Amazon Redshift connection🔗

This page covers key resources you'll need to connect your Amazon Redshift instance to the Data Productivity Cloud. You must configure this connection before running any data pipelines.

Full SaaS or Hybrid SaaS?🔗

The Data Productivity Cloud supports both Full SaaS and Hybrid SaaS deployment architectures.

Amazon Redshift on AWS is compatible with both deployment types.
Amazon Redshift Serverless is optimized for Full SaaS environments, but can also be used in Hybrid SaaS. Read Amazon Redshift serverless for details.

Authentication methods🔗

You can connect to Amazon Redshift using several authentication options:

IAM authentication: Use AWS IAM roles for secure, credential-free access. This method supports role-based access control. Read the Amazon Redshift documentation to learn more.
Username/Password authentication: Authenticate with standard database credentials. Ensure the Amazon Redshift cluster is configured with appropriate user access policies.
SSL encryption: Enforce SSL for secure connections between the Data Productivity Cloud and Amazon Redshift. Learn more about Redshift SSL configuration

Compute types🔗

The following Amazon Redshift compute options are supported:

Provisioned clusters: Manually configured clusters for predictable workloads.
Amazon Redshift serverless (recommended): Automatically scales based on workload demand.
RA3 instances: Support managed storage with high performance. Read Amazon Redshift instance types to learn more.

Feature support and considerations🔗

The Data Productivity Cloud supports key Amazon Redshift features to enhance data workflows:

External tables via Amazon Redshift spectrum: Query external data stored in Amazon S3.
Stored procedures and functions: Build modular SQL-based transformations.
Materialized views: Improve performance with pre-computed queries.
Workload management (WLM): Prioritize workloads with query queues.
Federated queries: Access data across other AWS services (e.g., RDS, Aurora).

Role privileges and access management🔗

Amazon Redshift uses a role-based access control (RBAC) model. Assign roles to control user access and responsibilities:

Role	Responsibilities
Superuser	Full administrative access to Amazon Redshift.
DBA role	Manage database objects, permissions, and monitoring.
ETL role	Execute data transformations and load operations.
Read-only	Query access without modification privileges.
External Schema role	Access Amazon Redshift Spectrum and AWS Glue external schemas.

Best practices for managing Amazon Redshift roles🔗

Prefer IAM roles over static credentials.
Grant least privilege access.
Enable RBAC to segment responsibilities.
Regularly audit roles and logs.