Distinct
Pass on only the distinct (or unique) input records to the next component.
Properties
Name
= string
A human-readable name for the component.
Columns
= dual listbox
Only these selected columns are passed to the next component. Duplicate records are removed, leaving only distinct values.
Note
On Redshift, since records may be located on many nodes, determining uniqueness can be expensive.
Strategy
Generates a select distinct query.
Snowflake | Delta Lake on Databricks | Amazon Redshift | Google BigQuery | Azure Synapse Analytics |
---|---|---|---|---|
✅ | ✅ | ✅ | ✅ | ✅ |