Skip to content

Detect Changes

The Detect Changes component lets users scan two separate (but similar) tables, and insert a new column detailing if data has been inserted, deleted, changed, or even if the data is unchanged.

Any rows with key columns that contain NULL values will be ignored. NULL comparison values are considered equal.


Properties

Name = string

A human-readable name for the component.


Master Table = drop-down

Select a master table from the two inputs. This table is the one treated as default in the comparison with the second table.


Match Keys = dual listbox

Select the key columns to join the two tables on. These columns must appear in both tables. NULL values are ignored.


Compare Columns = dual listbox

Select the columns that will be checked for changes. Just like the keys, these columns must appear in both tables; however, the two lists should not overlap.


Output Column Mapping = column editor

  • Input Column: Select input columns to map to output names. Sensible defaults are provided automatically; however, these can be changed.
  • Output Column: Name output columns to which selected input columns will map.

Indicator Column = string

Input a name for the new column in the output. By default, this column is named "Indicator". This column contains an indicator that shows the status of each record:

C the record has been changed. D the record has been deleted. I the record is identical. N the record is new.

Switching the master table in the Master Table property will reverse the meaning of new (N) and deleted (D).


Strategy

Detects changed, unchanged, added, or deleted data in a comparison table relative to the designated master table.


Indicators

Indicators are single-letter codes that indicate what the state of a row is with regard to Detect Changes. The table below shows all indicators and their meanings.

Indicator Description
C Changed: the record is present in both tables, with different values, but with the same ID.
D Deleted: the record is present in the master table, but not in the second table.
I Identical: the same record is present in both tables with no changes.
N New: the record is not present in the master table, but is present in the second table.

Snowflake Delta Lake on Databricks Amazon Redshift Google BigQuery Azure Synapse Analytics