Skip to content

Table Iterator

Overview

Loop over rows of data within a table or view and run another component for each row.

This component implements a simple loop over rows of data in an existing table. It enables you to run an attached component multiple times, each time with different values set on any number of variables taken from columns of that table. Those variables can then be referenced from the attached component.

To attach the iterator to another component, use the blue output connector and link to the desired component. To detach, right-click on the attached component and click Disconnect from Iterator.

If you need to iterate more than one component, put them into a separate orchestration job or transformation job and use a Run Transformation or Run Orchestration component attached to the iterator. In this way, you can run an entire ETL flow multiple times, once for each row of variable values.

The iterations are set up in advance, so the connection to the input table can be closed before any iterations are performed. If the attached component modifies the iteration table, those changes will not be reflected during the current run. Furthermore, to control runaway processes and control resources, only a limited number of rows of a table are considered for iteration. If you are iterating many rows, it is instead recommended that you use the table in a transformation job, and join the table being iterated instead.

All iterator components are limited to a maximum 5000 iterations.


Properties

Name = string

A human-readable name for the component.


Mode = drop-down

  • Basic: Construct the query using the clickable properties. This is the default setting.
  • Advanced: Manually write the SQL query.

Query = SQL editor

Manually write the SQL query. The Metadata Explorer can be used, and the query can be sampled as it is written.


Database = drop-down

Select the Snowflake database. The special value, [Environment Default], will use the database defined in the Matillion ETL environment. For more information, read Databases, Tables, & Views.


Schema = drop-down

Select the Snowflake schema. The special value, [Environment Default], will use the schema defined in the Matillion ETL environment. For more information, read Database, Schema, & Share DDL.


Target Table Name = drop-down

The name of the target table.


Concurrency = drop-down

  • Concurrent: Iterations run concurrently. This requires all "Variables to Iterate" to be defined as copied variables, so that each iteration gets its own copy of the variable isolated from the same variable being used by other concurrent executions.
  • Sequential: Iterations run in sequence, waiting for each to complete before starting the next. This is the default setting.

The maximum concurrency is limited by the number of available threads (2x the number of processors on your cloud instance).


Column Mapping = columns editor

  • Column Name: The table column name.
  • Variable Name: Choose an existing variable name. If you wish to define a new environment variable, click Project and then click Manage Environment Variables.

To learn more, read Environment Variables.


Order By = dual listbox

Move columns to the right-hand box to define the "order by" sequence.

This property is only available when Concurrency is set to Sequential.


Sort = drop-down

Choose to order by ascending or descending. The default is ascending.

This property is only available when Concurrency is set to Sequential.


Break on Failure = drop-down

  • No: Attempt to run the attached component for each iteration, regardless of success or failure. This is the default setting.
  • Yes: If the attached component does not run successfully, fail immediately.

If a failure occurs during any iteration, the failure link is followed. This parameter controls whether it is followed immediately or after all iterations have been attempted.

This property is only available when Concurrency is set to Sequential. When set to Concurrent, all iterations will be attempted.


Record Values In Task History = drop-down

Choose whether to record iteration values in the Matillion ETL Task History. The default setting is Yes.


Stop On Condition = drop-down

Select Yes to stop the iteration based on a condition specified in the Condition property. The default setting is No.

For this property to be available, set Concurrency to Sequential.


Mode = drop-down

Select the method of creating the condition.

  • Simple: A no-code Condition UI will open, where users must specify an Input Variable, Qualifier, Comparator, and Value using drop-down menus and text fields. This is the default setting.
  • Advanced: An editor will open, where users must write the condition manually using SQL.

Condition (Simple mode) = columns editor

  • Input Variable: An input variable to form a condition around.
  • Qualifier:
    • Is: Compares the input variable to the value using the Comparator.
    • Not: Reverses the effect of the comparison, so "Equals" becomes "Not equals", "Less than" becomes "Greater than or equal to", etc.
  • Comparator: Select the comparator. Available comparison operators include "Less than", "Less than or equal to", "Equal to", "Greater than or equal to", "Greater than", and "Blank".
  • Value: Specify the value to be compared.

Condition (Advanced mode) = text editor

Manually write the condition in the editor. This editor accepts conditions written in JavaScript.


Combine Conditions = drop-down

Use the defined conditions in combination with one another according to either And or Or.

This property is only available when Mode is set to Simple.

Name = string

A human-readable name for the component.


Mode = drop-down

  • Basic: Construct the query using the clickable properties. This is the default setting.
  • Advanced: Manually write the SQL query.

Query = SQL editor

Manually write the SQL query. The Metadata Explorer can be used, and the query can be sampled as it is written.


Schema = drop-down

Select the Redshift schema. The special value, [Environment Default], will use the schema defined in the Matillion ETL environment. For more information, read Schemas.


Target Table Name = drop-down

The name of the target table.


Concurrency = drop-down

  • Concurrent: Iterations run concurrently. This requires all "Variables to Iterate" to be defined as copied variables, so that each iteration gets its own copy of the variable isolated from the same variable being used by other concurrent executions.
  • Sequential: Iterations run in sequence, waiting for each to complete before starting the next. This is the default setting.

The maximum concurrency is limited by the number of available threads (2x the number of processors on your cloud instance).


Column Mapping = columns editor

  • Column Name: The table column name.
  • Variable Name: Choose an existing variable name. If you wish to define a new environment variable, click Project and then click Manage Environment Variables.

To learn more, read Environment Variables.


Order By = dual listbox

Move columns to the right-hand box to define the "order by" sequence.

This property is only available when Concurrency is set to Sequential.


Sort = drop-down

Choose to order by ascending or descending. The default is ascending.

This property is only available when Concurrency is set to Sequential.


Break on Failure = drop-down

  • No: Attempt to run the attached component for each iteration, regardless of success or failure. This is the default setting.
  • Yes: If the attached component does not run successfully, fail immediately.

If a failure occurs during any iteration, the failure link is followed. This parameter controls whether it is followed immediately or after all iterations have been attempted.

This property is only available when Concurrency is set to Sequential. When set to Concurrent, all iterations will be attempted.


Record Values In Task History = drop-down

Choose whether to record iteration values in the Matillion ETL Task History. The default setting is Yes.


Stop On Condition = drop-down

Select Yes to stop the iteration based on a condition specified in the Condition property. The default setting is No.

For this property to be available, set Concurrency to Sequential.


Mode = drop-down

Select the method of creating the condition.

  • Simple: A no-code Condition UI will open, where users must specify an Input Variable, Qualifier, Comparator, and Value using drop-down menus and text fields. This is the default setting.
  • Advanced: An editor will open, where users must write the condition manually using SQL.

Condition (Simple mode) = columns editor

  • Input Variable: An input variable to form a condition around.
  • Qualifier:
    • Is: Compares the input variable to the value using the Comparator.
    • Not: Reverses the effect of the comparison, so "Equals" becomes "Not equals", "Less than" becomes "Greater than or equal to", etc.
  • Comparator: Select the comparator. Available comparison operators include "Less than", "Less than or equal to", "Equal to", "Greater than or equal to", "Greater than", and "Blank".
  • Value: Specify the value to be compared.

Condition (Advanced mode) = text editor

Manually write the condition in the editor. This editor accepts conditions written in JavaScript.


Combine Conditions = drop-down

Use the defined conditions in combination with one another according to either And or Or.

This property is only available when Mode is set to Simple.

Name = string

A human-readable name for the component.


Mode = drop-down

  • Basic: Construct the query using the clickable properties. This is the default setting.
  • Advanced: Manually write the SQL query.

Query = SQL editor

Manually write the SQL query. The Metadata Explorer can be used, and the query can be sampled as it is written.


Project = drop-down

Select the Google Cloud project. The special value, [Environment Default], will use the project defined in the environment.

For more information, read Creating and managing projects.


Dataset = drop-down

Select the Google Cloud dataset to load data into. The special value, [Environment Default], will use the dataset defined in the environment.

For more information, read Introduction to datasets.


Target Table Name = drop-down

The name of the target table.


Concurrency = drop-down

  • Concurrent: Iterations run concurrently. This requires all "Variables to Iterate" to be defined as copied variables, so that each iteration gets its own copy of the variable isolated from the same variable being used by other concurrent executions.
  • Sequential: Iterations run in sequence, waiting for each to complete before starting the next. This is the default setting.

The maximum concurrency is limited by the number of available threads (2x the number of processors on your cloud instance).


Column Mapping = columns editor

  • Column Name: The table column name.
  • Variable Name: Choose an existing variable name. If you wish to define a new environment variable, click Project and then click Manage Environment Variables.

To learn more, read Environment Variables.


Order By = dual listbox

Move columns to the right-hand box to define the "order by" sequence.

This property is only available when Concurrency is set to Sequential.


Sort = drop-down

Choose to order by ascending or descending. The default is ascending.

This property is only available when Concurrency is set to Sequential.


Break on Failure = drop-down

  • No: Attempt to run the attached component for each iteration, regardless of success or failure. This is the default setting.
  • Yes: If the attached component does not run successfully, fail immediately.

If a failure occurs during any iteration, the failure link is followed. This parameter controls whether it is followed immediately or after all iterations have been attempted.

This property is only available when Concurrency is set to Sequential. When set to Concurrent, all iterations will be attempted.


Record Values In Task History = drop-down

Choose whether to record iteration values in the Matillion ETL Task History. The default setting is Yes.


Stop On Condition = drop-down

Select Yes to stop the iteration based on a condition specified in the Condition property. The default setting is No.

For this property to be available, set Concurrency to Sequential.


Mode = drop-down

Select the method of creating the condition.

  • Simple: A no-code Condition UI will open, where users must specify an Input Variable, Qualifier, Comparator, and Value using drop-down menus and text fields. This is the default setting.
  • Advanced: An editor will open, where users must write the condition manually using SQL.

Condition (Simple mode) = columns editor

  • Input Variable: An input variable to form a condition around.
  • Qualifier:
    • Is: Compares the input variable to the value using the Comparator.
    • Not: Reverses the effect of the comparison, so "Equals" becomes "Not equals", "Less than" becomes "Greater than or equal to", etc.
  • Comparator: Select the comparator. Available comparison operators include "Less than", "Less than or equal to", "Equal to", "Greater than or equal to", "Greater than", and "Blank".
  • Value: Specify the value to be compared.

Condition (Advanced mode) = text editor

Manually write the condition in the editor. This editor accepts conditions written in JavaScript.


Combine Conditions = drop-down

Use the defined conditions in combination with one another according to either And or Or.

This property is only available when Mode is set to Simple.


Variable Exports

This component makes the following values available to export into variables:

Source Description
Iteration Attempted The number of iterations that this component attempts to reach (Max Iterations parameter).
Iteration Generated The number of iterations that have been initiated. Iterators terminate after failure, so this number will be the successful iterations plus any potential failures.
Iteration Successful The number of iterations successfully performed. This is the max iteration number, minus failures and any unattempted iterations (since the component terminates after failure).

Video


Snowflake Delta Lake on Databricks Amazon Redshift Google BigQuery Azure Synapse Analytics