Skip to content

Vacuum Table

Perform a vacuum operation on a list of tables. Vacuum is a housekeeping task that physically reorganizes table data according to its sort key, and reclaims space left over from deleted rows. Vacuum is almost always used at the end of an orchestration pipeline.

For more information about the vacuum process, read the Databricks Vacuum documentation.


Properties

Name = string

A human-readable name for the component.


Catalog = drop-down

Select a Databricks Unity Catalog. The special value, [Environment Default], will use the catalog specified in the environment setup. Selecting a catalog will determine which databases are available in the next parameter.


Database = drop-down

Select the Delta Lake database. The special value, [Environment Default], will use the database specified in the environment setup.


Tables to Vacuum = dual listbox

Select which tables to vacuum.


Retention Period = integer

The retention threshold. The default is 7, with the unit specified in Retention Unit.


Retention Unit = drop-down

Select the unit of the Retention Period. Options are Day, Hour, or Week. The default is Day.


Snowflake Databricks Amazon Redshift (preview)