Google Analytics
This page describes how to configure a Google Analytics data source. With Data Loader, you can replicate and load your source data into your target destination. You can choose between two data sources: Google Universal Analytics, the default data source, and Google Analytics 4 (GA4). You can select the GA4 data source using the Advanced Settings options. Refer to the note in the Connect to Google Analytics section for more information.
Schema Drift Support: Yes. Read Schema Drift to learn more.
Return to any page of this wizard by clicking Previous.
Click X in the upper-right of the UI and then click Yes, discard to close the pipeline creation wizard.
Prerequisites
- Read the Allowed IP addresses topic before you begin. You may not be able to connect to certain data sources without first allowing the Batch IP addresses. In these circumstances, connection tests will always fail and you will not be able to complete the pipeline.
- You must have an active Google Analytics account.
Create pipeline
- In Data Loader, click Add pipeline.
- Choose Google Analytics from the grid of data sources.
- Choose Batch Loading.
Connect to Google Analytics
Configure the Google Analytics database connection settings, specifying the following:
Property | Description |
---|---|
Google Analytics Connection | Select a connection from the drop-down menu, or click Add Connection if one doesn't exist. |
Connection Name | Give a unique name for the connection, and click Connect. A new browser tab will open, where Google will ask you to confirm authorization using valid credentials. |
Advanced settings | Additional JDBC parameters or connection settings. Expand the Advanced settings, and choose a parameter from the drop-down menu. Enter a value for the parameter, and click Add parameter for any extra parameters you want to add. Read the Google Analytics data model for more information about Google Analytics JDBC connection settings. For a list of compatible connection properties, read Allowed connection properties. |
To select the Google Analytics 4 (GA4) data source, access the Advanced Settings and add a parameter with the following selections: Parameter = Schema, Value = GoogleAnalytics4
.
Click Continue.
Choose sources
Choose any data sources (tables) you wish to include in the pipeline. Use the arrow buttons to move tables to the Sources to extract and load listbox and then reorder any sources with click-and-drag. Additionally, select multiple sources using the SHIFT
key.
The list of available sources will vary according to whether you are using the Google Universal Analytics or Google Analytics 4 (GA4) data source.
Select the start date of the data by clicking the date button and selecting from the calendar.
Click Continue with X sources to move forward.
Review your data set
Choose the columns from each table to include in the pipeline. By default, Data Loader selects all columns from a table.
Click Configure on a table to open Select columns. Use the arrow buttons to move columns out of the Columns to extract and load listbox. Order columns with click-and-drag. Select multiple columns using SHIFT
.
Click Done to continue.
Click Continue once you have configured each table.
Choose destination
- Choose an existing destination or click Add a new destination.
- Select a destination from Snowflake, Amazon Redshift, or Google BigQuery.
Set frequency
Property | Description |
---|---|
Pipeline name | A descriptive label for your pipeline. This is how the pipeline appears on the pipeline dashboard and how Data Loader refers to the pipeline. |
Sync every | The frequency at which the pipeline should sync. Day values include 1—7. Hour values include 1—23. Minute values include 5—59. The input is also the length of delay before the first sync. |
Currently, you can't specify a start time.
Once you are happy with your pipeline configuration, click Create pipeline to complete the process and add the pipeline to your dashboard.