Description
|
Google Analytics is a web analytics service offered by Google that tracks and reports website traffic and also the mobile app traffic & events.
|
Supported Replication
|
Initial Sync
Continuous Sync
|
Authentication Type
|
OAuth (BigQuery Rest API)
Google Service Account (BigQuery Storage API)
|
Configurations
Prerequisites
To integrate Google Analytics data into our system, the customer must set up BigQuery Export from their Google Analytics account to their own Big Query data warehouse, and provide the project ID and database ID. Please refer to the Google Documentation for instructions on performing the BigQuery Export.
Once set up, their Google Account will export tables to BigQuery daily. We can then pull data from BigQuery using one of the following methods:
- BigQuery REST API
- BigQuery API must be enabled
- Standard method for retrieving data from BigQuery. Responses are usually large and slow.
- Suitable and faster for smaller datasets.
- Authentication via OAuth.
- BigQuery Storage API
- BigQuery Storage API must be enabled
- A newer API from BigQuery that allows fetching binary data from BigQuery, making fetching much faster.
- Optimal for larger datasets.
- Free 300TB of data fetch + network charges.
- Authentication via Service Account.
- Service Account must have the
BigQuery Read Session User
role enabled.
Both fetching methods are priced, and the cost falls on the customer's side. Please refer to the BigQuery cost documentation for further details.
Connector Configurations
Intraday tables are tables that are updated throughout the day. Enabling this option will sync data from these tables each time the pipeline runs. Note that enabling this option will increase the amount of synced data, but the data will be the most up-to-date. Non-intraday tables will only be synced once.
Collections
Collection Name |
Full/Initial Sync |
Incremental/Continuous Sync |
Supports Update |
Supports Delete |
Notes |
ga_sessions |
✔ |
✔ |
✔ |
✖ |
|
ga_session_custom_dimensions |
✔ |
✔ |
✔ |
✖ |
Normalized table from ga_sessions |
session_hits |
✔ |
✔ |
✔ |
✖ |
Normalized table from ga_sessions |
products |
✔ |
✔ |
✔ |
✖ |
Normalized table from session_hits |
product_custom_dimensions |
✔ |
✔ |
✔ |
✖ |
Normalized table from products |
product_custom_metrics |
✔ |
✔ |
✔ |
✖ |
Normalized table from products |
promotions |
✔ |
✔ |
✔ |
✖ |
Normalized table from session_hits |
experiments |
✔ |
✔ |
✔ |
✖ |
Normalized table from session_hits |
session_hit_custom_variables |
✔ |
✔ |
✔ |
✖ |
Normalized table from session_hits |
session_hit_custom_dimensions |
✔ |
✔ |
✔ |
✖ |
Normalized table from session_hits |
session_hit_custom_metrics |
✔ |
✔ |
✔ |
✖ |
Normalized table from session_hits |
publisher_infos |
✔ |
✔ |
✔ |
✖ |
Normalized table from session_hits |