Skip to main content

Google Analytics Raw Data connector FAQs

Preview

The Google Analytics Raw Data connector is in Public Preview.

This page answers frequently asked questions about the Google Analytics Raw Data connector in Databricks Lakeflow Connect.

General managed connector FAQs

The answers in Managed connector FAQs apply to all managed connectors in Lakeflow Connect. Keep reading for GA4-specific FAQs.

Connector-specific FAQs

The answers in this section are specific to the GA4 connector.

How does the GA4 connector work?

First, the user must export their GA4 data to BigQuery using Google's provided APIs or UIs. From there, Databricks consumes the data from BigQuery using the following APIs:

  • The BigQuery API for metadata operations (for example, to list tables and schemas)
  • The BigQuery Storage API for data ingestion
  • The Cloud Resource Manager API for schema exploration

Why does the connector require BigQuery?

There are two ways to extract GA4 data:

  • Directly from GA4
  • Indirectly using BigQuery

Direct ingestion from GA4 is limited to aggregated data. Indirect ingestion using BigQuery allows the connector to pull the raw, event-level data.

Which Google licenses and permissions does this connector require?

The connector requires:

  • A Google Cloud Platform (GCP) project with the BigQuery API and BigQuery storage API enabled.
  • A GCP service account with the following roles:
    • BigQuery Data Viewer
    • BigQuery Job User
    • BigQuery Read Session User
  • Google Analytics properties that export data to BigQuery