Skip to main content

Google Analytics Raw Data connector

The managed Google Analytics Raw Data connector in Lakeflow Connect allows you to ingest event-level data from Google Analytics 4 (GA4) into Databricks using BigQuery export.

What to know before you start

Topic

Why it matters

Databricks user persona

The workflow depends on your Databricks user persona:

  • Single-user: An admin user creates a Unity Catalog connection and an ingestion pipeline.
  • Multi-user: An admin user creates a connection for non-admin users to create pipelines with.

Authentication method

The steps to create a connection depend on the authentication method you choose. For supported methods, see Authentication methods.

Interface

The steps to create a pipeline depend on the interface.

Ingestion frequency

The pipeline schedule depends on your latency and cost requirements.

Common patterns

Depending on your ingestion needs, the pipeline might use configurations like history tracking, column selection, and row filtering. Supported configurations vary by connector. See Feature availability.

Start ingesting from Google Analytics

The following table provides an overview of the end-to-end Google Analytics Raw Data ingestion flow, based on user type:

User

Steps

Admin

  1. Export your GA4 data to BigQuery. See Set up Google Analytics 4 and Google BigQuery for Databricks ingestion.
  2. Either:

Non-admin

Use any supported interface to create a pipeline from an existing connection. See Ingest data from Google Analytics 4.