Connect to managed ingestion sources

This article describes how to create connections in Catalog Explorer that store authentication details for Lakeflow Connect managed ingestion sources. Any user with USE CONNECTION privileges or ALL PRIVILEGES on the connection can then create managed ingestion pipelines from sources like Salesforce and SQL Server.

An admin user must complete the steps in this article if the users who will create pipelines:

are non-admin users.
will use Databricks APIs, Databricks SDKs, the Databricks CLI, or Databricks Asset Bundles.

These interfaces require that users specify an existing connection when they create a pipeline.

Alternatively, admin users can create a connection and a pipeline at the same time in the data ingestion UI. See Managed connectors in Lakeflow Connect.

Lakeflow Connect vs. Lakehouse Federation

Lakehouse Federation allows you to query external data sources without moving your data. When you have a choice between Lakeflow Connect and Lakehouse Federation, choose Lakehouse Federation for ad hoc reporting or proof-of-concept work on your ETL pipelines. See What is Lakehouse Federation?.

Privilege requirements

The user privileges required to connect to a managed ingestion source depend on the interface you choose:

Data ingestion UI

Admin users can create a connection and a pipeline at the same time. This end-to-end ingestion wizard is only available in the UI. Not all managed ingestion connectors support UI-based pipeline authoring.
Catalog Explorer

Using Catalog Explorer separates connection creation from pipeline creation. This allows admins to create connections for non-admin users to create pipelines with.

If the users who will create pipelines are non-admin users or plan to use Databricks APIs, Databricks SDKs, the Databricks CLI, or Databricks Asset Bundles, an admin must first create the connection in Catalog Explorer. These interfaces require that users specify an existing connection when they create a pipeline.

Scenario	Supported interfaces	Required user privileges
An admin user creates a connection and an ingestion pipeline at the same time.	Data ingestion UI	`CREATE CONNECTION` on the metastore `USE CATALOG` on the target catalog (SaaS apps) `USE SCHEMA` and `CREATE TABLE` on an existing schema or `CREATE SCHEMA` on the target catalog (Databases) `USE SCHEMA`, `CREATE TABLE`, and `CREATE VOLUME` on an existing schema or `CREATE SCHEMA` on the target catalog
An admin user creates a connection for non-admin users to create pipelines with.	Admin: Catalog Explorer Non-admin: Data ingestion UI Databricks APIs Databricks SDKs Databricks CLI Databricks Asset Bundles	Admin: `CREATE CONNECTION` on the metastore Non-admin: `USE CONNECTION` or `ALL PRIVILEGES` on an existing connection. `USE CATALOG` on the target catalog (SaaS apps) `USE SCHEMA` and `CREATE TABLE` on an existing schema or `CREATE SCHEMA` on the target catalog (Databases) `USE SCHEMA`, `CREATE TABLE`, and `CREATE VOLUME` on an existing schema or `CREATE SCHEMA` on the target catalog

Scenario

Supported interfaces

Required user privileges

An admin user creates a connection and an ingestion pipeline at the same time.

Data ingestion UI

CREATE CONNECTION on the metastore
USE CATALOG on the target catalog
(SaaS apps) USE SCHEMA and CREATE TABLE on an existing schema or CREATE SCHEMA on the target catalog
(Databases) USE SCHEMA, CREATE TABLE, and CREATE VOLUME on an existing schema or CREATE SCHEMA on the target catalog

An admin user creates a connection for non-admin users to create pipelines with.

Admin:

Catalog Explorer

Non-admin:

Data ingestion UI
Databricks APIs
Databricks SDKs
Databricks CLI
Databricks Asset Bundles

Admin:

CREATE CONNECTION on the metastore

Non-admin:

USE CONNECTION or ALL PRIVILEGES on an existing connection.
USE CATALOG on the target catalog
(SaaS apps) USE SCHEMA and CREATE TABLE on an existing schema or CREATE SCHEMA on the target catalog
(Databases) USE SCHEMA, CREATE TABLE, and CREATE VOLUME on an existing schema or CREATE SCHEMA on the target catalog

Google Analytics Raw Data

The Databricks UI only supports OAuth for GA4 connections. You can use basic authentication instead by creating the connection using Databricks APIs.

Databricks UI
Databricks APIs

In the Databricks workspace, click Catalog > External locations > Connections > Create connection.
On the Connection basics page of the Set up connection wizard, specify a unique Connection name.
In the Connection type drop-down menu, select Google Analytics Raw Data.
(Optional) Add a comment.
Click Next.
On the Authentication page, click Sign in to Google and sign in with your Google account credentials.
At the prompt to allow Lakeflow Connect to access your Google account, click Allow.
Click Create connection.

The following example shows how to create a connection to GA4 using basic authentication with a service account JSON key. Databricks recommends running the following code locally to avoid having your personal access token and service account logged in the runCommand action of your audit logs. If verbose audit logs are enabled, creating the connection in a Databricks notebook with plaintext credentials could make these visible to anyone with access to the system.access.audit table or the raw audit logs.

curl -X POST \
 "${DATABRICKS_INSTANCE}/api/2.1/unity-catalog/connections" \
 -H "Authorization: Bearer ${TOKEN}" \
 -H "Content-Type: application/json" \
 -d "{
\"name\": \"YOUR_CONNECTION_NAME\",
\"connection_type\": \"GA4_RAW_DATA\",
\"options\": {
\"service_account_json\": $(jq -Rs '.' service_account.json)
},
\"comment\": \"GA4 Raw Data connection for managed ingestion\"
}"

Salesforce

Lakeflow Connect supports ingesting data from the Salesforce Platform. Databricks also offers a zero-copy connector in Lakehouse Federation to run federated queries on Salesforce Data 360 (formerly Data Cloud).

Prerequisites

Salesforce applies usage restrictions to connected apps. The permissions in the following table are required for a successful first-time authentication. If you lack these permissions, Salesforce blocks the connection and requires an admin to install the Databricks connected app.

Condition	Required permission
API Access Control is enabled.	`Customize Application` and either `Modify All Data` or `Manage Connected Apps`
API Access Control is not enabled.	`Approve Uninstalled Connected Apps`

For background, see Prepare for Connected App Usage Restrictions Change in the Salesforce documentation.

Create a connection

To create a Salesforce ingestion connection in Catalog Explorer, do the following:

In the Databricks workspace, click Catalog > External locations > Connections > Create connection.
On the Connection basics page of the Set up connection wizard, specify a unique Connection name.
In the Connection type drop-down menu, select Salesforce.
(Optional) Add a comment.
Click Next.
If you're ingesting from a Salesforce sandbox account, set Is sandbox to true.
Click Sign in with Salesforce.

You're redirected to Salesforce.
If you're ingesting from a Salesforce sandbox, click Use Custom Domain, provide the sandbox URL, and then click Continue.
Enter your Salesforce credentials and click Log in. Databricks recommends logging in as a Salesforce user that's dedicated to Databricks ingestion.

important
For security purposes, only authenticate if you clicked an OAuth 2.0 link in the Databricks UI.
After returning to the ingestion wizard, click Create connection.

ServiceNow

The steps to create a ServiceNow connection in Catalog Explorer depend on the OAuth method you choose. The following methods are supported:

U2M OAuth (recommended)
OAuth Resource Owner Password Credentials (ROPC – legacy)

Databricks recommends using U2M OAuth because it's the more secure approach.

U2M OAuth (recommended)

Complete the source setup. You'll use the authentication details that you obtain to create the connection.
In the Databricks workspace, click Catalog > External locations > Connections > Create connection.
On the Connection basics page of the Set up connection wizard, specify a unique Connection name.
In the Connection type drop-down menu, select ServiceNow.
In the Auth type drop-down menu, select OAuth (recommended).
(Optional) Add a comment.
Click Next.
On the Authentication page, enter the following:
- Instance URL: ServiceNow instance URL.
- OAuth scope: Leave the default value useraccount.
- Client secret: The client secret that you obtained in the source setup.
- Client ID: The client ID that you obtained in the source setup.
Click Sign in with ServiceNow.
Sign in using your ServiceNow credentials.

You're redirected to the Databricks workspace.
Click Create connection.

ROPC (legacy)

Complete the source setup. Use the authentication details that you obtain to create the connection.
In the Databricks workspace, click Catalog > External locations > Connections > Create connection.
On the Connection basics page of the Set up connection wizard, specify a unique Connection name.
In the Connection type drop-down menu, select ServiceNow.
In the Auth type drop-down menu, select OAuth Resource Owner Password (legacy).
(Optional) Add a comment.
Click Next.
On the Authentication page, enter the following:
- User: Your ServiceNow username.
- Password: Your ServiceNow password.
- Instance URL: ServiceNow instance URL.
- Client ID: The client ID that you obtained in the source setup.
- Client secret: The client secret that you obtained in the source setup.
Click Create connection.

SharePoint

The steps to create a SharePoint connection in Catalog Explorer depend on the OAuth method you choose. The following methods are supported:

User-to-machine (U2M) authentication
Manual token refresh authentication

Databricks recommends using U2M because it doesn't require computing the refresh token yourself. This is handled for you automatically. It also simplifies the process of granting the Entra ID client access to your SharePoint files and is more secure.

U2M (recommended)

Complete the source setup. You'll use the authentication details that you obtain to create the connection.
In the Databricks workspace, click Catalog > External data > Connections > Create connection.
On the Connection basics page of the Set up connection wizard, specify a unique Connection name.
In the Connection type drop-down menu, select Microsoft SharePoint.
In the Auth type drop-down menu, select OAuth.
(Optional) Add a comment.
Click Next.
On the Authentication page, enter the following credentials for your Microsoft Entra ID app:
- OAuth scope: Leave the OAuth scope set to the pre-filled value.
- Client secret: The client secret that you retrieved in the source setup.
- Client ID: The client ID that you retrieved in the source setup.
- Domain: The SharePoint instance URL in the following format: https://MYINSTANCE.sharepoint.com
- Tenant ID: The tenant ID that you retrieved in the source setup.
Click Sign in with Microsoft SharePoint.

A new window opens. After you sign in with your SharePoint credentials, the permissions you’re granting to the Entra ID app are shown.
Click Accept.

A Successfully authorized message displays, and you’re redirected to the Databricks workspace.
Click Create connection.

Manual refresh token

Complete the source setup. You'll use the authentication details that you obtain to create the connection.
In the Databricks workspace, click Catalog > External data > Connections > Create connection.
On the Connection basics page of the Set up connection wizard, specify a unique Connection name.
In the Connection type drop-down menu, select Microsoft SharePoint.
In the Auth type drop-down menu, select OAuth Refresh Token.
(Optional) Add a comment.
Click Next.
On the Authentication page, enter the following credentials for your Microsoft Entra ID app:
- Tenant ID: The tenant ID that you retrieved in the source setup.
- Client ID: The client ID that you retrieved in the source setup.
- Client secret: The client secret that you retrieved in the source setup.
- Refresh token: The refresh token that you retrieved in the source setup.
Click Create connection.

SQL Server

To create a Microsoft SQL Server connection in Catalog Explorer, do the following:

In the Databricks workspace, click Catalog > External Data > Connections.
Click Create connection.
Enter a unique Connection name.
For Connection type select SQL Server.
For Host, specify the SQL Server domain name.
For User and Password, enter your SQL Server login credentials.
Click Create.

Workday Reports

To create a Workday Reports connection in Catalog Explorer, do the following:

Create Workday access credentials. For instructions, see Configure Workday reports for ingestion.
In the Databricks workspace, click Catalog > External locations > Connections > Create connection.
For Connection name, enter a unique name for the Workday connection.
For Connection type, select Workday Reports.
For Auth type, select OAuth Refresh Token or Username and password (basic authentication), then click Next.
(OAuth refresh token) On the Authentication page, enter the Client ID, Client secret, and Refresh token that you obtained in the source setup.
(Basic authentication) Enter your Workday username and password.
Click Create connection.

Next step

After you create a connection to your managed ingestion source in Catalog Explorer, any user with USE CONNECTION privileges or ALL PRIVILEGES on the connection can create an ingestion pipeline in the following ways:

Ingestion wizard (supported connectors only)
Databricks Asset Bundles
Databricks APIs
Databricks SDKs
Databricks CLI

For instructions to create a pipeline, see the managed connector documentation.

Lakeflow Connect vs. Lakehouse Federation​

Privilege requirements​

Google Analytics Raw Data​

Salesforce​

Prerequisites​

Create a connection​

ServiceNow​

U2M OAuth (recommended)​

ROPC (legacy)​

SharePoint​

U2M (recommended)​

Manual refresh token​

SQL Server​

Workday Reports​

Next step​