Ingest data into Lakeflow Designer
This feature is in Public Preview.
This page describes the available options for bringing data into a Visual data prep in Lakeflow Designer. Designer can work with any data accessible through Databricks.
All data ingestion in Designer starts with the Source operator. When you open a Source operator's configuration pane, you have the following options.
Browse existing tables
Click Browse existing to open the asset selector, where you can:
- Browse tables and volumes in Unity Catalog.
- Browse files in the workspace file system.
Use this option when your data is already in Databricks.
Drag and drop a local file
Drag a CSV or Excel file from your local machine directly onto the canvas. Designer uploads the file to the workspace file system and automatically creates a Source operator.
Drag and drop of Excel files requires Excel File Format Support to be enabled. See Read Excel files.
Create a table from a file
Select Create table from file in the Source configuration pane to upload a local CSV or Excel file and create a Unity Catalog table from it. Use this option when you want the data persisted as a managed table rather than stored as a workspace file. Additionally, for large amounts of data, this provides the best performance.
Import from Google Drive
To import a file from Google Drive:
- In the Source configuration pane, select Import from Google Drive.
- Select a Unity Catalog connection that has access to Google Drive.
- Provide the file URL.
This requires a Unity Catalog connection configured for Google Drive access. See Ingest files from Google Drive for how to set up connections.
Import from SharePoint
To import a file from SharePoint:
- In the Source configuration pane, select Import from SharePoint.
- Select a Unity Catalog connection that has access to your SharePoint site.
- Provide the direct file link.
To get the correct URL, right-click the file in SharePoint, select Preview, and copy the URL from the browser address bar. Do not use a sharing link.
This requires a Unity Catalog connection configured for SharePoint access. See Ingest files from SharePoint for how to set up connections.
Use Lakeflow Connect for additional sources
If a data source is not available directly in Designer, check Lakeflow Connect for a managed connector. Lakeflow Connect supports ingestion from SaaS applications (Salesforce, Workday, HubSpot, and more), databases, cloud storage, and streaming sources. After you ingest data with Lakeflow Connect into a Unity Catalog table, you can use it as a Source in Designer.