Create a ServiceNow ingestion pipeline

Preview

The ServiceNow connector is in Public Preview.

This article describes how to create a ServiceNow ingestion pipeline using Databricks Lakeflow Connect.

Before you begin

To create an ingestion pipeline, you must meet the following requirements:

Your workspace must be enabled for Unity Catalog.
Serverless compute must be enabled for your workspace. See Enable serverless compute.
If you plan to create a new connection: You must have CREATE CONNECTION privileges on the metastore.

If your connector supports UI-based pipeline authoring, you can create the connection and the pipeline at the same time by completing the steps on this page. However, if you use API-based pipeline authoring, you must create the connection in Catalog Explorer before you complete the steps on this page. See Connect to managed ingestion sources.
If you plan to use an existing connection: You must have USE CONNECTION privileges or ALL PRIVILEGES on the connection object.
You must have USE CATALOG privileges on the target catalog.
You must have USE SCHEMA and CREATE TABLE privileges on an existing schema or CREATE SCHEMA privileges on the target catalog.

To ingest from ServiceNow, see Configure ServiceNow for Databricks ingestion.

Create the ingestion pipeline

Permissions required: USE CONNECTION or ALL PRIVILEGES on a connection.

This step describes how to create the ingestion pipeline. Each ingested table is written to a streaming table with the same name.

Databricks UI
Databricks notebook
Databricks CLI

In the sidebar of the Databricks workspace, click Data Ingestion.
On the Add data page, under Databricks connectors, click ServiceNow.

The ingestion wizard opens.
On the Ingestion pipeline page of the wizard, enter a unique name for the pipeline.
In the Destination catalog drop-down menu, select a catalog. Ingested data and event logs will be written to this catalog. You'll select a destination schema later.
Select the Unity Catalog connection that stores the credentials required to access the source data.

If there are no existing connections to the source, click Create connection and enter the authentication details you obtained in Configure ServiceNow for Databricks ingestion. You must have CREATE CONNECTION privileges on the metastore.
Click Create pipeline and continue.
On the Source page, select the tables to ingest, and then click Next.

If you select All tables, the connector writes all existing and future tables in the source schema to the destination schema. There is a maximum of 250 tables per pipeline.
On the Destination page, select the Unity Catalog catalog and schema to write to.

If you don't want to use an existing schema, click Create schema. You must have USE CATALOG and CREATE SCHEMA privileges on the parent catalog.
Click Save pipeline and continue.
(Optional) On the Settings page, click Create schedule. Set the frequency to refresh the destination tables.
(Optional) Set email notifications for pipeline operation success or failure.
Click Save and run pipeline.

Generate a personal access token and copy the token so you can paste it into a notebook later. See Databricks personal access tokens for workspace users.
Import the following notebook to your workspace:

Create a ServiceNow ingestion pipeline

Open notebook in new tab
Modify the following values in the notebook:

Cell 1:
- api_token: The personal access token you generated
Cell 3:
- name: A name for the pipeline
- connection_name: The name of the Unity Catalog connection you created in Catalog Explorer (Catalog > External data > Connections). If you don't have an existing connection to the source, you can create one. You must have the CREATE CONNECTION privilege on the metastore.
- source_table: The name of the source table
- destination_catalog: A name for the destination catalog that will contain the ingested data
- destination_schema: A name for the destination schema that will contain the ingested data
- scd_type: The SCD method to use: SCD_TYPE_1 or SCD_TYPE_2.
- include_columns: Optionally specify a list of columns to include for ingestion. If you use this option to explicitly include columns, the pipeline automatically excludes columns that are added to the source in the future. To ingest the future columns, you'll have to add them to the list.
- exclude_columns: Optionally specify a list of columns to exclude from ingestion. If you use this option to explicitly exclude columns, the pipeline automatically includes columns that are added to the source in the future. To ingest the future columns, you'll have to add them to the list.
  
  For more information, see History tracking.
Click Run all.

You can use the following table configuration properties in your pipeline definition to select or deselect specific columns to ingest:

include_columns: Optionally specify a list of columns to include for ingestion. If you use this option to explicitly include columns, the pipeline automatically excludes columns that are added to the source in the future. To ingest the future columns, you'll have to add them to the list.
exclude_columns: Optionally specify a list of columns to exclude from ingestion. If you use this option to explicitly exclude columns, the pipeline automatically includes columns that are added to the source in the future. To ingest the future columns, you'll have to add them to the list.

To create the pipeline:

SQL
databricks pipelines create --json "<pipeline definition or json file path>"

To edit the pipeline:

SQL
databricks pipelines update --json "<pipeline definition or json file path>"

To get the pipeline definition:

SQL

databricks pipelines get "<pipeline-id>"

To delete the pipeline:

SQL
databricks pipelines delete "<pipeline-id>"

For more information, run:

SQL
databricks pipelines --help
databricks pipelines <create|update|get|delete|...> --help

Example JSON pipeline definition:

JSON
"ingestion_definition": {

     "connection_name": "<connection-name>",

     "objects": [

       {

         "table": {

           "source_schema": "<source-schema>",

           "source_table": "<source-table>",

           "destination_catalog": "<destination-catalog>",

           "destination_schema": "<destination-schema>",

           "table_configuration": {

              "scd_type": "SCD_TYPE_2",

              "include_columns": ["<column-a>", "<column-b>", "<column-c>"]

           }

         }

       }

     ]

 }

Before you begin​

Create the ingestion pipeline​

Create a ServiceNow ingestion pipeline

Additional resources​

Before you begin

Create the ingestion pipeline

Additional resources