Skip to main content

Connect to Serverless GPU Compute

This article describes how to connect to Serverless GPU Compute from interactive notebooks, scheduled jobs, the Jobs API, and covers networking limitations.

Interactive (Notebooks)

This is the primary way to use Serverless GPU Compute. To connect your notebook and configure the environment:

  1. From a notebook, click the Connect drop-down menu at the top and select Serverless GPU.
  2. Click the Environment icon. to open the Environment side panel.
  3. Select A10 or H100 from the Accelerator field.
  4. Select None for the default environment or AI v4 for the AI environment from the Base environment field.
  5. Click Apply and then Confirm that you want to apply the Serverless GPU Compute to your notebook environment.
note

Connection to your compute auto-terminates after 60 minutes of inactivity.

tip

For operations that do not require GPUs (for example, cloning a Git repository, converting data formats, or exploratory data analysis), attach your notebook to a CPU cluster to preserve GPU resources.

Jobs (Scheduled)

You can schedule notebooks that use Serverless GPU Compute as recurring jobs. See Create and manage scheduled notebook jobs for more details.

After you open the notebook you want to use:

  1. Select the Schedule button on the top right.
  2. Select Add schedule.
  3. Populate the New schedule form with the Job name, Schedule, and Compute.
  4. Select Create.

You can also create and schedule jobs from the Jobs and pipelines UI. See Create a new job for step-by-step guidance.

note

Adding dependencies using the Environments panel is not supported for Serverless GPU Compute scheduled jobs. Dependencies must be installed programmatically within your notebook (for example, %pip install). Auto-recovery is not supported — if your job fails due to incompatible packages, you must manually fix and re-run. For workloads that may exceed the 7-day maximum runtime, implement manual checkpointing to allow resumption.

Jobs API and Databricks Asset Bundles

You can programmatically create and manage serverless GPU jobs using the Databricks Jobs API or Databricks Asset Bundles. Configure the compute type as serverless GPU in your job or bundle definition to automate deployment pipelines.

Networking

PrivateLink is not supported. Storage or pip repositories behind PrivateLink are not accessible from Serverless GPU Compute. Serverless GPU compute is not compatible with workspaces that have compliance security profiles enabled (for example, HIPAA or PCT). If your workspace uses Security Enhanced Groups (SEG) or PrivateLink restrictions, Serverless GPU Compute cannot be used.