Connect to AI Runtime

Preview

Connect to AI Runtime from interactive notebooks, an IDE through an SSH tunnel, scheduled jobs, the Jobs API, or Declarative Automation Bundles. Attaching a notebook to AI Runtime is the primary way to run training and fine-tuning workloads, and you can schedule the same notebooks as recurring jobs or automate them in deployment pipelines.

Interactive (Notebooks)

This is the primary way to use AI Runtime. To connect your notebook and configure the environment:

From a notebook, click the compute drop-down menu at the top and select Serverless GPU.
Click the to open the Environment side panel.
Select an accelerator from the Accelerator field. For distributed training workloads, select 8xH100. See Hardware options for guidance on choosing an accelerator.
Select Standard v5 or Standard v4 for the Standard environment, or AI v5 or AI v4 for the AI environment, from the Base environment field.
Click Apply and then Confirm that you want to apply the AI Runtime to your notebook environment.

note

Connection to your compute auto-terminates after 60 minutes of inactivity.

tip

For operations that do not require GPUs (for example, cloning a Git repository, converting data formats, or exploratory data analysis), attach your notebook to a CPU cluster to preserve GPU resources.

Connecting from IDE terminal

You can connect to AI Runtime on serverless GPU compute directly from a terminal in your IDE through an SSH tunnel.

To connect to AI Runtime, run the databricks ssh connect command with the --accelerator option from a terminal within your IDE. No separate setup step is required. For more information about the command, see ssh command group.

Bash
databricks ssh connect --accelerator=GPU_1xA10

To connect and start the session in Visual Studio Code or Cursor, use the --ide option. The CLI opens an IDE window pointing to the home workspace folder.

Bash
databricks ssh connect --ide=vscode

For more details on setup, opening projects, and running code, see Connect to Databricks using an SSH tunnel.

Jobs (Scheduled)

You can schedule notebooks that use AI Runtime as recurring jobs. See Create and manage scheduled notebook jobs for more details.

After you open the notebook you want to use:

Select the Schedule button on the top right.
Select Add schedule.
Populate the New schedule form with the Job name, Schedule, and Compute.
Select Create.

You can also create and schedule jobs from the Jobs and pipelines UI. See Create a new job for step-by-step guidance.

note

Adding dependencies using the Environments panel is not supported for AI Runtime scheduled jobs. Dependencies must be installed programmatically within your notebook (for example, %pip install). Auto-recovery is not supported. If your job fails due to incompatible packages, you must manually fix and re-run.

For workloads that may exceed the 7-day maximum runtime, implement manual checkpointing to allow resumption. We recommend using Unity Catalog volumes via UCVolumeWriter and UCVolumeReader from serverless_gpu.data. See Model checkpointing.

Jobs API and Declarative Automation Bundles

You can programmatically create and manage AI Runtime jobs using the Databricks Jobs API or Declarative Automation Bundles. Configure the compute type as serverless GPU in your job or bundle definition to automate deployment pipelines.

The following example shows a Declarative Automation Bundles configuration for an AI Runtime job using the Standard environment:

YAML
resources:
  jobs:
    sample_job:
      name: sample_job_h100

      trigger:
        periodic:
          interval: 1
          unit: DAYS

      parameters:
        - name: catalog
          default: ${var.catalog}
        - name: schema
          default: ${var.schema}

      environments:
        - environment_key: default
          spec:
            environment_version: '4'

      tasks:
        - task_key: notebook_task
          notebook_task:
            notebook_path: /Workspace/Users/your_email/your_notebook
          environment_key: default
          compute:
            hardware_accelerator: GPU_8xH100

To use the Databricks AI environment instead of the Standard environment, set base_environment to the AI environment identifier (for example, databricks_ai_v5 for AI v5) in the environment spec and reference it from the task's environment_key:

Beta

Selecting a Databricks AI environment as a workspace base environment is in Beta and requires a workspace admin to opt in. See Build for serverless GPU compute (AI Runtime).

YAML
resources:
  jobs:
    sample_job:
      name: sample_job_aiv5_h100

      trigger:
        periodic:
          interval: 1
          unit: DAYS

      parameters:
        - name: catalog
          default: ${var.catalog}
        - name: schema
          default: ${var.schema}

      environments:
        - environment_key: aiv5
          spec:
            base_environment: databricks_ai_v5

      tasks:
        - task_key: notebook_task
          notebook_task:
            notebook_path: /Workspace/Users/your_email/your_notebook
          environment_key: aiv5
          compute:
            hardware_accelerator: GPU_8xH100

Interactive (Notebooks)​

Connecting from IDE terminal​

Jobs (Scheduled)​

Jobs API and Declarative Automation Bundles​

Interactive (Notebooks)

Connecting from IDE terminal

Jobs (Scheduled)

Jobs API and Declarative Automation Bundles