Skip to main content

Generative AI models maintenance policy

This article describes the model maintenance policy for the Foundation Model APIs pay-per-token, Foundation Model APIs provisioned throughput, and Foundation Model Fine-tuning offerings.

In order to continue supporting the most state-of-the-art models, Databricks might update supported models or retire older models for these offerings.

Model retirement policy

The following sections summarize the retirement policy for the indicated feature offerings. See Retired models for a list of currently retired models and planned retirement dates.

important

The retirement policies that apply to the Foundation Model APIs pay-per-token and Foundation Model Fine-tuning offerings only impact supported chat and completion models.

Foundation Model APIs pay-per-token

The following table summarizes the retirement policy for Foundation Model APIs pay-per-token.

Retirement notification

Transition to retirement

On the retirement date

Databricks takes the following steps to notify customers about a model that is set for retirement:

  • On the Serving page of your Databricks workspace, a warning message appears on the model card that indicates that the model is planned for retirement.
  • The applicable documentation contains a notice that indicates the model is planned for retirement and the start date it will no longer be supported.

Databricks will retire the model in three months. During this three-month period, customers can either:

  • Choose to migrate to a Foundation Model APIs provisioned throughput endpoint to continue using the model past its end-of-life date.
  • Migrate existing workflows to use recommended replacement models.

The model is no longer available for use and removed from the product. Applicable documentation is updated to recommend using a replacement model.

Foundation Model APIs provisioned throughput

The following table summarizes the retirement policy for Foundation Model APIs provisioned throughput.

Retirement notification

Transition to retirement

On the retirement date

Databricks takes the following steps to notify customers about a model that is set for retirement:

  • For endpoints that serve a deprecated model, a warning message appears on that serving endpoint's details page in your Databricks workspace. This message indicates that the model is planned for retirement and the applicable retirement date.
  • A tooltip message provides recommended alternate models for workload migration.
  • The applicable documentation contains a notice that indicates the model is planned for retirement and the start date it will no longer be supported.

Databricks will retire the model in six months. During this six-month period:

  • Customers can continue running existing provisioned throughput endpoints using the deprecated model until the retirement date.
  • Customers that are not actively using a deprecated model cannot create new provisioned throughput endpoints or restart stopped endpoints for a deprecated model.

The model is no longer available for use and removed from the product.

  • All endpoints using the retired model are transitioned to a failed state with a descriptive message. Any requests to these endpoints will fail.
  • The customer can delete endpoints that use the retired model, but cannot restart them.
  • Applicable documentation is updated to recommend using a replacement model.

Foundation Model Fine-tuning

The following table summarizes the retirement policy for Foundation Model Fine-tuning.

Retirement notification

Transition to retirement

On the retirement date

Databricks takes the following steps to notify customers about a model that is set for retirement:

  • In the Experiments tab, a warning message appears in the dropdown menu for Foundation Model Fine-tuning that indicates that the model is planned for retirement.
  • The applicable documentation contains a notice that indicates the model is planned for retirement and the start date it will no longer be supported.

Databricks retires the model in three months. During this three-month period, customers can migrate existing workflows to use recommended replacement models.

The model is no longer available for use and removed from the product. Applicable documentation is updated to recommend using a replacement model.

Model updates

Databricks might ship incremental updates to pay-per-token models to deliver optimizations. When a model is updated, the endpoint URL remains the same, but the model ID in the response object changes to reflect the date of the update. For example, if an update is shipped to meta-llama/Meta-Llama-3.3-70B on 3/4/2024, the model name in the response object updates to meta-llama/Meta-Llama-3.3-70B-030424. Databricks maintains a version history of the updates that you can refer to.

Retired models

The following sections summarize current and upcoming model retirements for the Foundation Model Fine-tuning, Foundation Model APIs pay-per-token, and Foundation Model APIs provisioned throughput offerings.

Foundation Model APIs pay-per-token retirements

The following table shows model retirements, their retirement dates, and recommended replacement models to use for Foundation Model APIs pay-per-token serving workloads. Databricks recommends that you migrate your applications to use replacement models before the indicated retirement date.

important

On December 11, 2024, Meta-Llama-3.3-70B-Instruct replaced support for Meta-Llama-3.1-70B-Instruct in Foundation Model APIs pay-per-token endpoints.

Model

Retirement date

Recommended replacement model

Meta-Llama-3.1-405B

March 31, 2026

OpenAI GPT OSS 120B

DBRX Instruct

April 30, 2025

Meta-Llama-4-Maverick

Mixtral-8x7B Instruct

April 30, 2025

Meta-Llama-4-Maverick

Meta-Llama-3.1-70B-Instruct

December 11, 2024

Meta-Llama-4-Maverick

Meta-Llama-3-70B-Instruct

July 23, 2024

Meta-Llama-4-Maverick

Meta-Llama-2-70B-Chat

October 30, 2024

Meta-Llama-4-Maverick

MPT 7B Instruct

August 30, 2024

Meta-Llama-4-Maverick

MPT 30B Instruct

August 30, 2024

Meta-Llama-4-Maverick

If you require long-term support for a specific model version, Databricks recommends using Foundation Model APIs provisioned throughput for your serving workloads.

Foundation Model APIs provisioned throughput retirements

The following table shows model family retirements, their retirement dates, and recommended replacement models to use for Foundation Model APIs provisioned throughput serving workloads. Databricks recommends that you migrate your applications to use replacement models before the indicated retirement date.

Model family

Retirement date

Recommended replacement model

Meta Llama 3.1 405B

March 31, 2026

OpenAI GPT OSS 120B

Meta Llama 3 70B

February 15, 2026

Comparable model on the same offering, like Llama 3.2, 3.3, or 4 model of similar size.

Meta Llama 3 8B

February 15, 2026

Comparable model on the same offering, like Llama 3.2, 3.3, or 4 model of similar size.

Meta Llama 2 70B

February 15, 2026

Comparable model on the same offering, like Llama 3.2, 3.3, or 4 model of similar size.

Meta Llama 2 13B

February 15, 2026

Comparable model on the same offering, like Llama 3.2, 3.3, or 4 model of similar size.

Meta Llama 2 7B

February 15, 2026

Comparable model on the same offering, like Llama 3.2, 3.3, or 4 model of similar size.

DBRX

February 15, 2026

Comparable model on the same offering, like Llama 3.2, 3.3, or 4 model of similar size.

Mistral 8x7B

February 15, 2026

Comparable model on the same offering, like Llama 3.2, 3.3, or 4 model of similar size.

Mixtral 7B

February 15, 2026

Comparable model on the same offering, like Llama 3.2, 3.3, or 4 model of similar size.

MPT 30B

February 15, 2026

Comparable model on the same offering, like Llama 3.2, 3.3, or 4 model of similar size.

MPT 7B

February 15, 2026

Comparable model on the same offering, like Llama 3.2, 3.3, or 4 model of similar size.

Foundation Model Fine-tuning retirements

The following table shows retired model families, their retirement dates, and recommended replacement model families to use for Foundation Model Fine-tuning workloads. Databricks recommends that you migrate your applications to use replacement models before the indicated retirement date.

Model family

Retirement date

Recommended replacement model family

DBRX

April 30, 2025

Llama-3.1-70B

Mixtral

April 30, 2025

Llama-3.1-70B

Mistral

April 30, 2025

Llama-3.1-8B

Meta-Llama-3.1-405B

January 30, 2025

Llama-3.1-70B

Meta-Llama-3

January 7, 2025

Meta-Llama-3.1

Meta-Llama-2

January 7, 2025

Meta-Llama-3.1

Code Llama

January 7, 2025

Meta-Llama-3.1