Generative AI models maintenance policy
This article describes the model maintenance policy for the Foundation Model APIs pay-per-token and Foundation Model APIs provisioned throughput offerings.
In order to continue supporting the most state-of-the-art models, Databricks might update supported models or retire older models for these offerings.
Model retirement policy
The following sections summarize the retirement policy for the indicated feature offerings.
The retirement policies that apply to the Foundation Model APIs pay-per-token and Foundation Model Fine-tuning offerings only impact supported chat and completion models.
Foundation Model APIs pay-per-token
The following table summarizes the retirement policy for Foundation Model APIs pay-per-token.
Retirement notification | Transition to retirement | On the retirement date |
|---|---|---|
Databricks takes the following steps to notify customers about a model that is set for retirement:
| Databricks will retire the model in three months. During this three-month period, customers can either:
| The model is no longer available for use and removed from the product. Applicable documentation is updated to recommend using a replacement model. |
Foundation Model APIs provisioned throughput
The following table summarizes the retirement policy for Foundation Model APIs provisioned throughput.
Retirement notification | Transition to retirement | On the retirement date |
|---|---|---|
Databricks takes the following steps to notify customers about a model that is set for retirement:
| Databricks will retire the model in six months. During this six-month period:
| The model is no longer available for use and removed from the product.
|
Model updates
Databricks might ship incremental updates to pay-per-token models to deliver optimizations. When a model is updated, the endpoint URL remains the same, but the model ID in the response object changes to reflect the date of the update. For example, if an update is shipped to meta-llama/Meta-Llama-3.3-70B on 3/4/2024, the model name in the response object updates to meta-llama/Meta-Llama-3.3-70B-030424. Databricks maintains a version history of the updates that you can refer to.
Retired models
The following sections summarize current and upcoming model retirements for the Foundation Model Fine-tuning, Foundation Model APIs pay-per-token, and Foundation Model APIs provisioned throughput offerings.
Foundation Model APIs pay-per-token retirements
The following table shows model retirements, their retirement dates, and recommended replacement models to use for Foundation Model APIs pay-per-token serving workloads. Databricks recommends that you migrate your applications to use replacement models before the indicated retirement date.
On December 11, 2024, Meta-Llama-3.3-70B-Instruct replaced support for Meta-Llama-3.1-70B-Instruct in Foundation Model APIs pay-per-token endpoints.
Model | Retirement date | Recommended replacement model |
|---|---|---|
Meta Llama 3.1 405B | February 15, 2026 | OpenAI GPT OSS 120B |
Anthropic Claude 3.7 Sonnet | March 10, 2026 | Anthropic Claude Sonnet 4.5 |
DBRX Instruct | April 30, 2025 | Meta-Llama-4-Maverick |
Mixtral-8x7B Instruct | April 30, 2025 | Meta-Llama-4-Maverick |
Meta-Llama-3.1-70B-Instruct | December 11, 2024 | Meta-Llama-4-Maverick |
Meta-Llama-3-70B-Instruct | July 23, 2024 | Meta-Llama-4-Maverick |
Meta-Llama-2-70B-Chat | October 30, 2024 | Meta-Llama-4-Maverick |
MPT 7B Instruct | August 30, 2024 | Meta-Llama-4-Maverick |
MPT 30B Instruct | August 30, 2024 | Meta-Llama-4-Maverick |
If you require long-term support for a specific model version, Databricks recommends using Foundation Model APIs provisioned throughput for your serving workloads.
Foundation Model APIs provisioned throughput retirements
The following table shows model family retirements, their retirement dates, and recommended replacement models to use for Foundation Model APIs provisioned throughput serving workloads. Databricks recommends that you migrate your applications to use replacement models before the indicated retirement date.
Model family | Retirement date | Recommended replacement model |
|---|---|---|
Meta Llama 3.1 405B | May 15, 2026 | OpenAI GPT OSS 120B |
Meta Llama 3 70B | February 15, 2026 | Comparable model on the same offering, like Llama 3.2, 3.3, or 4 model of similar size. |
Meta Llama 3 8B | February 15, 2026 | Comparable model on the same offering, like Llama 3.2, 3.3, or 4 model of similar size. |
Meta Llama 2 70B | February 15, 2026 | Comparable model on the same offering, like Llama 3.2, 3.3, or 4 model of similar size. |
Meta Llama 2 13B | February 15, 2026 | Comparable model on the same offering, like Llama 3.2, 3.3, or 4 model of similar size. |
Meta Llama 2 7B | February 15, 2026 | Comparable model on the same offering, like Llama 3.2, 3.3, or 4 model of similar size. |
DBRX | February 15, 2026 | Comparable model on the same offering, like Llama 3.2, 3.3, or 4 model of similar size. |
Mistral 8x7B | February 15, 2026 | Comparable model on the same offering, like Llama 3.2, 3.3, or 4 model of similar size. |
Mixtral 7B | February 15, 2026 | Comparable model on the same offering, like Llama 3.2, 3.3, or 4 model of similar size. |
MPT 30B | February 15, 2026 | Comparable model on the same offering, like Llama 3.2, 3.3, or 4 model of similar size. |
MPT 7B | February 15, 2026 | Comparable model on the same offering, like Llama 3.2, 3.3, or 4 model of similar size. |