Log and register AI agents

Log AI agents using Mosaic AI Agent Framework. Logging an agent is the basis of the development process. Logging captures a “point in time” of the agent's code and configuration so you can evaluate the quality of the configuration.

Requirements

Create an AI agent before logging it.

Databricks recommends installing the latest version of the databricks-sdk.

Python
% pip install databricks-sdk

Code-based logging

Databricks recommends using MLflow's Models from Code functionality when logging agents.

In this approach, the agent's code is captured as a Python file, and the Python environment is captured as a list of packages. When the agent is deployed, the Python environment is restored, and the agent's code is executed to load the agent into memory so it can be invoked when the endpoint is called.

You can couple this approach with the use of pre-deployment validation APIs like mlflow.models.predict() to ensure that the agent runs reliably when deployed for serving.

To see an example of code-based logging, see ChatAgent authoring example notebooks.

Infer Model Signature during logging

note

Databricks recommends authoring an agent using the ChatAgent interface. If using ChatAgent, you can skip this section; MLflow automatically infers a valid signature for your agent.

If not using the ChatAgent interface, you must use one of the following methods to specify your agent's MLflow Model Signature at logging time:

Manually define the signature
Use MLflow's Model Signature inferencing capabilities to automatically generate the agent's signature based on an input example you provide. This approach is more convenient than manually defining the signature.

The MLflow model signature validates inputs and outputs to ensure the agent interacts correctly with downstream tools like AI Playground and the review app. It also guides other applications on how to use the agent effectively.

The LangChain and PyFunc examples below use Model Signature inferencing.

If you would rather explicitly define a Model Signature yourself at logging time, see MLflow docs - How to log models with signatures.

Code-based logging with LangChain

The following instructions and code sample show you how to log an agent with LangChain.

Create a notebook or Python file with your code. For this example, the notebook or file is named agent.py. The notebook or file must contain a LangChain agent, referred to here as lc_agent.
Include mlflow.models.set_model(lc_agent) in the notebook or file.
Create a new notebook to serve as the driver notebook (called driver.py in this example).
In the driver notebook, use the following code to run agent.py and log the results to an MLflow model:
Python
```
mlflow.langchain.log_model(lc_model="/path/to/agent.py", resources=list_of_databricks_resources)
```
The resources parameter declares Databricks-managed resources needed to serve the agent, such as a vector search index or serving endpoint that serves a foundation model. For more information, see Authentication for Databricks resources.
Deploy the model. See Deploy an agent for generative AI applications.
When the serving environment is loaded, agent.py is executed.
When a serving request comes in, lc_agent.invoke(...) is called.

Python

import mlflow

code_path = "/Workspace/Users/first.last/agent.py"
config_path = "/Workspace/Users/first.last/config.yml"

# Input example used by MLflow to infer Model Signature
input_example = {
  "messages": [
    {
      "role": "user",
      "content": "What is Retrieval-augmented Generation?",
    }
  ]
}

# example using langchain
with mlflow.start_run():
  logged_agent_info = mlflow.langchain.log_model(
    lc_model=code_path,
    model_config=config_path, # If you specify this parameter, this configuration is used by agent code. The development_config is overwritten.
    artifact_path="agent", # This string is used as the path inside the MLflow model where artifacts are stored
    input_example=input_example, # Must be a valid input to the agent
    example_no_conversion=True, # Required
  )

print(f"MLflow Run: {logged_agent_info.run_id}")
print(f"Model URI: {logged_agent_info.model_uri}")

# To verify that the model has been logged correctly, load the agent and call `invoke`:
model = mlflow.langchain.load_model(logged_agent_info.model_uri)
model.invoke(example)

Code-based logging with PyFunc

The following instructions and code sample show you how to log an agent with PyFunc.

Create a notebook or Python file with your code. For this example, the notebook or file is named agent.py. The notebook or file must contain a PyFunc class, named PyFuncClass.
Include mlflow.models.set_model(PyFuncClass) in the notebook or file.
Create a new notebook to serve as the driver notebook (called driver.py in this example).
In the driver notebook, use the following code to run agent.py and use log_model() to log the results to an MLflow model:
Python
```
mlflow.pyfunc.log_model(python_model="/path/to/agent.py", resources=list_of_databricks_resources)
```
The resources parameter declares Databricks-managed resources needed to serve the agent, such as a vector search index or serving endpoint that serves a foundation model. For more information, see Authentication for Databricks resources.
Deploy the model. See Deploy an agent for generative AI applications.
When the serving environment is loaded, agent.py is executed.
When a serving request comes in, PyFuncClass.predict(...) is called.

Python
import mlflow
from mlflow.models.resources import (
    DatabricksServingEndpoint,
    DatabricksVectorSearchIndex,
)

code_path = "/Workspace/Users/first.last/agent.py"
config_path = "/Workspace/Users/first.last/config.yml"

# Input example used by MLflow to infer Model Signature
input_example = {
  "messages": [
    {
      "role": "user",
      "content": "What is Retrieval-augmented Generation?",
    }
  ]
}

with mlflow.start_run():
  logged_agent_info = mlflow.pyfunc.log_model(
    python_model=agent_notebook_path,
    artifact_path="agent",
    input_example=input_example,
    resources=resources_path,
    example_no_conversion=True,
    resources=[
      DatabricksServingEndpoint(endpoint_name="databricks-meta-llama-3-3-70b-instruct"),
      DatabricksVectorSearchIndex(index_name="prod.agents.databricks_docs_index"),
    ]
  )

print(f"MLflow Run: {logged_agent_info.run_id}")
print(f"Model URI: {logged_agent_info.model_uri}")

# To verify that the model has been logged correctly, load the agent and call `invoke`:
model = mlflow.pyfunc.load_model(logged_agent_info.model_uri)
model.invoke(example)

Authentication for Databricks resources

AI agents often must authenticate to other resources to complete tasks. For example, an agent may need to access a Vector Search index to query unstructured data.

As described in Authentication for dependent resources, Model Serving supports authenticating to both Databricks-managed and external resources when you deploy the agent.

Model Serving supports two different kind of authentication for Databricks-managed resources:

System authentication: Allows the agent service principal to access any dependent resources specified at agent logging time. This is useful for accessing shared or non-sensitive resources, for example a vector search index containing public documentation
[Beta] On-behalf-of-user authentication: Allows the agent to use end user credentials to access Databricks resources. This is useful for scenarios where your agent needs to access sensitive data or query remote APIs to take actions on a per-user basis

Specify resources for automatic authentication passthrough (system authentication)

For the most common Databricks resource types, Databricks supports and recommends declaring resource dependencies for the agent upfront during logging. This enables automatic authentication passthrough when you deploy the agent - Databricks automatically provisions, rotates, and manages short-lived credentials to securely access these resource dependencies from within the agent endpoint.

To enable automatic authentication passthrough, specify dependent resources using the resources parameter of the log_model() API, as shown in the following code.

Python
import mlflow
from mlflow.models.resources import (
  DatabricksVectorSearchIndex,
  DatabricksServingEndpoint,
  DatabricksSQLWarehouse,
  DatabricksFunction,
  DatabricksGenieSpace,
  DatabricksTable,
  DatabricksUCConnection
)

with mlflow.start_run():
  logged_agent_info = mlflow.pyfunc.log_model(
    python_model=agent_notebook_path,
    artifact_path="agent",
    input_example=input_example,
    example_no_conversion=True,
    # Specify resources for automatic authentication passthrough
    resources=[
      DatabricksVectorSearchIndex(index_name="prod.agents.databricks_docs_index"),
      DatabricksServingEndpoint(endpoint_name="databricks-meta-llama-3-3-70b-instruct"),
      DatabricksServingEndpoint(endpoint_name="databricks-bge-large-en"),
      DatabricksSQLWarehouse(warehouse_id="your_warehouse_id"),
      DatabricksFunction(function_name="ml.tools.python_exec"),
      DatabricksGenieSpace(genie_space_id="your_genie_space_id"),
      DatabricksTable(table_name="your_table_name"),
      DatabricksUCConnection(connection_name="your_connection_name"),
    ]
  )

Databricks recommends you manually specify resources for all agent flavors.

note

If you do not specify resources when logging LangChain agents using mlflow.langchain.log_model(...), MLflow performs best-effort automatic inference of resources. However, this may not capture all dependencies, resulting in authorization errors when serving or querying the agent.

The following table lists the Databricks resources that support automatic authentication passthrough and the minimum mlflow version required to log the resource.

Resource type	Minimum `mlflow` version required to log the resource
Vector Search index	Requires `mlflow` 2.13.1 or above
Model serving endpoint	Requires `mlflow` 2.13.1 or above
SQL warehouse	Requires `mlflow` 2.16.1 or above
Unity Catalog function	Requires `mlflow` 2.16.1 or above
Genie space	Requires `mlflow` 2.17.1 or above
Unity Catalog table	Requires `mlflow` 2.18.0 or above
Unity Catalog connection	Requires `mlflow` 2.17.1 or above

On-behalf-of-user authentication

Beta

This feature is in Beta.

When logging an agent that uses on-behalf-of-user authentication, specify the minimum set of Databricks API scopes needed to perform actions as the end user within agent code. This ensures that the agent has least-privilege access to perform actions on behalf of the end user when deployed, enhancing security by preventing unauthorized actions and minimizing the risk of token misuse.

Below is a list of scopes required to access several common types of Databricks resources.

Databricks Resource	API Scopes Required
Vector Search Index	`serving.serving-endpoints`,`vectorsearch.vector-search-endpoints`,`vectorsearch.vector-search-indexes`
Model Serving Endpoint	`serving.serving-endpoints`
SQL Warehouse	`sql.statement-execution`,`sql.warehouses`
UC Connections	`catalog.connections`
Genie Space	`dashboards.genie`

To enable on-behalf-of-user authentication, pass an MLflow AuthPolicy to log_model(), as shown in the example below. An MLflow AuthPolicy has two components:

system_auth_policy: Specify resources for system authentication. Typically, agents will use system authentication for shared resources (e.g. to query model serving endpoints) in combination with on-behalf-of-user authentication for accessing sensitive resources or APIs
user_auth_policy: Specify API scopes your agent needs for on-behalf-of-user authentication

Python
from mlflow.models.resources import DatabricksServingEndpoint
from mlflow.models.auth_policy import SystemAuthPolicy, UserAuthPolicy, AuthPolicy

resources = [
    DatabricksServingEndpoint(endpoint_name="databricks-meta-llama-3-3-70b-instruct")
]
# Specify resources here for system authentication
system_auth_policy = SystemAuthPolicy(resources=resources)

# Specify the minimal set of API scopes needed for on-behalf-of-user authentication
	# When deployed, the agent can access Databricks resources and APIs
	# on behalf of the end user, but only via REST APIs that are covered by the list of
	# scopes below

user_auth_policy = UserAuthPolicy(
    api_scopes=[
        "serving.serving-endpoints",
        "vectorsearch.vector-search-endpoints",
        "vectorsearch.vector-search-indexes",
    ]
)

with mlflow.start_run():
    logged_agent_info = mlflow.pyfunc.log_model(
        ...
        # Instead of passing `resources` (which only supports system authentication),
	        # pass an auth_policy to log_model to enable both system authentication and
	        # on-behalf-of-user authentication
        auth_policy=AuthPolicy(
            system_auth_policy=system_auth_policy,
            user_auth_policy=user_auth_policy
        )
    )

Automatic authentication for OpenAI clients

If your agent uses the OpenAI client, use the Databricks SDK to authenticate automatically during deployment. Databricks SDK provides a wrapper for constructing the OpenAI client with authorization automatically configured, get_open_ai_client(). Run the following in your notebook:

Python
% pip install databricks-sdk[openai]

Python
from databricks.sdk import WorkspaceClient
def openai_client(self):
  w = WorkspaceClient()
  return w.serving_endpoints.get_open_ai_client()

Then, specify the Model Serving endpoint as part of resources to authenticate automatically at deployment time.

Register the agent to Unity Catalog

Before you deploy the agent, you must register the agent to Unity Catalog. Registering the agent packages it as a model in Unity Catalog. As a result, you can use Unity Catalog permissions for authorization for resources in the agent.

Python
import mlflow

mlflow.set_registry_uri("databricks-uc")

catalog_name = "test_catalog"
schema_name = "schema"
model_name = "agent_name"

model_name = catalog_name + "." + schema_name + "." + model_name
uc_model_info = mlflow.register_model(model_uri=logged_agent_info.model_uri, name=model_name)

See mlflow.register_model().

Requirements​

Code-based logging​

Infer Model Signature during logging​

Code-based logging with LangChain​

Code-based logging with PyFunc​

Authentication for Databricks resources​

Specify resources for automatic authentication passthrough (system authentication)​

On-behalf-of-user authentication​

Automatic authentication for OpenAI clients​

Register the agent to Unity Catalog​

Next steps​