Tracing a GenAI App (IDE)

This quickstart helps you integrate your GenAI app with MLflow Tracing if you use a local IDE as your development environment. If you use a Databricks Notebook, please use the Databricks Notebook quickstart instead.

What you'll achieve

By the end of this tutorial, you will have:

A MLflow Experiment for your GenAI app
Your local development environment connected to MLflow
A simple GenAI application instrumented with MLflow Tracing
A trace from that app in your MLflow Experiment

trace

Prerequisites

Databricks Workspace: Access to a Databricks workspace.

Step 1: Install MLflow

When working in your local IDE, you need to install MLflow with Databricks connectivity.

pip install --upgrade "mlflow[databricks]>=3.1" openai

Step 2: Create a new MLflow Experiment

An MLflow Experiment is the container for your GenAI application. Learn more about the Experiment and what it contains in the data model section.

Open your Databricks workspace
Go to Experiments in the left sidebar under Machine Learning
At the top of the Experiments page, click on New GenAI Experiment

Experiment Creation

Step 3: Connect your environment to MLflow

note

This quickstart describes using a Databricks Personal Access Token. MLflow also works with the other Databricks-supported authentication methods.

Use environment variables
Use a .env file

Click Generate API Key

Copy and run the generated code in your terminal.

Bash
export DATABRICKS_TOKEN=<databricks-personal-access-token>
export DATABRICKS_HOST=https://<workspace-name>.cloud.databricks.com
export MLFLOW_TRACKING_URI=databricks
export MLFLOW_EXPERIMENT_ID=<experiment-id>

Click Generate API Key

Copy the generated code to a .env file in your project root

Bash
DATABRICKS_TOKEN=<databricks-personal-access-token>
DATABRICKS_HOST=https://<workspace-name>.cloud.databricks.com
MLFLOW_TRACKING_URI=databricks
MLFLOW_EXPERIMENT_ID=<experiment-id>

Install the python-dotenv package
Bash
```
pip install python-dotenv
```

Load environment variables in your code

Python
# At the beginning of your Python script
from dotenv import load_dotenv

# Load environment variables from .env file
load_dotenv()

Step 4: Create and instrument your application

tip

Databricks provided out of the box access to popular frontier and open source foundational LLMs. To run this quickstart, you can:

Use the Databricks hosted LLMs
Directly use your own API key from an LLM provider
Create an external model to enable governed access to your LLM provider's API keys

The below example quickstart uses the OpenAI SDK to connect to a Databricks hosted LLM. If you want to use your own OpenAI key, update the client = OpenAI(...) line.

If you prefer to use one of the other 20+ LLM SDKs (Anthropic, Bedrock, etc) or GenAI authoring frameworks (LangGraph, etc) that MLflow supports, follow the instructions in the MLflow Experiment UI in the previous step.

Create a Python file named app.py in your project directory.

Initialize an OpenAI client to connect to either Databricks-hosted LLMs or LLMs hosted by OpenAI.

Databricks-hosted LLMs
OpenAI-hosted LLMs

Use MLflow to get an OpenAI client that connects to Databricks-hosted LLMs. Select a model from the available foundation models.

Python
import mlflow
from databricks.sdk import WorkspaceClient

# Enable MLflow's autologging to instrument your application with Tracing
mlflow.openai.autolog()

# Set up MLflow tracking to Databricks
mlflow.set_tracking_uri("databricks")
mlflow.set_experiment("/Shared/docs-demo")

# Create an OpenAI client that is connected to Databricks-hosted LLMs
w = WorkspaceClient()
client = w.serving_endpoints.get_open_ai_client()

# Select an LLM
model_name = "databricks-claude-sonnet-4"

Use the native OpenAI SDK to connect to OpenAI-hosted models. Select a model from the available OpenAI models.

Python
import mlflow
import os
import openai

# Ensure your OPENAI_API_KEY is set in your environment
# os.environ["OPENAI_API_KEY"] = "<YOUR_API_KEY>" # Uncomment and set if not globally configured

# Enable auto-tracing for OpenAI
mlflow.openai.autolog()

# Set up MLflow tracking to Databricks
mlflow.set_tracking_uri("databricks")
mlflow.set_experiment("/Shared/docs-demo")

# Create an OpenAI client connected to OpenAI SDKs
client = openai.OpenAI()

# Select an LLM
model_name = "gpt-4o-mini"

Define and run your application:

Use the @mlflow.trace decorator, which makes it easy to trace any Python application combined with the OpenAI automatic instrumentation to capture the details of the call to the OpenAI SDK.

Python
# Use the trace decorator to capture the application's entry point
@mlflow.trace
def my_app(input: str):
    # This call is automatically instrumented by `mlflow.openai.autolog()`
    response = client.chat.completions.create(
        model=model_name,  # This example uses a Databricks hosted LLM - you can replace this with any AI Gateway or Model Serving endpoint. If you provide your own OpenAI credentials, replace with a valid OpenAI model e.g., gpt-4o, etc.
        messages=[
            {
                "role": "system",
                "content": "You are a helpful assistant.",
            },
            {
                "role": "user",
                "content": input,
            },
        ],
    )
    return response.choices[0].message.content

result = my_app(input="What is MLflow?")
print(result)

Run the application
Bash
```
python app.py
```

Step 5: View the Trace in MLflow

Navigate back to the MLflow Experiment UI
You will now see the generated trace in the Traces tab
Click on the trace to view its details

Trace Details

Understanding the Trace

The trace you've just created shows:

Root Span: Represents the inputs to the my_app(...) function
- Child Span: Represents the OpenAI completion request
Attributes: Contains metadata like model name, token counts, and timing information
Inputs: The messages sent to the model
Outputs: The response received from the model

This simple trace already provides valuable insights into your application's behavior, such as:

What was asked
What response was generated
How long the request took
How many tokens were used (affecting cost)

tip

For more complex applications like RAG systems or multi-step agents, MLflow Tracing provides even more value by revealing the inner workings of each component and step.

Next steps

Continue your journey with these recommended actions and tutorials.

Evaluate your app's quality - Measure and improve your GenAI app's quality with MLflow's evaluation capabilities
Collect human feedback - Learn how to collect feedback from users and domain experts
Track users & sessions - Add user and conversation context to your traces

Reference guides

Explore detailed documentation for concepts and features mentioned in this guide.

Tracing concepts - Understand the fundamentals of MLflow Tracing
Tracing data model - Learn about traces, spans, and how MLflow structures observability data
Manual tracing APIs - Explore advanced tracing techniques for custom instrumentation

What you'll achieve​

Prerequisites​

Step 1: Install MLflow​

Step 2: Create a new MLflow Experiment​

Step 3: Connect your environment to MLflow​

Step 4: Create and instrument your application​

Step 5: View the Trace in MLflow​

Understanding the Trace​

Next steps​

Reference guides​