Add context to traces

Adding context to traces enables you to track execution details, analyze user behavior, debug issues across environments, and monitor application performance. MLflow provides standardized metadata fields for common context types plus the flexibility to add custom metadata specific to your application.

Requirements

Install the appropriate package for tracing based on your environment:

Production
Development

For production deployments, install the mlflow-tracing package:

Bash
pip install --upgrade mlflow-tracing

The mlflow-tracing package is optimized for production use with minimal dependencies and better performance characteristics.

For development environments, install the full MLflow package with Databricks extras:

Bash
pip install --upgrade "mlflow[databricks]>=3.1"

The full mlflow[databricks] package includes all features needed for local development and experimentation on Databricks.

note

MLflow 3 is required for context tracking. MLflow 2.x is not supported due to performance limitations and missing features essential for production use.

Implementation

To add metadata and tags to traces:

Trace your application. Most commonly, you will use the @mlflow.trace decorator to trace functions automatically.
During your application's execution, call mlflow.update_current_trace() to add context to traces using tags or metadata. After your application completes and a trace is logged, tags are mutable, but metadata are immutable in the logged trace.

Python
import mlflow
mlflow.update_current_trace(
    metadata={
        "mlflow.trace.user": user_id,
        "mlflow.trace.session": session_id,
    },
    tags={
        "query_category": "chat",  # Example of a custom tag
    },
)

To access metadata and tags in trace logs, use the metadata and tags fields in the pandas DataFrame returned by mlflow.search_traces(), or use the Trace.info.trace_metadata and Trace.info.tags fields from Trace objects.

See Tutorial: Trace and analyze users and environments for a full tutorial.

Types of context metadata

Production applications need to track multiple pieces of context simultaneously. MLflow has standardized metadata fields to capture important contextual information:

Context type	Use cases	MLflow field
Client request ID	Link traces to specific client requests or API calls for end-to-end debugging	`TraceInfo` parameter `client_request_id`
User session ID	Group traces from multi-turn conversations, allowing you to analyze the full conversational flow	`mlflow.trace.session` metadata
User ID	Associate traces with specific users for personalization, cohort analysis, and user-specific debugging	`mlflow.trace.user` metadata
Environment data	Track deployment context (environment, version, region) for operational insights and debugging across different deployments	automatic metadata and custom metadata
Custom metadata	Add application-specific metadata for organization, search, and filtering traces	(your metadata keys)

Track users and sessions

Tracking users and sessions in your GenAI application provides essential context for understanding user behavior, analyzing conversation flows, and improving personalization.

Why track users and sessions?

User and session tracking enables powerful analytics and improvements:

User behavior analysis - Understand how different users interact with your application
Conversation flow tracking - Analyze multi-turn conversations and context retention
Personalization insights - Identify patterns to improve user-specific experiences
Quality per user - Track performance metrics across different user segments
Session continuity - Maintain context across multiple interactions

Standard metadata fields for users and sessions

MLflow provides two standard metadata fields for session and user tracking:

mlflow.trace.user - Associates traces with specific users
mlflow.trace.session - Groups traces belonging to multi-turn conversations

When you use these standard metadata fields, MLflow automatically enables filtering and grouping in the UI. Unlike tags, metadata cannot be updated once the trace is logged, making it ideal for immutable identifiers like user and session IDs.

Track environments and versions

Tracking the execution environment and application version of your GenAI application allows you to debug performance and quality issues relative to the code. This metadata enables:

Environment-specific analysis across development, staging, and production
Performance/quality tracking and regression detection across app versions
Faster root cause analysis when issues occur

For deployment metadata such as environments and versions, your application should generally extract the metadata from environment variables, rather than having the metadata hard-coded into the application. Environment variables simplify the deployment process. For example:

Python
import mlflow
import os

# In your application logic
mlflow.update_current_trace(
    metadata={
        "mlflow.source.type": os.getenv("APP_ENVIRONMENT", "development"),  # Override default
    }
)

note

For a comprehensive overview of how versioning works, see Version Tracking.

Automatically populated metadata

MLflow automatically sets certain standard metadata fields based on your execution environment.

You can override any of the automatically populated metadata fields using mlflow.update_current_trace(). This is useful when the automatic detection does not meet your requirements. For example, override the execution environment value using mlflow.update_current_trace(metadata={"mlflow.source.name": "custom_name"}).

Category	Metadata Field	Description	Automatic Setting Logic
Execution environment	`mlflow.source.name`	The entry point or script that generated the trace.	Automatically populated with the filename for Python scripts, notebook name for Databricks/Jupyter notebooks.
	`mlflow.source.git.commit`	Git commit hash.	If run from a Git repository, the commit hash is automatically detected and populated.
	`mlflow.source.git.branch`	Git branch.	If run from a Git repository, the current branch name is automatically detected and populated.
	`mlflow.source.git.repoURL`	Git repo URL.	If run from a Git repository, the repository URL is automatically detected and populated.
	`mlflow.source.type`	Captures the execution environment.	Automatically set to `NOTEBOOK` if running in Jupyter or Databricks notebook, `LOCAL` if running a local Python script, else `UNKNOWN` (automatically detected). In your deployed app, we suggest updating this variable based on the environment e.g., `PRODUCTION`, `STAGING`, etc.
	`mlflow.sourceRun`	The run ID of the source run that generated the trace.	Automatically set to the run ID of the source run that generated the trace.
Application version	`metadata.mlflow.modelId`	MLflow LoggedModel ID.	Automatically set to the model ID value in the environment variable `MLFLOW_ACTIVE_MODEL_ID` or the model ID set via `mlflow.set_active_model()` function.

Add custom metadata

You use custom metadata keys to capture any other application-specific context. For example, you might want to attach information such as:

Application version
Deployment ID
Deployment region
Feature flags

Best practices

Consistent ID formats - Use standardized formats for user and session IDs across your application
Session boundaries - Define clear rules for when sessions start and end
Environment variables - Populate metadata from environment variables rather than hard-coding values
Combine context types - Track user, session, and environment context together for complete traceability
Regular analysis - Set up dashboards to monitor user behavior, session patterns, and version performance
Override defaults thoughtfully - Only override automatically populated metadata when necessary

Next steps

Tutorial: Trace and analyze users and environments - See a full example of adding user, session, environment, and app version metadata to traces.
Search traces programmatically - Learn more about using mlflow.search_traces().
Examples: Analyzing traces - See examples of trace analytics.

Requirements​

Implementation​

Types of context metadata​

Track users and sessions​

Why track users and sessions?​

Standard metadata fields for users and sessions​

Track environments and versions​

Automatically populated metadata​

Add custom metadata​

Best practices​

Next steps​