Link Production Traces to App Versions

2025-06-11

The track application versions guide showed how to track application versions using LoggedModel during development.

When deploying a LoggedModel to production, you need to link the traces it generates back to the specific version for monitoring and debugging. This guide shows how to configure your deployment to include version information in production traces.

Tip

Deploying on Databricks Model Serving? Trace linking is automatically configured for you. Skip to Trace Linking on Databricks Model Serving for details.

Prerequisites

For production deployments outside of Databricks Model Serving, install the mlflow-tracing package:
```
pip install --upgrade "mlflow-tracing>=3.1.0"
```
This package is specifically optimized for production environments, offering:
- Minimal dependencies for faster, leaner deployments
- Performance optimizations for high-volume tracing
Note

MLflow 3 is required for production tracing. MLflow 2.x is not supported for production deployments due to performance limitations and missing features for production use.
Create an MLflow experiment by following the setup your environment quickstart.

Environment Variable Configuration

Navigate to the versions tab to get the LoggedModel ID. In your CI/CD pipeline, you can generate a new LoggedModel by using create_external_model() as shown below. We suggest using the

import mlflow
import subprocess

# Define your application and its version identifier
app_name = "customer_support_agent"

# Get current git commit hash for versioning
try:
    git_commit = (
        subprocess.check_output(["git", "rev-parse", "HEAD"])
        .decode("ascii")
        .strip()[:8]
    )
    version_identifier = f"git-{git_commit}"
except subprocess.CalledProcessError:
    version_identifier = "local-dev"  # Fallback if not in a git repo

logged_model_name = f"{app_name}-{version_identifier}"

# Create a new LoggedModel
model = mlflow.create_external_model(name=logged_model_name)

Add the LoggedModel ID to your production environment configuration in the MLFLOW_ACTIVE_MODEL_ID environment variable alongside the standard MLflow tracing variables from the setup your environment quickstart.

# Standard MLflow tracing configuration
export DATABRICKS_HOST="https://your-workspace.databricks.com"
export DATABRICKS_TOKEN="your-databricks-token"
export MLFLOW_TRACKING_URI=databricks
# Either use MLFLOW_EXPERIMENT_NAME or MLFLOW_EXPERIMENT_ID
export MLFLOW_EXPERIMENT_NAME="/Shared/production-genai-app"

# Add LoggedModel version tracking by specifying your LoggedModel ID
# Ensure this matches a LoggedModel in your MLFlow Experiment
export MLFLOW_ACTIVE_MODEL_ID="customer_support_agent-git-98207f02"

Automatic Trace Linking

Important

When you set the MLFLOW_ACTIVE_MODEL_ID environment variable, all traces are automatically linked to that LoggedModel. You don't need to manually tag traces - MLflow handles this for you!

Your application code remains exactly the same as during development:

import mlflow
from fastapi import FastAPI, Request

app = FastAPI()

@mlflow.trace
def process_message(message: str) -> str:
    # Your actual application logic here
    # This is just a placeholder
    return f"Processed: {message}"

@app.post("/chat")
def handle_chat(request: Request, message: str):
    # Your traces are automatically linked to the LoggedModel
    # specified in MLFLOW_ACTIVE_MODEL_ID

    # Your application logic here
    response = process_message(message)
    return {"response": response}

To add additional context to your traces (such as user IDs, session IDs, or custom metadata), see Adding context to production traces in the production tracing guide.

Deployment Examples

Docker

When deploying with Docker, pass all necessary environment variables through your container configuration:

# Dockerfile
FROM python:3.9-slim

# Install dependencies
COPY requirements.txt .
RUN pip install -r requirements.txt

# Copy application code
COPY . /app
WORKDIR /app

# Declare required environment variables (no defaults)
ENV DATABRICKS_HOST
ENV DATABRICKS_TOKEN
ENV MLFLOW_TRACKING_URI
ENV MLFLOW_EXPERIMENT_NAME
ENV MLFLOW_ACTIVE_MODEL_ID

CMD ["python", "app.py"]

Run the container with environment variables:

docker run -d \
  -e DATABRICKS_HOST="https://your-workspace.databricks.com" \
  -e DATABRICKS_TOKEN="your-databricks-token" \
  -e MLFLOW_TRACKING_URI=databricks \
  -e MLFLOW_EXPERIMENT_NAME="/Shared/production-genai-app" \
  -e MLFLOW_ACTIVE_MODEL_ID="customer_support_agent-git-98207f02" \
  -e APP_VERSION="1.0.0" \
  your-app:latest

Kubernetes

For Kubernetes deployments, use ConfigMaps and Secrets to manage configuration:

# configmap.yaml
apiVersion: v1
kind: ConfigMap
metadata:
  name: mlflow-config
data:
  DATABRICKS_HOST: 'https://your-workspace.databricks.com'
  MLFLOW_TRACKING_URI: 'databricks'
  MLFLOW_EXPERIMENT_NAME: '/Shared/production-genai-app'
  MLFLOW_ACTIVE_MODEL_ID: 'customer_support_agent-git-98207f02'

---
# secret.yaml
apiVersion: v1
kind: Secret
metadata:
  name: databricks-secrets
type: Opaque
stringData:
  DATABRICKS_TOKEN: 'your-databricks-token'

---
# deployment.yaml
apiVersion: apps/v1
kind: Deployment
metadata:
  name: genai-app
spec:
  replicas: 2
  selector:
    matchLabels:
      app: genai-app
  template:
    metadata:
      labels:
        app: genai-app
    spec:
      containers:
        - name: app
          image: your-app:latest
          ports:
            - containerPort: 8000
          envFrom:
            - configMapRef:
                name: mlflow-config
            - secretRef:
                name: databricks-secrets
          env:
            - name: APP_VERSION
              value: '1.0.0'
          resources:
            requests:
              memory: '256Mi'
              cpu: '250m'
            limits:
              memory: '512Mi'
              cpu: '500m'

Querying Version-Specific Traces

Once deployed, you can view traces in the MLflow trace UI or you can query traces by model version in the SDK:

import mlflow

# Get the experiment ID
experiment = client.get_experiment_by_name("/Shared/production-genai-app")

# Find all traces from a specific model version
traces = mlflow.search_traces(
    experiment_ids=[experiment.experiment_id],
    model_id="customer_support_agent-git-98207f02",
)

# View the results
print(f"Found {len(traces)} traces for this model version")

Trace Linking on Databricks Model Serving

When you deploy a LoggedModel to Databricks Model Serving using Agent Framework and MLflow 3 is installed in your development environment, trace linking is automatically configured.

To view traces from your Databricks Model Serving endpoint:

Navigate to your the MLflow Experiment that was active when you called agents.deploy()
Click on the Traces tab to view traces
All traces are automatically linked to the specific model version serving the requests

The only requirement is that your application code uses MLflow tracing (either through autologging or manual instrumentation with @mlflow.trace).

Next Steps

For complete production tracing setup including authentication, monitoring, and feedback collection for deployments outside Databricks Model Serving, see Production observability with tracing.

Share via