Distributed tracing for agentic workflows with OpenTelemetry

A practical guide to setting up distributed tracing for multi-agent AI workflows using OpenTelemetry, based on lessons from building the it-self-service-agent AI quickstart. Covers context propagation across service boundaries using W3C Trace Context, auto-instrumentation for FastAPI and HTTPX clients, manual instrumentation for MCP servers using a decorator pattern, Llama Stack telemetry configuration for versions 0.2.x and 0.3.x, and deployment options ranging from Jaeger All-in-One for development to Red Hat OpenShift Distributed Tracing for production. Includes concrete code examples for span creation, context extraction/injection, error handling, and attribute best practices.

#python

#devops

#mcp

#opentelemetry

#agentic-ai

Apr 06•17m read time•From developers.redhat.com

Table of contents

About AI quickstarts Distributed tracing for agentic workloads What is OpenTelemetry?Context propagation Instrumenting Llama Stack Auto-instrumentation: HTTP clients and FastAPI Manual instrumentation: MCP servers Llama Stack tracing configuration Collect traces with a Jaeger All-in-One server Collect traces with Red Hat OpenShift Distributed Tracing Wrapping up

Comment

Bookmark

Copy

Sort: