Enhancing AI Agent reliability through advanced observability using NVIDIA NeMo and Docker Model Runner (DMR).

Cloud Native Now is a vibrant platform dedicated to exploring the ever-evolving landscape of cloud-native technologies. Through insightful articles, practical tutorials, and  analyses, readers can embark on a journey to master Kubernetes, containerization, microservices architecture, and DevOps practices. By staying updated with the latest trends and best practices in cloud-native development, readers can enhance their skills and propel their organizations towards digital transformation.

Cloud Native Now

A hands-on tutorial showing how to configure NVIDIA NeMo Agent Toolkit with Docker Model Runner (DMR) for local LLM inference, with a focus on adding observability to AI agents via OpenTelemetry tracing. Covers setting up DMR with the smollm2 model, configuring NeMo via YAML with a ReAct agent and Wikipedia search tool, and wiring up an OTel collector to capture spans from agent runs.

Configuring NVIDIA NeMo Agent Toolkit With Docker Model Runner