NVIDIA Dynamo 1.0 is now available to DigitalOcean customers to help drive performance enhancements and cost efficiency. NVIDIA Dynamo 1.0 offers a 7x inference performance increase and by pairing it with DigitalOcean’s Agentic Inference Cloud, customers can achieve higher performance at lower costs while benefiting from seamless deployment

DO (DigitalOcean) provides insights into cloud computing, infrastructure as code, and developer tools, offering tutorials and documentation for deploying and managing applications on the cloud. By exploring DO's curated content, developers can learn about cloud-native architectures, Kubernetes deployment patterns, and best practices for building scalable and resilient applications. Whether you're a startup founder, indie developer, or enterprise IT professional, DO offers resources to accelerate your cloud journey and optimize your infrastructure for success.

DigitalOcean

NVIDIA Dynamo 1.0, announced at GTC, is now available to DigitalOcean customers. It delivers up to 7x inference performance improvement on NVIDIA GB200 NVL systems through key features: KV-aware routing, disaggregated prefill/decode serving, and memory offloading via a KV Block Manager. Paired with DigitalOcean's Agentic Inference Cloud and Managed Kubernetes, Workato achieved 67% higher GPU throughput, 79% lower latency, and 67% lower model cost using half the GPUs. Customers can deploy Dynamo 1.0 as a container on Droplets or via DigitalOcean Kubernetes with inference runtimes like vLLM, SGLang, or TensorRT-LLM.

Meet the New Standard for High-Performance, Low-Cost Inference: NVIDIA Dynamo 1.0 is now available to DigitalOcean Customers

How DigitalOcean optimizes inference workloads with Dynamo to improve throughput and latency

The future of inference optimization with NVIDIA and DigitalOcean