Peer-to-Peer acceleration for AI model distribution with Dragonfly

Dragonfly, a CNCF Graduated P2P file distribution system, now natively supports hf:// and modelscope:// URL protocols for downloading AI models directly from Hugging Face and ModelScope hubs. Instead of each GPU node independently downloading large models (e.g., a 130 GB DeepSeek-V3 across 200 nodes = 26 TB of origin traffic), Dragonfly's seed peer fetches the model once and distributes pieces across the P2P mesh, reducing origin traffic by ~99.5%. The new backends are implemented in Rust using a pluggable Backend trait, support authentication tokens, revision pinning, recursive repository downloads, and single-file downloads. Key use cases include multi-node GPU cluster deployments, CI/CD ML pipelines, air-gapped environments, and dataset distribution for training jobs. The feature is available via dfget CLI and integrates natively with Kubernetes via Helm.

#kubernetes

#rust

#peer-to-peer

Apr 06•12m read time•From cncf.io

Table of contents

The problem: AI model distribution is broken at scale What Is Dragonfly?Introducing native model hub protocols in Dragonfly The hf:// Protocol — Hugging Face hub The modelscope:// Protocol — ModelScope hub Under the hood: Technical deep dive Real-world impact: Where this matters Comparison: Why not just use platform CLIs?Getting started What’s next Contributing Conclusion

Comment

Bookmark

Copy

Sort: