DeepEP: an efficient expert-parallel communication library - deepseek-ai/DeepEP

Hacker News is a community-driven platform for sharing and discussing technology news, startups, and programming-related topics. Through user submissions and comments, Hacker News offers insights into emerging technology trends, industry developments, and entrepreneurial ventures. Readers can participate in discussions, share their insights, and stay informed about the latest advancements in technology and innovation.

Hacker News

DeepEP is a communication library designed for Mixture-of-Experts (MoE) and expert parallelism (EP), providing high-throughput, low-latency all-to-all GPU kernels for efficient data transfer. It supports low-precision operations and includes optimized kernels for asymmetric-domain bandwidth forwarding. The library is tested on various performance metrics and supports traffic isolation and adaptive routing for improved network efficiency.

deepseek-ai/DeepEP: DeepEP: an efficient expert-parallel communication library