AIBullisharXiv โ CS AI ยท 9h ago6/10
๐ง
NCCL EP: Towards a Unified Expert Parallel Communication API for NCCL
Researchers have developed NCCL EP, a new communication library for Mixture-of-Experts (MoE) AI model architectures that improves GPU-initiated communication performance. The library provides unified APIs supporting both low-latency inference and high-throughput training modes, built entirely on NVIDIA's NCCL Device API.
๐ข Nvidia