AIBullisharXiv – CS AI · 7h ago7/10
🧠
Piper: A Programmable Distributed Training System
Piper is a new distributed training system that separates strategy design from runtime implementation, allowing researchers to compose multiple parallelism strategies flexibly without manual reconfiguration. The system maintains performance parity with existing approaches like ZeRO while enabling efficiency gains through joint optimization of computation and communication in complex training scenarios.