AIBullisharXiv – CS AI · 10h ago7/10
🧠
SwarmX: Agentic Scheduling for Low-Latency Agentic Systems
SwarmX is a new scheduling system designed to optimize GPU-CPU cluster performance for agentic AI applications that make multiple model calls and tool executions. The system uses neural predictors to reduce tail latency by up to 61.5% and sustain 2x higher throughput than production schedulers, addressing a critical infrastructure gap as AI agents become more complex.