AIBullisharXiv – CS AI · 6h ago7/10
🧠
Cost-Optimal LLM Routing with Limited User Feedback under User Satisfaction Guarantees
Researchers introduced SLARouter, an online algorithm that optimizes LLM request routing by learning cost-efficient policies from sparse user feedback while guaranteeing Service Level Agreement compliance. The approach reduces operating costs by up to 2.2x compared to existing solutions without requiring per-benchmark tuning.