AIBullisharXiv – CS AI · 15h ago6/10
🧠
Natural Language Query to Configuration for Retrieval Agents
Researchers introduce BRANE, an AI system that dynamically selects optimal configurations for retrieval agents by analyzing natural-language queries at inference time. The method reduces serving costs by up to 89% while maintaining accuracy, demonstrating that per-query optimization outperforms traditional static pipeline tuning across multiple benchmarks.