AINeutralarXiv – CS AI · 15h ago6/10
🧠
When Does Deep RL Beat Calibrated Baselines? A Benchmark Study on Adaptive Resource Control
A comprehensive benchmark study reveals that properly calibrated rule-based autoscalers outperform six mainstream deep reinforcement learning algorithms on cost in adaptive resource control tasks. The research challenges assumptions about DRL superiority, identifying baseline calibration and reward engineering as greater bottlenecks than algorithm selection.