AINeutralarXiv – CS AI · 3h ago6/10
🧠
Benchmarking AI for low-resource contexts: Thinking beyond leaderboards
Researchers argue that current AI evaluation benchmarks fail to reflect real-world performance in low-resource environments, where factors like noisy inputs, poor connectivity, and low-end hardware significantly impact usability. The paper proposes a new evaluation framework that assesses deployed systems holistically rather than isolated models, with standardized reporting cards designed for policymakers and implementers.