AINeutralarXiv – CS AI · 3h ago6/10
🧠
Dr-CiK: A Testbed for Foresight-Driven Agents
Researchers introduce Dr-CiK, a benchmark for testing whether AI agents can independently retrieve relevant context from noisy document sources to improve time series forecasting. Evaluation reveals current information retrieval agents recover less than 5% of supporting evidence and are frequently misled by irrelevant information, highlighting a critical gap in foresight-driven AI development.