AINeutralarXiv – CS AI · Mar 66/10
🧠
FinRetrieval: A Benchmark for Financial Data Retrieval by AI Agents
Researchers introduced FinRetrieval, a benchmark testing AI agents' ability to retrieve financial data, evaluating 14 configurations across major providers. The study found that tool availability dramatically impacts performance, with Claude Opus achieving 90.8% accuracy using structured APIs versus only 19.8% with web search alone.
🏢 OpenAI🏢 Anthropic🧠 Claude