AINeutralarXiv โ CS AI ยท Feb 275/106
๐ง
FIRE: A Comprehensive Benchmark for Financial Intelligence and Reasoning Evaluation
Researchers introduce FIRE, a comprehensive benchmark for evaluating Large Language Models' financial intelligence and reasoning capabilities. The benchmark includes theoretical financial knowledge tests from qualification exams and 3,000 practical financial scenario questions covering complex business domains.