AINeutralarXiv – CS AI · 3h ago6/10
🧠
AlphaForgeBench: Benchmarking End-to-End Trading Strategy Design with Large Language Models
Researchers introduce AlphaForgeBench, a new evaluation framework that addresses critical instability issues in Large Language Models deployed as trading agents. Rather than having LLMs generate discrete trading actions, the framework redefines their role as quantitative researchers producing alpha factors and strategies, enabling deterministic, reproducible evaluation aligned with real-world financial workflows.