AINeutralarXiv – CS AI · 8h ago6/10
🧠
Are LLMs Ready for Neural-integrated Mechanistic Modeling? A Benchmark and Agentic Framework
Researchers introduce NIMM, a benchmark for evaluating large language models' ability to construct neural-integrated mechanistic models that combine traditional scientific equations with neural networks. They propose NIMMGen, an agentic framework using tree-guided search that significantly outperforms existing LLM approaches on this complex modeling task across three scientific domains.