AINeutralarXiv – CS AI · 18h ago7/10
🧠
AI Scientists Are Only as Good as Their Evidence: A Stratified Ablation of Proprietary Data and Reasoning Skills in Drug-Asset Valuation
Researchers demonstrate that AI agents' performance in drug-asset valuation is fundamentally limited by access to proprietary data rather than reasoning quality alone. A three-arm experiment shows that adding reasoning scaffolds and structured tools improves calibration but cannot overcome gaps in underlying evidence, with proprietary datasets enabling 96% recovery of expert valuations versus 38% for public-data-only systems.