AIBearisharXiv โ CS AI ยท 7h ago7/10
๐ง
ASMR-Bench: Auditing for Sabotage in ML Research
Researchers introduced ASMR-Bench, a benchmark for detecting sabotage in ML research codebases, revealing that current frontier LLMs and human auditors struggle to identify subtle implementation flaws that produce misleading results. The study found even the best-performing model (Gemini 3.1 Pro) achieved only 77% AUROC and 42% fix rate, highlighting critical vulnerabilities in AI-assisted research validation.
๐ง Gemini