y0news
AnalyticsDigestsSourcesRSSAICrypto
#coding-benchmarks1 article
1 articles
AIBullisharXiv – CS AI Β· 7h ago7/10
🧠

Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents

Researchers introduce the Darwin GΓΆdel Machine (DGM), a self-improving AI system that can iteratively modify its own code and validate changes through benchmarks. The system demonstrated significant performance improvements, increasing coding capabilities from 20.0% to 50.0% on SWE-bench and from 14.2% to 30.7% on Polyglot benchmarks.