AINeutralarXiv – CS AI · May 297/10
🧠Researchers introduce the NOVA framework, which models AI knowledge discovery as an adaptive sampling process and identifies fundamental scaling limitations. The analysis reveals a contamination trap where false positives accumulate faster than genuine discoveries as knowledge becomes scarce, with cumulative generation costs following a Zipf-distributed scaling law demonstrating asymptotic diminishing returns.
AIBullisharXiv – CS AI · May 287/10
🧠Researchers propose COSE, a self-evolution framework for large language models that uses confidence signals to filter noisy self-generated training feedback without external verifiers. The method demonstrates consistent improvements across 19 benchmarks and multiple model sizes (0.6B–4B parameters), achieving state-of-the-art performance in reasoning and mathematics tasks.
🧠 Llama
AIBullisharXiv – CS AI · May 17/10
🧠Researchers introduce ANCORA, a self-play framework enabling language models to generate verifiable problems, solve them, and improve without human supervision. The method achieves 81.5% pass rate on Dafny2Verus tasks, significantly outperforming baseline approaches and demonstrating advances in autonomous AI reasoning capabilities.
AIBullisharXiv – CS AI · Mar 117/10
🧠PlayWorld introduces a breakthrough AI system that trains robot world simulators entirely from autonomous robot self-play, eliminating the need for human demonstrations. The system achieves 40% improvements in failure prediction and 65% policy performance gains when deployed in real-world scenarios.
AIBullisharXiv – CS AI · Mar 47/102
🧠Researchers introduce Tether, a breakthrough method enabling robots to perform autonomous functional play using minimal human demonstrations (≤10). The system generates over 1000 expert-level trajectories through continuous cycles of task execution and improvement, representing a significant advance in autonomous robotics learning.
AIBullishOpenAI News · Oct 117/104
🧠Researchers demonstrate that AI self-play training enables simulated agents to autonomously develop complex physical skills like tackling, ducking, and ball handling without explicit programming. Combined with successful Dota 2 results, this suggests self-play will be fundamental to future powerful AI systems.
AIBullisharXiv – CS AI · Jun 16/10
🧠Researchers introduce SCALE, a self-improving web agent framework that uses adversarial roles and cognitive-aware exploration to autonomously adapt to complex web environments without relying on handcrafted pipelines or expensive expert data. The framework includes SCALE-Hop, a graph exploration strategy, and SCALE-20k, a 20,000-sample dataset from 19 real-world websites that demonstrates improved performance across multiple multimodal large language models.
AIBullishDecrypt · Apr 146/10
🧠Nous Research has unveiled Hermes, an open-source AI agent featuring a built-in learning loop that enables it to create and improve skills from experience autonomously. The agent operates on terminal infrastructure and represents a significant advancement in self-improving AI systems, positioning itself as a competitor to proprietary alternatives like OpenAI's tools.
AIBullisharXiv – CS AI · Mar 176/10
🧠Researchers propose a new AI learning architecture inspired by human and animal cognition that integrates observational learning and active behavior learning. The framework includes a meta-control system that switches between learning modes, addressing current limitations in autonomous AI learning.