y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#self-improvement News & Analysis

30 articles tagged with #self-improvement. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

30 articles
AIBullisharXiv – CS AI · 5d ago6/10
🧠

CyberEvolver: Structured Self-Evolution for Cybersecurity Agents On the Fly

Researchers introduce CyberEvolver, an AI agent framework that autonomously improves its own architecture through iterative learning from failed cybersecurity tasks. The system demonstrates 13.6% average success rate improvements across CTF challenges and penetration testing, outperforming fixed human-designed alternatives and competing self-improvement methods.

AINeutralarXiv – CS AI · May 96/10
🧠

Conversation for Non-verifiable Learning: Self-Evolving LLMs through Meta-Evaluation

Researchers introduce CoNL, a framework that enables large language models to improve themselves through multi-agent self-play without requiring ground-truth labels or external judges. The system uses critiques that successfully improve solutions as training signals, allowing models to jointly optimize both generation and evaluation capabilities for non-verifiable tasks like creative writing and ethical reasoning.

AINeutralarXiv – CS AI · Apr 146/10
🧠

TokUR: Token-Level Uncertainty Estimation for Large Language Model Reasoning

Researchers propose TokUR, a framework that enables large language models to estimate uncertainty at the token level during reasoning tasks, allowing LLMs to self-assess response quality and improve performance on mathematical problems. The approach uses low-rank random weight perturbation to generate predictive distributions, demonstrating strong correlation with answer correctness and potential for enhancing LLM reliability.

AIBullisharXiv – CS AI · Mar 266/10
🧠

ELITE: Experiential Learning and Intent-Aware Transfer for Self-improving Embodied Agents

Researchers introduce ELITE, a new framework that enables AI embodied agents to learn from their own experiences and transfer knowledge to similar tasks. The system addresses failures in vision-language models when performing complex physical tasks by using self-reflective knowledge construction and intent-aware retrieval mechanisms.

AIBullisharXiv – CS AI · Mar 36/109
🧠

Provable and Practical In-Context Policy Optimization for Self-Improvement

Researchers introduce In-Context Policy Optimization (ICPO), a new method that allows AI models to improve their responses during inference through multi-round self-reflection without parameter updates. The practical ME-ICPO algorithm demonstrates competitive performance on mathematical reasoning tasks while maintaining affordable inference costs.

← PrevPage 2 of 2