y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#feedback-systems News & Analysis

5 articles tagged with #feedback-systems. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

5 articles
AINeutralarXiv – CS AI · Feb 277/108
🧠

A Mathematical Theory of Agency and Intelligence

Researchers propose a mathematical framework distinguishing agency from intelligence in AI systems, introducing 'bipredictability' as a measure of effective information sharing between observations, actions, and outcomes. Current AI systems achieve agency but lack true intelligence, which requires adaptive learning and self-monitoring capabilities.

AINeutralarXiv – CS AI · May 276/10
🧠

Towards Feedback-to-Plan Decisions for Self-Evolving LLM Agents in CUDA Kernel Generation

Researchers introduce CUDAnalyst, a new analysis framework that reveals how large language models make planning decisions when generating CUDA kernels by decomposing feedback signals. The study demonstrates that explicit planning helps only when feedback is well-aligned and that effective planning emerges from structured multi-feedback interactions, with findings showing robustness across different models and workloads.

AINeutralarXiv – CS AI · May 96/10
🧠

Prober.ai: Gated Inquiry-Based Feedback via LLM-Constrained Personas for Argumentative Writing Development

Prober.ai is an LLM-powered web-based writing environment that uses constrained AI personas and gated feedback mechanisms to improve argumentative writing through inquiry-based questioning rather than text generation. The system addresses cognitive outsourcing in education by forcing student reflection before revealing revision suggestions, grounded in Toulmin's argumentation theory and peer feedback research.

🧠 Gemini
AIBullisharXiv – CS AI · May 76/10
🧠

Efficiently Aligning Language Models with Online Natural Language Feedback

Researchers have developed methods to efficiently align language models using online natural language feedback in domains where human supervision is limited and difficult to quantify. By iteratively optimizing proxy reward models and collecting fresh expert feedback, the approach recovers 80-100% of full-supervision performance with 3-20x fewer expert samples, demonstrating significant improvements in training data efficiency.

🧠 Haiku
AIBullisharXiv – CS AI · Mar 55/10
🧠

Online Learning for Multi-Layer Hierarchical Inference under Partial and Policy-Dependent Feedback

Researchers developed a new variance-reduced EXP4-based algorithm for optimizing routing policies in multi-layer hierarchical inference systems. The solution addresses the challenge of sparse, policy-dependent feedback in AI systems where prediction errors are only revealed at terminal layers, improving stability and performance over standard importance-weighted approaches.