y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#problem-decomposition News & Analysis

1 article tagged with #problem-decomposition. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

1 articles
AINeutralarXiv – CS AI · 6h ago6/10
🧠

TD-Grokking: Learning from Zero-Reward Problems by Training-Time Decomposition

Researchers introduce TD-Grokking, a training-time decomposition framework that enables large language models to learn from zero-reward problems by recursively breaking down unsolvable tasks into verifiable subproblems. This addresses a critical limitation in reinforcement learning with verifiable rewards (RLVR), where models typically fail to improve on challenging problems that produce uniform failure outcomes.