#llm-generalization News & Analysis

2 articles tagged with #llm-generalization. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles

AIBullisharXiv – CS AI · Jun 17/10

🧠

MAVEN: Improving Generalization in Agentic Tool Calling

Researchers introduce MAVEN, a symbolic reasoning framework that improves language model generalization in tool-calling tasks by 23 percentage points (48% to 71% accuracy) on a new stress-test benchmark, while maintaining cost efficiency roughly 10x lower than frontier proprietary models. The work demonstrates that lightweight verification-centered scaffolds can enhance compositional reasoning without additional model training.

AIBullisharXiv – CS AI · May 127/10

🧠

Memorize Theorems, Not Instances: Probing SFT Generalization through Mathematical Reasoning

Researchers propose Theorem-SFT, a novel supervised fine-tuning approach that teaches language models to apply mathematical rules explicitly rather than memorize surface-level correlations between problems and solutions. The method demonstrates significant performance improvements across benchmarks while revealing that feed-forward layers, not memorization itself, are the primary locus of reasoning capability.