#reasoning-analysis News & Analysis

3 articles tagged with #reasoning-analysis. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

3 articles

AIBullisharXiv – CS AI · Jun 97/10

🧠

SLMJury: Can Small Language Models Judge as Well as Large Ones?

Researchers introduce SLMJury, a framework demonstrating that small language models (0.6B-14B parameters) can match or exceed large language models as judges for evaluating AI outputs. The study reveals that model size alone doesn't determine judging capability, with performance varying significantly by task domain and judgment type, challenging assumptions about requiring expensive proprietary LLMs for automated evaluation.

AINeutralarXiv – CS AI · May 286/10

🧠

Integrated and Cross-Architecture Interpretation of LLM Reasoning

Researchers present the Integrated cross-Architecture Reasoning (IAR) framework, a novel methodology for interpreting how large language models perform reasoning tasks by combining multiple analytical probes—bandwidth-calibrated Mutual Information Peak, Deep-Thinking Ratio analysis, and Jaccard stability metrics—across model layers and architectures. Testing on Qwen and Llama models across mathematics, code, logic, and common sense domains demonstrates that this multi-metric approach provides more reliable insights into LLM reasoning patterns than single-probe methods.

🧠 Llama

AINeutralarXiv – CS AI · May 286/10

🧠

RL Squeezes, SFT Expands: A Comparative Study of Reasoning LLMs

Researchers present a novel framework analyzing how reinforcement learning (RL) and supervised fine-tuning (SFT) differently shape reasoning in large language models. The study reveals that RL compresses incorrect reasoning paths while SFT expands correct ones, explaining why the two-stage training approach produces superior reasoning capabilities across models of 1.5B to 14B parameters.