y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#model-scaling News & Analysis

32 articles tagged with #model-scaling. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

32 articles
AIBullisharXiv – CS AI · May 126/10
🧠

LEVI: Stronger Search Architectures Can Substitute for Larger LLMs in Evolutionary Search

Researchers introduce LEVI, an open-source evolutionary search framework that achieves superior results on AI research benchmarks while reducing computational costs by 3.3x to 35x compared to existing methods. By optimizing search architecture rather than relying on larger language models, LEVI demonstrates that algorithmic efficiency can significantly reduce the expense of LLM-guided evolutionary discovery.

AINeutralarXiv – CS AI · May 116/10
🧠

When Does Critique Improve AI-Assisted Theoretical Physics? SCALAR: Structured Critic--Actor Loop for Agentic Reasoning

Researchers introduce SCALAR, an Actor-Critic-Judge framework that systematically evaluates how AI agents improve through human feedback on theoretical physics problems. The study reveals that multi-turn dialogue consistently outperforms single attempts, but the effectiveness of different feedback strategies depends heavily on the specific pairing of AI models used, with asymmetric model pairs benefiting most from structured critique.

AINeutralarXiv – CS AI · May 96/10
🧠

Strat-LLM: Stratified Strategy Alignment for LLM-based Stock Trading with Real-time Multi-Source Signals

Researchers introduce Strat-LLM, a framework that aligns large language models for stock trading by matching model architecture to operational modes (Free, Guided, Strict), finding that reasoning-heavy models excel with minimal constraints while standard models benefit from strict guardrails. Live-forward testing across 2025 on A-share and U.S. markets reveals that optimal performance depends on market regime and model scale, with mid-size models (35B) showing superior risk-adjusted returns under constraints.

AINeutralarXiv – CS AI · May 76/10
🧠

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

Researchers introduce CreativityBench, a benchmark with 4K entities and 150K+ affordance annotations to evaluate how well large language models can creatively repurpose tools by reasoning about their properties rather than canonical uses. Evaluations across 10 state-of-the-art LLMs reveal significant limitations: models struggle to identify correct parts, affordances, and physical mechanisms needed for non-obvious solutions, with performance gains from scaling and reasoning strategies like Chain-of-Thought proving limited.

AINeutralarXiv – CS AI · Apr 206/10
🧠

LLM attribution analysis across different fine-tuning strategies and model scales for automated code compliance

Researchers conducted a comparative study of how large language models trained with different fine-tuning methods (full fine-tuning, LoRA, and quantized LoRA) interpret code compliance tasks. The study reveals that full fine-tuning produces more focused attribution patterns than parameter-efficient methods, and larger models develop distinct interpretive strategies despite performance gains plateauing above 7B parameters.

AIBullishMarkTechPost · Mar 167/10
🧠

Moonshot AI Releases 𝑨𝒕𝒕𝒆𝒏𝒕𝒊𝒐𝒏 𝑹𝒆𝒔𝒊𝒅𝒖𝒂𝒍𝒔 to Replace Fixed Residual Mixing with Depth-Wise Attention for Better Scaling in Transformers

Moonshot AI has released Attention Residuals, a new approach that replaces traditional fixed residual connections in Transformer architectures with depth-wise attention mechanisms. The innovation addresses structural problems in PreNorm architectures where all prior layer outputs are mixed equally, potentially improving model scaling capabilities.

Moonshot AI Releases 𝑨𝒕𝒕𝒆𝒏𝒕𝒊𝒐𝒏 𝑹𝒆𝒔𝒊𝒅𝒖𝒂𝒍𝒔 to Replace Fixed Residual Mixing with Depth-Wise Attention for Better Scaling in Transformers
AINeutralarXiv – CS AI · Mar 36/104
🧠

How Small Can 6G Reason? Scaling Tiny Language Models for AI-Native Networks

Researchers evaluated compact AI language models for 6G networks, finding that mid-scale models (1.5-3B parameters) offer the best balance of performance and computational efficiency for edge deployment. The study shows diminishing returns beyond 3B parameters, with accuracy improving from 22% at 135M to 70% at 7B parameters.

← PrevPage 2 of 2