#scaling-efficiency News & Analysis

2 articles tagged with #scaling-efficiency. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles

AIBullisharXiv – CS AI · Jun 86/10

🧠

Dual Latent Memory for Visual Multi-agent System

Researchers propose L²-VMAS, a framework addressing the 'scaling wall' problem in Visual Multi-Agent Systems where adding more agents degrades performance despite higher computational costs. The solution uses dual latent memory and entropy-driven triggering to improve accuracy by 2.7-5.4% while reducing token usage by 21.3-44.8%.

AIBullisharXiv – CS AI · May 276/10

🧠

More Expressive Feedforward Layers: Part I. Token-Adaptive Mixing of Activations

Researchers propose Mixture of Activations (MoA), a novel feedforward network design that dynamically selects activation functions per token rather than applying a single fixed function across all inputs. Theoretical analysis proves MoA offers strict expressivity advantages over fixed-activation networks, while empirical testing on language models up to 2B parameters demonstrates consistent improvements in loss metrics with minimal computational overhead.