AIBullisharXiv – CS AI · Jun 27/10
🧠BitsMoE introduces a spectral-energy-guided quantization framework for compressing Mixture-of-Experts large language models, achieving significant improvements in the ultra-low-bit regime. The method uses SVD decomposition to intelligently allocate bits across expert weights, delivering 27.83 percentage point accuracy improvements over existing approaches at 2-bit quantization while accelerating inference speed by 1.76× on Qwen models.
AINeutralarXiv – CS AI · Jun 96/10
🧠Researchers propose a spectral graph neural network combined with reinforcement learning to optimize power grid recovery during outages, enabling real-time decision-making for network reconfiguration. The approach demonstrates near-optimal performance across IEEE test systems while generalizing effectively to diverse outage scenarios, addressing computational inefficiencies in traditional machine learning methods for smart grid management.
AINeutralarXiv – CS AI · Jun 46/10
🧠Researchers propose Low-Rank Decay (LRD), a spectral regularization technique that improves generalization in scale-invariant Transformer architectures by compressing weight singular values after memorization. Unlike standard L2 decay, LRD remains effective in normalized models and accelerates grokking—the delayed generalization phenomenon—on algorithmic tasks.
$UV
AINeutralarXiv – CS AI · Jun 26/10
🧠Researchers propose Bayesian Spectral Emotion Transition Discovery (BSETD), a framework that analyzes emotion dynamics in conversations by preserving multi-annotator disagreement rather than collapsing it into single labels. The method successfully identifies distinct emotion transition patterns across psychological theories and demonstrates strong cross-corpus validation, bridging computational linguistics with established emotion science.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers introduce CDLinear, a neural network layer based on the Communication Dynamics framework that achieves 3.8× parameter reduction compared to dense layers while maintaining comparable accuracy. The layer uses block-circulant matrices with FFT-diagonalization to dramatically improve Hessian conditioning, reducing the condition number by 310× in empirical tests.
$MATIC
AINeutralarXiv – CS AI · May 126/10
🧠Researchers introduce intrinsic Muon (iMuon), a unified optimization framework that extends the Muon optimizer to Riemannian manifolds while preserving symmetries and enabling closed-form solutions. The approach demonstrates applications in LLM fine-tuning, image classification, and subspace learning with convergence guarantees dependent only on manifold dimension rather than factor conditioning.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers propose Spectral Transformer Neural Processes (STNPs), an enhanced machine learning architecture that improves how neural networks handle periodic and quasi-periodic data by incorporating frequency-domain analysis. The method addresses a key limitation of existing Neural Processes by embedding spectral information directly into transformer models, enabling better generalization beyond training data.
AINeutralarXiv – CS AI · May 116/10
🧠Researchers introduce a spectral filtering method for learning complex-valued linear dynamical systems with sector-bounded spectrum, achieving dimension-free regret bounds for sequence prediction. The approach uses Slepian basis functions and demonstrates that learning efficiency depends on an effective dimension independent of state space size, with applications to signal processing and quantum systems.