AIBearisharXiv – CS AI · Apr 137/10
🧠Researchers propose the Spectral Sensitivity Theorem to explain hallucinations in large ASR models like Whisper, identifying a phase transition between dispersive and attractor regimes. Analysis of model eigenspectra reveals that intermediate models experience structural breakdown while large models compress information, decoupling from acoustic evidence and increasing hallucination risk.
AIBullisharXiv – CS AI · Apr 107/10
🧠SpecQuant introduces a novel quantization framework using spectral decomposition to compress large language models to 4-bit precision for both weights and activations, achieving only 1.5% accuracy loss on LLaMA-3 8B while enabling 2x faster inference and 3x memory reduction. The technique exploits frequency domain properties to preserve essential signal components while suppressing high-frequency noise, addressing a critical challenge in deploying LLMs on edge devices.
AIBullisharXiv – CS AI · Mar 167/10
🧠Researchers propose a new family of learnable Koopman operators that combine linear dynamical systems theory with deep learning for time series forecasting. The approach integrates with existing transformer architectures like Patchtst and Autoformer, offering improved stability and interpretability in predictive models.
AINeutralarXiv – CS AI · Mar 46/102
🧠Researchers identify the 'Malignant Tail' phenomenon where over-parameterized neural networks segregate signal from noise during training, leading to harmful overfitting. They demonstrate that Stochastic Gradient Descent pushes label noise into high-frequency orthogonal subspaces while preserving semantic features in low-rank subspaces, and propose Explicit Spectral Truncation as a post-hoc solution to recover optimal generalization.
AIBullisharXiv – CS AI · Feb 277/107
🧠Researchers have developed a unified framework using Spectral Geometry and Random Matrix Theory to address reliability and efficiency challenges in large language models. The study introduces EigenTrack for real-time hallucination detection and RMT-KD for model compression while maintaining accuracy.
AINeutralarXiv – CS AI · 4d ago6/10
🧠Researchers introduce a novel predictability-aligned evaluation framework for time series forecasting that separates model performance from data's inherent unpredictability. The framework reveals that complex AI models excel with difficult-to-predict data while linear models perform comparably on more predictable tasks, suggesting current benchmark rankings conflate model capability with task difficulty.
AINeutralarXiv – CS AI · 5d ago6/10
🧠Researchers identify a fundamental weakness in EEG foundation models: reconstruction-based pretraining causes these models to heavily bias toward aperiodic signal components while neglecting high-frequency oscillatory patterns critical for brain-computer interfaces. This spectral mismatch explains why large pretrained models underperform smaller supervised alternatives in low-resource settings.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers empirically validate theoretical predictions about feature repulsion in neural network grokking, discovering that while the mathematical sign structure holds consistently across activation functions, the spectral signature of this mechanism in weight updates depends critically on activation type—appearing sharply in quadratic activations but remaining invisible in ReLU networks.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers introduce a spectral-injection diagnostic method to measure which angular frequencies equivariant neural force fields can preserve, revealing sharp performance cliffs at theoretical capacity boundaries. Testing on aspirin with NequIP backbones shows a dramatic 11.7x performance drop at the predicted boundary, validated across multiple architectures and calibrated through polynomial span theorems.
AINeutralarXiv – CS AI · May 116/10
🧠Researchers introduce a spectral diagnostic method to detect hidden coalitions in multi-agent AI systems by analyzing mutual information patterns in internal neural representations rather than observable behavior. The technique successfully identifies hierarchical and dynamic coalition structures in reinforcement learning and language models, providing a scalable tool for monitoring emergent organization in distributed AI systems.
AINeutralarXiv – CS AI · Mar 126/10
🧠Researchers propose TASER, a new defense framework against backdoor attacks in UAV-based decentralized federated learning systems. The system uses spectral energy analysis rather than traditional outlier detection, achieving below 20% attack success rates while maintaining accuracy within 5% loss.
AIBullisharXiv – CS AI · Mar 96/10
🧠Researchers developed E-AdaPrune, an energy-driven adaptive pruning framework that optimizes Vision-Language Models by dynamically allocating visual tokens based on image information density. The method shows up to 0.6% average improvement across benchmarks, with a notable 5.1% boost on reasoning tasks, while adding only 8ms latency per image.
AINeutralarXiv – CS AI · Mar 37/106
🧠Researchers introduce StaTS, a new diffusion model for time series forecasting that learns adaptive noise schedules and uses frequency-guided denoising. The model addresses limitations of fixed noise schedules in existing diffusion models by incorporating spectral regularization and data-adaptive scheduling for improved structural preservation.
$NEAR
AIBullisharXiv – CS AI · Mar 36/104
🧠Researchers propose Class-Aware Spectral Distribution Matching (CSDM), a new dataset distillation method that addresses performance issues on imbalanced datasets. The technique achieves 14% improvement over existing methods on CIFAR-10-LT with enhanced stability on long-tailed data distributions.