AINeutralarXiv – CS AI · 6h ago6/10
🧠
Feature Starvation as Geometric Instability in Sparse Autoencoders
Researchers propose Adaptive Elastic Net Sparse Autoencoders (AEN-SAEs) to solve feature starvation in neural network interpretability tools. The method combines L2 and adaptive L1 regularization to create a mathematically stable sparse coding system that improves feature extraction in large language models without requiring complex workarounds.
🧠 Llama