AINeutralarXiv – CS AI · 10h ago6/10
🧠
mHC-SSM: Manifold-Constrained Hyper-Connections for State Space Language Models with Stream-Specialized Adapters
Researchers introduce mHC-SSM, a novel architecture combining Manifold-Constrained Hyper-Connections with state space language models using stream-specialized adapters. The approach achieves significant perplexity improvements (572.91 to 461.88) on WikiText-2 benchmarks with predictable efficiency tradeoffs in throughput and memory usage.
🏢 Meta🏢 Perplexity