AIBullisharXiv – CS AI · Jun 57/10
🧠Researchers introduce GeoVR, a framework that enhances multimodal large language models with 3D spatial awareness by learning geometric representations from 2D video sequences. Using four complementary geometric targets including camera pose estimation, depth mapping, and 3D feature distillation, the approach achieves state-of-the-art performance on spatial reasoning benchmarks without requiring large-scale 3D training data.
AIBullisharXiv – CS AI · Jun 47/10
🧠Researchers introduce Platonic Transformers, a novel architecture that adds geometric symmetry constraints to standard Transformers without sacrificing computational efficiency. By leveraging symmetry groups from Platonic solids as reference frames for attention mechanisms, the model achieves equivariance to translations and discrete symmetries while maintaining Transformer performance across vision, 3D point clouds, and molecular prediction tasks.
AIBullisharXiv – CS AI · Mar 167/10
🧠Researchers introduce a novel optimization framework that integrates the Minimum Description Length (MDL) principle directly into deep neural network training dynamics. The method uses geometrically-grounded cognitive manifolds with coupled Ricci flow to create autonomous model simplification while maintaining data fidelity, with theoretical guarantees for convergence and practical O(N log N) complexity.
AINeutralarXiv – CS AI · 12h ago6/10
🧠ParaScale introduces a geometric solution to camera motion transfer in video generation by identifying and preserving the Parallax Number (Pi), a scale-invariant metric that quantifies perceived camera movement independent of scene depth. The method enables creators to transfer cinematic camera movements between videos at vastly different scales without requiring retraining, improving transfer fidelity by over 3x compared to uncalibrated approaches.
AINeutralarXiv – CS AI · Jun 96/10
🧠Researchers propose a geometric framework for machine intelligence where cognitive computation emerges from Riemannian gradient flow on learned latent manifolds, eliminating the need for explicit memory modules. The approach demonstrates superior robustness across reinforcement learning tasks involving partial observability, sensory disruptions, and long-horizon prediction compared to feedforward baselines.
AINeutralarXiv – CS AI · Jun 86/10
🧠Researchers propose a Second-Order Correlation (SOC) layer that improves speech emotion recognition by modeling feature correlations as covariance descriptors rather than treating features independently. Using Log-Euclidean mapping to preserve geometric properties, the method demonstrates superior performance on standard emotion recognition datasets compared to conventional first-order aggregation approaches.
AINeutralarXiv – CS AI · Jun 26/10
🧠Researchers introduce E2M (End-to-End Metric regression), a deep learning framework that predicts non-Euclidean outputs like probability distributions and networks by computing weighted Fréchet means with neural network-learned weights. The method preserves geometric properties of output spaces while achieving state-of-the-art performance across multiple domains without requiring surrogate embeddings.
AINeutralarXiv – CS AI · May 286/10
🧠A reproducibility study of the TRIANGLE framework reveals that geometric alignment on hyperspheres improves multimodal retrieval beyond traditional pairwise approaches, achieving up to 8.7 point gains in zero-shot settings. However, researchers identified critical optimization instabilities when jointly training with data-text matching loss and reduced cross-dataset generalization with fine-tuning, suggesting the method's benefits are context-dependent rather than universally applicable.
AINeutralarXiv – CS AI · May 286/10
🧠Researchers demonstrate that the Muon optimizer significantly outperforms Adam when training equivariant neural networks, which encode geometric symmetries by design. Analysis of trained models reveals Muon produces solutions with more regular loss surfaces, higher weight ranks, and better-conditioned representations, suggesting optimizer choice substantially influences how neural networks learn geometric constraints.
AINeutralarXiv – CS AI · May 96/10
🧠Researchers introduce HilbNets, a novel deep learning framework that handles infinite-dimensional signals (like time series and probability distributions) on irregular domains using Hilbert bundles and cellular sheaves. The work provides theoretical convergence guarantees and demonstrates that discretized networks maintain consistency across different data sampling schemes, advancing geometric deep learning theory.
AIBullisharXiv – CS AI · Feb 275/107
🧠Researchers have developed RepSPD, a novel geometric deep learning model that enhances EEG brain activity decoding using symmetric positive definite manifolds and dynamic graphs. The framework introduces cross-attention mechanisms on Riemannian manifolds and bidirectional alignment strategies to improve brain signal representation and analysis.