AINeutralarXiv – CS AI · Jun 27/10
🧠Researchers introduce MENTIS, a framework for measuring internal geometric changes in language models during preference alignment training. The study reveals that alignment leaves selective, depth-localized signatures in model computations, with normative concepts showing larger internal reorganization than factual concepts across multiple model architectures.
AIBullisharXiv – CS AI · May 287/10
🧠Researchers introduce a topological data analysis framework to evaluate reasoning quality in large language models, moving beyond traditional graph-based metrics. The study demonstrates that higher-dimensional geometric structures predict reasoning quality more effectively than standard connectivity measures, offering a practical signal for training optimization.
AIBullisharXiv – CS AI · May 277/10
🧠Researchers introduce GeoFaith, a framework for detecting and improving faithfulness in chain-of-thought reasoning by LLMs, addressing the problem of plausible-sounding but inaccurate explanations. The method combines geometric latent structures with entropy analysis and includes a reinforcement learning approach that achieves superior performance on faithfulness detection while maintaining accuracy.
🧠 GPT-5
AINeutralarXiv – CS AI · May 97/10
🧠Researchers have identified a geometric framework explaining how language models fail through two distinct mechanisms: parametric memory conflicting with working memory, and hallucination from absent learned facts. Both failures produce confident outputs despite being mechanistically different, but hidden-state geometry and 'geometric margin' metrics can distinguish them more reliably than traditional entropy-based detection methods.
AINeutralarXiv – CS AI · Apr 67/10
🧠Researchers analyzed the geometric structure of layer updates in deep language models, finding they decompose into a dominant tokenwise component and a geometrically distinct residual. The study shows that while most updates behave like structured reparameterizations, functionally significant computation occurs in the residual component.
AINeutralarXiv – CS AI · Mar 177/10
🧠Researchers studied multi-task grokking in Transformers, revealing five key phenomena including staggered generalization order and weight decay phase structures. The study shows how AI models construct compact superposition subspaces in parameter space, with weight decay acting as compression pressure.
AINeutralarXiv – CS AI · Mar 127/10
🧠Researchers introduce TRACED, a framework that evaluates AI reasoning quality through geometric analysis rather than traditional scalar probabilities. The system identifies correct reasoning as high-progress stable trajectories, while AI hallucinations show low-progress unstable patterns with high curvature fluctuations.
AINeutralarXiv – CS AI · Mar 117/10
🧠Researchers propose 'Curveball steering', a nonlinear method for controlling large language model behavior that outperforms traditional linear approaches. The study challenges the Linear Representation Hypothesis by showing that LLM activation spaces have substantial geometric distortions that require geometry-aware interventions.
AINeutralarXiv – CS AI · Mar 57/10
🧠Researchers explain why Graph Neural Networks (GNNs) struggle with complex Boolean Satisfiability Problems (SATs) through geometric analysis using graph Ricci Curvature. They prove that harder SAT instances have more negative curvature, creating connectivity bottlenecks that prevent GNNs from effectively processing long-range dependencies.
AINeutralarXiv – CS AI · 5d ago6/10
🧠Researchers demonstrate that Large Language Models encode truth as geometric vectors in their activation space, and these vectors undergo predictable transformations when contextual information is introduced. The study reveals that larger models rely on directional changes to distinguish relevant context while smaller models use magnitude shifts, with conflicting context producing larger geometric shifts than aligned context.
AINeutralarXiv – CS AI · 6d ago6/10
🧠Researchers present a geometric framework for understanding activation steering in language models by decomposing interventions into angular and radial components. The study finds that while concepts are primarily encoded in angular structure, the hidden-state norm remains important for steering stability and effectiveness, suggesting that steering methods should be parameterized separately for these two geometric effects rather than as a single additive coefficient.
AINeutralarXiv – CS AI · Jun 46/10
🧠Researchers characterize the geometric structure of loss landscape plateaus in two-layer neural networks, focusing on how duplicating hidden neurons creates affine sets of stationary points. The study classifies whether these plateau points are local minima or saddles based on an 'inner Hessian' matrix, revealing that splitting a minimum can produce mixed or all-saddle plateaus, while splitting saddles always yields saddle plateaus.
AINeutralarXiv – CS AI · May 296/10
🧠Researchers propose a novel framework for understanding equilibrium computation in games by mapping the geometric structure of game spaces to solver effectiveness. Rather than studying algorithms in isolation, they develop a learned representation that identifies which solver mechanisms work best across different game regimes, revealing continuous regions of algorithmic validity and suggesting that solvability is governed by underlying structural properties.
AINeutralarXiv – CS AI · Apr 156/10
🧠Researchers demonstrate that large language models develop attractor-like geometric patterns in their activation space when processing identity documents describing persistent agents. Experiments on Llama 3.1 and Gemma 2 show paraphrased identity descriptions cluster significantly tighter than structural controls, suggesting LLMs encode semantic agent identity as stable attractors independent of linguistic variation.
🧠 Llama
AINeutralarXiv – CS AI · Apr 146/10
🧠Researchers investigate how large language models represent emotions in their latent spaces, discovering that LLMs develop coherent emotional representations aligned with established psychological models of valence and arousal. The findings support the linear representation hypothesis used in AI transparency methods and demonstrate practical applications for uncertainty quantification in emotion processing tasks.
AINeutralarXiv – CS AI · Mar 164/10
🧠Researchers developed a framework to improve video-language models' understanding of camera motion through geometric analysis. The study introduces CameraMotionDataset and CameraMotionVQA benchmark, revealing that current VideoLLMs struggle with camera motion recognition and proposing a lightweight solution using 3D foundation models.
AINeutralarXiv – CS AI · Mar 54/10
🧠Researchers propose directional CDNV (decision-axis variance) as a key geometric quantity explaining why self-supervised learning representations transfer well with few labels. The study shows that small variability along class-separating directions enables strong few-shot transfer and low interference across multiple tasks.