#geometric-reasoning News & Analysis

9 articles tagged with #geometric-reasoning. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

9 articles

AIBullisharXiv – CS AI · Jun 117/10

🧠

The Art of Interrogation: Consistency Amplifies Factuality in Spatial Reasoning

Researchers propose a self-supervised reinforcement learning framework that improves large language models' spatial reasoning capabilities through consistency verification rather than labeled data. The approach, which uses geometric and semantic consistency checks across image and text transformations, achieves performance comparable to supervised fine-tuning without ground-truth annotations.

AIBullisharXiv – CS AI · Mar 57/10

🧠

TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics

Researchers have developed TIGeR, a framework that enhances Vision-Language Models with precise geometric reasoning capabilities for robotics applications. The system enables VLMs to execute centimeter-level accurate computations by integrating external computational tools, moving beyond qualitative spatial reasoning to quantitative precision required for real-world robotic manipulation.

AIBullisharXiv – CS AI · Mar 37/105

🧠

Vid-LLM: A Compact Video-based 3D Multimodal LLM with Reconstruction-Reasoning Synergy

Researchers propose Vid-LLM, a new video-based 3D multimodal large language model that processes video inputs without requiring external 3D data for scene understanding. The model uses a Cross-Task Adapter module and Metric Depth Model to integrate geometric cues and maintain consistency across 3D tasks like question answering and visual grounding.

AINeutralarXiv – CS AI · Jun 106/10

🧠

What Spatial Memory Must Store: Occlusion as the Test for Language-Agent Memory

Researchers demonstrate that spatial memory systems for language agents must fundamentally separate memory recall from visibility computation, using occlusion testing as a validation method. The study shows that geometry-based weighting outperforms traditional blending approaches, and introduces a ray-casting technique to properly handle occluded spatial information.

AIBullisharXiv – CS AI · Jun 16/10

🧠

Symbolic Intermediaries as a Linguistic-Numerical Interface for LLM-Driven Geometric Reasoning

Researchers propose symbolic intermediaries—compact mathematical expressions derived from symbolic regression—to bridge the gap between Large Language Models and physics simulators by converting continuous numerical outputs into interpretable symbolic forms. LLM-based agents using this interface outperformed genetic algorithms by 19-53% on mechanism synthesis tasks, demonstrating that translating simulator behavior into symbolic language enables grounded geometric reasoning without model retraining.

AIBullisharXiv – CS AI · May 296/10

🧠

Hilbert-Geo: Solving Solid Geometric Problems by Neural-Symbolic Reasoning

Researchers introduce Hilbert-Geo, a neural-symbolic AI framework for solving solid geometry problems by combining formal language representation with theorem-based reasoning. The system achieves 77.3% accuracy on solid geometry tasks, significantly outperforming leading AI models like GPT-4 and Gemini-2.5-pro, demonstrating advances in multimodal geometric reasoning.

🧠 GPT-5🧠 Gemini

AINeutralarXiv – CS AI · Mar 266/10

🧠

GeoSketch: A Neural-Symbolic Approach to Geometric Multimodal Reasoning with Auxiliary Line Construction and Affine Transformation

Researchers introduce GeoSketch, a neural-symbolic AI framework that solves geometric problems through dynamic visual manipulation, including drawing auxiliary lines and applying transformations. The system combines perception, symbolic reasoning, and interactive sketch actions, achieving superior performance on geometric problem-solving benchmarks compared to static image processing methods.

AIBullisharXiv – CS AI · Mar 96/10

🧠

Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views

Researchers introduce 3DThinker, a new framework that enables vision-language models to perform 3D spatial reasoning from limited 2D views without requiring 3D training data. The system uses a two-stage training approach to align 3D representations with foundation models and demonstrates superior performance across multiple benchmarks.

AIBullisharXiv – CS AI · Mar 27/1015

🧠

PointCoT: A Multi-modal Benchmark for Explicit 3D Geometric Reasoning

Researchers introduce PointCoT, a new AI framework that enables multimodal large language models to perform explicit geometric reasoning on 3D point cloud data using Chain-of-Thought methodology. The framework addresses current limitations where AI models suffer from geometric hallucinations by implementing a 'Look, Think, then Answer' paradigm with 86k instruction-tuning samples.