#formal-mathematics News & Analysis

4 articles tagged with #formal-mathematics. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

4 articles

AIBullisharXiv – CS AI · Jun 257/10

🧠

TheoremGraph: Bridging Formal and Informal Mathematics

Researchers introduce TheoremGraph, a unified dependency graph linking 11.7M informal mathematical statements from arXiv with 388,105 formal Lean 4 declarations through semantic embeddings. The infrastructure bridges the historically fragmented landscape of mathematical knowledge representation, enabling improved discovery and reasoning across both informal academic papers and formally verified mathematics.

🏢 Hugging Face

AIBullisharXiv – CS AI · Jun 27/10

🧠

Expected Value Alignment for Generative Reward Modeling in Formal Mathematics Verification

Researchers introduce Expected Value Alignment (EVA), a novel reward-modeling technique that enables Large Language Models to provide continuous numerical scores while maintaining human-readable text output for formal mathematics verification in Lean 4. The method bridges a critical gap between discrete generative outputs and continuous value assessment needed for reinforcement learning in theorem proving systems.

AINeutralarXiv – CS AI · Jun 96/10

🧠

TheoremBench: Evaluating LLMs on Theorem Proving in Formal Mathematics

Researchers introduce TheoremBench, a comprehensive Lean4 benchmark for evaluating large language models on formal mathematics theorem proving. Unlike existing competition-focused benchmarks, TheoremBench assesses how LLMs handle longer, dependency-rich mathematical proofs through both standalone theorems and structured families of related subtasks, revealing that current models remain inefficient and biased toward simpler problems.

AINeutralarXiv – CS AI · Jun 26/10

🧠

Formally Solving Answer-Construction Problems in Lean

Researchers introduce Enumerate-Conjecture-Prove (ECP), a neuro-symbolic framework that combines general LLMs and prover LLMs to formally solve mathematical answer-construction problems in Lean. The approach addresses a critical gap where current AI systems struggle with generating both candidate answers and rigorous formal proofs, achieving higher success rates than baseline LLM approaches on competition mathematics benchmarks.