AINeutralarXiv โ CS AI ยท 2d ago7/10
๐ง
Beyond Scalars: Evaluating and Understanding LLM Reasoning via Geometric Progress and Stability
Researchers introduce TRACED, a framework that evaluates AI reasoning quality through geometric analysis rather than traditional scalar probabilities. The system identifies correct reasoning as high-progress stable trajectories, while AI hallucinations show low-progress unstable patterns with high curvature fluctuations.