AINeutralarXiv โ CS AI ยท 5h ago6/10
๐ง
Evaluating Legal Reasoning Traces with Legal Issue Tree Rubrics
Researchers introduce LEGIT, a 24K-instance legal reasoning dataset with hierarchical argument trees that serve as evaluation rubrics for LLM-generated legal reasoning. The study reveals that LLM legal reasoning performance depends critically on both issue coverage and correctness, with RAG and reinforcement learning offering complementary improvements.