AINeutralarXiv – CS AI · 14h ago6/10
🧠
Multi-Legal-Bench: Evaluating LLMs on Legal Reasoning Across Jurisdictions, Languages, and Legal Traditions
Researchers introduce Multi-Legal-Bench, a cross-jurisdictional benchmark evaluating large language models on legal reasoning tasks across six European countries, four language families, and 134 million court decisions. The study reveals that few-shot transfer effectiveness depends on label-set alignment rather than linguistic proximity, and that model architecture matters more than tokenizer efficiency for cross-lingual legal NLP performance.