AIBullisharXiv โ CS AI ยท 7h ago6/10
๐ง
VLegal-Bench: Cognitively Grounded Benchmark for Vietnamese Legal Reasoning of Large Language Models
Researchers have introduced VLegal-Bench, the first comprehensive benchmark for evaluating large language models on Vietnamese legal tasks, comprising 10,450 expert-annotated samples grounded in real legal documents. The benchmark uses Bloom's cognitive taxonomy to assess LLM performance across practical legal scenarios, establishing a standardized framework for developing more reliable AI-assisted legal systems in Vietnam.