AINeutralarXiv – CS AI · 6h ago7/10
🧠
Multi-LCB: Extending LiveCodeBench to Multiple Programming Languages
Researchers introduce Multi-LCB, an extension of the LiveCodeBench evaluation framework that tests large language models across twelve programming languages instead of just Python. The benchmark reveals significant performance disparities across languages and evidence of Python overfitting in current LLMs, establishing a more rigorous standard for assessing real-world multilingual code generation capabilities.