AINeutralarXiv โ CS AI ยท 14h ago7/10
๐ง
LiveCLKTBench: Towards Reliable Evaluation of Cross-Lingual Knowledge Transfer in Multilingual LLMs
Researchers introduce LiveCLKTBench, an automated benchmark for evaluating how well multilingual large language models transfer knowledge across languages, addressing the challenge of distinguishing genuine cross-lingual transfer from pre-training artifacts. Testing across five languages reveals that transfer effectiveness depends heavily on linguistic distance, model scale, and domain, with improvements plateauing in larger models.