#cross-lingual-transfer News & Analysis

8 articles tagged with #cross-lingual-transfer. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

8 articles

AINeutralarXiv – CS AI · Apr 147/10

🧠

LiveCLKTBench: Towards Reliable Evaluation of Cross-Lingual Knowledge Transfer in Multilingual LLMs

Researchers introduce LiveCLKTBench, an automated benchmark for evaluating how well multilingual large language models transfer knowledge across languages, addressing the challenge of distinguishing genuine cross-lingual transfer from pre-training artifacts. Testing across five languages reveals that transfer effectiveness depends heavily on linguistic distance, model scale, and domain, with improvements plateauing in larger models.

AINeutralarXiv – CS AI · May 296/10

🧠

Beyond Bilingual Transfer: Multilingual Code-Switching in Instruction Tuning

Researchers demonstrate that multilingual code-switching—mixing multiple languages within training data—improves large language model performance across four languages (English, Japanese, Korean, Chinese) simultaneously, extending previous bilingual findings to truly multilingual settings and showing consistent performance gains on cross-lingual benchmarks.

AINeutralarXiv – CS AI · May 296/10

🧠

Source-Grounded Semantic Reinforcement Learning for Low-Resource Target-Language Generation

Researchers introduce Source-Grounded Semantic Reinforcement Learning (SG-SRL), a framework that leverages abundant source-language monolingual data to improve low-resource target-language generation through cross-lingual semantic rewards. The approach demonstrates significant gains in semantic grounding and factual coverage while maintaining fluency through a lightweight recovery stage.

AINeutralarXiv – CS AI · May 296/10

🧠

Multi-Legal-Bench: Evaluating LLMs on Legal Reasoning Across Jurisdictions, Languages, and Legal Traditions

Researchers introduce Multi-Legal-Bench, a cross-jurisdictional benchmark evaluating large language models on legal reasoning tasks across six European countries, four language families, and 134 million court decisions. The study reveals that few-shot transfer effectiveness depends on label-set alignment rather than linguistic proximity, and that model architecture matters more than tokenizer efficiency for cross-lingual legal NLP performance.

AIBullisharXiv – CS AI · May 276/10

🧠

CroCo: Cross-Lingual Contrastive Preference Tuning on Self-Generations

Researchers demonstrate that cross-lingual contrastive preference tuning (CroCo) enables large language models to improve performance across 14 languages without language-specific annotations by leveraging English-trained reward models. The method shows consistent gains in both structured and open-ended generation tasks across multiple languages while avoiding catastrophic forgetting.

AINeutralarXiv – CS AI · May 276/10

🧠

An In-Vitro Study on Cross-Lingual Generalization in Language Models

Researchers introduce a controlled experimental framework using procedurally generated languages to study cross-lingual transfer in language models, isolating variables like lexical distance and tokenization. Their findings across 700 runs reveal that tokenization preserving reusable substructure—rather than vocabulary size or lexical similarity alone—determines transfer success, with transfer occurring in distinct stages from grammatical competence to masked lexical generalization.

AINeutralarXiv – CS AI · May 116/10

🧠

Multilingual Safety Alignment via Self-Distillation

Researchers propose Multilingual Self-Distillation (MSD), a framework that transfers safety safeguards from high-resource languages like English to vulnerable low-resource languages in large language models. The method eliminates the need for expensive multilingual response data by leveraging an LLM's existing safety capabilities, demonstrating effective cross-lingual protection across diverse jailbreak benchmarks.

AIBullisharXiv – CS AI · Apr 106/10

🧠

FLeX: Fourier-based Low-rank EXpansion for multilingual transfer

Researchers propose FLeX, a parameter-efficient fine-tuning approach combining LoRA, advanced optimizers, and Fourier-based regularization to enable cross-lingual code generation across programming languages. The method achieves 42.1% pass@1 on Java tasks compared to a 34.2% baseline, demonstrating significant improvements in multilingual transfer without full model retraining.

🧠 Llama