y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#cross-lingual News & Analysis

9 articles tagged with #cross-lingual. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

9 articles
AIBullisharXiv – CS AI · May 287/10
🧠

BioELX: Cross-lingual Biomedical Entity Linking via Alias-based Retrieval and LLM Ranking

Researchers introduce BioELX, a two-stage cross-lingual biomedical entity linking system that maps medical mentions across languages to knowledge base identifiers without requiring task-specific training data. The framework combines multilingual alias-enriched retrieval with LLM-based ranking, achieving state-of-the-art results across five benchmarks with substantial improvements for low-resource languages.

AINeutralarXiv – CS AI · May 127/10
🧠

MULTITEXTEDIT: Benchmarking Cross-Lingual Degradation in Text-in-Image Editing

Researchers introduce MULTITEXTEDIT, a benchmark for evaluating text-in-image editing across 12 languages, revealing significant cross-lingual performance degradation in AI models. The study uncovers pronounced accuracy issues in non-English languages, particularly Hebrew and Arabic, highlighting the need for multilingual improvements in visual content creation AI.

AIBullisharXiv – CS AI · Apr 157/10
🧠

AdaMCoT: Rethinking Cross-Lingual Factual Reasoning through Adaptive Multilingual Chain-of-Thought

Researchers introduce AdaMCoT, a framework that improves multilingual reasoning in large language models by dynamically routing intermediate thoughts through optimal 'thinking languages' before generating target-language responses. The approach achieves significant performance gains in low-resource languages without requiring additional pretraining, addressing a key limitation in current multilingual AI systems.

AINeutralarXiv – CS AI · Jun 95/10
🧠

Decoupling Semantics and Logic: A Training-Free Coarse-to-Fine Pipeline for Video Retrieval-Augmented Generation

Researchers present a training-free Video RAG (Retrieval-Augmented Generation) system that decouples semantic retrieval from logical reasoning to improve cross-lingual video comprehension and reduce hallucinations. The two-stage pipeline uses dense retrieval with clean visual data followed by LLM-powered cognitive reranking, achieving strong precision in information retrieval and persona-conditioned generation.

AINeutralarXiv – CS AI · Jun 16/10
🧠

Shared Doubt: Zero-shot Cross-Lingual Confidence Estimation for Language Models

Researchers demonstrate that multilingual large language models encode shared confidence features that transfer across languages without retraining. A lightweight linear probe trained on English can predict answer correctness in unseen languages with zero-shot generalization, suggesting confidence estimation mechanisms are language-universal in LLMs.

AINeutralarXiv – CS AI · May 286/10
🧠

Sense Representations Are Inducible Interfaces

Researchers introduce ACROS, a method that adds explicit sense representations (per-token meaning decompositions) to frozen pretrained language models without retraining. The technique achieves competitive results in word-sense disambiguation, lexical steering, and cross-lingual adaptation, positioning sense representations as a practical interface for existing models.

AINeutralarXiv – CS AI · May 276/10
🧠

Beyond Transfer Accuracy: Faithful Circuits for Controlled Low-Resource Adaptation

Researchers introduce a counterfactual-free circuit discovery method adapted for unstructured natural text, enabling Circuit-Targeted Supervised Fine-Tuning (CT-SFT) that improves low-resource model adaptation while preserving performance on source tasks and preventing catastrophic forgetting.

AIBullishHugging Face Blog · Feb 216/106
🧠

SigLIP 2: A better multilingual vision language encoder

SigLIP 2 represents an advancement in multilingual vision-language encoding technology, building upon the original SigLIP model. This improved encoder aims to better understand and process visual content across multiple languages, potentially enhancing AI applications that require cross-lingual visual comprehension.

AINeutralarXiv – CS AI · Apr 64/10
🧠

Reliability Gated Multi-Teacher Distillation for Low Resource Abstractive Summarization

Researchers developed EWAD and CPDP techniques for improving multi-teacher knowledge distillation in low-resource abstractive summarization tasks. The study across Bangla and cross-lingual datasets shows logit-level knowledge distillation provides most reliable gains, while complex distillation improves short summaries but degrades longer outputs.