y0news
#low-resource-languages2 articles
2 articles
AINeutralarXiv โ€“ CS AI ยท 4h ago1
๐Ÿง 

Terminology Rarity Predicts Catastrophic Failure in LLM Translation of Low-Resource Ancient Languages: Evidence from Ancient Greek

A study evaluated large language models (Claude, Gemini, ChatGPT) translating Ancient Greek texts, finding high performance on previously translated works (95.2/100) but declining quality on untranslated technical texts (79.9/100). Terminology rarity was identified as a strong predictor of translation failure, with rare terms causing catastrophic performance drops.

AINeutralarXiv โ€“ CS AI ยท 4h ago1
๐Ÿง 

Task-Lens: Cross-Task Utility Based Speech Dataset Profiling for Low-Resource Indian Languages

Researchers propose Task-Lens, a cross-task survey analyzing 50 Indian speech datasets across 26 languages for nine downstream speech tasks. The study reveals untapped metadata in existing datasets that could support multiple AI speech applications and identifies critical gaps in resources for underserved Indian languages.