AIBullisharXiv – CS AI · 6h ago6/10
🧠
Neural Machine Translation for Low-Resource Tangkhul--English
Researchers have developed a neural machine translation system for Tangkhul, a severely under-resourced Tibeto-Burman language spoken in Manipur, India, achieving a BLEU score of 39.97 using a fine-tuned ByT5-large model trained on 38,336 parallel sentences. This work addresses a significant gap in NLP infrastructure for one of India's marginalized linguistic communities and demonstrates practical approaches to machine translation for languages with minimal computational resources.