🧠 AI⚪ NeutralImportance 4/10

No Text Needed: Forecasting MT Quality and Inequity from Fertility and Metadata

arXiv – CS AI|Jessica M. Lundin, Ada Zhang, David Adelani, Cody Carroll|March 4, 2026 at 05:00 AM|3 views

🤖AI Summary

Researchers demonstrate that machine translation quality can be accurately predicted without running translation systems, using only token fertility ratios, token counts, and linguistic metadata. The study achieved R² scores of 0.66-0.72 when forecasting GPT-4o translation performance across 203 languages in the FLORES-200 benchmark.

Key Takeaways

→Translation quality can be predicted with high accuracy using only fertility ratios, token counts, and basic linguistic metadata.
→Gradient boosting models achieved R² scores of 0.66 for translations into English and 0.72 for English translations into other languages.
→Typological factors dominate quality predictions for translations into English, while fertility plays a larger role for diverse target languages.
→The findings suggest translation quality is shaped by both token-level fertility and broader linguistic typology.
→This research offers new insights for multilingual evaluation and quality estimation without running actual translation systems.

Mentioned Tokens

$XX$0.0000▲+0.0%

Let AI manage these →

Non-custodial · Your keys, always