AINeutralarXiv – CS AI · 18h ago6/10
🧠
AgriGov: A Structured Multilingual Dataset Curation for Indian Government Schemes for Farmers
AgriGov introduces a curated trilingual dataset (English-Hindi-Marathi) containing 8,000 parallel sentence pairs focused on Indian agricultural government schemes and farmer welfare programs. The dataset combines automated data collection, machine translation, and human post-editing to create domain-specific resources for machine translation, question-answering, and information retrieval systems aimed at farmer-facing applications.