AINeutralarXiv – CS AI · 14h ago6/10
🧠
MIRA: Mid-training Rubric Anchoring for Source-Aware Data Selection
Researchers introduce MIRA, a framework for optimizing data selection during mid-training of large language models by dynamically discovering and applying source-specific evaluation rubrics. The approach achieves comparable performance to full-corpus training while reducing token usage by 50% on code-oriented tasks across 21 diverse data sources.