AIBullisharXiv โ CS AI ยท 17h ago7/10
๐ง
DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning
Researchers introduce DataChef-32B, an AI system that uses reinforcement learning to automatically generate optimal data processing recipes for training large language models. The system eliminates the need for manual data curation by automatically designing complete data pipelines, achieving performance comparable to human experts across six benchmark tasks.