AINeutralarXiv โ CS AI ยท 14h ago6/10
๐ง
Domain-Specific Data Generation Framework for RAG Adaptation
RAGen is a new framework for generating domain-specific training data to improve Retrieval-Augmented Generation (RAG) systems. The system creates question-answer-context triples using semantic chunking, concept extraction, and Bloom's Taxonomy principles, enabling faster adaptation of LLMs to specialized domains like scientific research and enterprise knowledge bases.