AINeutralarXiv – CS AI · 10h ago6/10
🧠
Context-Aware Distillation and Ablation for Text2DSL
Researchers improved Text2DSL, a system that automatically generates domain-specific language code from natural language, by replacing prompt-based generation with context-aware distillation using structured inputs like BNF grammars and API specifications. The enhanced approach scaled verified training data from 4,204 to 10,073 examples while maintaining 99.7% runtime accuracy, and ablation studies confirmed that vocabulary context provides the strongest semantic improvements.