AIBullisharXiv – CS AI · 14h ago7/10
🧠
PuzzleClone: A DSL-Powered Framework for Synthesizing Verifiable Data
Researchers introduce PuzzleClone, a DSL-driven framework that automatically synthesizes large-scale, verifiable datasets for training LLMs on mathematical and logical reasoning tasks. The team generates PC-83K, a benchmark of 83,000+ diverse puzzles, and demonstrates that models fine-tuned on this dataset achieve substantial performance improvements across multiple logic and mathematical benchmarks.