AINeutralarXiv – CS AI · 10h ago6/10
🧠
Teacher-Aware Evolution of Heuristic Programs from Learned Optimization Policies
Researchers propose a teacher-aware evolutionary framework that leverages pre-trained learned optimization policies to guide the automatic design of heuristic programs for combinatorial optimization problems. The method uses behavioral feedback from teacher policies during evolution rather than relying solely on endpoint performance, achieving better results than baseline LLM-driven approaches without requiring neural inference at deployment.