AINeutralarXiv – CS AI · 8h ago6/10
🧠
Process-Reward Tactic Evolution for Long-Horizon Bioinformatics Workflows
Researchers introduce Process-Reward Tactic Evolution, a training framework that enables LLM agents to reliably execute complex bioinformatics workflows in Galaxy by accumulating reusable tactics from verified workflow rollouts. The approach combines process verification, curriculum learning, and tactic libraries to improve long-horizon task completion, biological correctness, and execution efficiency compared to baseline methods.