AINeutralarXiv – CS AI · 8h ago7/10
🧠
From Question Answering to Task Completion: A Survey on Agent System and Harness Design
A comprehensive survey examines LLM-based agent systems through a model-harness lens, arguing that agent performance depends on the interaction between foundation models, execution infrastructure, and task structure rather than model capabilities alone. The research identifies six core runtime responsibilities and maps how different harness configurations affect long-horizon task completion, efficiency, and reliability.