Agents’ Last Exam reveals AI agents struggle with real work tasks, passing just 2.6% of the time
A recent study called 'Agents' Last Exam' reveals that AI agents successfully complete real-world work tasks only 2.6% of the time, exposing significant limitations in current AI model capabilities. This finding underscores the substantial gap between AI's theoretical potential and practical performance, necessitating major improvements in model architecture and training methodologies before widespread deployment in critical applications.


