🧠 AI🟢 BullishImportance 7/10

Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing

arXiv – CS AI|Justin W. Lin, Eliot Krzysztof Jones, Donovan Julian Jasper, Ethan Jun-shen Ho, Anna Wu, Arnold Tianyi Yang, Neil Perry, Andy Zou, Matt Fredrikson, J. Zico Kolter, Percy Liang, Dan Boneh, Daniel E. Ho|March 4, 2026 at 05:00 AM|2 views

🤖AI Summary

Researchers conducted the first comprehensive evaluation comparing AI agents to human cybersecurity professionals in live penetration testing on a university network with 8,000 hosts. The new ARTEMIS AI agent framework placed second overall, discovering 9 vulnerabilities with 82% accuracy and outperforming 9 of 10 human participants while costing significantly less at $18/hour versus $60/hour for human testers.

Key Takeaways

→ARTEMIS AI agent outperformed 9 out of 10 human cybersecurity professionals in real-world penetration testing.
→AI agents demonstrated cost advantages at $18/hour compared to $60/hour for professional penetration testers.
→ARTEMIS achieved an 82% valid submission rate while discovering 9 valid vulnerabilities in the enterprise environment.
→AI agents excelled at systematic enumeration and parallel exploitation but struggled with GUI-based tasks and had higher false-positive rates.
→Existing AI scaffolds like Codex and CyAgent underperformed compared to most human participants, highlighting the advancement of ARTEMIS.

#ai-agents #cybersecurity #penetration-testing #artemis #enterprise-security #automation #vulnerability-assessment #ai-vs-humans #cost-efficiency #research

Read Original →via arXiv – CS AI

Act on this with AI

Stay ahead of the market.

Connect your wallet to an AI agent. It reads balances, proposes swaps and bridges across 15 chains — you keep full control of your keys.

Connect Wallet to AI →How it works

AIMay 6

Your company’s AI could delete everything in 9 seconds. ServiceNow wants to be the kill switch

AIMay 6

Hut 8 (HUT) Stock Soars 37% on Massive $9.8 Billion AI Data Center Agreement

AIMay 6

Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing

Your company’s AI could delete everything in 9 seconds. ServiceNow wants to be the kill switch

Hut 8 (HUT) Stock Soars 37% on Massive $9.8 Billion AI Data Center Agreement

S&P 500 and NASDAQ hit record highs as AI chip stocks surge