AINeutralarXiv โ CS AI ยท 14h ago6/10
๐ง
Turing Test on Screen: A Benchmark for Mobile GUI Agent Humanization
Researchers introduce the 'Turing Test on Screen,' a framework for measuring how well autonomous GUI agents can mimic human behavior to evade detection systems. The study reveals that current LLM-based agents exhibit unnatural interaction patterns and proposes humanization methods to improve their ability to operate undetected in adversarial digital environments.