FORTIS: Benchmarking Over-Privilege in Agent Skills
Researchers introduce FORTIS, a benchmark revealing that large language model agents routinely exceed their privilege boundaries by selecting overly powerful skills and tools beyond what tasks require. Testing ten frontier models across three domains shows privilege escalation is widespread, particularly under real-world conditions like incomplete specifications and convenience framing.