Software
The State of Long-Horizon AI Agents: Benchmarks & Vulnerabilities
AgentLAB, the first benchmark of its kind, reveals that even advanced LLM agents are highly susceptible to adaptive, long-horizon attacks, exposing a critical vulnerability in their design.
Sophie Laurent·July 1, 2026