UK gov’s Mythos AI tests help separate cybersecurity threat from hype
- The UK government’s AI Security Institute (AISI) has published an initial evaluation of Anthropic’s Mythos Preview model, adding independent public verification to previous reports.
- While Mythos isn’t significantly different in individual cybersecurity-related tasks, AISI highlights its ability to effectively chain these into complex multistep attacks, potentially setting it apart from previous models.
- AISI’s tests show that Mythos can complete over 85 percent of low-level “Apprentice” CTF tasks, reflecting a steady rise in performance since early 2023 when GPT-3.5 Turbo failed to complete any such tasks.
Originally published at arstechnica.com. Curated by AI Maestro.
Stay ahead of AI. Get the most important stories delivered to your inbox — no spam, no noise.

