The UK Government’s Mythos AI Tests Provide Insight into Cybersecurity Threats
- UK government’s AI Security Institute (AISI) has published an initial evaluation of Anthropic‘s Mythos Preview model, adding independent public verification to Anthropic’s claims.
- Mythos isn’t significantly different from other recent frontier models in individual cybersecurity-related tasks but could differentiate itself through its ability to chain these tasks into complex multistep attacks necessary for system infiltration.
- AISI’s evaluation shows Mythos Preview can complete over 85 percent of “Apprentice” level Capture the Flag (CTF) cybersecurity challenges, indicating significant improvement from earlier AI models like GPT-3.5 Turbo.
Originally published at arstechnica.com. Curated by AI Maestro.
Stay ahead of AI. Get the most important stories delivered to your inbox — no spam, no noise.

