The UK Government’s Mythos AI Tests Provide Insight into Cybersecurity Threats
- The UK government’s AI Security Institute (AISI) has conducted initial evaluations of Anthropic‘s Mythos Preview model, adding independent public verification to reports from Anthropic.
- While Mythos isn’t significantly different in cybersecurity-related tasks compared to other recent models, AISI highlights its ability to effectively chain these tasks into multistep series necessary for fully infiltrating some systems.
- AISI’s evaluations show that Mythos can complete over 85 percent of low-level “Apprentice” tasks in Capture the Flag (CTF) challenges, reflecting steady improvement since GPT-3.5 Turbo’s struggles with these tasks early last year.
Originally published at arstechnica.com. Curated by AI Maestro.
Stay ahead of AI. Get the most important stories delivered to your inbox — no spam, no noise.

