UK gov's Mythos AI tests help separate cybersecurity threat from hype

The UK Government’s Mythos AI Tests Provide Insight into Cybersecurity Threats

UK government’s AI Security Institute (AISI) has published an initial evaluation of Anthropic‘s Mythos Preview model, adding independent public verification to Anthropic’s claims.
Mythos isn’t significantly different from other recent frontier models in individual cybersecurity-related tasks but could differentiate itself through its ability to chain these tasks into complex multistep attacks necessary for system infiltration.
AISI’s evaluation shows Mythos Preview can complete over 85 percent of “Apprentice” level Capture the Flag (CTF) cybersecurity challenges, indicating significant improvement from earlier AI models like GPT-3.5 Turbo.

Originally published at arstechnica.com. Curated by AI Maestro.

Stay ahead of AI. Get the most important stories delivered to your inbox — no spam, no noise.