UK gov's Mythos AI tests help separate cybersecurity threat from hype

The UK Government’s Mythos AI Tests Help Separate Cybersecurity Threat from Hype

The UK government’s AI Security Institute (AISI) has evaluated Anthropic’s Mythos Preview model, adding independent public verification to previous reports of its capabilities.
AISI found that while Mythos is not significantly different in tests of individual cybersecurity-related tasks compared to other recent models, it could potentially set itself apart through its ability to chain these tasks into a multistep series necessary for full system infiltration.
Since early 2023, AISI has been evaluating various AI models’ performance on Capture the Flag challenges. Mythos Preview now completes more than 85% of these Apprentice-level tasks, marking significant improvement over earlier iterations.

Originally published at arstechnica.com. Curated by AI Maestro.

Stay ahead of AI. Get the most important stories delivered to your inbox — no spam, no noise.