UK gov’s Mythos AI tests help separate cybersecurity threat from hype

The UK government’s AI Security Institute (AISI) has published an initial evaluation of Anthropic’s Mythos Preview model, adding independent public verification to previous reports.
While Mythos isn’t significantly different in individual cybersecurity-related tasks, AISI highlights its ability to effectively chain these into complex multistep attacks, potentially setting it apart from previous models.
AISI’s tests show that Mythos can complete over 85 percent of low-level “Apprentice” CTF tasks, reflecting a steady rise in performance since early 2023 when GPT-3.5 Turbo failed to complete any such tasks.

Originally published at arstechnica.com. Curated by AI Maestro.

Stay ahead of AI. Get the most important stories delivered to your inbox — no spam, no noise.

u003cstrongu003eEmpowering Businesses with AI, One u003c/strongu003eu003cbru003eu003cstrongu003eSmart Tools, Smarter Business Decisions.u003c/strongu003e