UK gov's Mythos AI tests help separate cybersecurity threat from hype

The UK Government’s Mythos AI Tests Provide Insight into Cybersecurity Threats

The UK government’s AI Security Institute (AISI) has conducted initial evaluations of Anthropic‘s Mythos Preview model, adding independent public verification to reports from Anthropic.
While Mythos isn’t significantly different in cybersecurity-related tasks compared to other recent models, AISI highlights its ability to effectively chain these tasks into multistep series necessary for fully infiltrating some systems.
AISI’s evaluations show that Mythos can complete over 85 percent of low-level “Apprentice” tasks in Capture the Flag (CTF) challenges, reflecting steady improvement since GPT-3.5 Turbo’s struggles with these tasks early last year.

Originally published at arstechnica.com. Curated by AI Maestro.

Stay ahead of AI. Get the most important stories delivered to your inbox — no spam, no noise.