Latest
View all →Anthropic gets US approval to bring back Claude Mythos 5
The US government has approved Anthropic to redeploy Claude Mythos 5, allowing American organisations to use the model…
Meta’s Astryx Brings a CLI and MCP Server to an Open-Source React Design System Agents Can Read
Meta released Astryx this week. It is an open-source design system currently in Beta. The project grew inside…
ByteDance’s “iLLaDA” is a diffusion language model that keeps up with Qwen2.5
ByteDance’s “iLLaDA” is a diffusion language model that keeps up with Qwen2.5 Researchers from Renmin University and Bytedance…
Anthropic’s Mythos 5 is back
Anthropic has received permission to deploy its Mythos 5 model to a limited number of organisations following two…
Trump Admin releases Anthropic Mythos to be used by more than 100 US companies, agencies
The Trump administration has reversed its initial decision to ban Anthropic‘s cybersecurity models, Mythos 5 and Fable 5,…
Trump Administration Allows Anthropic to Release Mythos to Select US Organizations
The US government has lifted the ban on Anthropic‘s most advanced AI model, Claude Mythos 5, permitting the…
Cursor Study Finds Reward Hacking Inflates Coding-Agent Benchmark Scores on SWE-bench Pro
A recent study by Cursor reveals that many modern coding agents achieve high benchmark scores by retrieving known…
Building Supervised Fine-Tuning Data from NVIDIA Open-SWE-Traces: Trajectory Parsing, Patch Analysis, Token Budgets, and Tool-Use Metrics
The NVIDIA Open-SWE-Traces dataset is now available as a practical resource for building supervised fine-tuning data for agentic…
Quoting Dean W. Ball
Dean W. Ball warns that the current business model for frontier artificial intelligence is financially unsustainable. He argues…
AI Tools & Reviews
View all →OpenAI Previews GPT-5.6 With Sol, Terra, and Luna: Tiered Models, New Reasoning Modes, Limited Access
OpenAI has started a limited preview of GPT-5.6, splitting the release into three specific tiers: Sol, Terra, and…
Quoting OpenAI
OpenAI has started a limited preview of its GPT-5.6 series, introducing three distinct models named Sol, Terra, and…
Evaluating performance and efficiency of the GitHub Copilot agentic harness across models and tasks
While the model provides the raw intelligence, the harness shapes how effectively that intelligence is applied. The GitHub Copilot agentic…
Most major AI chatbots still lean left on political questions, even “anti-woke” models are no exception
A Washington Post investigation finds most major AI chatbots take left-leaning positions on political questions, even models marketed…
Grok AI is reportedly a porn platform now, with over half its traffic tied to adult content
The Information reports that more than half of all traffic to Grok AI now directs users toward pornographic…
Google bakes computer control directly into Gemini 3.5 Flash, letting the model see and operate your screen
Google has integrated Computer Use directly into Gemini 3.5 Flash, allowing the model to observe and manipulate screens…

