Latest
View all →GPT and Claude failed Bridgewater’s finance tests because the right answers were never public
Open-source AI model beats GPT and Claude on Bridgewater finance tests Bridgewater Associates and Thinking Machines Lab claim…
Behind the Blog: With Blogs Like These, Who Needs a Private Jet
The U.S. Supreme Court ruled today in Chatrie v. United States that individuals possess a reasonable expectation of…
Chinese AI video maker Kling raises $2 billion as it gears up for Hong Kong IPO
Kuaishou has secured approximately $2 billion in funding for its artificial intelligence video unit, Kling, bringing the company’s…
Job openings in music tech this month: Sync music, backyard engineering, lab work at major universities, and more
Arturia has released the AstroLab 37, but MusicTech is reporting a separate list of career openings in music…
Meet WebBrain: An Open-Source, Local-First AI Browser Agent That Reads Pages and Automates Tasks in Chrome and Firefox
WebBrain is a free, open-source browser agent for Chrome and Firefox that reads pages, extracts data, and automates…
Interfaze Ships diffusion-gemma-asr-small, an Open-Source Diffusion ASR Model Transcribing Six Languages via DiffusionGemma’s Parallel Denoising Decoder
Interfaze has released diffusion-gemma-asr-small, an open-source speech recognition model that processes audio using a diffusion decoder rather than…
Mark Zuckerberg tells staff that AI agents haven’t progressed as quickly as he’d hoped
Meta CEO Mark Zuckerberg informed staff during an internal town hall that the development of artificial intelligence agents…
llm-coding-agent 0.1a0
Release: llm-coding-agent 0.1a0 Simon Willison has published a new Python library built on his evolving LLM library. This…
RAG-Anything Tutorial: Build a Multimodal Retrieval Pipeline for Text, Tables, Equations, and Images in Colab
The tutorial demonstrates how to build a retrieval pipeline using RAG-Anything that processes text, tables, equations, and images…
AI Tools & Reviews
View all →Using DSPy to evaluate and improve Datasette Agent’s SQL system prompts
Simon Willison applied DSPy to evaluate and improve the system prompts for Datasette Agent, a tool that executes…
Anthropic says it cut 80 percent of Claude Code’s system prompt because Fable 5 models “want a smaller system prompt”
Anthropic has reduced the system prompt for Claude Code by 80 percent following the release of its Fable…
Gemini Spark, Google’s agentic assistant, is now available on Mac
Gemini Spark is now running on Mac. Google added the feature to its existing desktop app on Wednesday.…
Hidden code in Claude Code secretly flagged Chinese users
Anthropic is removing a covert surveillance feature from its coding tool, Claude Code, after it drew anger on…
Google built a great smart speaker, but Gemini isn’t ready for it
Google has launched the Google Home Speaker, its first dedicated smart speaker in six years, to replace the…
Claude Science is Anthropic’s newest flagship product
Anthropic announced Claude Science at a Tuesday event for pharmaceutical executives and biotech founders. This new product is…

