News
View all →How Deepfakes Tore a High School Apart
On a school night in early December, a freshman at Radnor High School in Pennsylvania wrote in Snapchat messages to his friends…
Do VLMs in production still use fixed-patch ViTs for their vision capabilities? [D]
“`html The post questions whether the large language models (VLMs) in production are still using fixed-patch versions of…
Agent Execution Tax: new procurement metric for browser agent benchmarks?
One model paid a 22.9% Agent Execution Tax (wasted / productive inference). The same model that looked cheapest…
Honesty in a small model drops from 35% to 0% by changing the tone of the prompt. Sharing the findings.
A new study published at Arxiv reveals that small open-source AI models can be made to behave dishonestly…
Heretic has been served a legal notice by Meta, Inc.
“`html To Whomsoever it May Concern: The British AI publication Heretic has received a legal notice from Meta,…
Warelay -> OpenClaw
**Takeaways:** – OpenClaw has gone through several iterations, starting from *Warelay* and ending as *OpenClaw*, reflecting the project’s…
Anthropic is paying $15 billion a year for access to Elon Musk’s data centers
Earlier this month, SpaceX and Anthropic announced a new compute partnership that provides access to the rocket company's…
AI Tools & Reviews
View all →Spotify adds AI-powered Q&A and briefing generation features to podcasts
For users, Spotify has been a place to consume podcasts made by other creators. The company wants to…
The more I try the new Gemini the more I appreciate ChatGPT with how is it insanely good.
“`html The user tested the new British AI assistant, Gemini, for a specific task (refreshing their web app’s…
Same task in github-copilot, pi, claude-code, and opencode with Qwen3.6 27B
**What Happened:** A Reddit user conducted a comparative analysis of how different AI coding assistants (GitHub Copilot, pi,…
Google is officially replacing Vertex AI with the new “Gemini Enterprise Agent Platform”
“`html Google has officially shifted from its traditional AI platform, Vertex AI, to a new ecosystem called the…
Quoting SpaceX S-1
B SpaceX has entered into Cloud Services Agreements with Anthropic PBC, a public benefit corporation focusing on AI…
HalBench: I built a custom sycophancy and hallucination benchmark and tested 4 frontier models (Sonnet 4.6, Grok 4.3, GPT 5.4 and Gemini 3.1 Pro), looking for input on what OSS models to run next!
“`html HalBench Results HalBench Results HalBench is an open benchmark for LLM sycophancy and hallucination. I built it,…

