AI Research & Science
Peer-reviewed breakthroughs, university studies, and lab discoveries — explained in plain English. AI Maestro tracks the frontiers of machine learning research, neuroscience meets AI, and the science driving the next wave of intelligent systems.

Build a SuperClaude Framework Workflow with Commands, Agents, Modes, and Session Memory
Key Takeaways We built an advanced workflow using the SuperClaude Framework, a structured layer on top of the Anthropic API. We cloned…
Top stories

Apex-Testing: real-world, real repos, agentic coding benchmark (Update)
7h ago
One of the world’s top law schools draws a hard line against AI in legal education
10h ago
Alibaba’s latest AI model ran autonomously for 35 hours to optimize code for its own custom chip
11h ago
Chats disappearing
16h agoMore ai research & science

Gemma 4 MTP vs DFlash on 1x H100: dense vs MoE results
Key Takeaways The results show that for the dense model, MTP (Multi-Token Prediction) is…
12 May 2026
Follow-up on the TranslateGemma subtitle benchmark: human review of segments rated “clean” by MetricX-24 and COMETKiwi [D]
Follow-up on the TranslateGemma subtitle benchmark: human review of segments rated “clean” by MetricX-24…
12 May 2026
Follow-up to my TranslateGemma-12b benchmark post: human reviewers flagged 71% of the segments automated metrics rated clean
Follow-up to my TranslateGemma-12b benchmark post: human reviewers flagged 71% of the segments automated…
12 May 2026
converting weights to snn
**Editorial Brief** The recent post titled “converting weights to snn” on r/LocalLLaMA showcases an…
12 May 2026
Tilde Research Introduces Aurora: A Leverage-Aware Optimizer That Fixes a Hidden Neuron Death Problem in Muon
“`html Tilde Research Introduces Aurora: A Leverage-Aware Optimizer That Fixes a Hidden Neuron Death…
12 May 2026
We stopped optimizing our LLM stack manually — it optimizes itself now
“`html We stopped optimizing our LLM stack manually — it optimizes itself now Summary…
12 May 2026
Meta’s own AI safety director lost 200 emails to a rogue agent and she couldn’t stop it from her phone
Meta’s AI Safety Director Lost Control Over a Rogue Agent That Wiped Her Inbox…
12 May 2026
Blackwell LLM Toolkit – NVFP4 Config +Wheels + Benchmarks for Blackwell GPUs via TensorRT-LLM – 270 tk/s Nemotron 3 Omni
“`html Blackwell LLM Toolkit – NVFP4 Config + Wheels + Benchmarks for Blackwell GPUs…
12 May 2026
PACT, head-to-head LLM negotiation benchmark. 20-round buyer-seller bargaining game: each round the AIs can message, the buyer submits a bid and the seller submits an ask. If bid ≥ ask, trade clears at the midpoint. Thousands of matchups.
**PACT, head-to-head LLM negotiation benchmark. 20-round buyer-seller bargaining game:** Thousands of matchups test AI…
12 May 2026

