AI Tracker
AI News Tracker
A daily executive briefing on GenAI, agentic dev tooling, and open-source LLMs you can run locally — plus a Friday weekly digest and an end-of-month review focused on AI Engineering. Auto-curated by Claude Opus 4.7 from 60+ sources.
Get the daily brief in your inbox
Weekday mornings. Double opt-in, no tracking pixels, unsubscribe in one click.
Also want the weekly and monthly digests? Pick your cadence →
-
12 Jun 2026
Claude Managed Agents, Cursor Bugbot 3x Faster, DiffusionGemma 4x Speed
Anthropic launched Claude Managed Agents — composable APIs for production-grade agent orchestration — while Cursor shipped a significantly faster Bugbot and Google released DiffusionGemma.
-
11 Jun 2026
Claude Fable 5, DiffusionGemma 4x Speed, Claude Code v2.1.172
Anthropic launched Claude Fable 5 and Mythos 5, with immediate controversy over hidden safeguards that limit competing labs' usage.
-
10 Jun 2026
Claude Fable 5, Gemma 4 12B, MiMo 1000 tok/s
Anthropic released Claude Fable 5, a Mythos-class model now available in Claude Code v2.1.170, dominating coding benchmarks and sustaining multi-day agentic runs.
-
8 Jun 2026 · quiet day
Gemma 4 MTP in llama.cpp, datasette-agent-edit, llama.cpp KV-cache fixes
---
-
7 Jun 2026
Vercel Agentic Ops Deep Dive, LiteLLM v1.88, llama.cpp Qwen3.5 Video
Vercel's CPO details running 96% of marketing and 93% of support on AI agents, with their SDR team fully reabsorbed.
-
6 Jun 2026
Claude Code Fallbacks, Cursor 3.7, Anthropic RSI
Claude Code v2.1.166 ships fallback model chains and glob-based deny rules; Cursor 3.7 drops design-mode improvements.
-
5 Jun 2026 WEEKLY DIGEST
Anthropic $65B / IPO, Nemotron 3 Ultra, Claude Code Plugins
Anthropic dominated the week: a $65B Series H at $965B valuation, a confidential IPO filing, Opus 4.8 shipping with dynamic workflows and effort controls, and the claim that 80% of its production code is now Claude-authored. Meanwhile NVIDIA dropped Nemotron 3 Ultra (550B MoE, 55B active, 1M context) as the strongest US open-weights model, Microsoft launched its MAI model family at Build, and Claude Code gained auto-loaded plugins from `.claude/skills` — directly relevant to your skills-framework workflow. The capital intensity of the race is now undeniable: Alphabet announced an $80B stock raise purely for AI compute.
-
5 Jun 2026
Cursor 3.7, Claude Code v2.1.163, Nemotron 3 Ultra
Cursor ships version 3.7 with canvas improvements, Claude Code adds version-pinning and plugin management, and Nemotron 3 Ultra launches for long-running agents.
-
4 Jun 2026
Claude Code v2.1.162, MiniMax M3 1M-Context, Anthropic IPO Filing
Claude Code ships multi-agent observability improvements while MiniMax announces open-weight 1M-context frontier model and Anthropic files for IPO amid enterprise cost scrutiny.
-
3 Jun 2026
Nemotron 3 Ultra 550B, Claude Code v2.1.161, Microsoft MAI Models
NVIDIA released Nemotron 3 Ultra (550B params, 55B active), the strongest US open-weights model, while Microsoft launched its MAI model family at Build.
-
2 Jun 2026
Grok Build 0.1, Mellum2 12B Open-Source, Claude Code v2.1.160
xAI launched Grok Build 0.1 for agentic coding at $1/M input tokens, while JetBrains open-sourced Mellum2 12B for production AI routing.
-
1 Jun 2026 · quiet day
MiniMax M3 1M Context, Datasette Agent Ships, llama.cpp iGPU Default
---
-
31 May 2026 MONTHLY DIGEST
Anthropic's $965B Valuation Month, Gemini 3.5 Flash Ships, Dynamic Workflows Go Parallel
May belonged to Anthropic. A $65B Series H at $965B valuation, first profitable quarter in sight, Opus 4.8 shipped, and Dynamic Workflows in Claude Code — orchestrating hundreds of parallel subagents — landed in research preview. Jarred Sumner rewrote Bun from Zig to Rust in 11 days using it. For teams building with agents, this is the month the overnight-agent-factory pattern got official infrastructure.
-
31 May 2026 · quiet day
Anthropic Sandboxing Docs, llama.cpp Qwen 3.5 TP Fix, LiteLLM v1.84.4
---
-
30 May 2026
Opus 4.8, Cursor 3.6, Claude Code Plugins
Anthropic shipped Claude Opus 4.8 with effort controls and dynamic workflows, alongside Claude Code v2.1.157 introducing auto-loaded plugins and agent dispatch.
-
29 May 2026 WEEKLY DIGEST
Anthropic $65B Series H, Claude Code Dynamic Workflows, MCP Spec Release Candidate
Anthropic dominated the week: a $65B Series H at $965B valuation with $47B run-rate revenue, Claude Opus 4.8 shipping with honest 'modest improvement' framing, and — most relevant to your overnight-agent-factory — Dynamic Workflows in Claude Code v2.1.154, which orchestrates hundreds of parallel subagents from a single task. Jarred Sumner used it to rewrite Bun from Zig to Rust (750K lines, 11 days, 99.8% test pass). Meanwhile the MCP spec release candidate dropped with breaking changes, a stateless HTTP core, and proper OAuth — your in-house MCP servers will need migration before July 28.
-
29 May 2026
Claude Opus 4.8, Claude Code Dynamic Workflows, Anthropic $65B Series H
Anthropic shipped Claude Opus 4.8 with dynamic workflows in Claude Code that orchestrate hundreds of parallel subagents, alongside a $65B Series H at $965B valuation.
-
28 May 2026
xAI-Cursor Acquisition, Anthropic Containment Guide, Claude Code v2.1.153
xAI has warned staff to limit contact with Cursor employees as their acquisition progresses, signalling imminent structural changes to the most popular AI coding editor.
-
27 May 2026
Claude Code v2.1.152, Copilot Cowork Exfiltration, GPT-5.6 Leak
Claude Code v2.1.152 ships `/code-review --fix` auto-apply and skill-level tool disabling, directly upgrading agentic coding workflows.
-
26 May 2026
MCP Spec Overhaul, Anthropic Mythos 1, DeepSeek V4 Price Cut
The next MCP specification release candidate drops with a stateless HTTP core, OAuth alignment, and breaking changes shipping 28 July.
-
25 May 2026 · quiet day
Task-Observer Skills, Qwen3.6 vs Gemma4 Local, Anthropic SMB Skills
---
-
24 May 2026 · quiet day
llama.cpp Native Tools, Claude Code Offline LLMs, LiteLLM v1.86
---
-
23 May 2026
Cursor $3B Revenue, Qwen3.7 Agent Model, Cursor Cloud Agent Lessons
Cursor hits $3 billion annualised revenue with SpaceX's $60B acquisition option looming, while Qwen3.7-Max launches as a top-scoring agent-foundation model.
-
22 May 2026 WEEKLY DIGEST
Anthropic $45B SpaceX Deal, Cursor $3B ARR, Gemini 3.5 Flash
Anthropic's $45 billion compute deal with SpaceX — $1.25B/month for three years — is the week's landmark. It signals that the compute constraint is now the binding one for frontier labs, not talent or data. Simultaneously, Cursor hit $3B ARR with 3,000+ enterprise customers, validating agentic coding as a category. Google shipped Gemini 3.5 Flash at I/O with agentic-first positioning, Karpathy joined Anthropic, and Cursor released both Composer 2.5 and a detailed cloud-agent architecture post. For a CTO running headless agent fleets, the infrastructure layer beneath your agents is consolidating fast — and the cost of frontier inference is about to rise.
-
22 May 2026
Anthropic $45B SpaceX Deal, GitLab 19.0, Google Agent Executor
Anthropic secures $45B compute deal with SpaceX while projecting its first profitable quarter on $10.9B revenue.
-
21 May 2026
Cursor 3.5, Karpathy Joins Anthropic, Anthropic-xAI $15B/yr
Cursor ships 3.5, Karpathy joins Anthropic for frontier R&D, and SpaceX's S-1 reveals Anthropic is paying $15B/year for Colossus compute.
-
20 May 2026
Gemini 3.5 Flash, Anthropic Acquires Stainless, Composer 2.5
Google I/O shipped Gemini 3.5 Flash to GA with improved agentic execution, while Anthropic acquired Stainless and Cursor released Composer 2.5.
-
19 May 2026
Claude Code v2.1.144, Cursor Composer 2.5, Deep Agents v0.6
Claude Code v2.1.144 ships /resume for background sessions, and Anthropic engineers explain their Claude Code workflows on Lenny's Podcast.
-
18 May 2026
llama.cpp MTP Speedup, Claude Context Tools, DeepSeek V4 1M Context
llama.cpp ships MTP prompt-decode optimisation while community benchmarks DeepSeek V4's 1M context window across real codebases.
-
17 May 2026
SGLang DeepSeek V4, llama.cpp MTP Support, LiteLLM v1.85
SGLang v0.5.12 ships full DeepSeek V4 inference with expert parallelism and disaggregated prefill-decode, while llama.cpp lands native MTP speculative decoding.
-
16 May 2026
Grok Build CLI, Cursor Cloud Agents, Claude Code v2.1.143
xAI launched Grok Build, a terminal coding agent with worktrees, subagents, and headless mode — directly competing with Claude Code.
-
15 May 2026 WEEKLY DIGEST
Grok Build CLI, Claude Code Agent View, Cerebras $5.5B IPO
The coding-agent arms race escalated sharply this week. xAI shipped Grok Build — a terminal agent with worktrees, subagents, headless mode, and MCP support — directly challenging Claude Code's territory. Claude Code itself landed agent view (`claude agents`), `/goal` for persistent multi-turn loops, and fast mode on Opus 4.7. OpenAI pushed Codex everywhere: mobile, Chrome, Windows sandbox, hooks, and programmatic tokens. Meanwhile Cerebras raised $5.5B in the year's biggest IPO, signalling Wall Street's conviction that inference hardware is the next bottleneck. For a CTO running parallel headless agents overnight, the tooling surface area expanded meaningfully in seven days.
-
14 May 2026
Claude Code v2.1.141, Cursor Changelog, Codex Windows Sandbox
Claude Code ships hook notifications and workspace identity federation; Cursor and OpenAI Codex also push updates on the same day.
-
13 May 2026
Claude Code /goal, xAI Becomes SpaceXAI, Cline CLI v3.0
Claude Code v2.1.139 shipped a fire-and-forget `/goal` mode with a persistent `claude agents` dashboard for managing headless sessions.
-
12 May 2026
Claude Code v2.1.139, Gemini 3.1 Flash-Lite GA, llama.cpp Parallel Drafting
Claude Code v2.1.139 ships agent view, /goal command for autonomous multi-turn sessions, and Remote Control integration.
-
11 May 2026
DeepSeek V4 Pro Local, MTP Benchmark Analysis, Claude Code Obsidian Plugin
DeepSeek V4 Pro is now running locally on consumer hardware via Q4_K_M quants in llama.cpp, and MTP speculative decoding benchmarks reveal task-dependent speed gains.
-
10 May 2026
DeepSeek V4 Full Paper, Qwen 3.6 MTP Breakthroughs, Sonnet 4.5 Retiring
DeepSeek V4's full paper drops with FP4 QAT details showing 2× speedup at 99.7% recall, while local LLM community hits 80–135 tok/sec on consumer GPUs with Qwen 3.6.
-
9 May 2026
Codex in Chrome, Codex /goal Persistence, GitHub Token Efficiency
OpenAI shipped Codex as a browser-native agent running directly in Chrome tabs on macOS and Windows.
-
8 May 2026 MONTHLY DIGEST
GPT-5.5 and Codex Superapp, Claude Code w/ SpaceX Compute, DeepSeek V4 Million-Token Context
April was the month the coding-agent war went full-stack. OpenAI shipped GPT-5.5 (2× the price of 5.4, but fewer output tokens per task) alongside a radically expanded Codex desktop app — now a general-purpose agent surface with computer use, plugins, skills, memory, and Symphony orchestration turning issue trackers into agent control planes. Anthropic countered with Opus 4.7 (SWE-bench Verified 87.6%, new tokenizer, xhigh reasoning tier), then doubled Claude Code rate limits overnight via a SpaceX/Colossus compute deal, shipped Managed Agents with dreaming and multiagent orchestration, and launched Bugcrawl for repo-wide vulnerability scanning. Meanwhile DeepSeek V4 dropped as a 1.6T MoE with 1M-token context and 75% price cuts, and Moonshot's Kimi K2.6 (1T MoE, 32B active, 256K context) arrived as the new open-weight coding leader. For builders, the practical upshot: frontier agentic performance is now available at three price tiers, harness engineering matters as much as model choice, and overnight agent factories just got meaningfully cheaper to run.
-
8 May 2026 WEEKLY DIGEST
Anthropic-xAI Colossus Deal, GPT-5.5 Instant, Claude Managed Agents
Anthropic dominated the week: a $5B/yr compute deal with SpaceX/xAI for Colossus gives them 220,000+ GPUs, immediately doubling Claude Code rate limits for Pro/Max/Team/Enterprise. At Code w/ Claude they shipped Managed Agents (dreaming, outcomes, multiagent orchestration) and Claude Security public beta. Meanwhile OpenAI dropped GPT-5.5 Instant as the new default with a 2x price bump, and Cursor shipped v3.3 with four changelog updates. For your overnight-agent-factory setup, the doubled Claude Code limits and the new worktree baseRef setting in v2.1.133 are the most immediately actionable changes.
-
8 May 2026
Cursor 3.3, Claude Managed Agents, Gemma 4 MTP
Cursor 3.3 ships alongside Claude's new self-improving managed agents and a 40% local inference speedup via multi-token prediction for Gemma 4 on llama.cpp.
-
7 May 2026
Managed Agents Dreaming/Orchestration, Anthropic SpaceX Compute Deal, Claude Code v2.1.132
Anthropic's SpaceX/xAI compute deal doubles Claude Code rate limits and removes peak-hour throttling. New Claude Code v2.1.132 ships useful session-ID and alternate-screen env vars, and Anthropic's Managed Agents platform adds dreaming, outcomes, and multiagent orchestration.
-
6 May 2026
Code w/ Claude 2026, Anthropic SpaceX Deal, Qwen 3.6 MTP 2.5x
A major day for Claude Code and local inference: Anthropic held their 'Code w/ Claude 2026' event (live-blogged by Simon Willison), removed peak-hours throttling on Claude Code Pro/Max via a SpaceX compute deal, and shipped new plugin/dispatch features. Meanwhile, MTP (Multi-Token Prediction) is delivering 2-2.5x speedups for Qwen 3.6 27B locally, and Ollama shipped MTP support for Gemma 4 on Mac.
-
5 May 2026
GPT-5.5 Instant, Ollama Gemma 4 MTP, Transformers v5.8 DeepSeek-V4
OpenAI launched GPT-5.5 Instant as the new default ChatGPT model, Ollama shipped Gemma 4 MTP speculative decoding for Mac with 2x speed gains, and a practical r/LocalLLaMA post quantified exactly when local models beat cloud — directly relevant to your hybrid routing setup.
-
4 May 2026
Claude Code v2.1.128, Vercel deepsec Launch, Open Models Threshold
A Sunday with a notable Claude Code release (v2.1.128 with plugin archives and channel auth), a Cursor changelog drop, and strong signals from JetBrains and LangChain on open models closing the gap with frontier for agent tasks.
-
3 May 2026
Ollama v0.23.0 Claude Desktop, Eugene Yan AI Compounding, Anthropic Sycophancy Research
Ollama v0.23.0 ships Claude Desktop integration (including Claude Code support via local models), and Eugene Yan publishes a substantial piece on compounding with AI that aligns closely with your 'context, not control' framing.
-
2 May 2026 · quiet day
LiteLLM Stable Cut, llama.cpp Maintenance Release, Quiet AI News Day
_Quiet day — A genuinely quiet day for actionable AI engineering news. The notable items are incremental llama.cpp maintenance releases, a minor LiteLLM stable cut, and event announcements — nothing that changes practice today._
-
1 May 2026 WEEKLY DIGEST
Anthropic SpaceX/xAI Deal, DeepSeek V4 Launch, OpenAI Symphony Spec
This was Anthropic's week. The Code w/ Claude event, the SpaceX/xAI Colossus compute deal, the 80x annualised growth admission from Dario, and doubled rate limits for Pro/Max users — it all paints a picture of a company that's simultaneously capacity-constrained and sprinting ahead of its own infrastructure. The Colossus deal is the headline, but the engineering signal is in the details: Claude Code limits doubling, Anthropic's natural language autoencoders research (turning internal representations into inspectable text), and Simon Willison's observation that vibe coding and agentic engineering are converging in his own practice. Meanwhile, OpenAI shipped GPT-5.5 Instant as the new default, open-sourced Symphony (an orchestration spec for Codex), and put models on AWS — a clear multi-cloud breakout from Azure exclusivity.
-
1 May 2026
Claude Code v2.1.126, Apple Reinforced Agent, LangGraph 1.2.0a3
Claude Code v2.1.126 shipped with LiteLLM gateway model discovery and a useful project-purge command. Cursor posted a changelog, LangGraph alpha adds node-level error handlers and stream_events v3, and Simon Willison demonstrated building a full app on his phone with Claude Code.
-
30 Apr 2026
Codex CLI /goal Loop, GPT-5.5 Cyber Eval, Cursor Changelog Apr 30
Cursor shipped a changelog update, OpenAI's Codex CLI gained a goal-loop feature reminiscent of headless agent patterns, and LangGraph added node-level error handlers in an alpha — a relatively light day but with a few items directly relevant to your agentic workflows.
-
29 Apr 2026
Cursor SDK Release, DeepSeek-V4 Pro 512K, LangGraph Timers Alpha
A relatively quiet day anchored by Simon Willison's significant LLM library refactor (now modelling conversations rather than prompts), a Cursor SDK release, a minor Claude Code patch, and DeepSeek-V4 Pro landing with 512K context. LangGraph's alpha introduces timers and graceful shutdown — relevant for long-running agent infrastructure.