AI Tracker
AI News Tracker
Un briefing exécutif quotidien sur la GenAI, l'outillage de développement agentique et les LLMs open-source utilisables en local - plus un digest hebdomadaire du vendredi et une revue de fin de mois centrée sur l'AI Engineering. Curation automatique par Claude Opus 4.7 à partir de plus de 60 sources.
Les briefings sont publiés en anglais, car ils sont générés automatiquement à partir de sources primaires anglophones.
-
28 mai 2026
xAI-Cursor Acquisition, Anthropic Containment Guide, Claude Code v2.1.153
xAI has warned staff to limit contact with Cursor employees as their acquisition progresses, signalling imminent structural changes to the most popular AI coding editor.
-
27 mai 2026
Claude Code v2.1.152, Copilot Cowork Exfiltration, GPT-5.6 Leak
Claude Code v2.1.152 ships `/code-review --fix` auto-apply and skill-level tool disabling, directly upgrading agentic coding workflows.
-
26 mai 2026
MCP Spec Overhaul, Anthropic Mythos 1, DeepSeek V4 Price Cut
The next MCP specification release candidate drops with a stateless HTTP core, OAuth alignment, and breaking changes shipping 28 July.
-
25 mai 2026 · journée calme
Task-Observer Skills, Qwen3.6 vs Gemma4 Local, Anthropic SMB Skills
---
-
24 mai 2026 · journée calme
llama.cpp Native Tools, Claude Code Offline LLMs, LiteLLM v1.86
---
-
23 mai 2026
Cursor $3B Revenue, Qwen3.7 Agent Model, Cursor Cloud Agent Lessons
Cursor hits $3 billion annualised revenue with SpaceX's $60B acquisition option looming, while Qwen3.7-Max launches as a top-scoring agent-foundation model.
-
22 mai 2026 DIGEST HEBDO
Anthropic $45B SpaceX Deal, Cursor $3B ARR, Gemini 3.5 Flash
Anthropic's $45 billion compute deal with SpaceX — $1.25B/month for three years — is the week's landmark. It signals that the compute constraint is now the binding one for frontier labs, not talent or data. Simultaneously, Cursor hit $3B ARR with 3,000+ enterprise customers, validating agentic coding as a category. Google shipped Gemini 3.5 Flash at I/O with agentic-first positioning, Karpathy joined Anthropic, and Cursor released both Composer 2.5 and a detailed cloud-agent architecture post. For a CTO running headless agent fleets, the infrastructure layer beneath your agents is consolidating fast — and the cost of frontier inference is about to rise.
-
22 mai 2026
Anthropic $45B SpaceX Deal, GitLab 19.0, Google Agent Executor
Anthropic secures $45B compute deal with SpaceX while projecting its first profitable quarter on $10.9B revenue.
-
21 mai 2026
Cursor 3.5, Karpathy Joins Anthropic, Anthropic-xAI $15B/yr
Cursor ships 3.5, Karpathy joins Anthropic for frontier R&D, and SpaceX's S-1 reveals Anthropic is paying $15B/year for Colossus compute.
-
20 mai 2026
Gemini 3.5 Flash, Anthropic Acquires Stainless, Composer 2.5
Google I/O shipped Gemini 3.5 Flash to GA with improved agentic execution, while Anthropic acquired Stainless and Cursor released Composer 2.5.
-
19 mai 2026
Claude Code v2.1.144, Cursor Composer 2.5, Deep Agents v0.6
Claude Code v2.1.144 ships /resume for background sessions, and Anthropic engineers explain their Claude Code workflows on Lenny's Podcast.
-
18 mai 2026
llama.cpp MTP Speedup, Claude Context Tools, DeepSeek V4 1M Context
llama.cpp ships MTP prompt-decode optimisation while community benchmarks DeepSeek V4's 1M context window across real codebases.
-
17 mai 2026
SGLang DeepSeek V4, llama.cpp MTP Support, LiteLLM v1.85
SGLang v0.5.12 ships full DeepSeek V4 inference with expert parallelism and disaggregated prefill-decode, while llama.cpp lands native MTP speculative decoding.
-
16 mai 2026
Grok Build CLI, Cursor Cloud Agents, Claude Code v2.1.143
xAI launched Grok Build, a terminal coding agent with worktrees, subagents, and headless mode — directly competing with Claude Code.
-
15 mai 2026 DIGEST HEBDO
Grok Build CLI, Claude Code Agent View, Cerebras $5.5B IPO
The coding-agent arms race escalated sharply this week. xAI shipped Grok Build — a terminal agent with worktrees, subagents, headless mode, and MCP support — directly challenging Claude Code's territory. Claude Code itself landed agent view (`claude agents`), `/goal` for persistent multi-turn loops, and fast mode on Opus 4.7. OpenAI pushed Codex everywhere: mobile, Chrome, Windows sandbox, hooks, and programmatic tokens. Meanwhile Cerebras raised $5.5B in the year's biggest IPO, signalling Wall Street's conviction that inference hardware is the next bottleneck. For a CTO running parallel headless agents overnight, the tooling surface area expanded meaningfully in seven days.
-
14 mai 2026
Claude Code v2.1.141, Cursor Changelog, Codex Windows Sandbox
Claude Code ships hook notifications and workspace identity federation; Cursor and OpenAI Codex also push updates on the same day.
-
13 mai 2026
Claude Code /goal, xAI Becomes SpaceXAI, Cline CLI v3.0
Claude Code v2.1.139 shipped a fire-and-forget `/goal` mode with a persistent `claude agents` dashboard for managing headless sessions.
-
12 mai 2026
Claude Code v2.1.139, Gemini 3.1 Flash-Lite GA, llama.cpp Parallel Drafting
Claude Code v2.1.139 ships agent view, /goal command for autonomous multi-turn sessions, and Remote Control integration.
-
11 mai 2026
DeepSeek V4 Pro Local, MTP Benchmark Analysis, Claude Code Obsidian Plugin
DeepSeek V4 Pro is now running locally on consumer hardware via Q4_K_M quants in llama.cpp, and MTP speculative decoding benchmarks reveal task-dependent speed gains.
-
10 mai 2026
DeepSeek V4 Full Paper, Qwen 3.6 MTP Breakthroughs, Sonnet 4.5 Retiring
DeepSeek V4's full paper drops with FP4 QAT details showing 2× speedup at 99.7% recall, while local LLM community hits 80–135 tok/sec on consumer GPUs with Qwen 3.6.
-
9 mai 2026
Codex in Chrome, Codex /goal Persistence, GitHub Token Efficiency
OpenAI shipped Codex as a browser-native agent running directly in Chrome tabs on macOS and Windows.
-
8 mai 2026 DIGEST HEBDO
Anthropic-xAI Colossus Deal, GPT-5.5 Instant, Claude Managed Agents
Anthropic dominated the week: a $5B/yr compute deal with SpaceX/xAI for Colossus gives them 220,000+ GPUs, immediately doubling Claude Code rate limits for Pro/Max/Team/Enterprise. At Code w/ Claude they shipped Managed Agents (dreaming, outcomes, multiagent orchestration) and Claude Security public beta. Meanwhile OpenAI dropped GPT-5.5 Instant as the new default with a 2x price bump, and Cursor shipped v3.3 with four changelog updates. For your overnight-agent-factory setup, the doubled Claude Code limits and the new worktree baseRef setting in v2.1.133 are the most immediately actionable changes.
-
8 mai 2026 DIGEST MENSUEL
GPT-5.5 and Codex Superapp, Claude Code w/ SpaceX Compute, DeepSeek V4 Million-Token Context
April was the month the coding-agent war went full-stack. OpenAI shipped GPT-5.5 (2× the price of 5.4, but fewer output tokens per task) alongside a radically expanded Codex desktop app — now a general-purpose agent surface with computer use, plugins, skills, memory, and Symphony orchestration turning issue trackers into agent control planes. Anthropic countered with Opus 4.7 (SWE-bench Verified 87.6%, new tokenizer, xhigh reasoning tier), then doubled Claude Code rate limits overnight via a SpaceX/Colossus compute deal, shipped Managed Agents with dreaming and multiagent orchestration, and launched Bugcrawl for repo-wide vulnerability scanning. Meanwhile DeepSeek V4 dropped as a 1.6T MoE with 1M-token context and 75% price cuts, and Moonshot's Kimi K2.6 (1T MoE, 32B active, 256K context) arrived as the new open-weight coding leader. For builders, the practical upshot: frontier agentic performance is now available at three price tiers, harness engineering matters as much as model choice, and overnight agent factories just got meaningfully cheaper to run.
-
8 mai 2026
Cursor 3.3, Claude Managed Agents, Gemma 4 MTP
Cursor 3.3 ships alongside Claude's new self-improving managed agents and a 40% local inference speedup via multi-token prediction for Gemma 4 on llama.cpp.
-
7 mai 2026
Managed Agents Dreaming/Orchestration, Anthropic SpaceX Compute Deal, Claude Code v2.1.132
Anthropic's SpaceX/xAI compute deal doubles Claude Code rate limits and removes peak-hour throttling. New Claude Code v2.1.132 ships useful session-ID and alternate-screen env vars, and Anthropic's Managed Agents platform adds dreaming, outcomes, and multiagent orchestration.
-
6 mai 2026
Code w/ Claude 2026, Anthropic SpaceX Deal, Qwen 3.6 MTP 2.5x
A major day for Claude Code and local inference: Anthropic held their 'Code w/ Claude 2026' event (live-blogged by Simon Willison), removed peak-hours throttling on Claude Code Pro/Max via a SpaceX compute deal, and shipped new plugin/dispatch features. Meanwhile, MTP (Multi-Token Prediction) is delivering 2-2.5x speedups for Qwen 3.6 27B locally, and Ollama shipped MTP support for Gemma 4 on Mac.
-
5 mai 2026
GPT-5.5 Instant, Ollama Gemma 4 MTP, Transformers v5.8 DeepSeek-V4
OpenAI launched GPT-5.5 Instant as the new default ChatGPT model, Ollama shipped Gemma 4 MTP speculative decoding for Mac with 2x speed gains, and a practical r/LocalLLaMA post quantified exactly when local models beat cloud — directly relevant to your hybrid routing setup.
-
4 mai 2026
Claude Code v2.1.128, Vercel deepsec Launch, Open Models Threshold
A Sunday with a notable Claude Code release (v2.1.128 with plugin archives and channel auth), a Cursor changelog drop, and strong signals from JetBrains and LangChain on open models closing the gap with frontier for agent tasks.
-
3 mai 2026
Ollama v0.23.0 Claude Desktop, Eugene Yan AI Compounding, Anthropic Sycophancy Research
Ollama v0.23.0 ships Claude Desktop integration (including Claude Code support via local models), and Eugene Yan publishes a substantial piece on compounding with AI that aligns closely with your 'context, not control' framing.
-
2 mai 2026 · journée calme
LiteLLM Stable Cut, llama.cpp Maintenance Release, Quiet AI News Day
_Quiet day — A genuinely quiet day for actionable AI engineering news. The notable items are incremental llama.cpp maintenance releases, a minor LiteLLM stable cut, and event announcements — nothing that changes practice today._
-
1 mai 2026 DIGEST HEBDO
Anthropic SpaceX/xAI Deal, DeepSeek V4 Launch, OpenAI Symphony Spec
This was Anthropic's week. The Code w/ Claude event, the SpaceX/xAI Colossus compute deal, the 80x annualised growth admission from Dario, and doubled rate limits for Pro/Max users — it all paints a picture of a company that's simultaneously capacity-constrained and sprinting ahead of its own infrastructure. The Colossus deal is the headline, but the engineering signal is in the details: Claude Code limits doubling, Anthropic's natural language autoencoders research (turning internal representations into inspectable text), and Simon Willison's observation that vibe coding and agentic engineering are converging in his own practice. Meanwhile, OpenAI shipped GPT-5.5 Instant as the new default, open-sourced Symphony (an orchestration spec for Codex), and put models on AWS — a clear multi-cloud breakout from Azure exclusivity.
-
1 mai 2026
Claude Code v2.1.126, Apple Reinforced Agent, LangGraph 1.2.0a3
Claude Code v2.1.126 shipped with LiteLLM gateway model discovery and a useful project-purge command. Cursor posted a changelog, LangGraph alpha adds node-level error handlers and stream_events v3, and Simon Willison demonstrated building a full app on his phone with Claude Code.
-
30 avr. 2026
Codex CLI /goal Loop, GPT-5.5 Cyber Eval, Cursor Changelog Apr 30
Cursor shipped a changelog update, OpenAI's Codex CLI gained a goal-loop feature reminiscent of headless agent patterns, and LangGraph added node-level error handlers in an alpha — a relatively light day but with a few items directly relevant to your agentic workflows.
-
29 avr. 2026
Cursor SDK Release, DeepSeek-V4 Pro 512K, LangGraph Timers Alpha
A relatively quiet day anchored by Simon Willison's significant LLM library refactor (now modelling conversations rather than prompts), a Cursor SDK release, a minor Claude Code patch, and DeepSeek-V4 Pro landing with 512K context. LangGraph's alpha introduces timers and graceful shutdown — relevant for long-running agent infrastructure.