Anthropic $45B SpaceX Deal, GitLab 19.0, Google Agent Executor
Freitag, 22. Mai 2026 - AI News · (letzte 24h)
Anthropic secures $45B compute deal with SpaceX while projecting its first profitable quarter on $10.9B revenue.
Must read
- Anthropic to Pay SpaceX Nearly $45 Billion for Computing Deal — $1.25B/month for 300MW+ capacity signals Anthropic is scaling infrastructure aggressively — expect model capability jumps.
- Anthropic Nears First Profitable Quarter on $10.9B Revenue — Revenue doubling quarterly validates Claude as your primary stack bet; profitability reduces platform risk.
- Google Agent Executor: Distributed Agent Runtime — Open-source durable execution, session consistency, and trajectory branching — directly relevant to your overnight-agent-factory pattern.
- GitLab 19.0 Released — MR-lifecycle AI agent, secrets manager beta, and self-hosted Duo model expansion — ships the Act 2 vision into product.
- OpenAI Model Disproves 80-Year-Old Geometry Conjecture — First autonomous AI disproof of a prominent open maths problem — a concrete frontier-capability milestone worth tracking.
Tools & Frameworks
Datasette Agent 0.1a3
Simon Willison’s new extensible AI assistant merges LLM and Datasette into a conversational data-query interface with plugin support and sandboxed execution via Fly Sprites.
Why this matters: Interesting pattern for agent-augmented data exploration over Postgres-like stores.
LangChain: From Token Streams to Agent Streams
New streaming primitives in LangGraph enable typed events, scoped subscriptions, and subagent visibility for production agent UIs.
Why this matters: Relevant if you’re building frontend observability for multi-agent workflows.
LangSmith Auth Proxy for Agent Sandboxes
Infrastructure-level egress control keeps secrets out of agent runtimes while constraining network access per sandbox.
Why this matters: Security pattern directly applicable to your in-house MCP server sandboxing.
vibe-skill: Delegate Coding from Claude Code to Mistral
A Claude Code skill delegates coding to Mistral Vibe while keeping Claude for planning — 57M tokens saved, 90%+ cost reduction over 10 days.
Why this matters: Directly implements the hybrid routing pattern you use with LiteLLM; worth testing.
Vercel CLI: Anomaly Alerts with —ai Flag
New vercel alerts --ai command surfaces AI investigation results for anomalies, enabling agents to act on production alerts without the dashboard.
Why this matters: Your team deploys on Vercel — agents can now triage prod issues headlessly.
Building Agents From First Principles
Strips TRL/Unsloth abstractions to show every agent-training system reduces to prompt→action→environment→reward→gradient in pure Python.
Why this matters: Useful mental model for understanding RL-based agent training without framework lock-in.
Open Models & Local
Qwen 3.7 Open Weight Hype
Qwen 3.7 announced with open weights imminent; already available as Max variant on Vercel AI Gateway for agentic workloads.
Why this matters: If weights drop at the 35B-A3B tier, this could be your next local coding model.
110 tok/s on 12GB VRAM with ik_llama.cpp + Qwen3.6 35B-A3B
ik_llama.cpp fork achieves 110 tok/s with MTP on RTX 4070 Super 12GB — significantly faster than mainline llama.cpp post-MTP merge.
Why this matters: Concrete local-inference benchmark; relevant if you’re evaluating MoE models on consumer GPUs.
llama.cpp b9274 Fixes MTP VRAM Leak
New release fixes VRAM creep in MTP models by properly freeing draft resources on server sleep.
Why this matters: If you run llama.cpp with MTP for local coding, update to avoid OOM crashes.
Qwen3.6 35B-A3B Workflow Transformation
User runs Qwen3.6 locally via ‘pi’ agent, feeding Claude-generated skills for devops, Playwright testing, and code tickets — full local-plus-cloud hybrid.
Why this matters: Real-world validation of the skills-framework + local-model pattern you write about.
Meta Sends Legal Notice to Heretic Project
Meta’s legal team targets the Heretic Free Software Project — details unclear but signals tightening enforcement around Llama licence terms.
Why this matters: Watch-but-don’t-act: could affect how you redistribute or modify Llama-family weights.
Industry & Trends
Cheap AI Threatens OpenAI and Anthropic IPO Valuations
Falling inference costs and ‘advisor model’ patterns let enterprises cut spend, pressuring frontier labs’ pricing power ahead of IPOs.
Why this matters: Validates your LiteLLM routing strategy — cost arbitrage is becoming table stakes.
OpenAI Moves Toward September IPO
OpenAI preparing IPO as early as September following Musk lawsuit dismissal.
Why this matters: Context for platform-risk assessment across your model gateway.
Spotify: LLM Evals as a Funnel, Not a Fork
Spotify pairs offline LLM evals with online experiments to create a feedback loop that improves both over time.
Why this matters: Concrete eval methodology from a ~500-eng org — directly applicable to your team’s eval framework.
Daytona: 850K Daily Agent Runs on Bare Metal
Daytona reports 74% MoM growth and 850K daily sandbox runs, positioning as the compute layer for headless coding agents.
Why this matters: Alternative to Fly Sprites for your overnight-agent-factory sandboxing needs.
Google Adds llms.txt Check to Lighthouse
Chrome Lighthouse now audits for llms.txt under a new ‘Agentic Browsing’ category, signalling machine-readable site metadata becoming standard.
Why this matters: If your products expose APIs or docs, adding llms.txt improves agent discoverability.
AI Pricing: The Unsustainable Subsidy Ends
AI model pricing is rising as labs prioritise margins — the era of below-cost inference is closing.
Why this matters: Budget planning signal: expect your Claude/OpenAI spend to increase.
Org & Leadership
GitLab 19.0: AI Agent Automates Full MR Lifecycle
Developer Flow agent now handles reviewer feedback, conflict resolution, and rebasing — shipping the Act 2 ‘if an agent can do it, automate it’ principle into product.
Why this matters: Concrete implementation of the Act 2 blueprint you track; test against your GitHub Actions workflows.
Sources unavailable today: GitHub: Aider-AI/aider, GitHub: All-Hands-AI/OpenHands, GitHub: BerriAI/litellm, GitHub: anthropics/claude-code, GitHub: cline/cline, GitHub: crewAIInc/crewAI, GitHub: ggml-org/llama.cpp, GitHub: huggingface/text-generation-inference, GitHub: huggingface/transformers, GitHub: langchain-ai/langchain, GitHub: langchain-ai/langgraph, GitHub: microsoft/autogen, GitHub: ml-explore/mlx, GitHub: ollama/ollama, GitHub: princeton-nlp/SWE-agent, GitHub: sgl-project/sglang, GitHub: simonw/llm, GitHub: vllm-project/vllm
Auto-curated daily by Claude Opus 4.7 from Ben’s Bites, Don’t Worry About the Vase (Zvi), GitLab blog, Google DeepMind blog, LangChain blog, Latent Space, NVIDIA developer blog, OpenAI blog, Simon Willison, TLDR AI, The Pragmatic Engineer (Gergely Orosz), Vercel blog, r/ClaudeAI top, r/LocalLLaMA top. Source list and editorial profile maintained by Daniel.