Anthropic $45B SpaceX Deal, GitLab 19.0, Google Agent Executor

Anthropic secures $45B compute deal with SpaceX while projecting its first profitable quarter on $10.9B revenue.

Must read

Anthropic to Pay SpaceX Nearly $45 Billion for Computing Deal — $1.25B/month for 300MW+ capacity signals Anthropic is scaling infrastructure aggressively — expect model capability jumps.
Anthropic Nears First Profitable Quarter on $10.9B Revenue — Revenue doubling quarterly validates Claude as your primary stack bet; profitability reduces platform risk.
Google Agent Executor: Distributed Agent Runtime — Open-source durable execution, session consistency, and trajectory branching — directly relevant to your overnight-agent-factory pattern.
GitLab 19.0 Released — MR-lifecycle AI agent, secrets manager beta, and self-hosted Duo model expansion — ships the Act 2 vision into product.
OpenAI Model Disproves 80-Year-Old Geometry Conjecture — First autonomous AI disproof of a prominent open maths problem — a concrete frontier-capability milestone worth tracking.

Tools & Frameworks

Datasette Agent 0.1a3

Simon Willison’s new extensible AI assistant merges LLM and Datasette into a conversational data-query interface with plugin support and sandboxed execution via Fly Sprites.

Why this matters: Interesting pattern for agent-augmented data exploration over Postgres-like stores.

LangChain: From Token Streams to Agent Streams

New streaming primitives in LangGraph enable typed events, scoped subscriptions, and subagent visibility for production agent UIs.

Why this matters: Relevant if you’re building frontend observability for multi-agent workflows.

LangSmith Auth Proxy for Agent Sandboxes

Infrastructure-level egress control keeps secrets out of agent runtimes while constraining network access per sandbox.

Why this matters: Security pattern directly applicable to your in-house MCP server sandboxing.

vibe-skill: Delegate Coding from Claude Code to Mistral

A Claude Code skill delegates coding to Mistral Vibe while keeping Claude for planning — 57M tokens saved, 90%+ cost reduction over 10 days.

Why this matters: Directly implements the hybrid routing pattern you use with LiteLLM; worth testing.

Vercel CLI: Anomaly Alerts with —ai Flag

New vercel alerts --ai command surfaces AI investigation results for anomalies, enabling agents to act on production alerts without the dashboard.

Why this matters: Your team deploys on Vercel — agents can now triage prod issues headlessly.

Building Agents From First Principles

Strips TRL/Unsloth abstractions to show every agent-training system reduces to prompt→action→environment→reward→gradient in pure Python.

Why this matters: Useful mental model for understanding RL-based agent training without framework lock-in.

Open Models & Local

Qwen 3.7 Open Weight Hype

Qwen 3.7 announced with open weights imminent; already available as Max variant on Vercel AI Gateway for agentic workloads.

Why this matters: If weights drop at the 35B-A3B tier, this could be your next local coding model.

110 tok/s on 12GB VRAM with ik_llama.cpp + Qwen3.6 35B-A3B

ik_llama.cpp fork achieves 110 tok/s with MTP on RTX 4070 Super 12GB — significantly faster than mainline llama.cpp post-MTP merge.

Why this matters: Concrete local-inference benchmark; relevant if you’re evaluating MoE models on consumer GPUs.

llama.cpp b9274 Fixes MTP VRAM Leak

New release fixes VRAM creep in MTP models by properly freeing draft resources on server sleep.

Why this matters: If you run llama.cpp with MTP for local coding, update to avoid OOM crashes.

Qwen3.6 35B-A3B Workflow Transformation

User runs Qwen3.6 locally via ‘pi’ agent, feeding Claude-generated skills for devops, Playwright testing, and code tickets — full local-plus-cloud hybrid.

Why this matters: Real-world validation of the skills-framework + local-model pattern you write about.

Meta Sends Legal Notice to Heretic Project

Meta’s legal team targets the Heretic Free Software Project — details unclear but signals tightening enforcement around Llama licence terms.

Why this matters: Watch-but-don’t-act: could affect how you redistribute or modify Llama-family weights.

Industry & Trends

Cheap AI Threatens OpenAI and Anthropic IPO Valuations

Falling inference costs and ‘advisor model’ patterns let enterprises cut spend, pressuring frontier labs’ pricing power ahead of IPOs.

Why this matters: Validates your LiteLLM routing strategy — cost arbitrage is becoming table stakes.

OpenAI Moves Toward September IPO

OpenAI preparing IPO as early as September following Musk lawsuit dismissal.

Why this matters: Context for platform-risk assessment across your model gateway.

Spotify: LLM Evals as a Funnel, Not a Fork

Spotify pairs offline LLM evals with online experiments to create a feedback loop that improves both over time.

Why this matters: Concrete eval methodology from a ~500-eng org — directly applicable to your team’s eval framework.

Daytona: 850K Daily Agent Runs on Bare Metal

Daytona reports 74% MoM growth and 850K daily sandbox runs, positioning as the compute layer for headless coding agents.

Why this matters: Alternative to Fly Sprites for your overnight-agent-factory sandboxing needs.

Google Adds llms.txt Check to Lighthouse

Chrome Lighthouse now audits for llms.txt under a new ‘Agentic Browsing’ category, signalling machine-readable site metadata becoming standard.

Why this matters: If your products expose APIs or docs, adding llms.txt improves agent discoverability.

AI Pricing: The Unsustainable Subsidy Ends

AI model pricing is rising as labs prioritise margins — the era of below-cost inference is closing.

Why this matters: Budget planning signal: expect your Claude/OpenAI spend to increase.

Org & Leadership

GitLab 19.0: AI Agent Automates Full MR Lifecycle

Developer Flow agent now handles reviewer feedback, conflict resolution, and rebasing — shipping the Act 2 ‘if an agent can do it, automate it’ principle into product.

Why this matters: Concrete implementation of the Act 2 blueprint you track; test against your GitHub Actions workflows.

Sources unavailable today: GitHub: Aider-AI/aider, GitHub: All-Hands-AI/OpenHands, GitHub: BerriAI/litellm, GitHub: anthropics/claude-code, GitHub: cline/cline, GitHub: crewAIInc/crewAI, GitHub: ggml-org/llama.cpp, GitHub: huggingface/text-generation-inference, GitHub: huggingface/transformers, GitHub: langchain-ai/langchain, GitHub: langchain-ai/langgraph, GitHub: microsoft/autogen, GitHub: ml-explore/mlx, GitHub: ollama/ollama, GitHub: princeton-nlp/SWE-agent, GitHub: sgl-project/sglang, GitHub: simonw/llm, GitHub: vllm-project/vllm

Auto-curated daily by Claude Opus 4.7 from Ben’s Bites, Don’t Worry About the Vase (Zvi), GitLab blog, Google DeepMind blog, LangChain blog, Latent Space, NVIDIA developer blog, OpenAI blog, Simon Willison, TLDR AI, The Pragmatic Engineer (Gergely Orosz), Vercel blog, r/ClaudeAI top, r/LocalLLaMA top. Source list and editorial profile maintained by Daniel.