Claude Code Fallbacks, Cursor 3.7, Anthropic RSI

Claude Code v2.1.166 ships fallback model chains and glob-based deny rules; Cursor 3.7 drops design-mode improvements.

Must read

Claude Code v2.1.166: fallback models, glob deny rules, hardened cross-session messaging — fallbackModel setting lets your overnight agent factory gracefully degrade across models when primary is overloaded — directly relevant to headless dispatch.
Cursor 3.7 — Design Mode Improvements — New Cursor release; design mode is maturing into a viable React prototyping path for your frontend stack.
Anthropic: When AI Builds Itself — Recursive Self-Improvement — Internal benchmarks claim 8× code volume per engineer; frames the ceiling for what your own Claude Code-heavy workflow could approach.
ChatGPT Dreaming V3 — Memory Synthesis — New persistent-memory architecture for long-horizon continuity; relevant if you route any workflows through OpenAI via LiteLLM.
Simon Willison: Python Code Sandbox via MicroPython + WASM — Lightweight sandboxing pattern for agent code execution — directly applicable to your in-house MCP servers that need safe eval.

Tools & Frameworks

Anthropic: Defending Code Reference Harness

Reference implementation for autonomous vulnerability discovery and remediation with Claude; ships as a pipeline you can customise.

Why this matters: Plug into CI for your identity/fraud codebase security scanning.

Braintrust Topics: Trace Intelligence at Scale

Pipeline for clustering and classifying million-token agent traces with hundreds of spans, inspired by Anthropic’s Clio paper.

Why this matters: Useful eval layer for your overnight agent factory’s production traces.

Vercel skills.sh API — 600K+ Open-Source Skills

Searchable API of 600,000+ open-source skills with security audits, authenticated via Vercel OIDC tokens.

Why this matters: Potential skills-registry integration for your agent-skills framework.

Vercel Sandbox Drives — Persistent Storage (Private Beta)

Drives provide persistent, attachable storage independent of sandbox lifecycle; available in beta SDK.

Why this matters: Enables stateful agent sandboxes on Vercel if you deploy there.

Ollama Model Tester CLI

CLI tool for running the same prompt across multiple local Ollama models and saving responses for comparison.

Why this matters: Quick local model eval for your hybrid routing decisions.

LangChain: Give Your Agent Its Own Computer

Architecture guide for provisioning isolated compute environments (filesystem, shell, packages) per agent task at scale.

Why this matters: Pattern reference for scaling your headless agent infrastructure.

Open Models & Local

Ollama v0.30.6 — NVFP4 MLX Embeddings, Oh My Pi Integration

MLX embedding layers now use NVFP4 global scale for improved quantisation on Apple Silicon; adds Oh My Pi coding agent launcher.

Why this matters: Better local quantisation quality on your Mac — directly improves hybrid routing viability.

Tom Tunguz: 78% of AI Work Now Runs Locally

Skill-distillation workflow routes easy tasks to local models, complex ones to cloud; 78% handled on-device.

Why this matters: Real-world local-plus-cloud hybrid playbook matching your routing architecture.

vLLM v0.22.1 — DeepSeek-V4 Fix, Mellum v2 Support

Patch release adds JetBrains Mellum v2 MoE model support, fixes DeepSeek-V4 init and multi-node Ray serving.

Why this matters: Relevant if you self-host inference behind LiteLLM.

Industry & Trends

Anthropic: 80% of Production Code Now Authored by Claude

Anthropic reports 80% AI-authored production code and 8× code volume per engineer; details internal workflow patterns.

Why this matters: Benchmark for your own Claude Code adoption metrics.

Claude Oceanus (Mythos Successor) in Red Team

New Anthropic model checkpoint ‘claude-oceanus-v1-p’ is in red-team testing; public launch likely within a week.

Why this matters: Potential model upgrade for your Claude Code and API workflows — watch for launch.

Ladybird Stops Accepting Public PRs Due to AI-Generated Code

Ladybird browser project closes public PRs; AI-generated patches broke the assumption that effort implies good faith.

Why this matters: Concrete example of the 22,000-line PR verification problem you write about.

How to Stop Shipping Low-Quality RL Environments

Practical guide to broken RL harnesses that actively degrade model training; covers data composition and teacher guidance.

Why this matters: Actionable if your team fine-tunes or evaluates models internally.

OpenAI Lockdown Mode Now Live

Lockdown Mode prevents data exfiltration from ChatGPT; rolling out to Free, Plus, Pro, and Business accounts.

Why this matters: Security hardening relevant if team members use ChatGPT with sensitive context.

Sources unavailable today: r/ChatGPTCoding top, r/ClaudeAI top, r/LocalLLaMA top, r/MachineLearning top

Auto-curated daily by Claude Opus 4.7 from Cursor changelog, Don’t Worry About the Vase (Zvi), GitHub: anthropics/claude-code, GitHub: cline/cline, GitHub: ggml-org/llama.cpp, GitHub: langchain-ai/langchain, GitHub: ollama/ollama, GitHub: vllm-project/vllm, Hugging Face blog, LangChain blog, Latent Space, Not Boring (Packy McCormick), SaaStr (Jason Lemkin), Simon Willison, TLDR AI, Tomasz Tunguz, Vercel blog, smol.ai news. Source list and editorial profile maintained by Daniel.