Gemma 4 MTP in llama.cpp, datasette-agent-edit, llama.cpp KV-cache fixes
lundi 8 juin 2026 - AI News · (24 dernières heures)
Quiet day — Quiet Saturday; the notable ship is llama.cpp b9549 adding Gemma 4 multi-token prediction support for local inference on Apple Silicon.
Sources unavailable today: r/ChatGPTCoding top, r/ClaudeAI top, r/LocalLLaMA top, r/MachineLearning top
Auto-curated daily by Claude Opus 4.7 from GitHub: ggml-org/llama.cpp, Hugging Face blog, Lenny’s Newsletter, SaaStr (Jason Lemkin), Simon Willison, The Algorithmic Bridge (Alberto Romero). Source list and editorial profile maintained by Daniel.