photo: Unsplash contributor · Unsplash
Signal & Noise · 2026-06-16

The one-way door swung this week

2026-06-16 9-day window 2026-06-07 → 2026-06-16

Anthropic shipped Fable 5 and Mythos 5 — biggest model jump since Opus 4.5.

Within days, US Commerce Department effectively suspended access.

Lambert called it the AGI era of governance.

Meanwhile a sharper skeptic counter-bloc tightened around LeCun's post-LLM bet and Narayanan's normal-technology frame.

This week
26 voices · 9-day window
№ 01 · Lead

Fable 5 / Mythos 5 release and the 96-hour governance collision

Capability arrived; constraint arrived faster. The substrate for shipping AI products materially shifted between Tuesday and Friday.

On Monday June 9, Anthropic released Claude Fable 5 and Mythos 5 — the largest model-capability jump anyone had felt in seven months. Mollick called it a genuine jump. Willison ran 5.5 hours of hands-on and named it relentlessly proactive. Boris Cherny, inside Anthropic, said the biggest step up since Opus 4.5 back in November. Karpathy — who joined Anthropic earlier this quarter — tweeted super exciting release. The capability story was clean and the buyer-side reaction near-unanimous.

By Friday, the US Commerce Department had issued a directive that effectively suspended access to both models. Marcus called it the nuclear option after two years of underregulating AI. Axios reported personality clashes inside Anthropic had taken the models offline. Wired reported Anthropic walked back a separate policy that allegedly let Fable throttle competitor AI research. Helen Toner read Trump's national security presidential memorandum on AI like — and didn't finish the sentence. The capability went into a regulatory wall in 96 hours.

Welcome to the AGI era of AI governance. It's a one-way door and we weren't ready for it.Nathan Lambert (@natolambert)

Nathan Lambert named the underlying shape: welcome to the AGI era of AI governance, a one-way door we weren't ready for. The Bezos two-way-door frame applied to regulatory state — the directive sets a precedent that isn't easily reversed. For anyone building a product on top of Anthropic API, the practical implication landed this week, not next quarter: the regulatory state-from-this-week-forward is materially different from the state-from-last-month. Plan accordingly.

Underneath the headline, the skeptic counter-bloc tightened. Yann LeCun left Meta to found AMI, his post-LLM company, saying the change of paradigm will be obvious by early 2027. Arvind Narayanan published why AI hasn't replaced software engineers, and won't, framing coding agents as normal technology. François Chollet kept hitting the same calibration note: near-term AI is the newest form of digital infrastructure, not a different kind of thing. Gary Marcus said the industry is propped up by math that is insane and pointed at the Burn-Murdoch FT productivity graph as data. None of these is a stop-shipping signal. All of them are a calibrate-your-claims signal.

The harder question isn't whether the skeptics are right about scaling. It's whether the operational substrate — eval workspaces, verification loops, write-scope discipline — survives the next architectural shift, or whether it has to be rebuilt against world-models or JEPA-class architectures within 18 months. The bet shipped this week wasn't on Fable 5. It was on which substrate to invest the next year of craft on.

AnthropicMollickWillisonChernyKarpathyMarcusLambertTonerRaschkaSchluntz
Also this week
№ 02

The skeptic counter-bloc converges on calibration

Four voices, four independent threads, one underlying shape. LeCun leaves Meta, founds AMI, predicts paradigm shift obvious by early 2027 — LLMs totally suck at continuous high-dimensional noisy data; language is the easy case. Marcus: industry propped up by math that is insane, Burn-Murdoch FT productivity graph as confirming evidence, German court hallucination liability eroding Section 230 shield. Narayanan + Kapoor: why AI hasn't replaced software engineers, and won't — coding agents as normal technology, augmentation not replacement. Chollet: when an AI tells me I'm absolutely right, my trust drops; near-term AI is the newest form of digital infrastructure, not a different kind. Mitchell's Yale Review jagged-intelligence piece supplies the calibration frame in canonical form. None of this is a stop-shipping signal. All of it is a scope-by-demonstrated-capability signal.

LeCun · Marcus · Narayanan · Chollet · Mitchell
№ 03

The Claude Code substrate matures: nested subagents, self-verification loops, teams of Mythos

bcherny ships nested subagent support — agents kicking off agents is now a first-class primitive. Schluntz: teams of Claude Mythos agents 3x faster than single; head of Claude Code hasn't written code by hand in months; 49 features shipped in 2 days, 100% AI-written. The bcherny / __catwu one-year-after-GA retrospective surfaces the self-verification loops doctrine — powerful models need their own output verification inside the loop, not after. Cross-link to JC-OS perfect-commit Move 2: same shape, different vocabulary. Karpathy's Jevon's-paradox quote about working software coming out on a tap is the macro frame this lands on.

Cherny · Wu · Schluntz · Karpathy
№ 04

The architectural substrate underneath: GEPA, DiffusionGemma, MoE-vs-dense, JEPA

Quiet but recurring substrate signals. Berkeley's GEPA team ships optimize_anything (arXiv 2605.19633) — Pareto-frontier text-artifact evolution with Actionable Side Information as the central concept; results across CUDA kernels, scheduling, circle packing. Google's DiffusionGemma — text via diffusion, faster than autoregressive. Local-inference decision tree sharpens around MoE-vs-dense and expert offload (Qwen3.6-35B-A3B hits 33.5 tok/s on 8GB VRAM via llama.cpp `-ngl 99 -ncmoe 99`). Tri Dao reframes — transformer is gemm + epilogue; kernel-level tuning beats architecture-swap at inference scale; NVIDIA 90% market share. The post-LLM bets — JEPA (LeCun) for robotics, World Labs (Fei-Fei Li) for spatial-physical — sit underneath. Today's architecture ships; tomorrow's may be different in kind. The eval-workspace + verification-loop discipline survives architecture swaps; harness-level scaffolding may not.

Agrawal · Willison · LeCun · Fei-Fei Li · Tri Dao · Raschka
04 / How to Do It This Week

The practitioner synthesis

Prompting & inference 03
  • Author lead_body and briefs as explicit editorial artifacts — fallback rendering is for outages, not defaults
    via Episode 7 retro / modes/digest.md Phase 2f-bis
  • Treat agreement-first language as data about the model, not the proposal — sycophancy heuristic
  • Stress-test Fable 5's relentlessly proactive mode on tasks where over-action costs more than under-action
Tools, repos, libraries 08
  • gepa + optimize_anything
    Pareto-frontier prompt and artifact evolution with Actionable Side Information — pip install git+https://github.com/gepa-ai/gepa.git
    via Lakshya Agrawal · github.com/gepa-ai/gepa
  • dspy.GEPA
    Drop-in GEPA optimizer for DSPy pipelines
    via Lakshya Agrawal · dspy.ai/tutorials/gepa_ai_program/
  • datasette-agent 0.3a0
    Adds execute_write_sql tool; almost entirely written by Claude Fable 5
  • llm 0.32a3
    Simon Willison's CLI, Fable-5-authored release
    via Simon Willison · github.com/simonw/llm
  • luau-wasm 0.1a0
    Lua VM compiled to WASM, distributed as PyPI wheel
    via Simon Willison · github.com/simonw/luau-wasm
  • asyncinject 0.7
    asyncio dependency-injection utility, Fable-rewritten
    via Simon Willison · github.com/simonw/asyncinject
  • cua + cua-driver
    Computer-use skill that works with Fable + any model; integrates with Ghostty-based IDEs
    via Cat Wu (retweet) · github.com/trycua/cua
  • LLMs-from-scratch (DeepSeek Sparse Attention)
    From-scratch DSA implementation in the canonical pedagogical repo
    via Sebastian Raschka · github.com/rasbt/LLMs-from-scratch
Architectural & model-selection 04
  • MoE vs dense decision tree for local inference
    8GB VRAM + 64GB system RAM → MoE w/ expert offload (Qwen3.6-35B-A3B at 33.5 tok/s via llama.cpp); 16-24GB VRAM → dense Qwen3.6-27B for agentic; 128GB+ unified memory → MoE at top tier (Qwen3-235B-A22B)
  • Nested subagent support in Claude Code
    Agents-spawning-agents is first-class — revisit any flat-parallel agent design
  • Diffusion-based text generation (DiffusionGemma)
    Faster than autoregressive — re-evaluate fallback model choice if latency-bound
    via Simon Willison (via Google) · simonwillison.net/2026/Jun/10/diffusiongemma/
  • Transformer = gemm + epilogue
    At inference scale, kernel-level tuning beats architecture-swap for perf wins
Methodological frames 05
  • One-way door framing for governance
    Assume the Commerce directive's regulatory posture is the new baseline; design from there, not from last month
  • Self-verification loops doctrine
    Powerful models need their own output verification inside the loop, not after — pairs with perfect-commit Move 2 (triadic spec)
  • Jagged intelligence calibration
    Scope capabilities task-by-task; don't extrapolate from headline benchmarks
    via Melanie Mitchell (Yale Review) · yalereview.org/article/melanie-mitchell-jagged-…
  • Coding agents as normal technology
    Design for augmentation (engineer + agent), not replacement (agent-only autonomy at production scale)
    via Arvind Narayanan + Sayash Kapoor · www.normaltech.ai/p/why-ai-hasnt-replaced-softw…
  • Actionable Side Information (ASI)
    Evaluators return diagnostic feedback (errors, traces, profiler output) as text-gradient — required for any eval-driven optimization loop to converge
    via GEPA / Berkeley team · arxiv.org/abs/2605.19633
Papers worth a closer read 05
  • optimize_anything: A Universal API for Optimizing any Text Parameter
    Berkeley GEPA team — single LLM-based optimizer hits SOTA across 6 domains (CUDA kernels, scheduling, circle packing, agent architectures). Accepted CAIS 2026. Actionable Side Information is the central contribution.
  • Less Context, More Accuracy: A Bi-Temporal Memory Engine for LLM Agents
    Lean retrieved context beats full history; long-term memory layer for agents that doesn't replay everything — relevant to JC-OS memory + /jc-handoff
  • KVEraser: Learning to Steer KV Cache for Efficient Localized Context Erasing
    Post-hoc context erasing in the KV cache without breaking subsequent token states — relevant for any mid-session forget primitive
  • SING: Synthetic Intention Graph for Scalable Active Tool Discovery
    As tool ecosystems expand, tool discovery becomes the bottleneck — maps to harness deferred-tool / ToolSearch pattern
  • TokenPilot: Cache-Efficient Context Management for LLM Agents
    Constrains sequence mutations to preserve cache locality — makes nested-subagent patterns affordable at scale
05 / Quotes

Lines worth carrying

Welcome to the AGI era of AI governance. It's a one-way door and we weren't ready for it.
Nathan Lambert (Interconnects)
After two years of underregulating AI, the US government suddenly takes the nuclear option.
Gary Marcus (Substack)
Whenever an AI tells me I'm absolutely right, my trust in it drops by a bit.
François Chollet (@fchollet)
Fable 5 is the biggest step up I've felt in our models since Opus 4.5 back in November.
Boris Cherny (@bcherny)
After two days with Claude Fable 5 the best way I can describe it is relentlessly proactive.
Simon Willison (@simonw)
06 / Position shifts

What changed in the stance map

PersonThemeShiftNote
Karpathy software_economics_post_capability_jump NEW THEME Working software increasingly comes out on a tap; Jevon's paradox kicks in. First time tying Anthropic move to public macro-economics thesis.
Willison ai_code_quality ESCALATED From 'ships anyway' to 'the model writes the library; I review.' Three releases this period explicitly Fable-5-authored.
Willison agentic_security ESCALATED New failure mode named — silent competitor throttling. 'If Claude Fable stops helping you, you'll never know.'
Marcus ai_policy ESCALATED From 'clumsy mess' to concrete event reporting — US Commerce shutdown + What Washington must do.
Marcus legal_liability NEW THEME German court ruling holds Google liable for hallucinations. Section 230 erosion frame.
Mollick model_quality_read ESCALATED POSITIVE Unqualified positive read on Fable from calibrated voice; visible distress about the directive.
Clark legal_at_anthropic NEW THEME Anthropic team for AI and rule of law — first institutional AI-legal posture surface.
Lambert regulatory_one_way_door NEW THEME First explicit Bezos-style irreversibility framing applied to AI governance.
Chollet sycophancy_trust_dynamics NEW THEME Canonical short-form for candor-over-deference discipline.
Toner capability_gap_governance NEW THEME Bad to have large gap between internal and external capabilities — governance KPI re-framing.
Narayanan lab_claim_audit SHARPENED Why AI hasn't replaced software engineers, and won't — coding agents as normal technology.
bcherny agent_orchestration_topology NEW THEME Nested subagent support shipped — topology now tree-shaped, not flat-parallel.
agrawal prompt_optimization_paradigm SHARPENED optimize_anything paper accepted CAIS 2026; results broaden to 6 domains; used in MAI-Thinking-1 pretraining.
anthropic_institutional governance_collision NEW THEME Commerce directive + Axios scoop on internal clashes + Wired-driven policy walk-back. Three negative institutional events one week.
LeCun post_llm_paradigm ESCALATED Leaves Meta, founds AMI for post-LLM architectures. Change of paradigm obvious by early 2027.
Mitchell jagged_intelligence SHARPENED Yale Review piece gives the frame canonical citation.
07 / Cross-references

Who built on whom

08 / Source registry

The voices

Andrej Karpathy
@karpathy
['x', 'rss']
Simon Willison
@simonw
['x', 'rss']
Ethan Mollick
@emollick
['x', 'rss']
Gary Marcus
@GaryMarcus
['x', 'rss']
Nathan Lambert
['rss']
Arvind Narayanan
['rss']
Jack Clark
@jackclarkSF
['x', 'rss']
Helen Toner
@hlntnr
['x']
François Chollet
@fchollet
['x']
Yann LeCun
@ylecun
['x', 'video']
Melanie Mitchell
@MelMitchell1
['x', 'rss']
Sebastian Raschka
@rasbt
['x', 'web']
Tri Dao
@tri_dao
['x', 'web']
Christopher Olah
@ch402
['x']
Boris Cherny
@bcherny
['x']
Cat Wu
@__catwu
['x']
Erik Schluntz
@ErikSchluntz
['x']
Garry Tan
@garrytan
['x']
Fei-Fei Li
@drfeifei
['x', 'web']
Sara Hooker
@sarahookr
['x']
Lakshya Agrawal
@LakshyAAAgrawal
['x', 'arxiv']
Benedict Evans
@benedictevans
['x', 'rss']
Jason Wei
@_jasonwei
['x']
Cassie Kozyrkov
['rss']
Tyler Cowen
['rss']
MIT Technology Review
['web']