Anvil
The Briefing / Today

All things AI,
every day.

The model race is slowing. The inference race is just starting. Everyone is about to relearn what a margin is.
The Top Stories
8 picks
Economics11 MIN
01
Stratechery

Aggregation, inference, and the return of unit economics

Ben Thompson argues the AI wave has hit the same wall every platform shift does: subsidies end, aggregators form, and the middle gets squeezed. The question is which layer becomes the aggregator this time — the model, the application, or the inference runtime.

Next Up
Tools7 MIN
02
Simon Willison

Running Claude 5 Sonnet on a MacBook via MLX

Local inference with a 200B-class model on consumer silicon was a 2027 prediction six months ago. It just happened. Worth reading for anyone who thought OpenAI and Anthropic had a durable API moat.

Craft9 MIN
03
One Useful Thing

Most AI writing is bad. Here is why and what to do

Mollick makes a case that LLM-default prose is a style, not a capability ceiling. The fix is not a better prompt — it is being specific about whose writing you want the model to sound like, and doing the work to show it.

Models4 MIN
04
Anthropic News

Extended thinking leaves beta

Ships with pricing, a tuned budget parameter, and a surprise: the public leaderboard numbers for the same budget went up after the beta run. Good news if you were holding off until the contract was stable.

Also Worth Reading
Research13 MIN
05
Interconnects

Open weights are not enough — reproducible post-training is

Nathan Lambert on why "open" is now a gradient, not a binary. Weights without training data, without preference data, without eval sets is a reproducibility crisis in motion.

Building8 MIN
06
Latent Space

DevRel for agents: the new developer experience

Swyx argues documentation is getting read by agents before humans, and the best docs are now structured for the agent reader first. Implications for anyone shipping an API in 2026.

Economics15 MIN
07
Import AI

The quiet economics of running inference at scale

Jack Clark walks through the internal spreadsheet math of a frontier lab — why 70% of compute is inference, why that ratio flips annually, and what it means for who funds what.

Models3 MIN
08
OpenAI Blog

GPT-5.5 Turbo is 60% cheaper at the same quality

Another step down the inference cost curve. The parts of your product that you argued were too expensive to AI-augment three months ago are probably not anymore.

Ask Anvil

The briefing answers what it can. For everything else, ask.

Or start with
From the Archive
The Builder Weekly ↗
WEEKLY · VOL XII

AI Alone Is Fragile

Your demo works. Your production is fragile. Build the system, then build the product. AI executes within a system. Without one, you ship fragile. With one, you ship with confidence.

Mike Molinet & Govind Kavaturi10 minRead on TBW ↗
Being Asked Right Now
Live · updated every minute