The Briefing / Today

All things AI,
every day.

The model race is slowing. The inference race is just starting. Everyone is about to relearn what a margin is.

The Top Stories

8 picks

Economics11 MIN

01

Stratechery

Aggregation, inference, and the return of unit economics

Ben Thompson argues the AI wave has hit the same wall every platform shift does: subsidies end, aggregators form, and the middle gets squeezed. The question is which layer becomes the aggregator this time — the model, the application, or the inference runtime.

9H AGO

Read source ↗Ask Anvil →

Next Up

Tools7 MIN

02

Simon Willison

Running Claude 5 Sonnet on a MacBook via MLX

Local inference with a 200B-class model on consumer silicon was a 2027 prediction six months ago. It just happened. Worth reading for anyone who thought OpenAI and Anthropic had a durable API moat.

10H AGO

Read source ↗Ask Anvil →

Craft9 MIN

03

One Useful Thing

Most AI writing is bad. Here is why and what to do

Mollick makes a case that LLM-default prose is a style, not a capability ceiling. The fix is not a better prompt — it is being specific about whose writing you want the model to sound like, and doing the work to show it.

16H AGO

Read source ↗Ask Anvil →

Models4 MIN

04

Anthropic News

Extended thinking leaves beta

Ships with pricing, a tuned budget parameter, and a surprise: the public leaderboard numbers for the same budget went up after the beta run. Good news if you were holding off until the contract was stable.

2H AGO

Read source ↗Ask Anvil →

Also Worth Reading

Research13 MIN

05

Interconnects

Open weights are not enough — reproducible post-training is

Nathan Lambert on why "open" is now a gradient, not a binary. Weights without training data, without preference data, without eval sets is a reproducibility crisis in motion.

1D AGO

Read source ↗Ask Anvil →

Building8 MIN

06

Latent Space

DevRel for agents: the new developer experience

Swyx argues documentation is getting read by agents before humans, and the best docs are now structured for the agent reader first. Implications for anyone shipping an API in 2026.

1D AGO

Read source ↗Ask Anvil →

Economics15 MIN

07

Import AI

The quiet economics of running inference at scale

Jack Clark walks through the internal spreadsheet math of a frontier lab — why 70% of compute is inference, why that ratio flips annually, and what it means for who funds what.

1D AGO

Read source ↗Ask Anvil →

Models3 MIN

08

OpenAI Blog

GPT-5.5 Turbo is 60% cheaper at the same quality

Another step down the inference cost curve. The parts of your product that you argued were too expensive to AI-augment three months ago are probably not anymore.

5H AGO

Read source ↗Ask Anvil →

From the Archive

The Builder Weekly ↗

WEEKLY · VOL XII

AI Alone Is Fragile

Your demo works. Your production is fragile. Build the system, then build the product. AI executes within a system. Without one, you ship fragile. With one, you ship with confidence.

Mike Molinet & Govind Kavaturi10 minRead on TBW ↗

Being Asked Right Now

Live · updated every minute

All things AI,every day.

Aggregation, inference, and the return of unit economics

Running Claude 5 Sonnet on a MacBook via MLX

Most AI writing is bad. Here is why and what to do

Extended thinking leaves beta

Open weights are not enough — reproducible post-training is

DevRel for agents: the new developer experience

The quiet economics of running inference at scale

GPT-5.5 Turbo is 60% cheaper at the same quality

The briefing answers what it can. For everything else, ask.

AI Alone Is Fragile

All things AI,
every day.