cd /news/large-language-models/dragoncatcher-the-secret-of-lightnes… · home topics large-language-models article
[ARTICLE · art-31405] src=robinsloan.com ↗ pub= topic=large-language-models verified=true sentiment=· neutral

Dragoncatcher: The secret of lightness

Robin Sloan argues that language models are fundamentally different from brains and that scaling laws alone may not lead to superintelligence, suggesting that smaller, more efficient models could achieve high performance. He challenges the assumption that massive parameter counts are necessary, advocating for a focus on lightness and efficiency in AI development.

read2 min views1 publishedJun 17, 2026

The secret of lightness We do not yet understand how to train language models! This seems obvious to me, because it ought to be possible —

The famous Scaling Laws only describe transformer models —

A fair objection goes like this: Robin, remember that the human brain has hundreds of trillions of “parameters”, in the form of synapses. Our largest models haven’t even approached that scale yet. Do you want us to architect a beetle’s brain, or SUPERINTELLIGENCE?

(Before proceeding, Robin replies: well, I wouldn’t mind starting with the beetle … )

The obvious response to this objection is that language models aren’t brains. Contra the brain, they operate with both handicaps (e.g. power consumption) and advantages (e.g. speed). More than linearly “better” or “worse”, though, they are just different! And so we should expect different properties, different capabilities … different numbers.

Hanging over all this, the recognition: the day that this level of intelligence moves out to the edge —

A true believer in the Scaling Laws doesn’t think such an idea can exist — I’m with Calvino:

Were I to choose an auspicious image for the new millennium, I would choose [ … ] the sudden agile leap of the poet-philosopher who raises himself above the weight of the world, showing that with all his gravity he has the secret of lightness, and that what many consider to be the vitality of the times —

noisy, aggressive, revving and roaring — belongs to the realm of death, like a cemetery for rusty old cars.

Of course, this is just a post by a child of the 20th century, to whom the prefix “giga-“ still sounds unspeakably plush. Even so: if you tell me you can’t fit a supercapable model, one poised comfortably on today’s performance frontier, into 30 billion parameters, I will tell you, try harder!

To the blog home page

── more in #large-language-models 4 stories · sorted by recency
── more on @robin sloan 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/dragoncatcher-the-se…] indexed:0 read:2min 2026-06-17 ·