Subscribe to First Token.
One production AI deep dive every Tuesday — for backend engineers who ship the systems behind the demo. Confirm and we'll send the RAG eval cheatsheet right after.
One more step — check your inbox.
We just sent a confirmation link. Click it to lock in your subscription and we'll send the cheatsheet right after.
Tactics from real shipping, not blog posts about blog posts.
Every issue starts with a real failure mode, debug story, or architectural decision — drawn from systems serving actual users. If it works in a demo, it doesn't ship here.
Written for engineers who ship the systems behind the demo.
APIs, queues, retrieval, caching, observability, evals. The boring infrastructure that makes LLM products actually work in production — not prompt engineering hot takes.
A single deep dive every Tuesday. No filler, no daily noise.
Long enough to be useful, short enough to read on the commute. Roughly 1,800 words, one diagram, one runnable snippet per issue. That's the contract.