A publication for engineers shipping inference

The engineering
layer of AI.

Deep technical writing on LLM, GPU, and ML systems internals. Decoded from silicon to system to algorithm — for the engineers who already know what RAG is.

/03 Editorial

We decode AI one layer at a time. Silicon tells you what the hardware can do. System tells you how inference actually runs. Algorithm tells you why the math works. All three, in depth, without the vendor gloss.

Written for the engineers building inference infrastructure — not the engineers explaining what inference is. Dense. Verifiable. No filler.

fp4 editorial desk