The 2024 article on production_transformer.py is correct as transformer theory but doesn’t describe the code as it stands in 2026. Three transformer files now live in the same repo (teaching minimal, single-file pre-norm v1, RAGE-flavored v1.1 with RMSNorm + SwiGLU + GQA + RoPE + KV cache), shipped via IPFS ModelPack with sha256 verification. Here is the operational ground truth.