I made this interactive reference for transformer models, showing everything down to elementary math. I intentionally avoided matrix multiplication and used explicit sums and indices instead. Covers models from GPT-2 to Qwen 3.6, with MLA, MoE, RoPE, MTP, etc togglable. It's best viewed on desktop or tablet.
Show HN: Transformer Math Explorer
(simonramstedt.com)4 points by rmst 2 hours ago | 1 comments
Comments