References

  1. kipply, Transformer Inference Arithmetic

  2. kipply, Transformer Param Count

  3. Jay Alammar, The Illustrated Transformer

  4. Horace He, Making Deep Learning Go Brrrr From First Principles

  5. Sean Goedecke, Fast LLM Inference From Scratch

  6. DeepSeek-V3 Technical Report

  7. Baseten, Kimi K2 Thinking at 140 tok/s on NVIDIA Blackwell

  8. Baseten, Inference Engineering

  9. SemiAnalysis, InferenceX v2: NVIDIA Blackwell vs AMD MI355X

  10. swyx (Latent Space), Reasoning Price War

  11. Cline, A Practical Guide to Hill Climbing (Evals)

  12. How to land a job at frontier lab