- Preface
- Setting Up the Environment
- 1. Week 1: From Matmul to Text
- 1.1. Attention and Multi-Head Attention
- 1.2. Positional Encodings and RoPE
- 1.3. Grouped/Multi Query Attention
- 1.4. RMSNorm and MLP
- 1.5. The Qwen2 Model
- 1.6. Generating the Response
- 1.7. Sampling and Preparing for Week 2
2. Week 2: Optimizing
3. Week 3: Serving
- Glossary Index