learning
CS336
Language Modeling from Scratch — click a topic to see notes.
intro
→
model
architecture
→
training
→
parallelism
→
↓
inference
→
data
→
post
training