Notes, zines, and illustrations on things I'm figuring out.
Short-form write-ups on GPU architecture, kernels, and systems.
Notes from Stanford's Language Modeling from Scratch course.
Using TMA in the Hopper architecture to load data from global to shared memory.
Comics and illustrations on topics I find interesting.
A visual explainer on MoE in transformers.