home

Notes

i'll be dumping my notes about everything I learn here

Tensor memory accelerator CuTe layouts GPU occupancy Inside Hopper arch sparse attention