I am a Senior Deep Learning Performance Architect at NVIDIA, working on ML workload performance on current and next-gen NVIDIA GPUs. Previously, I spent four years at AMD working on performance-modeling GPU platforms and accelerating ML workloads (Llama, GPT, Stable Diffusion) on MI300X and earlier Instinct GPUs. I graduated with a Ph.D. in Computer Science from William & Mary in May 2021.

Recent Posts