C Kaustubh
I build reliable distributed systems, ML pipelines, and performance-oriented developer tools with Rust, Go, Python, CUDA, and PyTorch.
Now
Teaching Assistant at PES University and Tech Lead at Shunya PESU.
Research
RLHF/RLVR for ASR models, agentic memory, and video anomaly detection.
Open Source
LanceDB, Firecracker, and core contributor to Vektori.
Focus
Distributed systems, ML systems, cloud-native infrastructure, and low-level performance.
Proof
Research intern at C-ISFCR, Deployfest 2026 runner-up, and multiple from-scratch systems projects.
Open To
Fall 2026 internships in ML, MLOps, systems, backend engineering, and applied AI R&D.
Featured: qwen_cuda
From scratch implementation of a CUDA backend for the Qwen model family, optimizing memory usage and computational efficiency. Improved benchmark scores by ~3 tok/s.