Selected Publications
all publications →
ICML 2026
Breaking the Query Barrier: Efficient Verbatim Regurgitation from Large Language Models
NeurIPS 2025
SLMs as Compiler Experts: Auto-Parallelization for Heterogeneous Systems
EACL 2025 Workshop
Can LLMs Learn Tulu? Teaching Low-Resource Languages Through Hard Constraint Prompting
App. Soft Comp.
PDF Malware Detection with Adversarial Robustness (KASPER)
Projects
all projects →
Production LLM training + serving system with Mixture-of-Refusals (MoR) safety routing, FSDP/DDP, vLLM, and Kubernetes deployment
Decentralized content-addressable file storage system in Go for streaming large files across distributed nodes
Qwen2.5-7B fine-tuned on Indian legal datasets with LoRA + 4-bit quantization deployed on Modal
Distributed 3D printer management using the Raft Consensus Algorithm for persistence without centralized databases