Publications
Incoming CS PhD student at Dartmouth College (fall 2026), advised by Prof. Shawn Shan. Research spans ML security, LLM privacy, low-resource NLP, and production AI systems. Also on Google Scholar.
Published & Accepted
5SLMs as Compiler Experts: Auto-Parallelization for Heterogeneous Systems
PublishedNeurIPS 2025 · Workshop on Machine Learning for Systems
Small Language Models as compiler experts for auto-parallelization in heterogeneous systems. Achieves 6.81× average speedup (43.25× peak) against LLVM Polly, TVM, and Triton baselines.
Can LLMs Learn Tulu? Teaching Low-Resource Languages Through Hard Constraint Prompting
PublishedEACL 2025 · LoResLM Workshop
Hard negative constraints reduce catastrophic language leakage from 80% to 5% in low-resource language modeling for Tulu (~0.001% of typical training data). Explicit prohibitions outperform positive instructions for maintaining language integrity.
GUARDIAN: Multi-Agent Defense System for Large Language Model Security
PublishedICIAI · Waseda University
Multi-agent defense system achieving 99.5% prompt injection and jailbreak detection accuracy through coordinated multi-agent analysis.
PyraFuseNet: Dual-Path Network for Resource-Constrained Vision
PublishedICIAI · NTU Singapore
Dual-path architecture for resource-constrained vision. Achieves 55% computational reduction vs. ResNet-18 while maintaining competitive performance.
PDF Malware Detection with Adversarial Robustness
PublishedApplied Soft Computing · Q1 Journal
KASPER framework for PDF malware detection: 99.5% accuracy under FGSM and PGD adversarial attacks. Explainability via Kolmogorov-Arnold Networks.
Under Review
1Breaking the Query Barrier: Efficient Verbatim Regurgitation from Large Language Models
Under ReviewICML 2026 · Main Conference
Hierarchical Extraction Search (HES): a query-efficient framework extracting memorized training data from LLMs under black-box constraints. Reduces extraction cost by 10–100× vs. brute-force, demonstrated across 12 models (7B–123B). Models appearing robust to high-volume attacks remain vulnerable to low-footprint extraction.
Preprints
3SAMVAD: Multi-Agent Framework for Modeling Judicial Reasoning in Indian Jurisprudence
PreprintarXiv · 2509.03793
Multi-agent framework for modeling judicial reasoning in Indian jurisprudence.
RegimeNAS: Regime-Aware Neural Architecture Search for Financial Trading
PreprintarXiv · 2508.11338
Regime-aware NAS for financial trading, adapting architecture selection to market conditions.
MorphNAS: Differentiable Neural Architecture Search for Multilingual NER
PreprintarXiv · 2508.15836
Differentiable NAS for multilingual named entity recognition across multiple languages.