Applied ML researcher and engineer with 5+ years closing the loop from experiment to production — multimodal learning, post-training LLMs (GRPO, SFT, QLoRA), large-scale retrieval & ranking, and document-understanding systems at scale. 2 US patents filed. Published in IEEE, Semantic Web Journal, and Alexa Prize. Strong collaborations with product, stakeholder engineering teams and leadership.
At BILL, I've shipped production systems spanning a 27M-entity multimodal search system (patent pending), various deep learning and traditional ML solutions and end-to-end LLM agent orchestration for customer support and invoice understanding.
Before this, I spent two years as a Research Assistant at USC's Information Sciences Institute—working on multi-agent RL for the Diplomacy game (Dr. Jon May) and knowledge graph quality & embeddings (Dr. Filip Ilievski). I completed my M.S. in Computer Science with Honors and a 4.0 GPA at University of Southern California and my B.Tech with Rank 1 from K. J. Somaiya College of Engineering, Mumbai.
Outside work: Guitar, snowboarding, hiking and painting. Find my resume here.
End-to-end RAG pipeline (ChromaDB, LangChain, FastAPI, OpenAI APIs, PHI guardrails via Presidio Analyzer) to promote HPV vaccination to Los Angeles General Hospital patients as a pilot. Presented as a poster at USC ShowCAIS 2026. Demo · GitHub
USC team advanced as a semifinalist in Amazon's Alexa Prize Socialbot Grand Challenge 4. Contributed to few components of the generate-and-rank pipeline: DialoGPT generation, SNIPS NLU intent classification, FSM dialogue management, BERT re-ranking, and AI safety guardrails. Article
Fine-tuned BART-large for dense retrieval + answer generation, achieving 92.6% top-1k retrieval accuracy and 43.9% EM accuracy on the Natural Questions dataset.
Siamese network (CNN + Pooling + BatchNorm + Dropout) for one-shot signature verification— determining whether two signatures belong to the same person. GitHub →
CNN trained on chest X-rays to detect pneumonia. Achieved 94.56% test accuracy and a recall of 0.97.
Android app translating Indian Sign Language gestures into voice using image processing, segmentation, KNN & HMM algorithms. Published at IEEE ICCNT 2018.
Building and shipping ML systems from research prototype to production at scale.
Peer-reviewed papers, workshop proceedings, and intellectual property.
Layout-aware field extraction (US 19/410,970) — System, method, and computer program product for extracting fields from vendor documents using layout detection and user behavior patterns.
Multimodal embedding system for business recommendations (US 19/043,191) — multimodal embeddings for entity search at scale.
First-authored empirical audit of Wikidata's data quality across completeness, consistency, and schema conformance at scale. Developed automated assessment pipelines and surfaced systematic gaps — providing the community with actionable quality metrics and a reproducible framework adopted by downstream KG research.
First-authored undergraduate capstone turned IEEE publication. Designed and built an Android application using image processing, colour-based hand segmentation, Hidden Markov Models and KNN classifiers to translate Indian Sign Language gestures into synthesised voice output for hearing- and speech-impaired users in real time.
Contributed the graph-embedding retrofitting pipeline — injecting BERT embeddings and structural features from Wikidata, Probase, and DBPedia into node representations — improving Spearman correlation from 0.66 → 0.73 on WordSim353. The study showed that pairing language models with rich structural knowledge achieves best-in-class concept similarity performance.
Ran evaluation experiments across multiple Wikidata relation categories using text embeddings to test whether the knowledge graph's relational structure supports analogical reasoning. Found that relevant analogical information is frequently absent or inconsistently modelled — establishing desiderata for future automated analogy extraction.
USC's entry in Amazon's Alexa Prize Socialbot Grand Challenge 4, advancing as a semifinalist from a competitive field of university teams. Contributed to few components of the generate-and-rank pipeline: DialoGPT for candidate generation, SNIPS NLU intent classification, FSM dialogue management, BERT-based response re-ranking, and AI safety guardrails. (3rd author, large team)
Open to research collaborations, applied ML roles, and interesting problems in foundation model post-training, multimodal systems, large-scale retrieval and agentic AI.
Senior ML Engineer at BILL, building scalable production LLM and multimodal systems. USC MS CS, GPA 4.0/4.0. Based in San Jose, CA.