I currently work as a Senior Machine Learning Engineer at BILL (formerly Bill.com) where I lead designing of the intelligence layer of LLM agents for various use cases.
I also design and implement deep learning models to improve the features of our software.
I completed my Masters in Computer Science at University of Southern
California with Honors. (Graduation Date: December 2022).
At USC, I was working in Information Sciences
Institute as a research assistant where my work was centered around language models and various aspects of knowledge graphs.
At the same time, I was a Teaching Assistant for CSCI 561 and CSCI 570.
I have previously worked as a Software Developer at Barclays Global Service Centre, India where I had worked on a wide
variety of tech stacks including Natural Language Processing and Web Development.
I had completed my Bachelors with Rank 1 from K. J. Somaiya College of Engineering, Mumbai majoring in Computer Engineering. My final year project - Sign Language Translation was aimed at recognizing the signs and gestures demonstrated by the hearing- and speech-impaired and translate into voice. Apart from my academic endeavours, I play the Guitar, love to hike and snowboard. You can find my Resume here and papers or reports for all the above work in the publications section.
MS in Computer Science, Dec 2022
University of Southern California
BTech in Computer Engineering, 2018
K. J. Somaiya College of Engineering, Mumbai
For a complete list, kindly see my CV
Built an end-to-end RAG pipeline and chat assistant to promote HPV vaccination to USC patients using ChromaDB, FastAPI, OpenAI (text model and TTS model). Github Link
Developed a Siamese network of a combination of Convolutional, Max Pooling, Batch Norm and Dropout layers, to verify whether the signatures are signed by the same person or not using one-shot learning. Github Link
CNN was trained for detecting Pneumonia given Chest X-Rays. After training the model, test accuracy of 94.56% and a recall score of 0.97 was achieved.
An Android application designed to translate the signs and gestures performed by the hearing- and speech-impaired into voice using Image Processing, Segmentation and Machine Learning algorithms KNN and HMMs. Paper published in IEEE.
Contributed to the development of Machine Learning Lab on behalf of K J Somaiya College of Engineering, Mumbai and hosted on IIT Bombay's Virtual Labs website. As part of this, I developed simulations of neural networks, optical character recognition using Tesseract JS, Hebbian, Perceptron Learning Rules.
Spearheaded the company's first entity search system using document embeddings by working on
pointwise LTR (Learning to Rank) algorithm and XGB ranker that achieves a top-1 accuracy of
91% using invoices and customer features. Retrieval is done against 27M embeddings
and the RTT is 280ms on the OpenSearch VectorDB backend
Led the design of the intelligence layer of the LLM agents for various use cases
including customer support, invoice field extraction while mentoring
junior engineers and guiding other non-technical stakeholders on the use of the
toolkit such as n8n, Braintrust, ElevenLabs, RetellAI
and various prompt engineering techniques, evaluations
Contributed to a POC of multi-task learning on documents for field extraction
using LayoutLM (2D attention transformer) embedding layer and
Siamese network-based classification head achieving 89% classification accuracy
and NER heads achieving 76% document field extraction accuracy
Contributed to the development of an end-to-end pipeline for field extraction for invoices using a rules-based engine to get low-latency and high-quality results. This pipeline achieves a p90 latency of under 350ms and a precision of 95%
Designed a proprietary logo extraction model from invoices using YOLOv8 model with a mAP of 0.9 and a similarity model - ConvNext with 87% accuracy to assist in identifying similar invoices with logos
Enhanced a document similarity model to improve de-duplication of merchants in the database using Siamese-BERT model which has increased the eligible transaction payment volume by 10% through enhanced payment rails
Worked on designing bots capable of understanding communication and strategizing for the board game – diplomacy under the supervision of Dr. Jon May
Increased the win rates by an average of 5% for the powers in the game by generating and understanding DAIDE messages and using rules on top of the pre-trained reinforcement learning based Dipnet bot
Generated a dataset of 1 million invoices and statements from user data after sanity checks for improving the amount OCR model
Leveraged an ensemble of document classification models to fix the imbalanced data distribution of statements in training dataset
Improved the amount prediction test accuracy from 75.5 to 77.6% with 50% confidence threshold
Constructed a framework for identifying low quality statements in Wikidata knowledge graph amongst 1.1 billion statements on the basis of deleted, deprecated statements and constraint violations
Enhanced the graph embeddings of nodes using retrofitting based on BERT embeddings and structural, textual properties extracted from Wikidata, Probase and DBPedia datasets increasing Spearman correlation from 0.66 to 0.73 on WordSim353 benchmark
Devised a prototype fraud detection pipeline using Kafka queues, Cassandra DB and PySpark servers having an ensemble of ML models
Designed a real-time tweets sentiment analysis engine to enable quick customer service response achieving an accuracy of around 90 % in pilot runs
Created a classifier application utilizing ML algorithm LDA to extract insights from iOS and Android application reviews and customer complaints
Deployed a system that helps in connecting the colleagues with available bandwidth and skillsets with the colleagues needing assistance in their work, using AngularJS, Java, MySQL, saving more than 900 man-hours annually
Implemented dashboards for automated generation of real-time delivery metrics of more than 30 teams from Agile Central and Jira data sources which have been saving around 150 man-hours annually. Bagged the Barclays Award of Stewardship for this initiative
Led a team of three to develop a Virtual Lab for the online demonstration of machine learning concepts such as neural networks, learning rules and optical character recognition
This lab has won the Global Online Laboratory Consortium International Lab Award
Courses Included: 1) Neural Networks and Deep Learning, 2) Improving Deep Neural Networks:
Hyperparameter tuning, Regularization and Optimization
3) Structuring Machine Learning Projects, 4) Convolutional Neural Networks, 5) Sequence
Models.
Links to All Courses: Intro to Machine Learning Intermediate Machine Learning Feature Engineering Pandas
Topics Covered: Logistic Regression, Artificial Neural Network, Machine Learning Algorithms,
Principal Component Analysis, Collaborative Filtering
Please feel free to reach out to me for any query