AI-Powered Resume & Job Description Matching Assistant
Agentic RAG system using CrewAI and LangChain to automate context-aware resume scoring and tailored cover letter generation.
View on GitHubHi, I'm
ML Pipelines · GenAI · MLOps
I build production ML systems — from AWS-native AutoML pipelines to agentic RAG applications — that turn messy data into measurable business impact.
A data scientist turning models into measurable outcomes — from AutoML pipelines to GenAI-powered assistants.
I'm a Data Scientist at Squark AI in Boston, where I architect AWS-native ML pipelines that train 35% faster on diverse structured and unstructured datasets. My work blends classical ML with modern GenAI — engineering custom preprocessing frameworks with BERT and Word2Vec embeddings, and validating prediction granularity through clustering and rigorous A/B testing.
Before grad school, I spent two years at Tredence Analytics in Bengaluru building predictive churn models and end-to-end MLOps pipelines in Databricks for a top-tier retail client with a 10M+ user base. That experience taught me that the fastest model rarely wins — the one that's deployed, monitored, and trusted does.
I hold an M.S. in Data Analytics Engineering from Northeastern University (GPA 3.8/4.0) with coursework in Data Mining, MLOps, and Data Management. I'm passionate about agentic AI, drift monitoring, and the unglamorous engineering work that keeps models alive in production.
3+ years across AI startups, enterprise analytics consulting, and public-sector data — shipping ML that moves business metrics.
Squark AI · Remote, United States
Tredence Analytics Solutions · Bengaluru, India
SJVN Ltd. · Shimla, India
A full-stack data toolkit — from SQL and PySpark to BERT embeddings, MLflow, and AWS.
A snapshot of recent work — agentic GenAI, predictive modeling, and recommendation engines.
Agentic RAG system using CrewAI and LangChain to automate context-aware resume scoring and tailored cover letter generation.
View on GitHub85%-accurate XGBoost model for identifying high-risk customer segments. Surfaces key retention drivers through statistical analysis to help businesses act before churn happens.
View on GitHub91.4%-accurate neural network for sentiment classification on unstructured Yelp reviews, paired with a hybrid recommendation engine using KNN, SVD, and sentiment embeddings.
View on GitHubA job-notification agent that crawls Big Tech career pages every 15 minutes and pushes instant, personalized alerts. Reduces noise — finds roles that actually match.
View on GitHubSurvival classification model for liver-cirrhosis patients — predicts outcomes from clinical features using a tuned classification pipeline. Healthcare ML in practice.
View on GitHubA working reference for productionizing ML models — covers the operational glue around training, serving, and monitoring that keeps models reliable post-deployment.
View on GitHubFormal training at the intersection of analytics, machine learning, and engineering.
Master of Science in Data Analytics Engineering
GPA: 3.8 / 4.0
Coursework: Data Management in Analytics, Data Mining in Engineering, Machine Learning Operations
Bachelor of Technology in Computer Science and Engineering
Open to data science roles, collaborations, and conversations about ML, GenAI, and MLOps. Let's build something.