Transforming raw data into decisions that matter. Specializing in scalable machine learning models and predictive analytics.
I am a Data Scientist and Machine Learning Engineer passionate about bridging the gap between complex algorithms and real-world business value. With a strong foundation in statistical modeling and distributed computing, I architect end-to-end data pipelines that empower decision-makers. My focus is on deploying scalable, interpretable AI solutions that drive measurable impact.
A full-stack Django-based automatic subjective grading platform that uses BERT-driven semantic similarity (Sentence Transformers, cosine scoring) with role-based dashboards to evaluate unstructured answers at scale, achieving a 0.87 F1 score and retaining educator control via a human-in-the-loop override.
A context-aware conversational AI for preliminary symptom analysis, integrating LangChain and Gemini LLMs with Pinecone vector databases to perform high-speed semantic search across chunks of clinical text from the Gale Encyclopedia of Medicine
Developed a reproducible, end-to-end ML pipeline for estimating biological age from DNA methylation data using interpretable models and a calibrated stacked ensemble, with robust cross-study validation demonstrating strong generalization on an external cohort
Built an end-to-end, interpretable diabetes prediction pipeline on a 100K-record clinical dataset using Lasso-based feature selection and ensemble modeling, achieving a 0.97 AUC with a tuned XGBoost model while translating predictions into clinically actionable risk insights.
Currently open for new opportunities. Whether you have a question, a project proposal, or just want to say hi, I'll try my best to get back to you!
patel.h.kush@gmail.com
linkedin.com/in/kush-patel2416
Chicago, IL (Remote OK)