Hello, I'm

Kush Patel

|

Building end-to-end clinical AI pipelines, from predictive modeling and NLP to LLM-driven agentic systems, that transform health data into actionable decisions.

Kush Patel

About Me

01

I'm a Data Scientist with a Master's in Data Science from Illinois Tech, focused on predictive modeling, NLP, and LLM-driven agentic systems. I build end-to-end clinical AI pipelines on AWS, from post-discharge voice triage to hypertension adherence agents, applying robust, interpretable AI to real-world health system challenges.

๐Ÿง  ML Research ๐Ÿ“Š Data Viz โšก Scalable APIs โ˜• Coffee-Driven Dev
0 Projects
0 Research Paper
Currently Focusing on : Advanced LLM fine-tuning, Agentic AI & Model Context Protocol (MCP)

Arsenal

02

Core Competencies

Python & R95%
SQL & Databases90%
Machine Learning 85%
Deep Learning (PyTorch, TF)80%
Data Visualization (Plotly, Tableau, Power BI)85%

Technologies & Tools

AWS
Docker
Git
Spark
FastAPI
Pandas
NLP
BERT
LangChain
LLMs
MySQL
MongoDB
JavaScript
Excel
Django
RAG
Pinecone
GCP
Amazon Bedrock
TypeScript
Multi-Agent AI

Projects

03

ARIA: Adherence Risk Intelligence Agent

Architected a three-layer AI pipeline: deterministic rule engine, weighted risk scoring (0-100), and LLM narrative generation with 11 validation checks, for hypertension management across a full patient panel. Built 5 nightly clinical detectors with patient-adaptive thresholds, an Ask ARIA chatbot with three-layer guardrails, and a full-stack system with a 12-table PostgreSQL schema and Next.js clinician dashboard.

LLM PostgreSQL Next.js AWS

Sentinel Care: Post-Discharge Voice Triage

Production-grade AI voice triage system targeting the post-discharge follow-up gap, with outbound calls, natural speech clinical assessments, and real-time SBAR reports delivered to a nurse dashboard with tiered risk classification. Designed an 8-Lambda serverless architecture integrating Amazon Bedrock Nova Pro/Lite with LACE readmission risk scoring and SNS escalation for high-risk patients.

Amazon Bedrock Serverless NLP AWS

AegisGuard: Antibiotic Stewardship AI

AI-powered point-of-care antibiotic prescribing tool combining facility antibiogram data, patient-specific lab values, and validated clinical scoring algorithms to reduce the ~50% inappropriate antibiotic use rate prevalent in US hospitals. Integrates real-time medication safety analysis with local pathogen resistance patterns for EHR-integrated workflows.

LLM Clinical AI Python

Auto-Grading Subjective Test Platform

A full-stack Django-based automatic subjective grading platform that uses BERT-driven semantic similarity (Sentence Transformers, cosine scoring) with role-based dashboards to evaluate unstructured answers at scale, achieving a 0.87 F1 score and retaining educator control via a human-in-the-loop override.

Django NLP SQLite

CareBot: Clinical-Support Medical Chatbot

A context-aware conversational AI for preliminary symptom analysis, integrating LangChain and Gemini LLMs with Pinecone vector databases to perform high-speed semantic search across chunks of clinical text from the Gale Encyclopedia of Medicine

RAG Pinecone LLM

Biological Age Prediction

Developed a reproducible, end-to-end ML pipeline for estimating biological age from DNA methylation data using interpretable models and a calibrated stacked ensemble, with robust cross-study validation demonstrating strong generalization on an external cohort

Python Pandas ML Pipeline

Diabetes Prediction

Built an end-to-end, interpretable diabetes prediction pipeline on a 100K-record clinical dataset using Lasso-based feature selection and ensemble modeling, achieving a 0.97 AUC with a tuned XGBoost model while translating predictions into clinically actionable risk insights.

Python Regression XGBoost

Experience

04

ARC Math Tutor

Illinois Institute of Technology

Nov 2025 - Present
  • Translated complex mathematical and statistical concepts into intuitive, accessible frameworks for undergraduate students, demonstrating the ability to communicate highly technical logic to non-technical audiences.

Machine Learning Intern

InternshipStudio

May 2023 - Jun 2023
  • Engineered predictive regression models (Random Forest, Keras/TensorFlow) to forecast digital asset monetization based on audience interaction signals and content metadata in Google BigQuery, outperforming the baseline decision-tree model by 15% in Mean Absolute Error (MAE) to support targeted promotional strategies.
  • Architected a data preprocessing pipeline in Python using Pandas, NumPy, and Spark to handle corrupted data flags, encode complex categorical taxonomies, and filter statistical anomalies in engagement metrics while managing multiple projects to streamline data preparation and enable downstream models to train on clean data without manual intervention
  • Developed interpretability frameworks using Random Forest feature importance to demystify complex predictive outputs, identifying the primary drivers of user engagement and translating model behavior into actionable content strategies for stakeholders.

Data Analyst Intern

Trainity

Nov 2022 - Dec 2022
  • Engineered data sanitization workflows using SQL and Excel on a 300,000+ row loan portfolio, eliminating 41 high-null features and imputing missing financial data to establish a reliable baseline for risk modeling.
  • Conducted segmented bivariate analysis and designed KPI-driven dashboards in Power BI and other Microsoft applications for a highly imbalanced credit dataset (92% non-default vs. 8% default), visualizing demographic distributions to uncover hidden risk correlations in income and employment.
  • Presented a data-driven risk mitigation strategy to business stakeholders, specifically highlighting the Transport sector's 16% default rate, coordinating multiple project work streams to propose dynamic interest rate adjustments and optimize the underwriting process.

Education

05

Illinois Institute of Technology

Master of Applied Science, Data Science

2024 - 2026 GPA: 3.8/4.0
ARC MATH Tutor Indian Student Association

University of Mumbai

B.Tech in Computer Science and Engineering

2019 - 2023
Data Structures Big Data Technologies Python Database Managements

Want the full picture?

My resume has everything, from specific project architectures to publications.

Download Resume PDF ยท Last updated May 2026

Let's Connect

06

Currently open for new opportunities. Whether you have a question, a project proposal, or just want to say hi, I'll try my best to get back to you!

Email

patel.h.kush@gmail.com

LinkedIn

kush-patel2416

Location

Chicago, IL (Remote OK)

Message sent successfully!