Predictive Modeling for Hospital Readmission Risk Among Cardiovascular Patients
This project presents a comprehensive data-driven framework to predict six-month hospital readmission risk for cardiovascular patients, specifically addressing the challenges of data scarcity in low-resource environments. It documents a complete machine learning workflow, beginning with the manual collection of a novel dataset from 3,867 patients that integrates demographic, clinical, and behavioral variables. The project develops, compares, and validates several predictive models, including gradient boosting (XGBoost), deep neural networks (TabNet), and stacked ensemble classifiers. A strong emphasis is placed on clinical interpretability and transparency through the integration of Explainable AI (XAI) tools like SHAP and LIME, which are used to identify the key factors driving readmission risk. The result is a replicable and interpretable framework designed to aid healthcare providers in identifying high-risk individuals and enabling early intervention.
11
Tech Stack
7
Key Features
Technologies Used

Key Features
- ●Based on a novel, manually collected dataset of 3,867 cardiovascular patients, integrating demographic, clinical, and behavioral data.
- ●Implements a robust data preprocessing pipeline, including MICE-inspired KNN imputation for missing values and SMOTE to address class imbalance.
- ●Develops and compares multiple machine learning models, including XGBoost, TabNet, and advanced stacking ensembles.
- ●Employs advanced feature engineering techniques, such as Principal Component Analysis (PCA) and the generation of interaction features.
- ●Integrates Explainable AI (XAI) tools (SHAP and LIME) to provide both global (feature importance) and local (patient-level) model interpretability.
- ●Provides a framework for risk stratification to segment patients into low, medium, and high-risk groups to support clinical decision-making.
- ●Includes analysis of model fairness and generalizability, with evaluations of performance across different patient subgroups.