1 Introduction
Cardiovascular disease (CVD) is the leading global cause of death, claiming nearly 18 million lives each year. Traditional risk assessments focus on hypertension, diabetes, obesity and smoking, but often omit sleep—which emerging evidence links to metabolic dysregulation, inflammation and elevated blood pressure. Our study bridges this gap by integrating sleep quality, duration and efficiency into an explainable machine‑learning framework for CVD risk prediction.
2 Dataset
Wisconsin Sleep Cohort (WSC)
– 2,570 participants
– 230 variables covering demographics, clinical measures and polysomnography‑derived sleep metrics
– Key fields: age, sex, BMI, LDL cholesterol, total sleep time, sleep efficiency, onset latency
3 Methods
-
Preprocessing & Cleaning
- Removed sparse (> 99 % unique or > 90 % single‑value) and highly missing (> 70 %) columns
- Aggregated redundant lifestyle variables
- Imputed continuous values using the median; one‑hot encoded categorical fields
-
Feature Selection
- Statistical significance (p < 0.05)
- Collinearity (r < 0.75)
- Random Forest feature‑importance ranking
-
Modeling
- Algorithm: Logistic regression
- Optimization: Grid search over regularization strength and penalty
- Validation: Stratified cross‑validation to ensure generalizability
-
Explainability
- SHAP (SHapley Additive exPlanations) to quantify each feature’s impact on predicted risk
4 Results
-
Overall performance
– Accuracy: 88.9 %
– Precision: 85.8 %
– Recall: 88.9 %
– F1 score: 86.9 % -
Top predictors
- LDL cholesterol
- Age
- Total non‑REM sleep time
- Sleep efficiency
- Sleep onset latency
Including sleep metrics improved high‑risk detection and offered actionable insights through transparent model explanations.
5 Conclusion
By combining traditional cardiovascular factors with sleep characteristics in an interpretable AI model, we demonstrate:
- Enhanced accuracy in identifying individuals at high CVD risk
- Clinical transparency, fostering practitioner trust
- A paradigm shift advocating routine inclusion of sleep assessment in CVD screening
Future directions: incorporate wearable‑based sleep monitoring, expand to diverse cohorts and explore deep‑learning approaches for richer pattern discovery.