Deakin University
Browse
islam-machinelearning-inpress-2022.pdf (4.73 MB)

Machine learning models for prediction of co-occurrence of diabetes and cardiovascular diseases: a retrospective cohort study

Download (4.73 MB)
journal contribution
posted on 2022-01-01, 00:00 authored by A S Abdalrada, Jemal AbawajyJemal Abawajy, T Al-Quraishi, Shariful IslamShariful Islam
Abstract
Background
Diabetic mellitus (DM) and cardiovascular diseases (CVD) cause significant healthcare burden globally and often co-exists. Current approaches often fail to identify many people with co-occurrence of DM and CVD, leading to delay in healthcare seeking, increased complications and morbidity. In this paper, we aimed to develop and evaluate a two-stage machine learning (ML) model to predict the co-occurrence of DM and CVD.

Methods
We used the diabetes complications screening research initiative (DiScRi) dataset containing >200 variables from >2000 participants. In the first stage, we used two ML models (logistic regression and Evimp functions) implemented in multivariate adaptive regression splines model to infer the significant common risk factors for DM and CVD and applied the correlation matrix to reduce redundancy. In the second stage, we used classification and regression algorithm to develop our model. We evaluated the prediction models using prediction accuracy, sensitivity and specificity as performance metrics.

Results
Common risk factors for DM and CVD co-occurrence was family history of the diseases, gender, deep breathing heart rate change, lying to standing blood pressure change, HbA1c, HDL and TCHDL ratio. The predictive model showed that the participants with HbA1c >6.45 and TCHDL ratio > 5.5 were at risk of developing both diseases (97.9% probability). In contrast, participants with HbA1c >6.45 and TCHDL ratio ≤ 5.5 were more likely to have only DM (84.5% probability) and those with HbA1c ≤5.45 and HDL >1.45 were likely to be healthy (82.4%. probability). Further, participants with HbA1c ≤5.45 and HDL <1.45 were at risk of only CVD (100% probability). The predictive accuracy of the ML model to detect co-occurrence of DM and CVD is 94.09%, sensitivity 93.5%, and specificity 95.8%.

Conclusions
Our ML model can significantly predict with high accuracy the co-occurrence of DM and CVD in people attending a screening program. This might help in early detection of patients with DM and CVD who could benefit from preventive treatment and reduce future healthcare burden.

History

Journal

Journal of Diabetes and Metabolic Disorders

Pagination

1 - 11

Publisher

Springer

Location

Berlin, Germany

eISSN

2251-6581

Language

English

Publication classification

C1 Refereed article in a scholarly journal