Applying data mining and machine learning (ML) techniques to clinical data might identify predictive biomarkers for diabetic nephropathy (DN), a common complication of type 2 diabetes mellitus (T2DM). A retrospective analysis of the Action to Control Cardiovascular Risk in Diabetes (ACCORD) trial was intended to identify such factors using ML. The longitudinal data were stratified by time after patient enrollment to differentiate early and late predictors. Our results showed that Random Forest and Simple Logistic Regression methods exhibited the best performance among the evaluated algorithms. Baseline values for glomerular filtration rate (GFR), urinary creatinine, urinary albumin, potassium, cholesterol, low-density lipoprotein, and urinary albumin to creatinine ratio were identified as DN predictors. Early predictors were the baseline values of GFR, systolic blood pressure, as well as fasting plasma glucose (FPG) and potassium at month 4. Changes per year in GFR, FPG, and triglycerides were recognized as predictors of late development. In conclusion, ML-based methods successfully identified predictive factors for DN among patients with T2DM.
ASJC Scopus subject areas
- Biochemistry, Genetics and Molecular Biology(all)
- Pharmacology, Toxicology and Pharmaceutics(all)