1
|
Li Y, Jin N, Zhan Q, Huang Y, Sun A, Yin F, Li Z, Hu J, Liu Z. Machine learning-based risk predictive models for diabetic kidney disease in type 2 diabetes mellitus patients: a systematic review and meta-analysis. Front Endocrinol (Lausanne) 2025; 16:1495306. [PMID: 40099258 PMCID: PMC11911190 DOI: 10.3389/fendo.2025.1495306] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/12/2024] [Accepted: 02/13/2025] [Indexed: 03/19/2025] Open
Abstract
Background Machine learning (ML) models are being increasingly employed to predict the risk of developing and progressing diabetic kidney disease (DKD) in patients with type 2 diabetes mellitus (T2DM). However, the performance of these models still varies, which limits their widespread adoption and practical application. Therefore, we conducted a systematic review and meta-analysis to summarize and evaluate the performance and clinical applicability of these risk predictive models and to identify key research gaps. Methods We conducted a systematic review and meta-analysis to compare the performance of ML predictive models. We searched PubMed, Embase, the Cochrane Library, and Web of Science for English-language studies using ML algorithms to predict the risk of DKD in patients with T2DM, covering the period from database inception to April 18, 2024. The primary performance metric for the models was the area under the receiver operating characteristic curve (AUC) with a 95% confidence interval (CI). The risk of bias was assessed using the Prediction Model Risk of Bias Assessment Tool (PROBAST) checklist. Results 26 studies that met the eligibility criteria were included into the meta-analysis. 25 studies performed internal validation, but only 8 studies conducted external validation. A total of 94 ML models were developed, with 81 models evaluated in the internal validation sets and 13 in the external validation sets. The pooled AUC was 0.839 (95% CI 0.787-0.890) in the internal validation and 0.830 (95% CI 0.784-0.877) in the external validation sets. Subgroup analysis based on the type of ML showed that the pooled AUC for traditional regression ML was 0.797 (95% CI 0.777-0.816), for ML was 0.811 (95% CI 0.785-0.836), and for deep learning was 0.863 (95% CI 0.825-0.900). A total of 26 ML models were included, and the AUCs of models that were used three or more times were pooled. Among them, the random forest (RF) models demonstrated the best performance with a pooled AUC of 0.848 (95% CI 0.785-0.911). Conclusion This meta-analysis demonstrates that ML exhibit high performance in predicting DKD risk in T2DM patients. However, challenges related to data bias during model development and validation still need to be addressed. Future research should focus on enhancing data transparency and standardization, as well as validating these models' generalizability through multicenter studies. Systematic Review Registration https://inplasy.com/inplasy-2024-9-0038/, identifier INPLASY202490038.
Collapse
Affiliation(s)
- Yihan Li
- Department of Geriatrics, Xiyuan Hospital, China Academy of Traditional Chinese Medicine, Beijing, China
| | - Nan Jin
- Department of Geriatrics, Xiyuan Hospital, China Academy of Traditional Chinese Medicine, Beijing, China
| | - Qiuzhong Zhan
- Faculty of Chinese Medicine, Macau University of Science and Technology, Macao, Macao SAR, China
| | - Yue Huang
- Department of Geriatrics, Xiyuan Hospital, China Academy of Traditional Chinese Medicine, Beijing, China
| | - Aochuan Sun
- Department of Geriatrics, Xiyuan Hospital, China Academy of Traditional Chinese Medicine, Beijing, China
- Graduate School of Beijing University of Chinese Medicine, Beijing, China
| | - Fen Yin
- Department of Geriatrics, Xiyuan Hospital, China Academy of Traditional Chinese Medicine, Beijing, China
- Graduate School of Beijing University of Chinese Medicine, Beijing, China
| | - Zhuangzhuang Li
- Department of Geriatrics, Xiyuan Hospital, China Academy of Traditional Chinese Medicine, Beijing, China
| | - Jiayu Hu
- Department of Geriatrics, Xiyuan Hospital, China Academy of Traditional Chinese Medicine, Beijing, China
| | - Zhengtang Liu
- Department of Geriatrics, Xiyuan Hospital, China Academy of Traditional Chinese Medicine, Beijing, China
| |
Collapse
|
2
|
Khamis A, Abdul F, Dsouza S, Sulaiman F, Farooqi M, Al Awadi F, Hassanein M, Ahmed FS, Alsharhan M, AlOlama A, Ali N, Abdulaziz A, Rafie AM, Goswami N, Bayoumi R. Risk of Microvascular Complications in Newly Diagnosed Type 2 Diabetes Patients Using Automated Machine Learning Prediction Models. J Clin Med 2024; 13:7422. [PMID: 39685880 DOI: 10.3390/jcm13237422] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2024] [Revised: 11/27/2024] [Accepted: 12/03/2024] [Indexed: 12/18/2024] Open
Abstract
Background/Objectives: In type 2 diabetes (T2D), collective damage to the eyes, kidneys, and peripheral nerves constitutes microvascular complications, which significantly affect patients' quality of life. This study aimed to prospectively evaluate the risk of microvascular complications in newly diagnosed T2D patients in Dubai, UAE. Methods: Supervised automated machine learning in the Auto-Classifier model of the IBM SPSS Modeler package was used to predict microvascular complications in a training data set of 348 long-term T2D patients with complications using 24 independent variables as predictors and complications as targets. Three automated model scenarios were tested: Full All-Variable Model; Univariate-Selected Model, and Backward Stepwise Logistic Regression Model. An independent cohort of 338 newly diagnosed T2D patients with no complications was used for the model validation. Results: Long-term T2D patients with complications (duration = ~14.5 years) were significantly older (mean age = 56.3 ± 10.9 years) than the newly diagnosed patients without complications (duration = ~2.5 years; mean age = 48.9 ± 9.6 years). The Bayesian Network was the most reliable algorithm for predicting microvascular complications in all three scenarios with an area under the curve (AUC) of 77-87%, accuracy of 68-75%, sensitivity of 86-95%, and specificity of 53-75%. Among newly diagnosed T2D patients, 22.5% were predicted positive and 49.1% negative across all models. Logistic regression applied to the 16 significant predictors between the two sub-groups showed that BMI, HDL, adjusted for age at diagnosis of T2D, age at visit, and urine albumin explained >90% of the variation in microvascular measures. Conclusions: the Bayesian Network model effectively predicts microvascular complications in newly diagnosed T2D patients, highlighting the significant roles of BMI, HDL, age at diagnosis, age at visit, and urine albumin.
Collapse
Affiliation(s)
- Amar Khamis
- Hamdan Bin Mohammed College of Dental Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai P.O. Box 505055, United Arab Emirates
| | - Fatima Abdul
- College of Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai P.O. Box 505055, United Arab Emirates
| | - Stafny Dsouza
- College of Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai P.O. Box 505055, United Arab Emirates
| | - Fatima Sulaiman
- College of Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai P.O. Box 505055, United Arab Emirates
| | - Muhammad Farooqi
- Dubai Diabetes Center, Dubai Health, Dubai P.O. Box 7272, United Arab Emirates
| | - Fatheya Al Awadi
- Endocrinology Department, Dubai Hospital, Dubai Health, Dubai P.O. Box 7272, United Arab Emirates
| | - Mohammed Hassanein
- Endocrinology Department, Dubai Hospital, Dubai Health, Dubai P.O. Box 7272, United Arab Emirates
| | - Fayha Salah Ahmed
- Pathology and Genetics Department, Dubai Hospital, Dubai Health, Dubai P.O. Box 7272, United Arab Emirates
| | - Mouza Alsharhan
- Pathology and Genetics Department, Dubai Hospital, Dubai Health, Dubai P.O. Box 7272, United Arab Emirates
| | - Ayesha AlOlama
- Primary Healthcare Centre, Dubai Health, Dubai P.O. Box 7272, United Arab Emirates
| | - Noorah Ali
- Primary Healthcare Centre, Dubai Health, Dubai P.O. Box 7272, United Arab Emirates
| | - Aaesha Abdulaziz
- Primary Healthcare Centre, Dubai Health, Dubai P.O. Box 7272, United Arab Emirates
| | - Alia Mohammad Rafie
- Primary Healthcare Centre, Dubai Health, Dubai P.O. Box 7272, United Arab Emirates
| | - Nandu Goswami
- Center for Space and Aviation Health, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai P.O. Box 505055, United Arab Emirates
| | - Riad Bayoumi
- College of Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai P.O. Box 505055, United Arab Emirates
| |
Collapse
|
3
|
Khosravi M, Mojtabaeian SM, Demiray EKD, Sayar B. A Systematic Review of the Outcomes of Utilization of Artificial Intelligence Within the Healthcare Systems of the Middle East: A Thematic Analysis of Findings. Health Sci Rep 2024; 7:e70300. [PMID: 39720235 PMCID: PMC11667773 DOI: 10.1002/hsr2.70300] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2024] [Revised: 12/03/2024] [Accepted: 12/08/2024] [Indexed: 12/26/2024] Open
Abstract
Background and Aims The rapid expansion of artificial intelligence (AI) within worldwide healthcare systems is occurring at a significant rate. In this context, the Middle East has demonstrated distinctive characteristics in the application of AI within the healthcare sector, particularly shaped by regional policies. This study examined the outcomes resulting from the utilization of AI within healthcare systems in the Middle East. Methods A systematic review was conducted across several databases, including PubMed, Scopus, ProQuest, and the Cochrane Database of Systematic Reviews in 2024. The quality assessment of the included studies was conducted using the Authority, Accuracy, Coverage, Objectivity, Date, Significance checklist. Following this, a thematic analysis was carried out on the acquired data, adhering to the Boyatzis approach. Results 100 papers were included. The quality and bias risk of the included studies were delineated to be within an acceptable range. Multiple themes were derived from the thematic analysis including: "Prediction of diseases, their diagnosis, and outcomes," "Prediction of organizational issues and attributes," "Prediction of mental health issues and attributes," "Prediction of polypharmacy and emotional analysis of texts," "Prediction of climate change issues and attributes," and "Prediction and identification of success and satisfaction among healthcare individuals." Conclusion The findings emphasized AI's significant potential in addressing prevalent healthcare challenges in the Middle East, such as cancer, diabetes, and climate change. AI has the potential to overhaul the healthcare systems. The findings also highlighted the need for policymakers and administrators to develop a concrete plan to effectively integrate AI into healthcare systems.
Collapse
Affiliation(s)
- Mohsen Khosravi
- Imam Hossein Hospital Shahroud University of Medical Sciences Shahroud Iran
| | - Seyyed Morteza Mojtabaeian
- Department of Healthcare Services Management, School of Management and Medical Informatics Shiraz University of Medical Sciences Shiraz Iran
| | | | - Burak Sayar
- Bitlis Eren University Vocational School of Social Sciences Bitlis Türkiye
| |
Collapse
|
4
|
Dholariya S, Dutta S, Sonagra A, Kaliya M, Singh R, Parchwani D, Motiani A. Unveiling the utility of artificial intelligence for prediction, diagnosis, and progression of diabetic kidney disease: an evidence-based systematic review and meta-analysis. Curr Med Res Opin 2024; 40:2025-2055. [PMID: 39474800 DOI: 10.1080/03007995.2024.2423737] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/05/2024] [Revised: 10/25/2024] [Accepted: 10/28/2024] [Indexed: 11/14/2024]
Abstract
OBJECTIVE The purpose of this study was to conduct a systematic investigation of the potential of artificial intelligence (AI) models in the prediction, detection of diagnostic biomarkers, and progression of diabetic kidney disease (DKD). In addition, we compared the performance of non-logistic regression (LR) machine learning (ML) models to conventional LR prediction models. METHODS Until January 30, 2024, a comprehensive literature review was conducted by investigating databases such as Medline (via PubMed) and Cochrane. Research that is inclusive of AI or ML models for the prediction, diagnosis, and progression of DKD was incorporated. The area under the Receiver Operating Characteristic Curve (AUROC) served as the principal outcome metric for assessing model performance. A meta-analysis was performed utilizing MedCalc statistical software to calculate pooled AUROC and assess the performance differences between LR and non-LR models. RESULTS A total of 57 studies were included in the meta-analysis. The pooled AUROC of AI or ML model was 0.84 (95% CI = 0.81-0.86, p < 0.0001) for analyzing prediction of DKD, 0.88 (95%CI = 0.84-0.92, p < 0.0001) for detecting diagnostic biomarkers, and 0.80 (95% CI = 0.77-0.82, p < 0.0001) for analyzing progression of DKD. The pooled AUROC of LR and non-LR ML models exhibited no significant differences across all categories (p > 0.05), except for the random forest (RF) model, which displayed a statistically significant increase in predictive accuracy compared to LR for DKD occurrence (p < 0.04). CONCLUSION ML models showed solid DKD prediction effectiveness, with pooled AUROC values over 0.8, suggesting good performance. These data demonstrated that non-LR and LR models perform similarly in overall CKD management, but the RF model outperforms the LR model, particularly in predicting the occurrence of DKD. These findings highlight the promise of AI technologies for better DKD management. To improve model reliability, future study should include extended follow-up periods as well as external validation.
Collapse
Affiliation(s)
- Sagar Dholariya
- Department of Biochemistry, All India Institute of Medical Sciences, Rajkot, India
| | - Siddhartha Dutta
- Department of Pharmacology, All India Institute of Medical Sciences, Rajkot, India
| | - Amit Sonagra
- Department of Biochemistry, All India Institute of Medical Sciences, Rajkot, India
| | - Mehul Kaliya
- General Medicine, Department of General Medicine, All India Institute of Medical Sciences, Rajkot, India
| | - Ragini Singh
- Department of Biochemistry, All India Institute of Medical Sciences, Rajkot, India
| | - Deepak Parchwani
- Department of Biochemistry, All India Institute of Medical Sciences, Rajkot, India
| | - Anita Motiani
- Department of Biochemistry, All India Institute of Medical Sciences, Rajkot, India
| |
Collapse
|
5
|
Zhang X, Qiu W, Huang J, Pang X, Su Y, Ye J, Zhou S, Tang Z, Wang R, Su R. Insulin combined with N-acetylcysteine attenuates type 1 diabetes-induced splenic inflammatory injury in canines by inhibiting the MAPKs-NF-κB signaling pathway and pyroptosis. J Diabetes Complications 2024; 38:108805. [PMID: 39089052 DOI: 10.1016/j.jdiacomp.2024.108805] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Revised: 07/05/2024] [Accepted: 07/06/2024] [Indexed: 08/03/2024]
Abstract
PURPOSE Type 1 diabetes (T1DM) is a chronic metabolic disorder that can cause damage to multiple organs including the spleen. Sole insulin therapy is not satisfactory. This study aims to investigate the effects and mechanisms of combined treatment with insulin and N-acetylcysteine (NAC) on spleen damage in T1DM canines, in order to identify drugs that may better assist patients in the management of diabetes and its complications. METHODS The canine model of T1DM was established by intravenous injection of alloxan (ALX) and streptozotocin (STZ). The therapeutic effects of insulin and NAC were evaluated by clinical manifestations, spleen protein and mRNA expression. RESULTS The results indicate that the combined treatment of insulin and NAC can alleviate hyperglycemia and hematologic abnormalities, improve splenic histopathological changes, prevent fibrous tissue proliferation, and glycogen deposition. In addition, we observed that this combination treatment significantly suppressed the protein expression of p-P65/P65 (17.6 %, P < 0.05), NLRP3 (46.8 %, P < 0.05), and p-P38/P38 (37.1 %, P < 0.05) induced by T1DM when compared to insulin treatment alone. Moreover, it also significantly decreased the mRNA expression of TLR4 (45.0 %, P < 0.01), TNF-α (30.3 %, P < 0.05), and NLRP3 (43.3 %, P < 0.05). CONCLUSIONS This combination has the potential to mitigate splenic inflammatory injury in T1DM canines by suppressing the activation of MAPKs-NF-κB pathway and pyroptosis. These findings provide a reference for the treatment strategies of diabetes and its complications.
Collapse
Affiliation(s)
- Xinting Zhang
- College of Veterinary Medicine, South China Agricultural University, 483 Wushan road, Tianhe district, Guangzhou, 510642, People's Republic of China
| | - Wenyue Qiu
- College of Veterinary Medicine, South China Agricultural University, 483 Wushan road, Tianhe district, Guangzhou, 510642, People's Republic of China
| | - Jianjia Huang
- College of Veterinary Medicine, South China Agricultural University, 483 Wushan road, Tianhe district, Guangzhou, 510642, People's Republic of China
| | - Xiaoyue Pang
- College of Veterinary Medicine, South China Agricultural University, 483 Wushan road, Tianhe district, Guangzhou, 510642, People's Republic of China
| | - Yiman Su
- College of Veterinary Medicine, South China Agricultural University, 483 Wushan road, Tianhe district, Guangzhou, 510642, People's Republic of China
| | - Jiali Ye
- College of Veterinary Medicine, South China Agricultural University, 483 Wushan road, Tianhe district, Guangzhou, 510642, People's Republic of China
| | - Shuilian Zhou
- College of Veterinary Medicine, South China Agricultural University, 483 Wushan road, Tianhe district, Guangzhou, 510642, People's Republic of China
| | - Zhaoxin Tang
- College of Veterinary Medicine, South China Agricultural University, 483 Wushan road, Tianhe district, Guangzhou, 510642, People's Republic of China
| | - Rongmei Wang
- Henry Fok College of Biology and Agriculture, Shaoguan University, No. 288, Daxue Road, Zhenjiang District, Shaoguan, 512005, People's Republic of China
| | - Rongsheng Su
- College of Veterinary Medicine, South China Agricultural University, 483 Wushan road, Tianhe district, Guangzhou, 510642, People's Republic of China.
| |
Collapse
|
6
|
Chen L, Shao X, Yu P. Machine learning prediction models for diabetic kidney disease: systematic review and meta-analysis. Endocrine 2024; 84:890-902. [PMID: 38141061 DOI: 10.1007/s12020-023-03637-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Accepted: 11/28/2023] [Indexed: 12/24/2023]
Abstract
BACKGROUND Machine learning is increasingly recognized as a viable approach for identifying risk factors associated with diabetic kidney disease (DKD). However, the current state of real-world research lacks a comprehensive systematic analysis of the predictive performance of machine learning (ML) models for DKD. OBJECTIVES The objectives of this study were to systematically summarize the predictive capabilities of various ML methods in forecasting the onset and the advancement of DKD, and to provide a basic outline for ML methods in DKD. METHODS We have searched mainstream databases, including PubMed, Web of Science, Embase, and MEDLINE databases to obtain the eligible studies. Subsequently, we categorized various ML techniques and analyzed the differences in their performance in predicting DKD. RESULTS Logistic regression (LR) was the prevailing ML method, yielding an overall pooled area under the receiver operating characteristic curve (AUROC) of 0.83. On the other hand, the non-LR models also performed well with an overall pooled AUROC of 0.80. Our t-tests showed no statistically significant difference in predicting ability between LR and non-LR models (t = 1.6767, p > 0.05). CONCLUSION All ML predicting models yielded relatively satisfied DKD predicting ability with their AUROCs greater than 0.7. However, we found no evidence that non-LR models outperformed the LR model. LR exhibits high performance or accuracy in practice, while it is known for algorithmic simplicity and computational efficiency compared to others. Thus, LR may be considered a cost-effective ML model in practice.
Collapse
Affiliation(s)
- Lianqin Chen
- NHC Key Laboratory of Hormones and Development, Tianjin Key Laboratory of Metabolic Diseases, Chu Hsien-I Memorial Hospital & Tianjin Institute of Endocrinology, Tianjin Medical University, Tianjin, 300134, China
| | - Xian Shao
- NHC Key Laboratory of Hormones and Development, Tianjin Key Laboratory of Metabolic Diseases, Chu Hsien-I Memorial Hospital & Tianjin Institute of Endocrinology, Tianjin Medical University, Tianjin, 300134, China
| | - Pei Yu
- NHC Key Laboratory of Hormones and Development, Tianjin Key Laboratory of Metabolic Diseases, Chu Hsien-I Memorial Hospital & Tianjin Institute of Endocrinology, Tianjin Medical University, Tianjin, 300134, China.
| |
Collapse
|
7
|
Oliullah K, Rasel MH, Islam MM, Islam MR, Wadud MAH, Whaiduzzaman M. A stacked ensemble machine learning approach for the prediction of diabetes. J Diabetes Metab Disord 2024; 23:603-617. [PMID: 38932863 PMCID: PMC11196524 DOI: 10.1007/s40200-023-01321-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Accepted: 09/22/2023] [Indexed: 06/28/2024]
Abstract
Objectives Diabetes has become a leading cause of mortality in both developed and developing countries, impacting a growing number of individuals worldwide. As the prevalence of the disease continues to rise, researchers have diligently worked towards developing accurate diabetes prediction models. The primary aim of this study is to utilize a diverse set of machine learning algorithms to detect the presence of diabetes, particularly in females, at an early stage. By leveraging these methods, this research seeks to provide physicians with valuable tools to identify the disease early, enabling timely interventions and improving patient outcomes. Methods In this study, some state-of-the-art machine learning techniques, such as random forest classifiers with gridsearchCV, XGBoost, NGBoost, Bagging, LightGBM, and AdaBoost classifiers, were employed. These models were chosen as the base layer of our proposed stacked ensemble model because of their high accuracy. Before feeding the data into the models, the dataset was preprocessed to ensure optimal performance and obtain improved results. Results The accuracy achieved in this study was 92.91%, which demonstrates its competitiveness with the existing approaches. Moreover, the utilization of the Shapley additive explanation (SHAP) facilitated the interpretation of machine learning models. Conclusion We anticipate that these findings will be beneficial to healthcare providers, stakeholders, students, and researchers involved in diabetes prediction research and development.
Collapse
Affiliation(s)
- Khondokar Oliullah
- Department of Computer Science and Engineering, Bangladesh University of Business and Technology, Dhaka, Bangladesh
| | - Mahedi Hasan Rasel
- Department of Computer Science and Engineering, Bangladesh University of Business and Technology, Dhaka, Bangladesh
| | - Md. Manzurul Islam
- Department of Computer Science and Engineering, Bangladesh University of Business and Technology, Dhaka, Bangladesh
| | - Md. Reazul Islam
- Department of Computer Science and Engineering, Bangladesh University of Business and Technology, Dhaka, Bangladesh
| | - Md. Anwar Hussen Wadud
- Department of Computer Science and Engineering, Bangladesh University of Business and Technology, Dhaka, Bangladesh
| | - Md. Whaiduzzaman
- School of Information Systems, Queensland University of Technology, Brisbane, Australia
| |
Collapse
|
8
|
Wu S, Wang H, Pan D, Guo J, Zhang F, Ning Y, Gu Y, Guo L. Navigating the future of diabetes: innovative nomogram models for predicting all-cause mortality risk in diabetic nephropathy. BMC Nephrol 2024; 25:127. [PMID: 38600468 PMCID: PMC11008048 DOI: 10.1186/s12882-024-03563-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Accepted: 03/26/2024] [Indexed: 04/12/2024] Open
Abstract
OBJECTIVE This study aims to establish and validate a nomogram model for the all-cause mortality rate in patients with diabetic nephropathy (DN). METHODS We analyzed data from the National Health and Nutrition Examination Survey (NHANES) spanning from 2007 to 2016. A random split of 7:3 was performed between the training and validation sets. Utilizing follow-up data until December 31, 2019, we examined the all-cause mortality rate. Cox regression models and Least Absolute Shrinkage and Selection Operator (LASSO) regression models were employed in the training cohort to develop a nomogram for predicting all-cause mortality in the studied population. Finally, various validation methods were employed to assess the predictive performance of the nomogram, and Decision Curve Analysis (DCA) was conducted to evaluate the clinical utility of the nomogram. RESULTS After the results of LASSO regression models and Cox multivariate analyses, a total of 8 variables were selected, gender, age, poverty income ratio, heart failure, body mass index, albumin, blood urea nitrogen and serum uric acid. A nomogram model was built based on these predictors. The C-index values in training cohort of 3-year, 5-year, 10-year mortality rates were 0.820, 0.807, and 0.798. In the validation cohort, the C-index values of 3-year, 5-year, 10-year mortality rates were 0.773, 0.788, and 0.817, respectively. The calibration curve demonstrates satisfactory consistency between the two cohorts. CONCLUSION The newly developed nomogram proves to be effective in predicting the all-cause mortality risk in patients with diabetic nephropathy, and it has undergone robust internal validation.
Collapse
Affiliation(s)
- Sensen Wu
- Department of Vascular Surgery, Xuanwu Hospital, Capital Medical University, 45 Changchun Street, Xicheng District, Beijing, China
| | - Hui Wang
- Department of Vascular Surgery, Xuanwu Hospital, Capital Medical University, 45 Changchun Street, Xicheng District, Beijing, China
| | - Dikang Pan
- Department of Vascular Surgery, Xuanwu Hospital, Capital Medical University, 45 Changchun Street, Xicheng District, Beijing, China
| | - Julong Guo
- Department of Vascular Surgery, Xuanwu Hospital, Capital Medical University, 45 Changchun Street, Xicheng District, Beijing, China
| | - Fan Zhang
- Department of Vascular Surgery, Xuanwu Hospital, Capital Medical University, 45 Changchun Street, Xicheng District, Beijing, China
| | - Yachan Ning
- Department of Intensive Care Medicine, Xuanwu Hospital, Capital Medical University, Beijing, China
| | - Yongquan Gu
- Department of Vascular Surgery, Xuanwu Hospital, Capital Medical University, 45 Changchun Street, Xicheng District, Beijing, China.
| | - Lianrui Guo
- Department of Vascular Surgery, Xuanwu Hospital, Capital Medical University, 45 Changchun Street, Xicheng District, Beijing, China.
| |
Collapse
|
9
|
Kanbour S, Harris C, Lalani B, Wolf RM, Fitipaldi H, Gomez MF, Mathioudakis N. Machine Learning Models for Prediction of Diabetic Microvascular Complications. J Diabetes Sci Technol 2024; 18:273-286. [PMID: 38189280 PMCID: PMC10973856 DOI: 10.1177/19322968231223726] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/09/2024]
Abstract
IMPORTANCE AND AIMS Diabetic microvascular complications significantly impact morbidity and mortality. This review focuses on machine learning/artificial intelligence (ML/AI) in predicting diabetic retinopathy (DR), diabetic kidney disease (DKD), and diabetic neuropathy (DN). METHODS A comprehensive PubMed search from 1990 to 2023 identified studies on ML/AI models for diabetic microvascular complications. The review analyzed study design, cohorts, predictors, ML techniques, prediction horizon, and performance metrics. RESULTS Among the 74 identified studies, 256 featured internally validated ML models and 124 had externally validated models, with about half being retrospective. Since 2010, there has been a rise in the use of ML for predicting microvascular complications, mainly driven by DKD research across 27 countries. A more modest increase in ML research on DR and DN was observed, with publications from fewer countries. For all microvascular complications, predictive models achieved a mean (standard deviation) c-statistic of 0.79 (0.09) on internal validation and 0.72 (0.12) on external validation. Diabetic kidney disease models had the highest discrimination, with c-statistics of 0.81 (0.09) on internal validation and 0.74 (0.13) on external validation, respectively. Few studies externally validated prediction of DN. The prediction horizon, outcome definitions, number and type of predictors, and ML technique significantly influenced model performance. CONCLUSIONS AND RELEVANCE There is growing global interest in using ML for predicting diabetic microvascular complications. Research on DKD is the most advanced in terms of publication volume and overall prediction performance. Both DR and DN require more research. External validation and adherence to recommended guidelines are crucial.
Collapse
Affiliation(s)
| | - Catharine Harris
- Division of Endocrinology, Diabetes,
& Metabolism, Johns Hopkins University School of Medicine, Baltimore, MD,
USA
| | - Benjamin Lalani
- Division of Endocrinology, Diabetes,
& Metabolism, Johns Hopkins University School of Medicine, Baltimore, MD,
USA
| | - Risa M. Wolf
- Division of Pediatric Endocrinology,
Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Hugo Fitipaldi
- Department of Clinical Sciences, Lund
University Diabetes Centre, Lund University, Malmö, Sweden
| | - Maria F. Gomez
- Department of Clinical Sciences, Lund
University Diabetes Centre, Lund University, Malmö, Sweden
| | - Nestoras Mathioudakis
- Division of Endocrinology, Diabetes,
& Metabolism, Johns Hopkins University School of Medicine, Baltimore, MD,
USA
| |
Collapse
|
10
|
Huang AA, Huang SY. Increasing transparency in machine learning through bootstrap simulation and shapely additive explanations. PLoS One 2023; 18:e0281922. [PMID: 36821544 PMCID: PMC9949629 DOI: 10.1371/journal.pone.0281922] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Accepted: 02/05/2023] [Indexed: 02/24/2023] Open
Abstract
Machine learning methods are widely used within the medical field. However, the reliability and efficacy of these models is difficult to assess, making it difficult for researchers to identify which machine-learning model to apply to their dataset. We assessed whether variance calculations of model metrics (e.g., AUROC, Sensitivity, Specificity) through bootstrap simulation and SHapely Additive exPlanations (SHAP) could increase model transparency and improve model selection. Data from the England National Health Services Heart Disease Prediction Cohort was used. After comparison of model metrics for XGBoost, Random Forest, Artificial Neural Network, and Adaptive Boosting, XGBoost was used as the machine-learning model of choice in this study. Boost-strap simulation (N = 10,000) was used to empirically derive the distribution of model metrics and covariate Gain statistics. SHapely Additive exPlanations (SHAP) to provide explanations to machine-learning output and simulation to evaluate the variance of model accuracy metrics. For the XGBoost modeling method, we observed (through 10,000 completed simulations) that the AUROC ranged from 0.771 to 0.947, a difference of 0.176, the balanced accuracy ranged from 0.688 to 0.894, a 0.205 difference, the sensitivity ranged from 0.632 to 0.939, a 0.307 difference, and the specificity ranged from 0.595 to 0.944, a 0.394 difference. Among 10,000 simulations completed, we observed that the gain for Angina ranged from 0.225 to 0.456, a difference of 0.231, for Cholesterol ranged from 0.148 to 0.326, a difference of 0.178, for maximum heart rate (MaxHR) ranged from 0.081 to 0.200, a range of 0.119, and for Age ranged from 0.059 to 0.157, difference of 0.098. Use of simulations to empirically evaluate the variability of model metrics and explanatory algorithms to observe if covariates match the literature are necessary for increased transparency, reliability, and utility of machine learning methods. These variance statistics, combined with model accuracy statistics can help researchers identify the best model for a given dataset.
Collapse
Affiliation(s)
- Alexander A. Huang
- Department of Statistics and Data Science, Cornell University, Ithaca, New York, United States of America
- Department of MD Education, Northwestern University Feinberg School of Medicine, Chicago, Illinois, United States of America
| | - Samuel Y. Huang
- Department of Statistics and Data Science, Cornell University, Ithaca, New York, United States of America
- Department of Internal Medicine, Virginia Commonwealth University School of Medicine, Richmond, Virginia, United States of America
| |
Collapse
|