1
|
Xi H, Kang Q, Jiang X. Machine learning-based risk assessment for cardiovascular diseases in patients with chronic lung diseases. Medicine (Baltimore) 2025; 104:e41672. [PMID: 40068071 PMCID: PMC11902955 DOI: 10.1097/md.0000000000041672] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/27/2024] [Revised: 02/06/2025] [Accepted: 02/07/2025] [Indexed: 03/14/2025] Open
Abstract
The association between chronic lung diseases (CLDs) and the risk of cardiovascular diseases (CVDs) has been extensively recognized. Nevertheless, conventional approaches for CVD risk evaluation cannot fully capture the risk factors (RFs) related to CLDs. This research sought to construct a CLD-specific CVD risk prediction model based on machine learning models and evaluate the prediction performance. The cross-sectional study design was adopted with data retrieved from Waves 1 and 3 of the China Health and Retirement Longitudinal Study, including 1357 participants. Multiple RFs were integrated into the models, including conventional RFs for CVDs, pulmonary function indicators, physical features, and measures of quality of life and psychological state. Four machine learning algorithms, including extreme gradient boosting (XGBoost), logistic regression, random forest, and support vector machine, were evaluated for prediction performance. The XGBoost model displayed superior performance to machine learning algorithms for predictive accuracy (area under the receiver operating characteristic curve [AUC]: 0.788, accuracy: 0.716, sensitivity: 0.615, specificity: 0.803). This model pinpointed the top 5 RFs for CLD-specific CVD RFs: body mass index, age, C-reactive protein, uric acid, and grip strength. Moreover, the prediction performance of the random forest model (AUC: 0.709, accuracy: 0.633) was higher relative to the logistic regression (AUC: 0.619, accuracy: 0.584) and support vector machine (AUC: 0.584, accuracy: 0.548) models. Nonetheless, these models performed less favorably compared to the XGBoost model. The XGBoost model presented the most accurate predictions for CLD-specific CVD risk. This multidimensional risk assessment approach offers a promising avenue for the establishment of personalized prevention strategies targeting CVD in patients with CLDs.
Collapse
Affiliation(s)
- Huiming Xi
- Department of Pulmonary and Critical Care Medicine, Nanchang People's Hospital, Nanchang, China
| | - Qingxin Kang
- Department of Pulmonary and Critical Care Medicine, Nanchang People's Hospital, Nanchang, China
| | - Xunsheng Jiang
- Department of Pulmonary and Critical Care Medicine, Nanchang People's Hospital, Nanchang, China
| |
Collapse
|
2
|
Zhu B, Dai L, Wang H, Zhang K, Zhang C, Wang Y, Yin F, Li J, Ning E, Wang Q, Yang L, Yang H, Li R, Li J, Hu C, Wu H, Jiang H, Bai Y. Machine learning discrimination of Gleason scores below GG3 and above GG4 for HSPC patients diagnosis. Sci Rep 2024; 14:25641. [PMID: 39465343 PMCID: PMC11514210 DOI: 10.1038/s41598-024-77033-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Accepted: 10/18/2024] [Indexed: 10/29/2024] Open
Abstract
This study aims to develop machine learning (ML)-assisted models for analyzing datasets related to Gleason scores in prostate cancer, conducting statistical analyses on the datasets, and identifying meaningful features. We retrospectively collected data from 717 hormone-sensitive prostate cancer (HSPC) patients at Yunnan Cancer Hospital. Of these, data from 526 patients were used for modeling. Seven auxiliary models were established using Logistic Regression (LR), Support Vector Machine (SVM), Random Forest (RF), Decision Tree (DT), Extreme gradient boosting tree (XGBoost), Adaptive Boosting (Adaboost), and artificial neural network (ANN) based on 21 clinical biochemical indicators and features. Evaluation metrics included accuracy (ACC), precision (PRE), specificity (SPE), sensitivity (SEN) or regression rate(Recall), and f1 score. Evaluation metrics for the models primarily included ACC, PRE, SPE, SEN or Recall, f1 score, and area under the curve(AUC). Evaluation metrics were visualized using confusion matrices and ROC curves. Among the ensemble learning methods, RF, XGBoost, and Adaboost performed the best. RF achieved a training dataset score of 0.769 (95% CI: 0.759-0.835) and a testing dataset score of 0.755 (95% CI: 0.660-0.760) (AUC: 0.786, 95%CI: 0.722-0.803), while XGBoost achieved a training dataset score of 0.755 (95% CI: 95%CI: 0.711-0.809) and a testing dataset score of 0.745 (95% CI: 0.660-0.764) (AUC: 0.777, 95% CI: 0.726-0.798). Adaboost scored 0.789 on the training dataset (95% CI: 0.782-0.857) and 0.774 on the testing dataset (95% CI: 0.651-0.774) (AUC: 0.799, 95% CI: 0.703-0.802). In terms of feature importance (FI) in ensemble learning, Bone metastases at first visit, prostatic volume, age, and T1-T2 have significant proportions in RF's FI. fPSA, TPSA, and tumor burden have significant proportions in Adaboost's FI, while f/TPSA, LDH, and testosterone have the highest proportions in XGBoost. Our findings indicate that ensemble learning methods demonstrate good performance in classifying HSPC patient data, with TNM staging and fPSA being important classification indicators. These discoveries provide valuable references for distinguishing different Gleason scores, facilitating more accurate patient assessments and personalized treatment plans.
Collapse
Affiliation(s)
- Bingyu Zhu
- Department of Urology I, The Third Affiliated Hospital of Kunming Medical University (Peking University Cancer Hospital Yunnan, Yunnan Cancer Hospital, Cancer Center of Yunnan Province), 519 Kunzhou Road, Kunming, 650199, Yunnan, China
| | - Longguo Dai
- Department of Urology I, The Third Affiliated Hospital of Kunming Medical University (Peking University Cancer Hospital Yunnan, Yunnan Cancer Hospital, Cancer Center of Yunnan Province), 519 Kunzhou Road, Kunming, 650199, Yunnan, China
| | - Huijian Wang
- Department of Urology I, The Third Affiliated Hospital of Kunming Medical University (Peking University Cancer Hospital Yunnan, Yunnan Cancer Hospital, Cancer Center of Yunnan Province), 519 Kunzhou Road, Kunming, 650199, Yunnan, China
| | - Kun Zhang
- Department of Urology I, The Third Affiliated Hospital of Kunming Medical University (Peking University Cancer Hospital Yunnan, Yunnan Cancer Hospital, Cancer Center of Yunnan Province), 519 Kunzhou Road, Kunming, 650199, Yunnan, China
| | - Chongjian Zhang
- Department of Urology I, The Third Affiliated Hospital of Kunming Medical University (Peking University Cancer Hospital Yunnan, Yunnan Cancer Hospital, Cancer Center of Yunnan Province), 519 Kunzhou Road, Kunming, 650199, Yunnan, China
| | - Yang Wang
- Department of Urology I, The Third Affiliated Hospital of Kunming Medical University (Peking University Cancer Hospital Yunnan, Yunnan Cancer Hospital, Cancer Center of Yunnan Province), 519 Kunzhou Road, Kunming, 650199, Yunnan, China
| | - Feiyu Yin
- Department of Urology I, The Third Affiliated Hospital of Kunming Medical University (Peking University Cancer Hospital Yunnan, Yunnan Cancer Hospital, Cancer Center of Yunnan Province), 519 Kunzhou Road, Kunming, 650199, Yunnan, China
| | - Ji Li
- Department of Urology I, The Third Affiliated Hospital of Kunming Medical University (Peking University Cancer Hospital Yunnan, Yunnan Cancer Hospital, Cancer Center of Yunnan Province), 519 Kunzhou Road, Kunming, 650199, Yunnan, China
| | - Enfa Ning
- Department of Urology I, The Third Affiliated Hospital of Kunming Medical University (Peking University Cancer Hospital Yunnan, Yunnan Cancer Hospital, Cancer Center of Yunnan Province), 519 Kunzhou Road, Kunming, 650199, Yunnan, China
| | - Qilin Wang
- Department of Urology I, The Third Affiliated Hospital of Kunming Medical University (Peking University Cancer Hospital Yunnan, Yunnan Cancer Hospital, Cancer Center of Yunnan Province), 519 Kunzhou Road, Kunming, 650199, Yunnan, China
| | - Libo Yang
- Department of Urology I, The Third Affiliated Hospital of Kunming Medical University (Peking University Cancer Hospital Yunnan, Yunnan Cancer Hospital, Cancer Center of Yunnan Province), 519 Kunzhou Road, Kunming, 650199, Yunnan, China
| | - Hong Yang
- Department of Urology I, The Third Affiliated Hospital of Kunming Medical University (Peking University Cancer Hospital Yunnan, Yunnan Cancer Hospital, Cancer Center of Yunnan Province), 519 Kunzhou Road, Kunming, 650199, Yunnan, China
| | - Ruiqian Li
- Department of Urology I, The Third Affiliated Hospital of Kunming Medical University (Peking University Cancer Hospital Yunnan, Yunnan Cancer Hospital, Cancer Center of Yunnan Province), 519 Kunzhou Road, Kunming, 650199, Yunnan, China
| | - Jun Li
- Department of Urology I, The Third Affiliated Hospital of Kunming Medical University (Peking University Cancer Hospital Yunnan, Yunnan Cancer Hospital, Cancer Center of Yunnan Province), 519 Kunzhou Road, Kunming, 650199, Yunnan, China
| | - Chen Hu
- Department of Urology I, The Third Affiliated Hospital of Kunming Medical University (Peking University Cancer Hospital Yunnan, Yunnan Cancer Hospital, Cancer Center of Yunnan Province), 519 Kunzhou Road, Kunming, 650199, Yunnan, China
| | - Hongyi Wu
- Department of Urology I, The Third Affiliated Hospital of Kunming Medical University (Peking University Cancer Hospital Yunnan, Yunnan Cancer Hospital, Cancer Center of Yunnan Province), 519 Kunzhou Road, Kunming, 650199, Yunnan, China
| | - Haiyang Jiang
- Department of Urology I, The Third Affiliated Hospital of Kunming Medical University (Peking University Cancer Hospital Yunnan, Yunnan Cancer Hospital, Cancer Center of Yunnan Province), 519 Kunzhou Road, Kunming, 650199, Yunnan, China.
| | - Yu Bai
- Department of Urology I, The Third Affiliated Hospital of Kunming Medical University (Peking University Cancer Hospital Yunnan, Yunnan Cancer Hospital, Cancer Center of Yunnan Province), 519 Kunzhou Road, Kunming, 650199, Yunnan, China.
| |
Collapse
|
3
|
Tang Y, Liu Y, Du Z, Wang Z, Pan S. Prediction of coronary artery lesions in children with Kawasaki syndrome based on machine learning. BMC Pediatr 2024; 24:158. [PMID: 38443868 PMCID: PMC10916227 DOI: 10.1186/s12887-024-04608-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Accepted: 01/31/2024] [Indexed: 03/07/2024] Open
Abstract
OBJECTIVE Kawasaki syndrome (KS) is an acute vasculitis that affects children < 5 years of age and leads to coronary artery lesions (CAL) in about 20-25% of untreated cases. Machine learning (ML) is a branch of artificial intelligence (AI) that integrates complex data sets on a large scale and uses huge data to predict future events. The purpose of the present study was to use ML to present the model for early risk assessment of CAL in children with KS by different algorithms. METHODS A total of 158 children were enrolled from Women and Children's Hospital, Qingdao University, and divided into 70-30% as the training sets and the test sets for modeling and validation studies. There are several classifiers are constructed for models including the random forest (RF), the logistic regression (LR), and the eXtreme Gradient Boosting (XGBoost). Data preprocessing is analyzed before applying the classifiers to modeling. To avoid the problem of overfitting, the 5-fold cross validation method was used throughout all the data. RESULTS The area under the curve (AUC) of the RF model was 0.925 according to the validation of the test set. The average accuracy was 0.930 (95% CI, 0.905 to 0.956). The AUC of the LG model was 0.888 and the average accuracy was 0.893 (95% CI, 0,837 to 0.950). The AUC of the XGBoost model was 0.879 and the average accuracy was 0.935 (95% CI, 0.891 to 0.980). CONCLUSION The RF algorithm was used in the present study to construct a prediction model for CAL effectively, with an accuracy of 0.930 and AUC of 0.925. The novel model established by ML may help guide clinicians in the initial decision to make a more aggressive initial anti-inflammatory therapy. Due to the limitations of external validation and regional population characteristics, additional research is required to initiate a further application in the clinic.
Collapse
Affiliation(s)
- Yaqi Tang
- Heart Center, Qingdao Women and Children's Hospital, Qingdao University, Qingdao, China
| | - Yuhai Liu
- Dawning International Information Industry Co., Ltd., No. 78 Zhuzhou Road, Laoshan District, Qingdao, China
- Sugon Nanjing Institute, Co., Ltd., No. 519 Chengxin Avenue, Fangyuan Road, Jiangning District, Nanjing, China
| | - Zhanhui Du
- Heart Center, Qingdao Women and Children's Hospital, Qingdao University, Qingdao, China
| | - Zheqi Wang
- School of Mathematics, Jilin University, Changchun, China
| | - Silin Pan
- Heart Center, Qingdao Women and Children's Hospital, Qingdao University, Qingdao, China.
| |
Collapse
|