1
|
Naskar S, Sharma S, Kuotsu K, Halder S, Pal G, Saha S, Mondal S, Biswas UK, Jana M, Bhattacharjee S. The biomedical applications of artificial intelligence: an overview of decades of research. J Drug Target 2025; 33:717-748. [PMID: 39744873 DOI: 10.1080/1061186x.2024.2448711] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2024] [Revised: 12/13/2024] [Accepted: 12/26/2024] [Indexed: 01/11/2025]
Abstract
A significant area of computer science called artificial intelligence (AI) is successfully applied to the analysis of intricate biological data and the extraction of substantial associations from datasets for a variety of biomedical uses. AI has attracted significant interest in biomedical research due to its features: (i) better patient care through early diagnosis and detection; (ii) enhanced workflow; (iii) lowering medical errors; (v) lowering medical costs; (vi) reducing morbidity and mortality; (vii) enhancing performance; (viii) enhancing precision; and (ix) time efficiency. Quantitative metrics are crucial for evaluating AI implementations, providing insights, enabling informed decisions, and measuring the impact of AI-driven initiatives, thereby enhancing transparency, accountability, and overall impact. The implementation of AI in biomedical fields faces challenges such as ethical and privacy concerns, lack of awareness, technology unreliability, and professional liability. A brief discussion is given of the AI techniques, which include Virtual screening (VS), DL, ML, Hidden Markov models (HMMs), Neural networks (NNs), Generative models (GMs), Molecular dynamics (MD), and Structure-activity relationship (SAR) models. The study explores the application of AI in biomedical fields, highlighting its enhanced predictive accuracy, treatment efficacy, diagnostic efficiency, faster decision-making, personalised treatment strategies, and precise medical interventions.
Collapse
Affiliation(s)
- Sweet Naskar
- Department of Pharmaceutics, Institute of Pharmacy, Kalyani, West Bengal, India
| | - Suraj Sharma
- Department of Pharmaceutics, Sikkim Professional College of Pharmaceutical Sciences, Sikkim, India
| | - Ketousetuo Kuotsu
- Department of Pharmaceutical Technology, Jadavpur University, Kolkata, West Bengal, India
| | - Suman Halder
- Medical Department, Department of Indian Railway, Kharagpur Division, Kharagpur, West Bengal, India
| | - Goutam Pal
- Service Dispensary, ESI Hospital, Hoogly, West Bengal, India
| | - Subhankar Saha
- Department of Pharmaceutical Technology, Jadavpur University, Kolkata, West Bengal, India
| | - Shubhadeep Mondal
- Department of Pharmacology, Momtaz Begum Pharmacy College, Rajarhat, West Bengal, India
| | - Ujjwal Kumar Biswas
- School of Pharmaceutical Science (SPS), Siksha O Anusandhan (SOA) University, Bhubaneswar, Odisha, India
| | - Mayukh Jana
- School of Pharmacy, Centurion University of Technology and Management, Centurion University, Bhubaneswar, Odisha, India
| | - Sunirmal Bhattacharjee
- Department of Pharmaceutics, Bharat Pharmaceutical Technology, Amtali, Agartala, Tripura, India
| |
Collapse
|
2
|
Yang Q, Liu C, Wang Y, Dong G, Sun J. Construction of risk prediction model of sentinel lymph node metastasis in breast cancer patients based on machine learning algorithm. Discov Oncol 2025; 16:704. [PMID: 40341608 PMCID: PMC12061823 DOI: 10.1007/s12672-025-02493-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/02/2025] [Accepted: 04/24/2025] [Indexed: 05/10/2025] Open
Abstract
PURPOSE The aim of this study was to develop and validate a machine learning (ML) based prediction model for sentinel lymph node metastasis in breast cancer to identify patients with a high risk of sentinel lymph node metastasis. METHODS In this machine learning study, we retrospectively collected 225 female breast cancer patients who underwent sentinel lymph node biopsy (SLNB). Feature screening was performed using the logistic regression analysis. Subsequently, five ML algorithms, namely LOGIT, LASSO, XGBOOST, RANDOM FOREST model and GBM model were employed to train and develop an ML model. In addition, model interpretation was performed by the Shapley Additive Explanations (SHAP) analysis to clarify the importance of each feature of the model and its decision basis. RESULTS Combined univariate and multivariate logistic regression analysis, identified Multifocal, LVI, Maximum Diameter, Shape US, Maximum Cortical Thickness as significant predictors. We than successfully leveraged machine learning algorithms, particularly the RANDOM FOREST model, to develop a predictive model for sentinel lymph node metastasis in breast cancer. Finally, the SHAP method identified Maximum Diameter and Maximum Cortical Thickness as the primary decision factors influencing the ML model's predictions. CONCLUSION With the integration of pathological and imaging characteristics, ML algorithm can accurately predict sentinel lymph node metastasis in breast cancer patients. The RANDOM FOREST model showed ideal performance. With the incorporation of these models in the clinic, can helpful for clinicians to identify patients at risk of sentinel lymph node metastasis of breast cancer and make more reasonable treatment decisions.
Collapse
Affiliation(s)
- Qianmei Yang
- Department of Ultrasound, The First Affiliated Hospital of Chongqing University of Chinese Medicine, Chongqing Hospital of Traditional Chinese Medicine, Chongqing, 400021, China
| | - Cuifang Liu
- Department of Radiology, The First Affiliated Hospital of Chongqing University of Chinese Medicine, Chongqing Hospital of Traditional Chinese Medicine, Chongqing, 400021, China
| | - Yongyue Wang
- Department of Mammary Gland, The First Affiliated Hospital of Chongqing University of Chinese Medicine, Chongqing Hospital of Traditional Chinese Medicine, Chongqing, 400021, China
| | - Guifang Dong
- Department of Ultrasound, The First Affiliated Hospital of Chongqing University of Chinese Medicine, Chongqing Hospital of Traditional Chinese Medicine, Chongqing, 400021, China
| | - Jinghuan Sun
- Department of Traditional Chinese Medicine, ChongQing JiangJin District Hospital of Chinese Medicine (Jiangjin Hospital, Chongqing University of Chinese Medicin), Chongqing, 402260, China.
| |
Collapse
|
3
|
Zhang W, Song X, Wang X, Jiang Z, Zhang Y, Cui Y. Network analysis of core factors related to non-suicidal self-injury in adolescents with mood disorders. Front Psychiatry 2025; 16:1557351. [PMID: 40364995 PMCID: PMC12069291 DOI: 10.3389/fpsyt.2025.1557351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/08/2025] [Accepted: 04/07/2025] [Indexed: 05/15/2025] Open
Abstract
Introduction Adolescents with mood disorders are at an exceptionally high risk for non-suicidal self-injury (NSSI); however, the understanding of the core factors underlying this vulnerability remains limited. This knowledge gap significantly hampers the effectiveness of targeted prevention and intervention strategies. Methods A total of 263 adolescents with mood disorders completed a series of self-report surveys, covering demographic, personal, and social factors related to NSSI. We first used least absolute shrinkage and selection operator (LASSO) regression to identify the core related factors. Then, we employed network analysis to construct the network structure of these core factors. Results Our findings indicate that depressive and anxiety symptoms are the strongest influencing factors for NSSI among adolescents with mood disorders. Life events and the specific functions of NSSI are identified as personalized factors within this group. Additionally, objective social support and education level emerged as potential protective factors against NSSI. These factors are not independent but interact with each other. Conclusion By identifying and intervening in these key factors, more effective prevention strategies and personalized treatment plans can be developed, ultimately improving the quality of life and psychological well-being of adolescents with mood disorders.
Collapse
Affiliation(s)
- Wenyan Zhang
- Department of Psychiatry, Beijing Children’s Hospital, Capital Medical University, National Center for Children Healthy, Beijing, China
| | - Xiaohui Song
- Out-patient Department, Jining NO.2 People’s Hospital, Jingning, Shandong, China
| | - Xianbin Wang
- Department of Psychiatry, Beijing Children’s Hospital, Capital Medical University, National Center for Children Healthy, Beijing, China
| | - Zhongliang Jiang
- Department of Psychiatry, Beijing Children’s Hospital, Capital Medical University, National Center for Children Healthy, Beijing, China
| | - Yuebing Zhang
- Department of Psychiatry, Shandong Daizhuang Hospital, Jingning, Shandong, China
| | - Yonghua Cui
- Department of Psychiatry, Beijing Children’s Hospital, Capital Medical University, National Center for Children Healthy, Beijing, China
| |
Collapse
|
4
|
Sugimoto H, Hironaka KI, Nakamura T, Yamada T, Miura H, Otowa-Suematsu N, Fujii M, Hirota Y, Sakaguchi K, Ogawa W, Kuroda S. Improved detection of decreased glucose handling capacities via continuous glucose monitoring-derived indices. COMMUNICATIONS MEDICINE 2025; 5:103. [PMID: 40263561 PMCID: PMC12015487 DOI: 10.1038/s43856-025-00819-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Accepted: 03/24/2025] [Indexed: 04/24/2025] Open
Abstract
BACKGROUND Efficiently assessing glucose handling capacity is a critical public health challenge. This study assessed the utility of relatively easy-to-measure continuous glucose monitoring (CGM)-derived indices in estimating glucose handling capacities calculated from resource-intensive clamp tests. METHODS We conducted a prospective study of 64 individuals without prior diabetes diagnosis. The study performed CGM, oral glucose tolerance tests (OGTT), and hyperglycemic and hyperinsulinemic-euglycemic clamp tests. We validated CGM-derived indices characteristics using an independent dataset from another country and mathematical models with simulated data. RESULTS A CGM-derived index reflecting the autocorrelation function of glucose levels (AC_Var) is significantly correlated with clamp-derived disposition index (DI), a well-established measure of glucose handling capacity and predictor of diabetes onset. Multivariate and machine learning models indicate AC_Var's contribution to predicting clamp-derived DI independent from other CGM-derived indices. The model using CGM-measured glucose standard deviation and AC_Var outperforms models using commonly used diabetes diagnostic indices, such as fasting blood glucose, HbA1c, and OGTT measures, in predicting clamp-derived DI. Mathematical simulations also demonstrate the association of AC_Var with DI. CONCLUSIONS CGM-derived indices, including AC_Var, serve as valuable tools for predicting glucose handling capacities in populations without prior diabetes diagnosis. We develop a web application that calculates these CGM-derived indices ( https://cgm-ac-mean-std.streamlit.app/ ).
Collapse
Affiliation(s)
- Hikaru Sugimoto
- Department of Biochemistry and Molecular Biology, Graduate School of Medicine, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-0033, Japan
| | - Ken-Ichi Hironaka
- Department of Biological Sciences, Graduate School of Science, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-0033, Japan
| | - Tomoaki Nakamura
- Department of Diabetes and Endocrinology, Akashi Medical Center, 743-33 Okubo-cho Yagi, Akashi, Hyogo, 674-0063, Japan
| | - Tomoko Yamada
- Division of Diabetes and Endocrinology, Department of Internal Medicine, Kobe University Graduate School of Medicine, 7-5-1 Kusunoki-cho, Chuo-ku, Kobe, Hyogo, 650-0017, Japan
| | - Hiroshi Miura
- Department of Diabetes and Endocrinology, Takatsuki General Hospital, 1-3-13 Kosobe-cho, Takatsuki, Osaka, 569-1192, Japan
| | - Natsu Otowa-Suematsu
- Division of Diabetes and Endocrinology, Department of Internal Medicine, Kobe University Graduate School of Medicine, 7-5-1 Kusunoki-cho, Chuo-ku, Kobe, Hyogo, 650-0017, Japan
| | - Masashi Fujii
- Department of Biological Sciences, Graduate School of Science, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-0033, Japan
- Department of Mathematical and Life Sciences, Graduate School of Integrated Sciences for Life, Hiroshima University, 1-3-1 Kagamiyama, Higashi-hiroshima City, Hiroshima, 739-8526, Japan
| | - Yushi Hirota
- Division of Diabetes and Endocrinology, Department of Internal Medicine, Kobe University Graduate School of Medicine, 7-5-1 Kusunoki-cho, Chuo-ku, Kobe, Hyogo, 650-0017, Japan
| | - Kazuhiko Sakaguchi
- Division of Diabetes and Endocrinology, Department of Internal Medicine, Kobe University Graduate School of Medicine, 7-5-1 Kusunoki-cho, Chuo-ku, Kobe, Hyogo, 650-0017, Japan
| | - Wataru Ogawa
- Division of Diabetes and Endocrinology, Department of Internal Medicine, Kobe University Graduate School of Medicine, 7-5-1 Kusunoki-cho, Chuo-ku, Kobe, Hyogo, 650-0017, Japan.
| | - Shinya Kuroda
- Department of Biochemistry and Molecular Biology, Graduate School of Medicine, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-0033, Japan.
- Department of Biological Sciences, Graduate School of Science, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-0033, Japan.
| |
Collapse
|
5
|
You F, Zhang H, Meng L, Li C, Yang Y, Wang Y, Zhao R, Chao L. Mechanistic investigation of Shuanghuanglian against infectious bronchitis in chickens: a network pharmacology and molecular dynamics study. Front Vet Sci 2025; 12:1557850. [PMID: 40144526 PMCID: PMC11936991 DOI: 10.3389/fvets.2025.1557850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2025] [Accepted: 02/24/2025] [Indexed: 03/28/2025] Open
Abstract
Introduction Infectious bronchitis (IB) poses a major challenge to global poultry production, causing substantial economic burdens and underscoring the necessity for novel therapeutic interventions given the limitations of current vaccines and conventional antiviral agents. The purpose of this study is to comprehensively explore the active components in Shuanghuanglian and their interaction with the key pathological targets of IBV (Infectious bronchitis virus) infection. By using advanced computational methods, this study aims not only to identify the therapeutic potential of active ingredients, but also to reveal their mechanism of action against IBV. Methods Through integrative systems pharmacology approaches, we systematically investigated Shuanghuanglian and its phytochemical constituents against IB, employing multi-omics analysis, ensemble machine learning, and all-atom molecular dynamics (MD) simulations. Network pharmacology revealed 65 target genes associated with Shuanghuanglian's primary bioactive components (quercetin, kaempferol, wogonin, and luteolin), exhibiting high network centrality. Results Using the TCMSP database, we found 65 target genes associated with key active components, such as quercetin and kaempferol, which exhibited strong connectivity in our network analysis. The GeneCards database also identified 40 common target genes shared by Shuanghuanglian and IB. Importantly, BCL2 and IL6 were recognized as key targets in the protein-protein interaction (PPI) network analysis, highlighting their roles in apoptosis and inflammation. Furthermore, analyses using Gene Ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways revealed significant roles in regulating the cell cycle and inflammatory responses. Machine learning techniques identified BCL2 and IL6 as critical genes for therapeutic intervention, supported by molecular docking results that showed strong binding energies. Furthermore, molecular dynamics simulations confirm the stability of the complexes, underscoring the importance of these interactions for treatment efficacy. Conclusion We used a variety of analytical methods, and finally identified the potential active ingredients of Shuanghuanglian as kaempferol, quercetin, wogonin, and luteolin. The active ingredients target BCL2 and IL6 and play a therapeutic role in avian infectious bronchitis by inhibiting apoptosis and reducing inflammatory response.
Collapse
Affiliation(s)
- Fuming You
- College of Animal Science and Technology, Inner Mongolia MINZU University, Tongliao, China
| | - Hanzhao Zhang
- College of Animal Science and Technology, Inner Mongolia MINZU University, Tongliao, China
| | - Linghao Meng
- College of Animal Science and Technology, Inner Mongolia MINZU University, Tongliao, China
| | - Chuanhong Li
- College of Computer Science and Technology, Inner Mongolia MINZU University, Tongliao, China
| | - Yuxia Yang
- College of Computer Science and Technology, Inner Mongolia MINZU University, Tongliao, China
| | | | - Rigetu Zhao
- Chifeng Academy of Agricultural and Animal Husbandry Sciences, Chifeng, China
| | - Luomeng Chao
- College of Animal Science and Technology, Inner Mongolia MINZU University, Tongliao, China
- Inner Mongolia Rambo Testing Technology Limited Company, Tongliao, China
- Inner Mongolia Engineering Technology Research Center for Prevention and Control of Beef Cattle Diseases, Tongliao, China
| |
Collapse
|
6
|
Zhang B, Chen L, Li T. Unveiling the effect of urinary xenoestrogens on chronic kidney disease in adults: A machine learning model. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2025; 292:117945. [PMID: 39987685 DOI: 10.1016/j.ecoenv.2025.117945] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2024] [Revised: 02/07/2025] [Accepted: 02/20/2025] [Indexed: 02/25/2025]
Abstract
Exposure to three primary xenoestrogens (XEs), including phthalates, parabens, and phenols, has been strongly associated with chronic kidney disease (CKD). An interpretable machine learning (ML) model was developed to predict CKD using data from the National Health and Nutrition Examination Survey (NHANES) database spanning from 2007 to 2016. Four ML algorithms-random forest classifier (RF), XGBoost (XGB), k-nearest neighbors (KNN), and support vector machine (SVM)-were used alongside traditional logistic regression to predict CKD. The study included 6910 U.S. adults, with XGB showing the highest predictive accuracy, achieving an area under the curve (AUC) of 0.817 (95 % CI: 0.789, 0.844). The selected model was interpreted using Shapley additive explanations (SHAP) and partial dependence plot (PDP). The SHAP method identified key predictive features for CKD in urinary metabolites of XEs-methyl paraben (MeP), mono-(carboxynonyl) phthalate (MCNP), and triclosan (TCS)-and suggested personalized CKD care should focus on XE control. PDP results confirmed that, within certain ranges, MeP levels positively impacted the model, MCNP levels negatively impacted it, and TCS had a mixed effect. The synergistic effects suggested that managing urinary MeP levels could be essential for the effective control of CKD. In summary, our research highlights the significant predictive potential of XEs for CKD, especially MeP, MCNP, and TCS.
Collapse
Affiliation(s)
- Bowen Zhang
- Department of Anesthesiology, National Clinical Research Center for Geriatrics, West China Hospital, Sichuan University, Chengdu 610041, China; Laboratory of Mitochondrial Metabolism and Perioperative Medicine, National-Local Joint Engineering Research Centre of Translational Medicine of Anesthesiology, West China Hospital, Sichuan University, Chengdu 610041, China; Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Liang Chen
- Department of Ophthalmology, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Tao Li
- Department of Anesthesiology, National Clinical Research Center for Geriatrics, West China Hospital, Sichuan University, Chengdu 610041, China; Laboratory of Mitochondrial Metabolism and Perioperative Medicine, National-Local Joint Engineering Research Centre of Translational Medicine of Anesthesiology, West China Hospital, Sichuan University, Chengdu 610041, China; Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu 610041, China.
| |
Collapse
|
7
|
Yang P, Yang B. Development and validation of predictive models for diabetic retinopathy using machine learning. PLoS One 2025; 20:e0318226. [PMID: 39992900 PMCID: PMC11849896 DOI: 10.1371/journal.pone.0318226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2024] [Accepted: 01/13/2025] [Indexed: 02/26/2025] Open
Abstract
OBJECTIVE This study aimed to develop and compare machine learning models for predicting diabetic retinopathy (DR) using clinical and biochemical data, specifically logistic regression, random forest, XGBoost, and neural networks. METHODS A dataset of 3,000 diabetic patients, including 1,500 with DR, was obtained from the National Population Health Science Data Center. Significant predictors were identified, and four predictive models were developed. Model performance was assessed using accuracy, precision, recall, F1-score, and area under the curve (AUC). RESULTS Random forest and XGBoost demonstrated superior performance, achieving accuracies of 95.67% and 94.67%, respectively, with AUC values of 0.991 and 0.989. Logistic regression yielded an accuracy of 76.50% (AUC: 0.828), while neural networks achieved 82.67% accuracy (AUC: 0.927). Key predictors included 24-hour urinary microalbumin, HbA1c, and serum creatinine. CONCLUSION The study highlights random forest and XGBoost as effective tools for early DR detection, emphasizing the importance of renal and glycemic markers in risk assessment. These findings support the integration of machine learning models into clinical decision-making for improved patient outcomes in diabetes management.
Collapse
Affiliation(s)
- Penglu Yang
- The First Clinical School & Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Bin Yang
- Health Management Center, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China
| |
Collapse
|
8
|
Cao JY, Zhang LX, Zhou XJ. Construction and Verification of a Frailty Risk Prediction Model for Elderly Patients with Coronary Heart Disease Based on a Machine Learning Algorithm. Rev Cardiovasc Med 2025; 26:26225. [PMID: 40026519 PMCID: PMC11868882 DOI: 10.31083/rcm26225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2024] [Revised: 10/31/2024] [Accepted: 11/12/2024] [Indexed: 03/05/2025] Open
Abstract
Background This study aimed to develop a machine learning-based predictive model for assessing frailty risk among elderly patients with coronary heart disease (CHD). Methods From November 2020 to May 2023, a cohort of 1170 elderly patients diagnosed with CHD were enrolled from the Department of Cardiology of a tier-3 hospital in Anhui Province, China. Participants were randomly divided into a development group and a validation group, each containing 585 patients in a 1:1 ratio. Least absolute shrinkage and selection operator (LASSO) regression was employed in the development group to identify key variables influencing frailty among patients with CHD. These variables informed the creation of a machine learning prediction model, with the most accurate model selected. Predictive accuracy was subsequently evaluated in the validation group through receiver operating characteristic (ROC) curve analysis. Results LASSO regression identified the activities of daily living (ADL) score, hemoglobin, low-density lipoprotein cholesterol (LDL-C), total cholesterol (TC), depression, cardiac function classification, cerebrovascular disease, diabetes, solitary living, and age as significant predictors of frailty among elderly patients with CHD in the development group. These variables were incorporated into a logistic regression model and four machine learning models: extreme gradient boosting (XGBoost), random forest (RF), light gradient boosting machine (LightGBM), and adaptive boosting (AdaBoost). AdaBoost demonstrated the highest accuracy in the development group, achieving an area under the ROC curve (AUC) of 0.803 in the validation group, indicating strong predictive capability. Conclusions By leveraging key frailty determinants in elderly patients with CHD, the AdaBoost machine learning model developed in this study has shown robust predictive performance through validated indicators and offers a reliable tool for assessing frailty risk in this patient population.
Collapse
Affiliation(s)
- Jiao-yu Cao
- Department of Cardiology, The First Affiliated Hospital of USTC, Division of Life Science and Medicine, University of Science and Technology of China, 230001 Hefei, Anhui, China
| | - Li-xiang Zhang
- Department of Cardiology, The First Affiliated Hospital of USTC, Division of Life Science and Medicine, University of Science and Technology of China, 230001 Hefei, Anhui, China
| | - Xiao-juan Zhou
- Department of Cardiology, The First Affiliated Hospital of USTC, Division of Life Science and Medicine, University of Science and Technology of China, 230001 Hefei, Anhui, China
| |
Collapse
|
9
|
Du X, Liu P, Xiang D, Zhang C, Du J, Jin H, Liao Y. Nomogram Based on HRV for Predicting the Therapeutic Effects of Orthostatic Training in Children with Vasovagal Syncope. CHILDREN (BASEL, SWITZERLAND) 2024; 11:1467. [PMID: 39767896 PMCID: PMC11674548 DOI: 10.3390/children11121467] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/12/2024] [Revised: 11/21/2024] [Accepted: 11/27/2024] [Indexed: 01/11/2025]
Abstract
BACKGROUND This study intended to find out whether the parameters of heart rate variability (HRV) can predict the treatment efficacy of orthostatic training among pediatric cases of vasovagal syncope (VVS). METHODS Patients with VVS who underwent orthostatic training were retrospectively enrolled. Lasso and logistic regression were used to sift through variables and build the model, which is visualized using a nomogram. The model's performance was evaluated through calibration plots, a receiver operating characteristic (ROC) curve, and decision curve analysis (DCA) for both datasets. RESULTS In total, 119 participants were included in the analysis, and 73 and 46 were assigned to the training and validation datasets, respectively. Five factors with nonzero coefficients were chosen based on lasso regression: age, the root means square of successive differences between normal sinus beats (rMSSD), standard deviation of the averages normal-to-normal intervals in all 5-min segments, minimum heart rate, and high frequency. Drawing from the logistic regression analysis results, the visual predictive model incorporated two variables, namely age and rMSSD. For the training dataset, the sensitivity was 0.686 and the specificity was 0.868 with an area under the curve (AUC) of 0.81 (95% CI, 0.71-0.91) for the ROC curve. For the validation dataset, the AUC of the ROC was 0.80 (95% CI, 0.66-0.93), while sensitivity and specificity were recorded at 0.625 and 0.909, respectively. In the calibration plots for both datasets, the predicted probabilities correlated well with the actual probabilities. According to the DCA, the visual predictive model gained a significant net benefit across a wide threshold range. CONCLUSIONS Pediatric patients with VVS can benefit from orthostatic training using a visual predictive model comprising age and rMSSD.
Collapse
Affiliation(s)
- Xiaojuan Du
- Department of Pediatrics, Peking University First Hospital, Beijing 100034, China; (X.D.); (P.L.); (D.X.); (C.Z.); (J.D.); (H.J.)
| | - Ping Liu
- Department of Pediatrics, Peking University First Hospital, Beijing 100034, China; (X.D.); (P.L.); (D.X.); (C.Z.); (J.D.); (H.J.)
| | - Dandan Xiang
- Department of Pediatrics, Peking University First Hospital, Beijing 100034, China; (X.D.); (P.L.); (D.X.); (C.Z.); (J.D.); (H.J.)
| | - Chunyu Zhang
- Department of Pediatrics, Peking University First Hospital, Beijing 100034, China; (X.D.); (P.L.); (D.X.); (C.Z.); (J.D.); (H.J.)
| | - Junbao Du
- Department of Pediatrics, Peking University First Hospital, Beijing 100034, China; (X.D.); (P.L.); (D.X.); (C.Z.); (J.D.); (H.J.)
- State Key Laboratory of Vascular Homeostasis and Remodeling, Peking University, Beijing 100191, China
| | - Hongfang Jin
- Department of Pediatrics, Peking University First Hospital, Beijing 100034, China; (X.D.); (P.L.); (D.X.); (C.Z.); (J.D.); (H.J.)
| | - Ying Liao
- Department of Pediatrics, Peking University First Hospital, Beijing 100034, China; (X.D.); (P.L.); (D.X.); (C.Z.); (J.D.); (H.J.)
| |
Collapse
|
10
|
Xiao H, Liang X, Li H, Chen X, Li Y. Trends in the prevalence of osteoporosis and effects of heavy metal exposure using interpretable machine learning. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2024; 286:117238. [PMID: 39490102 DOI: 10.1016/j.ecoenv.2024.117238] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2024] [Revised: 09/30/2024] [Accepted: 10/19/2024] [Indexed: 11/05/2024]
Abstract
There is limited evidence that heavy metals exposure contributes to osteoporosis. Multi-parameter scoring machine learning (ML) techniques were developed using National Health and Nutrition Examination Survey data to predict osteoporosis based on heavy metal exposure levels. For generating an optimal predictive model for osteoporosis, 12 ML models were used. Identification was carried out using the model that performed the best. For interpretation of models, Shapley additive explanation (SHAP) methods and partial dependence plots (PDP) were integrated into a pipeline and incorporated into the ML pipeline. By regressing osteoporosis on survey cycles, logistic regression was used to evaluate linear trends in osteoporosis over time. For the purpose of training and validating predictive models, 5745 eligible participants were randomly selected into training and testing set. It was evident from the results that the gradient boosting decision tree model performed the best among the predictive models, attributing to an accuracy rate of 89.40 % in the testing set. Based on the model results, the area under the curve and F1 score were 0.88 and 0.39, respectively. As a result of the SHAP analysis, urinary Co, urinary Tu, blood Cd, and urinary Hg levels were identified as the most influential factors influencing osteoporosis. Urinary Co (0.20-6.10 μg/mg creatinine), urinary Tu (0.06-1.93 μg/mg creatinine), blood Cd (0.07-0.50 μg/L), and urinary Hg (0.06-0.75 μg/mg creatinine) levels displayed a distinctive upward trend with risk of osteoporosis as values increased. Our analysis revealed that urinary Co, urinary Tu, blood Cd, and urinary Hg played a significant role in predictability.
Collapse
Affiliation(s)
- Hewei Xiao
- Department of Scientific Research, Guangxi Academy of Medical Sciences and the People's Hospital of Guangxi Zhuang Autonomous Region, Nanning, Guangxi, China
| | - Xueyan Liang
- Phase 1 Clinical Trial Laboratory, Guangxi Academy of Medical Sciences and the People's Hospital of Guangxi Zhuang Autonomous Region, Nanning, Guangxi, China
| | - Huijuan Li
- Phase 1 Clinical Trial Laboratory, Guangxi Academy of Medical Sciences and the People's Hospital of Guangxi Zhuang Autonomous Region, Nanning, Guangxi, China; Department of Pharmacy, Guangxi Academy of Medical Sciences and the People's Hospital of Guangxi Zhuang Autonomous Region, Nanning, Guangxi, China
| | - Xiaoyu Chen
- Phase 1 Clinical Trial Laboratory, Guangxi Academy of Medical Sciences and the People's Hospital of Guangxi Zhuang Autonomous Region, Nanning, Guangxi, China; Department of Pharmacy, Guangxi Academy of Medical Sciences and the People's Hospital of Guangxi Zhuang Autonomous Region, Nanning, Guangxi, China.
| | - Yan Li
- Department of Pharmacy, Guangxi Academy of Medical Sciences and the People's Hospital of Guangxi Zhuang Autonomous Region, Nanning, Guangxi, China.
| |
Collapse
|
11
|
Deng Y, Yi S, Liu W, Yang L, Zhu L, Zhang Q, Jin H, Yang R, Wang R, Tang NJ. Identification of Primary Organophosphate Esters Contributing to Enhanced Risk of Gestational Diabetes Mellitus Based on a Case-Control Study. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2024; 58:17532-17542. [PMID: 39315849 DOI: 10.1021/acs.est.4c04180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/25/2024]
Abstract
Epidemiological studies on associations of organophosphate ester (OPE) exposure and gestational diabetes mellitus (GDM) risk, which remain rare and inconclusive, were carried out with a case-control population comprising 287 GDM and 313 non-GDM pregnant women recruited from Tianjin. The GDM group suffered distinctly higher serum concentrations of tri-n-butyl phosphate (TNBP), tri(2-butoxyethyl) phosphate (TBOEP), triphenyl phosphate (TPHP), tri-iso-propyl phosphate (TIPP), and tri(1-chloro-2-propyl) phosphate (TCIPP) than the healthy control group (p < 0.001). Traditional analysis methods employed for either individual or mixture effects found positive correlations (p < 0.05) between the concentrations of five OPEs (i.e., TNBP, TBOEP, TPHP, TIPP, and TCIPP) and the incidence of GDM, while 2-ethylhexyl diphenyl phosphate, tri(1-chloro-2-propyl) phosphate, and bis(2-ethylhexyl) phosphate exhibited opposite effects. Three machine learning methods considering the concurrence of OPE mixture exposure and population characteristics were applied to clarify their relative importance to GDM risk, among which random forest performed the best. Several OPEs, particularly TNBP and TBOEP ranking at the top, made greater contributions than some demographical characteristics, such as prepregnancy body mass index and family history of diabetes, to the occurrence of GDM. This was further validated by another independent case-control population obtained from Hangzhou.
Collapse
Affiliation(s)
- Yun Deng
- Key Laboratory of Pollution Processes and Environmental Criteria, Ministry of Education, Tianjin Key Laboratory of Environmental Remediation and Pollution Control, College of Environmental Science and Engineering, Nankai University, Tianjin 300350, PR China
| | - Shujun Yi
- Key Laboratory of Pollution Processes and Environmental Criteria, Ministry of Education, Tianjin Key Laboratory of Environmental Remediation and Pollution Control, College of Environmental Science and Engineering, Nankai University, Tianjin 300350, PR China
| | - Wenya Liu
- Key Laboratory of Pollution Processes and Environmental Criteria, Ministry of Education, Tianjin Key Laboratory of Environmental Remediation and Pollution Control, College of Environmental Science and Engineering, Nankai University, Tianjin 300350, PR China
| | - Liping Yang
- Key Laboratory of Pollution Processes and Environmental Criteria, Ministry of Education, Tianjin Key Laboratory of Environmental Remediation and Pollution Control, College of Environmental Science and Engineering, Nankai University, Tianjin 300350, PR China
| | - Lingyan Zhu
- Key Laboratory of Pollution Processes and Environmental Criteria, Ministry of Education, Tianjin Key Laboratory of Environmental Remediation and Pollution Control, College of Environmental Science and Engineering, Nankai University, Tianjin 300350, PR China
| | - Qiang Zhang
- Department of Occupational and Environmental Health, School of Public Health, Tianjin Medical University, Tianjin 300070, PR China
| | - Hangbiao Jin
- Key Laboratory of Microbial Technology for Industrial Pollution Control of Zhejiang Province, College of Environment, Zhejiang University of Technology, Hangzhou 310032, Zhejiang, PR China
| | - Rongyan Yang
- Key Laboratory of Pollution Processes and Environmental Criteria, Ministry of Education, Tianjin Key Laboratory of Environmental Remediation and Pollution Control, College of Environmental Science and Engineering, Nankai University, Tianjin 300350, PR China
| | - Rouyi Wang
- Key Laboratory of Pollution Processes and Environmental Criteria, Ministry of Education, Tianjin Key Laboratory of Environmental Remediation and Pollution Control, College of Environmental Science and Engineering, Nankai University, Tianjin 300350, PR China
| | - Nai-Jun Tang
- Department of Occupational and Environmental Health, School of Public Health, Tianjin Medical University, Tianjin 300070, PR China
| |
Collapse
|
12
|
He S, Zhang X, Wang Z, Zhang Q, Yao Y, Pang J, Chen Y. Classification and functional analysis of disulfidptosis-associated genes in sepsis. J Cell Mol Med 2024; 28:e70020. [PMID: 39400961 PMCID: PMC11472650 DOI: 10.1111/jcmm.70020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2024] [Revised: 07/26/2024] [Accepted: 08/07/2024] [Indexed: 10/15/2024] Open
Abstract
Sepsis represents a critical condition characterized by multiple-organ dysfunction resulting from inflammatory response to infection. Disulfidptosis is a newly identified type of programmed cell death that is intimately associated with the actin cytoskeleton collapse caused by glucose starvation and disulfide stress, but its role in sepsis is largely unknown. The study was to adopt a diagnostic and prognostic signature for sepsis with disulfidptosis based on the differentially expressed genes (DEGs) between sepsis and healthy people from GEO database. The disulfidptosis hub genes associated with sepsis were identified, and then developed consensus clustering and immune infiltration characteristics. Next, we evaluated disulfidptosis-related risk genes by using LASSO and Random Forest algorithms, and constructed the diagnostic sepsis model by nomogram. Finally, immune infiltration, GSVA analysis and mRNA-miRNA networks based on disulfidptosis-related DEGs were screened. There are five upregulated disulfidptosis-related genes and seven downregulated genes were filtered out. The six intersection disulfidptosis-related genes including LRPPRC, SLC7A11, GLUT, MYH9, NUBPL and GYS1 exhibited higher predictive ability for sepsis with an accuracy of 99.7%. In addition, the expression patterns of the critical genes were validated. The study provided a comprehensive view of disulfidptosis-based signatures to predict the prognosis, biological features and potential treatment directions for sepsis.
Collapse
Affiliation(s)
- Simeng He
- Department of Emergency MedicineQilu Hospital of Shandong UniversityJinanChina
- Shandong Provincial Clinical Research Center for Emergency and Critical Care Medicine, Institute of Emergency and Critical Care Medicine of Shandong University, Chest Pain CenterQilu Hospital of Shandong UniversityJinanChina
- Key Laboratory of Emergency and Critical Care Medicine of Shandong Province, Key Laboratory of Cardiopulmonary‐Cerebral Resuscitation Research of Shandong Province, Shandong Provincial Engineering Laboratory for Emergency and Critical Care Medicine, Shandong Key Laboratory: Magnetic Field‐free Medicine and Functional ImagingQilu Hospital of Shandong UniversityJinanChina
- NMPA Key Laboratory for Clinical Research and Evaluation of Innovative DrugQilu Hospital of Shandong UniversityJinanChina
- The Key Laboratory of Cardiovascular Remodeling and Function Research, Chinese Ministry of Education, Chinese Ministry of Health and Chinese Academy of Medical Sciences, The State and Shandong Province Joint Key Laboratory of Translational Cardiovascular MedicineQilu Hospital of Shandong UniversityJinanChina
| | - Xiangxin Zhang
- Department of Emergency MedicineQilu Hospital of Shandong UniversityJinanChina
- Shandong Provincial Clinical Research Center for Emergency and Critical Care Medicine, Institute of Emergency and Critical Care Medicine of Shandong University, Chest Pain CenterQilu Hospital of Shandong UniversityJinanChina
- Key Laboratory of Emergency and Critical Care Medicine of Shandong Province, Key Laboratory of Cardiopulmonary‐Cerebral Resuscitation Research of Shandong Province, Shandong Provincial Engineering Laboratory for Emergency and Critical Care Medicine, Shandong Key Laboratory: Magnetic Field‐free Medicine and Functional ImagingQilu Hospital of Shandong UniversityJinanChina
- NMPA Key Laboratory for Clinical Research and Evaluation of Innovative DrugQilu Hospital of Shandong UniversityJinanChina
- The Key Laboratory of Cardiovascular Remodeling and Function Research, Chinese Ministry of Education, Chinese Ministry of Health and Chinese Academy of Medical Sciences, The State and Shandong Province Joint Key Laboratory of Translational Cardiovascular MedicineQilu Hospital of Shandong UniversityJinanChina
| | - Zichen Wang
- Department of Emergency MedicineQilu Hospital of Shandong UniversityJinanChina
- Shandong Provincial Clinical Research Center for Emergency and Critical Care Medicine, Institute of Emergency and Critical Care Medicine of Shandong University, Chest Pain CenterQilu Hospital of Shandong UniversityJinanChina
- Key Laboratory of Emergency and Critical Care Medicine of Shandong Province, Key Laboratory of Cardiopulmonary‐Cerebral Resuscitation Research of Shandong Province, Shandong Provincial Engineering Laboratory for Emergency and Critical Care Medicine, Shandong Key Laboratory: Magnetic Field‐free Medicine and Functional ImagingQilu Hospital of Shandong UniversityJinanChina
- NMPA Key Laboratory for Clinical Research and Evaluation of Innovative DrugQilu Hospital of Shandong UniversityJinanChina
- The Key Laboratory of Cardiovascular Remodeling and Function Research, Chinese Ministry of Education, Chinese Ministry of Health and Chinese Academy of Medical Sciences, The State and Shandong Province Joint Key Laboratory of Translational Cardiovascular MedicineQilu Hospital of Shandong UniversityJinanChina
| | - Qingju Zhang
- Department of Emergency MedicineQilu Hospital of Shandong UniversityJinanChina
- Shandong Provincial Clinical Research Center for Emergency and Critical Care Medicine, Institute of Emergency and Critical Care Medicine of Shandong University, Chest Pain CenterQilu Hospital of Shandong UniversityJinanChina
- Key Laboratory of Emergency and Critical Care Medicine of Shandong Province, Key Laboratory of Cardiopulmonary‐Cerebral Resuscitation Research of Shandong Province, Shandong Provincial Engineering Laboratory for Emergency and Critical Care Medicine, Shandong Key Laboratory: Magnetic Field‐free Medicine and Functional ImagingQilu Hospital of Shandong UniversityJinanChina
- NMPA Key Laboratory for Clinical Research and Evaluation of Innovative DrugQilu Hospital of Shandong UniversityJinanChina
- The Key Laboratory of Cardiovascular Remodeling and Function Research, Chinese Ministry of Education, Chinese Ministry of Health and Chinese Academy of Medical Sciences, The State and Shandong Province Joint Key Laboratory of Translational Cardiovascular MedicineQilu Hospital of Shandong UniversityJinanChina
| | - Yu Yao
- Department of Emergency MedicineQilu Hospital of Shandong UniversityJinanChina
- Shandong Provincial Clinical Research Center for Emergency and Critical Care Medicine, Institute of Emergency and Critical Care Medicine of Shandong University, Chest Pain CenterQilu Hospital of Shandong UniversityJinanChina
- Key Laboratory of Emergency and Critical Care Medicine of Shandong Province, Key Laboratory of Cardiopulmonary‐Cerebral Resuscitation Research of Shandong Province, Shandong Provincial Engineering Laboratory for Emergency and Critical Care Medicine, Shandong Key Laboratory: Magnetic Field‐free Medicine and Functional ImagingQilu Hospital of Shandong UniversityJinanChina
- NMPA Key Laboratory for Clinical Research and Evaluation of Innovative DrugQilu Hospital of Shandong UniversityJinanChina
- The Key Laboratory of Cardiovascular Remodeling and Function Research, Chinese Ministry of Education, Chinese Ministry of Health and Chinese Academy of Medical Sciences, The State and Shandong Province Joint Key Laboratory of Translational Cardiovascular MedicineQilu Hospital of Shandong UniversityJinanChina
| | - Jiaojiao Pang
- Department of Emergency MedicineQilu Hospital of Shandong UniversityJinanChina
- Shandong Provincial Clinical Research Center for Emergency and Critical Care Medicine, Institute of Emergency and Critical Care Medicine of Shandong University, Chest Pain CenterQilu Hospital of Shandong UniversityJinanChina
- Key Laboratory of Emergency and Critical Care Medicine of Shandong Province, Key Laboratory of Cardiopulmonary‐Cerebral Resuscitation Research of Shandong Province, Shandong Provincial Engineering Laboratory for Emergency and Critical Care Medicine, Shandong Key Laboratory: Magnetic Field‐free Medicine and Functional ImagingQilu Hospital of Shandong UniversityJinanChina
- NMPA Key Laboratory for Clinical Research and Evaluation of Innovative DrugQilu Hospital of Shandong UniversityJinanChina
- The Key Laboratory of Cardiovascular Remodeling and Function Research, Chinese Ministry of Education, Chinese Ministry of Health and Chinese Academy of Medical Sciences, The State and Shandong Province Joint Key Laboratory of Translational Cardiovascular MedicineQilu Hospital of Shandong UniversityJinanChina
| | - Yuguo Chen
- Department of Emergency MedicineQilu Hospital of Shandong UniversityJinanChina
- Shandong Provincial Clinical Research Center for Emergency and Critical Care Medicine, Institute of Emergency and Critical Care Medicine of Shandong University, Chest Pain CenterQilu Hospital of Shandong UniversityJinanChina
- Key Laboratory of Emergency and Critical Care Medicine of Shandong Province, Key Laboratory of Cardiopulmonary‐Cerebral Resuscitation Research of Shandong Province, Shandong Provincial Engineering Laboratory for Emergency and Critical Care Medicine, Shandong Key Laboratory: Magnetic Field‐free Medicine and Functional ImagingQilu Hospital of Shandong UniversityJinanChina
- NMPA Key Laboratory for Clinical Research and Evaluation of Innovative DrugQilu Hospital of Shandong UniversityJinanChina
- The Key Laboratory of Cardiovascular Remodeling and Function Research, Chinese Ministry of Education, Chinese Ministry of Health and Chinese Academy of Medical Sciences, The State and Shandong Province Joint Key Laboratory of Translational Cardiovascular MedicineQilu Hospital of Shandong UniversityJinanChina
| |
Collapse
|
13
|
Liu L, Zhou H, Wang X, Wen F, Zhang G, Yu J, Shen H, Huang R. Effects of environmental phenols on eGFR: machine learning modeling methods applied to cross-sectional studies. Front Public Health 2024; 12:1405533. [PMID: 39148651 PMCID: PMC11324456 DOI: 10.3389/fpubh.2024.1405533] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2024] [Accepted: 07/26/2024] [Indexed: 08/17/2024] Open
Abstract
Purpose Limited investigation is available on the correlation between environmental phenols' exposure and estimated glomerular filtration rate (eGFR). Our target is established a robust and explainable machine learning (ML) model that associates environmental phenols' exposure with eGFR. Methods Our datasets for constructing the associations between environmental phenols' and eGFR were collected from the National Health and Nutrition Examination Survey (NHANES, 2013-2016). Five ML models were contained and fine-tuned to eGFR regression by phenols' exposure. Regression evaluation metrics were used to extract the limitation of the models. The most effective model was then utilized for regression, with interpretation of its features carried out using shapley additive explanations (SHAP) and the game theory python package to represent the model's regression capacity. Results The study identified the top-performing random forest (RF) regressor with a mean absolute error of 0.621 and a coefficient of determination of 0.998 among 3,371 participants. Six environmental phenols with eGFR in linear regression models revealed that the concentrations of triclosan (TCS) and bisphenol S (BPS) in urine were positively correlated with eGFR, and the correlation coefficients were β = 0.010 (p = 0.026) and β = 0.007 (p = 0.004) respectively. SHAP values indicate that BPS (1.38), bisphenol F (BPF) (0.97), 2,5-dichlorophenol (0.87), TCS (0.78), BP3 (0.60), bisphenol A (BPA) (0.59) and 2,4-dichlorophenol (0.47) in urinary contributed to the model. Conclusion The RF model was efficient in identifying a correlation between phenols' exposure and eGFR among United States NHANES 2013-2016 participants. The findings indicate that BPA, BPF, and BPS are inversely associated with eGFR.
Collapse
Affiliation(s)
- Lei Liu
- Department of Pathology, Affiliated Hospital of Nantong University, Nantong, China
| | - Hao Zhou
- Department of Thoracic Surgery, Affiliated Hospital of Nantong University, Nantong, China
| | - Xueli Wang
- Department of Pathology, Qingdao Eighth People's Hospital, Qingdao, China
| | - Fukang Wen
- Institute of Computer Science and Engineering, Sun Yat-Sen University, Guangzhou, China
| | - Guibin Zhang
- College of Electronic and Information Engineering, Tongji University, Shanghai, China
| | - Jinao Yu
- Institute of Computer Science and Engineering, University of Wisconsin-Madison, Madison, WI, United States
| | - Hui Shen
- Department of Computer Science and Engineering, The Ohio State University, Columbus, OH, United States
| | - Rongrong Huang
- Department of Pharmacy, Affiliated Hospital of Nantong University, Nantong, China
| |
Collapse
|
14
|
Liu J, Wang L, Shen B, Gong Y, Guo X, Shen Q, Yang M, Dong Y, Liu Y, Chen H, Yang Z, Liu Y, Zhu X, Ma H, Jin G, Qian Y. Association of serum metal levels with type 2 diabetes: A prospective cohort and mediating effects of metabolites analysis in Chinese population. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2024; 279:116470. [PMID: 38772147 DOI: 10.1016/j.ecoenv.2024.116470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Revised: 05/13/2024] [Accepted: 05/14/2024] [Indexed: 05/23/2024]
Abstract
Several studies have suggested an association between exposure to various metals and the onset of type 2 diabetes (T2D). However, the results vary across different studies. We aimed to investigate the associations between serum metal concentrations and the risk of developing T2D among 8734 participants using a prospective cohort study design. We utilized inductively coupled plasmamass spectrometry (ICP-MS) to assess the serum concentrations of 27 metals. Cox regression was applied to calculate the hazard ratios (HRs) for the associations between serum metal concentrations on the risk of developing T2D. Additionally, 196 incident T2D cases and 208 healthy control participants were randomly selected for serum metabolite measurement using an untargeted metabolomics approach to evaluate the mediating role of serum metabolite in the relationship between serum metal concentrations and the risk of developing T2D with a nested casecontrol study design. In the cohort study, after Bonferroni correction, the serum concentrations of zinc (Zn), mercury (Hg), and thallium (Tl) were positively associated with the risk of developing T2D, whereas the serum concentrations of manganese (Mn), molybdenum (Mo), barium (Ba), lutetium (Lu), and lead (Pb) were negatively associated with the risk of developing T2D. After adding these eight metals, the predictive ability increased significantly compared with that of the traditional clinical model (AUC: 0.791 vs. 0.772, P=8.85×10-5). In the nested casecontrol study, a machine learning analysis revealed that the serum concentrations of 14 out of 1579 detected metabolites were associated with the risk of developing T2D. According to generalized linear regression models, 7 of these metabolites were significantly associated with the serum concentrations of the identified metals. The mediation analysis showed that two metabolites (2-methyl-1,2-dihydrophthalazin-1-one and mestranol) mediated 46.81% and 58.70%, respectively, of the association between the serum Pb concentration and the risk of developing T2D. Our study suggested that serum Mn, Zn, Mo, Ba, Lu, Hg, Tl, and Pb were associated with T2D risk. Two metabolites mediated the associations between the serum Pb concentration and the risk of developing T2D.
Collapse
Affiliation(s)
- Jia Liu
- Department of Chronic Non-Communicable Disease Control, The Affiliated Wuxi Center for Disease Control and Prevention of Nanjing Medical University (Wuxi Center for Disease Control and Prevention), Wuxi, Jiangsu 214023, China
| | - Lu Wang
- Department of Chronic Non-Communicable Disease Control, The Affiliated Wuxi Center for Disease Control and Prevention of Nanjing Medical University (Wuxi Center for Disease Control and Prevention), Wuxi, Jiangsu 214023, China
| | - Bohui Shen
- Department of Chronic Non-Communicable Disease Control, The Affiliated Wuxi Center for Disease Control and Prevention of Nanjing Medical University (Wuxi Center for Disease Control and Prevention), Wuxi, Jiangsu 214023, China
| | - Yan Gong
- Department of Chronic Non-Communicable Disease Control, The Affiliated Wuxi Center for Disease Control and Prevention of Nanjing Medical University (Wuxi Center for Disease Control and Prevention), Wuxi, Jiangsu 214023, China
| | - Xiangxin Guo
- Department of Epidemiology, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu 211166, China
| | - Qian Shen
- Department of Chronic Non-Communicable Disease Control, The Affiliated Wuxi Center for Disease Control and Prevention of Nanjing Medical University (Wuxi Center for Disease Control and Prevention), Wuxi, Jiangsu 214023, China
| | - Man Yang
- Department of Chronic Non-Communicable Disease Control, The Affiliated Wuxi Center for Disease Control and Prevention of Nanjing Medical University (Wuxi Center for Disease Control and Prevention), Wuxi, Jiangsu 214023, China
| | - Yunqiu Dong
- Department of Chronic Non-Communicable Disease Control, The Affiliated Wuxi Center for Disease Control and Prevention of Nanjing Medical University (Wuxi Center for Disease Control and Prevention), Wuxi, Jiangsu 214023, China
| | - Yongchao Liu
- Department of Chronic Non-Communicable Disease Control, The Affiliated Wuxi Center for Disease Control and Prevention of Nanjing Medical University (Wuxi Center for Disease Control and Prevention), Wuxi, Jiangsu 214023, China
| | - Hai Chen
- Department of Chronic Non-Communicable Disease Control, The Affiliated Wuxi Center for Disease Control and Prevention of Nanjing Medical University (Wuxi Center for Disease Control and Prevention), Wuxi, Jiangsu 214023, China
| | - Zhijie Yang
- Department of Chronic Non-Communicable Disease Control, The Affiliated Wuxi Center for Disease Control and Prevention of Nanjing Medical University (Wuxi Center for Disease Control and Prevention), Wuxi, Jiangsu 214023, China
| | - Yaqi Liu
- Department of Chronic Non-Communicable Disease Control, The Affiliated Wuxi Center for Disease Control and Prevention of Nanjing Medical University (Wuxi Center for Disease Control and Prevention), Wuxi, Jiangsu 214023, China
| | - Xiaowei Zhu
- Department of Chronic Non-Communicable Disease Control, The Affiliated Wuxi Center for Disease Control and Prevention of Nanjing Medical University (Wuxi Center for Disease Control and Prevention), Wuxi, Jiangsu 214023, China
| | - Hongxia Ma
- Department of Epidemiology, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu 211166, China
| | - Guangfu Jin
- Department of Epidemiology, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu 211166, China
| | - Yun Qian
- Department of Chronic Non-Communicable Disease Control, The Affiliated Wuxi Center for Disease Control and Prevention of Nanjing Medical University (Wuxi Center for Disease Control and Prevention), Wuxi, Jiangsu 214023, China.
| |
Collapse
|
15
|
Zuo W, Yang X. A machine learning model predicts stroke associated with blood cadmium level. Sci Rep 2024; 14:14739. [PMID: 38926494 PMCID: PMC11208606 DOI: 10.1038/s41598-024-65633-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Accepted: 06/21/2024] [Indexed: 06/28/2024] Open
Abstract
Stroke is the leading cause of death and disability worldwide. Cadmium is a prevalent environmental toxicant that may contribute to cardiovascular disease, including stroke. We aimed to build an effective and interpretable machine learning (ML) model that links blood cadmium to the identification of stroke. Our data exploring the association between blood cadmium and stroke came from the National Health and Nutrition Examination Survey (NHANES, 2013-2014). In total, 2664 participants were eligible for this study. We divided these data into a training set (80%) and a test set (20%). To analyze the relationship between blood cadmium and stroke, a multivariate logistic regression analysis was performed. We constructed and tested five ML algorithms including K-nearest neighbor (KNN), decision tree (DT), logistic regression (LR), multilayer perceptron (MLP), and random forest (RF). The best-performing model was selected to identify stroke in US adults. Finally, the features were interpreted using the Shapley Additive exPlanations (SHAP) tool. In the total population, participants in the second, third, and fourth quartiles had an odds ratio of 1.32 (95% CI 0.55, 3.14), 1.65 (95% CI 0.71, 3.83), and 2.67 (95% CI 1.10, 6.49) for stroke compared with the lowest reference group for blood cadmium, respectively. This blood cadmium-based LR approach demonstrated the greatest performance in identifying stroke (area under the operator curve: 0.800, accuracy: 0.966). Employing interpretable methods, we found blood cadmium to be a notable contributor to the predictive model. We found that blood cadmium was positively correlated with stroke risk and that stroke risk from cadmium exposure could be effectively predicted by using ML modeling.
Collapse
Affiliation(s)
- Wenwei Zuo
- School of Gongli Hospital Medical Technology, University of Shanghai for Science and Technology, No. 516, Jungong Road, Yangpu Area, Shanghai, 200093, China
| | - Xuelian Yang
- Department of Neurology, Shanghai Pudong New Area Gongli Hospital, No. 219 Miaopu Road, Pudong New Area, Shanghai, 200135, China.
| |
Collapse
|
16
|
Labib SM. Greenness, air pollution, and temperature exposure effects in predicting premature mortality and morbidity: A small-area study using spatial random forest model. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 928:172387. [PMID: 38608883 DOI: 10.1016/j.scitotenv.2024.172387] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Revised: 04/08/2024] [Accepted: 04/08/2024] [Indexed: 04/14/2024]
Abstract
BACKGROUND Although studies have provided negative impacts of air pollution, heat or cold exposure on mortality and morbidity, and positive effects of increased greenness on reducing them, a few studies have focused on exploring combined and synergetic effects of these exposures in predicting these health outcomes, and most had ignored the spatial autocorrelation in analyzing their health effects. This study aims to investigate the health effects of air pollution, greenness, and temperature exposure on premature mortality and morbidity within a spatial machine-learning modeling framework. METHODS Years of potential life lost reflecting premature mortality and comparative illness and disability ratio reflecting chronic morbidity from 1673 small areas covering Greater Manchester for the year 2008-2013 obtained. Average annual levels of NO2 concentration, normalized difference vegetation index (NDVI) representing greenness, and annual average air temperature were utilized to assess exposure in each area. These exposures were linked to health outcomes using non-spatial and spatial random forest (RF) models while accounting for spatial autocorrelation. RESULTS Spatial-RF models provided the best predictive accuracy when accounted for spatial autocorrelation. Among the exposures considered, air pollution emerged as the most influential in predicting mortality and morbidity, followed by NDVI and temperature exposure. Nonlinear exposure-response relations were observed, and interactions between exposures illustrated specific ranges or sweet and sour spots of exposure thresholds where combined effects either exacerbate or moderate health conditions. CONCLUSION Air pollution exposure had a greater negative impact on health compared to greenness and temperature exposure. Combined exposure effects may indicate the highest influence of premature mortality and morbidity burden.
Collapse
Affiliation(s)
- S M Labib
- Department of Human Geography and Spatial Planning, Faculty of Geosciences, Utrecht University, the Netherlands.
| |
Collapse
|
17
|
Gui Y, Gui S, Wang X, Li Y, Xu Y, Zhang J. Exploring the relationship between heavy metals and diabetic retinopathy: a machine learning modeling approach. Sci Rep 2024; 14:13049. [PMID: 38844504 PMCID: PMC11156935 DOI: 10.1038/s41598-024-63916-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Accepted: 06/03/2024] [Indexed: 06/09/2024] Open
Abstract
Diabetic retinopathy (DR) is one of the leading causes of adult blindness in the United States. Although studies applying traditional statistical methods have revealed that heavy metals may be essential environmental risk factors for diabetic retinopathy, there is a lack of analyses based on machine learning (ML) methods to adequately explain the complex relationship between heavy metals and DR and the interactions between variables. Based on characteristic variables of participants with and without DR and heavy metal exposure data obtained from the NHANES database (2003-2010), a ML model was developed for effective prediction of DR. The best predictive model for DR was selected from 11 models by receiver operating characteristic curve (ROC) analysis. Further permutation feature importance (PFI) analysis, partial dependence plots (PDP) analysis, and SHapley Additive exPlanations (SHAP) analysis were used to assess the model capability and key influencing factors. A total of 1042 eligible individuals were randomly assigned to two groups for training and testing set of the prediction model. ROC analysis showed that the k-nearest neighbour (KNN) model had the highest prediction performance, achieving close to 100% accuracy in the testing set. Urinary Sb level was identified as the critical heavy metal affecting the predicted risk of DR, with a contribution weight of 1.730632 ± 1.791722, which was much higher than that of other heavy metals and baseline variables. The results of the PDP analysis and the SHAP analysis also indicated that antimony (Sb) had a more significant effect on DR. The interaction between age and Sb was more significant compared to other variables and metal pairs. We found that Sb could serve as a potential predictor of DR and that Sb may influence the development of DR by mediating cellular and systemic senescence. The study revealed that monitoring urinary Sb levels can be useful for early non-invasive screening and intervention in DR development, and also highlighted the important role of constructed ML models in explaining the effects of heavy metal exposure on DR.
Collapse
Affiliation(s)
- Yanchao Gui
- Department of Ophthalmology, Anqing Second People's Hospital, 79 Guanyuemiao Street, Anqing, 246004, China
| | - Siyu Gui
- Department of Ophthalmology, The Second Affiliated Hospital of Anhui Medical University, 678 Furong Road, Hefei, 230601, China
| | - Xinchen Wang
- Department of Ophthalmology, The Second Affiliated Hospital of Anhui Medical University, 678 Furong Road, Hefei, 230601, China
| | - Yiran Li
- Department of Clinical Medicine, The Second School of Clinical Medicine, Anhui Medical University, 81 Meishan Road, Hefei, 230032, China
| | - Yueyang Xu
- Department of Clinical Medicine, The First School of Clinical Medicine, Anhui Medical University, 81 Meishan Road, Hefei, 230032, China
| | - Jinsong Zhang
- Department of Ophthalmology, Anqing Second People's Hospital, 79 Guanyuemiao Street, Anqing, 246004, China.
| |
Collapse
|
18
|
Wang F, Lin Y, Xu J, Wei F, Huang S, Wen S, Zhou H, Jiang Y, Wang H, Ling W, Li X, Yang X. Risk of papillary thyroid carcinoma and nodular goiter associated with exposure to semi-volatile organic compounds: A multi-pollutant assessment based on machine learning algorithms. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 915:169962. [PMID: 38219999 DOI: 10.1016/j.scitotenv.2024.169962] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 12/30/2023] [Accepted: 01/04/2024] [Indexed: 01/16/2024]
Abstract
BACKGROUND Exposure to semi-volatile organic compounds (SVOCs) may link to thyroid nodule risk, but studies of mixed-SVOCs exposure effects are lacking. Traditional analytical methods are inadequate for dealing with mixed exposures, while machine learning (ML) seems to be a good way to fill the gaps in the field of environmental epidemiology research. OBJECTIVES Different ML algorithms were used to explore the relationship between mixed-SVOCs exposure and thyroid nodule. METHODS A 1:1:1 age- and gender-matched case-control study was conducted in which 96 serum SVOCs were measured in 50 papillary thyroid carcinoma (PTC), 50 nodular goiters (NG), and 50 controls. Different ML techniques such as Random Forest, AdaBoost were selected based on their predictive power, and variables were selected based on their weights in the models. Weighted quantile sum (WQS) regression and Bayesian kernel machine regression (BKMR) were used to assess the mixed effects of the SVOCs exposure on thyroid nodule. RESULTS Forty-three of 96 SVOCs with detection rate >80 % were included in the analysis. ML algorithms showed a consistent selection of SVOCs associated with thyroid nodule. Fluazifop-butyl and fenpropathrin are positively associated with PTC and NG in single compound models (all P < 0.05). WQS model shows that exposure to mixed-SVOCs was associated with an increased risk of PTC and NG, with the mixture dominated by fenpropathrin, followed by fluazifop-butyl and propham. In the BKMR model, mixtures showed a significant positive association with thyroid nodule risk at high exposure levels, and fluazifop-butyl showed positive effects associated with PTC and NG. CONCLUSION This study confirms the feasibility of ML methods for variable selection in high-dimensional complex data and showed that mixed exposure to SVOCs was associated with increased risk of PTC and NG. The observed association was primarily driven by fluazifop-butyl and fenpropathrin. The findings warranted further investigation.
Collapse
Affiliation(s)
- Fei Wang
- Department of Occupational Health and Environmental Health, School of Public Health, Guangxi Medical University, Nanning, Guangxi, China; Guangxi Key Laboratory on Precise Prevention and Treatment for Thyroid Tumor, The Second Affiliated Hospital, Guangxi University of Science and Technology, Liuzhou, Guangxi, China
| | - Yuanxin Lin
- Department of Occupational Health and Environmental Health, School of Public Health, Guangxi Medical University, Nanning, Guangxi, China; Guangxi Key Laboratory on Precise Prevention and Treatment for Thyroid Tumor, The Second Affiliated Hospital, Guangxi University of Science and Technology, Liuzhou, Guangxi, China
| | - Jianing Xu
- Guangxi Key Laboratory on Precise Prevention and Treatment for Thyroid Tumor, The Second Affiliated Hospital, Guangxi University of Science and Technology, Liuzhou, Guangxi, China; School of Electronic Engineering, Guangxi University of Science and Technology, Liuzhou, Guangxi, China
| | - Fugui Wei
- Department of Head and Neck Surgery, The Second Affiliated Hospital of Guangxi University of Science and Technology, Liuzhou, Guangxi, China
| | - Simei Huang
- School of Science, Guangxi University of Science and Technology, Liuzhou, Guangxi, China
| | - Shifeng Wen
- Department of Occupational Health and Environmental Health, School of Public Health, Guangxi Medical University, Nanning, Guangxi, China; Guangxi Key Laboratory on Precise Prevention and Treatment for Thyroid Tumor, The Second Affiliated Hospital, Guangxi University of Science and Technology, Liuzhou, Guangxi, China
| | - Huijiao Zhou
- Department of Occupational Health and Environmental Health, School of Public Health, Guangxi Medical University, Nanning, Guangxi, China; Guangxi Key Laboratory on Precise Prevention and Treatment for Thyroid Tumor, The Second Affiliated Hospital, Guangxi University of Science and Technology, Liuzhou, Guangxi, China
| | - Yuwei Jiang
- Department of Occupational Health and Environmental Health, School of Public Health, Guangxi Medical University, Nanning, Guangxi, China; Guangxi Key Laboratory on Precise Prevention and Treatment for Thyroid Tumor, The Second Affiliated Hospital, Guangxi University of Science and Technology, Liuzhou, Guangxi, China
| | - Haoyu Wang
- Department of Occupational Health and Environmental Health, School of Public Health, Guangxi Medical University, Nanning, Guangxi, China; Guangxi Key Laboratory on Precise Prevention and Treatment for Thyroid Tumor, The Second Affiliated Hospital, Guangxi University of Science and Technology, Liuzhou, Guangxi, China
| | - Wenlong Ling
- Department of Thyroid Surgery, The Second Affiliated Hospital of Guangxi University of Science and Technology, Liuzhou, Guangxi, China
| | - Xiangzhi Li
- Guangxi Key Laboratory on Precise Prevention and Treatment for Thyroid Tumor, The Second Affiliated Hospital, Guangxi University of Science and Technology, Liuzhou, Guangxi, China; Department of Public Health, School of Medicine, Guangxi University of Science and Technology, Liuzhou, Guangxi, China
| | - Xiaobo Yang
- Department of Occupational Health and Environmental Health, School of Public Health, Guangxi Medical University, Nanning, Guangxi, China; Guangxi Key Laboratory on Precise Prevention and Treatment for Thyroid Tumor, The Second Affiliated Hospital, Guangxi University of Science and Technology, Liuzhou, Guangxi, China.
| |
Collapse
|
19
|
Lin W, Shi S, Lan H, Wang N, Huang H, Wen J, Chen G. Identification of influence factors in overweight population through an interpretable risk model based on machine learning: a large retrospective cohort. Endocrine 2024; 83:604-614. [PMID: 37776483 DOI: 10.1007/s12020-023-03536-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Accepted: 09/12/2023] [Indexed: 10/02/2023]
Abstract
BACKGROUND The identification of associated overweight risk factors is crucial to future health risk predictions and behavioral interventions. Several consensus problems remain in machine learning, such as cross-validation, and the resulting model may suffer from overfitting or poor interpretability. METHODS This study employed nine commonly used machine learning methods to construct overweight risk models. The general community are the target of this study, and a total of 10,905 Chinese subjects from Ningde City in Fujian province, southeast China, participated. The best model was selected through appropriate verification and validation and was suitably explained. RESULTS The overweight risk models employing machine learning exhibited good performance. It was concluded that CatBoost, which is used in the construction of clinical risk models, may surpass previous machine learning methods. The visual display of the Shapley additive explanation value for the machine model variables accurately represented the influence of each variable in the model. CONCLUSIONS The construction of an overweight risk model using machine learning may currently be the best approach. Moreover, CatBoost may be the best machine learning method. Furthermore, combining Shapley's additive explanation and machine learning methods can be effective in identifying disease risk factors for prevention and control.
Collapse
Affiliation(s)
- Wei Lin
- Department of Endocrinology, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital, FuZhou, 350001, PR China.
| | - Songchang Shi
- Department of Critical Care Medicine, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital South Branch, Fujian Provincial Hospital Jinshan Branch, Fujian Provincial Hospital, Fuzhou, 350001, PR China
| | - Huiyu Lan
- Department of Endocrinology, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital, FuZhou, 350001, PR China
| | - Nengying Wang
- Department of Endocrinology, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital, FuZhou, 350001, PR China
| | - Huibin Huang
- Department of Endocrinology, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital, FuZhou, 350001, PR China
| | - Junping Wen
- Department of Endocrinology, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital, FuZhou, 350001, PR China
| | - Gang Chen
- Department of Endocrinology, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital, FuZhou, 350001, PR China.
| |
Collapse
|
20
|
Chen Q, Hu H, She Y, He Q, Huang X, Shi H, Cao X, Zhang X, Xu Y. An artificial neural network model for evaluating the risk of hyperuricaemia in type 2 diabetes mellitus. Sci Rep 2024; 14:2197. [PMID: 38273015 PMCID: PMC10810925 DOI: 10.1038/s41598-024-52550-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2023] [Accepted: 01/19/2024] [Indexed: 01/27/2024] Open
Abstract
Type 2 diabetes with hyperuricaemia may lead to gout, kidney damage, hypertension, coronary heart disease, etc., further aggravating the condition of diabetes as well as adding to the medical and financial burden. To construct a risk model for hyperuricaemia in patients with type 2 diabetes mellitus based on artificial neural network, and to evaluate the effectiveness of the risk model to provide directions for the prevention and control of the disease in this population. From June to December 2022, 8243 patients with type 2 diabetes were recruited from six community service centers for questionnaire and physical examination. Secondly, the collected data were used to select suitable variables and based on the comparison results, logistic regression was used to screen the variable characteristics. Finally, three risk models for evaluating the risk of hyperuricaemia in type 2 diabetes mellitus were developed using an artificial neural network algorithm and evaluated for performance. A total of eleven factors affecting the development of hyperuricaemia in patients with type 2 diabetes mellitus in this study, including gender, waist circumference, diabetes medication use, diastolic blood pressure, γ-glutamyl transferase, blood urea nitrogen, triglycerides, low-density lipoprotein cholesterol, high-density lipoprotein cholesterol, fasting glucose and estimated glomerular filtration rate. Among the generated models, baseline & biochemical risk model had the best performance with cutoff, area under the curve, accuracy, recall, specificity, positive likelihood ratio, negative likelihood ratio, precision, negative predictive value, KAPPA and F1-score were 0.488, 0.744, 0.689, 0.625, 0.749, 2.489, 0.501, 0.697, 0.684, 0.375 and 0.659. In addition, its Brier score was 0.169 and the calibration curve also showed good agreement between fitting and observation. The constructed artificial neural network model has better efficacy and facilitates the reduction of the harm caused by type 2 diabetes mellitus combined with hyperuricaemia.
Collapse
Affiliation(s)
- Qingquan Chen
- The Affiliated Fuzhou Center for Disease Control and Prevention of Fujian Medical University, Fuzhou, China
- School of Public Health, Fujian Medical University, Fuzhou, China
| | - Haiping Hu
- The Affiliated Fuzhou Center for Disease Control and Prevention of Fujian Medical University, Fuzhou, China
- School of Public Health, Fujian Medical University, Fuzhou, China
| | - Yuanyu She
- The Affiliated Fuzhou Center for Disease Control and Prevention of Fujian Medical University, Fuzhou, China
- School of Public Health, Fujian Medical University, Fuzhou, China
| | - Qing He
- The Affiliated Fuzhou Center for Disease Control and Prevention of Fujian Medical University, Fuzhou, China
- School of Public Health, Fujian Medical University, Fuzhou, China
| | - Xinfeng Huang
- The Affiliated Fuzhou Center for Disease Control and Prevention of Fujian Medical University, Fuzhou, China
- School of Public Health, Fujian Medical University, Fuzhou, China
| | - Huanhuan Shi
- The Affiliated Fuzhou Center for Disease Control and Prevention of Fujian Medical University, Fuzhou, China
- School of Public Health, Fujian Medical University, Fuzhou, China
| | - Xiangyu Cao
- The Affiliated Fuzhou Center for Disease Control and Prevention of Fujian Medical University, Fuzhou, China
- School of Public Health, Fujian Medical University, Fuzhou, China
| | - Xiaoyang Zhang
- The Affiliated Fuzhou Center for Disease Control and Prevention of Fujian Medical University, Fuzhou, China.
- School of Public Health, Fujian Medical University, Fuzhou, China.
| | - Youqiong Xu
- The Affiliated Fuzhou Center for Disease Control and Prevention of Fujian Medical University, Fuzhou, China.
- School of Public Health, Fujian Medical University, Fuzhou, China.
| |
Collapse
|
21
|
Talukder MA, Islam MM, Uddin MA, Kazi M, Khalid M, Akhter A, Ali Moni M. Toward reliable diabetes prediction: Innovations in data engineering and machine learning applications. Digit Health 2024; 10:20552076241271867. [PMID: 39175924 PMCID: PMC11339751 DOI: 10.1177/20552076241271867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Accepted: 06/27/2024] [Indexed: 08/24/2024] Open
Abstract
OBJECTIVE Diabetes is a metabolic disorder that causes the risk of stroke, heart disease, kidney failure, and other long-term complications because diabetes generates excess sugar in the blood. Machine learning (ML) models can aid in diagnosing diabetes at the primary stage. So, we need an efficient ML model to diagnose diabetes accurately. METHODS In this paper, an effective data preprocessing pipeline has been implemented to process the data and random oversampling to balance the data, handling the imbalance distributions of the observational data more sophisticatedly. We used four different diabetes datasets to conduct our experiments. Several ML algorithms were used to determine the best models to predict diabetes faultlessly. RESULTS The performance analysis demonstrates that among all ML algorithms, random forest surpasses the current works with an accuracy rate of 86% and 98.48% for Dataset 1 and Dataset 2; extreme gradient boosting and decision tree surpass with an accuracy rate of 99.27% and 100% for Dataset 3 and Dataset 4, respectively. Our proposal can increase accuracy by 12.15% compared to the model without preprocessing. CONCLUSIONS This excellent research finding indicates that the proposed models might be employed to produce more accurate diabetes predictions to supplement current preventative interventions to reduce the incidence of diabetes and its associated costs.
Collapse
Affiliation(s)
- Md. Alamin Talukder
- Department of Computer Science and Engineering, International University of Business Agriculture and Technology, Dhaka, Bangladesh
| | - Md. Manowarul Islam
- Department of Computer Science and Engineering, Jagannath University, Dhaka, Bangladesh
| | - Md Ashraf Uddin
- School of Information Technology, Deakin University, Waurn Ponds Campus, Geelong, Australia
| | - Mohsin Kazi
- Department of Pharmaceutics, College of Pharmacy, King Saud University, Riyadh, Saudi Arabia
| | - Majdi Khalid
- Department of Computer Science and Artificial Intelligence, College of Computing, Umm Al-Qura University, Makkah, Saudi Arabia
| | - Arnisha Akhter
- Department of Computer Science and Engineering, Jagannath University, Dhaka, Bangladesh
| | - Mohammad Ali Moni
- Artificial Intelligence & Data Science, Faculty of Health and Behavioural Sciences, The University of Queensland, Brisbane, Australia
| |
Collapse
|
22
|
Li S, Zhang W, Hu X. Comprehensive analysis of necroptosis-related genes in renal ischemia-reperfusion injury. Front Immunol 2023; 14:1279603. [PMID: 37965311 PMCID: PMC10641517 DOI: 10.3389/fimmu.2023.1279603] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Accepted: 10/16/2023] [Indexed: 11/16/2023] Open
Abstract
Background Oxidative stress is the primary cause of ischemia-reperfusion injury (IRI) in kidney transplantation, leading to delayed graft function (DGF) and implications on patient health. Necroptosis is believed to play a role in renal IRI. This research presents a comprehensive analysis of necroptosis-related genes and their functional implications in the context of IRI in renal transplantation. Methods The necroptosis-related differentially expressed genes (NR-DEGs) were identified using gene expression data from pre- and post-reperfusion renal biopsies, and consensus clustering analysis was performed to distinguish necroptosis-related clusters. A predictive model for DGF was developed based on the NR-DEGs and patients were divided into high- and low-risk groups. We investigated the differences in functional enrichment and immune infiltration between different clusters and risk groups and further validated them in single-cell RNA-sequencing (scRNA-seq) data. Finally, we verified the expression changes of NR-DEGs in an IRI mouse model. Results Five NR-DEGs were identified and were involved in various biological processes. The renal samples were further stratified into two necroptosis-related clusters (C1 and C2) showing different occurrences of DGF. The predictive model had a reliable performance in identifying patients at higher risk of DGF with the area under the curve as 0.798. Additionally, immune infiltration analysis indicated more abundant proinflammatory cells in the high-risk group, which was also found in C2 cluster with more DGF patients. Validation of NR-DEG in scRNA-seq data further supported their involvement in immune cells. Lastly, the mouse model validated the up-regulation of NR-DEGs after IR and indicated the correlations with kidney function markers. Conclusions Our research provides valuable insights into the identification and functional characterization of NR-DEGs in the context of renal transplantation and sheds light on their involvement in immune responses and the progression of IRI and DGF.
Collapse
Affiliation(s)
- Shuai Li
- Department of Urology, Beijing Chao-Yang Hospital, Capital Medical University, Beijing, China
- Institute of Urology, Capital Medical University, Beijing, China
| | - Weixun Zhang
- Department of Urology, Beijing Chao-Yang Hospital, Capital Medical University, Beijing, China
- Institute of Urology, Capital Medical University, Beijing, China
| | - Xiaopeng Hu
- Department of Urology, Beijing Chao-Yang Hospital, Capital Medical University, Beijing, China
- Institute of Urology, Capital Medical University, Beijing, China
| |
Collapse
|
23
|
Li W, Huang G, Tang N, Lu P, Jiang L, Lv J, Qin Y, Lin Y, Xu F, Lei D. Association between co-exposure to phenols, phthalates, and polycyclic aromatic hydrocarbons with the risk of frailty. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:105181-105193. [PMID: 37713077 DOI: 10.1007/s11356-023-29887-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Accepted: 09/11/2023] [Indexed: 09/16/2023]
Abstract
The phenomenon of population aging has brought forth the challenge of frailty. Nevertheless, the contribution of environmental exposure to frailty remains ambiguous. Our objective was to investigate the association between phenols, phthalates (PAEs), and polycyclic aromatic hydrocarbons (PAHs) with frailty. We constructed a 48-item frailty index using data from the National Health and Nutrition Examination Survey (NHANES). The exposure levels of 20 organic contaminants were obtained from the survey circle between 2005 and 2016. The association between individual organic contaminants and the frailty index was assessed using negative binomial regression models. The combined effect of organic contaminants was examined using weighted quantile sum (WQS) regression. Dose-response patterns were modeled using generalized additive models (GAMs). Additionally, an interpretable machine learning approach was employed to develop a predictive model for the frailty index. A total of 1566 participants were included in the analysis. Positive associations were observed between exposure to MIB, P02, ECP, MBP, MHH, MOH, MZP, MC1, and P01 with the frailty index. WQS regression analysis revealed a significant increase in the frailty index with higher levels of the mixture of organic contaminants (aOR, 1.12; 95% CI, 1.05-1.20; p < 0.001), with MIB, ECP, COP, MBP, P02, and P01 identified as the major contributors. Dose-response relationships were observed between MIB, ECP, MBP, P02, and P01 exposure with an increased risk of frailty (both with p < 0.05). The developed predictive model based on organic contaminants exposure demonstrated high performance, with an R2 of 0.9634 and 0.9611 in the training and testing sets, respectively. Furthermore, the predictive model suggested potential synergistic effects in the MIB-MBP and P01-P02 pairs. Taken together, these findings suggest a significant association between exposure to phthalates and PAHs with an increased susceptibility to frailty.
Collapse
Affiliation(s)
- Wenxiang Li
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences, Nanning, 530021, People's Republic of China
| | - Guangyi Huang
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences, Nanning, 530021, People's Republic of China
| | - Ningning Tang
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences, Nanning, 530021, People's Republic of China
| | - Peng Lu
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences, Nanning, 530021, People's Republic of China
| | - Li Jiang
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences, Nanning, 530021, People's Republic of China
| | - Jian Lv
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences, Nanning, 530021, People's Republic of China
| | - Yuanjun Qin
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences, Nanning, 530021, People's Republic of China
| | - Yunru Lin
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences, Nanning, 530021, People's Republic of China
| | - Fan Xu
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences, Nanning, 530021, People's Republic of China
| | - Daizai Lei
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences, Nanning, 530021, People's Republic of China.
- Department of Ophthalmology, The People's Hospital of Guangxi Zhuang Autonomous Region, 6 Taoyuan Road, Qingxiu District, Nanning, 530000, China.
| |
Collapse
|
24
|
Li W, Huang G, Tang N, Lu P, Jiang L, Lv J, Qin Y, Lin Y, Xu F, Lei D. Identification of dietary components in association with abdominal aortic calcification. Food Funct 2023; 14:8383-8395. [PMID: 37609915 DOI: 10.1039/d3fo02920d] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/24/2023]
Abstract
The precise impact of dietary components on vascular health remains incompletely understood. To identify the dietary components and their associations with abdominal aortic calcification (AAC), the data from NHANES was employed in this cross-sectional study. The LASSO method and logistic regression were utilized to identify dietary components that exhibited the strongest association with AAC. Grouped WQS regression analysis was employed to evaluate the combined effects of dietary components on AAC. Furthermore, principal component analysis was employed to identify the primary dietary patterns in the study population. The present analysis included 1862 participants, from whom information on 35 dietary macro- and micronutrient components was obtained through 24-hour dietary recall interviews. The assessment of AAC was performed utilizing dual-energy X-ray absorptiometry. The LASSO method identified 10 dietary components that were associated with AAC. Total protein, total fiber, vitamin A, and β-cryptoxanthin exhibited a negative association with AAC. Compared to the first quartile, the adjusted odds ratios (95% CIs) for the highest quartile were 0.59 (0.38, 0.93), 0.63 (0.42, 0.93), 0.59 (0.41, 0.84), and 0.68 (0.48, 0.94), respectively. Grouped WQS regression demonstrated a positive association between the lipid group and AAC (aOR: 1.29; 95% CI: 1.12, 1.50), while the proteins and phytochemical group exhibited a negative association with AAC (aOR: 0.69; 95% CI: 0.58, 0.82). For the dietary pattern analysis, high adherence to the plant-based pattern (aOR: 0.62; 95% CI: 0.44, 0.88) was associated with a lower risk of AAC, whereas the caffeine and theobromine pattern (aOR: 1.73; 95% CI: 1.25, 2.41) was associated with a higher risk of AAC. The findings of this study indicate that adopting a dietary pattern characterized by high levels of protein and plant-based foods, as well as reduced levels of fat, may offers potential advantages for the prevention of AAC.
Collapse
Affiliation(s)
- Wenxiang Li
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology, Nanning, 530021, People's Republic of China.
| | - Guangyi Huang
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology, Nanning, 530021, People's Republic of China.
| | - Ningning Tang
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology, Nanning, 530021, People's Republic of China.
| | - Peng Lu
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology, Nanning, 530021, People's Republic of China.
| | - Li Jiang
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology, Nanning, 530021, People's Republic of China.
| | - Jian Lv
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology, Nanning, 530021, People's Republic of China.
| | - Yuanjun Qin
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology, Nanning, 530021, People's Republic of China.
| | - Yunru Lin
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology, Nanning, 530021, People's Republic of China.
| | - Fan Xu
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology, Nanning, 530021, People's Republic of China.
| | - Daizai Lei
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology, Nanning, 530021, People's Republic of China.
| |
Collapse
|
25
|
Iparraguirre-Villanueva O, Espinola-Linares K, Flores Castañeda RO, Cabanillas-Carbonell M. Application of Machine Learning Models for Early Detection and Accurate Classification of Type 2 Diabetes. Diagnostics (Basel) 2023; 13:2383. [PMID: 37510127 PMCID: PMC10378239 DOI: 10.3390/diagnostics13142383] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 06/23/2023] [Accepted: 07/07/2023] [Indexed: 07/30/2023] Open
Abstract
Early detection of diabetes is essential to prevent serious complications in patients. The purpose of this work is to detect and classify type 2 diabetes in patients using machine learning (ML) models, and to select the most optimal model to predict the risk of diabetes. In this paper, five ML models, including K-nearest neighbor (K-NN), Bernoulli Naïve Bayes (BNB), decision tree (DT), logistic regression (LR), and support vector machine (SVM), are investigated to predict diabetic patients. A Kaggle-hosted Pima Indian dataset containing 768 patients with and without diabetes was used, including variables such as number of pregnancies the patient has had, blood glucose concentration, diastolic blood pressure, skinfold thickness, body insulin levels, body mass index (BMI), genetic background, diabetes in the family tree, age, and outcome (with/without diabetes). The results show that the K-NN and BNB models outperform the other models. The K-NN model obtained the best accuracy in detecting diabetes, with 79.6% accuracy, while the BNB model obtained 77.2% accuracy in detecting diabetes. Finally, it can be stated that the use of ML models for the early detection of diabetes is very promising.
Collapse
|
26
|
Li W, Huang G, Tang N, Lu P, Jiang L, Lv J, Qin Y, Lin Y, Xu F, Lei D. Effects of heavy metal exposure on hypertension: A machine learning modeling approach. CHEMOSPHERE 2023; 337:139435. [PMID: 37422210 DOI: 10.1016/j.chemosphere.2023.139435] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Revised: 07/04/2023] [Accepted: 07/05/2023] [Indexed: 07/10/2023]
Abstract
Heavy metal exposure is a common risk factor for hypertension. To develop an interpretable predictive machine learning (ML) model for hypertension based on levels of heavy metal exposure, data from the NHANES (2003-2016) were employed. Random forest (RF), support vector machine (SVM), decision tree (DT), multilayer perceptron (MLP), ridge regression (RR), AdaBoost (AB), gradient boosting decision tree (GBDT), voting classifier (VC), and K-nearest neighbour (KNN) algorithms were utilized to generate an optimal predictive model for hypertension. Three interpretable methods, the permutation feature importance analysis, partial dependence plot (PDP), and Shapley additive explanations (SHAP) methods, were integrated into a pipeline and embedded in ML for model interpretation. A total of 9005 eligible individuals were randomly allocated into two distinct sets for predictive model training and validation. The results showed that among the predictive models, the RF model demonstrated the highest performance, achieving an accuracy rate of 77.40% in the validation set. The AUC and F1 score for the model were 0.84 and 0.76, respectively. Blood Pb, urinary Cd, urinary Tl, and urinary Co levels were identified as the main influencers of hypertension, and their contribution weights were 0.0504 ± 0.0482, 0.0389 ± 0.0256, 0.0307 ± 0.0179, and 0.0296 ± 0.0162, respectively. Blood Pb (0.55-2.93 μg/dL) and urinary Cd (0.06-0.15 μg/L) levels exhibited the most pronounced upwards trend with the risk of hypertension within a specific value range, while urinary Tl (0.06-0.26 μg/L) and urinary Co (0.02-0.32 μg/L) levels demonstrated a declining trend with hypertension. The findings on the synergistic effects indicated that Pb and Cd were the primary determinants of hypertension. Our findings underscore the predictive value of heavy metals for hypertension. By utilizing interpretable methods, we discerned that Pb, Cd, Tl, and Co emerged as noteworthy contributors within the predictive model.
Collapse
Affiliation(s)
- Wenxiang Li
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology, Nanning, 530021, China.
| | - Guangyi Huang
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology, Nanning, 530021, China
| | - Ningning Tang
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology, Nanning, 530021, China
| | - Peng Lu
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology, Nanning, 530021, China
| | - Li Jiang
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology, Nanning, 530021, China
| | - Jian Lv
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology, Nanning, 530021, China
| | - Yuanjun Qin
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology, Nanning, 530021, China
| | - Yunru Lin
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology, Nanning, 530021, China
| | - Fan Xu
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology, Nanning, 530021, China.
| | - Daizai Lei
- Department of Ophthalmology, the People's Hospital of Guangxi Zhuang Autonomous Region & Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences & Guangxi Key Laboratory of Eye Health & Guangxi Health Commission Key Laboratory of Ophthalmology and Related Systemic Diseases Artificial Intelligence Screening Technology, Nanning, 530021, China.
| |
Collapse
|
27
|
Zhao M, Wan J, Qin W, Huang X, Chen G, Zhao X. A machine learning-based diagnosis modelling of type 2 diabetes mellitus with environmental metal exposure. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2023; 235:107537. [PMID: 37037162 DOI: 10.1016/j.cmpb.2023.107537] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Revised: 03/02/2023] [Accepted: 04/04/2023] [Indexed: 05/08/2023]
Abstract
BACKGROUND AND OBJECTIVE Increasing and compelling evidence has been proved that urinary and dietary metal exposure are underappreciated but potentially modifiable biomarkers for type 2 diabetes mellitus (T2DM). The aims of this study were (1) to identify the key potential biomarkers which contributed to T2DM with effective and parsimonious features and (2) to assess the utility of baseline variables and metal exposure in the diagnosis of T2DM. METHODS Based on the National Health and Nutrition Examination Survey (NHANES), we selected 9822 screening records with 82 significant variables covering demographics, lifestyle, anthropometric measures, diet and metal exposure for this study. Combining extreme gradient boosting (XGBoost), random forest and light gradient boosting machine (lightGBM), a soft voting ensemble model was proposed to measure the importance of 82 features. With this soft voting ensemble model and variance inflation factor (VIF), strong multicollinear features with low importance scores were further removed from candidate biomarkers. Then, a soft voting ensemble classifier was adopted to demonstrate the efficiency of the proposed feature selection method. RESULTS With the novel feature selection method, 12 baseline variables and 3 metal variables were selected to detect patients at risk for T2DM in our study. For metal variables, the dietary copper (Cu), urinary cadmium (Cd) and urinary mercury (Hg) metals were selected as the most remarkable metal exposure and the corresponding P-values were all less than 0.05. In a classification model of T2DM with 12 baseline biomarkers, the addition of 3 metal exposure improved the classification accuracy of T2DM from a traditional area under the curve (AUC) 0.792 of the receiver operating characteristic (ROC) to an AUC 0.847. CONCLUSIONS This was the first demonstration of T2DM classification with machine learning under urinary and dietary metal exposure. Improved prediction precision illustrated the effectiveness of the proposed machine learning-based diagnosis model facilitated lifestyle/dietary intervention for T2DM prevention.
Collapse
Affiliation(s)
- Min Zhao
- School of Science, Nantong University, Nantong, 226019, China
| | - Jin Wan
- School of Science, Nantong University, Nantong, 226019, China
| | - Wenzhi Qin
- School of Science, Nantong University, Nantong, 226019, China
| | - Xin Huang
- School of Science, Nantong University, Nantong, 226019, China
| | - Guangdi Chen
- Bioelectromagnetics Laboratory, and Department of Reproductive Endocrinology of Women's Hospital, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Xinyuan Zhao
- Department of occupational Medicine and Environmental Toxicology, School of Public Health, Nantong University, Nantong, 226019, China.
| |
Collapse
|
28
|
Afsaneh E, Sharifdini A, Ghazzaghi H, Ghobadi MZ. Recent applications of machine learning and deep learning models in the prediction, diagnosis, and management of diabetes: a comprehensive review. Diabetol Metab Syndr 2022; 14:196. [PMID: 36572938 PMCID: PMC9793536 DOI: 10.1186/s13098-022-00969-9] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 12/16/2022] [Indexed: 12/28/2022] Open
Abstract
Diabetes as a metabolic illness can be characterized by increased amounts of blood glucose. This abnormal increase can lead to critical detriment to the other organs such as the kidneys, eyes, heart, nerves, and blood vessels. Therefore, its prediction, prognosis, and management are essential to prevent harmful effects and also recommend more useful treatments. For these goals, machine learning algorithms have found considerable attention and have been developed successfully. This review surveys the recently proposed machine learning (ML) and deep learning (DL) models for the objectives mentioned earlier. The reported results disclose that the ML and DL algorithms are promising approaches for controlling blood glucose and diabetes. However, they should be improved and employed in large datasets to affirm their applicability.
Collapse
|
29
|
Unnikrishnan PS, Animish A, Madhumitha G, Suthindhiran K, Jayasri MA. Bioactivity Guided Study for the Isolation and Identification of Antidiabetic Compounds from Edible Seaweed- Ulva reticulata. Molecules 2022; 27:molecules27248827. [PMID: 36557959 PMCID: PMC9783910 DOI: 10.3390/molecules27248827] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 12/01/2022] [Accepted: 12/06/2022] [Indexed: 12/15/2022] Open
Abstract
Managing diabetes is challenging due to the complex physiology of the disease and the numerous complications associated with it. As part of the ongoing search for antidiabetic chemicals, marine algae have been demonstrated to be an excellent source due to their medicinal properties. In this study, Ulva reticulata extracts were investigated for their anti-diabetic effect by examining its inhibitory effects on α-amylase, α-glucosidase, and DPP-IV and antioxidant (DPPH) potential in vitro and its purified fraction using animal models. Among the various solvents used, the Methanolic extract of Ulva reticulata (MEUR) displayed the highest antidiabetic activity in both in vitro and in vivo; it showed no cytotoxicity and hence was subjected to bioassay-guided chromatographic separation. Among the seven isolated fractions (F1 to F7), the F4 (chloroform) fraction exhibited substantial total phenolic content (65.19 μg mL-1) and total flavonoid content (20.33 μg mL-1), which showed the promising inhibition against α-amylase (71.67%) and α-glucosidase (38.01%). Active fraction (F4) was further purified using column chromatography, subjected to thin-layer chromatography (TLC), and characterized by spectroscopy techniques. Upon structural elucidation, five distinct compounds, namely, Nonane, Hexadecanoic acid, 1-dodecanol, Cyclodecane methyl, and phenol, phenol, 3,5-bis(1,1-dimethylethyl) were identified. The antidiabetic mechanism of active fraction (F4) was further investigated using various in vitro and in vivo models. The results displayed that in in vitro both 1 and 24 h in vitro cultures, the active fraction (F4) at a concentration of 100 μg mL-1 demonstrated maximum glucose-induced insulin secretion at 4 mM (0.357 and 0.582 μg mL-1) and 20 mM (0.848 and 1.032 μg mL-1). The active fraction (F4) reduces blood glucose levels in normoglycaemic animals and produces effects similar to that of standard acarbose. Active fraction (F4) also demonstrated outstanding hypoglycaemic activity in hyperglycemic animals at a dose of 10 mg/kg B.wt. In the STZ-induced diabetic rat model, the active fraction (F4) showed a (61%) reduction in blood glucose level when compared to the standard drug glibenclamide (68%). The results indicate that the marine algae Ulva reticulata is a promising candidate for managing diabetes by inhibiting carbohydrate metabolizing enzymes and promoting insulin secretion.
Collapse
Affiliation(s)
| | - Andhere Animish
- Marine Biotechnology and Bioproducts Laboratory, Vellore Institute of Technology, School of Biosciences and Technology, Vellore 632014, India
| | - Gunabalan Madhumitha
- Chemistry of Heterocycles and Natural Products Research Laboratory, Vellore Institute of Technology, School of Advanced Sciences, Vellore 632014, India
| | - Krishnamurthy Suthindhiran
- Marine Biotechnology and Bioproducts Laboratory, Vellore Institute of Technology, School of Biosciences and Technology, Vellore 632014, India
| | - Mangalam Achuthananthan Jayasri
- Marine Biotechnology and Bioproducts Laboratory, Vellore Institute of Technology, School of Biosciences and Technology, Vellore 632014, India
- Correspondence:
| |
Collapse
|
30
|
Morgan-Benita J, Sánchez-Reyna AG, Espino-Salinas CH, Oropeza-Valdez JJ, Luna-García H, Galván-Tejada CE, Galván-Tejada JI, Gamboa-Rosales H, Enciso-Moreno JA, Celaya-Padilla J. Metabolomic Selection in the Progression of Type 2 Diabetes Mellitus: A Genetic Algorithm Approach. Diagnostics (Basel) 2022; 12:diagnostics12112803. [PMID: 36428864 PMCID: PMC9689091 DOI: 10.3390/diagnostics12112803] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2022] [Revised: 11/04/2022] [Accepted: 11/07/2022] [Indexed: 11/18/2022] Open
Abstract
According to the World Health Organization (WHO), type 2 diabetes mellitus (T2DM) is a result of the inefficient use of insulin by the body. More than 95% of people with diabetes have T2DM, which is largely due to excess weight and physical inactivity. This study proposes an intelligent feature selection of metabolites related to different stages of diabetes, with the use of genetic algorithms (GA) and the implementation of support vector machines (SVMs), K-Nearest Neighbors (KNNs) and Nearest Centroid (NEARCENT) and with a dataset obtained from the Instituto Mexicano del Seguro Social with the protocol name of the following: "Análisis metabolómico y transcriptómico diferencial en orina y suero de pacientes pre diabéticos, diabéticos y con nefropatía diabética para identificar potenciales biomarcadores pronósticos de daño renal" (differential metabolomic and transcriptomic analyses in the urine and serum of pre-diabetic, diabetic and diabetic nephropathy patients to identify potential prognostic biomarkers of kidney damage). In order to analyze which machine learning (ML) model is the most optimal for classifying patients with some stage of T2DM, the novelty of this work is to provide a genetic algorithm approach that detects significant metabolites in each stage of progression. More than 100 metabolites were identified as significant between all stages; with the data analyzed, the average accuracies obtained in each of the five most-accurate implementations of genetic algorithms were in the range of 0.8214-0.9893 with respect to average accuracy, providing a precise tool to use in detections and backing up a diagnosis constructed entirely with metabolomics. By providing five potential biomarkers for progression, these extremely significant metabolites are as follows: "Cer(d18:1/24:1) i2", "PC(20:3-OH/P-18:1)", "Ganoderic acid C2", "TG(16:0/17:1/18:1)" and "GPEtn(18:0/20:4)".
Collapse
Affiliation(s)
- Jorge Morgan-Benita
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juarez 147, Centro, Zacatecas 98000, Mexico
| | - Ana G. Sánchez-Reyna
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juarez 147, Centro, Zacatecas 98000, Mexico
| | - Carlos H. Espino-Salinas
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juarez 147, Centro, Zacatecas 98000, Mexico
| | - Juan José Oropeza-Valdez
- Metabolomics and Proteomics Laboratory, Autonomous University of Zacatecas, Zacatecas 98000, Mexico
| | - Huizilopoztli Luna-García
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juarez 147, Centro, Zacatecas 98000, Mexico
| | - Carlos E. Galván-Tejada
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juarez 147, Centro, Zacatecas 98000, Mexico
| | - Jorge I. Galván-Tejada
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juarez 147, Centro, Zacatecas 98000, Mexico
| | - Hamurabi Gamboa-Rosales
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juarez 147, Centro, Zacatecas 98000, Mexico
| | | | - José Celaya-Padilla
- Unidad Académica de Ingeniería Eléctrica, Universidad Autónoma de Zacatecas, Jardín Juarez 147, Centro, Zacatecas 98000, Mexico
- Correspondence:
| |
Collapse
|
31
|
Guo X, Wang H, Song Q, Li N, Liang Q, Su W, Liang M, Ding X, Sun C, Lowe S, Sun Y. Association between exposure to organophosphorus pesticides and the risk of diabetes among US Adults: Cross-sectional findings from the National Health and Nutrition Examination Survey. CHEMOSPHERE 2022; 301:134471. [PMID: 35367493 DOI: 10.1016/j.chemosphere.2022.134471] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Revised: 03/26/2022] [Accepted: 03/28/2022] [Indexed: 06/14/2023]
Abstract
OBJECTIVE Organophosphorus pesticides (OPPs) are commonly used pesticides across the world, however there is little epidemiological evidence linking their exposure to diabetes. Hence, this study aimed at investigating the effect of OPP exposure on the prevalence of diabetes in American adults. METHODS Adults (≥20 years old) were eligible for this study from the National Health and Nutrition Examination Survey (NHANES). Multivariate logistic regression model was employed to explore the associations of six main urinary OPPs metabolites with diabetes. Subgroup analyses were performed by age and gender. Combined effect of OPPs metabolites on the overall association with diabetes was evaluated by weighted quantile sum regression (WQS). Furthermore, Bayesian kernel machine regression (BKMR) model was implemented to explore joint effect of multiple OPPs metabolites on diabetes. RESULTS Ultimately, 6,593 adults were included in our analysis. Of them, 1,044 participants were determined as diabetes patients. The results of logistic regression shown that urinary OPPs metabolites concentrations, whether taken as continuous variables or quantiles, were in positive correlation with diabetes. Notably, the p for trend of diethylphosphate (DEP), a kind of OPPs metabolites, was less than 0.05 indicated that a linear trend may exist between levels of DEP and prevalence of diabetes among adults while this trend was not obversed in other OPPs metabolites. In the WQS model, combined exposure of OPPs metabolites had a significantly positive association with diabetes (OR: 1.057; 95% CI: 1.002, 1.114) and diethylphosphate (36.84%) made the largest contributor to the WQS index. The result of BKMR also suggested a positive trend of association between mixed OPPs metabolites and diabetes. CONCLUSION Our results add credibility to the argument that OPP exposure might trigger diabetes. Certainly, prospective data are required to corroborate our findings.
Collapse
Affiliation(s)
- Xianwei Guo
- Department of Epidemiology and Health Statistics, School of Public Health, Anhui Medical University, Hefei, 230032, Anhui, PR China
| | - Hao Wang
- Department of Epidemiology and Health Statistics, School of Public Health, Anhui Medical University, Hefei, 230032, Anhui, PR China
| | - Qiuxia Song
- Department of Epidemiology and Health Statistics, School of Public Health, Anhui Medical University, Hefei, 230032, Anhui, PR China
| | - Ning Li
- Department of Epidemiology and Health Statistics, School of Public Health, Anhui Medical University, Hefei, 230032, Anhui, PR China
| | - Qiwei Liang
- Department of Epidemiology and Health Statistics, School of Public Health, Anhui Medical University, Hefei, 230032, Anhui, PR China
| | - Wanying Su
- Department of Epidemiology and Health Statistics, School of Public Health, Anhui Medical University, Hefei, 230032, Anhui, PR China
| | - Mingming Liang
- Department of Epidemiology and Health Statistics, School of Public Health, Anhui Medical University, Hefei, 230032, Anhui, PR China
| | - Xiuxiu Ding
- Department of Epidemiology and Health Statistics, School of Public Health, Anhui Medical University, Hefei, 230032, Anhui, PR China
| | - Chenyu Sun
- Internal Medicine, AMITA Health Saint Joseph Hospital Chicago, 2900 N. Lake Shore Drive, Chicago, IL, 60657, USA
| | - Scott Lowe
- College of Osteopathic Medicine, Kansas City University, 1750 Independence Ave, Kansas City, MO, 64106, USA
| | - Yehuan Sun
- Department of Epidemiology and Health Statistics, School of Public Health, Anhui Medical University, Hefei, 230032, Anhui, PR China; Chaohu Hospital, Anhui Medical University, Hefei, 238000, Anhui, PR China.
| |
Collapse
|
32
|
Liu Q, Zhang M, He Y, Zhang L, Zou J, Yan Y, Guo Y. Predicting the Risk of Incident Type 2 Diabetes Mellitus in Chinese Elderly Using Machine Learning Techniques. J Pers Med 2022; 12:jpm12060905. [PMID: 35743691 PMCID: PMC9224915 DOI: 10.3390/jpm12060905] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2022] [Revised: 05/21/2022] [Accepted: 05/27/2022] [Indexed: 02/04/2023] Open
Abstract
Early identification of individuals at high risk of diabetes is crucial for implementing early intervention strategies. However, algorithms specific to elderly Chinese adults are lacking. The aim of this study is to build effective prediction models based on machine learning (ML) for the risk of type 2 diabetes mellitus (T2DM) in Chinese elderly. A retrospective cohort study was conducted using the health screening data of adults older than 65 years in Wuhan, China from 2018 to 2020. With a strict data filtration, 127,031 records from the eligible participants were utilized. Overall, 8298 participants were diagnosed with incident T2DM during the 2-year follow-up (2019–2020). The dataset was randomly split into training set (n = 101,625) and test set (n = 25,406). We developed prediction models based on four ML algorithms: logistic regression (LR), decision tree (DT), random forest (RF), and extreme gradient boosting (XGBoost). Using LASSO regression, 21 prediction features were selected. The Random under-sampling (RUS) was applied to address the class imbalance, and the Shapley Additive Explanations (SHAP) was used to calculate and visualize feature importance. Model performance was evaluated by the area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and accuracy. The XGBoost model achieved the best performance (AUC = 0.7805, sensitivity = 0.6452, specificity = 0.7577, accuracy = 0.7503). Fasting plasma glucose (FPG), education, exercise, gender, and waist circumference (WC) were the top five important predictors. This study showed that XGBoost model can be applied to screen individuals at high risk of T2DM in the early phrase, which has the strong potential for intelligent prevention and control of diabetes. The key features could also be useful for developing targeted diabetes prevention interventions.
Collapse
Affiliation(s)
- Qing Liu
- Department of Epidemiology, School of Public Health, Wuhan University, Wuhan 430071, China; (Q.L.); (M.Z.)
| | - Miao Zhang
- Department of Epidemiology, School of Public Health, Wuhan University, Wuhan 430071, China; (Q.L.); (M.Z.)
| | - Yifeng He
- School of Geodesy and Geomatics, Wuhan University, Wuhan 430079, China; (Y.H.); (J.Z.)
| | - Lei Zhang
- School of Mathematics and Statistics, Wuhan University, Wuhan 430070, China;
| | - Jingui Zou
- School of Geodesy and Geomatics, Wuhan University, Wuhan 430079, China; (Y.H.); (J.Z.)
| | - Yaqiong Yan
- Wuhan Center for Disease Control and Prevention, Wuhan 430015, China;
| | - Yan Guo
- Wuhan Center for Disease Control and Prevention, Wuhan 430015, China;
- Correspondence:
| |
Collapse
|
33
|
Oh R, Lee HK, Pak YK, Oh MS. An Interactive Online App for Predicting Diabetes via Machine Learning from Environment-Polluting Chemical Exposure Data. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:ijerph19105800. [PMID: 35627338 PMCID: PMC9142138 DOI: 10.3390/ijerph19105800] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/09/2022] [Revised: 05/04/2022] [Accepted: 05/07/2022] [Indexed: 02/04/2023]
Abstract
The early prediction and identification of risk factors for diabetes may prevent or delay diabetes progression. In this study, we developed an interactive online application that provides the predictive probabilities of prediabetes and diabetes in 4 years based on a Bayesian network (BN) classifier, which is an interpretable machine learning technique. The BN was trained using a dataset from the Ansung cohort of the Korean Genome and Epidemiological Study (KoGES) in 2008, with a follow-up in 2012. The dataset contained not only traditional risk factors (current diabetes status, sex, age, etc.) for future diabetes, but it also contained serum biomarkers, which quantified the individual level of exposure to environment-polluting chemicals (EPC). Based on accuracy and the area under the curve (AUC), a tree-augmented BN with 11 variables derived from feature selection was used as our prediction model. The online application that implemented our BN prediction system provided a tool that performs customized diabetes prediction and allows users to simulate the effects of controlling risk factors for the future development of diabetes. The prediction results of our method demonstrated that the EPC biomarkers had interactive effects on diabetes progression and that the use of the EPC biomarkers contributed to a substantial improvement in prediction performance.
Collapse
Affiliation(s)
- Rosy Oh
- Department of Mathematics, Korea Military Academy, Seoul 01805, Korea;
| | - Hong Kyu Lee
- Department of Internal Medicine, College of Medicine, Seoul National University, Seoul 03080, Korea;
| | - Youngmi Kim Pak
- Department of Physiology, College of Medicine, Kyung Hee University, Seoul 02447, Korea
- Correspondence: (Y.K.P.); (M.-S.O.); Tel.: +82-2-961-0908 (Y.K.P.); +82-2-3277-2374 (M.-S.O.)
| | - Man-Suk Oh
- Department of Statistics, Ewha Womans University, Seoul 03760, Korea
- Correspondence: (Y.K.P.); (M.-S.O.); Tel.: +82-2-961-0908 (Y.K.P.); +82-2-3277-2374 (M.-S.O.)
| |
Collapse
|
34
|
Liu X, Zhang W, Zhang Q, Chen L, Zeng T, Zhang J, Min J, Tian S, Zhang H, Huang H, Wang P, Hu X, Chen L. Development and validation of a machine learning-augmented algorithm for diabetes screening in community and primary care settings: A population-based study. Front Endocrinol (Lausanne) 2022; 13:1043919. [PMID: 36518245 PMCID: PMC9742532 DOI: 10.3389/fendo.2022.1043919] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Accepted: 11/11/2022] [Indexed: 11/29/2022] Open
Abstract
BACKGROUND Opportunely screening for diabetes is crucial to reduce its related morbidity, mortality, and socioeconomic burden. Machine learning (ML) has excellent capability to maximize predictive accuracy. We aim to develop ML-augmented models for diabetes screening in community and primary care settings. METHODS 8425 participants were involved from a population-based study in Hubei, China since 2011. The dataset was split into a development set and a testing set. Seven different ML algorithms were compared to generate predictive models. Non-laboratory features were employed in the ML model for community settings, and laboratory test features were further introduced in the ML+lab models for primary care. The area under the receiver operating characteristic curve (AUC), area under the precision-recall curve (auPR), and the average detection costs per participant of these models were compared with their counterparts based on the New China Diabetes Risk Score (NCDRS) currently recommended for diabetes screening. RESULTS The AUC and auPR of the ML model were 0·697and 0·303 in the testing set, seemingly outperforming those of NCDRS by 10·99% and 64·67%, respectively. The average detection cost of the ML model was 12·81% lower than that of NCDRS with the same sensitivity (0·72). Moreover, the average detection cost of the ML+FPG model is the lowest among the ML+lab models and less than that of the ML model and NCDRS+FPG model. CONCLUSION The ML model and the ML+FPG model achieved higher predictive accuracy and lower detection costs than their counterpart based on NCDRS. Thus, the ML-augmented algorithm is potential to be employed for diabetes screening in community and primary care settings.
Collapse
Affiliation(s)
- XiaoHuan Liu
- Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- Hubei provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
| | - Weiyue Zhang
- Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- Hubei provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
| | - Qiao Zhang
- Department of Cardiovascular Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Long Chen
- Department of Computer Science and Technology, Tsinghua University, Beijing, China
| | - TianShu Zeng
- Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- Hubei provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
| | - JiaoYue Zhang
- Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- Hubei provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
| | - Jie Min
- Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- Hubei provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
| | - ShengHua Tian
- Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- Hubei provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
| | - Hao Zhang
- Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- Hubei provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
| | | | - Ping Wang
- Precision Health Program, Department of Radiology, College of Human Medicine, Michigan State University, East Lansing, MI, United States
| | - Xiang Hu
- Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- Hubei provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
- *Correspondence: LuLu Chen, ; Xiang Hu,
| | - LuLu Chen
- Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- Hubei provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
- *Correspondence: LuLu Chen, ; Xiang Hu,
| |
Collapse
|
35
|
Fregoso-Aparicio L, Noguez J, Montesinos L, García-García JA. Machine learning and deep learning predictive models for type 2 diabetes: a systematic review. Diabetol Metab Syndr 2021; 13:148. [PMID: 34930452 PMCID: PMC8686642 DOI: 10.1186/s13098-021-00767-9] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Accepted: 12/07/2021] [Indexed: 12/12/2022] Open
Abstract
Diabetes Mellitus is a severe, chronic disease that occurs when blood glucose levels rise above certain limits. Over the last years, machine and deep learning techniques have been used to predict diabetes and its complications. However, researchers and developers still face two main challenges when building type 2 diabetes predictive models. First, there is considerable heterogeneity in previous studies regarding techniques used, making it challenging to identify the optimal one. Second, there is a lack of transparency about the features used in the models, which reduces their interpretability. This systematic review aimed at providing answers to the above challenges. The review followed the PRISMA methodology primarily, enriched with the one proposed by Keele and Durham Universities. Ninety studies were included, and the type of model, complementary techniques, dataset, and performance parameters reported were extracted. Eighteen different types of models were compared, with tree-based algorithms showing top performances. Deep Neural Networks proved suboptimal, despite their ability to deal with big and dirty data. Balancing data and feature selection techniques proved helpful to increase the model's efficiency. Models trained on tidy datasets achieved almost perfect models.
Collapse
Affiliation(s)
- Luis Fregoso-Aparicio
- School of Engineering and Sciences, Tecnologico de Monterrey, Av Lago de Guadalupe KM 3.5, Margarita Maza de Juarez, 52926 Cd Lopez Mateos, Mexico
| | - Julieta Noguez
- School of Engineering and Sciences, Tecnologico de Monterrey, Ave. Eugenio Garza Sada 2501, 64849 Monterrey, Nuevo Leon Mexico
| | - Luis Montesinos
- School of Engineering and Sciences, Tecnologico de Monterrey, Ave. Eugenio Garza Sada 2501, 64849 Monterrey, Nuevo Leon Mexico
| | - José A. García-García
- Hospital General de Mexico Dr. Eduardo Liceaga, Dr. Balmis 148, Doctores, Cuauhtemoc, 06720 Mexico City, Mexico
| |
Collapse
|