1
|
Park JH, Jeong I, Ko GJ, Jeong S, Lee H. Development of a Predictive Model for Metabolic Syndrome Using Noninvasive Data and its Cardiovascular Disease Risk Assessments: Multicohort Validation Study. J Med Internet Res 2025; 27:e67525. [PMID: 40315452 DOI: 10.2196/67525] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2024] [Revised: 12/31/2024] [Accepted: 04/08/2025] [Indexed: 05/04/2025] Open
Abstract
BACKGROUND Metabolic syndrome is a cluster of metabolic abnormalities, including obesity, hypertension, dyslipidemia, and insulin resistance, that significantly increase the risk of cardiovascular disease (CVD) and other chronic conditions. Its global prevalence is rising, particularly in aging and urban populations. Traditional screening methods rely on laboratory tests and specialized assessments, which may not be readily accessible in routine primary care and community settings. Limited resources, time constraints, and inconsistent screening practices hinder early identification and intervention. Developing a noninvasive and scalable predictive model could enhance accessibility and improve early detection. OBJECTIVE This study aimed to develop and validate a predictive model for metabolic syndrome using noninvasive body composition data. Additionally, we evaluated the model's ability to predict long-term CVD risk, supporting its application in clinical and public health settings for early intervention and preventive strategies. METHODS We developed a machine learning-based predictive model using noninvasive data from two nationally representative cohorts: the Korea National Health and Nutrition Examination Survey (KNHANES) and the Korean Genome and Epidemiology Study. The model was trained using dual-energy x-ray absorptiometry data from KNHANES (2008-2011) and validated internally with bioelectrical impedance analysis data from KNHANES 2022. External validation was conducted using Korean Genome and Epidemiology Study follow-up datasets. Five machine learning algorithms were compared, and the best-performing model was selected based on the area under the receiver operating characteristic curve. Cox proportional hazards regression was used to assess the model's ability to predict long-term CVD risk. RESULTS The model demonstrated strong predictive performance across validation cohorts. Area under the receiver operating characteristic curve values for metabolic syndrome prediction ranged from 0.8338 to 0.8447 in internal validation, 0.8066 to 0.8138 in external validation 1, and 0.8039 to 0.8123 in external validation 2. The model's predictions were significantly associated with future cardiovascular risk, with Cox regression analysis indicating that individuals classified as having metabolic syndrome had a 1.51-fold higher risk of developing CVD (hazard ratio 1.51, 95% CI 1.32-1.73; P<.001). The ability to predict long-term CVD risk highlights the potential utility of this model for guiding early interventions. CONCLUSIONS This study developed a noninvasive predictive model for metabolic syndrome with strong performance across diverse validation cohorts. By enabling early risk identification without laboratory tests, the model enhances accessibility in primary care and large-scale screenings. Its ability to predict long-term CVD risk supports proactive intervention strategies, potentially reducing the burden of cardiometabolic diseases. Further research should refine the model with additional clinical factors and broader population validation to maximize its clinical impact.
Collapse
Affiliation(s)
- Jin-Hyun Park
- Korea University College of Medicine, Seoul, Republic of Korea
| | - Inyong Jeong
- Korea University College of Medicine, Seoul, Republic of Korea
| | - Gang-Jee Ko
- Korea University Guro Hospital, Seoul, Republic of Korea
| | - Seogsong Jeong
- Korea University College of Medicine, Seoul, Republic of Korea
| | - Hwamin Lee
- Korea University College of Medicine, Seoul, Republic of Korea
| |
Collapse
|
2
|
Lee TK, Kim SY, Choi HJ, Choe EK, Sohn KA. Vision transformer based interpretable metabolic syndrome classification using retinal Images. NPJ Digit Med 2025; 8:205. [PMID: 40216912 PMCID: PMC11992118 DOI: 10.1038/s41746-025-01588-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2024] [Accepted: 03/25/2025] [Indexed: 04/14/2025] Open
Abstract
Metabolic syndrome is leading to an increased risk of diabetes and cardiovascular disease. Our study developed a model using retinal image data from fundus photographs taken during comprehensive health check-ups to classify metabolic syndrome. The model achieved an AUC of 0.7752 (95% CI: 0.7719-0.7786) using retinal images, and an AUC of 0.8725 (95% CI: 0.8669-0.8781) when combining retinal images with basic clinical features. Furthermore, we propose a method to improve the interpretability of the relationship between retinal image features and metabolic syndrome by visualizing metabolic syndrome-related areas in retinal images. The results highlight the potential of retinal images in classifying metabolic syndrome.
Collapse
Affiliation(s)
- Tae Kwan Lee
- Department of Artificial Intelligence, Ajou University, Suwon, South Korea
| | - So Yeon Kim
- Department of Artificial Intelligence, Ajou University, Suwon, South Korea
- Department of Software and Computer Engineering, Ajou University, Suwon, South Korea
| | - Hyuk Jin Choi
- Department of Ophthalmology, Seoul National University College of Medicine, Seoul, South Korea
- Department of Ophthalmology, Seoul National University Hospital Healthcare System Gangnam Center, Seoul, South Korea
| | - Eun Kyung Choe
- Department of Surgery, Seoul National University Hospital Healthcare System Gangnam Center, Seoul, South Korea.
- Department of Surgery, Seoul National University College of Medicine, Seoul, South Korea.
| | - Kyung-Ah Sohn
- Department of Artificial Intelligence, Ajou University, Suwon, South Korea.
- Department of Software and Computer Engineering, Ajou University, Suwon, South Korea.
| |
Collapse
|
3
|
Wu YS, Tzeng WC, Wu CW, Wu HY, Kang CY, Wang WY. Gender Differences in Predicting Metabolic Syndrome Among Hospital Employees Using Machine Learning Models: A Population-Based Study. J Nurs Res 2025; 33:e381. [PMID: 40162697 DOI: 10.1097/jnr.0000000000000668] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/02/2025] Open
Abstract
BACKGROUND Metabolic syndrome (MetS) is a complex condition that captures several markers of dysregulation, including obesity, elevated blood glucose levels, dyslipidemia and hypertension. Using an approach to early prediction of MetS risk in hospital employees that takes into account the differing effects of gender may be expected to improve cardiovascular disease-related health outcomes. PURPOSE In this study, machine learning techniques were applied to construct an optimized MetS prediction model for use on hospital employees. METHODS This population-based study survey included 3,537 participants aged 20 to 65 years old. Participant demographic, anthropometric data, medical history, lifestyle-related factor, and biochemical data were collected from the hospital's Health Management Information System from 2018 to 2020. MetS prediction and the investigation of gender differences were performed using six machine learning models based on the following algorithms: K-nearest neighbor, random forest, logistic regression, support vector machine, neural network, and Naïve Bayes. All analyses were performed by sequentially inputting the features in three steps according to their characteristics. RESULTS MetS was detected in 8.91% of the participants. Among the MetS prediction models, Naïve Bayes showed the best performance, with a sensitivity of 0.825, an accuracy of 0.859 and an area under the receiver operating characteristic curve of 0.936. Body mass index and alanine transaminase were identified as important predictive factors for MetS in participants of both genders. Age, uric acid, and aspartate transaminase were identified as important predictive factors in men, while chronic disease and phosphorous were identified as important predictive factors in women. CONCLUSIONS The results indicate Naïve Bayes model to be useful and accurate in identifying MetS in hospital employees independent of gender. The early prediction of MetS using a model that accounts for gender differences is an important part of routine health screening and requires a multidimensional approach, including self-administered questionnaires and anthropometric and biochemical measurements.
Collapse
Affiliation(s)
- Yi-Syuan Wu
- Department of Computer Science and Information Engineering, National Taitung University, Taitung, Taiwan
| | - Wen-Chii Tzeng
- School of Nursing, National Defense Medical Center, Taipei, Taiwan
| | - Cheng-Wei Wu
- Department of Computer Science and Information Engineering, National Ilan University, Yilan, Taiwan
| | - Hao-Yi Wu
- Department of Nursing, Tri-Service General Hospital, Taipei, Taiwan
| | - Chih-Yun Kang
- Department of Nursing, Tri-Service General Hospital, Taipei, Taiwan
| | | |
Collapse
|
4
|
Kawakita T, Greenland P, Pemberton VL, Grobman WA, Silver RM, Bairey Merz CN, McNeil RB, Haas DM, Reddy UM, Simhan H, Saade GR. Prediction of metabolic syndrome following a first pregnancy. Am J Obstet Gynecol 2024; 231:649.e1-649.e19. [PMID: 38527600 PMCID: PMC11424779 DOI: 10.1016/j.ajog.2024.03.031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Revised: 03/09/2024] [Accepted: 03/21/2024] [Indexed: 03/27/2024]
Abstract
BACKGROUND The prevalence of metabolic syndrome is rapidly increasing in the United States. We hypothesized that prediction models using data obtained during pregnancy can accurately predict the future development of metabolic syndrome. OBJECTIVE This study aimed to develop machine learning models to predict the development of metabolic syndrome using factors ascertained in nulliparous pregnant individuals. STUDY DESIGN This was a secondary analysis of a prospective cohort study (Nulliparous Pregnancy Outcomes Study: Monitoring Mothers-to-Be Heart Health Study [nuMoM2b-HHS]). Data were collected from October 2010 to October 2020, and analyzed from July 2023 to October 2023. Participants had in-person visits 2 to 7 years after their first delivery. The primary outcome was metabolic syndrome, defined by the National Cholesterol Education Program Adult Treatment Panel III criteria, which was measured within 2 to 7 years after delivery. A total of 127 variables that were obtained during pregnancy were evaluated. The data set was randomly split into a training set (70%) and a test set (30%). We developed a random forest model and a lasso regression model using variables obtained during pregnancy. We compared the area under the receiver operating characteristic curve for both models. Using the model with the better area under the receiver operating characteristic curve, we developed models that included fewer variables based on SHAP (SHapley Additive exPlanations) values and compared them with the original model. The final model chosen would have fewer variables and noninferior areas under the receiver operating characteristic curve. RESULTS A total of 4225 individuals met the inclusion criteria; the mean (standard deviation) age was 27.0 (5.6) years. Of these, 754 (17.8%) developed metabolic syndrome. The area under the receiver operating characteristic curve of the random forest model was 0.878 (95% confidence interval, 0.846-0.909), which was higher than the 0.850 of the lasso model (95% confidence interval, 0.811-0.888; P<.001). Therefore, random forest models using fewer variables were developed. The random forest model with the top 3 variables (high-density lipoprotein, insulin, and high-sensitivity C-reactive protein) was chosen as the final model because it had the area under the receiver operating characteristic curve of 0.867 (95% confidence interval, 0.839-0.895), which was not inferior to the original model (P=.08). The area under the receiver operating characteristic curve of the final model in the test set was 0.847 (95% confidence interval, 0.821-0.873). An online application of the final model was developed (https://kawakita.shinyapps.io/metabolic/). CONCLUSION We developed a model that can accurately predict the development of metabolic syndrome in 2 to 7 years after delivery.
Collapse
Affiliation(s)
- Tetsuya Kawakita
- Department of Obstetrics and Gynecology, Eastern Virginia Medical School, Norfolk, VA.
| | - Philip Greenland
- Departments of Preventive Medicine and Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL
| | - Victoria L Pemberton
- Division of Cardiovascular Sciences, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD
| | - William A Grobman
- Department of Obstetrics and Gynecology, The Ohio State University, Columbus, OH
| | - Robert M Silver
- Department of Obstetrics and Gynecology, University of Utah, Salt Lake City, UT
| | - C Noel Bairey Merz
- Barbra Streisand Women's Heart Center, Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, CA
| | | | - David M Haas
- Department of Obstetrics and Gynecology, Indiana University School of Medicine, Indianapolis, IN
| | - Uma M Reddy
- Department of Obstetrics and Gynecology, Columbia University, New York, NY
| | - Hyagriv Simhan
- Department of Obstetrics, Gynecology, and Reproductive Sciences, University of Pittsburgh, Pittsburgh, PA
| | - George R Saade
- Department of Obstetrics and Gynecology, Eastern Virginia Medical School, Norfolk, VA
| |
Collapse
|
5
|
Mariam A, Javidi H, Zabor EC, Zhao R, Radivoyevitch T, Rotroff DM. Unsupervised clustering of longitudinal clinical measurements in electronic health records. PLOS DIGITAL HEALTH 2024; 3:e0000628. [PMID: 39405315 PMCID: PMC11478862 DOI: 10.1371/journal.pdig.0000628] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Accepted: 08/30/2024] [Indexed: 10/19/2024]
Abstract
Longitudinal electronic health records (EHR) can be utilized to identify patterns of disease development and progression in real-world settings. Unsupervised temporal matching algorithms are being repurposed to EHR from signal processing- and protein-sequence alignment tasks where they have shown immense promise for gaining insight into disease. The robustness of these algorithms for classifying EHR clinical data remains to be determined. Timeseries compiled from clinical measurements, such as blood pressure, have far more irregularity in sampling and missingness than the data for which these algorithms were developed, necessitating a systematic evaluation of these methods. We applied 30 state-of-the-art unsupervised machine learning algorithms to 6,912 systematically generated simulated clinical datasets across five parameters. These algorithms included eight temporal matching algorithms with fourteen partitional and eight fuzzy clustering methods. Nemenyi tests were used to determine differences in accuracy using the Adjusted Rand Index (ARI). Dynamic time warping and its lower-bound variants had the highest accuracies across all cohorts (median ARI>0.70). All 30 methods were better at discriminating classes with differences in magnitude compared to differences in trajectory shapes. Missingness impacted accuracies only when classes were different by trajectory shape. The method with the highest ARI was then used to cluster a large pediatric metabolic syndrome (MetS) cohort (N = 43,426). We identified three unique childhood BMI patterns with high average cluster consensus (>70%). The algorithm identified a cluster with consistently high BMI which had the greatest risk of MetS, consistent with prior literature (OR = 4.87, 95% CI: 3.93-6.12). While these algorithms have been shown to have similar accuracies for regular timeseries, their accuracies in clinical applications vary substantially in discriminating differences in shape and especially with moderate to high missingness (>10%). This systematic assessment also shows that the most robust algorithms tested here can derive meaningful insights from longitudinal clinical data.
Collapse
Affiliation(s)
- Arshiya Mariam
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, United States of America
- Center for Quantitative Metabolic Research, Cleveland Clinic, Cleveland, Ohio, United States of America
| | - Hamed Javidi
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, United States of America
- Center for Quantitative Metabolic Research, Cleveland Clinic, Cleveland, Ohio, United States of America
- Department of Electrical Engineering and Computer Science, Cleveland State University, Cleveland, Ohio, United States of America
| | - Emily C. Zabor
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, United States of America
- Taussig Cancer Institute, Cleveland Clinic, Cleveland, Ohio United States of America
| | - Ran Zhao
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, United States of America
| | - Tomas Radivoyevitch
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, United States of America
| | - Daniel M. Rotroff
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, United States of America
- Center for Quantitative Metabolic Research, Cleveland Clinic, Cleveland, Ohio, United States of America
- Department of Electrical Engineering and Computer Science, Cleveland State University, Cleveland, Ohio, United States of America
- Endocrinology and Metabolism Institute, Cleveland Clinic, Cleveland, Ohio, United States of America
| |
Collapse
|
6
|
Shin D. Prediction of metabolic syndrome using machine learning approaches based on genetic and nutritional factors: a 14-year prospective-based cohort study. BMC Med Genomics 2024; 17:224. [PMID: 39232768 PMCID: PMC11373243 DOI: 10.1186/s12920-024-01998-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2024] [Accepted: 08/28/2024] [Indexed: 09/06/2024] Open
Abstract
INTRODUCTION Metabolic syndrome is a chronic disease associated with multiple comorbidities. Over the last few years, machine learning techniques have been used to predict metabolic syndrome. However, studies incorporating demographic, clinical, laboratory, dietary, and genetic factors to predict the incidence of metabolic syndrome in Koreans are limited. In the present study, we propose a genome-wide polygenic risk score for the prediction of metabolic syndrome, along with other factors, to improve the prediction accuracy of metabolic syndrome. METHODS We developed 7 machine learning-based models and used Cox multivariable regression, deep neural network (DNN), support vector machine (SVM), stochastic gradient descent (SGD), random forest (RAF), Naïve Bayes (NBA) classifier, and AdaBoost (ADB) to predict the incidence of metabolic syndrome at year 14 using the dataset from the Korean Genome and Epidemiology Study (KoGES) Ansan and Ansung. RESULTS Of the 5440 patients, 2,120 were considered to have new-onset metabolic syndrome. The AUC values of model, which included sex, age, alcohol intake, energy intake, marital status, education status, income status, smoking status, dried laver intake, and genome-wide polygenic risk score (gPRS) Z-score based on 344,447 SNPs (p-value < 1.0), were the highest for RAF (0.994 [95% CI 0.985, 1.000]) and ADB (0.994 [95% CI 0.986, 1.000]). CONCLUSIONS Incorporating both gPRS and demographic, clinical, laboratory, and seaweed data led to enhanced metabolic syndrome risk prediction by capturing the distinct etiologies of metabolic syndrome development. The RAF- and ADB-based models predicted metabolic syndrome more accurately than the NBA-based model for the Korean population.
Collapse
Affiliation(s)
- Dayeon Shin
- Department of Food and Nutrition, Inha University, Incheon, 22212, Republic of Korea.
| |
Collapse
|
7
|
Lee M, Park T, Shin JY, Park M. A comprehensive multi-task deep learning approach for predicting metabolic syndrome with genetic, nutritional, and clinical data. Sci Rep 2024; 14:17851. [PMID: 39090161 PMCID: PMC11294629 DOI: 10.1038/s41598-024-68541-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Accepted: 07/24/2024] [Indexed: 08/04/2024] Open
Abstract
Metabolic syndrome (MetS) is a complex disorder characterized by a cluster of metabolic abnormalities, including abdominal obesity, hypertension, elevated triglycerides, reduced high-density lipoprotein cholesterol, and impaired glucose tolerance. It poses a significant public health concern, as individuals with MetS are at an increased risk of developing cardiovascular diseases and type 2 diabetes. Early and accurate identification of individuals at risk for MetS is essential. Various machine learning approaches have been employed to predict MetS, such as logistic regression, support vector machines, and several boosting techniques. However, these methods use MetS as a binary status and do not consider that MetS comprises five components. Therefore, a method that focuses on these characteristics of MetS is needed. In this study, we propose a multi-task deep learning model designed to predict MetS and its five components simultaneously. The benefit of multi-task learning is that it can manage multiple tasks with a single model, and learning related tasks may enhance the model's predictive performance. To assess the efficacy of our proposed method, we compared its performance with that of several single-task approaches, including logistic regression, support vector machine, CatBoost, LightGBM, XGBoost and one-dimensional convolutional neural network. For the construction of our multi-task deep learning model, we utilized data from the Korean Association Resource (KARE) project, which includes 352,228 single nucleotide polymorphisms (SNPs) from 7729 individuals. We also considered lifestyle, dietary, and socio-economic factors that affect chronic diseases, in addition to genomic data. By evaluating metrics such as accuracy, precision, F1-score, and the area under the receiver operating characteristic curve, we demonstrate that our multi-task learning model surpasses traditional single-task machine learning models in predicting MetS.
Collapse
Affiliation(s)
- Minhyuk Lee
- Department of Statistics, Korea University, Seoul, Republic of Korea
| | - Taesung Park
- Department of Statistics, Seoul National University, Seoul, Republic of Korea
| | - Ji-Yeon Shin
- Department of Preventive Medicine, School of Medicine, Kyungpook National University, Daegu, Republic of Korea.
| | - Mira Park
- Department of Preventive Medicine, School of Medicine, Eulji University, Daejeon, Republic of Korea.
| |
Collapse
|
8
|
Mohseni-Takalloo S, Mohseni H, Mozaffari-Khosravi H, Mirzaei M, Hosseinzadeh M. The effect of data balancing approaches on the prediction of metabolic syndrome using non-invasive parameters based on random forest. BMC Bioinformatics 2024; 25:18. [PMID: 38212697 PMCID: PMC10782700 DOI: 10.1186/s12859-024-05633-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Accepted: 01/02/2024] [Indexed: 01/13/2024] Open
Abstract
BACKGROUND Metabolic syndrome (MetS) is a cluster of metabolic abnormalities (including obesity, insulin resistance, hypertension, and dyslipidemia), which can be used to identify at-risk populations for diabetes and cardiovascular diseases, the main causes of morbidity and mortality worldwide. The achievement of a simple approach for diagnosing MetS without needing biochemical tests is so valuable. The present study aimed to predict MetS using non-invasive features based on a successful random forest learning algorithm. Also, to deal with the problem of data imbalance that naturally exists in this type of data, the effect of two different data balancing approaches, including the Synthetic Minority Over-sampling Technique (SMOTE) and Random Splitting data balancing (SplitBal), on model performance is investigated. RESULTS The most important determinant for MetS prediction was waist circumference. Applying a random forest learning algorithm to imbalanced data, the trained models reach 86.9% and 79.4% accuracies and 37.1% and 38.2% sensitivities in men and women, respectively. However, by applying the SplitBal data balancing technique, the best results were obtained, and despite that the accuracy of the trained models decreased by 7.8% and 11.3%, but their sensitivity improved significantly to 82.3% and 73.7% in men and women, respectively. CONCLUSIONS The random forest learning method, along with data balancing techniques, especially SplitBal, could create MetS prediction models with promising results that can be applied as a useful prognostic tool in health screening programs.
Collapse
Affiliation(s)
- Sahar Mohseni-Takalloo
- School of Public Health, Bam University of Medical Sciences, Bam, Iran
- Research Center for Food Hygiene and Safety, School of Public Health, Shahid Sadoughi University of Medical Sciences, Yazd, Iran
- Department of Nutrition, School of Public Health, Shahid Sadoughi University of Medical Sciences, Yazd, Iran
| | - Hadis Mohseni
- Computer Engineering Department, Shahid Bahonar University of Kerman, Kerman, Iran
| | - Hassan Mozaffari-Khosravi
- Research Center for Food Hygiene and Safety, School of Public Health, Shahid Sadoughi University of Medical Sciences, Yazd, Iran
- Department of Nutrition, School of Public Health, Shahid Sadoughi University of Medical Sciences, Yazd, Iran
| | - Masoud Mirzaei
- Yazd Cardiovascular Research Centre, Non-Communicable Diseases Research Institute, Shahid Sadoughi University of Medical Sciences, Yazd, Iran
| | - Mahdieh Hosseinzadeh
- Research Center for Food Hygiene and Safety, School of Public Health, Shahid Sadoughi University of Medical Sciences, Yazd, Iran.
- Department of Nutrition, School of Public Health, Shahid Sadoughi University of Medical Sciences, Yazd, Iran.
| |
Collapse
|
9
|
Mohseni-Takalloo S, Mozaffari-Khosravi H, Mohseni H, Mirzaei M, Hosseinzadeh M. Metabolic syndrome prediction using non-invasive and dietary parameters based on a support vector machine. Nutr Metab Cardiovasc Dis 2024; 34:126-135. [PMID: 37949713 DOI: 10.1016/j.numecd.2023.08.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/18/2023] [Revised: 07/14/2023] [Accepted: 08/21/2023] [Indexed: 11/12/2023]
Abstract
BACKGROUND AND AIMS Metabolic syndrome (MetS) is a widely used index for finding people at risk for chronic diseases, including cardiovascular disease and diabetes. Early detection of MetS is especially important in prevention programs. Relying on previous studies that suggest machine learning methods as a valuable approach for diagnosing MetS, this study aimed to develop MetS prediction models based on support vector machine (SVM) algorithms, applying non-invasive and low-cost (NI&LC), and also dietary parameters. METHODS AND RESULTS This population-based research was conducted on a large dataset of 4596 participants within the framework of the Shahedieh cohort study. An Extremely Randomized Trees Classifier was used to select the most effective features among NI&LC and dietary data. The prediction models were developed based on SVM algorithms, and their performance was assessed by accuracy, sensitivity, specificity, positive prediction value, negative prediction value, f1-score, and receiver operating characteristic curve. MetS was diagnosed in 14% of men and 22% of women. Among NI&LC features, waist circumference, body mass index, waist-to-height ratio, waist-to-hip ratio, systolic blood pressure, and diastolic blood pressure were the most predictive variables. By using NI&LC features, models with 78.4% and 63.5% accuracy and 81.2% and 75.3% sensitivity were yielded for men and women, respectively. By incorporating NI&LC and dietary features, the accuracy of the model in women improved by 3.7%. CONCLUSIONS SVM algorithms had promising potential for early detection of MetS relying on NI&LC parameters. These models can be used in prevention programs, clinical practice, and personal applications.
Collapse
Affiliation(s)
- Sahar Mohseni-Takalloo
- Research Center for Food Hygiene and Safety, School of Public Health, Shahid Sadoughi University of Medical Sciences, Yazd, Iran; Department of Nutrition, School of Public Health, Shahid Sadoughi University of Medical Sciences, Yazd, Iran; School of Public Health, Bam University of Medical Sciences, Bam, Iran
| | - Hassan Mozaffari-Khosravi
- Research Center for Food Hygiene and Safety, School of Public Health, Shahid Sadoughi University of Medical Sciences, Yazd, Iran; Department of Nutrition, School of Public Health, Shahid Sadoughi University of Medical Sciences, Yazd, Iran
| | - Hadis Mohseni
- Computer Engineering Department, Shahid Bahonar University of Kerman, Kerman, Iran
| | - Masoud Mirzaei
- Yazd Cardiovascular Research Centre, Non-communicable Disease Institute, Shahid Sadoughi University of Medical Sciences, Yazd, Iran
| | - Mahdieh Hosseinzadeh
- Research Center for Food Hygiene and Safety, School of Public Health, Shahid Sadoughi University of Medical Sciences, Yazd, Iran; Department of Nutrition, School of Public Health, Shahid Sadoughi University of Medical Sciences, Yazd, Iran.
| |
Collapse
|
10
|
Zheng J, Zhang Z, Wang J, Zhao R, Liu S, Yang G, Liu Z, Deng Z. Metabolic syndrome prediction model using Bayesian optimization and XGBoost based on traditional Chinese medicine features. Heliyon 2023; 9:e22727. [PMID: 38125549 PMCID: PMC10730568 DOI: 10.1016/j.heliyon.2023.e22727] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 11/16/2023] [Accepted: 11/17/2023] [Indexed: 12/23/2023] Open
Abstract
Metabolic syndrome (MetS) has a high prevalence and is prone to many complications. However, current MetS diagnostic methods require blood tests that are not conducive to self-testing, so a user-friendly and accurate method for predicting MetS is needed to facilitate early detection and treatment. In this study, a MetS prediction model based on a simple, small number of Traditional Chinese Medicine (TCM) clinical indicators and biological indicators combined with machine learning algorithms is investigated. Electronic medical record data from 2040 patients who visited outpatient clinics at Guangdong Chinese medicine hospitals from 2020 to 2021 were used to investigate the fusion of Bayesian optimization (BO) and eXtreme gradient boosting (XGBoost) in order to create a BO-XGBoost model for screening nineteen key features in three categories: individual bio-information, TCM indicators, and TCM habits that influence MetS prediction. Subsequently, the predictive diagnostic model for MetS was developed. The experimental results revealed that the model proposed in this paper achieved values of 93.35 %, 90.67 %, 80.40 %, and 0.920 for the F1, sensitivity, FRS, and AUC metrics, respectively. These values outperformed those of the seven other tested machine learning models. Finally, this study developed an intelligent prediction application for MetS based on the proposed model, which can be utilized by ordinary users to perform self-diagnosis through a web-based questionnaire, thereby accomplishing the objective of early detection and intervention for MetS.
Collapse
Affiliation(s)
- Jianhua Zheng
- College of Information Science and Technology, Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, China
- Guangdong Provincial Key Laboratory of Traditional Chinese Medicine Informatization, Guangzhou, 510630, China
| | - Zihao Zhang
- College of Information Science and Technology, Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, China
| | - Jinhe Wang
- Xiyuan Hospital of China Academy of Chinese Medical Sciences, Beijing, 100091, China
| | - Ruolin Zhao
- College of Information Science and Technology, Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, China
| | - Shuangyin Liu
- College of Information Science and Technology, Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, China
- Guangdong Provincial Key Laboratory of Traditional Chinese Medicine Informatization, Guangzhou, 510630, China
| | - Gaolin Yang
- College of Information Science and Technology, Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, China
| | - Zhengjie Liu
- Guangdong Provincial Hospital of Chinese Medicine, Guangzhou, 510120, China
- The Second Affiliated Hospital of Guangzhou University of Chinese Medicine, Guangzhou, 510120, China
| | - Zhengyuan Deng
- College of Information Science and Technology, Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, China
- Network and Educational Technology Center, Jinan University, Guangzhou, 510630, China
| |
Collapse
|
11
|
Zou G, Zhong Q, OUYang P, Li X, Lai X, Zhang H. Predictive analysis of metabolic syndrome based on 5-years continuous physical examination data. Sci Rep 2023; 13:9132. [PMID: 37277414 DOI: 10.1038/s41598-023-35604-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2022] [Accepted: 05/20/2023] [Indexed: 06/07/2023] Open
Abstract
Metabolic syndrome (MetS) represents a complex group of metabolic disorders. As MetS poses a significant challenge to global public health, predicting the occurrence of MetS and the development of related risk factors is important. In this study, we conducted a predictive analysis of MetS based on machine learning algorithms using datasets of 15,661 individuals. Five consecutive years of medical examination records were provided by Nanfang Hospital, Southern Medical University, China. The specific risk factors used included WC, WHR, TG, HDL-C, BMI, FGLU, etc. We proposed a feature construction method using the examination records over the past four consecutive years, combining the differences between the annual value and the normal limits of each risk factor and the year-to-year variation. The results showed that the feature set, which contained the original features of the inspection record and new features proposed in this study yielded the highest AUC of 0.944, implying that the new features could help identify risk factors for MetS and provide more targeted diagnostic advice for physicians.
Collapse
Affiliation(s)
- Guohan Zou
- School of Physics and Telecommunication Engineering, South China Normal University (SCNU), Guangzhou, 510000, China
- Guangdong Provincial Engineering Technology Research Center of Cardiovascular Individual Medicine & Big Data, SCNU, Guangzhou, 510006, China
| | - Qinghua Zhong
- School of Electronics and Information Engineering, SCNU, Foshan, 528225, China.
- Guangdong Provincial Engineering Technology Research Center of Cardiovascular Individual Medicine & Big Data, SCNU, Guangzhou, 510006, China.
| | - Ping OUYang
- Department of Health Management, Nanfang Hospital, Southern Medical University, Guangzhou, 510515, China
| | - Xiaoxi Li
- Department of Health Management, Nanfang Hospital, Southern Medical University, Guangzhou, 510515, China
| | - Xiaoying Lai
- Department of Health Management, Nanfang Hospital, Southern Medical University, Guangzhou, 510515, China
| | - Han Zhang
- School of Electronics and Information Engineering, SCNU, Foshan, 528225, China.
- Guangdong Provincial Engineering Technology Research Center of Cardiovascular Individual Medicine & Big Data, SCNU, Guangzhou, 510006, China.
| |
Collapse
|
12
|
Kim H, Heo JH, Lim DH, Kim Y. Development of a Metabolic Syndrome Classification and Prediction Model for Koreans Using Deep Learning Technology: The Korea National Health and Nutrition Examination Survey (KNHANES) (2013-2018). Clin Nutr Res 2023; 12:138-153. [PMID: 37214780 PMCID: PMC10193438 DOI: 10.7762/cnr.2023.12.2.138] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 03/21/2023] [Accepted: 03/27/2023] [Indexed: 05/24/2023] Open
Abstract
The prevalence of metabolic syndrome (MetS) and its cost are increasing due to lifestyle changes and aging. This study aimed to develop a deep neural network model for prediction and classification of MetS according to nutrient intake and other MetS-related factors. This study included 17,848 individuals aged 40-69 years from the Korea National Health and Nutrition Examination Survey (2013-2018). We set MetS (3-5 risk factors present) as the dependent variable and 52 MetS-related factors and nutrient intake variables as independent variables in a regression analysis. The analysis compared and analyzed model accuracy, precision and recall by conventional logistic regression, machine learning-based logistic regression and deep learning. The accuracy of train data was 81.2089, and the accuracy of test data was 81.1485 in a MetS classification and prediction model developed in this study. These accuracies were higher than those obtained by conventional logistic regression or machine learning-based logistic regression. Precision, recall, and F1-score also showed the high accuracy in the deep learning model. Blood alanine aminotransferase (β = 12.2035) level showed the highest regression coefficient followed by blood aspartate aminotransferase (β = 11.771) level, waist circumference (β = 10.8555), body mass index (β = 10.3842), and blood glycated hemoglobin (β = 10.1802) level. Fats (cholesterol [β = -2.0545] and saturated fatty acid [β = -2.0483]) showed high regression coefficients among nutrient intakes. The deep learning model for classification and prediction on MetS showed a higher accuracy than conventional logistic regression or machine learning-based logistic regression.
Collapse
Affiliation(s)
- Hyerim Kim
- Department of Food and Nutrition, Gyeongsang National University, Jinju 52828, Korea
| | - Ji Hye Heo
- Department of Information & Statistics, Gyeongsang National University, Jinju 52828, Korea
| | - Dong Hoon Lim
- Department of Information & Statistics, Research Institute of Natural Science (RINS), Gyeongsang National University, Jinju 52828, Korea
| | - Yoona Kim
- Department of Food and Nutrition, Institute of Agriculture and Life Science, Gyeongsang National University, Jinju 52828, Korea
| |
Collapse
|
13
|
Hu X, Li XK, Wen S, Li X, Zeng TS, Zhang JY, Wang W, Bi Y, Zhang Q, Tian SH, Min J, Wang Y, Liu G, Huang H, Peng M, Zhang J, Wu C, Li YM, Sun H, Ning G, Chen LL. Predictive modeling the probability of suffering from metabolic syndrome using machine learning: A population-based study. Heliyon 2022; 8:e12343. [PMID: 36643319 PMCID: PMC9834713 DOI: 10.1016/j.heliyon.2022.e12343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2022] [Revised: 06/16/2022] [Accepted: 12/06/2022] [Indexed: 12/14/2022] Open
Abstract
Background There is an increasing trend of Metabolic syndrome (MetS) prevalence, which has been considered as an important contributor for cardiovascular disease (CVD), cancers and diabetes. However, there is often a long asymptomatic phase of MetS, resulting in not diagnosed and intervened so timely as needed. It would be very helpful to explore tools to predict the probability of suffering from MetS in daily life or routinely clinical practice. Objective To develop models that predict individuals' probability of suffering from MetS timely with high efficacy in general population. Methods The present study enrolled 8964 individuals aged 40-75 years without severe diseases, which was a part of the REACTION study from October 2011 to February 2012. We developed three prediction models for different scenarios in hospital (Model 1, 2) or at home (Model 3) based on LightGBM (LGBM) technique and corresponding logistic regression (LR) models were also constructed for comparison. Model 1 included variables of laboratory tests, lifestyles and anthropometric measurements while model 2 was built with components of MetS excluded based on model 1, and model 3 was constructed with blood biochemical indexes removed based on model 2. Additionally, we also investigated the strength of association between the predictive factors and MetS, as well as that between the predictors and each component of MetS. Results In this study, 2714 (30.3%) participants suffer from MetS accordingly. The performances of the LGBM models in predicting the probability of suffering from MetS produced good results and were presented as follows: model 1 had an area under the curve (AUC) value of 0.993 while model 2 indicated an AUC value of 0.885. Model 3 had an AUC value of 0.859, which is close to that of model 2. The AUC values of LR model 1 and 2 for the scenario in hospital and model 3 at home were 0.938, 0.839 and 0.820 respectively, which seemed lower than that of their corresponding machine learning models, respectively. In both LGBM and logistic models, gender, height and resting pulse rate (RPR) were predictors for MetS. Women had higher risk of MetS than men (OR 8.84, CI: 6.70-11.66), and each 1-cm increase in height indicated 3.8% higher risk of suffering from MetS in people over 58 years, whereas each 1- Beat Per Minute (bpm) increase in RPR showed 1.0% higher risk in individuals younger than 62 years. Conclusion The present study showed that the prediction models developed by machine learning demonstrated effective in evaluating the probability of suffering from MetS, and presented prominent predicting efficacies and accuracies. Additionally, we found that women showed a higher risk of MetS than men, and height in individuals over 58 years was important factor in predicting the probability of suffering from MetS while RPR was of vital importance in people aged 40-62 years.
Collapse
Affiliation(s)
- Xiang Hu
- Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China,Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
| | - Xue-Ke Li
- Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China,Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
| | - Shiping Wen
- Centre for Artificial Intelligence, Faculty of Engineering Information Technology, University of Technology Sydney, Ultimo, NSW, 2007, Australia
| | - Xingyu Li
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, China
| | - Tian-Shu Zeng
- Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China,Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
| | - Jiao-Yue Zhang
- Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China,Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
| | - Weiqing Wang
- Department of Endocrinology and Metabolism, State Key Laboratory of Medical Genomes, National Clinical Research Center for Metabolic Diseases, Shanghai Clinical Center for Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao-Tong University School of Medicine, Shanghai, China
| | - Yufang Bi
- Department of Endocrinology and Metabolism, State Key Laboratory of Medical Genomes, National Clinical Research Center for Metabolic Diseases, Shanghai Clinical Center for Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao-Tong University School of Medicine, Shanghai, China
| | - Qiao Zhang
- Department of Cardiovascular Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Sheng-Hua Tian
- Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China,Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
| | - Jie Min
- Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China,Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
| | - Ying Wang
- Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China,Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
| | - Geng Liu
- Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China,Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
| | | | - Miaomiao Peng
- Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China,Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
| | | | - Chaodong Wu
- Department of Nutrition and Food Science, Texas A&M University, College Station, TX, USA
| | - Yu-Ming Li
- Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China,Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
| | - Hui Sun
- Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China,Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
| | - Guang Ning
- Department of Endocrinology and Metabolism, State Key Laboratory of Medical Genomes, National Clinical Research Center for Metabolic Diseases, Shanghai Clinical Center for Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao-Tong University School of Medicine, Shanghai, China
| | - Lu-Lu Chen
- Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China,Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China,Corresponding author.
| |
Collapse
|
14
|
Machine Learning Approach for Metabolic Syndrome Diagnosis Using Explainable Data-Augmentation-Based Classification. Diagnostics (Basel) 2022; 12:diagnostics12123117. [PMID: 36553124 PMCID: PMC9777696 DOI: 10.3390/diagnostics12123117] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Revised: 12/02/2022] [Accepted: 12/03/2022] [Indexed: 12/14/2022] Open
Abstract
Metabolic syndrome (MetS) is a cluster of risk factors including hypertension, hyperglycemia, dyslipidemia, and abdominal obesity. Metabolism-related risk factors include diabetes and heart disease. MetS is also linked to numerous cancers and chronic kidney disease. All of these variables raise medical costs. Developing a prediction model that can quickly identify persons at high risk of MetS and offer them a treatment plan is crucial. Early prediction of metabolic syndrome will highly impact the quality of life of patients as it gives them a chance for making a change to the bad habit and preventing a serious illness in the future. In this paper, we aimed to assess the performance of various algorithms of machine learning in order to decrease the cost of predictive diagnoses of metabolic syndrome. We employed ten machine learning algorithms along with different metaheuristics for feature selection. Moreover, we examined the effects of data augmentation in the prediction accuracy. The statistics show that the augmentation of data after applying feature selection on the data highly improves the performance of the classifiers.
Collapse
|
15
|
Kim H, Hwang S, Lee S, Kim Y. Classification and Prediction on Hypertension with Blood Pressure Determinants in a Deep Learning Algorithm. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:15301. [PMID: 36430024 PMCID: PMC9690260 DOI: 10.3390/ijerph192215301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 11/16/2022] [Accepted: 11/17/2022] [Indexed: 06/16/2023]
Abstract
Few studies classified and predicted hypertension using blood pressure (BP)-related determinants in a deep learning algorithm. The objective of this study is to develop a deep learning algorithm for the classification and prediction of hypertension with BP-related factors based on the Korean Genome and Epidemiology Study-Ansan and Ansung baseline survey. We also investigated whether energy intake adjustment is adequate for deep learning algorithms. We constructed a deep neural network (DNN) in which the number of hidden layers and the number of nodes in each hidden layer are experimentally selected, and we trained the DNN to diagnose hypertension using the dataset while varying the energy intake adjustment method in four ways. For comparison, we trained a decision tree in the same way. Experimental results showed that the DNN performs better than the decision tree in all aspects, such as having higher sensitivity, specificity, F1-score, and accuracy. In addition, we found that unlike general machine learning algorithms, including the decision tree, the DNNs perform best when energy intake is not adjusted. The result indicates that energy intake adjustment is not required when using a deep learning algorithm to classify and predict hypertension with BP-related factors.
Collapse
Affiliation(s)
- Hyerim Kim
- Department of Food and Nutrition, Gyeongsang National University, Jinju 52828, Republic of Korea
| | - Seunghyeon Hwang
- Department of Computer Science, Gyeongsang National University, Jinju 52828, Republic of Korea
| | - Suwon Lee
- Department of Computer Science, The Research Institute of Natural Science, Gyeongsang National University, Jinju 52828, Republic of Korea
| | - Yoona Kim
- Department of Food and Nutrition, Institute of Agriculture and Life Science, Gyeongsang National University, Jinju 52828, Republic of Korea
| |
Collapse
|
16
|
Daniel Tavares L, Manoel A, Henrique Rizzi Donato T, Cesena F, André Minanni C, Miwa Kashiwagi N, Paiva da Silva L, Amaro E, Szlejf C. Prediction of metabolic syndrome: A machine learning approach to help primary prevention. Diabetes Res Clin Pract 2022; 191:110047. [PMID: 36029889 DOI: 10.1016/j.diabres.2022.110047] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Revised: 08/17/2022] [Accepted: 08/19/2022] [Indexed: 11/29/2022]
Abstract
AIMS To describe the performance of machine learning (ML) applied to predict future metabolic syndrome (MS), and to estimate lifestyle changes effects in MS predictions. METHODS We analyzed data from 17,182 adults attending a checkup program sequentially (37,999 visit pairs) over 17 years. Variables on sociodemographic attributes, clinical, laboratory, and lifestyle characteristics were used to develop ML models to predict MS [logistic regression, linear discriminant analysis, k-nearest neighbors, decision trees, Light Gradient Boosting Machine (LGBM), Extreme Gradient Boosting]. We have tested the effects of changes in lifestyle in MS prediction at individual levels. RESULTS All models showed adequate calibration and good discrimination, but the LGBM showed better performance (Sensitivity = 87.8 %, Specificity = 70.2 %, AUC-ROC = 0.86). Causal inference analysis showed that increasing physical activity level and reducing BMI by at least 2 % had an effect of reducing the predicted probability of MS by 3.8 % (95 % CI = -4.8 %; -2.7 %). CONCLUSION ML models based on data from a checkup program showed good performance to predict MS and allowed testing for effects of lifestyle changes in this prediction. External validation is recommended to verify models' ability to identify at-risk individuals, and potentially increase their engagement in preventive measures.
Collapse
Affiliation(s)
| | - Andre Manoel
- Hospital Israelita Albert Einstein, São Paulo, Brazil
| | | | | | | | | | | | - Edson Amaro
- Hospital Israelita Albert Einstein, São Paulo, Brazil
| | | |
Collapse
|
17
|
Hsu NW, Chou KC, Wang YTT, Hung CL, Kuo CF, Tsai SY. Building a model for predicting metabolic syndrome using artificial intelligence based on an investigation of whole-genome sequencing. J Transl Med 2022; 20:190. [PMID: 35484552 PMCID: PMC9052619 DOI: 10.1186/s12967-022-03379-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Accepted: 04/04/2022] [Indexed: 12/02/2022] Open
Abstract
Background The circadian system is responsible for regulating various physiological activities and behaviors and has been gaining recognition. The circadian rhythm is adjusted in a 24-h cycle and has transcriptional–translational feedback loops. When the circadian rhythm is interrupted, affecting the expression of circadian genes, the phenotypes of diseases could amplify. For example, the importance of maintaining the internal temporal homeostasis conferred by the circadian system is revealed as mutations in genes coding for core components of the clock result in diseases. This study will investigate the association between circadian genes and metabolic syndromes in a Taiwanese population. Methods We performed analysis using whole-genome sequencing, read vcf files and set target circadian genes to determine if there were variants on target genes. In this study, we have investigated genetic contribution of circadian-related diseases using population-based next generation whole genome sequencing. We also used significant SNPs to create a metabolic syndrome prediction model. Logistic regression, random forest, adaboost, and neural network were used to predict metabolic syndrome. In addition, we used random forest model variables importance matrix to select 40 more significant SNPs, which were subsequently incorporated to create new prediction models and to compare with previous models. The data was then utilized for training set and testing set using five-fold cross validation. Each model was evaluated with the following criteria: area under the receiver operating characteristics curve (AUC), precision, F1 score, and average precision (the area under the precision recall curve). Results After searching significant variants, we used Chi-Square tests to find some variants. We found 186 significant SNPs, and four predicting models which used 186 SNPs (logistic regression, random forest, adaboost and neural network), AUC were 0.68, 0.8, 0.82, 0.81 respectively. The F1 scores were 0.412, 0.078, 0.295, 0.552, respectively. The other three models which used the 40 SNPs (logistic regression, adaboost and neural network), AUC were 0.82, 0.81, 0.81 respectively. The F1 scores were 0.584, 0.395, 0.574, respectively. Conclusions Circadian gene defect may also contribute to metabolic syndrome. Our study found several related genes and building a simple model to predict metabolic syndrome. Supplementary Information The online version contains supplementary material available at 10.1186/s12967-022-03379-7.
Collapse
Affiliation(s)
- Nai-Wei Hsu
- Department of Medicine, Mackay Medical College, New Taipei City, Taiwan
| | - Kai-Chen Chou
- Department of Laboratory Medicine, MacKay Memorial Hospital, Taipei City, Taiwan
| | - Yu-Ting Tina Wang
- Department of Laboratory Medicine, MacKay Memorial Hospital, Taipei City, Taiwan
| | - Chung-Lieh Hung
- Department of Medicine, Mackay Medical College, New Taipei City, Taiwan.,Institute of Biomedical Sciences, Mackay Medical College, New Taipei City, Taiwan
| | - Chien-Feng Kuo
- Department of Medicine, Mackay Medical College, New Taipei City, Taiwan.,Department of Nursing, MacKay Junior College of Medicine, Nursing and Management, New Taipei City, Taiwan.,Division of Infectious Diseases, Department of Internal Medicine, Mackay Memorial Hospital, Taipei, Taiwan
| | - Shin-Yi Tsai
- Department of Medicine, Mackay Medical College, New Taipei City, Taiwan. .,Department of Laboratory Medicine, MacKay Memorial Hospital, Taipei City, Taiwan. .,Department of Health Policy and Management, Johns Hopkins Bloomberg School of Public Health, Johns Hopkins University, Baltimore, Maryland, 21205, USA. .,Institute of Biomedical Sciences, Mackay Medical College, New Taipei City, Taiwan. .,Institute of Long-Term Care, Mackay Medical College, New Taipei City, Taiwan.
| |
Collapse
|
18
|
Single-nucleotide polymorphisms in medical nutritional weight loss: Challenges and future directions. J Transl Int Med 2022; 10:1-4. [PMID: 35702183 PMCID: PMC8997798 DOI: 10.2478/jtim-2022-0002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
|
19
|
Ibrahim MS, Pang D, Randhawa G, Pappas Y. Development and Validation of a Simple Risk Model for Predicting Metabolic Syndrome (MetS) in Midlife: A Cohort Study. Diabetes Metab Syndr Obes 2022; 15:1051-1075. [PMID: 35418767 PMCID: PMC8995775 DOI: 10.2147/dmso.s336384] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/24/2021] [Accepted: 01/15/2022] [Indexed: 11/23/2022] Open
Abstract
Purpose To develop and validate a simple risk model for predicting metabolic syndrome in midlife using a prospective cohort data. Design Prospective cohort study. Participants A total of 7626 members of the 1958 British birth cohort (individuals born in the first week of March 1958) participated in the biomedical survey at age 45 and have completed information on metabolic syndrome. Methods Variables utilised were obtained prospectively at birth, 7, 16, 23 and 45 years. Multivariable logistic regression was used to develop a total of ten (10) MetS risk prediction models taking the life course approach. Measures of discrimination and calibration were used to evaluate the performance of the models. A pragmatic criteria developed was used to select one model with the most potential to be useful. The internal validity (overfitting) of the selected model was assessed using bootstrap technique of Stata. Main Outcome Measure Metabolic syndrome was defined based on the NCEP-ATP III clinical criteria. Results There is high prevalence of MetS among the cohort members (19.6%), with males having higher risk as compared to females (22.8% vs 16.4%, P < 0.001). Individuals with MetS are more likely to have higher levels of HbA1c and low HDL-cholesterol. Similarly, regarding the individual components of MetS, male cohort members are more likely to have higher levels of glycaemia (HbA1c), BP and serum triglycerides. In contrast, female cohort members have lower levels of HDL-cholesterol and higher levels of waist circumference. Furthermore, a total of ten (10) MetS risk prediction models were developed taking the life course approach. Of these, one model with the most potential to be applied in practical setting was selected. The model has good accuracy (AUROC 0.91 (0.90, 0.92)), is well calibrated (Hosmer-Lemeshow 6.47 (0.595)) and has good internal validity. Conclusion Early life factors could be included in a risk model to predict MetS in midlife. The developed model has been shown to be accurate and has good internal validity. Therefore, interventions targeting socioeconomic inequality could help in the wider prevention of MetS. However, the validity of the developed model needs to be further established in an external population.
Collapse
Affiliation(s)
- Musa S Ibrahim
- Institute for Health Research, University of Bedfordshire, Putteridge Bury Luton, Bedfordshire, LU2 8LE, England
| | - Dong Pang
- Institute for Health Research, University of Bedfordshire, Putteridge Bury Luton, Bedfordshire, LU2 8LE, England
| | - Gurch Randhawa
- Institute for Health Research, University of Bedfordshire, Putteridge Bury Luton, Bedfordshire, LU2 8LE, England
| | - Yannis Pappas
- Institute for Health Research, University of Bedfordshire, Putteridge Bury Luton, Bedfordshire, LU2 8LE, England
| |
Collapse
|
20
|
Kim J, Mun S, Lee S, Jeong K, Baek Y. Prediction of metabolic and pre-metabolic syndromes using machine learning models with anthropometric, lifestyle, and biochemical factors from a middle-aged population in Korea. BMC Public Health 2022; 22:664. [PMID: 35387629 PMCID: PMC8985311 DOI: 10.1186/s12889-022-13131-x] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2021] [Accepted: 03/30/2022] [Indexed: 01/10/2023] Open
Abstract
BACKGROUND Metabolic syndrome (MetS) is a complex condition that appears as a cluster of metabolic abnormalities, and is closely associated with the prevalence of various diseases. Early prediction of the risk of MetS in the middle-aged population provides greater benefits for cardiovascular disease-related health outcomes. This study aimed to apply the latest machine learning techniques to find the optimal MetS prediction model for the middle-aged Korean population. METHODS We retrieved 20 data types from the Korean Medicine Daejeon Citizen Cohort, a cohort study on a community-based population of adults aged 30-55 years. The data included sex, age, anthropometric data, lifestyle-related data, and blood indicators of 1991 individuals. Participants satisfying two (pre-MetS) or ≥ 3 (MetS) of the five NECP-ATP III criteria were included in the MetS group. MetS prediction used nine machine learning models based on the following algorithms: Decision tree, Gaussian Naïve Bayes, K-nearest neighbor, eXtreme gradient boosting (XGBoost), random forest, logistic regression, support vector machine, multi-layer perceptron, and 1D convolutional neural network. All analyses were performed by sequentially inputting the features in three steps according to their characteristics. The models' performances were compared after applying the synthetic minority oversampling technique (SMOTE) to resolve data imbalance. RESULTS MetS was detected in 33.85% of the subjects. Among the MetS prediction models, the tree-based random forest and XGBoost models showed the best performance, which improved with the number of features used. As a measure of the models' performance, the area under the receiver operating characteristic curve (AUC) increased by up to 0.091 when the SMOTE was applied, with XGBoost showing the highest AUC of 0.851. Body mass index and waist-to-hip ratio were identified as the most important features in the MetS prediction models for this population. CONCLUSIONS Tree-based machine learning models were useful in identifying MetS with high accuracy in middle-aged Koreans. Early diagnosis of MetS is important and requires a multidimensional approach that includes self-administered questionnaire, anthropometric, and biochemical measurements.
Collapse
Affiliation(s)
- Junho Kim
- KM Data Division, Korea Institute of Oriental Medicine, 1672 Yuseongdae-ro, Yuseong-gu, Daejeon, Republic of Korea
| | - Sujeong Mun
- KM Data Division, Korea Institute of Oriental Medicine, 1672 Yuseongdae-ro, Yuseong-gu, Daejeon, Republic of Korea
| | - Siwoo Lee
- KM Data Division, Korea Institute of Oriental Medicine, 1672 Yuseongdae-ro, Yuseong-gu, Daejeon, Republic of Korea
| | - Kyoungsik Jeong
- KM Data Division, Korea Institute of Oriental Medicine, 1672 Yuseongdae-ro, Yuseong-gu, Daejeon, Republic of Korea
| | - Younghwa Baek
- KM Data Division, Korea Institute of Oriental Medicine, 1672 Yuseongdae-ro, Yuseong-gu, Daejeon, Republic of Korea.
| |
Collapse
|
21
|
Yang H, Yu B, OUYang P, Li X, Lai X, Zhang G, Zhang H. Machine learning-aided risk prediction for metabolic syndrome based on 3 years study. Sci Rep 2022; 12:2248. [PMID: 35145200 PMCID: PMC8831522 DOI: 10.1038/s41598-022-06235-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2021] [Accepted: 01/20/2022] [Indexed: 11/21/2022] Open
Abstract
Metabolic syndrome (MetS) is a group of physiological states of metabolic disorders, which may increase the risk of diabetes, cardiovascular and other diseases. Therefore, it is of great significance to predict the onset of MetS and the corresponding risk factors. In this study, we investigate the risk prediction for MetS using a data set of 67,730 samples with physical examination records of three consecutive years provided by the Department of Health Management, Nanfang Hospital, Southern Medical University, P.R. China. Specifically, the prediction for MetS takes the numerical features of examination records as well as the differential features by using the examination records over the past two consecutive years, namely, the differential numerical feature (DNF) and the differential state feature (DSF), and the risk factors of the above features w.r.t different ages and genders are statistically analyzed. From numerical results, it is shown that the proposed DSF in addition to the numerical feature of examination records, significantly contributes to the risk prediction of MetS. Additionally, the proposed scheme, by using the proposed features, yields a superior performance to the state-of-the-art MetS prediction model, which provides the potential of effective prescreening the occurrence of MetS.
Collapse
Affiliation(s)
- Haizhen Yang
- School of Physics and Telecommunication Engineering, South China Normal University (SCNU), Guangzhou, 510006, China.,School of Electronics and Information Engineering, SCNU, Foshan, 528225, China.,Guangdong Provincial Engineering Technology Research Center of Cardiovascular Individual Medicine & Big Data, SCNU, Guangzhou, 510006, China
| | - Baoxian Yu
- School of Physics and Telecommunication Engineering, South China Normal University (SCNU), Guangzhou, 510006, China. .,School of Electronics and Information Engineering, SCNU, Foshan, 528225, China. .,Guangdong Provincial Engineering Technology Research Center of Cardiovascular Individual Medicine & Big Data, SCNU, Guangzhou, 510006, China.
| | - Ping OUYang
- Department of Health Management, Nanfang Hospital, Southern Medical University, Guangzhou, 510515, China.
| | - Xiaoxi Li
- Department of Health Management, Nanfang Hospital, Southern Medical University, Guangzhou, 510515, China
| | - Xiaoying Lai
- Department of Health Management, Nanfang Hospital, Southern Medical University, Guangzhou, 510515, China
| | - Guishan Zhang
- Key Laboratory of Digital Signal and Image Processing of Guangdong Provincial, College of Engineering, Shantou University, Shantou, 515063, China
| | - Han Zhang
- School of Physics and Telecommunication Engineering, South China Normal University (SCNU), Guangzhou, 510006, China. .,School of Electronics and Information Engineering, SCNU, Foshan, 528225, China. .,Guangdong Provincial Engineering Technology Research Center of Cardiovascular Individual Medicine & Big Data, SCNU, Guangzhou, 510006, China.
| |
Collapse
|
22
|
Zhang Q, Wan NJ. Simple Method to Predict Insulin Resistance in Children Aged 6-12 Years by Using Machine Learning. Diabetes Metab Syndr Obes 2022; 15:2963-2975. [PMID: 36193541 PMCID: PMC9526431 DOI: 10.2147/dmso.s380772] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Accepted: 09/13/2022] [Indexed: 11/29/2022] Open
Abstract
BACKGROUND Due to the increasing insulin resistance (IR) in childhood, rates of diabetes and cardiovascular disease may rise in the future and seriously threaten the healthy development of children. Finding an easy way to predict IR in children can help pediatricians to identify these children in time and intervene appropriately, which is particularly important for practitioners in primary health care. PATIENTS AND METHODS Seventeen features from 503 children 6-12 years old were collected. We defined IR by HOMA-IR greater than 3.0, thus classifying children with IR and those without IR. Data were preprocessed by multivariate imputation and oversampling to resolve missing values and data imbalances; then, recursive feature elimination was applied to further select features of interest, and 5 machine learning methods-namely, logistic regression (LR), support vector machine (SVM), random forest (RF), extreme gradient boosting (XGBoost), and gradient boosting with categorical features support (CatBoost)-were used for model training. We tested the trained models on an external test set containing information from 133 children, from which performance metrics were extracted and the optimal model was selected. RESULTS After feature selection, the numbers of chosen features for the LR, SVM, RF, XGBoost, and CatBoost models were 6, 9, 10, 14, and 6, respectively. Among them, glucose, waist circumference, and age were chosen as predictors by most of the models. Finally, all 5 models achieved good performance on the external test set. Both XGBoost and CatBoost had the same AUC (0.85), which was highest among those of all models. Their accuracy, sensitivity, precision, and F1 scores were also close, but the specificity of XGBoost reached 0.79, which was significantly higher than that of CatBoost, so XGBoost was chosen as the optimal model. CONCLUSION The model developed herein has a good predictive ability for IR in children 6-12 years old and can be clinically applied to help pediatricians identify children with IR in a simple and inexpensive way.
Collapse
Affiliation(s)
- Qian Zhang
- Department of Pediatrics, Beijing Jishuitan Hospital, Beijing, People’s Republic of China
| | - Nai-jun Wan
- Department of Pediatrics, Beijing Jishuitan Hospital, Beijing, People’s Republic of China
- Correspondence: Nai-jun Wan, Department of Pediatrics, Beijing Jishuitan Hospital, 31# Xinjiekou Dongjie, West District, Beijing, 100035, People’s Republic of China, Tel +86-10-58398102, Email
| |
Collapse
|
23
|
Classification and Prediction on the Effects of Nutritional Intake on Overweight/Obesity, Dyslipidemia, Hypertension and Type 2 Diabetes Mellitus Using Deep Learning Model: 4-7th Korea National Health and Nutrition Examination Survey. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:ijerph18115597. [PMID: 34073854 PMCID: PMC8197245 DOI: 10.3390/ijerph18115597] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Revised: 05/17/2021] [Accepted: 05/21/2021] [Indexed: 11/17/2022]
Abstract
Few studies have been conducted to classify and predict the influence of nutritional intake on overweight/obesity, dyslipidemia, hypertension and type 2 diabetes mellitus (T2DM) based on deep learning such as deep neural network (DNN). The present study aims to classify and predict associations between nutritional intake and risk of overweight/obesity, dyslipidemia, hypertension and T2DM by developing a DNN model, and to compare a DNN model with the most popular machine learning models such as logistic regression and decision tree. Subjects aged from 40 to 69 years in the 4-7th (from 2007 through 2018) Korea National Health and Nutrition Examination Survey (KNHANES) were included. Diagnostic criteria of dyslipidemia (n = 10,731), hypertension (n = 10,991), T2DM (n = 3889) and overweight/obesity (n = 10,980) were set as dependent variables. Nutritional intakes were set as independent variables. A DNN model comprising one input layer with 7 nodes, three hidden layers with 30 nodes, 12 nodes, 8 nodes in each layer and one output layer with one node were implemented in Python programming language using Keras with tensorflow backend. In DNN, binary cross-entropy loss function for binary classification was used with Adam optimizer. For avoiding overfitting, dropout was applied to each hidden layer. Structural equation modelling (SEM) was also performed to simultaneously estimate multivariate causal association between nutritional intake and overweight/obesity, dyslipidemia, hypertension and T2DM. The DNN model showed the higher prediction accuracy with 0.58654 for dyslipidemia, 0.79958 for hypertension, 0.80896 for T2DM and 0.62496 for overweight/obesity compared with two other machine leaning models with five-folds cross-validation. Prediction accuracy for dyslipidemia, hypertension, T2DM and overweight/obesity were 0.58448, 0.79929, 0.80818 and 0.62486, respectively, when analyzed by a logistic regression, also were 0.52148, 0.66773, 0.71587 and 0.54026, respectively, when analyzed by a decision tree. This study observed a DNN model with three hidden layers with 30 nodes, 12 nodes, 8 nodes in each layer had better prediction accuracy than two conventional machine learning models of a logistic regression and decision tree.
Collapse
|
24
|
Park JE, Mun S, Lee S. Metabolic Syndrome Prediction Models Using Machine Learning and Sasang Constitution Type. EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE : ECAM 2021; 2021:8315047. [PMID: 33628316 PMCID: PMC7886522 DOI: 10.1155/2021/8315047] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Revised: 12/11/2020] [Accepted: 01/21/2021] [Indexed: 11/23/2022]
Abstract
BACKGROUND Machine learning may be a useful tool for predicting metabolic syndrome (MetS), and previous studies also suggest that the risk of MetS differs according to Sasang constitution type. The present study investigated the development of MetS prediction models utilizing machine learning methods and whether the incorporation of Sasang constitution type could improve the performance of those prediction models. METHODS Participants visiting a medical center for a health check-up were recruited in 2005 and 2006. Six kinds of machine learning were utilized (K-nearest neighbor, naive Bayes, random forest, decision tree, multilayer perceptron, and support vector machine), as was conventional logistic regression. Machine learning-derived MetS prediction models with and without the incorporation of Sasang constitution type were compared to investigate whether the former would predict MetS with higher sensitivity. Age, sex, education level, marital status, body mass index, stress, physical activity, alcohol consumption, and smoking were included as potentially predictive factors. RESULTS A total of 750/2,871 participants had MetS. Among the six types of machine learning methods investigated, multiplayer perceptron and support vector machine exhibited the same performance as the conventional regression method, based on the areas under the receiver operating characteristic curves. The naive-Bayes method exhibited the highest sensitivity (0.49), which was higher than that of the conventional regression method (0.39). The incorporation of Sasang constitution type improved the sensitivity of all of the machine learning methods investigated except for the K-nearest neighbor method. CONCLUSION Machine learning-derived models may be useful for MetS prediction, and the incorporation of Sasang constitution type may increase the sensitivity of such models.
Collapse
Affiliation(s)
- Ji-Eun Park
- Future Medicine Division, Korea Institute of Oriental Medicine, Daejeon, Republic of Korea
| | - Sujeong Mun
- Future Medicine Division, Korea Institute of Oriental Medicine, Daejeon, Republic of Korea
| | - Siwoo Lee
- Future Medicine Division, Korea Institute of Oriental Medicine, Daejeon, Republic of Korea
| |
Collapse
|
25
|
Wang J, Li C, Li J, Qin S, Liu C, Wang J, Chen Z, Wu J, Wang G. Development and internal validation of risk prediction model of metabolic syndrome in oil workers. BMC Public Health 2020; 20:1828. [PMID: 33256679 PMCID: PMC7706262 DOI: 10.1186/s12889-020-09921-w] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2020] [Accepted: 11/18/2020] [Indexed: 01/28/2023] Open
Abstract
Background The prevalence of metabolic syndrome continues to rise sharply worldwide, seriously threatening people’s health. The optimal model can be used to identify people at high risk of metabolic syndrome as early as possible, to predict their risk, and to persuade them to change their adverse lifestyle so as to slow down and reduce the incidence of metabolic syndrome. Methods Design existing circumstances research. A total of 1468 workers from an oil company who participated in occupational health physical examination from April 2017 to October 2018 were included in this study. We established the Logistic regression model, the random forest model and the convolutional neural network model, and compared the prediction performance of the models according to the F1 score, sensitivity, accuracy and other indicators of the three models. Results The results showed that the accuracy of the three models was 82.49,95.98 and 92.03%, the sensitivity was 87.94,95.52 and 90.59%, the specificity was 74.54, 96.65 and 94.14%, the F1 score was 0.86,0.97 and 0.93, and the area under ROC curve was 0.88,0.96 and 0.92, respectively. The Brier score of the three models was 0.15, 0.08 and 0.12, Observed-expected ratio was 0.83, 0.97 and 1.13, and the Integrated Calibration Index was 0.075,0.073 and 0.074, respectively, and explained how the random forest model was used for individual disease risk score. Conclusions The study showed that the prediction performance of random forest model is better than other models, and the model has higher application value, which can better predict the risk of metabolic syndrome in oil workers, and provide corresponding theoretical basis for the health management of oil workers. Supplementary Information The online version contains supplementary material available at 10.1186/s12889-020-09921-w.
Collapse
Affiliation(s)
- Jie Wang
- School of Public Health, North China University of Science and Technology, No.21 Bohai Avenue, Caofeidian New Town, Tangshan City, Hebei Province, 063210, P.R. China
| | - Chao Li
- School of Public Health, North China University of Science and Technology, No.21 Bohai Avenue, Caofeidian New Town, Tangshan City, Hebei Province, 063210, P.R. China
| | - Jing Li
- School of Public Health, North China University of Science and Technology, No.21 Bohai Avenue, Caofeidian New Town, Tangshan City, Hebei Province, 063210, P.R. China
| | - Sheng Qin
- School of Public Health, North China University of Science and Technology, No.21 Bohai Avenue, Caofeidian New Town, Tangshan City, Hebei Province, 063210, P.R. China
| | - Chunlei Liu
- College of Science, North China University of Science and Technology, Tangshan, Hebei, P.R. China
| | - Jiaojiao Wang
- School of Public Health, North China University of Science and Technology, No.21 Bohai Avenue, Caofeidian New Town, Tangshan City, Hebei Province, 063210, P.R. China
| | - Zhe Chen
- School of Public Health, North China University of Science and Technology, No.21 Bohai Avenue, Caofeidian New Town, Tangshan City, Hebei Province, 063210, P.R. China
| | - Jianhui Wu
- School of Public Health, North China University of Science and Technology, No.21 Bohai Avenue, Caofeidian New Town, Tangshan City, Hebei Province, 063210, P.R. China. .,Hebei Province Key Laboratory of Occupational Health and Safety for Coal Industry, North China University of Science and Technology, Tangshan, Hebei, P.R. China.
| | - Guoli Wang
- School of Public Health, North China University of Science and Technology, No.21 Bohai Avenue, Caofeidian New Town, Tangshan City, Hebei Province, 063210, P.R. China.,Hebei Province Key Laboratory of Occupational Health and Safety for Coal Industry, North China University of Science and Technology, Tangshan, Hebei, P.R. China
| |
Collapse
|
26
|
Perry BI, Upthegrove R, Crawford O, Jang S, Lau E, McGill I, Carver E, Jones PB, Khandaker GM. Cardiometabolic risk prediction algorithms for young people with psychosis: a systematic review and exploratory analysis. Acta Psychiatr Scand 2020; 142:215-232. [PMID: 32654119 DOI: 10.1111/acps.13212] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Accepted: 07/06/2020] [Indexed: 12/25/2022]
Abstract
OBJECTIVE Cardiometabolic risk prediction algorithms are common in clinical practice. Young people with psychosis are at high risk for developing cardiometabolic disorders. We aimed to examine whether existing cardiometabolic risk prediction algorithms are suitable for young people with psychosis. METHODS We conducted a systematic review and narrative synthesis of studies reporting the development and validation of cardiometabolic risk prediction algorithms for general or psychiatric populations. Furthermore, we used data from 505 participants with or at risk of psychosis at age 18 years in the ALSPAC birth cohort, to explore the performance of three algorithms (QDiabetes, QRISK3 and PRIMROSE) highlighted as potentially suitable. We repeated analyses after artificially increasing participant age to the mean age of the original algorithm studies to examine the impact of age on predictive performance. RESULTS We screened 7820 results, including 110 studies. All algorithms were developed in relatively older participants, and most were at high risk of bias. Three studies (QDiabetes, QRISK3 and PRIMROSE) featured psychiatric predictors. Age was more strongly weighted than other risk factors in each algorithm. In our exploratory analysis, calibration plots for all three algorithms implied a consistent systematic underprediction of cardiometabolic risk in the younger sample. After increasing participant age, calibration plots were markedly improved. CONCLUSION Existing cardiometabolic risk prediction algorithms cannot be recommended for young people with or at risk of psychosis. Existing algorithms may underpredict risk in young people, even in the face of other high-risk features. Recalibration of existing algorithms or a new tailored algorithm for the population is required.
Collapse
Affiliation(s)
- B I Perry
- Department of Psychiatry, University of Cambridge, Cambridge, UK
- Cambridgeshire and Peterborough NHS Foundation Trust, Cambridge, UK
| | - R Upthegrove
- Institute for Mental Health, University of Birmingham, Birmingham, UK
| | - O Crawford
- University of Cambridge School of Clinical Medicine, Cambridge, UK
| | - S Jang
- University of Cambridge School of Clinical Medicine, Cambridge, UK
| | - E Lau
- University of Cambridge School of Clinical Medicine, Cambridge, UK
| | - I McGill
- University of Cambridge School of Clinical Medicine, Cambridge, UK
| | - E Carver
- University of Cambridge School of Clinical Medicine, Cambridge, UK
| | - P B Jones
- Department of Psychiatry, University of Cambridge, Cambridge, UK
- Cambridgeshire and Peterborough NHS Foundation Trust, Cambridge, UK
| | - G M Khandaker
- Department of Psychiatry, University of Cambridge, Cambridge, UK
- Cambridgeshire and Peterborough NHS Foundation Trust, Cambridge, UK
| |
Collapse
|
27
|
Genetic markers and continuity of healthy metabolic status: Tehran cardio-metabolic genetic study (TCGS). Sci Rep 2020; 10:13600. [PMID: 32788640 PMCID: PMC7423921 DOI: 10.1038/s41598-020-70627-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2019] [Accepted: 07/23/2020] [Indexed: 12/29/2022] Open
Abstract
Obese individuals can be categorized as “healthy obese” (MHO) and “unhealthy obese” (MUO) based on the presence or absence of metabolic abnormality. This study sets out to assess potential genetic causes behind persistence of healthy metabolic status in individuals categorized as “healthy obese”. This study was conducted in the framework of the Tehran cardio-metabolic genetic study (TCGS). 766 MHO subjects at the start of the study followed up 15 years for occurrence of metabolic unhealthy status. These two groups (persistent MHO, MUO) were compared regarding the presence or absence of 16 single nucleotide polymorphisms (SNPs) identified as being associated with obesity phenotype in previous studies. We used logistic regression model for assessing the association between MHO/MUO with candidate SNPs. By the end of the follow up, 206 (27%) were categorized as the persistent MHO and 560 (73%) as MUO groups. Considering interaction effect between some SNP and sex, a sex stratification analysis was applied. When the analysis was performed by gender, rs1121980 associated with a decrease, and rs7903146 with an increase in the likelihood of persistent MHO individuals. Another analysis was separately performed on postmenopausal women from both groups; it showed that rs13107325 was associated with an increase in the likelihood of persistent MHO status in this subgroup of woman. In all cases, the markers had dominant inheritance. This findings suggest that the expression of some genetic markers are associated with persistence of healthy metabolic status, in female obese individuals.
Collapse
|