1
|
Abdel Hady DA, Mabrouk OM, Abd El-Hafeez T. Employing machine learning for enhanced abdominal fat prediction in cavitation post-treatment. Sci Rep 2024; 14:11004. [PMID: 38744923 PMCID: PMC11094079 DOI: 10.1038/s41598-024-60387-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Accepted: 04/22/2024] [Indexed: 05/16/2024] Open
Abstract
This study investigates the application of cavitation in non-invasive abdominal fat reduction and body contouring, a topic of considerable interest in the medical and aesthetic fields. We explore the potential of cavitation to alter abdominal fat composition and delve into the optimization of fat prediction models using advanced hyperparameter optimization techniques, Hyperopt and Optuna. Our objective is to enhance the predictive accuracy of abdominal fat dynamics post-cavitation treatment. Employing a robust dataset with abdominal fat measurements and cavitation treatment parameters, we evaluate the efficacy of our approach through regression analysis. The performance of Hyperopt and Optuna regression models is assessed using metrics such as mean squared error, mean absolute error, and R-squared score. Our results reveal that both models exhibit strong predictive capabilities, with R-squared scores reaching 94.12% and 94.11% for post-treatment visceral fat, and 71.15% and 70.48% for post-treatment subcutaneous fat predictions, respectively. Additionally, we investigate feature selection techniques to pinpoint critical predictors within the fat prediction models. Techniques including F-value selection, mutual information, recursive feature elimination with logistic regression and random forests, variance thresholding, and feature importance evaluation are utilized. The analysis identifies key features such as BMI, waist circumference, and pretreatment fat levels as significant predictors of post-treatment fat outcomes. Our findings underscore the effectiveness of hyperparameter optimization in refining fat prediction models and offer valuable insights for the advancement of non-invasive fat reduction methods. This research holds important implications for both the scientific community and clinical practitioners, paving the way for improved treatment strategies in the realm of body contouring.
Collapse
Affiliation(s)
- Doaa A Abdel Hady
- Department of Physical Therapy for Women's Health, Faculty of Physiotherapy, Deraya University, EL-Minia, Egypt.
| | - Omar M Mabrouk
- MSK Sonographer, Physical Therapy for Basic Science, Deraya University, EL-Minia, Egypt
| | - Tarek Abd El-Hafeez
- Department of Computer Science, Faculty of Science, Minia University, EL-Minia, Egypt.
- Computer Science Unit, Deraya University, EL-Minia, Egypt.
| |
Collapse
|
2
|
Lin W, Shi S, Lan H, Wang N, Huang H, Wen J, Chen G. Identification of influence factors in overweight population through an interpretable risk model based on machine learning: a large retrospective cohort. Endocrine 2024; 83:604-614. [PMID: 37776483 DOI: 10.1007/s12020-023-03536-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Accepted: 09/12/2023] [Indexed: 10/02/2023]
Abstract
BACKGROUND The identification of associated overweight risk factors is crucial to future health risk predictions and behavioral interventions. Several consensus problems remain in machine learning, such as cross-validation, and the resulting model may suffer from overfitting or poor interpretability. METHODS This study employed nine commonly used machine learning methods to construct overweight risk models. The general community are the target of this study, and a total of 10,905 Chinese subjects from Ningde City in Fujian province, southeast China, participated. The best model was selected through appropriate verification and validation and was suitably explained. RESULTS The overweight risk models employing machine learning exhibited good performance. It was concluded that CatBoost, which is used in the construction of clinical risk models, may surpass previous machine learning methods. The visual display of the Shapley additive explanation value for the machine model variables accurately represented the influence of each variable in the model. CONCLUSIONS The construction of an overweight risk model using machine learning may currently be the best approach. Moreover, CatBoost may be the best machine learning method. Furthermore, combining Shapley's additive explanation and machine learning methods can be effective in identifying disease risk factors for prevention and control.
Collapse
Affiliation(s)
- Wei Lin
- Department of Endocrinology, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital, FuZhou, 350001, PR China.
| | - Songchang Shi
- Department of Critical Care Medicine, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital South Branch, Fujian Provincial Hospital Jinshan Branch, Fujian Provincial Hospital, Fuzhou, 350001, PR China
| | - Huiyu Lan
- Department of Endocrinology, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital, FuZhou, 350001, PR China
| | - Nengying Wang
- Department of Endocrinology, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital, FuZhou, 350001, PR China
| | - Huibin Huang
- Department of Endocrinology, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital, FuZhou, 350001, PR China
| | - Junping Wen
- Department of Endocrinology, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital, FuZhou, 350001, PR China
| | - Gang Chen
- Department of Endocrinology, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital, FuZhou, 350001, PR China.
| |
Collapse
|
3
|
Gou H, Song H, Tian Z, Liu Y. Prediction models for children/adolescents with obesity/overweight: A systematic review and meta-analysis. Prev Med 2024; 179:107823. [PMID: 38103795 DOI: 10.1016/j.ypmed.2023.107823] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Revised: 11/12/2023] [Accepted: 12/11/2023] [Indexed: 12/19/2023]
Abstract
The incidence of obesity and overweight in children and adolescents is increasing worldwide and becomes a global health concern. This study aims to evaluate the accuracy of available prediction models in early identification of obesity and overweight in general children or adolescents and identify predictive factors for the models, thus provide a reference for subsequent development of risk prediction tools for obesity and overweight in children or adolescents. Related publications were obtained from several databases such as PubMed, Embase, Cochrane Library, and Web of Science from their inception to September 18th, 2022. The novel Prediction Model Risk of Bias Assessment Tool (PROBAST) was employed to assess the bias risk of the included studies. R4.2.0 and Stata15.1 softwares were used to conduct meta-analysis. This study involved 45 cross-sectional and/or prospective studies with 126 models. Meta-analyses showed that the overall pooled index of concordance (c-index) of prediction models for children/adolescents with obesity and overweight in the training set was 0.769 (95% CI 0.754-0.785) and 0.835(95% CI 0.792-0.879), respectively. Additionally, a large number of predictors were found to be related to children's lifestyles, such as sleep duration, sleep quality, and eating speed. In conclusions, prediction models can be employed to predict obesity/overweight in children and adolescents. Most predictors are controllable factors and are associated with lifestyle. Therefore, the prediction model serves as an excellent tool to formulate effective strategies for combating obesity/overweight in pediatric patients.
Collapse
Affiliation(s)
- Hao Gou
- Department of Pediatrics, Hospital of Chengdu University of Traditional Chinese Medicine, Chengdu, Sichuan Province, China
| | - Huiling Song
- Department of Emergency, West China Second University Hospital, Sichuan University, Chengdu, China; Key Laboratory of Birth Defects and Related Diseases of Women and Children, Ministry of Education, Sichuan University, Chengdu, China
| | - Zhiqing Tian
- Department of Emergency, West China Second University Hospital, Sichuan University, Chengdu, China; Key Laboratory of Birth Defects and Related Diseases of Women and Children, Ministry of Education, Sichuan University, Chengdu, China
| | - Yan Liu
- Department of Emergency, West China Second University Hospital, Sichuan University, Chengdu, China; Key Laboratory of Birth Defects and Related Diseases of Women and Children, Ministry of Education, Sichuan University, Chengdu, China.
| |
Collapse
|
4
|
Suissa K, Wyss R, Lu Z, Bessette LG, York C, Tsacogianis TN, Lin KJ. Development and Validation of a Claims-Based Model to Predict Categories of Obesity. Am J Epidemiol 2024; 193:203-213. [PMID: 37650647 DOI: 10.1093/aje/kwad178] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Revised: 05/23/2023] [Accepted: 08/24/2023] [Indexed: 09/01/2023] Open
Abstract
We developed and validated a claims-based algorithm that classifies patients into obesity categories. Using Medicare (2007-2017) and Medicaid (2000-2014) claims data linked to 2 electronic health record (EHR) systems in Boston, Massachusetts, we identified a cohort of patients with an EHR-based body mass index (BMI) measurement (calculated as weight (kg)/height (m)2). We used regularized regression to select from 137 variables and built generalized linear models to classify patients with BMIs of ≥25, ≥30, and ≥40. We developed the prediction model using EHR system 1 (training set) and validated it in EHR system 2 (validation set). The cohort contained 123,432 patients in the Medicare population and 40,736 patients in the Medicaid population. The model comprised 97 variables in the Medicare set and 95 in the Medicaid set, including BMI-related diagnosis codes, cardiovascular and antidiabetic drugs, and obesity-related comorbidities. The areas under the receiver-operating-characteristic curve in the validation set were 0.72, 0.75, and 0.83 (Medicare) and 0.66, 0.66, and 0.70 (Medicaid) for BMIs of ≥25, ≥30, and ≥40, respectively. The positive predictive values were 81.5%, 80.6%, and 64.7% (Medicare) and 81.6%, 77.5%, and 62.5% (Medicaid), for BMIs of ≥25, ≥30, and ≥40, respectively. The proposed model can identify obesity categories in claims databases when BMI measurements are missing and can be used for confounding adjustment, defining subgroups, or probabilistic bias analysis.
Collapse
|
5
|
Sewpaul R, Awe OO, Dogbey DM, Sekgala MD, Dukhi N. Classification of Obesity among South African Female Adolescents: Comparative Analysis of Logistic Regression and Random Forest Algorithms. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023; 21:2. [PMID: 38276791 PMCID: PMC10815679 DOI: 10.3390/ijerph21010002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Revised: 12/14/2023] [Accepted: 12/15/2023] [Indexed: 01/27/2024]
Abstract
BACKGROUND This study evaluates the performance of logistic regression (LR) and random forest (RF) algorithms to model obesity among female adolescents in South Africa. METHODS Data was analysed on 375 females aged 15-17 from the South African National Health and Nutrition Examination Survey 2011/2012. The primary outcome was obesity, defined as body mass index (BMI) ≥ 30 kg/m2. A total of 31 explanatory variables were included, ranging from socio-economic, demographic, family history, dietary and health behaviour. RF and LR models were run using imbalanced data as well as after oversampling, undersampling, and hybrid sampling of the data. RESULTS Using the imbalanced data, the RF model performed better with higher precision, recall, F1 score, and balanced accuracy. Balanced accuracy was highest with the hybrid data (0.618 for RF and 0.668 for LR). Using the hybrid balanced data, the RF model performed better (F1-score = 0.940 for RF vs. 0.798 for LR). CONCLUSION The model with the highest overall performance metrics was the RF model both before balancing the data and after applying hybrid balancing. Future work would benefit from using larger datasets on adolescent female obesity to assess the robustness of the models.
Collapse
Affiliation(s)
- Ronel Sewpaul
- Public Health, Societies and Belonging, Human Sciences Research Council, Merchant House, 2 Dock Rail Road, Cape Town 8001, South Africa
| | - Olushina Olawale Awe
- Institute of Mathematics, Statistics and Scientific Computing (IMECC), University of Campinas, Campinas 13083-859, Brazil
| | - Dennis Makafui Dogbey
- Medical Biotechnology and Immunotherapy Research Unit, Institute of Infectious Diseases and Molecular Medicine, Faculty of Health Sciences, University of Cape Town, Cape Town 7700, South Africa
| | | | - Natisha Dukhi
- Public Health, Societies and Belonging, Human Sciences Research Council, Merchant House, 2 Dock Rail Road, Cape Town 8001, South Africa
| |
Collapse
|
6
|
Alghalyini B. Applications of artificial intelligence in the management of childhood obesity. J Family Med Prim Care 2023; 12:2558-2564. [PMID: 38186810 PMCID: PMC10771175 DOI: 10.4103/jfmpc.jfmpc_469_23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 07/14/2023] [Accepted: 07/24/2023] [Indexed: 01/09/2024] Open
Abstract
Background Childhood obesity has emerged as a significant public health challenge, with long-term implications that often extend into adulthood, increasing the susceptibility to chronic health conditions. Objective The objective of this review is to elucidate the applications of artificial intelligence (AI) in the prevention and treatment of pediatric obesity, emphasizing its potential to complement and enhance traditional management methods. Methods We undertook a comprehensive examination of existing literature to understand the integration of machine learning and other AI techniques in childhood obesity management strategies. Results The findings from numerous studies suggest a strong endorsement for AI's role in addressing childhood obesity. Particularly, machine learning techniques have shown considerable efficacy in augmenting current therapeutic and preventive approaches. Conclusion The intersection of AI with conventional obesity management practices presents a novel and promising approach to fortify interventions targeting pediatric obesity. This review accentuates the transformative capacity of AI, thereby advocating for continued research and innovation in this rapidly evolving domain.
Collapse
Affiliation(s)
- Baraa Alghalyini
- Department of Family and Community Medicine, College of Medicine, Alfaisal University, Riyadh, Saudi Arabia
| |
Collapse
|
7
|
Murtha JA, Birstler J, Stalter L, Jawara D, Hanlon BM, Hanrahan LP, Churpek MM, Funk LM. Identifying Young Adults at High Risk for Weight Gain Using Machine Learning. J Surg Res 2023; 291:7-16. [PMID: 37329635 PMCID: PMC10524852 DOI: 10.1016/j.jss.2023.05.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Revised: 04/25/2023] [Accepted: 05/16/2023] [Indexed: 06/19/2023]
Abstract
INTRODUCTION Weight gain among young adults continues to increase. Identifying adults at high risk for weight gain and intervening before they gain weight could have a major public health impact. Our objective was to develop and test electronic health record-based machine learning models to predict weight gain in young adults with overweight/class 1 obesity. METHODS Seven machine learning models were assessed, including three regression models, random forest, single-layer neural network, gradient-boosted decision trees, and support vector machine (SVM) models. Four categories of predictors were included: 1) demographics; 2) obesity-related health conditions; 3) laboratory data and vital signs; and 4) neighborhood-level variables. The cohort was split 60:40 for model training and validation. Area under the receiver operating characteristic curves (AUC) were calculated to determine model accuracy at predicting high-risk individuals, defined by ≥ 10% total body weight gain within 2 y. Variable importance was measured via generalized analysis of variance procedures. RESULTS Of the 24,183 patients (mean [SD] age, 32.0 [6.3] y; 55.1% females) in the study, 14.2% gained ≥10% total body weight. Area under the receiver operating characteristic curves varied from 0.557 (SVM) to 0.675 (gradient-boosted decision trees). Age, sex, and baseline body mass index were the most important predictors among the models except SVM and neural network. CONCLUSIONS Our machine learning models performed similarly and had modest accuracy for identifying young adults at risk of weight gain. Future models may need to incorporate behavioral and/or genetic information to enhance model accuracy.
Collapse
Affiliation(s)
| | - Jen Birstler
- Department of Biostatistics and Medical Informatics, University of Wisconsin, Madison, Wisconsin
| | - Lily Stalter
- Department of Surgery, University of Wisconsin, Madison, Wisconsin
| | - Dawda Jawara
- Department of Surgery, University of Wisconsin, Madison, Wisconsin
| | - Bret M Hanlon
- Department of Surgery, University of Wisconsin, Madison, Wisconsin; Department of Biostatistics and Medical Informatics, University of Wisconsin, Madison, Wisconsin
| | - Lawrence P Hanrahan
- Department of Family Medicine and Community Health, University of Wisconsin School of Medicine and Public Health, Madison, Wisconsin
| | - Matthew M Churpek
- Department of Biostatistics and Medical Informatics, University of Wisconsin, Madison, Wisconsin; Department of Medicine, University of Wisconsin, Madison, Wisconsin
| | - Luke M Funk
- Department of Surgery, University of Wisconsin, Madison, Wisconsin; Department of Surgery, William S. Middleton Memorial VA, Madison, Wisconsin.
| |
Collapse
|
8
|
Jeong S, Yun SB, Park SY, Mun S. Understanding cross-data dynamics of individual and social/environmental factors through a public health lens: explainable machine learning approaches. Front Public Health 2023; 11:1257861. [PMID: 37954048 PMCID: PMC10639162 DOI: 10.3389/fpubh.2023.1257861] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 10/09/2023] [Indexed: 11/14/2023] Open
Abstract
Introduction The rising prevalence of obesity has become a public health concern, requiring efficient and comprehensive prevention strategies. Methods This study innovatively investigated the combined influence of individual and social/environmental factors on obesity within the urban landscape of Seoul, by employing advanced machine learning approaches. We collected 'Community Health Surveys' and credit card usage data to represent individual factors. In parallel, we utilized 'Seoul Open Data' to encapsulate social/environmental factors contributing to obesity. A Random Forest model was used to predict obesity based on individual factors. The model was further subjected to Shapley Additive Explanations (SHAP) algorithms to determine each factor's relative importance in obesity prediction. For social/environmental factors, we used the Geographically Weighted Least Absolute Shrinkage and Selection Operator (GWLASSO) to calculate the regression coefficients. Results The Random Forest model predicted obesity with an accuracy of >90%. The SHAP revealed diverse influential individual obesity-related factors in each Gu district, although 'self-awareness of obesity', 'weight control experience', and 'high blood pressure experience' were among the top five influential factors across all Gu districts. The GWLASSO indicated variations in regression coefficients between social/environmental factors across different districts. Conclusion Our findings provide valuable insights for designing targeted obesity prevention programs that integrate different individual and social/environmental factors within the context of urban design, even within the same city. This study enhances the efficient development and application of explainable machine learning in devising urban health strategies. We recommend that each autonomous district consider these differential influential factors in designing their budget plans to tackle obesity effectively.
Collapse
Affiliation(s)
- Siwoo Jeong
- Convergence Institute of Human Data Technology, Jeonju University, Jeonju, Republic of Korea
- Department of Sports Rehabilitation Medicine, Kyungil University, Gyeongsan, Republic of Korea
| | - Sung Bum Yun
- Urban Strategy Research Division, Seoul Institute of Technology, Seoul, Republic of Korea
| | - Soon Yong Park
- Urban Strategy Research Division, Seoul Institute of Technology, Seoul, Republic of Korea
| | - Sungchul Mun
- Convergence Institute of Human Data Technology, Jeonju University, Jeonju, Republic of Korea
- Department of Industrial Engineering, Jeonju University, Jeonju, Republic of Korea
| |
Collapse
|
9
|
Winkelman J, Nguyen D, vanSonnenberg E, Kirk A, Lieberman S. Artificial Intelligence (AI) in pediatric endocrinology. J Pediatr Endocrinol Metab 2023; 36:903-908. [PMID: 37589444 DOI: 10.1515/jpem-2023-0287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/18/2023] [Accepted: 08/03/2023] [Indexed: 08/18/2023]
Abstract
Artificial Intelligence (AI) is integrating itself throughout the medical community. AI's ability to analyze complex patterns and interpret large amounts of data will have considerable impact on all areas of medicine, including pediatric endocrinology. In this paper, we review and update the current studies of AI in pediatric endocrinology. Specific topics that are addressed include: diabetes management, bone growth, metabolism, obesity, and puberty. Becoming knowledgeable and comfortable with AI will assist pediatric endocrinologists, the goal of the paper.
Collapse
Affiliation(s)
| | - Diep Nguyen
- University of Arizona College of Medicine Phoenix, Phoenix, USA
| | - Eric vanSonnenberg
- University of Arizona College of Medicine Phoenix, Phoenix, USA
- From the Departments of Radiology, University of Arizona College of Medicine Phoenix, Phoenix, USA
- Student Affairs, University of Arizona College of Medicine Phoenix, Phoenix, USA
| | - Alison Kirk
- University of Arizona College of Medicine Phoenix, Phoenix, USA
- Student Affairs, University of Arizona College of Medicine Phoenix, Phoenix, USA
- Pediatrics, University of Arizona College of Medicine Phoenix, Phoenix, USA
| | - Steven Lieberman
- University of Arizona College of Medicine Phoenix, Phoenix, USA
- Internal Medicine (Division of Endocrinology), University of Arizona College of Medicine Phoenix, Phoenix, USA
| |
Collapse
|
10
|
Abdulazeem H, Whitelaw S, Schauberger G, Klug SJ. A systematic review of clinical health conditions predicted by machine learning diagnostic and prognostic models trained or validated using real-world primary health care data. PLoS One 2023; 18:e0274276. [PMID: 37682909 PMCID: PMC10491005 DOI: 10.1371/journal.pone.0274276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Accepted: 08/29/2023] [Indexed: 09/10/2023] Open
Abstract
With the advances in technology and data science, machine learning (ML) is being rapidly adopted by the health care sector. However, there is a lack of literature addressing the health conditions targeted by the ML prediction models within primary health care (PHC) to date. To fill this gap in knowledge, we conducted a systematic review following the PRISMA guidelines to identify health conditions targeted by ML in PHC. We searched the Cochrane Library, Web of Science, PubMed, Elsevier, BioRxiv, Association of Computing Machinery (ACM), and IEEE Xplore databases for studies published from January 1990 to January 2022. We included primary studies addressing ML diagnostic or prognostic predictive models that were supplied completely or partially by real-world PHC data. Studies selection, data extraction, and risk of bias assessment using the prediction model study risk of bias assessment tool were performed by two investigators. Health conditions were categorized according to international classification of diseases (ICD-10). Extracted data were analyzed quantitatively. We identified 106 studies investigating 42 health conditions. These studies included 207 ML prediction models supplied by the PHC data of 24.2 million participants from 19 countries. We found that 92.4% of the studies were retrospective and 77.3% of the studies reported diagnostic predictive ML models. A majority (76.4%) of all the studies were for models' development without conducting external validation. Risk of bias assessment revealed that 90.8% of the studies were of high or unclear risk of bias. The most frequently reported health conditions were diabetes mellitus (19.8%) and Alzheimer's disease (11.3%). Our study provides a summary on the presently available ML prediction models within PHC. We draw the attention of digital health policy makers, ML models developer, and health care professionals for more future interdisciplinary research collaboration in this regard.
Collapse
Affiliation(s)
- Hebatullah Abdulazeem
- Chair of Epidemiology, Department of Sport and Health Sciences, Technical University of Munich (TUM), Munich, Germany
| | - Sera Whitelaw
- Faculty of Medicine and Health Sciences, McGill University, Montreal, Quebec, Canada
| | - Gunther Schauberger
- Chair of Epidemiology, Department of Sport and Health Sciences, Technical University of Munich (TUM), Munich, Germany
| | - Stefanie J. Klug
- Chair of Epidemiology, Department of Sport and Health Sciences, Technical University of Munich (TUM), Munich, Germany
| |
Collapse
|
11
|
Solomon DD, Khan S, Garg S, Gupta G, Almjally A, Alabduallah BI, Alsagri HS, Ibrahim MM, Abdallah AMA. Hybrid Majority Voting: Prediction and Classification Model for Obesity. Diagnostics (Basel) 2023; 13:2610. [PMID: 37568973 PMCID: PMC10417773 DOI: 10.3390/diagnostics13152610] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Revised: 07/26/2023] [Accepted: 07/31/2023] [Indexed: 08/13/2023] Open
Abstract
Because it is associated with most multifactorial inherited diseases like heart disease, hypertension, diabetes, and other serious medical conditions, obesity is a major global health concern. Obesity is caused by hereditary, physiological, and environmental factors, as well as poor nutrition and a lack of exercise. Weight loss can be difficult for various reasons, and it is diagnosed via BMI, which is used to estimate body fat for most people. Muscular athletes, for example, may have a BMI in the obesity range even when they are not obese. Researchers from a variety of backgrounds and institutions devised different hypotheses and models for the prediction and classification of obesity using different approaches and various machine learning techniques. In this study, a majority voting-based hybrid modeling approach using a gradient boosting classifier, extreme gradient boosting, and a multilayer perceptron was developed. Seven distinct machine learning algorithms were used on open datasets from the UCI machine learning repository, and their respective accuracy levels were compared before the combined approaches were chosen. The proposed majority voting-based hybrid model for prediction and classification of obesity that was achieved has an accuracy of 97.16%, which is greater than both the individual models and the other hybrid models that have been developed.
Collapse
Affiliation(s)
- Dahlak Daniel Solomon
- Yogananda School of AI Computers and Data Sciences, Shoolini University, Solan 173229, India
| | - Shakir Khan
- College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh 11432, Saudi Arabia (A.M.A.A.)
- Department of Computer Science and Engineering, University Centre for Research and Development, Chandigarh University, Mohali 140413, India
| | - Sonia Garg
- Yogananda School of AI Computers and Data Sciences, Shoolini University, Solan 173229, India
| | - Gaurav Gupta
- Yogananda School of AI Computers and Data Sciences, Shoolini University, Solan 173229, India
| | - Abrar Almjally
- College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh 11432, Saudi Arabia (A.M.A.A.)
| | - Bayan Ibrahimm Alabduallah
- Department of Information System, College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University, Riyadh 11432, Saudi Arabia
| | - Hatoon S. Alsagri
- College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh 11432, Saudi Arabia (A.M.A.A.)
| | - Mandour Mohamed Ibrahim
- College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh 11432, Saudi Arabia (A.M.A.A.)
| | - Alsadig Mohammed Adam Abdallah
- College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh 11432, Saudi Arabia (A.M.A.A.)
| |
Collapse
|
12
|
Salinari A, Machì M, Armas Diaz Y, Cianciosi D, Qi Z, Yang B, Ferreiro Cotorruelo MS, Villar SG, Dzul Lopez LA, Battino M, Giampieri F. The Application of Digital Technologies and Artificial Intelligence in Healthcare: An Overview on Nutrition Assessment. Diseases 2023; 11:97. [PMID: 37489449 PMCID: PMC10366918 DOI: 10.3390/diseases11030097] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Revised: 07/11/2023] [Accepted: 07/11/2023] [Indexed: 07/26/2023] Open
Abstract
In the last decade, artificial intelligence (AI) and AI-mediated technologies have undergone rapid evolution in healthcare and medicine, from apps to computer software able to analyze medical images, robotic surgery and advanced data storage system. The main aim of the present commentary is to briefly describe the evolution of AI and its applications in healthcare, particularly in nutrition and clinical biochemistry. Indeed, AI is revealing itself to be an important tool in clinical nutrition by using telematic means to self-monitor various health metrics, including blood glucose levels, body weight, heart rate, fat percentage, blood pressure, activity tracking and calorie intake trackers. In particular, the application of the most common digital technologies used in the field of nutrition as well as the employment of AI in the management of diabetes and obesity, two of the most common nutrition-related pathologies worldwide, will be presented.
Collapse
Affiliation(s)
- Alessia Salinari
- Department of Clinical Sciences, Faculty of Medicine, Polytechnic University of Marche, 60131 Ancona, Italy
| | - Michele Machì
- Department of Clinical Sciences, Faculty of Medicine, Polytechnic University of Marche, 60131 Ancona, Italy
| | - Yasmany Armas Diaz
- Department of Clinical Sciences, Faculty of Medicine, Polytechnic University of Marche, 60131 Ancona, Italy
| | - Danila Cianciosi
- Department of Clinical Sciences, Faculty of Medicine, Polytechnic University of Marche, 60131 Ancona, Italy
| | - Zexiu Qi
- Department of Clinical Sciences, Faculty of Medicine, Polytechnic University of Marche, 60131 Ancona, Italy
| | - Bei Yang
- Department of Clinical Sciences, Faculty of Medicine, Polytechnic University of Marche, 60131 Ancona, Italy
| | | | - Santos Gracia Villar
- Research Group on Food, Nutritional Biochemistry and Health, Universidad Europea del Atlántico, 39011 Santander, Spain
- Department of Projects, Universidad Internacional Iberoamericana, Campeche 24560, Mexico
- Department of Extension, Universidad Internacional do Cuanza, Cuito P.O. Box 841, Angola
| | - Luis Alonso Dzul Lopez
- Research Group on Food, Nutritional Biochemistry and Health, Universidad Europea del Atlántico, 39011 Santander, Spain
- Department of Projects, Universidad Internacional Iberoamericana, Campeche 24560, Mexico
- Department of Projects, Universidad Internacional Iberoamericana, Arecibo, PR 00613, USA
| | - Maurizio Battino
- International Research Center for Food Nutrition and Safety, Jiangsu University, Zhenjiang 212013, China
- Department of Clinical Sciences, Faculty of Medicine, Polytechnic University of Marche, 60131 Ancona, Italy
- Research Group on Food, Nutritional Biochemistry and Health, Universidad Europea del Atlántico, 39011 Santander, Spain
| | - Francesca Giampieri
- Research Group on Food, Nutritional Biochemistry and Health, Universidad Europea del Atlántico, 39011 Santander, Spain
| |
Collapse
|
13
|
Lim H, Lee H, Kim J. A prediction model for childhood obesity risk using the machine learning method: a panel study on Korean children. Sci Rep 2023; 13:10122. [PMID: 37344518 DOI: 10.1038/s41598-023-37171-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Accepted: 06/17/2023] [Indexed: 06/23/2023] Open
Abstract
Young children are increasingly exposed to an obesogenic environment through increased intake of processed food and decreased physical activity. Mothers' perceptions of obesity and parenting styles influence children's abilities to maintain a healthy weight. This study developed a prediction model for childhood obesity in 10-year-olds, and identify relevant risk factors using a machine learning method. Data on 1185 children and their mothers were obtained from the Korean National Panel Study. A prediction model for obesity was developed based on ten factors related to children (gender, eating habits, activity, and previous body mass index) and their mothers (education level, self-esteem, and body mass index). These factors were selected based on the least absolute shrinkage and selection operator. The prediction model was validated with an Area Under the Receiver Operator Characteristic Curve of 0.82 and an accuracy of 76%. Other than body mass index for both children and mothers, significant risk factors for childhood obesity were less physical activity among children and higher self-esteem among mothers. This study adds new evidence demonstrating that maternal self-esteem is related to children's body mass index. Future studies are needed to develop effective strategies for screening young children at risk for obesity, along with their mothers.
Collapse
Affiliation(s)
- Heemoon Lim
- College of Nursing, Yonsei University, Seoul, South Korea
| | - Hyejung Lee
- College of Nursing, Yonsei University, Mo-Im Kim Nursing Research Institute, Seoul, South Korea.
| | - Joungyoun Kim
- Department of Artificial Intelligence, University of Seoul, Seoul, South Korea
| |
Collapse
|
14
|
Cheng ER, Cengiz AY, Miled ZB. Predicting body mass index in early childhood using data from the first 1000 days. Sci Rep 2023; 13:8781. [PMID: 37258628 PMCID: PMC10232444 DOI: 10.1038/s41598-023-35935-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Accepted: 05/25/2023] [Indexed: 06/02/2023] Open
Abstract
Few existing efforts to predict childhood obesity have included risk factors across the prenatal and early infancy periods, despite evidence that the first 1000 days is critical for obesity prevention. In this study, we employed machine learning techniques to understand the influence of factors in the first 1000 days on body mass index (BMI) values during childhood. We used LASSO regression to identify 13 features in addition to historical weight, height, and BMI that were relevant to childhood obesity. We then developed prediction models based on support vector regression with fivefold cross validation, estimating BMI for three time periods: 30-36 (N = 4204), 36-42 (N = 4130), and 42-48 (N = 2880) months. Our models were developed using 80% of the patients from each period. When tested on the remaining 20% of the patients, the models predicted children's BMI with high accuracy (mean average error [standard deviation] = 0.96[0.02] at 30-36 months, 0.98 [0.03] at 36-42 months, and 1.00 [0.02] at 42-48 months) and can be used to support clinical and public health efforts focused on obesity prevention in early life.
Collapse
Affiliation(s)
- Erika R Cheng
- Division of Children's Health Services Research, Department of Pediatrics, Indiana University School of Medicine, 410 W. 10th Street, Indianapolis, IN, 46220, USA.
| | - Ahmet Yahya Cengiz
- Department of Computer Science, Purdue School of Science, IUPUI, Indianapolis, IN, USA
| | - Zina Ben Miled
- Department of Electrical and Computer Engineering, School of Engineering and Technology, Indiana University Purdue University at Indianapolis, Indianapolis, IN, USA
- Regenstrief Institute, Inc., Indianapolis, IN, USA
| |
Collapse
|
15
|
Awe OO, Dukhi N, Dias R. Shrinkage heteroscedastic discriminant algorithms for classifying multi-class high-dimensional data: Insights from a national health survey. MACHINE LEARNING WITH APPLICATIONS 2023. [DOI: 10.1016/j.mlwa.2023.100459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/08/2023] Open
|
16
|
Ferreras A, Sumalla-Cano S, Martínez-Licort R, Elío I, Tutusaus K, Prola T, Vidal-Mazón JL, Sahelices B, de la Torre Díez I. Systematic Review of Machine Learning applied to the Prediction of Obesity and Overweight. J Med Syst 2023; 47:8. [PMID: 36637549 DOI: 10.1007/s10916-022-01904-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2022] [Accepted: 12/15/2022] [Indexed: 01/14/2023]
Abstract
Obesity and overweight has increased in the last year and has become a pandemic disease, the result of sedentary lifestyles and unhealthy diets rich in sugars, refined starches, fats and calories. Machine learning (ML) has proven to be very useful in the scientific community, especially in the health sector. With the aim of providing useful tools to help nutritionists and dieticians, research focused on the development of ML and Deep Learning (DL) algorithms and models is searched in the literature. The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) protocol has been used, a very common technique applied to carry out revisions. In our proposal, 17 articles have been filtered in which ML and DL are applied in the prediction of diseases, in the delineation of treatment strategies, in the improvement of personalized nutrition and more. Despite expecting better results with the use of DL, according to the selected investigations, the traditional methods are still the most used and the yields in both cases fluctuate around positive values, conditioned by the databases (transformed in each case) to a greater extent than by the artificial intelligence paradigm used. Conclusions: An important compilation is provided for the literature in this area. ML models are time-consuming to clean data, but (like DL) they allow automatic modeling of large volumes of data which makes them superior to traditional statistics.
Collapse
Affiliation(s)
- Antonio Ferreras
- Department of Signal Theory and Communications and Telematics Engineering, University of Valladolid, Paseo de Belén 15, Valladolid, 47011, Spain
| | - Sandra Sumalla-Cano
- Research Group on Foods, Nutritional Biochemistry and Health, European University of the Atlantic, Santander, 39011, Spain
- Department of Health, Nutrition and Sport, Iberoamerican International University, Campeche, 24560, Mexico
| | - Rosmeri Martínez-Licort
- Telemedicine and eHealth Research Group, Department of Signal Theory and Communications and Telematics Engineering, University of Valladolid, Paseo de Belén 15, Valladolid, 47011, Spain.
- Department of Telecommunications, University of Pinar del Río, Pinar del Río, Cuba.
| | - Iñaki Elío
- Research Group on Foods, Nutritional Biochemistry and Health, European University of the Atlantic, Santander, 39011, Spain
- Department of Health, Nutrition and Sport, Iberoamerican International University, Campeche, 24560, Mexico
| | - Kilian Tutusaus
- Higher Polytechnic School, European University of the Atlantic, Santander, 39011, Spain
- Higher Polytechnic School, Iberoamerican International University, Campeche, 24560, Mexico
| | - Thomas Prola
- Faculty of Social Sciences and Humanites, European University of the Atlantic, Santander, Spain
| | - Juan Luís Vidal-Mazón
- Higher Polytechnic School, European University of the Atlantic, Santander, 39011, Spain
- Higher Polytechnic School, International University of Cuanza, Estrada nacional 250, Cuito-Bié, Angola
- Higher Polytechnic School, Iberoamerican International University, Arecibo, 00613, Puerto Rico
| | - Benjamín Sahelices
- Research group GCME, Department of Computer Science, University of Valladolid, Paseo de Belén 15, Valladolid, 47011, Spain
| | - Isabel de la Torre Díez
- Department of Signal Theory and Communications and Telematics Engineering, University of Valladolid, Paseo de Belén 15, Valladolid, 47011, Spain
| |
Collapse
|
17
|
Mondal PK, Foysal KH, Norman BA, Gittner LS. Predicting Childhood Obesity Based on Single and Multiple Well-Child Visit Data Using Machine Learning Classifiers. SENSORS (BASEL, SWITZERLAND) 2023; 23:759. [PMID: 36679555 PMCID: PMC9865403 DOI: 10.3390/s23020759] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/19/2022] [Revised: 01/05/2023] [Accepted: 01/05/2023] [Indexed: 06/17/2023]
Abstract
Childhood obesity is a public health concern in the United States. Consequences of childhood obesity include metabolic disease and heart, lung, kidney, and other health-related comorbidities. Therefore, the early determination of obesity risk is needed and predicting the trend of a child's body mass index (BMI) at an early age is crucial. Early identification of obesity can lead to early prevention. Multiple methods have been tested and evaluated to assess obesity trends in children. Available growth charts help determine a child's current obesity level but do not predict future obesity risk. The present methods of predicting obesity include regression analysis and machine learning-based classifications and risk factor (threshold)-based categorizations based on specific criteria. All the present techniques, especially current machine learning-based methods, require longitudinal data and information on a large number of variables related to a child's growth (e.g., socioeconomic, family-related factors) in order to predict future obesity-risk. In this paper, we propose three different techniques for three different scenarios to predict childhood obesity based on machine learning approaches and apply them to real data. Our proposed methods predict obesity for children at five years of age using the following three data sets: (1) a single well-child visit, (2) multiple well-child visits under the age of two, and (3) multiple random well-child visits under the age of five. Our models are especially important for situations where only the current patient information is available rather than having multiple data points from regular spaced well-child visits. Our models predict obesity using basic information such as birth BMI, gestational age, BMI measures from well-child visits, and gender. Our models can predict a child's obesity category (normal, overweight, or obese) at five years of age with an accuracy of 89%, 77%, and 89%, for the three application scenarios, respectively. Therefore, our proposed models can assist healthcare professionals by acting as a decision support tool to aid in predicting childhood obesity early in order to reduce obesity-related complications, and in turn, improve healthcare.
Collapse
Affiliation(s)
- Pritom Kumar Mondal
- Department of Industrial, Manufacturing & Systems Engineering, Texas Tech University, Lubbock, TX 79409, USA
| | - Kamrul H. Foysal
- Department of Electrical and Computer Engineering, Texas Tech University, Lubbock, TX 79409, USA
| | - Bryan A. Norman
- Department of Industrial, Manufacturing & Systems Engineering, Texas Tech University, Lubbock, TX 79409, USA
| | - Lisaann S. Gittner
- Department of Public Health, Texas Tech University Health Sciences Center, Lubbock, TX 79430, USA
| |
Collapse
|
18
|
An R, Shen J, Xiao Y. Applications of Artificial Intelligence to Obesity Research: Scoping Review of Methodologies. J Med Internet Res 2022; 24:e40589. [PMID: 36476515 PMCID: PMC9856437 DOI: 10.2196/40589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Revised: 10/05/2022] [Accepted: 11/01/2022] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Obesity is a leading cause of preventable death worldwide. Artificial intelligence (AI), characterized by machine learning (ML) and deep learning (DL), has become an indispensable tool in obesity research. OBJECTIVE This scoping review aimed to provide researchers and practitioners with an overview of the AI applications to obesity research, familiarize them with popular ML and DL models, and facilitate the adoption of AI applications. METHODS We conducted a scoping review in PubMed and Web of Science on the applications of AI to measure, predict, and treat obesity. We summarized and categorized the AI methodologies used in the hope of identifying synergies, patterns, and trends to inform future investigations. We also provided a high-level, beginner-friendly introduction to the core methodologies to facilitate the dissemination and adoption of various AI techniques. RESULTS We identified 46 studies that used diverse ML and DL models to assess obesity-related outcomes. The studies found AI models helpful in detecting clinically meaningful patterns of obesity or relationships between specific covariates and weight outcomes. The majority (18/22, 82%) of the studies comparing AI models with conventional statistical approaches found that the AI models achieved higher prediction accuracy on test data. Some (5/46, 11%) of the studies comparing the performances of different AI models revealed mixed results, indicating the high contingency of model performance on the data set and task it was applied to. An accelerating trend of adopting state-of-the-art DL models over standard ML models was observed to address challenging computer vision and natural language processing tasks. We concisely introduced the popular ML and DL models and summarized their specific applications in the studies included in the review. CONCLUSIONS This study reviewed AI-related methodologies adopted in the obesity literature, particularly ML and DL models applied to tabular, image, and text data. The review also discussed emerging trends such as multimodal or multitask AI models, synthetic data generation, and human-in-the-loop that may witness increasing applications in obesity research.
Collapse
Affiliation(s)
- Ruopeng An
- Brown School, Washington University in St. Louis, St. Louis, MO, United States
| | - Jing Shen
- Department of Physical Education, China University of Geosciences, Beijing, China
| | - Yunyu Xiao
- Weill Cornell Medical College, Cornell University, Ithaca, NY, United States
| |
Collapse
|
19
|
Kaur R, Kumar R, Gupta M. Predicting risk of obesity and meal planning to reduce the obese in adulthood using artificial intelligence. Endocrine 2022; 78:458-469. [PMID: 36224505 PMCID: PMC9555702 DOI: 10.1007/s12020-022-03215-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Accepted: 09/29/2022] [Indexed: 11/26/2022]
Abstract
BACKGROUND An unhealthy diet or excessive amount of food intake creates obesity issues in human beings that further may cause several diseases such as Polycystic Ovary Syndrome (PCOS), Cardiovascular disease, Diabetes, Cancers, etc. Obesity is a major risk factor for PCOS, which is a common disease in women and is significantly correlated with weight gain. METHODS This study is providing a one-step solution for predicting the risk of obesity using different Machine Learning (ML) algorithms such as Gradient Boosting (GB), Bagging meta-estimator (BME), XG Boost (XGB), Random Forest (RF), Support Vector Machine (SVM), and K Nearest Neighbour (KNN). A dataset is collected from the UCI ML repository having features of physical description and eating habits of individuals to train the proposed model. RESULTS The model has been experimented with different training and testing data ratios such as (90:10, 80:20, 70:30,60:40). At a data ratio of 90:10, the GB classifier achieved the highest accuracy i.e., 98.11%. Further, at the 80:20 ratio, the GB and XGB provide the same result i.e., 97.87%. For the 70:30 data ratio, XGB achieves the highest accuracy i.e., 97.79%. Further, the Nearest Neighbour (NN) learning method is applied to meal planning to overcome obesity. CONCLUSION This method predicts the meal which includes breakfast, morning snacks, lunch, evening snacks, and dinner for the individual as per caloric and macronutrient requirements. The proposed research work can be used by practitioners to check obesity levels and to suggest meals to reduce the obese in adulthood.
Collapse
Affiliation(s)
- Rajdeep Kaur
- Department of Computer Science & Engineering, Chandigarh University, Chandigarh, Punjab, India
| | - Rakesh Kumar
- Department of Computer Science & Engineering, Chandigarh University, Chandigarh, Punjab, India
| | - Meenu Gupta
- Department of Computer Science & Engineering, Chandigarh University, Chandigarh, Punjab, India.
| |
Collapse
|
20
|
Ji W, Zhang Y, Cheng Y, Wang Y, Zhou Y. Development and validation of prediction models for hypertension risks: A cross-sectional study based on 4,287,407 participants. Front Cardiovasc Med 2022; 9:928948. [PMID: 36225955 PMCID: PMC9548597 DOI: 10.3389/fcvm.2022.928948] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Accepted: 08/29/2022] [Indexed: 11/13/2022] Open
Abstract
ObjectiveTo develop an optimal screening model to identify the individuals with a high risk of hypertension in China by comparing tree-based machine learning models, such as classification and regression tree, random forest, adaboost with a decision tree, extreme gradient boosting decision tree, and other machine learning models like an artificial neural network, naive Bayes, and traditional logistic regression models.MethodsA total of 4,287,407 adults participating in the national physical examination were included in the study. Features were selected using the least absolute shrinkage and selection operator regression. The Borderline synthetic minority over-sampling technique was used for data balance. Non-laboratory and semi-laboratory analyses were carried out in combination with the selected features. The tree-based machine learning models, other machine learning models, and traditional logistic regression models were constructed to identify individuals with hypertension, respectively. Top features selected using the best algorithm and the corresponding variable importance score were visualized.ResultsA total of 24 variables were finally included for analyses after the least absolute shrinkage and selection operator regression model. The sample size of hypertensive patients in the training set was expanded from 689,025 to 2,312,160 using the borderline synthetic minority over-sampling technique algorithm. The extreme gradient boosting decision tree algorithm showed the best results (area under the receiver operating characteristic curve of non-laboratory: 0.893 and area under the receiver operating characteristic curve of semi-laboratory: 0.894). This study found that age, systolic blood pressure, waist circumference, diastolic blood pressure, albumin, drinking frequency, electrocardiogram, ethnicity (uyghur, hui, and other), body mass index, sex (female), exercise frequency, diabetes mellitus, and total bilirubin are important factors reflecting hypertension. Besides, some algorithms included in the semi-laboratory analyses showed less improvement in the predictive performance compared to the non-laboratory analyses.ConclusionUsing multiple methods, a more significant prediction model can be built, which discovers risk factors and provides new insights into the prediction and prevention of hypertension.
Collapse
Affiliation(s)
- Weidong Ji
- Department of Medical Information, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
| | - Yushan Zhang
- Department of Maternal and Child Health, School of Public Health, Sun Yat-sen University, Guangzhou, China
| | - Yinlin Cheng
- Department of Medical Information, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
| | - Yushan Wang
- Center of Health Management, The First Affiliated Hospital of Xinjiang Medical University, Urumqi, China
- *Correspondence: Yushan Wang
| | - Yi Zhou
- Department of Medical Information, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
- Yi Zhou
| |
Collapse
|
21
|
Alsareii SA, Shaf A, Ali T, Zafar M, Alamri AM, AlAsmari MY, Irfan M, Awais M. IoT Framework for a Decision-Making System of Obesity and Overweight Extrapolation among Children, Youths, and Adults. Life (Basel) 2022; 12:life12091414. [PMID: 36143450 PMCID: PMC9500775 DOI: 10.3390/life12091414] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Revised: 08/28/2022] [Accepted: 09/05/2022] [Indexed: 01/16/2023] Open
Abstract
Approximately 30% of the global population is suffering from obesity and being overweight, which is approximately 2.1 billion people worldwide. The ratio is expected to surpass 40% by 2030 if the current balance continues to grow. The global pandemic due to COVID-19 will also impact the predicted obesity rates. It will cause a significant increase in morbidity and mortality worldwide. Multiple chronic diseases are associated with obesity and several threat elements are associated with obesity. Various challenges are involved in the understanding of risk factors and the ratio of obesity. Therefore, diagnosing obesity in its initial stages might significantly increase the patient’s chances of effective treatment. The Internet of Things (IoT) has attained an evolving stage in the development of the contemporary environment of healthcare thanks to advancements in information and communication technologies. Therefore, in this paper, we thoroughly investigated machine learning techniques for making an IoT-enabled system. In the first phase, the proposed system analyzed the performances of random forest (RF), K-nearest neighbor (KNN), support vector machine (SVM), decision tree (DT), logistic regression (LR), and naïve Bayes (NB) algorithms on the obesity dataset. The second phase, on the other hand, introduced an IoT-based framework that adopts a multi-user request system by uploading the data to the cloud for the early diagnosis of obesity. The IoT framework makes the system available to anyone (and everywhere) for precise obesity categorization. This research will help the reader understand the relationships among risk factors with weight changes and their visualizations. Furthermore, it also focuses on how existing datasets can help one study the obesity nature and which classification and regression models perform well in correspondence to others.
Collapse
Affiliation(s)
- Saeed Ali Alsareii
- Department of Surgery, College of Medicine, Najran University Saudi Arabia, Najran 11001, Saudi Arabia
- Correspondence:
| | - Ahmad Shaf
- Department of Computer Science, COMSATS University Islamabad, Sahiwal Campus, Sahiwal 57000, Pakistan
| | - Tariq Ali
- Department of Computer Science, COMSATS University Islamabad, Sahiwal Campus, Sahiwal 57000, Pakistan
| | - Maryam Zafar
- Department of Computer Science, COMSATS University Islamabad, Sahiwal Campus, Sahiwal 57000, Pakistan
| | - Abdulrahman Manaa Alamri
- Department of Surgery, College of Medicine, Najran University Saudi Arabia, Najran 11001, Saudi Arabia
| | - Mansour Yousef AlAsmari
- Department of Surgery, College of Medicine, Najran University Saudi Arabia, Najran 11001, Saudi Arabia
| | - Muhammad Irfan
- Electrical Engineering Department, College of Engineering, Najran University Saudi Arabia, Najran 11001, Saudi Arabia
| | - Muhammad Awais
- Department of Computer Science, Edge Hill University, St Helens Rd, Ormskirk L39 4QP, UK
| |
Collapse
|
22
|
Matsushita FY, Krebs VLJ, Carvalho WBD. Artificial intelligence and machine learning in pediatrics and neonatology healthcare. Rev Assoc Med Bras (1992) 2022; 68:745-750. [PMID: 35766685 PMCID: PMC9575899 DOI: 10.1590/1806-9282.20220177] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 02/09/2022] [Indexed: 11/23/2022] Open
Affiliation(s)
- Felipe Yu Matsushita
- Universidade de São Paulo, Faculty of Medicine, Department of Pediatrics - São Paulo (SP), Brazil
| | - Vera Lucia Jornada Krebs
- Universidade de São Paulo, Faculty of Medicine, Department of Pediatrics - São Paulo (SP), Brazil
| | | |
Collapse
|
23
|
Platikanova M, Yordanova A, Hristova P. Dependence of Body Mass Index on Some Dietary Habits: An Application of Classification and Regression Tree. IRANIAN JOURNAL OF PUBLIC HEALTH 2022; 51:1283-1294. [PMID: 36447974 PMCID: PMC9659541 DOI: 10.18502/ijph.v51i6.9672] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/17/2021] [Accepted: 09/21/2021] [Indexed: 06/16/2023]
Abstract
BACKGROUND The purpose of this study was to determine the influence of some eating habits on body mass index (BMI) using a regression model created via the classification and regression tree method (CART). METHODS The study was conducted using a questionnaire specially developed for the study, evaluated for reliability and validity. In addition to demographics (age and sex), the questions concern the timing of the meals and the type of food consumed. The data contains records for 533 people (322 women and 211 men) aged 18 to 65 years. The survey was conducted in the period 2019-2021 in Stara Zagora, Bulgaria. Data were processed using descriptive statistics, and regression and classification data mining method CART. RESULTS A CART model with a dependent variable BMI and predictors Sex, Age, Breakfast type, Breakfast time, Lunchtime, Lunch type, Dinner time, Dinner type have been created. The obtained model is statistically significant at a significance level of P<0.0001 and a coefficient of determination R2 = 0.495. The normalized importance of the factors that affect the BMI is as follows: Sex (100%), Age (61.4%), Lunch type (26. 0%), Lunchtime (18.8%), Dinner time (13.9%), and Breakfast type (13.2%). Women have a lower BMI than men. BMI increases with age. CONCLUSION The CART method allows to make a classification by the predictors used and gives opportunities for a more in-depth analysis of the reasons for the increase in BMI. The level of influence of diet and eating habits (type of food, time of consumption) on BMI was determined.
Collapse
Affiliation(s)
- Magdalena Platikanova
- Department of Hygiene, Epidemiology, Microbiology, Parasitology and Infectious Diseases, Medical Faculty, Trakia University, Stara Zagora 6000, Bulgaria
| | | | - Petya Hristova
- Department of Hygiene, Epidemiology, Microbiology, Parasitology and Infectious Diseases, Medical Faculty, Trakia University, Stara Zagora 6000, Bulgaria
| |
Collapse
|
24
|
Alotaibi M, Alnajjar F, Cappuccio M, Khalid S, Alhmiedat T, Mubin O. Efficacy of Emerging Technologies to Manage Childhood Obesity. Diabetes Metab Syndr Obes 2022; 15:1227-1244. [PMID: 35480851 PMCID: PMC9037732 DOI: 10.2147/dmso.s357176] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/05/2022] [Accepted: 04/13/2022] [Indexed: 11/23/2022] Open
Abstract
Childhood obesity is a widespread medical condition and presents a formidable challenge for public health. Long-term treatment strategies and early prevention strategies are required because obese children are more likely to carry this condition into adulthood, increasing their risk of developing other major health disorders. The present review analyses various technological interventions available for childhood obesity prevention and treatment. It also examines whether machine learning and technological interventions can play vital roles in its management. Twenty-six studies were shortlisted for the review using various technological strategies and analysed regarding their efficacy. While most of the selected studies showed positive outcomes, there was a lack of studies using robots and artificial intelligence to manage obesity in children. The use of machine learning was observed in various studies, and the integration of social robots and other efficacious strategies may be effective for treating childhood obesity in the future.
Collapse
Affiliation(s)
- Mohammad Alotaibi
- Faculty of Computers and Information Technology, University of Tabuk, Tabuk, Saudi Arabia
| | - Fady Alnajjar
- College of Information Technology, United Arab Emirates University, Abu Dhabi, United Arab Emirates
| | - Massimiliano Cappuccio
- School of Engineering and IT, University of New South Wales Canberra, Canberra, Australia
| | - Sumaya Khalid
- College of Information Technology, United Arab Emirates University, Abu Dhabi, United Arab Emirates
| | - Tareq Alhmiedat
- Faculty of Computers and Information Technology, University of Tabuk, Tabuk, Saudi Arabia
- Industrial Innovation & Robotics Center, University of Tabuk, Tabuk, Saudi Arabia
| | - Omar Mubin
- School of Computer, Data and Mathematical Sciences, Western Sydney University, Sydney, Australia
| |
Collapse
|
25
|
Marcos-Pasero H, Aguilar-Aguilar E, de la Iglesia R, Espinosa-Salinas I, Molina S, Colmenarejo G, Martínez JA, Ramírez de Molina A, Reglero G, Loria-Kohen V. "GENYAL" Study to Childhood Obesity Prevention: Methodology and Preliminary Results. Front Nutr 2022; 9:777384. [PMID: 35350411 PMCID: PMC8957940 DOI: 10.3389/fnut.2022.777384] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Accepted: 02/04/2022] [Indexed: 11/13/2022] Open
Abstract
Objective This article describes the methodology and summarizes some preliminary results of the GENYAL study aiming to design and validate a predictive model, considering both environmental and genetic factors, that identifies children who would benefit most from actions aimed at reducing the risk of obesity and its complications. Design The study is a cluster randomized clinical trial with 5-year follow-up. The initial evaluation was carried out in 2017. The schools were randomly split into intervention (nutritional education) and control schools. Anthropometric measurements, social and health as well as dietary and physical activity data of schoolchildren and their families are annually collected. A total of 26 single nucleotide polymorphisms (SNPs) were assessed. Machine Learning models are being designed to predict obesity phenotypes after the 5-year follow-up. Settings Six schools in Madrid. Participants A total of 221 schoolchildren (6-8 years old). Results Collected results show that the prevalence of excess weight was 19.0, 25.4, and 32.2% (according to World Health Organization, International Obesity Task Force and Orbegozo Foundation criteria, respectively). Associations between the nutritional state of children with mother BMI [β = 0.21 (0.13-0.3), p (adjusted) <0.001], geographical location of the school [OR = 2.74 (1.24-6.22), p (adjusted) = 0.06], dairy servings per day [OR = 0.48 (0.29-0.75), p (adjusted) = 0.05] and 8 SNPs [rs1260326, rs780094, rs10913469, rs328, rs7647305, rs3101336, rs2568958, rs925946; p (not adjusted) <0.05] were found. Conclusions These baseline data support the evidence that environmental and genetic factors play a role in the development of childhood obesity. After 5-year follow-up, the GENYAL study pretends to validate the predictive model as a new strategy to fight against obesity. Clinical Trial Registration This study has been registered in ClinicalTrials.gov with the identifier NCT03419520, https://clinicaltrials.gov/ct2/show/NCT03419520.
Collapse
Affiliation(s)
- Helena Marcos-Pasero
- Nutrition and Clinical Trials Unit, GENYAL Platform, IMDEA-Food Institute, CEI UAM + CSIC, Madrid, Spain
- Faculty of Health Sciences, Valencian International University (VIU), Valencia, Spain
| | - Elena Aguilar-Aguilar
- Nutrition and Clinical Trials Unit, GENYAL Platform, IMDEA-Food Institute, CEI UAM + CSIC, Madrid, Spain
| | - Rocío de la Iglesia
- Departamento de Ciencias Farmaceúticas y de la Salud, Facultad de Farmacia, Universidad San Pablo-CEU, CEU Universities, Madrid, Spain
| | - Isabel Espinosa-Salinas
- Nutritional Genomics and Health Unit, GENYAL Platform, IMDEA-Food Institute, CEI UAM + CSIC, Madrid, Spain
| | - Susana Molina
- GenyalLab, GENYAL Platform, IMDEA-Food Institute, CEI UAM + CSIC, Madrid, Spain
| | - Gonzalo Colmenarejo
- Biostatistics and Bioinformatics Unit, IMDEA-Food Institute, CEI UAM + CSIC, Madrid, Spain
| | - J. Alfredo Martínez
- Precision Nutrition and Cardiometabolic Health, IMDEA-Food Institute, CEI UAM + CSIC, Madrid, Spain
- IdisNA, Navarra Institute for Health Research, Pamplona, Spain
- Center of Biomedical Research in Physiopathology of Obesity and Nutrition, Institute of Health Carlos III, Madrid, Spain
| | - Ana Ramírez de Molina
- Molecular Oncology and Nutritional Genomics of Cancer, IMDEA-Food Institute, CEI UAM + CSIC, Madrid, Spain
| | - Guillermo Reglero
- Production and Development of Foods for Health, IMDEA-Food Institute, CEI UAM + CSIC, Madrid, Spain
- Department of Production and Characterization of Novel Foods, Institute of Food Science Research (CIAL), CEI UAM+CSIC, Madrid, Spain
| | - Viviana Loria-Kohen
- Nutrition and Clinical Trials Unit, GENYAL Platform, IMDEA-Food Institute, CEI UAM + CSIC, Madrid, Spain
- Departamento de Nutrición y Ciencia de los Alimentos, Facultad de Farmacia, Universidad Complutense de Madrid, Grupo de Investigación VALORNUT-UCM, Madrid, Spain
| |
Collapse
|
26
|
Santisteban Quiroz JP. Estimation of obesity levels based on dietary habits and condition physical using computational intelligence. INFORMATICS IN MEDICINE UNLOCKED 2022. [DOI: 10.1016/j.imu.2022.100901] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022] Open
|
27
|
Masi D, Risi R, Biagi F, Vasquez Barahona D, Watanabe M, Zilich R, Gabrielli G, Santin P, Mariani S, Lubrano C, Gnessi L. Application of a Machine Learning Technology in the Definition of Metabolically Healthy and Unhealthy Status: A Retrospective Study of 2567 Subjects Suffering from Obesity with or without Metabolic Syndrome. Nutrients 2022; 14:nu14020373. [PMID: 35057554 PMCID: PMC8779369 DOI: 10.3390/nu14020373] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Revised: 12/31/2021] [Accepted: 01/13/2022] [Indexed: 12/13/2022] Open
Abstract
The key factors playing a role in the pathogenesis of metabolic alterations observed in many patients with obesity have not been fully characterized. Their identification is crucial, and it would represent a fundamental step towards better management of this urgent public health issue. This aim could be accomplished by exploiting the potential of machine learning (ML) technology. In a single-centre study (n = 2567), we used an ML analysis to cluster patients with metabolically healthy (MHO) or metabolically unhealthy (MUO) obesity, based on several clinical and biochemical variables. The first model provided by ML was able to predict the presence/absence of MHO with an accuracy of 66.67% and 72.15%, respectively, and included the following parameters: HOMA-IR, upper body fat/lower body fat, glycosylated haemoglobin, red blood cells, age, alanine aminotransferase, uric acid, white blood cells, insulin-like growth factor 1 (IGF-1) and gamma-glutamyl transferase. For each of these parameters, ML provided threshold values identifying either MUO or MHO. A second model including IGF-1 zSDS, a surrogate marker of IGF-1 normalized by age and sex, was even more accurate with a 71.84% and 72.3% precision, respectively. Our results demonstrated high IGF-1 levels in MHO patients, thus highlighting a possible role of IGF-1 as a novel metabolic health parameter to effectively predict the development of MUO using ML technology.
Collapse
Affiliation(s)
- Davide Masi
- Department of Experimental Medicine, Section of Medical Pathophysiology, Food Science and Endocrinology, Sapienza University of Rome, 00161 Rome, Italy; (R.R.); (F.B.); (D.V.B.); (M.W.); (S.M.); (C.L.); (L.G.)
- Correspondence: ; Tel.: +39-06-499-707-16
| | - Renata Risi
- Department of Experimental Medicine, Section of Medical Pathophysiology, Food Science and Endocrinology, Sapienza University of Rome, 00161 Rome, Italy; (R.R.); (F.B.); (D.V.B.); (M.W.); (S.M.); (C.L.); (L.G.)
- MRC Metabolic Diseases Unit, MRC Institute of Metabolic Science, University of Cambridge, Cambridge CB2 1TN, UK
| | - Filippo Biagi
- Department of Experimental Medicine, Section of Medical Pathophysiology, Food Science and Endocrinology, Sapienza University of Rome, 00161 Rome, Italy; (R.R.); (F.B.); (D.V.B.); (M.W.); (S.M.); (C.L.); (L.G.)
| | - Daniel Vasquez Barahona
- Department of Experimental Medicine, Section of Medical Pathophysiology, Food Science and Endocrinology, Sapienza University of Rome, 00161 Rome, Italy; (R.R.); (F.B.); (D.V.B.); (M.W.); (S.M.); (C.L.); (L.G.)
| | - Mikiko Watanabe
- Department of Experimental Medicine, Section of Medical Pathophysiology, Food Science and Endocrinology, Sapienza University of Rome, 00161 Rome, Italy; (R.R.); (F.B.); (D.V.B.); (M.W.); (S.M.); (C.L.); (L.G.)
| | | | | | | | - Stefania Mariani
- Department of Experimental Medicine, Section of Medical Pathophysiology, Food Science and Endocrinology, Sapienza University of Rome, 00161 Rome, Italy; (R.R.); (F.B.); (D.V.B.); (M.W.); (S.M.); (C.L.); (L.G.)
| | - Carla Lubrano
- Department of Experimental Medicine, Section of Medical Pathophysiology, Food Science and Endocrinology, Sapienza University of Rome, 00161 Rome, Italy; (R.R.); (F.B.); (D.V.B.); (M.W.); (S.M.); (C.L.); (L.G.)
| | - Lucio Gnessi
- Department of Experimental Medicine, Section of Medical Pathophysiology, Food Science and Endocrinology, Sapienza University of Rome, 00161 Rome, Italy; (R.R.); (F.B.); (D.V.B.); (M.W.); (S.M.); (C.L.); (L.G.)
| |
Collapse
|
28
|
Lee YC, Christensen JJ, Parnell LD, Smith CE, Shao J, McKeown NM, Ordovás JM, Lai CQ. Using Machine Learning to Predict Obesity Based on Genome-Wide and Epigenome-Wide Gene-Gene and Gene-Diet Interactions. Front Genet 2022; 12:783845. [PMID: 35047011 PMCID: PMC8763388 DOI: 10.3389/fgene.2021.783845] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2021] [Accepted: 11/29/2021] [Indexed: 12/15/2022] Open
Abstract
Obesity is associated with many chronic diseases that impair healthy aging and is governed by genetic, epigenetic, and environmental factors and their complex interactions. This study aimed to develop a model that predicts an individual's risk of obesity by better characterizing these complex relations and interactions focusing on dietary factors. For this purpose, we conducted a combined genome-wide and epigenome-wide scan for body mass index (BMI) and up to three-way interactions among 402,793 single nucleotide polymorphisms (SNPs), 415,202 DNA methylation sites (DMSs), and 397 dietary and lifestyle factors using the generalized multifactor dimensionality reduction (GMDR) method. The training set consisted of 1,573 participants in exam 8 of the Framingham Offspring Study (FOS) cohort. After identifying genetic, epigenetic, and dietary factors that passed statistical significance, we applied machine learning (ML) algorithms to predict participants' obesity status in the test set, taken as a subset of independent samples (n = 394) from the same cohort. The quality and accuracy of prediction models were evaluated using the area under the receiver operating characteristic curve (ROC-AUC). GMDR identified 213 SNPs, 530 DMSs, and 49 dietary and lifestyle factors as significant predictors of obesity. Comparing several ML algorithms, we found that the stochastic gradient boosting model provided the best prediction accuracy for obesity with an overall accuracy of 70%, with ROC-AUC of 0.72 in test set samples. Top predictors of the best-fit model were 21 SNPs, 230 DMSs in genes such as CPT1A, ABCG1, SLC7A11, RNF145, and SREBF1, and 26 dietary factors, including processed meat, diet soda, French fries, high-fat dairy, artificial sweeteners, alcohol intake, and specific nutrients and food components, such as calcium and flavonols. In conclusion, we developed an integrated approach with ML to predict obesity using omics and dietary data. This extends our knowledge of the drivers of obesity, which can inform precision nutrition strategies for the prevention and treatment of obesity. Clinical Trial Registration: [www.ClinicalTrials.gov], the Framingham Heart Study (FHS), [NCT00005121].
Collapse
Affiliation(s)
- Yu-Chi Lee
- USDA ARS, Nutrition and Genomics Laboratory, JM-USDA Human Nutrition Research Center on Aging at Tufts University, Boston, MA, United States
| | - Jacob J. Christensen
- Department of Nutrition, Norwegian National Advisory Unit on FH, Oslo University Hospital, University of Oslo, Oslo, Norway
| | - Laurence D. Parnell
- USDA ARS, Nutrition and Genomics Laboratory, JM-USDA Human Nutrition Research Center on Aging at Tufts University, Boston, MA, United States
| | - Caren E. Smith
- Nutrition and Genomics Laboratory, JM-USDA Human Nutrition Research Center on Aging at Tufts University, Boston, MA, United States
| | - Jonathan Shao
- Statistical and Bioinformatics Group, Northeast Area, USDA ARS, Beltsville, MD, United States
| | - Nicola M. McKeown
- Nutritional Epidemiology Laboratory, JM-USDA Human Nutrition Research Center on Aging at Tufts University, Boston, MA, United States
- Friedman School of Nutrition Science and Policy, Tufts University, Boston, MA, United States
| | - José M. Ordovás
- Nutrition and Genomics Laboratory, JM-USDA Human Nutrition Research Center on Aging at Tufts University, Boston, MA, United States
- CEI UAM + CSIC, IMDEA Food Institute, Madrid, Spain
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Madrid, Spain
| | - Chao-Qiang Lai
- USDA ARS, Nutrition and Genomics Laboratory, JM-USDA Human Nutrition Research Center on Aging at Tufts University, Boston, MA, United States
| |
Collapse
|
29
|
Alanazi EM, Abdou A, Luo J. Predicting Risk of Stroke From Lab Tests Using Machine Learning Algorithms: Development and Evaluation of Prediction Models. JMIR Form Res 2021; 5:e23440. [PMID: 34860663 PMCID: PMC8686476 DOI: 10.2196/23440] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Revised: 11/14/2020] [Accepted: 10/15/2021] [Indexed: 12/22/2022] Open
Abstract
Background Stroke, a cerebrovascular disease, is one of the major causes of death. It causes significant health and financial burdens for both patients and health care systems. One of the important risk factors for stroke is health-related behavior, which is becoming an increasingly important focus of prevention. Many machine learning models have been built to predict the risk of stroke or to automatically diagnose stroke, using predictors such as lifestyle factors or radiological imaging. However, there have been no models built using data from lab tests. Objective The aim of this study was to apply computational methods using machine learning techniques to predict stroke from lab test data. Methods We used the National Health and Nutrition Examination Survey data sets with three different data selection methods (ie, without data resampling, with data imputation, and with data resampling) to develop predictive models. We used four machine learning classifiers and six performance measures to evaluate the performance of the models. Results We found that accurate and sensitive machine learning models can be created to predict stroke from lab test data. Our results show that the data resampling approach performed the best compared to the other two data selection techniques. Prediction with the random forest algorithm, which was the best algorithm tested, achieved an accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and area under the curve of 0.96, 0.97, 0.96, 0.75, 0.99, and 0.97, respectively, when all of the attributes were used. Conclusions The predictive model, built using data from lab tests, was easy to use and had high accuracy. In future studies, we aim to use data that reflect different types of stroke and to explore the data to build a prediction model for each type.
Collapse
Affiliation(s)
- Eman M Alanazi
- Department of Health Informatics, College of Health Sciences, Saudi Electronic University, Riyadh, Saudi Arabia.,Department of Biomedical and Health Informatics, College of Engineering, University of Wisconsin-Milwaukee, Milwaukee, WI, United States
| | - Aalaa Abdou
- Department of Radiotherapy, Children's Cancer Hospital 57357, Cairo, Egypt
| | - Jake Luo
- Department of Health Informatics and Administration, College of Health Sciences, University of Wisconsin-Milwaukee, Milwaukee, WI, United States
| |
Collapse
|
30
|
Höskuldsdóttir G, Engström M, Rawshani A, Wallenius V, Lenér F, Fändriks L, Mossberg K, Eliasson B. The BAriatic surgery SUbstitution and nutrition (BASUN) population: a data-driven exploration of predictors for obesity. BMC Endocr Disord 2021; 21:183. [PMID: 34507573 PMCID: PMC8431862 DOI: 10.1186/s12902-021-00849-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Accepted: 08/18/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The development of obesity is most likely due to a combination of biological and environmental factors some of which might still be unidentified. We used a machine learning technique to examine the relative importance of more than 100 clinical variables as predictors for BMI. METHODS BASUN is a prospective non-randomized cohort study of 971 individuals that received medical or surgical treatment (treatment choice was based on patient's preferences and clinical criteria, not randomization) for obesity in the Västra Götaland county in Sweden between 2015 and 2017 with planned follow-up for 10 years. This study includes demographic data, BMI, blood tests, and questionnaires before obesity treatment that cover three main areas: gastrointestinal symptoms and eating habits, physical activity and quality of life, and psychological health. We used random forest, with conditional variable importance, to study the relative importance of roughly 100 predictors of BMI, covering 15 domains. We quantified the predictive value of each individual predictor, as well as each domain. RESULTS The participants received medical (n = 382) or surgical treatment for obesity (Roux-en-Y gastric bypass, n = 388; sleeve gastrectomy, n = 201). There were minor differences between these groups before treatment with regard to anthropometrics, laboratory measures and results from questionnaires. The 10 individual variables with the strongest predictive value, in order of decreasing strength, were country of birth, marital status, sex, calcium levels, age, levels of TSH and HbA1c, AUDIT score, BE tendencies according to QEWPR, and TG levels. The strongest domains predicting BMI were: Socioeconomic status, Demographics, Biomarkers (notably TSH), Lifestyle/habits, Biomarkers for cardiovascular disease and diabetes, and Potential anxiety and depression. CONCLUSIONS Lifestyle, habits, age, sex and socioeconomic status are some of the strongest predictors for BMI levels. Potential anxiety and / or depression and other characteristics captured using questionnaires have strong predictive value. These results confirm previously suggested associations and advocate prospective studies to examine the value of better characterization of patients eligible for obesity treatment, and consequently to evaluate the treatment effects in groups of patients. TRIAL REGISTRATION March 03, 2015; NCT03152617 .
Collapse
Affiliation(s)
- Gudrún Höskuldsdóttir
- Institute of Medicine, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden.
- Department of Medicine, Sahlgrenska University Hospital, 413 45, Gothenburg, Sweden.
| | - My Engström
- Institute of Health and Care Sciences, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
- Department of Surgery, Region Västra Götaland, Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Araz Rawshani
- Institute of Medicine, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
| | - Ville Wallenius
- Institute of Clinical Sciences, University of Gothenburg, Gothenburg, Sweden
- Department of Surgery, Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Frida Lenér
- Institute of Medicine, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
- Department of Public Health and Community Medicine, Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Lars Fändriks
- Institute of Clinical Sciences, University of Gothenburg, Gothenburg, Sweden
- Department of Surgery, Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Karin Mossberg
- Institute of Medicine, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
- Department of Public Health and Community Medicine, Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Björn Eliasson
- Institute of Medicine, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
- Department of Medicine, Sahlgrenska University Hospital, 413 45, Gothenburg, Sweden
| |
Collapse
|
31
|
Safaei M, Sundararajan EA, Driss M, Boulila W, Shapi'i A. A systematic literature review on obesity: Understanding the causes & consequences of obesity and reviewing various machine learning approaches used to predict obesity. Comput Biol Med 2021; 136:104754. [PMID: 34426171 DOI: 10.1016/j.compbiomed.2021.104754] [Citation(s) in RCA: 124] [Impact Index Per Article: 41.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2021] [Revised: 08/05/2021] [Accepted: 08/05/2021] [Indexed: 01/02/2023]
Abstract
Obesity is considered a principal public health concern and ranked as the fifth foremost reason for death globally. Overweight and obesity are one of the main lifestyle illnesses that leads to further health concerns and contributes to numerous chronic diseases, including cancers, diabetes, metabolic syndrome, and cardiovascular diseases. The World Health Organization also predicted that 30% of death in the world will be initiated with lifestyle diseases in 2030 and can be stopped through the suitable identification and addressing of associated risk factors and behavioral involvement policies. Thus, detecting and diagnosing obesity as early as possible is crucial. Therefore, the machine learning approach is a promising solution to early predictions of obesity and the risk of overweight because it can offer quick, immediate, and accurate identification of risk factors and condition likelihoods. The present study conducted a systematic literature review to examine obesity research and machine learning techniques for the prevention and treatment of obesity from 2010 to 2020. Accordingly, 93 papers are identified from the review articles as primary studies from an initial pool of over 700 papers addressing obesity. Consequently, this study initially recognized the significant potential factors that influence and cause adult obesity. Next, the main diseases and health consequences of obesity and overweight are investigated. Ultimately, this study recognized the machine learning methods that can be used for the prediction of obesity. Finally, this study seeks to support decision-makers looking to understand the impact of obesity on health in the general population and identify outcomes that can be used to guide health authorities and public health to further mitigate threats and effectively guide obese people globally.
Collapse
Affiliation(s)
- Mahmood Safaei
- Centre for Software Technology and Management, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia (UKM), Bangi, 43600, Selangor, Malaysia
| | - Elankovan A Sundararajan
- Centre for Software Technology and Management, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia (UKM), Bangi, 43600, Selangor, Malaysia.
| | - Maha Driss
- RIADI Laboratory, University of Manouba, Manouba, Tunisia; College of Computer Science and Engineering, Taibah University, Medina, Saudi Arabia
| | - Wadii Boulila
- RIADI Laboratory, University of Manouba, Manouba, Tunisia; College of Computer Science and Engineering, Taibah University, Medina, Saudi Arabia
| | - Azrulhizam Shapi'i
- Center for Artificial Intelligence Technology, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia (UKM), Bangi, 43600, Selangor, Malaysia
| |
Collapse
|
32
|
Qasrawi R, Vicuna Polo SP, Abu Al-Halawa D, Hallaq S, Abdeen Z. Schoolchildren’ Depression and Anxiety Risk Factors Assessment and Prediction: Machine Learning Techniques Performance Analysis (Preprint). JMIR Form Res 2021; 6:e32736. [PMID: 35665695 PMCID: PMC9475423 DOI: 10.2196/32736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2021] [Revised: 02/03/2022] [Accepted: 04/29/2022] [Indexed: 11/13/2022] Open
Abstract
Background Depression and anxiety symptoms in early childhood have a major effect on children’s mental health growth and cognitive development. The effect of mental health problems on cognitive development has been studied by researchers for the last 2 decades. Objective In this paper, we sought to use machine learning techniques to predict the risk factors associated with schoolchildren’s depression and anxiety. Methods The study sample consisted of 3984 students in fifth to ninth grades, aged 10-15 years, studying at public and refugee schools in the West Bank. The data were collected using the health behaviors schoolchildren questionnaire in the 2013-2014 academic year and analyzed using machine learning to predict the risk factors associated with student mental health symptoms. We used 5 machine learning techniques (random forest [RF], neural network, decision tree, support vector machine [SVM], and naive Bayes) for prediction. Results The results indicated that the SVM and RF models had the highest accuracy levels for depression (SVM: 92.5%; RF: 76.4%) and anxiety (SVM: 92.4%; RF: 78.6%). Thus, the SVM and RF models had the best performance in classifying and predicting the students’ depression and anxiety. The results showed that school violence and bullying, home violence, academic performance, and family income were the most important factors affecting the depression and anxiety scales. Conclusions Overall, machine learning proved to be an efficient tool for identifying and predicting the associated factors that influence student depression and anxiety. The machine learning techniques seem to be a good model for predicting abnormal depression and anxiety symptoms among schoolchildren, so the deployment of machine learning within the school information systems might facilitate the development of health prevention and intervention programs that will enhance students’ mental health and cognitive development.
Collapse
Affiliation(s)
- Radwan Qasrawi
- Department of Computer Science, Al-Quds University, Ramallah, Occupied Palestinian Territory
- Department of Computer Engineering, Istinye University, Istanbul, Turkey
| | - Stephanny Paola Vicuna Polo
- Center for Business Innovation and Technology, Al-Quds University, Jerusalem, Occupied Palestinian Territory
| | - Diala Abu Al-Halawa
- Faculty of Medicine, Al-Quds University, Jerusalem, Occupied Palestinian Territory
| | - Sameh Hallaq
- Al-Quds Bard College for Arts and Sciences, Al-Quds University, Jerusalem, Occupied Palestinian Territory
| | - Ziad Abdeen
- Faculty of Medicine, Al-Quds University, Jerusalem, Occupied Palestinian Territory
| |
Collapse
|
33
|
Nyrup R. From General Principles to Procedural Values: Responsible Digital Health Meets Public Health Ethics. Front Digit Health 2021; 3:690417. [PMID: 34713166 PMCID: PMC8521828 DOI: 10.3389/fdgth.2021.690417] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2021] [Accepted: 06/14/2021] [Indexed: 11/13/2022] Open
Abstract
Most existing work in digital ethics is modeled on the "principlist" approach to medical ethics, seeking to articulate a small set of general principles to guide ethical decision-making. Critics have highlighted several limitations of such principles, including (1) that they mask ethical disagreements between and within stakeholder communities, and (2) that they provide little guidance for how to resolve trade-offs between different values. This paper argues that efforts to develop responsible digital health practices could benefit from paying closer attention to a different branch of medical ethics, namely public health ethics. In particular, I argue that the influential "accountability for reasonableness" (A4R) approach to public health ethics can help overcome some of the limitations of existing digital ethics principles. A4R seeks to resolve trade-offs through decision-procedures designed according to certain shared procedural values. This allows stakeholders to recognize decisions reached through these procedures as legitimate, despite their underlying disagreements. I discuss the prospects for adapting A4R to the context of responsible digital health and suggest questions for further research.
Collapse
Affiliation(s)
- Rune Nyrup
- Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
34
|
Thamrin SA, Arsyad DS, Kuswanto H, Lawi A, Nasir S. Predicting Obesity in Adults Using Machine Learning Techniques: An Analysis of Indonesian Basic Health Research 2018. Front Nutr 2021; 8:669155. [PMID: 34235168 PMCID: PMC8255629 DOI: 10.3389/fnut.2021.669155] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2021] [Accepted: 04/27/2021] [Indexed: 12/22/2022] Open
Abstract
Obesity is strongly associated with multiple risk factors. It is significantly contributing to an increased risk of chronic disease morbidity and mortality worldwide. There are various challenges to better understand the association between risk factors and the occurrence of obesity. The traditional regression approach limits analysis to a small number of predictors and imposes assumptions of independence and linearity. Machine Learning (ML) methods are an alternative that provide information with a unique approach to the application stage of data analysis on obesity. This study aims to assess the ability of ML methods, namely Logistic Regression, Classification and Regression Trees (CART), and Naïve Bayes to identify the presence of obesity using publicly available health data, using a novel approach with sophisticated ML methods to predict obesity as an attempt to go beyond traditional prediction models, and to compare the performance of three different methods. Meanwhile, the main objective of this study is to establish a set of risk factors for obesity in adults among the available study variables. Furthermore, we address data imbalance using Synthetic Minority Oversampling Technique (SMOTE) to predict obesity status based on risk factors available in the dataset. This study indicates that the Logistic Regression method shows the highest performance. Nevertheless, kappa coefficients show only moderate concordance between predicted and measured obesity. Location, marital status, age groups, education, sweet drinks, fatty/oily foods, grilled foods, preserved foods, seasoning powders, soft/carbonated drinks, alcoholic drinks, mental emotional disorders, diagnosed hypertension, physical activity, smoking, and fruit and vegetables consumptions are significant in predicting obesity status in adults. Identifying these risk factors could inform health authorities in designing or modifying existing policies for better controlling chronic diseases especially in relation to risk factors associated with obesity. Moreover, applying ML methods on publicly available health data, such as Indonesian Basic Health Research (RISKESDAS) is a promising strategy to fill the gap for a more robust understanding of the associations of multiple risk factors in predicting health outcomes.
Collapse
Affiliation(s)
- Sri Astuti Thamrin
- Department of Statistics, Faculty of Mathematics and Natural Science, Hasanuddin University, Makassar, Indonesia
| | - Dian Sidik Arsyad
- Department of Epidemiology, Faculty of Public Health, Hasanuddin University, Makassar, Indonesia
| | - Hedi Kuswanto
- Department of Statistics, Faculty of Mathematics and Natural Science, Hasanuddin University, Makassar, Indonesia
| | - Armin Lawi
- Department of Mathematics, Faculty of Mathematics and Natural Sciences, Hasanuddin University, Makassar, Indonesia
| | - Sudirman Nasir
- Department of Health Promotion, Faculty of Public Health, Hasanuddin University, Makassar, Indonesia
| |
Collapse
|
35
|
Pang X, Forrest CB, Lê-Scherban F, Masino AJ. Prediction of early childhood obesity with machine learning and electronic health record data. Int J Med Inform 2021; 150:104454. [PMID: 33866231 DOI: 10.1016/j.ijmedinf.2021.104454] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Revised: 03/19/2021] [Accepted: 04/05/2021] [Indexed: 12/27/2022]
Abstract
OBJECTIVE This study compares seven machine learning models developed to predict childhood obesity from age > 2 to ≤ 7 years using Electronic Healthcare Record (EHR) data up to age 2 years. MATERIALS AND METHODS EHR data from of 860,510 patients with 11,194,579 healthcare encounters were obtained from the Children's Hospital of Philadelphia. After applying stringent quality control to remove implausible growth values and including only individuals with all recommended wellness visits by age 7 years, 27,203 (50.78 % male) patients remained for model development. Seven machine learning models were developed to predict obesity incidence as defined by the Centers for Disease Control and Prevention (age/sex adjusted BMI>95th percentile). Model performance was evaluated by multiple standard classifier metrics and the differences among seven models were compared using the Cochran's Q test and post-hoc pairwise testing. RESULTS XGBoost yielded 0.81 (0.001) AUC, which outperformed all other models. It also achieved statistically significant better performance than all other models on standard classifier metrics (sensitivity fixed at 80 %): precision 30.90 % (0.22 %), F1-socre 44.60 % (0.26 %), accuracy 66.14 % (0.41 %), and specificity 63.27 % (0.41 %). DISCUSSION AND CONCLUSION Early childhood obesity prediction models were developed from the largest cohort reported to date. Relative to prior research, our models generalize to include males and females in a single model and extend the time frame for obesity incidence prediction to 7 years of age. The presented machine learning model development workflow can be adapted to various EHR-based studies and may be valuable for developing other clinical prediction models.
Collapse
Affiliation(s)
- Xueqin Pang
- Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, USA.
| | - Christopher B Forrest
- Applied Clinical Research Center, Children's Hospital of Philadelphia, Philadelphia, USA; Department of Anesthesiology & Critical Care Medicine, Perelman School of Medicine University of Pennsylvania, Philadelphia, USA
| | - Félice Lê-Scherban
- Department of Epidemiology and Biostatistics, Dornsife School of Public Health, Drexel University, Philadelphia, USA; Drexel Urban Health Collaborative, Drexel University, Philadelphia, USA
| | - Aaron J Masino
- Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, USA; Department of Anesthesiology & Critical Care Medicine, Perelman School of Medicine University of Pennsylvania, Philadelphia, USA
| |
Collapse
|
36
|
LeCroy MN, Kim RS, Stevens J, Hanna DB, Isasi CR. Identifying Key Determinants of Childhood Obesity: A Narrative Review of Machine Learning Studies. Child Obes 2021; 17:153-159. [PMID: 33661719 PMCID: PMC8418446 DOI: 10.1089/chi.2020.0324] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Machine learning is a class of algorithms able to handle a large number of predictors with potentially nonlinear relationships. By applying machine learning to obesity, researchers can examine how risk factors across multiple settings (e.g., school and home) interact to best predict childhood obesity risk. In this narrative review, we provide an overview of studies that have applied machine learning to predict childhood obesity using a combination of sociodemographic and behavioral risk factors. The objective is to summarize the key determinants of obesity identified in existing machine learning studies and highlight opportunities for future machine learning applications in the field. Of 15 peer-reviewed studies, approximately half examined early childhood (0-24 months of age) determinants. These studies identified child's weight history (e.g., history of overweight/obesity or large increases in weight-related measures between birth and 24 months of age) and parental overweight/obesity (current or prior) as key risk factors, whereas the remaining studies indicated that social factors and physical inactivity were important in middle childhood and late childhood/adolescence. Across age groups, findings suggested that race/ethnic-specific models may be needed to accurately predict obesity from middle childhood onward. Future studies should consider using existing large data sets to take advantage of the benefits of machine learning and should collect a wider range of novel risk factors (e.g., psychosocial and sociocultural determinants of health) to better predict childhood obesity. Ultimately, such research can aid in the development of effective obesity prevention interventions, particularly ones that address the disproportionate burden of obesity experienced by racial/ethnic minorities.
Collapse
Affiliation(s)
- Madison N. LeCroy
- Department of Epidemiology and Population Health, Albert Einstein College of Medicine, Bronx, NY, USA.,Address correspondence to: Madison N. LeCroy, PhD, Department of Epidemiology and Population Health, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, USA
| | - Ryung S. Kim
- Department of Epidemiology and Population Health, Albert Einstein College of Medicine, Bronx, NY, USA
| | - June Stevens
- Department of Nutrition and Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.,Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - David B. Hanna
- Department of Epidemiology and Population Health, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Carmen R. Isasi
- Department of Epidemiology and Population Health, Albert Einstein College of Medicine, Bronx, NY, USA
| |
Collapse
|
37
|
Kim HH, Kim Y, Park YR. Interpretable Conditional Recurrent Neural Network for Weight Change Prediction: Algorithm Development and Validation Study. JMIR Mhealth Uhealth 2021; 9:e22183. [PMID: 33779574 PMCID: PMC8088842 DOI: 10.2196/22183] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2020] [Revised: 10/15/2020] [Accepted: 01/17/2021] [Indexed: 01/19/2023] Open
Abstract
Background In recent years, mobile-based interventions have received more attention as an alternative to on-site obesity management. Despite increased mobile interventions for obesity, there are lost opportunities to achieve better outcomes due to the lack of a predictive model using current existing longitudinal and cross-sectional health data. Noom (Noom Inc) is a mobile app that provides various lifestyle-related logs including food logging, exercise logging, and weight logging. Objective The aim of this study was to develop a weight change predictive model using an interpretable artificial intelligence algorithm for mobile-based interventions and to explore contributing factors to weight loss. Methods Lifelog mobile app (Noom) user data of individuals who used the weight loss program for 16 weeks in the United States were used to develop an interpretable recurrent neural network algorithm for weight prediction that considers both time-variant and time-fixed variables. From a total of 93,696 users in the coaching program, we excluded users who did not take part in the 16-week weight loss program or who were not overweight or obese or had not entered weight or meal records for the entire 16-week program. This interpretable model was trained and validated with 5-fold cross-validation (training set: 70%; testing: 30%) using the lifelog data. Mean absolute percentage error between actual weight loss and predicted weight was used to measure model performance. To better understand the behavior factors contributing to weight loss or gain, we calculated contribution coefficients in test sets. Results A total of 17,867 users’ data were included in the analysis. The overall mean absolute percentage error of the model was 3.50%, and the error of the model declined from 3.78% to 3.45% by the end of the program. The time-level attention weighting was shown to be equally distributed at 0.0625 each week, but this gradually decreased (from 0.0626 to 0.0624) as it approached 16 weeks. Factors such as usage pattern, weight input frequency, meal input adherence, exercise, and sharp decreases in weight trajectories had negative contribution coefficients of –0.021, –0.032, –0.015, and –0.066, respectively. For time-fixed variables, being male had a contribution coefficient of –0.091. Conclusions An interpretable algorithm, with both time-variant and time-fixed data, was used to precisely predict weight loss while preserving model transparency. This week-to-week prediction model is expected to improve weight loss and provide a global explanation of contributing factors, leading to better outcomes.
Collapse
Affiliation(s)
- Ho Heon Kim
- Department of Biomedical Systems Informatics, College of Medicine, Yonsei University, Seoul, Republic of Korea
| | | | - Yu Rang Park
- Department of Biomedical Systems Informatics, College of Medicine, Yonsei University, Seoul, Republic of Korea
| |
Collapse
|
38
|
Saberi-Karimian M, Khorasanchi Z, Ghazizadeh H, Tayefi M, Saffar S, Ferns GA, Ghayour-Mobarhan M. Potential value and impact of data mining and machine learning in clinical diagnostics. Crit Rev Clin Lab Sci 2021; 58:275-296. [PMID: 33739235 DOI: 10.1080/10408363.2020.1857681] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Data mining involves the use of mathematical sciences, statistics, artificial intelligence, and machine learning to determine the relationships between variables from a large sample of data. It has previously been shown that data mining can improve the prediction and diagnostic precision of type 2 diabetes mellitus. A few studies have applied machine learning to assess hypertension and metabolic syndrome-related biomarkers, as well as refine the assessment of cardiovascular disease risk. Machine learning methods have also been applied to assess new biomarkers and survival outcomes in patients with renal diseases to predict the development of chronic kidney disease, disease progression, and renal graft survival. In the latter, random forest methods were found to be the best for the prediction of chronic kidney disease. Some studies have investigated the prognosis of nonalcoholic fatty liver disease and acute liver failure, as well as therapy response prediction in patients with viral disorders, using decision tree models. Machine learning techniques, such as Sparse High-Order Interaction Model with Rejection Option, have been used for diagnosing Alzheimer's disease. Data mining techniques have also been applied to identify the risk factors for serious mental illness, such as depression and dementia, and help to diagnose and predict the quality of life of such patients. In relation to child health, some studies have determined the best algorithms for predicting obesity and malnutrition. Machine learning has determined the important risk factors for preterm birth and low birth weight. Published studies of patients with cancer and bacterial diseases are limited and should perhaps be addressed more comprehensively in future studies. Herein, we provide an in-depth review of studies in which biochemical biomarker data were analyzed using machine learning methods to assess the risk of several common diseases, in order to summarize the potential applications of data mining methods in clinical diagnosis. Data mining techniques have now been increasingly applied to clinical diagnostics, and they have the potential to support this field.
Collapse
Affiliation(s)
- Maryam Saberi-Karimian
- International UNESCO Center for Health Related Basic Sciences and Human Nutrition, Mashhad University of Medical Sciences, Mashhad, Iran.,Student Research Committee, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Zahra Khorasanchi
- Department of Nutrition, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Hamideh Ghazizadeh
- International UNESCO Center for Health Related Basic Sciences and Human Nutrition, Mashhad University of Medical Sciences, Mashhad, Iran.,Student Research Committee, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Maryam Tayefi
- Norwegian Center for e-health Research, University Hospital of North Norway, Tromsø, Norway
| | - Sara Saffar
- International UNESCO Center for Health Related Basic Sciences and Human Nutrition, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Gordon A Ferns
- Division of Medical Education, Brighton and Sussex Medical School, Falmer, UK
| | - Majid Ghayour-Mobarhan
- International UNESCO Center for Health Related Basic Sciences and Human Nutrition, Mashhad University of Medical Sciences, Mashhad, Iran
| |
Collapse
|
39
|
Use of Machine Learning to Determine the Information Value of a BMI Screening Program. Am J Prev Med 2021; 60:425-433. [PMID: 33483154 PMCID: PMC8610445 DOI: 10.1016/j.amepre.2020.10.016] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/28/2020] [Revised: 10/13/2020] [Accepted: 10/14/2020] [Indexed: 12/12/2022]
Abstract
INTRODUCTION Childhood obesity continues to be a significant public health issue in the U.S. and is associated with short- and long-term adverse health outcomes. A number of states have implemented school-based BMI screening programs. However, these programs have been criticized for not being effective in improving students' BMI or reducing childhood obesity. One potential benefit, however, of screening programs is the identification of younger children at risk of obesity as they age. METHODS This study used a unique panel data set from the BMI screening program for public school children in the state of Arkansas collected from 2003 to 2004 through the 2018-2019 academic years and analyzed in 2020. Machine learning algorithms were applied to understand the informational value of BMI screening. Specifically, this study evaluated the importance of BMI information during kindergarten to the accurate prediction of childhood obesity by the 4th grade. RESULTS Kindergarten BMI z-score is the most important predictor of obesity by the 4th grade and is much more important to prediction than sociodemographic and socioeconomic variables that would otherwise be available to policymakers in the absence of the screening program. Including the kindergarten BMI z-score of students in the model meaningfully increases the accuracy of the prediction. CONCLUSIONS Data from the Arkansas BMI screening program greatly improve the ability to identify children at greatest risk of future obesity to the extent that better prediction can be translated into more effective policy and better health outcomes. This is a heretofore unexamined benefit of school-based BMI screening.
Collapse
|
40
|
Rose S. Intersections of machine learning and epidemiological methods for health services research. Int J Epidemiol 2021; 49:1763-1770. [PMID: 32236476 PMCID: PMC7825941 DOI: 10.1093/ije/dyaa035] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/17/2020] [Indexed: 12/15/2022] Open
Abstract
The field of health services research is broad and seeks to answer questions about the health care system. It is inherently interdisciplinary, and epidemiologists have made crucial contributions. Parametric regression techniques remain standard practice in health services research with machine learning techniques currently having low penetrance in comparison. However, studies in several prominent areas, including health care spending, outcomes and quality, have begun deploying machine learning tools for these applications. Nevertheless, major advances in epidemiological methods are also as yet underleveraged in health services research. This article summarizes the current state of machine learning in key areas of health services research, and discusses important future directions at the intersection of machine learning and epidemiological methods for health services research.
Collapse
Affiliation(s)
- Sherri Rose
- Department of Health Care Policy, Harvard Medical School, 180 Longwood Ave, Boston, MA, 02115, USA
| |
Collapse
|
41
|
Ranking of a wide multidomain set of predictor variables of children obesity by machine learning variable importance techniques. Sci Rep 2021; 11:1910. [PMID: 33479310 PMCID: PMC7820584 DOI: 10.1038/s41598-021-81205-8] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2020] [Accepted: 01/04/2021] [Indexed: 12/14/2022] Open
Abstract
The increased prevalence of childhood obesity is expected to translate in the near future into a concomitant soaring of multiple cardio-metabolic diseases. Obesity has a complex, multifactorial etiology, that includes multiple and multidomain potential risk factors: genetics, dietary and physical activity habits, socio-economic environment, lifestyle, etc. In addition, all these factors are expected to exert their influence through a specific and especially convoluted way during childhood, given the fast growth along this period. Machine Learning methods are the appropriate tools to model this complexity, given their ability to cope with high-dimensional, non-linear data. Here, we have analyzed by Machine Learning a sample of 221 children (6–9 years) from Madrid, Spain. Both Random Forest and Gradient Boosting Machine models have been derived to predict the body mass index from a wide set of 190 multidomain variables (including age, sex, genetic polymorphisms, lifestyle, socio-economic, diet, exercise, and gestation ones). A consensus relative importance of the predictors has been estimated through variable importance measures, implemented robustly through an iterative process that included permutation and multiple imputation. We expect this analysis will help to shed light on the most important variables associated to childhood obesity, in order to choose better treatments for its prevention.
Collapse
|
42
|
Khondaker MTI, Khan JY, Refaee MA, Hajj NE, Rahman MS, Alam T. Obesity in Qatar: A Case-Control Study on the Identification of Associated Risk Factors. Diagnostics (Basel) 2020; 10:diagnostics10110883. [PMID: 33138081 PMCID: PMC7693222 DOI: 10.3390/diagnostics10110883] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Revised: 10/20/2020] [Accepted: 10/23/2020] [Indexed: 12/11/2022] Open
Abstract
Obesity is an emerging public health problem in the Western world as well as in the Gulf region. Qatar, a tiny wealthy county, is among the top-ranked obese countries with a high obesity rate among its population. Compared to Qatar's severity of this health crisis, only a limited number of studies focused on the systematic identification of potential risk factors using multimodal datasets. This study aims to develop machine learning (ML) models to distinguish healthy from obese individuals and reveal potential risk factors associated with obesity in Qatar. We designed a case-control study focused on 500 Qatari subjects, comprising 250 obese and 250 healthy individuals- the later forming the control group. We obtained the most extensive collection of clinical measurements for the Qatari population from the Qatar Biobank (QBB) repertoire, including (i) Physio-clinical Biomarkers, (ii) Spirometry, (iii) VICORDER, (iv) DXA scan composition, and (v) DXA scan densitometry readings. We developed several machine learning (ML) models to distinguish healthy from obese individuals and applied multiple feature selection techniques to identify potential risk factors associated with obesity. The proposed ML model achieved over 90% accuracy, thereby outperforming the existing state of the art models. The outcome from the ablation study on multimodal clinical datasets revealed physio-clinical measurements as the most influential risk factors in distinguishing healthy versus obese subjects. Furthermore, multiple feature ranking techniques confirmed known obesity risk factors (c-peptide, insulin, albumin, uric acid) and identified potential risk factors linked to obesity-related comorbidities such as diabetes (e.g., HbA1c, glucose), liver function (e.g., alkaline phosphatase, gamma-glutamyl transferase), lipid profile (e.g., triglyceride, low density lipoprotein cholesterol, high density lipoprotein cholesterol), etc. Most of the DXA measurements (e.g., bone area, bone mineral composition, bone mineral density, etc.) were significantly (p-value < 0.05) higher in the obese group. Overall, the net effect of hypothesized protective factors of obesity on bone mass seems to have surpassed the hypothesized harmful factors. All the identified factors warrant further investigation in a clinical setup to understand their role in obesity.
Collapse
Affiliation(s)
- Md. Tawkat Islam Khondaker
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka 1205, Bangladesh; (M.T.I.K.); (J.Y.K.); (M.S.R.)
| | - Junaed Younus Khan
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka 1205, Bangladesh; (M.T.I.K.); (J.Y.K.); (M.S.R.)
| | - Mahmoud Ahmed Refaee
- College of Science and Engineering, Hamad Bin Khalifa University, Doha 34110, Qatar;
- Geriatric Department, Hamad Medical Corporation, Doha 3050, Qatar
- Faculty of Medicine, Ain Shams University, Alabasia 38, Cairo, Egypt
| | - Nady El Hajj
- College of Health and Life Sciences, Hamad Bin Khalifa University, Doha 34110, Qatar;
| | - M. Sohel Rahman
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka 1205, Bangladesh; (M.T.I.K.); (J.Y.K.); (M.S.R.)
| | - Tanvir Alam
- College of Science and Engineering, Hamad Bin Khalifa University, Doha 34110, Qatar;
- Correspondence:
| |
Collapse
|
43
|
Colmenarejo G. Machine Learning Models to Predict Childhood and Adolescent Obesity: A Review. Nutrients 2020; 12:E2466. [PMID: 32824342 PMCID: PMC7469049 DOI: 10.3390/nu12082466] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2020] [Revised: 08/09/2020] [Accepted: 08/13/2020] [Indexed: 12/19/2022] Open
Abstract
The prevalence of childhood and adolescence overweight an obesity is raising at an alarming rate in many countries. This poses a serious threat to the current and near-future health systems, given the association of these conditions with different comorbidities (cardiovascular diseases, type II diabetes, and metabolic syndrome) and even death. In order to design appropriate strategies for its prevention, as well as understand its origins, the development of predictive models for childhood/adolescent overweight/obesity and related outcomes is of extreme value. Obesity has a complex etiology, and in the case of childhood and adolescence obesity, this etiology includes also specific factors like (pre)-gestational ones; weaning; and the huge anthropometric, metabolic, and hormonal changes that during this period the body suffers. In this way, Machine Learning models are becoming extremely useful tools in this area, given their excellent predictive power; ability to model complex, nonlinear relationships between variables; and capacity to deal with high-dimensional data typical in this area. This is especially important given the recent appearance of large repositories of Electronic Health Records (EHR) that allow the development of models using datasets with many instances and predictor variables, from which Deep Learning variants can generate extremely accurate predictions. In the current work, the area of Machine Learning models to predict childhood and adolescent obesity and related outcomes is comprehensively and critically reviewed, including the latest ones using Deep Learning with EHR. These models are compared with the traditional statistical ones that used mainly logistic regression. The main features and applications appearing from these models are described, and the future opportunities are discussed.
Collapse
Affiliation(s)
- Gonzalo Colmenarejo
- Biostatistics and Bioinformatics Unit, IMDEA Food, CEI UAM+CSIC, E28049 Madrid, Spain
| |
Collapse
|
44
|
Computerized decision support and machine learning applications for the prevention and treatment of childhood obesity: A systematic review of the literature. Artif Intell Med 2020; 104:101844. [DOI: 10.1016/j.artmed.2020.101844] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2019] [Revised: 03/09/2020] [Accepted: 03/12/2020] [Indexed: 12/20/2022]
|
45
|
Williams A, Turer C, Smith J, Nievera I, McCulloch L, Wareg N, Clary M, Rajagopalan A, Brownson RC, Koopman RJ, Hampl S. Adoption of an Electronic Medical Record Tool for Childhood Obesity by Primary Care Providers. Appl Clin Inform 2020; 11:210-217. [PMID: 32187633 DOI: 10.1055/s-0040-1705106] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Primary care providers are tasked with the increasingly difficult job of addressing childhood obesity during clinic visits. Electronic medical record (EMR)-enabled decision-support tools may aid providers in this task; however, information is needed regarding whether providers perceive such tools to be useful for addressing nutrition and physical activity lifestyle behaviors. OBJECTIVES This study aimed to evaluate the usefulness and usability of FitTastic, an EMR-enabled tool to support prevention and management of childhood obesity in primary care. METHODS In this mixed-method study, we implemented the FitTastic tool in two primary-care clinics, then surveyed and conducted focused interviews with providers. Validated Technology Acceptance Model perceived usefulness and National Aeronautics and Space Administration (NASA) perceived usability survey questions were e-mailed to 60 providers. In-depth provider interviews with family medicine and pediatric physicians (n = 12) were used to further probe adoption of FitTastic. RESULTS Surveys were completed by 73% of providers (n = 44). The mean score for FitTastic's usefulness was 3.3 (standard deviation [SD] = 0.54, scale 1-5, where 5 is strongly agree) and usability, 4.8 (SD = 0.86, scale 1-7, where 7 is strongly agree). Usefulness and usability scores were associated with intention to use FitTastic (correlation for both, p < 0.05). Data from provider interviews indicated that useful features of FitTastic included: standardizing the approach to childhood obesity, and facilitating conversations about weight management, without increasing cognitive workload. However, use of FitTastic required more time from nurses to input lifestyle data. CONCLUSION FitTastic is perceived as a useful and usable EMR-based lifestyle behavior tool that standardizes, facilitates, and streamlines healthy lifestyle conversations with families. Perceived usability and usefulness scores correlated with provider intention-to-use the technology. These data suggest that EMR-based child obesity prevention and management tools can be feasible to use in the clinic setting, with potential for scalability. Usefulness can be optimized by limiting amount of time needed by staff to input data.
Collapse
Affiliation(s)
- Amy Williams
- Department of Family and Community Medicine, University of Missouri, Columbia, Missouri, United States
| | - Christy Turer
- Department of Internal Medicine-Pediatrics, University of Texas Southwestern, Dallas, Texas, United States
| | - Jamie Smith
- Department of Family and Community Medicine, University of Missouri, Columbia, Missouri, United States
| | - Isabelle Nievera
- University of Missouri School of Medicine, Columbia, Missouri, United States
| | - Laura McCulloch
- Columbia/Boone County Public Health and Human Services, Columbia, Missouri, United States
| | - Nuha Wareg
- Department of Family and Community Medicine, University of Missouri, Columbia, Missouri, United States
| | - Megan Clary
- Department of Child Health, University of Missouri, Columbia, Missouri, United States
| | - Anuradha Rajagopalan
- Department of Child Health, University of Missouri, Columbia, Missouri, United States
| | - Ross C Brownson
- Department of Surgery and Alvin J. Siteman Cancer Center, Washington University in St. Louis, St. Louis, Missouri, United States.,Division of Public Health Sciences, Department of Surgery, Alvin J. Siteman Cancer Center, Washington University School of Medicine, Washington University in St. Louis, St. Louis, Missouri, United States
| | - Richelle J Koopman
- Department of Family and Community Medicine, University of Missouri, Columbia, Missouri, United States
| | - Sarah Hampl
- General Pediatrics and Weight Management, Children's Mercy Hospital Center for Children's Healthy Lifestyles and Nutrition, Kansas City, Missouri, United States.,Department of Pediatrics, University of MO-Kansas City School of Medicine, Kansas City, Missouri, United States
| |
Collapse
|
46
|
Goulooze SC, Zwep LB, Vogt JE, Krekels EHJ, Hankemeier T, van den Anker JN, Knibbe CAJ. Beyond the Randomized Clinical Trial: Innovative Data Science to Close the Pediatric Evidence Gap. Clin Pharmacol Ther 2020; 107:786-795. [PMID: 31863465 DOI: 10.1002/cpt.1744] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Accepted: 11/22/2019] [Indexed: 12/13/2022]
Abstract
Despite the application of advanced statistical and pharmacometric approaches to pediatric trial data, a large pediatric evidence gap still remains. Here, we discuss how to collect more data from children by using real-world data from electronic health records, mobile applications, wearables, and social media. The large datasets collected with these approaches enable and may demand the use of artificial intelligence and machine learning to allow the data to be analyzed for decision making. Applications of this approach are presented, which include the prediction of future clinical complications, medical image analysis, identification of new pediatric end points and biomarkers, the prediction of treatment nonresponders, and the prediction of placebo-responders for trial enrichment. Finally, we discuss how to bring machine learning from science to pediatric clinical practice. We conclude that advantage should be taken of the current opportunities offered by innovations in data science and machine learning to close the pediatric evidence gap.
Collapse
Affiliation(s)
- Sebastiaan C Goulooze
- Division of Systems Biomedicine and Pharmacology, Leiden Academic Centre for Drug Research, Leiden University, Leiden, The Netherlands
| | - Laura B Zwep
- Division of Systems Biomedicine and Pharmacology, Leiden Academic Centre for Drug Research, Leiden University, Leiden, The Netherlands.,Mathematical Institute, Leiden University, Leiden, The Netherlands
| | - Julia E Vogt
- Medical Data Science Group, Department of Computer Science, ETH Zurich, Zurich, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Elke H J Krekels
- Division of Systems Biomedicine and Pharmacology, Leiden Academic Centre for Drug Research, Leiden University, Leiden, The Netherlands
| | - Thomas Hankemeier
- Division of Systems Biomedicine and Pharmacology, Leiden Academic Centre for Drug Research, Leiden University, Leiden, The Netherlands
| | - John N van den Anker
- Division of Clinical Pharmacology, Children's National Health System, Washington, District of Columbia, USA.,Paediatric Pharmacology and Pharmacometrics Research Program, University of Basel Children's Hospital, Basel, Switzerland
| | - Catherijne A J Knibbe
- Division of Systems Biomedicine and Pharmacology, Leiden Academic Centre for Drug Research, Leiden University, Leiden, The Netherlands.,Department of Clinical Pharmacy, St. Antonius Hospital, Nieuwegein, The Netherlands
| |
Collapse
|
47
|
Krzhizhanovskaya VV, Závodszky G, Lees MH, Dongarra JJ, Sloot PMA, Brissos S, Teixeira J. Machine Learning Approach for the Early Prediction of the Risk of Overweight and Obesity in Young People. LECTURE NOTES IN COMPUTER SCIENCE 2020. [PMCID: PMC7303691 DOI: 10.1007/978-3-030-50423-6_39] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
48
|
Cervantes RC, Palacio UM. Estimation of obesity levels based on computational intelligence. INFORMATICS IN MEDICINE UNLOCKED 2020. [DOI: 10.1016/j.imu.2020.100472] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open
|
49
|
Butler ÉM, Derraik JGB, Taylor RW, Cutfield WS. Prediction Models for Early Childhood Obesity: Applicability and Existing Issues. Horm Res Paediatr 2019; 90:358-367. [PMID: 30739117 DOI: 10.1159/000496563] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/03/2018] [Accepted: 01/03/2019] [Indexed: 11/19/2022] Open
Abstract
Statistical models have been developed for the prediction or diagnosis of a wide range of outcomes. However, to our knowledge, only 7 published studies have reported models to specifically predict overweight and/or obesity in early childhood. These models were developed using known risk factors and vary greatly in terms of their discrimination and predictive capacities. There are currently no established guidelines on what constitutes an acceptable level of risk (i.e., risk threshold) for childhood obesity prediction models, but these should be set following consideration of the consequences of false-positive and false-negative predictions, as well as any relevant clinical guidelines. To date, no studies have examined the impact of using early childhood obesity prediction models as intervention tools. While these are potentially valuable to inform targeted interventions, the heterogeneity of the existing models and the lack of consensus on adequate thresholds limit their usefulness in practice.
Collapse
Affiliation(s)
- Éadaoin M Butler
- A Better Start - National Science Challenge, New Zealand.,Liggins Institute, University of Auckland, Auckland, New Zealand
| | - José G B Derraik
- A Better Start - National Science Challenge, New Zealand, .,Liggins Institute, University of Auckland, Auckland, New Zealand, .,Department of Women's and Children's Health, Uppsala University, Uppsala, Sweden,
| | - Rachael W Taylor
- A Better Start - National Science Challenge, New Zealand.,Department of Medicine, University of Otago, Dunedin, New Zealand
| | - Wayne S Cutfield
- A Better Start - National Science Challenge, New Zealand.,Liggins Institute, University of Auckland, Auckland, New Zealand
| |
Collapse
|
50
|
Palechor FM, Manotas ADLH. Dataset for estimation of obesity levels based on eating habits and physical condition in individuals from Colombia, Peru and Mexico. Data Brief 2019; 25:104344. [PMID: 31467953 PMCID: PMC6710633 DOI: 10.1016/j.dib.2019.104344] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2019] [Revised: 07/23/2019] [Accepted: 07/25/2019] [Indexed: 11/24/2022] Open
Abstract
This paper presents data for the estimation of obesity levels in individuals from the countries of Mexico, Peru and Colombia, based on their eating habits and physical condition. The data contains 17 attributes and 2111 records, the records are labeled with the class variable NObesity (Obesity Level), that allows classification of the data using the values of Insufficient Weight, Normal Weight, Overweight Level I, Overweight Level II, Obesity Type I, Obesity Type II and Obesity Type III. 77% of the data was generated synthetically using the Weka tool and the SMOTE filter, 23% of the data was collected directly from users through a web platform. This data can be used to generate intelligent computational tools to identify the obesity level of an individual and to build recommender systems that monitor obesity levels. For discussion and more information of the dataset creation, please refer to the full-length article “Obesity Level Estimation Software based on Decision Trees” (De-La-Hoz-Correa et al., 2019).
Collapse
|