1
|
Tamanna T, Mahmud S, Salma N, Hossain MM, Karim MR. Identifying determinants of malnutrition in under-five children in Bangladesh: insights from the BDHS-2022 cross-sectional study. Sci Rep 2025; 15:14336. [PMID: 40274916 DOI: 10.1038/s41598-025-99288-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2025] [Accepted: 04/18/2025] [Indexed: 04/26/2025] Open
Abstract
Malnutrition remains one of the most pressing global health challenges, particularly in developing countries like Bangladesh, where it continues to significantly impact child health and contribute to chronic illness and high child mortality. Despite the potential of machine learning to improve malnutrition predictions, research in this area remains limited in the country. This study utilizes Bangladesh Demographic and Health Survey (BDHS) 2022 data to identify and quantify key determinants of under-five malnutrition (underweight, wasting, stunting) and evaluates various machine learning models for predicting malnutrition. By addressing a critical gap, this research provides deeper insights into the root causes of malnutrition in Bangladesh. Descriptive statistics were conducted to summarize the key characteristics of the dataset. Boruta algorithm was employed to identify important features related to malnutrition which were then used to evaluate several machine learning models, including K-Nearest Neighbors (KNN), Neural Networks (NN), Classification and Regression Trees (CART), XGBoost (XGBM), Support Vector Machines (SVM), and Random Forest (RF), in addition to the traditional logistic regression (LR) model. The best-performing model was selected to identify the most important factors contributing to malnutrition. The significance of these variables was further assessed using Feature Importance plot (Based on Gini Importance) and Shapley Additive Explanation (SHAP) values. Model performance was evaluated through various metrics, including accuracy, 95% Confidence Interval (CI), Cohen's kappa, sensitivity, specificity, F1 score and precision. The study examined a cohort of 7,910 children, reporting prevalence rates of 19% for stunting, 8% for wasting, and 17% for underweight. The Boruta algorithm identified 18 confirmed features for stunting, 22 for wasting, and 19 for underweight. For stunting, the Random Forest (RF) model outperformed other methods with an accuracy of 64.19%, 95% CI of (0.623, 0.666), Cohen's kappa of 0.158, sensitivity of 56.25%, specificity of 66.00%, F1 score of 0.750 and precision of 0.60. In wasting prediction, RF achieved the highest accuracy at 76.68%, 95% CI of (0.743, 0.787), Cohen's kappa of 0.049, sensitivity of 27.22%, specificity of 80.98%, F1 score of 0.865 and precision of 0.810. Similarly, for underweight, RF demonstrated superior performance with an accuracy of 68.18%, 95% CI of (0.662, 0.703), Cohen's kappa of 0.130, sensitivity of 43.02%, specificity of 73.48%, F1 score of 0.792 and precision of 0.735. Across all malnutrition types, the RF model consistently outperformed traditional logistic regression (LR) and other ML techniques in terms of accuracy, sensitivity, specificity, and other performance metrics. For stunting, key predictors identified in both the Shapley and Gini importance plots included mother's education, father's occupation, place of delivery, wealth index, birth order, and toilet facility; for wasting, significant predictors were antenatal care, unmet family planning, mother's BMI, birth interval, father's occupation, and television ownership; and for underweight, important factors included father's occupation, mother's education, child's age, birth order, wealth index, and place of delivery. This study highlights the effectiveness of Random Forest (RF) in predicting malnutrition outcomes-stunting, wasting, and underweight-using key features identified by the Boruta algorithm. While RF demonstrates moderate performance in predicting stunting and underweight, it shows strong predictive ability for wasting. This underscores RF's potential in guiding targeted interventions for wasting, though further improvements are needed for stunting and underweight predictions. Moreover, the study identifies key contributors for each malnutrition outcome. By pinpointing these determinants, the study provides actionable insights for designing targeted interventions to combat malnutrition more effectively. These findings align with the global development agenda, particularly Sustainable Development Goal (SDG) 2: Zero Hunger and SDG 3: Good Health and Well-being, reinforcing efforts to reduce malnutrition and improve child health outcomes in Bangladesh.
Collapse
Affiliation(s)
- Tanzila Tamanna
- Department of Statistics and Data Science, Jahangirnagar University, Savar, Dhaka, 1342, Bangladesh
| | - Shohel Mahmud
- Department of Statistics and Data Science, Jahangirnagar University, Savar, Dhaka, 1342, Bangladesh
- Department of Statistics, Noakhali Science and Technology University, Noakhali, Bangladesh
| | - Nahid Salma
- Department of Statistics and Data Science, Jahangirnagar University, Savar, Dhaka, 1342, Bangladesh.
| | - Md Musharraf Hossain
- Department of Pediatrics, Netrokona District Hospital, Netrokona, 2400, Bangladesh
| | - Md Rezaul Karim
- Department of Statistics and Data Science, Jahangirnagar University, Savar, Dhaka, 1342, Bangladesh
| |
Collapse
|
2
|
de Luis Román D, López Gómez JJ, Barajas Galindo DE, García García C. Role of artificial intelligence in predicting disease-related malnutrition - A narrative review. NUTR HOSP 2025; 42:173-183. [PMID: 39873467 DOI: 10.20960/nh.05672] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2025] Open
Abstract
Introduction Background: disease-related malnutrition (DRM) affects 30-50 % of hospitalized patients and is often underdiagnosed, increasing risks of complications and healthcare costs. Traditional DRM detection has relied on manual methods that lack accuracy and efficiency. Objective: this narrative review explores how artificial intelligence (AI), specifically machine learning (ML) and deep learning (DL), can transform the prediction and management of DRM in clinical settings. Methods: we examine widely used ML and DL models, assessing their clinical applicability, advantages, and limitations. The integration of these models into electronic health record systems allows for automated risk detection and optimizes real-time patient management. Results: ML and DL models show significant potential for accurate assessment of nutritional status and prediction of complications in patients with DRM. These models facilitate improved clinical decision-making and more efficient resource management, although their implementation faces challenges related to the need for large volumes of standardized data and integration with existing systems. Conclusion: AI offers promising prospects for proactive DRM management, highlighting the need for interdisciplinary collaboration to overcome existing barriers and maximize its positive impact on patient care.
Collapse
Affiliation(s)
- Daniel de Luis Román
- Servicio de Endocrinología y Nutrición. Hospital Clínico Universitario de Valladolid. Centro de Investigación en Endocrinología y Nutrición Clínica (IENVA). Facultad de Medicina. Universidad de Valladolid
| | - Juan José López Gómez
- Servicio de Endocrinología y Nutrición. Hospital Clínico Universitario de Valladolid. Centro de Investigación Endocrinología y Nutrición. Universidad de Valladolid
| | | | | |
Collapse
|
3
|
Arya PK, Sur K, Kundu T, Dhote S, Singh SK. Unveiling predictive factors for household-level stunting in India: A machine learning approach using NFHS-5 and satellite-driven data. Nutrition 2025; 132:112674. [PMID: 39848008 DOI: 10.1016/j.nut.2024.112674] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2024] [Revised: 12/12/2024] [Accepted: 12/17/2024] [Indexed: 01/25/2025]
Abstract
OBJECTIVES Childhood stunting remains a significant public health issue in India, affecting approximately 35% of children under 5. Despite extensive research, existing prediction models often fail to incorporate diverse data sources and address the complex interplay of socioeconomic, demographic, and environmental factors. This study bridges this gap by employing machine learning methods to predict stunting at the household level, using data from the National Family Health Survey combined with satellite-driven datasets. METHODS We used four machine learning models-random forest regression, support vector machine regression, K-nearest neighbors regression, and regularized linear regression-to examine the impact of various factors on stunting. The random forest regression model demonstrated the highest predictive accuracy and robustness. RESULTS The proportion of households below the poverty line and the dependency ratio consistently predicted stunting across all models, underscoring the importance of economic status and household structure. Moreover, the educational level of the household head and environmental variables such as average temperature and leaf area index were significant contributors. Spatial analysis revealed significant geographic clustering of high-stunting districts, notably in central and eastern India, further emphasizing the role of regional socioeconomic and environmental factors. Notably, environmental variables like average temperature and leaf area index emerged as strong predictors of stunting, highlighting how regional climate and vegetation conditions shape nutritional outcomes. CONCLUSIONS These findings underline the importance of comprehensive interventions that not only address socioeconomic inequities but also consider environmental factors, such as climate and vegetation, to effectively combat childhood stunting in India.
Collapse
Affiliation(s)
- Prashant Kumar Arya
- Institute for Human Development, Delhi, India; ICSSR Post-Doctoral Fellow, Central University of Jharkhand, Ranchi, India.
| | - Koyel Sur
- Geospatial Resource Mapping and Application Group, Punjab Remote Sensing Centre, Punjab, India.
| | | | | | | |
Collapse
|
4
|
Rao B, Rashid M, Hasan MG, Thunga G. Machine Learning in Predicting Child Malnutrition: A Meta-Analysis of Demographic and Health Surveys Data. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2025; 22:449. [PMID: 40238576 PMCID: PMC11941938 DOI: 10.3390/ijerph22030449] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/12/2025] [Revised: 02/25/2025] [Accepted: 02/25/2025] [Indexed: 04/18/2025]
Abstract
BACKGROUND Childhood malnutrition remains a significant global public health concern. The Demographic and Health Surveys (DHS) program provides specific data on child health across numerous countries. This meta-analysis aims to comprehensively assess machine learning (ML) applications in DHS data to predict malnutrition in children. METHODS A comprehensive search of the peer-reviewed literature in PubMed, Embase, and Scopus databases was conducted in January 2024. Studies employing ML algorithms on DHS data to predict malnutrition in children under 5 years were included. Using PROBAST (Prediction model Risk Of Bias Assessment Tool), the quality of the listed studies was evaluated. To conduct meta-analyses, Review Manager 5.4 was used. RESULTS A total of 11 out of 789 studies were included in this review. The studies were published between 2019 and 2023, with the major contribution from Bangladesh (n = 6, 55%). Of these, ten studies reported stunting, three reported wasting, and four reported underweight. A meta-analysis of ten studies reported a pooled accuracy of 68.92% (95% CI: 66.04, 71.80; I2 = 100%) among ML models for predicting stunting in children. Three studies indicated a pooled accuracy of 84.39% (95% CI: 80.90, 87.87; I2 = 100%) in predicting wasting. A meta-analysis of four studies indicated a pooled accuracy of 73.60% (95% CI: 70.01, 77.20; I2 = 100%) for ML models predicting underweight status in children. CONCLUSIONS This meta-analysis indicated that ML models were observed to have moderate to good performance metrics in predicting malnutrition using DHS data among children under five years.
Collapse
Affiliation(s)
- Bhagyajyothi Rao
- Department of Applied Statistics & Data Science, Prasanna School of Public Health, Manipal Academy of Higher Education, Manipal 576104, India;
| | - Muhammad Rashid
- Department of Pharmacy Practice, Manipal College of Pharmaceutical Sciences, Manipal Academy of Higher Education, Manipal 576104, India; (M.R.); (G.T.)
- Department of Pharmacotherapy, College of Pharmacy, University of Utah, Salt Lake City, UT 84112, USA
| | - Md Gulzarull Hasan
- Department of Applied Statistics & Data Science, Prasanna School of Public Health, Manipal Academy of Higher Education, Manipal 576104, India;
| | - Girish Thunga
- Department of Pharmacy Practice, Manipal College of Pharmaceutical Sciences, Manipal Academy of Higher Education, Manipal 576104, India; (M.R.); (G.T.)
- Centre for Toxicovigilance and Drug Safety, Manipal College of Pharmaceutical Sciences, Manipal Academy of Higher Education, Manipal 576104, India
| |
Collapse
|
5
|
Aanjankumar S, Sathyamoorthy M, Dhanaraj RK, Surjit Kumar SR, Poonkuntran S, Khadidos AO, Selvarajan S. Prediction of malnutrition in kids by integrating ResNet-50-based deep learning technique using facial images. Sci Rep 2025; 15:7871. [PMID: 40050339 PMCID: PMC11885806 DOI: 10.1038/s41598-025-91825-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2025] [Accepted: 02/24/2025] [Indexed: 03/09/2025] Open
Abstract
In recent times, severe acute malnutrition (SAM) in India is considered a serious issue as per UNICEF 2022 records. In that record, 35.5% of children under age 5 are stunted, 19.3% are wasted, and 32% are underweight. Malnutrition, defined as these three conditions, affects 5.7 million children globally. This research utilizes an artificial intelligence-based image segmentation technique to predict malnutrition in children. The primary goal of this research is to use a deep learning model to eliminate the need for multiple manual diagnostic tests and simplify the prediction of malnutrition in kids. The traditional model uses text-based data and takes more time with continuous monitoring of kids by analysing body mass index (BMI) over different periods. Children in rural areas often miss medical expert appointments, and a lack of knowledge among parents can lead to severe malnutrition. The aim of the proposed system is to eliminate the need for manual blood tests and regular visits to medical experts. This study uses the ResNet-50 deep learning model's built-in shortcut connection to solve the image-based vanishing gradient problem. This makes training more efficient for image segmentation tasks in predicting malnutrition. The model is 98.49% accurate in predicting the kids who are malnourished among the kids who are healthy. It is evident from the results that the proposed system serves better than other deep learning models, such as XG Boost (75.29% accuracy), VGG 16 (94% accuracy), Xception (95.41% accuracy), and MobileNet (92.42% accuracy). Hence, the proposed technique is effective in detecting malnutrition and diagnose it earlier, without using predictive analysis function or advice from the medical experts.
Collapse
Affiliation(s)
- S Aanjankumar
- School of Computing Science and Engineering, VIT Bhopal University, Bhopal-Indore Highway, Kothrikalan, 466114, Sehore, Madhya Pradesh, India
| | - Malathy Sathyamoorthy
- Department of Information Technology, KPR Institute of Engineering and Technology, Coimbatore, India
| | - Rajesh Kumar Dhanaraj
- Symbiosis Institute of Computer Studies and Research (SICSR), Symbiosis International (Deemed University), Pune, India
| | - S R Surjit Kumar
- School of Computing Science and Engineering, VIT Bhopal University, Bhopal-Indore Highway, Kothrikalan, 466114, Sehore, Madhya Pradesh, India
| | - S Poonkuntran
- School of Computing Science and Engineering, VIT Bhopal University, Bhopal-Indore Highway, Kothrikalan, 466114, Sehore, Madhya Pradesh, India
| | - Adil O Khadidos
- Department of Information Technology, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Shitharth Selvarajan
- Department of Computer Science, Kebri Dehar University, Kebri Dehar, Ethiopia.
- Department of Computer Science and Engineering, Chennai Institute of Technology, Chennai, India.
- Centre for Research Impact & Outcome, Chitkara University, Punjab, India.
| |
Collapse
|
6
|
Begashaw GB, Zewotir T, Fenta HM. A deep learning approach for classifying and predicting children's nutritional status in Ethiopia using LSTM-FC neural networks. BioData Min 2025; 18:11. [PMID: 39885567 PMCID: PMC11783927 DOI: 10.1186/s13040-025-00425-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2024] [Accepted: 01/17/2025] [Indexed: 02/01/2025] Open
Abstract
BACKGROUND This study employs a LSTM-FC neural networks to address the critical public health issue of child undernutrition in Ethiopia. By employing this method, the study aims classify children's nutritional status and predict transitions between different undernutrition states over time. This analysis is based on longitudinal data extracted from the Young Lives cohort study, which tracked 1,997 Ethiopian children across five survey rounds conducted from 2002 to 2016. This paper applies rigorous data preprocessing, including handling missing values, normalization, and balancing, to ensure optimal model performance. Feature selection was performed using SHapley Additive exPlanations to identify key factors influencing nutritional status predictions. Hyperparameter tuning was thoroughly applied during model training to optimize performance. Furthermore, this paper compares the performance of LSTM-FC with existing baseline models to demonstrate its superiority. We used Python's TensorFlow and Keras libraries on a GPU-equipped system for model training. RESULTS LSTM-FC demonstrated superior predictive accuracy and long-term forecasting compared to baseline models for assessing child nutritional status. The classification and prediction performance of the model showed high accuracy rates above 93%, with perfect predictions for Normal (N) and Stunted & Wasted (SW) categories, minimal errors in most other nutritional statuses, and slight over- or underestimations in a few instances. The LSTM-FC model demonstrates strong generalization performance across multiple folds, with high recall and consistent F1-scores, indicating its robustness in predicting nutritional status. We analyzed the prevalence of children's nutritional status during their transition from late adolescence to early adulthood. The results show a notable decline in normal nutritional status among males, decreasing from 58.3% at age 5 to 33.5% by age 25. At the same time, the risk of severe undernutrition, including conditions of being underweight, stunted, and wasted (USW), increased from 1.3% to 9.4%. CONCLUSIONS The LSTM-FC model outperforms baseline methods in classifying and predicting Ethiopian children's nutritional statuses. The findings reveal a critical rise in undernutrition, emphasizing the need for urgent public health interventions.
Collapse
Affiliation(s)
- Getnet Bogale Begashaw
- Department of Statistics, College of Science, Bahir Dar University, P.O. Box 79, Bahir Dar, Ethiopia.
- Department of Data Science, College of Natural and Computational Science, Debre Berhan University, P.O. Box 445, Debre Berhan, Ethiopia.
| | - Temesgen Zewotir
- School of Mathematics, Statistics and Computer Science, College of Agriculture, Engineering and Science, University of KwaZulu-Natal, Durban, South Africa
| | - Haile Mekonnen Fenta
- Department of Statistics, College of Science, Bahir Dar University, P.O. Box 79, Bahir Dar, Ethiopia
- Center for Environmental and Respiratory Health Research, Population Health, University of Oulu, Oulu, Finland
- Biocenter Oulu, University of Oulu, Oulu, Finland
| |
Collapse
|
7
|
Zemariam AB, Abate BB, Alamaw AW, Lake ES, Yilak G, Ayele M, Tilahun BD, Ngusie HS. Prediction of stunting and its socioeconomic determinants among adolescent girls in Ethiopia using machine learning algorithms. PLoS One 2025; 20:e0316452. [PMID: 39854425 PMCID: PMC11760002 DOI: 10.1371/journal.pone.0316452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Accepted: 12/11/2024] [Indexed: 01/26/2025] Open
Abstract
BACKGROUND Stunting is a vital indicator of chronic undernutrition that reveals a failure to reach linear growth. Investigating growth and nutrition status during adolescence, in addition to infancy and childhood is very crucial. However, the available studies in Ethiopia have been usually focused in early childhood and they used the traditional stastical methods. Therefore, this study aimed to employ multiple machine learning algorithms to identify the most effective model for the prediction of stunting among adolescent girls in Ethiopia. METHODS A total of 3156 weighted samples of adolescent girls aged 15-19 years were used from the 2016 Ethiopian Demographic and Health Survey dataset. The data was pre-processed, and 80% and 20% of the observations were used for training, and testing the model, respectively. Eight machine learning algorithms were included for consideration of model building and comparison. The performance of the predictive model was evaluated using evaluation metrics value through Python software. The synthetic minority oversampling technique was used for data balancing and Boruta algorithm was used to identify best features. Association rule mining using an Apriori algorithm was employed to generate the best rule for the association between the independent feature and the targeted feature using R software. RESULTS The random forest classifier (sensitivity = 81%, accuracy = 77%, precision = 75%, f1-score = 78%, AUC = 85%) outperformed in predicting stunting compared to other ML algorithms considered in this study. Region, poor wealth index, no formal education, unimproved toilet facility, rural residence, not used contraceptive method, religion, age, no media exposure, occupation, and having one or more children were the top attributes to predict stunting. Association rule mining was identified the top seven best rules that most frequently associated with stunting among adolescent girls in Ethiopia. CONCLUSION The random forest classifier outperformed in predicting and identifying the relevant predictors of stunting. Results have shown that machine learning algorithms can accurately predict stunting, making them potentially valuable as decision-support tools for the relevant stakeholders and giving emphasis for the identified predictors could be an important intervention to halt stunting among adolescent girls.
Collapse
Affiliation(s)
- Alemu Birara Zemariam
- Department of Pediatrics and Child Health Nursing, School of Nursing, College of Medicine and Health Science, Woldia University, Woldia, Ethiopia
| | - Biruk Beletew Abate
- Department of Pediatrics and Child Health Nursing, School of Nursing, College of Medicine and Health Science, Woldia University, Woldia, Ethiopia
| | - Addis Wondmagegn Alamaw
- Department of Emergency and Critical Care Nursing, School of Nursing, College of Medicine and Health Science, Woldia University, Woldia, Ethiopia
| | - Eyob shitie Lake
- Department of Midwifery, School of Midwifery, School of Midwifery, College of Medicine and Health Science, Woldia University, Woldia, Ethiopia
| | - Gizachew Yilak
- Department of Nursing, School of Nursing, College of Medicine and Health Science, Woldia University, Woldia, Ethiopia
| | - Mulat Ayele
- Department of Midwifery, School of Midwifery, School of Midwifery, College of Medicine and Health Science, Woldia University, Woldia, Ethiopia
| | - Befkad Derese Tilahun
- Department of Nursing, School of Nursing, College of Medicine and Health Science, Woldia University, Woldia, Ethiopia
| | - Habtamu Setegn Ngusie
- Department of Health Informatics, School of Public Health, College of Medicine and Health Science, Woldia University, Woldia, Ethiopia
| |
Collapse
|
8
|
Islam MM, Kibria NMSJ, Kumar S, Roy DC, Karim MR. Prediction of undernutrition and identification of its influencing predictors among under-five children in Bangladesh using explainable machine learning algorithms. PLoS One 2024; 19:e0315393. [PMID: 39642130 PMCID: PMC11623790 DOI: 10.1371/journal.pone.0315393] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2024] [Accepted: 11/25/2024] [Indexed: 12/08/2024] Open
Abstract
BACKGROUND AND OBJECTIVES Child undernutrition is a leading global health concern, especially in low and middle-income developing countries, including Bangladesh. Thus, the objectives of this study are to develop an appropriate model for predicting the risk of undernutrition and identify its influencing predictors among under-five children in Bangladesh using explainable machine learning algorithms. MATERIALS AND METHODS This study used the latest nationally representative cross-sectional Bangladesh demographic health survey (BDHS), 2017-18 data. The Boruta technique was implemented to identify the important predictors of undernutrition, and logistic regression, artificial neural network, random forest, and extreme gradient boosting (XGB) were adopted to predict undernutrition (stunting, wasting, and underweight) risk. The models' performance was evaluated through accuracy and area under the curve (AUC). Additionally, SHapley Additive exPlanations (SHAP) were employed to illustrate the influencing predictors of undernutrition. RESULTS The XGB-based model outperformed the other models, with the accuracy and AUC respectively 81.73% and 0.802 for stunting, 76.15% and 0.622 for wasting, and 79.13% and 0.712 for underweight. Moreover, the SHAP method demonstrated that the father's education, wealth, mother's education, BMI, birth interval, vitamin A, watching television, toilet facility, residence, and water source are the influential predictors of stunting. While, BMI, mother education, and BCG of wasting; and father education, wealth, mother education, BMI, birth interval, toilet facility, breastfeeding, birth order, and residence of underweight. CONCLUSION The proposed integrating framework will be supportive as a method for selecting important predictors and predicting children who are at high risk of stunting, wasting, and underweight in Bangladesh.
Collapse
Affiliation(s)
- Md. Merajul Islam
- Department of Statistics, Jatiya Kabi Kazi Nazrul Islam University, Trishal, Mymensingh, Bangladesh
- Department of Statistics, University of Rajshahi, Rajshahi, Bangladesh
| | | | - Sujit Kumar
- Department of Statistics, Jatiya Kabi Kazi Nazrul Islam University, Trishal, Mymensingh, Bangladesh
| | - Dulal Chandra Roy
- Department of Statistics, University of Rajshahi, Rajshahi, Bangladesh
| | - Md. Rezaul Karim
- Department of Statistics, University of Rajshahi, Rajshahi, Bangladesh
| |
Collapse
|
9
|
Hussain Z, Borah MD. A computational model to analyze the impact of birth weight-nutritional status pair on disease development and disease recovery. Health Inf Sci Syst 2024; 12:10. [PMID: 38375133 PMCID: PMC10874357 DOI: 10.1007/s13755-024-00272-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2023] [Accepted: 01/08/2024] [Indexed: 02/21/2024] Open
Abstract
Purpose The purpose of this work is to analyse the combined impacts of birth weight and nutritional status on development and recovery of various types of diseases. This work aims to computationally establish the facts about the effects of individual birth weight-nutritional status pairs on disease development and disease recovery. Methods This work designs a computational model to analyze the impact of birth weight-nutritional status pairs on disease development and disease recovery. Our model works in two phases. The first phase finds the best machine learning model to predict birth weight from "Child Birth Weight Dataset" available at IEEE Dataport (https://dx.doi.org/10.21227/dvd4-3232). The second phase combines the predicted birth weight labels with nutritional status labels and establishes the effects using differential equations. Results The experimental results find Gradient boosting (GB) to work the best with Information gain (IGT) and Support Vector Machine (SVM) with Chi-square test (CST) for predicting the birth weights. The simulated results establish that "normal birth weight and normal nutritional status" is the best pair for resisting disease development as well as enhancing disease recovery. The results also depict that "low birth weight and malnutrition" is the worst pair for disease development while "high birth weight and malnutrition" is the worst combination for disease recovery. Conclusion The findings computationally establish the facts about the effects of birth weight-nutritional status pairs on disease development and disease recovery. As a social implication, this study can spread awareness about the importance of birth weight and nutritional status. The outcome can be helpful for the concerned authority in making decisions on healthcare cost and expenditure.
Collapse
Affiliation(s)
- Zakir Hussain
- Department of Computer Science and Engineering, National Institute of Technology Silchar, NIT Road, Cachar, Silchar, Assam 788010 India
| | - Malaya Dutta Borah
- Department of Computer Science and Engineering, National Institute of Technology Silchar, NIT Road, Cachar, Silchar, Assam 788010 India
| |
Collapse
|
10
|
Alam MA, Sajib MRUZ, Rahman F, Ether S, Hanson M, Sayeed A, Akter E, Nusrat N, Islam TT, Raza S, Tanvir KM, Chisti MJ, Rahman QSU, Hossain A, Layek MA, Zaman A, Rana J, Rahman SM, Arifeen SE, Rahman AE, Ahmed A. Implications of Big Data Analytics, AI, Machine Learning, and Deep Learning in the Health Care System of Bangladesh: Scoping Review. J Med Internet Res 2024; 26:e54710. [PMID: 39466315 PMCID: PMC11555453 DOI: 10.2196/54710] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Revised: 05/14/2024] [Accepted: 09/12/2024] [Indexed: 10/29/2024] Open
Abstract
BACKGROUND The rapid advancement of digital technologies, particularly in big data analytics (BDA), artificial intelligence (AI), machine learning (ML), and deep learning (DL), is reshaping the global health care system, including in Bangladesh. The increased adoption of these technologies in health care delivery within Bangladesh has sparked their integration into health care and public health research, resulting in a noticeable surge in related studies. However, a critical gap exists, as there is a lack of comprehensive evidence regarding the research landscape; regulatory challenges; use cases; and the application and adoption of BDA, AI, ML, and DL in the health care system of Bangladesh. This gap impedes the attainment of optimal results. As Bangladesh is a leading implementer of digital technologies, bridging this gap is urgent for the effective use of these advancing technologies. OBJECTIVE This scoping review aims to collate (1) the existing research in Bangladesh's health care system, using the aforementioned technologies and synthesizing their findings, and (2) the limitations faced by researchers in integrating the aforementioned technologies into health care research. METHODS MEDLINE (via PubMed), IEEE Xplore, Scopus, and Embase databases were searched to identify published research articles between January 1, 2000, and September 10, 2023, meeting the following inclusion criteria: (1) any study using any of the BDA, AI, ML, and DL technologies and health care and public health datasets for predicting health issues and forecasting any kind of outbreak; (2) studies primarily focusing on health care and public health issues in Bangladesh; and (3) original research articles published in peer-reviewed journals and conference proceedings written in English. RESULTS With the initial search, we identified 1653 studies. Following the inclusion and exclusion criteria and full-text review, 4.66% (77/1653) of the articles were finally included in this review. There was a substantial increase in studies over the last 5 years (2017-2023). Among the 77 studies, the majority (n=65, 84%) used ML models. A smaller proportion of studies incorporated AI (4/77, 5%), DL (7/77, 9%), and BDA (1/77, 1%) technologies. Among the reviewed articles, 52% (40/77) relied on primary data, while the remaining 48% (37/77) used secondary data. The primary research areas of focus were infectious diseases (15/77, 19%), noncommunicable diseases (23/77, 30%), child health (11/77, 14%), and mental health (9/77, 12%). CONCLUSIONS This scoping review highlights remarkable progress in leveraging BDA, AI, ML, and DL within Bangladesh's health care system. The observed surge in studies over the last 5 years underscores the increasing significance of AI and related technologies in health care research. Notably, most (65/77, 84%) studies focused on ML models, unveiling opportunities for advancements in predictive modeling. This review encapsulates the current state of technological integration and propels us into a promising era for the future of digital Bangladesh.
Collapse
Affiliation(s)
- Md Ashraful Alam
- Maternal and Child Health Division, International Centre for Diarrheal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Md Refat Uz Zaman Sajib
- Department of Health and Kinesiology, University of Illinois, Champaign and Urbana, IL, United States
| | - Fariya Rahman
- Maternal and Child Health Division, International Centre for Diarrheal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Saraban Ether
- Maternal and Child Health Division, International Centre for Diarrheal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Molly Hanson
- Department of Women's and Children's Health, Uppsala University, Uppsala, Sweden
| | - Abu Sayeed
- Maternal and Child Health Division, International Centre for Diarrheal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Ema Akter
- Maternal and Child Health Division, International Centre for Diarrheal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Nowrin Nusrat
- Maternal and Child Health Division, International Centre for Diarrheal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Tanjeena Tahrin Islam
- Maternal and Child Health Division, International Centre for Diarrheal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Sahar Raza
- Maternal and Child Health Division, International Centre for Diarrheal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - K M Tanvir
- Maternal and Child Health Division, International Centre for Diarrheal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Mohammod Jobayer Chisti
- Maternal and Child Health Division, International Centre for Diarrheal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Qazi Sadeq-Ur Rahman
- Maternal and Child Health Division, International Centre for Diarrheal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Akm Hossain
- Maternal and Child Health Division, International Centre for Diarrheal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - M A Layek
- Department of Computer Science and Engineering, Jagannath University, Dhaka, Bangladesh
| | - Asaduz Zaman
- Faculty of Information Technology, Monash University, Melbourne, Australia
| | - Juwel Rana
- Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, QC, Canada
- Research and Innovation Division, South Asian Institute for Social Transformation, Dhaka, Bangladesh
| | | | - Shams El Arifeen
- Maternal and Child Health Division, International Centre for Diarrheal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Ahmed Ehsanur Rahman
- Maternal and Child Health Division, International Centre for Diarrheal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Anisuddin Ahmed
- Maternal and Child Health Division, International Centre for Diarrheal Disease Research, Bangladesh, Dhaka, Bangladesh
- Department of Women's and Children's Health, Uppsala University, Uppsala, Sweden
| |
Collapse
|
11
|
Deng L, Zhao J, Wang T, Liu B, Jiang J, Jia P, Liu D, Li G. Construction and validation of predictive models for intravenous immunoglobulin-resistant Kawasaki disease using an interpretable machine learning approach. Clin Exp Pediatr 2024; 67:405-414. [PMID: 39048087 PMCID: PMC11298769 DOI: 10.3345/cep.2024.00549] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/26/2024] [Revised: 04/27/2024] [Accepted: 05/10/2024] [Indexed: 07/27/2024] Open
Abstract
BACKGROUND Intravenous immunoglobulin (IVIG)-resistant Kawasaki disease is associated with coronary artery lesion development. PURPOSE This study aimed to explore the factors associated with IVIG-resistance and construct and validate an interpretable machine learning (ML) prediction model in clinical practice. METHODS Between December 2014 and November 2022, 602 patients were screened and risk factors for IVIG-resistance investigated. Five ML models are used to establish an optimal prediction model. The SHapley Additive exPlanations (SHAP) method was used to interpret the ML model. RESULTS Na+, hemoglobin (Hb), C-reactive protein (CRP), and globulin were independent risk factors for IVIG-resistance. A nonlinear relationship was identified between globulin level and IVIG-resistance. The XGBoost model exhibited excellent performance, with an area under the receiver operating characteristic curve of 0.821, accuracy of 0.748, sensitivity of 0.889, and specificity of 0.683 in the testing set. The XGBoost model was interpreted globally and locally using the SHAP method. CONCLUSION Na+, Hb, CRP, and globulin levels were independently associated with IVIG-resistance. Our findings demonstrate that ML models can reliably predict IVIG-resistance. Moreover, use of the SHAP method to interpret the established XGBoost model's findings would provide evidence of IVIG-resistance and guide the individualized treatment of Kawasaki disease.
Collapse
Affiliation(s)
- Linfan Deng
- Department of Pediatrics, The Affiliated Hospital of Southwest Medical University, Luzhou, China
- Sichuan Clinical Research Center for Birth Defects, Luzhou, China
- Mianyang Central Hospital, School of Medicine, University of Electronic Science and Technology of China, Mianyang, China
| | - Jian Zhao
- Department of Pediatrics, The Affiliated Hospital of Southwest Medical University, Luzhou, China
- Sichuan Clinical Research Center for Birth Defects, Luzhou, China
| | - Ting Wang
- Department of Pediatrics, The Affiliated Hospital of Southwest Medical University, Luzhou, China
- Sichuan Clinical Research Center for Birth Defects, Luzhou, China
| | - Bin Liu
- Department of Pediatrics, The Affiliated Hospital of Southwest Medical University, Luzhou, China
- Sichuan Clinical Research Center for Birth Defects, Luzhou, China
| | - Jun Jiang
- Department of General Surgery (Thyroid Surgery), The Affiliated Hospital of Southwest Medical University, Luzhou, China
- Metabolic Vascular Diseases Key Laboratory of Sichuan Province, Luzhou, China
| | - Peng Jia
- Department of Pediatrics, Zigong First People’s Hospital, Zigong, China
| | - Dong Liu
- Department of Pediatrics, The Affiliated Hospital of Southwest Medical University, Luzhou, China
- Sichuan Clinical Research Center for Birth Defects, Luzhou, China
| | - Gang Li
- Department of Pediatrics, The Affiliated Hospital of Southwest Medical University, Luzhou, China
- Sichuan Clinical Research Center for Birth Defects, Luzhou, China
| |
Collapse
|
12
|
Qasrawi R, Sgahir S, Nemer M, Halaikah M, Badrasawi M, Amro M, Vicuna Polo S, Abu Al-Halawa D, Mujahed D, Nasreddine L, Elmadfa I, Atari S, Al-Jawaldeh A. Machine Learning Approach for Predicting the Impact of Food Insecurity on Nutrient Consumption and Malnutrition in Children Aged 6 Months to 5 Years. CHILDREN (BASEL, SWITZERLAND) 2024; 11:810. [PMID: 39062259 PMCID: PMC11274836 DOI: 10.3390/children11070810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 06/10/2024] [Accepted: 06/21/2024] [Indexed: 07/28/2024]
Abstract
BACKGROUND Food insecurity significantly impacts children's health, affecting their development across cognitive, physical, and socio-emotional dimensions. This study explores the impact of food insecurity among children aged 6 months to 5 years, focusing on nutrient intake and its relationship with various forms of malnutrition. METHODS Utilizing machine learning algorithms, this study analyzed data from 819 children in the West Bank to investigate sociodemographic and health factors associated with food insecurity and its effects on nutritional status. The average age of the children was 33 months, with 52% boys and 48% girls. RESULTS The analysis revealed that 18.1% of children faced food insecurity, with household education, family income, locality, district, and age emerging as significant determinants. Children from food-insecure environments exhibited lower average weight, height, and mid-upper arm circumference compared to their food-secure counterparts, indicating a direct correlation between food insecurity and reduced nutritional and growth metrics. Moreover, the machine learning models observed vitamin B1 as a key indicator of all forms of malnutrition, alongside vitamin K1, vitamin A, and zinc. Specific nutrients like choline in the "underweight" category and carbohydrates in the "wasting" category were identified as unique nutritional priorities. CONCLUSION This study provides insights into the differential risks for growth issues among children, offering valuable information for targeted interventions and policymaking.
Collapse
Affiliation(s)
- Radwan Qasrawi
- Department of Computer Sciences, Al Quds University, Jerusalem P.O. Box 20002, Palestine
- Department of Computer Engineering, Istinye University, 34010 Istanbul, Turkey
| | - Sabri Sgahir
- Department of Nutrition and Food Technology, College of Agriculture, Hebron University, Hebron P.O. Box 40, Palestine
| | - Maysaa Nemer
- Institute of Community and Public Health, Birzeit University, Ramallah P.O. Box 14, Palestine
| | - Mousa Halaikah
- Nutrition Department, Ministry of Health, Ramallah P.O. Box 4284, Palestine
| | - Manal Badrasawi
- Nutrition and Food Technology Department, Faculty of Agriculture and Veterinary Medicine, An-Najah National University, Nablus P.O. Box 7, Palestine
| | - Malak Amro
- Department of Computer Sciences, Al Quds University, Jerusalem P.O. Box 20002, Palestine
| | - Stephanny Vicuna Polo
- Department of Computer Sciences, Al Quds University, Jerusalem P.O. Box 20002, Palestine
| | - Diala Abu Al-Halawa
- Faculty of Medicine, Al-Quds University, Jerusalem P.O. Box 20002, Palestine
| | - Doa’a Mujahed
- Institute of Community and Public Health, Birzeit University, Ramallah P.O. Box 14, Palestine
| | - Lara Nasreddine
- Nutrition and Food Sciences Department, Faculty of Agriculture and Food Sciences, American University of Beirut, Beirut 1107 2020, Lebanon
| | - Ibrahim Elmadfa
- Department of Nutrition, Faculty of Life Sciences, University of Vienna, 1090 Vienna, Austria
| | - Siham Atari
- Department of Computer Sciences, Al Quds University, Jerusalem P.O. Box 20002, Palestine
| | - Ayoub Al-Jawaldeh
- Regional Office for the Eastern Mediterranean, World Health Organization, Cairo 7608, Egypt
| |
Collapse
|
13
|
Liu P, Liu Y, Liu H, Xiong L, Mei C, Yuan L. A Random Forest Algorithm for Assessing Risk Factors Associated With Chronic Kidney Disease: Observational Study. Asian Pac Isl Nurs J 2024; 8:e48378. [PMID: 38830204 PMCID: PMC11184270 DOI: 10.2196/48378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Revised: 02/02/2024] [Accepted: 04/16/2024] [Indexed: 06/05/2024] Open
Abstract
BACKGROUND The prevalence and mortality rate of chronic kidney disease (CKD) are increasing year by year, and it has become a global public health issue. The economic burden caused by CKD is increasing at a rate of 1% per year. CKD is highly prevalent and its treatment cost is high but unfortunately remains unknown. Therefore, early detection and intervention are vital means to mitigate the treatment burden on patients and decrease disease progression. OBJECTIVE In this study, we investigated the advantages of using the random forest (RF) algorithm for assessing risk factors associated with CKD. METHODS We included 40,686 people with complete screening records who underwent screening between January 1, 2015, and December 22, 2020, in Jing'an District, Shanghai, China. We grouped the participants into those with and those without CKD by staging based on the glomerular filtration rate staging and grouping based on albuminuria. Using a logistic regression model, we determined the relationship between CKD and risk factors. The RF machine learning algorithm was used to score the predictive variables and rank them based on their importance to construct a prediction model. RESULTS The logistic regression model revealed that gender, older age, obesity, abnormal index estimated glomerular filtration rate, retirement status, and participation in urban employee medical insurance were significantly associated with the risk of CKD. On RF algorithm-based screening, the top 4 factors influencing CKD were age, albuminuria, working status, and urinary albumin-creatinine ratio. The RF model predicted an area under the receiver operating characteristic curve of 93.15%. CONCLUSIONS Our findings reveal that the RF algorithm has significant predictive value for assessing risk factors associated with CKD and allows the screening of individuals with risk factors. This has crucial implications for early intervention and prevention of CKD.
Collapse
Affiliation(s)
- Pei Liu
- Department of Mathematics and Physics, Second Military Medical University, Shanghai, China
| | - Yijun Liu
- Department of Health Management, Second Military Medical University, Shanghai, China
| | - Hao Liu
- Faculty of Health Service, Second Military Medical University, Shanghai, China
| | - Linping Xiong
- Department of Health Management, Second Military Medical University, Shanghai, China
| | - Changlin Mei
- Nephrology Department, Shanghai Changzheng Hospital, Shanghai, China
| | - Lei Yuan
- Department of Health Management, Second Military Medical University, Shanghai, China
| |
Collapse
|
14
|
Begum N, Rahman MM, Omar Faruk M. Machine learning prediction of nutritional status among pregnant women in Bangladesh: Evidence from Bangladesh demographic and health survey 2017-18. PLoS One 2024; 19:e0304389. [PMID: 38820295 PMCID: PMC11142495 DOI: 10.1371/journal.pone.0304389] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Accepted: 05/12/2024] [Indexed: 06/02/2024] Open
Abstract
AIM Malnutrition in pregnant women significantly affects both mother and child health. This research aims to identify the best machine learning (ML) techniques for predicting the nutritional status of pregnant women in Bangladesh and detect the most essential features based on the best-performed algorithm. METHODS This study used retrospective cross-sectional data from the Bangladeshi Demographic and Health Survey 2017-18. Different feature transformations and machine learning classifiers were applied to find the best transformation and classification model. RESULTS This investigation found that robust scaling outperformed all feature transformation methods. The result shows that the Random Forest algorithm with robust scaling outperforms all other machine learning algorithms with 74.75% accuracy, 57.91% kappa statistics, 73.36% precision, 73.08% recall, and 73.09% f1 score. In addition, the Random Forest algorithm had the highest precision (76.76%) and f1 score (71.71%) for predicting the underweight class, as well as an expected precision of 82.01% and f1 score of 83.78% for the overweight/obese class when compared to other algorithms with a robust scaling method. The respondent's age, wealth index, region, husband's education level, husband's age, and occupation were crucial features for predicting the nutritional status of pregnant women in Bangladesh. CONCLUSION The proposed classifier could help predict the expected outcome and reduce the burden of malnutrition among pregnant women in Bangladesh.
Collapse
Affiliation(s)
- Najma Begum
- Department of Statistics, Noakhali Science and Technology University, Noakhali, Bangladesh
| | | | - Mohammad Omar Faruk
- Department of Statistics, Noakhali Science and Technology University, Noakhali, Bangladesh
| |
Collapse
|
15
|
Anku EK, Duah HO. Predicting and identifying factors associated with undernutrition among children under five years in Ghana using machine learning algorithms. PLoS One 2024; 19:e0296625. [PMID: 38349921 PMCID: PMC10863846 DOI: 10.1371/journal.pone.0296625] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2023] [Accepted: 11/13/2023] [Indexed: 02/15/2024] Open
Abstract
BACKGROUND Undernutrition among children under the age of five is a major public health concern, especially in developing countries. This study aimed to use machine learning (ML) algorithms to predict undernutrition and identify its associated factors. METHODS Secondary data analysis of the 2017 Multiple Indicator Cluster Survey (MICS) was performed using R and Python. The main outcomes of interest were undernutrition (stunting: height-for-age (HAZ) < -2 SD; wasting: weight-for-height (WHZ) < -2 SD; and underweight: weight-for-age (WAZ) < -2 SD). Seven ML algorithms were trained and tested: linear discriminant analysis (LDA), logistic model, support vector machine (SVM), random forest (RF), least absolute shrinkage and selection operator (LASSO), ridge regression, and extreme gradient boosting (XGBoost). The ML models were evaluated using the accuracy, confusion matrix, and area under the curve (AUC) receiver operating characteristics (ROC). RESULTS In total, 8564 children were included in the final analysis. The average age of the children was 926 days, and the majority were females. The weighted prevalence rates of stunting, wasting, and underweight were 17%, 7%, and 12%, respectively. The accuracies of all the ML models for wasting were (LDA: 84%; Logistic: 95%; SVM: 92%; RF: 94%; LASSO: 96%; Ridge: 84%, XGBoost: 98%), stunting (LDA: 86%; Logistic: 86%; SVM: 98%; RF: 88%; LASSO: 86%; Ridge: 86%, XGBoost: 98%), and for underweight were (LDA: 90%; Logistic: 92%; SVM: 98%; RF: 89%; LASSO: 92%; Ridge: 88%, XGBoost: 98%). The AUC values of the wasting models were (LDA: 99%; Logistic: 100%; SVM: 72%; RF: 94%; LASSO: 99%; Ridge: 59%, XGBoost: 100%), for stunting were (LDA: 89%; Logistic: 90%; SVM: 100%; RF: 92%; LASSO: 90%; Ridge: 89%, XGBoost: 100%), and for underweight were (LDA: 95%; Logistic: 96%; SVM: 100%; RF: 94%; LASSO: 96%; Ridge: 82%, XGBoost: 82%). Age, weight, length/height, sex, region of residence and ethnicity were important predictors of wasting, stunting and underweight. CONCLUSION The XGBoost model was the best model for predicting wasting, stunting, and underweight. The findings showed that different ML algorithms could be useful for predicting undernutrition and identifying important predictors for targeted interventions among children under five years in Ghana.
Collapse
Affiliation(s)
- Eric Komla Anku
- Dietherapy and Nutrition, Cape Coast Teaching Hospital, Cape Coast, Ghana
| | - Henry Ofori Duah
- University of Cincinnati College of Nursing, Cincinnati, Ohio, United States of America
| |
Collapse
|
16
|
Turjo EA, Rahman MH. Assessing risk factors for malnutrition among women in Bangladesh and forecasting malnutrition using machine learning approaches. BMC Nutr 2024; 10:22. [PMID: 38303093 PMCID: PMC10832135 DOI: 10.1186/s40795-023-00808-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Accepted: 12/04/2023] [Indexed: 02/03/2024] Open
Abstract
BACKGROUND This paper presents an in-depth examination of malnutrition in women in Bangladesh. Malnutrition in women is a major public health issue related to different diseases and has negative repercussions for children, such as premature birth, decreased infection resistance, and an increased risk of death. Moreover, malnutrition is a severe problem in Bangladesh. Data from the Bangladesh Demographic Health Survey (BDHS) conducted in 2017-18 was used to identify risk factors for malnourished women and to create a machine learning-based strategy to detect their nutritional status. METHODS A total of 17022 women participants are taken to conduct the research. All the participants are from different regions and different ages. A chi-square test with a five percent significance level is used to identify possible risk variables for malnutrition in women and six machine learning-based classifiers (Naïve Bayes, two types of Decision Tree, Logistic Regression, Random Forest, and Gradient Boosting Machine) were used to predict the malnutrition of women. The models are being evaluated using different parameters like accuracy, sensitivity, specificity, positive predictive value, negative predictive value, [Formula: see text] score, and area under the curve (AUC). RESULTS Descriptive data showed that 45% of the population studied were malnourished women, and the chi-square test illustrated that all fourteen variables are significantly associated with malnutrition in women and among them, age and wealth index had the most influence on their nutritional status, while water source had the least impact. Random Forest had an accuracy of 60% and 60.2% for training and test data sets, respectively. CART and Gradient Boosting Machine also had close accuracy like Random Forest but based on other performance metrics such as kappa and [Formula: see text] scores Random Forest got the highest rank among others. Also, it had the highest accuracy and [Formula: see text] scores in k-fold validation along with the highest AUC (0.604). CONCLUSION The Random Forest (RF) approach is a reasonably superior machine learning-based algorithm for forecasting women's nutritional status in Bangladesh in comparison to other ML algorithms investigated in this work. The suggested approach will aid in forecasting which women are at high susceptibility to malnutrition, hence decreasing the strain on the healthcare system.
Collapse
Affiliation(s)
- Estiyak Ahmed Turjo
- Department of Statistics and Data Science, Jahangirnagar University, Dhaka, Bangladesh
| | - Md Habibur Rahman
- Department of Statistics and Data Science, Jahangirnagar University, Dhaka, Bangladesh.
| |
Collapse
|
17
|
Demsash AW, Chereka AA, Walle AD, Kassie SY, Bekele F, Bekana T. Machine learning algorithms' application to predict childhood vaccination among children aged 12-23 months in Ethiopia: Evidence 2016 Ethiopian Demographic and Health Survey dataset. PLoS One 2023; 18:e0288867. [PMID: 37851705 PMCID: PMC10584162 DOI: 10.1371/journal.pone.0288867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Accepted: 07/06/2023] [Indexed: 10/20/2023] Open
Abstract
INTRODUCTION Childhood vaccination is a cost-effective public health intervention to reduce child mortality and morbidity. But, vaccination coverage remains low, and previous similar studies have not focused on machine learning algorithms to predict childhood vaccination. Therefore, knowledge extraction, association rule formulation, and discovering insights from hidden patterns in vaccination data are limited. Therefore, this study aimed to predict childhood vaccination among children aged 12-23 months using the best machine learning algorithm. METHODS A cross-sectional study design with a two-stage sampling technique was used. A total of 1617 samples of living children aged 12-23 months were used from the 2016 Ethiopian Demographic and Health Survey dataset. The data was pre-processed, and 70% and 30% of the observations were used for training, and evaluating the model, respectively. Eight machine learning algorithms were included for consideration of model building and comparison. All the included algorithms were evaluated using confusion matrix elements. The synthetic minority oversampling technique was used for imbalanced data management. Informational gain value was used to select important attributes to predict childhood vaccination. The If/ then logical association was used to generate rules based on relationships among attributes, and Weka version 3.8.6 software was used to perform all the prediction analyses. RESULTS PART was the first best machine learning algorithm to predict childhood vaccination with 95.53% accuracy. J48, multilayer perceptron, and random forest models were the consecutively best machine learning algorithms to predict childhood vaccination with 89.24%, 87.20%, and 82.37% accuracy, respectively. ANC visits, institutional delivery, health facility visits, higher education, and being rich were the top five attributes to predict childhood vaccination. A total of seven rules were generated that could jointly determine the magnitude of childhood vaccination. Of these, if wealth status = 3 (Rich), adequate ANC visits = 1 (yes), and residency = 2 (Urban), then the probability of childhood vaccination would be 86.73%. CONCLUSIONS The PART, J48, multilayer perceptron, and random forest algorithms were important algorithms for predicting childhood vaccination. The findings would provide insight into childhood vaccination and serve as a framework for further studies. Strengthening mothers' ANC visits, institutional delivery, improving maternal education, and creating income opportunities for mothers could be important interventions to enhance childhood vaccination.
Collapse
Affiliation(s)
| | - Alex Ayenew Chereka
- Department of Health Informatics, College of Health Science, Mettu University, Mettu, Ethiopia
| | - Agmasie Damtew Walle
- Department of Health Informatics, College of Health Science, Mettu University, Mettu, Ethiopia
| | - Sisay Yitayih Kassie
- Department of Health Informatics, College of Health Science, Mettu University, Mettu, Ethiopia
| | - Firomsa Bekele
- Department of Pharmacy, College of Health Science, Mettu University, Mettu, Ethiopia
| | - Teshome Bekana
- Biomedical Science Department, College of Health Science, Mettu University, Mettu, Ethiopia
| |
Collapse
|
18
|
Shen H, Zhao H, Jiang Y. Machine Learning Algorithms for Predicting Stunting among Under-Five Children in Papua New Guinea. CHILDREN (BASEL, SWITZERLAND) 2023; 10:1638. [PMID: 37892302 PMCID: PMC10605317 DOI: 10.3390/children10101638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 09/27/2023] [Accepted: 09/28/2023] [Indexed: 10/29/2023]
Abstract
Preventing stunting is particularly important for healthy development across the life course. In Papua New Guinea (PNG), the prevalence of stunting in children under five years old has consistently not improved. Therefore, the primary objective of this study was to employ multiple machine learning algorithms to identify the most effective model and key predictors for stunting prediction in children in PNG. The study used data from the 2016-2018 Papua New Guinea Demographic Health Survey, including from 3380 children with complete height-for-age data. The least absolute shrinkage and selection operator (LASSO) and random-forest-recursive feature elimination were used for feature selection. Logistic regression, a conditional decision tree, a support vector machine with a radial basis function kernel, and an extreme gradient boosting machine (XGBoost) were employed to construct the prediction model. The performance of the final model was evaluated using accuracy, precision, recall, F1 score, and area under the curve (AUC). The results of the study showed that LASSO-XGBoost has the best performance for predicting stunting in PNG (AUC: 0.765; 95% CI: 0.714-0.819) with accuracy, precision, recall, and F1 scores of 0.728, 0.715, 0.628, and 0.669, respectively. Combined with the SHAP value method, the optimal prediction model identified living in the Highlands Region, the age of the child, being in the richest family, and having a larger or smaller birth size as the top five important characteristics for predicting stunting. Based on the model, the findings support the necessity of preventing stunting early in life. Emphasizing the nutritional status of vulnerable maternal and child populations in PNG is recommended to promote maternal and child health and overall well-being.
Collapse
Affiliation(s)
| | | | - Yi Jiang
- School of Public Health, Chongqing Medical University, Chongqing 400016, China; (H.S.); (H.Z.)
| |
Collapse
|
19
|
Salinari A, Machì M, Armas Diaz Y, Cianciosi D, Qi Z, Yang B, Ferreiro Cotorruelo MS, Villar SG, Dzul Lopez LA, Battino M, Giampieri F. The Application of Digital Technologies and Artificial Intelligence in Healthcare: An Overview on Nutrition Assessment. Diseases 2023; 11:97. [PMID: 37489449 PMCID: PMC10366918 DOI: 10.3390/diseases11030097] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Revised: 07/11/2023] [Accepted: 07/11/2023] [Indexed: 07/26/2023] Open
Abstract
In the last decade, artificial intelligence (AI) and AI-mediated technologies have undergone rapid evolution in healthcare and medicine, from apps to computer software able to analyze medical images, robotic surgery and advanced data storage system. The main aim of the present commentary is to briefly describe the evolution of AI and its applications in healthcare, particularly in nutrition and clinical biochemistry. Indeed, AI is revealing itself to be an important tool in clinical nutrition by using telematic means to self-monitor various health metrics, including blood glucose levels, body weight, heart rate, fat percentage, blood pressure, activity tracking and calorie intake trackers. In particular, the application of the most common digital technologies used in the field of nutrition as well as the employment of AI in the management of diabetes and obesity, two of the most common nutrition-related pathologies worldwide, will be presented.
Collapse
Affiliation(s)
- Alessia Salinari
- Department of Clinical Sciences, Faculty of Medicine, Polytechnic University of Marche, 60131 Ancona, Italy
| | - Michele Machì
- Department of Clinical Sciences, Faculty of Medicine, Polytechnic University of Marche, 60131 Ancona, Italy
| | - Yasmany Armas Diaz
- Department of Clinical Sciences, Faculty of Medicine, Polytechnic University of Marche, 60131 Ancona, Italy
| | - Danila Cianciosi
- Department of Clinical Sciences, Faculty of Medicine, Polytechnic University of Marche, 60131 Ancona, Italy
| | - Zexiu Qi
- Department of Clinical Sciences, Faculty of Medicine, Polytechnic University of Marche, 60131 Ancona, Italy
| | - Bei Yang
- Department of Clinical Sciences, Faculty of Medicine, Polytechnic University of Marche, 60131 Ancona, Italy
| | | | - Santos Gracia Villar
- Research Group on Food, Nutritional Biochemistry and Health, Universidad Europea del Atlántico, 39011 Santander, Spain
- Department of Projects, Universidad Internacional Iberoamericana, Campeche 24560, Mexico
- Department of Extension, Universidad Internacional do Cuanza, Cuito P.O. Box 841, Angola
| | - Luis Alonso Dzul Lopez
- Research Group on Food, Nutritional Biochemistry and Health, Universidad Europea del Atlántico, 39011 Santander, Spain
- Department of Projects, Universidad Internacional Iberoamericana, Campeche 24560, Mexico
- Department of Projects, Universidad Internacional Iberoamericana, Arecibo, PR 00613, USA
| | - Maurizio Battino
- International Research Center for Food Nutrition and Safety, Jiangsu University, Zhenjiang 212013, China
- Department of Clinical Sciences, Faculty of Medicine, Polytechnic University of Marche, 60131 Ancona, Italy
- Research Group on Food, Nutritional Biochemistry and Health, Universidad Europea del Atlántico, 39011 Santander, Spain
| | - Francesca Giampieri
- Research Group on Food, Nutritional Biochemistry and Health, Universidad Europea del Atlántico, 39011 Santander, Spain
| |
Collapse
|
20
|
A hybrid ensemble approach to accelerate the classification accuracy for predicting malnutrition among under-five children in sub-Saharan African countries. Nutrition 2023; 108:111947. [PMID: 36641887 DOI: 10.1016/j.nut.2022.111947] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Revised: 11/29/2022] [Accepted: 12/20/2022] [Indexed: 12/25/2022]
Abstract
BACKGROUND The proper intake of nutrients is essential to the growth and maturation of youngsters. In sub-Saharan Africa, 1 in 7 children dies before age 5 y, and more than a third of these deaths are attributed to malnutrition. The main purpose of this study was to develop a majority voting-based hybrid ensemble (MVBHE) learning model to accelerate the prediction accuracy of malnutrition data of under-five children in sub-Saharan Africa. METHODS This study used available under-five nutritional secondary data from the Demographic and Health Surveys performed in sub-Saharan African countries. The research used bagging, boosting, and voting algorithms, such as random forest, decision tree, eXtreme Gradient Boosting, and k-nearest neighbors machine learning methods, to generate the MVBHE model. RESULTS We evaluated the model performances in contrast to each other using different measures, including accuracy, precision, recall, and the F1 score. The results of the experiment showed that the MVBHE model (96%) was better at predicting malnutrition than the random forest (81%), decision tree (60%), eXtreme Gradient Boosting (79%), and k-nearest neighbors (74%). CONCLUSIONS The random forest algorithm demonstrated the highest prediction accuracy (81%) compared with the decision tree, eXtreme Gradient Boosting, and k-nearest neighbors algorithms. The accuracy was then enhanced to 96% using the MVBHE model. The MVBHE model is recommended by the present study as the best way to predict malnutrition in under-five children.
Collapse
|
21
|
Aybar-Flores A, Talavera A, Espinoza-Portilla E. Predicting the HIV/AIDS Knowledge among the Adolescent and Young Adult Population in Peru: Application of Quasi-Binomial Logistic Regression and Machine Learning Algorithms. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023; 20:5318. [PMID: 37047934 PMCID: PMC10093875 DOI: 10.3390/ijerph20075318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Revised: 03/19/2023] [Accepted: 03/27/2023] [Indexed: 06/19/2023]
Abstract
Inadequate knowledge is one of the principal obstacles for preventing HIV/AIDS spread. Worldwide, it is reported that adolescents and young people have a higher vulnerability of being infected. Thus, the need to understand youths' knowledge towards HIV/AIDS becomes crucial. This study aimed to identify the determinants and develop a predictive model to estimate HIV/AIDS knowledge among this target population in Peru. Data from the 2019 DHS Survey were used. The software RStudio and RapidMiner were used for quasi-binomial logistic regression and computational model building, respectively. Five classification algorithms were considered for model development and their performance was assessed using accuracy, sensitivity, specificity, FPR, FNR, Cohen's kappa, F1 score and AUC. The results revealed an association between 14 socio-demographic, economic and health factors and HIV/AIDS knowledge. The accuracy levels were estimated between 59.47 and 64.30%, with the random forest model showing the best performance (64.30%). Additionally, the best classifier showed that the gender of the respondent, area of residence, wealth index, region of residence, interviewee's age, highest educational level, ethnic self-perception, having heard about HIV/AIDS in the past, the performance of an HIV/AIDS screening test and mass media access have a major influence on HIV/AIDS knowledge prediction. The results suggest the usefulness of the associations found and the random forest model as a predictor of knowledge of HIV/AIDS and may aid policy makers to guide and reinforce the planning and implementation of healthcare strategies.
Collapse
Affiliation(s)
- Alejandro Aybar-Flores
- Department of Engineering, Universidad del Pacífico, Lima 15072, Peru; (A.A.-F.); (A.T.)
| | - Alvaro Talavera
- Department of Engineering, Universidad del Pacífico, Lima 15072, Peru; (A.A.-F.); (A.T.)
| | | |
Collapse
|
22
|
Data harnessing to nurture the human mind for a tailored approach to the child. Pediatr Res 2023; 93:357-365. [PMID: 36180585 DOI: 10.1038/s41390-022-02320-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 07/06/2022] [Accepted: 09/12/2022] [Indexed: 11/08/2022]
Abstract
Big data in pediatrics is an ocean of structured and unstructured data. Big data analysis helps to dive into the ocean of data to filter out information that can guide pediatricians in their decision making, precision diagnosis, and targeted therapy. In addition, big data and its analysis have helped in the surveillance, prevention, and performance of the health system. There has been a considerable amount of work in pediatrics that we have tried to highlight in this review and some of it has been already incorporated into the health system. Work in specialties of pediatrics is still forthcoming with the creation of a common data model and amalgamation of the huge "omics" database. The physicians entrusted with the care of children must be aware of the outcome so that they can play a role to ensure that big data algorithms have a clinically relevant effect in improving the health of their patients. They will apply the outcome of big data and its analysis in patient care through clinical algorithms or with the help of embedded clinical support alerts from the electronic medical records. IMPACT: Big data in pediatrics include structured, unstructured data, waveform data, biological, and social data. Big data analytics has unraveled significant information from these databases. This is changing how pediatricians will look at the body of available evidence and translate it into their clinical practice. Data harnessed so far is implemented in certain fields while in others it is in the process of development to become a clinical adjunct to the physician. Common databases are being prepared for future work. Diagnostic and prediction models when incorporated into the health system will guide the pediatrician to a targeted approach to diagnosis and therapy.
Collapse
|
23
|
Kirk D, Kok E, Tufano M, Tekinerdogan B, Feskens EJM, Camps G. Machine Learning in Nutrition Research. Adv Nutr 2022; 13:2573-2589. [PMID: 36166846 PMCID: PMC9776646 DOI: 10.1093/advances/nmac103] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2022] [Revised: 08/02/2022] [Accepted: 09/22/2022] [Indexed: 01/29/2023] Open
Abstract
Data currently generated in the field of nutrition are becoming increasingly complex and high-dimensional, bringing with them new methods of data analysis. The characteristics of machine learning (ML) make it suitable for such analysis and thus lend itself as an alternative tool to deal with data of this nature. ML has already been applied in important problem areas in nutrition, such as obesity, metabolic health, and malnutrition. Despite this, experts in nutrition are often without an understanding of ML, which limits its application and therefore potential to solve currently open questions. The current article aims to bridge this knowledge gap by supplying nutrition researchers with a resource to facilitate the use of ML in their research. ML is first explained and distinguished from existing solutions, with key examples of applications in the nutrition literature provided. Two case studies of domains in which ML is particularly applicable, precision nutrition and metabolomics, are then presented. Finally, a framework is outlined to guide interested researchers in integrating ML into their work. By acting as a resource to which researchers can refer, we hope to support the integration of ML in the field of nutrition to facilitate modern research.
Collapse
Affiliation(s)
- Daniel Kirk
- Division of Human Nutrition and Health, Wageningen University and Research, Wageningen, The Netherlands
| | - Esther Kok
- Division of Human Nutrition and Health, Wageningen University and Research, Wageningen, The Netherlands
| | - Michele Tufano
- Division of Human Nutrition and Health, Wageningen University and Research, Wageningen, The Netherlands
| | - Bedir Tekinerdogan
- Information Technology Group, Wageningen University and Research, Wageningen, The Netherlands
| | - Edith J M Feskens
- Division of Human Nutrition and Health, Wageningen University and Research, Wageningen, The Netherlands
| | - Guido Camps
- Division of Human Nutrition and Health, Wageningen University and Research, Wageningen, The Netherlands.,OnePlanet Research Center, Wageningen, The Netherlands
| |
Collapse
|
24
|
Predicting malnutrition from longitudinal patient trajectories with deep learning. PLoS One 2022; 17:e0271487. [PMID: 35901027 PMCID: PMC9333236 DOI: 10.1371/journal.pone.0271487] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2021] [Accepted: 07/04/2022] [Indexed: 11/19/2022] Open
Abstract
Malnutrition is common, morbid, and often correctable, but subject to missed and delayed diagnosis. Better screening and prediction could improve clinical, functional, and economic outcomes. This study aimed to assess the predictability of malnutrition from longitudinal patient records, and the external generalizability of a predictive model. Predictive models were developed and validated on statewide emergency department (ED) and hospital admission databases for California, Florida and New York, including visits from October 1, 2015 to December 31, 2018. Visit features included patient demographics, diagnosis codes, and procedure categories. Models included long short-term memory (LSTM) recurrent neural networks trained on longitudinal trajectories, and gradient-boosted tree and logistic regression models trained on cross-sectional patient data. The dataset used for model training and internal validation (California and Florida) included 62,811 patient trajectories (266,951 visits). Test sets included 63,997 (California), 63,112 (Florida), and 62,472 (New York) trajectories, such that each cohort’s composition was proportional to the prevalence of malnutrition in that state. Trajectories contained seven patient characteristics and up to 2,008 diagnosis categories. Area under the receiver-operating characteristic (AUROC) and precision-recall curves (AUPRC) were used to characterize prediction of first malnutrition diagnoses in the test sets. Data analysis was performed from September 2020 to May 2021. Between 4.0% (New York) and 6.2% (California) of patients received malnutrition diagnoses. The longitudinal LSTM model produced the most accurate predictions of malnutrition, with comparable predictive performance in California (AUROC 0.854, AUPRC 0.258), Florida (AUROC 0.869, AUPRC 0.234), and New York (AUROC 0.869, AUPRC 0.190). Deep learning models can reliably predict malnutrition from existing longitudinal patient records, with better predictive performance and lower data-collection requirements than existing instruments. This approach may facilitate early nutritional intervention via automated screening at the point of care.
Collapse
|
25
|
Performance of Machine Learning Classifiers in Classifying Stunting among Under-Five Children in Zambia. CHILDREN 2022; 9:children9071082. [PMID: 35884066 PMCID: PMC9320245 DOI: 10.3390/children9071082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Revised: 06/22/2022] [Accepted: 06/24/2022] [Indexed: 12/01/2022]
Abstract
Stunting is a global public health issue. We sought to train and evaluate machine learning (ML) classification algorithms on the Zambia Demographic Health Survey (ZDHS) dataset to predict stunting among children under the age of five in Zambia. We applied Logistic regression (LR), Random Forest (RF), SV classification (SVC), XG Boost (XgB) and Naïve Bayes (NB) algorithms to predict the probability of stunting among children under five years of age, on the 2018 ZDHS dataset. We calibrated predicted probabilities and plotted the calibration curves to compare model performance. We computed accuracy, recall, precision and F1 for each machine learning algorithm. About 2327 (34.2%) children were stunted. Thirteen of fifty-eight features were selected for inclusion in the model using random forest. Calibrating the predicted probabilities improved the performance of machine learning algorithms when evaluated using calibration curves. RF was the most accurate algorithm, with an accuracy score of 79% in the testing and 61.6% in the training data while Naïve Bayesian was the worst performing algorithm for predicting stunting among children under five in Zambia using the 2018 ZDHS dataset. ML models aids quick diagnosis of stunting and the timely development of interventions aimed at preventing stunting.
Collapse
|
26
|
Nel S, Feucht UD, Nel AL, Becker PJ, Wenhold FAM. A novel screening tool to predict severe acute malnutrition through automated monitoring of weight‐for‐age growth curves. MATERNAL & CHILD NUTRITION 2022; 18:e13364. [PMID: 35586991 PMCID: PMC9218329 DOI: 10.1111/mcn.13364] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 11/26/2021] [Revised: 03/24/2022] [Accepted: 03/30/2022] [Indexed: 12/02/2022]
Abstract
Weight‐for‐age (WFA) growth faltering often precedes severe acute malnutrition (SAM) in children, yet it is often missed during routine growth monitoring. Automated interpretation of WFA growth within electronic health records could expedite the identification of children at risk of SAM. This study aimed to develop an automated screening tool to predict SAM risk from WFA growth, and to determine its predictive ability compared with simple changes in weight or WFA z‐score. To develop the screening tool, South African child growth experts (n = 30) rated SAM risk on 100 WFA growth curves, which were then used to train an artificial neural network (ANN) to assess SAM risk from consecutive WFA z‐scores. The ANN was validated in 185 children under five (63 SAM cases; 122 controls) using diagnostic accuracy methodology. The ANN's performance was compared with that of changes in weight or WFA z‐score. Even though experts' SAM risk ratings of the WFA growth curves differed considerably, the ANN achieved a sensitivity of 73.0% (95% confidence interval [CI]: 60.3; 83.4), specificity of 86.1% (95% CI: 78.6; 91.7) and receiver‐operating characteristic curve area of 0.795 (95% CI: 0.732; 0.859) during validation with real cases, outperforming changes in weight or WFA z‐scores. The ANN, as an automated screening tool, could markedly improve the identification of children at risk of SAM using routinely collected WFA growth information. Growth faltering preceding severe acute malnutrition (SAM) is poorly identified during routine growth monitoring, and opportunities for preventative interventions are missed. Consistently identifying growth faltering is challenging, even for experts in child growth. Digitized growth monitoring tools incorporating automated pattern recognition by an artificial neural network (ANN) can assist clinicians in identifying growth faltering. In this study, an ANN was trained to identify children at risk of SAM based on weight‐for‐age growth curves, with promising results. With further refinement, the ANN could greatly improve the identification of children with growth faltering and facilitate earlier interventions to prevent SAM.
Collapse
Affiliation(s)
- Sanja Nel
- Department of Human Nutrition, Faculty of Health Sciences University of Pretoria Pretoria South Africa
- Research Centre for Maternal, Fetal, Newborn & Child Health Care Strategies University of Pretoria Pretoria South Africa
- Maternal and Infant Health Care Strategies Unit South African Medical Research Council (SA MRC) Pretoria South Africa
| | - Ute D. Feucht
- Research Centre for Maternal, Fetal, Newborn & Child Health Care Strategies University of Pretoria Pretoria South Africa
- Maternal and Infant Health Care Strategies Unit South African Medical Research Council (SA MRC) Pretoria South Africa
- Department of Paediatrics University of Pretoria Pretoria South Africa
- District Clinical Specialist Team, Tshwane District Health Services Gauteng Department of Health Pretoria South Africa
| | - André L. Nel
- Department of Mechanical Engineering, Faculty of Engineering and Built Environment University of Johannesburg Johannesburg South Africa
| | - Piet J. Becker
- Research Office, Faculty of Health Sciences University of Pretoria Pretoria South Africa
| | - Friedeburg A. M. Wenhold
- Department of Human Nutrition, Faculty of Health Sciences University of Pretoria Pretoria South Africa
- Research Centre for Maternal, Fetal, Newborn & Child Health Care Strategies University of Pretoria Pretoria South Africa
- Maternal and Infant Health Care Strategies Unit South African Medical Research Council (SA MRC) Pretoria South Africa
| |
Collapse
|
27
|
Hossain MI, Habib MJ, Saleheen AAS, Kamruzzaman M, Rahman A, Roy S, Amit Hasan M, Haq I, Methun MIH, Nayan MIH, Rukonozzaman Rukon M. Performance Evaluation of Machine Learning Algorithm for Classification of Unintended Pregnancy among Married Women in Bangladesh. JOURNAL OF HEALTHCARE ENGINEERING 2022; 2022:1460908. [PMID: 35669979 PMCID: PMC9167128 DOI: 10.1155/2022/1460908] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/09/2021] [Accepted: 05/13/2022] [Indexed: 11/30/2022]
Abstract
Intended pregnancy is one of the significant indicators of women's well-being. Globally, 74 million women become pregnant every year without planning. Unintended pregnancies account for 28% of all pregnancies among married women in Bangladesh. This study aimed to investigate the performance of six different machine learning (ML) algorithms applied to predict unintended pregnancies among married women in Bangladesh. From BDHS 2017-18, only 1129 pregnant women aged 15-49 were eligible for this study. An independent χ 2 test had performed before we considered six popular ML algorithms, such as logistic regression (LR), random forest (RF), support vector machine (SVM), k-nearest neighbor (KNN), naïve Bayes (NB), and elastic net regression (ENR) to predict the unintended pregnancy. Accuracy, sensitivity, specificity, Cohen's Kappa statistic, and area under curve (AUC) value were used as model evaluation. The bivariate analysis result showed that women aged 30-49 years, poor, not educated, and living in male-headed households had a higher percentage of unintended pregnancy. We found various performance parameters for the classification of unintended pregnancy: LR accuracy = 79.29%, LR AUC = 72.12%; RF accuracy = 77.81%, RF AUC = 72.17%; SVM accuracy = 76.92%, SVM AUC = 70.90%; KNN accuracy = 77.22%, KNN AUC = 70.27%; NB accuracy = 78%, NB AUC = 73.06%; and ENR accuracy = 77.51%, ENR AUC = 74.67%. Based on the AUC value, we can conclude that of all the ML algorithms we investigated, the ENR algorithm provides the most accurate classification for predicting unwanted pregnancy among Bangladeshi women. Our findings contribute to a better understanding of how to categorize pregnancy intentions among Bangladeshi women. As a result, the government can initiate an effective campaign to raise contraception awareness.
Collapse
Affiliation(s)
| | - Md. Jakaria Habib
- Department of Statistics, Jagannath University, Dhaka 1100, Bangladesh
| | | | - Md. Kamruzzaman
- Department of Statistics, Jagannath University, Dhaka 1100, Bangladesh
| | - Azizur Rahman
- Department of Statistics, Jahangirnagar University, Savar, Dhaka, Bangladesh
| | - Sutopa Roy
- Department of Statistics, Jagannath University, Dhaka 1100, Bangladesh
| | - Md. Amit Hasan
- Department of Statistics, Jagannath University, Dhaka 1100, Bangladesh
| | - Iqramul Haq
- Department of Agricultural Statistics, Sher-e-Bangla Agricultural University, Dhaka 1207, Bangladesh
| | | | | | | |
Collapse
|
28
|
Siy Van VT, Antonio VA, Siguin CP, Gordoncillo NP, Sescon JT, Go CC, Miro EP. Predicting undernutrition among elementary schoolchildren in the Philippines using machine learning algorithms. Nutrition 2022; 96:111571. [DOI: 10.1016/j.nut.2021.111571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Revised: 11/23/2021] [Accepted: 12/05/2021] [Indexed: 11/30/2022]
|
29
|
Meitei AJ, Saini A, Mohapatra BB, Singh KJ. Predicting child anaemia in the North-Eastern states of India: a machine learning approach. INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT 2022; 13:2949-2962. [PMCID: PMC9441193 DOI: 10.1007/s13198-022-01765-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Revised: 07/14/2022] [Accepted: 08/08/2022] [Indexed: 01/07/2024]
Abstract
Child anaemia is a serious global health issue and India is one of the highest contributors among the developing nations. Researchers identify many harmful effects of anaemia, which include psychomotor retardation, which in turn decreases the learning ability and causes low intelligence among pre-school children. The effects also include behavioural delays, low immunity, and susceptibility to frequent infections, increased mortality, and disability. The present study aims to predict anaemia among children in North-East India by applying Machine Learning (ML) algorithms to latest available National Family Health Survey (NFHS)-4 data. Out of the total 29,312 eligible children (6–59 months) in North-East India, a total of 21,000 children with demographic variables without any missing observations, wherein 10,460 are anaemic, is considered for this study. Machine learning (ML) algorithms have been applied through 3 different types of penalized regression methods—ridge, least absolute shrinkage and selection operator, and elastic net for predicting anaemia. A systematic assessment of algorithms is performed in terms of accuracy, sensitivity, specificity, F1-Score, and Cohen’s \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k$$\end{document} k -Statistics. Having achieved the receiver operating characteristic value of over 70% in training and accuracy of above 64% while testing, it can be safely asserted that factors like mother’s anaemic status, age of the child, social status, mother’s age, mother’s education, religion are important in identifying the child as anaemic.
Collapse
Affiliation(s)
- A. Jiran Meitei
- Department of Mathematics, Maharaja Agrasen College, University of Delhi, New Delhi, India
| | - Akanksha Saini
- Department of Operational Research, University of Delhi, New Delhi, Delhi 110007 India
| | | | | |
Collapse
|
30
|
Shi H, Yang D, Tang K, Hu C, Li L, Zhang L, Gong T, Cui Y. Explainable machine learning model for predicting the occurrence of postoperative malnutrition in children with congenital heart disease. Clin Nutr 2021; 41:202-210. [PMID: 34906845 DOI: 10.1016/j.clnu.2021.11.006] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Revised: 10/25/2021] [Accepted: 11/05/2021] [Indexed: 01/25/2023]
Abstract
BACKGROUND & AIMS Malnutrition is persistent in 50%-75% of children with congenital heart disease (CHD) after surgery, and early prediction is crucial for nutritional intervention. The aim of this study was to develop and validate machine learning (ML) models to predict the malnutrition status of children with CHD. We used explainable ML methods to provide insight into the model's predictions and outcomes. METHODS This prospective cohort study included consecutive children with CHD admitted to the hospital from December 2017 to May 2020. The cohort data were divided into the training and test data sets based on the follow-up time. The outcome of the study was CHD child malnutrition 1 year after surgery, the primary outcome was an underweight status, and the secondary outcomes were stunted and wasting status. We used five ML algorithms with multiple features to construct prediction models, and the performance of these ML models was measured by an area under the receiver operating characteristic curve (AUC) analysis. We also used the permutation importance and SHapley Additive exPlanations (SHAP) to determine the importance of the selected features and interpret the ML models. RESULTS We enrolled 536 children with CHD who underwent complete repair. The proportions of children with an underweight, stunted, or wasting status 1 year after surgery were 18.1% (97/536), 12.1% (65/536), and 17.5% (94/536), respectively. All patients contributed to the generation of 115 useable features, which allowed us to build models to predict malnutrition. Five prediction algorithms were used, and the XGBoost model achieved the greatest AUC in all outcomes. The results obtained from the permutation importance and SHAP analyses showed that the 1-month postoperative WAZ-score, discharge WAZ score and preoperative WAZ score were the top 3 important features in predicting an underweight status in the XGBoost algorithm. Regarding the stunted status, the top 3 important features were the 1-month postoperative HAZ score, discharge HAZ score, and aortic clamping time. Regarding the wasting status, the top 3 important features were the hospital length of stay, formula intake, and discharge WHZ-score. We also used a narrative case report as an example to describe the clinical manifestations and predicted the primary outcomes of two children. CONCLUSIONS We developed an ML model (XGBoost) that provides accurate early predictions of malnutrition 1-year postoperatively in children with CHD. Because the ML model is explainable, it may better enable clinicians to better understand the reasoning underlying the outcome. Our study could aid in determining individual treatment and nutritional follow-up strategies for children with CHD.
Collapse
Affiliation(s)
- Hui Shi
- Guangzhou Women and Children's Medical Center, Institute of Pediatrics, Guangzhou Medical University, No.9 Jinsui Road, Zhujiang Newtown, Tianhe District, Guangzhou, 510623, China; Department of Biostatistics and Epidemiology, School of Public Health, Sun Yat-sen University, Guangzhou, China
| | - Dong Yang
- Guangzhou AID Cloud Technology, No. 68 Huacheng Avenue, Tianhe District, Guangzhou, China
| | - Kaichen Tang
- Guangzhou AID Cloud Technology, No. 68 Huacheng Avenue, Tianhe District, Guangzhou, China
| | - Chunmei Hu
- Cardiac Intensive Care Unit, The Heart Center, Guangzhou Women and Children Medical Center, Guangzhou Medical University, No.9 Jinsui Road, Zhujiang Newtown, Tianhe District, Guangzhou 510623, China
| | - Lijuan Li
- Cardiac Intensive Care Unit, The Heart Center, Guangzhou Women and Children Medical Center, Guangzhou Medical University, No.9 Jinsui Road, Zhujiang Newtown, Tianhe District, Guangzhou 510623, China
| | - Linfang Zhang
- Cardiac Intensive Care Unit, The Heart Center, Guangzhou Women and Children Medical Center, Guangzhou Medical University, No.9 Jinsui Road, Zhujiang Newtown, Tianhe District, Guangzhou 510623, China
| | - Ting Gong
- Cardiac Intensive Care Unit, The Heart Center, Guangzhou Women and Children Medical Center, Guangzhou Medical University, No.9 Jinsui Road, Zhujiang Newtown, Tianhe District, Guangzhou 510623, China
| | - Yanqin Cui
- Cardiac Intensive Care Unit, The Heart Center, Guangzhou Women and Children Medical Center, Guangzhou Medical University, No.9 Jinsui Road, Zhujiang Newtown, Tianhe District, Guangzhou 510623, China; Department of Pediatric Surgery, Guangdong Provincial Key Laboratory of Research in Structural Birth Defect Disease, Guangzhou Women and Children's Medical Center, Guangzhou Medical University, Guangzhou, 510623, Guangdong, China.
| |
Collapse
|
31
|
Fenta HM, Zewotir T, Muluneh EK. A machine learning classifier approach for identifying the determinants of under-five child undernutrition in Ethiopian administrative zones. BMC Med Inform Decis Mak 2021; 21:291. [PMID: 34689769 PMCID: PMC8542294 DOI: 10.1186/s12911-021-01652-1] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Accepted: 10/04/2021] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Undernutrition is the main cause of child death in developing countries. This paper aimed to explore the efficacy of machine learning (ML) approaches in predicting under-five undernutrition in Ethiopian administrative zones and to identify the most important predictors. METHOD The study employed ML techniques using retrospective cross-sectional survey data from Ethiopia, a national-representative data collected in the year (2000, 2005, 2011, and 2016). We explored six commonly used ML algorithms; Logistic regression, Least Absolute Shrinkage and Selection Operator (L-1 regularization logistic regression), L-2 regularization (Ridge), Elastic net, neural network, and random forest (RF). Sensitivity, specificity, accuracy, and area under the curve were used to evaluate the performance of those models. RESULTS Based on different performance evaluations, the RF algorithm was selected as the best ML model. In the order of importance; urban-rural settlement, literacy rate of parents, and place of residence were the major determinants of disparities of nutritional status for under-five children among Ethiopian administrative zones. CONCLUSION Our results showed that the considered machine learning classification algorithms can effectively predict the under-five undernutrition status in Ethiopian administrative zones. Persistent under-five undernutrition status was found in the northern part of Ethiopia. The identification of such high-risk zones could provide useful information to decision-makers trying to reduce child undernutrition.
Collapse
Affiliation(s)
- Haile Mekonnen Fenta
- Department of Statistics, College of Science, Bahir Dar University, Bahir Dar, Ethiopia
| | - Temesgen Zewotir
- School of Mathematics, Statistics and Computer Science, College of Agriculture Engineering and Science, University of KwaZulu-Natal, Durban, South Africa
| | - Essey Kebede Muluneh
- School of Public Health, College of Medicine and Health Sciences, Bahir Dar University, Bahir Dar, Ethiopia
| |
Collapse
|
32
|
Abstract
OBJECTIVE Child undernutrition is a global public health problem with serious implications. In this study, we estimate predictive algorithms for the determinants of childhood stunting by using various machine learning (ML) algorithms. DESIGN This study draws on data from the Ethiopian Demographic and Health Survey of 2016. Five ML algorithms including eXtreme gradient boosting, k-nearest neighbours (k-NN), random forest, neural network and the generalised linear models were considered to predict the socio-demographic risk factors for undernutrition in Ethiopia. SETTING Households in Ethiopia. PARTICIPANTS A total of 9471 children below 5 years of age participated in this study. RESULTS The descriptive results show substantial regional variations in child stunting, wasting and underweight in Ethiopia. Also, among the five ML algorithms, xgbTree algorithm shows a better prediction ability than the generalised linear mixed algorithm. The best predicting algorithm (xgbTree) shows diverse important predictors of undernutrition across the three outcomes which include time to water source, anaemia history, child age greater than 30 months, small birth size and maternal underweight, among others. CONCLUSIONS The xgbTree algorithm was a reasonably superior ML algorithm for predicting childhood undernutrition in Ethiopia compared to other ML algorithms considered in this study. The findings support improvement in access to water supply, food security and fertility regulation, among others, in the quest to considerably improve childhood nutrition in Ethiopia.
Collapse
|
33
|
Côté M, Lamarche B. Artificial intelligence in nutrition research: perspectives on current and future applications. Appl Physiol Nutr Metab 2021; 47:1-8. [PMID: 34525321 DOI: 10.1139/apnm-2021-0448] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Artificial intelligence (AI) is a rapidly evolving area that offers unparalleled opportunities of progress and applications in many healthcare fields. In this review, we provide an overview of the main and latest applications of AI in nutrition research and identify gaps to address to potentialize this emerging field. AI algorithms may help better understand and predict the complex and non-linear interactions between nutrition-related data and health outcomes, particularly when large amounts of data need to be structured and integrated, such as in metabolomics. AI-based approaches, including image recognition, may also improve dietary assessment by maximizing efficiency and addressing systematic and random errors associated with self-reported measurements of dietary intakes. Finally, AI applications can extract, structure and analyze large amounts of data from social media platforms to better understand dietary behaviours and perceptions among the population. In summary, AI-based approaches will likely improve and advance nutrition research as well as help explore new applications. However, further research is needed to identify areas where AI does deliver added value compared with traditional approaches, and other areas where AI is simply not likely to advance the field. Novelty: Artificial intelligence offers unparalleled opportunities of progress and applications in nutrition. There remain gaps to address to potentialize this emerging field.
Collapse
Affiliation(s)
- Mélina Côté
- Centre de recherche Nutrition, santé et société (NUTRISS), INAF, Université Laval, Québec, QC, Canada
- School of Nutrition, Université Laval, Québec, QC, Canada
| | - Benoît Lamarche
- Centre de recherche Nutrition, santé et société (NUTRISS), INAF, Université Laval, Québec, QC, Canada
- School of Nutrition, Université Laval, Québec, QC, Canada
| |
Collapse
|
34
|
An Innovative Machine Learning Approach to Predict the Dietary Fiber Content of Packaged Foods. Nutrients 2021; 13:nu13093195. [PMID: 34579072 PMCID: PMC8470168 DOI: 10.3390/nu13093195] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2021] [Revised: 09/08/2021] [Accepted: 09/09/2021] [Indexed: 01/23/2023] Open
Abstract
Underconsumption of dietary fiber is prevalent worldwide and is associated with multiple adverse health conditions. Despite the importance of fiber, the labeling of fiber content on packaged foods and beverages is voluntary in most countries, making it challenging for consumers and policy makers to monitor fiber consumption. Here, we developed a machine learning approach for automated and systematic prediction of fiber content using nutrient information commonly available on packaged products. An Australian packaged food dataset with known fiber content information was divided into training (n = 8986) and test datasets (n = 2455). Utilization of a k-nearest neighbors machine learning algorithm explained a greater proportion of variance in fiber content than an existing manual fiber prediction approach (R2 = 0.84 vs. R2 = 0.68). Our findings highlight the opportunity to use machine learning to efficiently predict the fiber content of packaged products on a large scale.
Collapse
|
35
|
Kim K, Park H. Machine-learning models predicting osteoarthritis associated with the lead blood level. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2021; 28:44079-44084. [PMID: 33846921 DOI: 10.1007/s11356-021-13887-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Accepted: 04/07/2021] [Indexed: 06/12/2023]
Abstract
Lead is one of the most hazardous environmental pollutants in industrialized countries; lead exposure is a risk factor for osteoarthritis (OA) in older women. Here, the performance of several machine-learning (ML) algorithms in terms of predicting the prevalence of OA associated with lead exposure was compared. A total of 2224 women aged 50 years and older who participated in the Korea National Health and Nutrition Examination Surveys from 2005 to 2017 were divided into a training dataset (70%) for generation of ML models, and a test dataset (30%). We built and tested five ML algorithms, including logistic regression (LR), a k-nearest neighbor model, a decision tree, a random forest, and a support vector machine. All afforded acceptable predictive accuracy; the LR model was the most accurate and yielded the greatest area under the receiver operating characteristic curve. We found that various ML models can be used to predict the risk of OA associated with lead exposure effectively, using data from population-based survey.
Collapse
Affiliation(s)
- Kisok Kim
- College of Pharmacy, Keimyung University, Daegu, 42601, Republic of Korea.
| | - Hyejin Park
- Department of International Healthcare Administration, Daegu Catholic University, Gyeongsan, 38430, Republic of Korea
| |
Collapse
|
36
|
Rahman SMJ, Ahmed NAMF, Abedin MM, Ahammed B, Ali M, Rahman MJ, Maniruzzaman M. Investigate the risk factors of stunting, wasting, and underweight among under-five Bangladeshi children and its prediction based on machine learning approach. PLoS One 2021; 16:e0253172. [PMID: 34138925 PMCID: PMC8211236 DOI: 10.1371/journal.pone.0253172] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2021] [Accepted: 05/28/2021] [Indexed: 11/23/2022] Open
Abstract
Aims Malnutrition is a major health issue among Bangladeshi under-five (U5) children. Children are malnourished if the calories and proteins they take through their diet are not sufficient for their growth and maintenance. The goal of the research was to use machine learning (ML) algorithms to detect the risk factors of malnutrition (stunted, wasted, and underweight) as well as their prediction. Methods This work utilized malnutrition data that was derived from Bangladesh Demographic and Health Survey which was conducted in 2014. The selected dataset consisted of 7079 children with 13 factors. The potential risks of malnutrition have been identified by logistic regression (LR). Moreover, 3 ML classifiers (support vector machine (SVM), random forest (RF), and LR) have been implemented for predicting malnutrition and the performance of these ML algorithms were assessed on the basis of accuracy. Results The average prevalence of stunted, wasted, and underweight was 35.4%, 15.4%, and 32.8%, respectively. It was noted that LR identified five risk factors for stunting and underweight, as well as four factors for wasting. Results illustrated that RF can be accurately classified as stunted, wasted, and underweight children and obtained the highest accuracy of 88.3% for stunted, 87.7% for wasted, and 85.7% for underweight. Conclusion This research focused on the identification and prediction of major risk factors for stunting, wasting, and underweight using ML algorithms which will aid policymakers in reducing malnutrition among Bangladesh’s U5 children.
Collapse
Affiliation(s)
| | | | | | - Benojir Ahammed
- Statistics Discipline, Khulna University, Khulna, Bangladesh
| | - Mohammad Ali
- Statistics Discipline, Khulna University, Khulna, Bangladesh
| | | | - Md. Maniruzzaman
- Statistics Discipline, Khulna University, Khulna, Bangladesh
- * E-mail:
| |
Collapse
|