1
|
Xu Q, Cai X, Yu R, Zheng Y, Chen G, Sun H, Gao T, Xu C, Sun J. Machine Learning-Based Risk Factor Analysis and Prediction Model Construction for the Occurrence of Chronic Heart Failure: Health Ecologic Study. JMIR Med Inform 2025; 13:e64972. [PMID: 39889299 PMCID: PMC11829185 DOI: 10.2196/64972] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2024] [Revised: 12/04/2024] [Accepted: 12/25/2024] [Indexed: 02/02/2025] Open
Abstract
BACKGROUND Chronic heart failure (CHF) is a serious threat to human health, with high morbidity and mortality rates, imposing a heavy burden on the health care system and society. With the abundance of medical data and the rapid development of machine learning (ML) technologies, new opportunities are provided for in-depth investigation of the mechanisms of CHF and the construction of predictive models. The introduction of health ecology research methodology enables a comprehensive dissection of CHF risk factors from a wider range of environmental, social, and individual factors. This not only helps to identify high-risk groups at an early stage but also provides a scientific basis for the development of precise prevention and intervention strategies. OBJECTIVE This study aims to use ML to construct a predictive model of the risk of occurrence of CHF and analyze the risk of CHF from a health ecology perspective. METHODS This study sourced data from the Jackson Heart Study database. Stringent data preprocessing procedures were implemented, which included meticulous management of missing values and the standardization of data. Principal component analysis and random forest (RF) were used as feature selection techniques. Subsequently, several ML models, namely decision tree, RF, extreme gradient boosting, adaptive boosting (AdaBoost), support vector machine, naive Bayes model, multilayer perceptron, and bootstrap forest, were constructed, and their performance was evaluated. The effectiveness of the models was validated through internal validation using a 10-fold cross-validation approach on the training and validation sets. In addition, the performance metrics of each model, including accuracy, precision, sensitivity, F1-score, and area under the curve (AUC), were compared. After selecting the best model, we used hyperparameter optimization to construct a better model. RESULTS RF-selected features (21 in total) had an average root mean square error of 0.30, outperforming principal component analysis. Synthetic Minority Oversampling Technique and Edited Nearest Neighbors showed better accuracy in data balancing. The AdaBoost model was most effective with an AUC of 0.86, accuracy of 75.30%, precision of 0.86, sensitivity of 0.69, and F1-score of 0.76. Validation on the training and validation sets through 10-fold cross-validation gave an AUC of 0.97, an accuracy of 91.27%, a precision of 0.94, a sensitivity of 0.92, and an F1-score of 0.94. After random search processing, the accuracy and AUC of AdaBoost improved. Its accuracy was 77.68% and its AUC was 0.86. CONCLUSIONS This study offered insights into CHF risk prediction. Future research should focus on prospective studies, diverse data, advanced techniques, longitudinal studies, and exploring factor interactions for better CHF prevention and management.
Collapse
Affiliation(s)
- Qian Xu
- School of Medicine, Southeast University, Nanjing, China
| | - Xue Cai
- Department of Respiratory and Critical Care, Zhongda Hospital Southeast University, Nanjing, China
| | - Ruicong Yu
- School of Medicine, Southeast University, Nanjing, China
| | - Yueyue Zheng
- Department of Geriatrics, Zhongda Hospital Southeast University, Nanjing, China
| | - Guanjie Chen
- Department of Intensive Care, Zhongda Hospital Southeast University, Nanjing, China
| | - Hui Sun
- School of Medicine, Southeast University, Nanjing, China
| | - Tianyun Gao
- School of Medicine, Southeast University, Nanjing, China
| | - Cuirong Xu
- Department of Nursing, Zhongda Hospital Southeast University, Nanjing, China
| | - Jing Sun
- Rural Health Research Institute, Charles Sturt University, Orange, Australia
| |
Collapse
|
2
|
Yang G, Zhou Z, Ding A, Cai Y, Kong F, Xi Y, Liu N. MAPRS: An intelligent approach for post-prescription review based on multi-label learning. Artif Intell Med 2024; 157:102971. [PMID: 39265507 DOI: 10.1016/j.artmed.2024.102971] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Revised: 05/20/2024] [Accepted: 08/28/2024] [Indexed: 09/14/2024]
Abstract
Antimicrobial resistance (AMR) is a major threat to public health worldwide. It is a promising way to improve appropriate prescription by the review and stewardship of antimicrobials, and Post-Prescription Review (PPR) is currently the main tool used in hospitals. Existing methods of PPR typically focus on the dichotomy of antimicrobial prescription based on binary classification which, however, is usually a multi-label classification problem. Moreover, previous research did not explain the causes beneath the inappropriate antimicrobial used in the clinical setting, which could be practically important for problem location and decision improvement. In this paper, we collected antimicrobial prescriptions and related data from clean surgery in a hospital in northeastern China, and proposed a Multi-label Antimicrobial Post-Prescription Review System (MAPRS). MAPRS first uses NLP techniques to process unstructured data in prescriptions and explores the value of clinical record text for solving medical problems. Then, Classifier Chains are used to deal with multi-label problems and fused with machine learning algorithms to construct a classifier. At last, a SHAP explanation module is introduced to explain the inappropriate prescriptions. The experimental results show that MAPRS could achieve great performance in a challenging six-category multi-label task, with a subset accuracy of 90.7 % and an average AUROC of 94.3 %. Our results can help hospitals to perform intelligent prescription review and improve the antimicrobial stewardship.
Collapse
Affiliation(s)
- Guangfei Yang
- Central Hospital of Dalian University of Technology (Dalian Municipal Central Hospital), Dalian 116033, China; Institute of Systems Engineering, Dalian University of Technology, Dalian 116024, China.
| | - Ziyao Zhou
- Institute of Systems Engineering, Dalian University of Technology, Dalian 116024, China
| | - Aili Ding
- Central Hospital of Dalian University of Technology (Dalian Municipal Central Hospital), Dalian 116033, China.
| | - Yuanfeng Cai
- Zicklin School of Business, City University of New York--Baruch College, New York 10010, USA.
| | - Fanli Kong
- Central Hospital of Dalian University of Technology (Dalian Municipal Central Hospital), Dalian 116033, China
| | - Yalin Xi
- Central Hospital of Dalian University of Technology (Dalian Municipal Central Hospital), Dalian 116033, China
| | - Nannan Liu
- Central Hospital of Dalian University of Technology (Dalian Municipal Central Hospital), Dalian 116033, China
| |
Collapse
|
3
|
Zhang L, Yang L, Zhou Z. Data-based modeling for hypoglycemia prediction: Importance, trends, and implications for clinical practice. Front Public Health 2023; 11:1044059. [PMID: 36778566 PMCID: PMC9910805 DOI: 10.3389/fpubh.2023.1044059] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Accepted: 01/10/2023] [Indexed: 01/27/2023] Open
Abstract
Background and objective Hypoglycemia is a key barrier to achieving optimal glycemic control in people with diabetes, which has been proven to cause a set of deleterious outcomes, such as impaired cognition, increased cardiovascular disease, and mortality. Hypoglycemia prediction has come to play a role in diabetes management as big data analysis and machine learning (ML) approaches have become increasingly prevalent in recent years. As a result, a review is needed to summarize the existing prediction algorithms and models to guide better clinical practice in hypoglycemia prevention. Materials and methods PubMed, EMBASE, and the Cochrane Library were searched for relevant studies published between 1 January 2015 and 8 December 2022. Five hypoglycemia prediction aspects were covered: real-time hypoglycemia, mild and severe hypoglycemia, nocturnal hypoglycemia, inpatient hypoglycemia, and other hypoglycemia (postprandial, exercise-related). Results From the 5,042 records retrieved, we included 79 studies in our analysis. Two major categories of prediction models are identified by an overview of the chosen studies: simple or logistic regression models based on clinical data and data-based ML models (continuous glucose monitoring data is most commonly used). Models utilizing clinical data have identified a variety of risk factors that can lead to hypoglycemic events. Data-driven models based on various techniques such as neural networks, autoregressive, ensemble learning, supervised learning, and mathematical formulas have also revealed suggestive features in cases of hypoglycemia prediction. Conclusion In this study, we looked deep into the currently established hypoglycemia prediction models and identified hypoglycemia risk factors from various perspectives, which may provide readers with a better understanding of future trends in this topic.
Collapse
|
4
|
Ren Z, Zhao Y, Han X, Yue M, Wang B, Zhao Z, Wen B, Hong Y, Wang Q, Hong Y, Zhao T, Wang N, Zhao P. An objective model for diagnosing comorbid cognitive impairment in patients with epilepsy based on the clinical-EEG functional connectivity features. Front Neurosci 2023; 16:1060814. [PMID: 36711136 PMCID: PMC9878185 DOI: 10.3389/fnins.2022.1060814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2022] [Accepted: 12/28/2022] [Indexed: 01/15/2023] Open
Abstract
Objective Cognitive impairment (CI) is a common disorder in patients with epilepsy (PWEs). Objective assessment method for diagnosing CI in PWEs would be beneficial in reality. This study proposed to construct a diagnostic model for CI in PWEs using the clinical and the phase locking value (PLV) functional connectivity features of the electroencephalogram (EEG). Methods PWEs who met the inclusion and exclusion criteria were divided into a cognitively normal (CON) group (n = 55) and a CI group (n = 76). The 23 clinical features and 684 PLV EEG features at the time of patient visit were screened and ranked using the Fisher score. Adaptive Boosting (AdaBoost) and Gradient Boosting Decision Tree (GBDT) were used as algorithms to construct diagnostic models of CI in PWEs either with pure clinical features, pure PLV EEG features, or combined clinical and PLV EEG features. The performance of these models was assessed using a five-fold cross-validation method. Results GBDT-built model with combined clinical and PLV EEG features performed the best with accuracy, precision, recall, F1-score, and an area under the curve (AUC) of 90.11, 93.40, 89.50, 91.39, and 0.95%. The top 5 features found to influence the model performance based on the Fisher scores were the magnetic resonance imaging (MRI) findings of the head for abnormalities, educational attainment, PLV EEG in the beta (β)-band C3-F4, seizure frequency, and PLV EEG in theta (θ)-band Fp1-Fz. A total of 12 of the top 5% of features exhibited statistically different PLV EEG features, while eight of which were PLV EEG features in the θ band. Conclusion The model constructed from the combined clinical and PLV EEG features could effectively identify CI in PWEs and possess the potential as a useful objective evaluation method. The PLV EEG in the θ band could be a potential biomarker for the complementary diagnosis of CI comorbid with epilepsy.
Collapse
Affiliation(s)
- Zhe Ren
- Department of Neurology, Zhengzhou University People’s Hospital, Zhengzhou, Henan, China
| | - Yibo Zhao
- Department of Neurology, Zhengzhou University People’s Hospital, Zhengzhou, Henan, China
| | - Xiong Han
- Department of Neurology, Henan Provincial People’s Hospital, People’s Hospital of Zhengzhou University, Zhengzhou, Henan, China,*Correspondence: Xiong Han,
| | - Mengyan Yue
- Department of Rehabilitation, The First Hospital of Shanxi Medical University, Taiyuan, Shanxi, China
| | - Bin Wang
- Department of Neurology, Henan Provincial People’s Hospital, People’s Hospital of Zhengzhou University, Zhengzhou, Henan, China
| | - Zongya Zhao
- School of Medical Engineering, Xinxiang Medical University, Xinxiang, Henan, China
| | - Bin Wen
- School of Life Sciences and Technology, Xi’an Jiaotong University, Xi’an, Shaanxi, China
| | - Yang Hong
- Department of Neurology, People’s Hospital of Henan University, Zhengzhou, Henan, China
| | - Qi Wang
- Department of Neurology, Zhengzhou University People’s Hospital, Zhengzhou, Henan, China
| | - Yingxing Hong
- Department of Neurology, People’s Hospital of Henan University, Zhengzhou, Henan, China
| | - Ting Zhao
- Department of Neurology, Henan Provincial People’s Hospital, People’s Hospital of Zhengzhou University, Zhengzhou, Henan, China
| | - Na Wang
- Department of Neurology, Henan Provincial People’s Hospital, People’s Hospital of Zhengzhou University, Zhengzhou, Henan, China
| | - Pan Zhao
- Department of Neurology, Henan Provincial People’s Hospital, People’s Hospital of Zhengzhou University, Zhengzhou, Henan, China
| |
Collapse
|
5
|
Afsaneh E, Sharifdini A, Ghazzaghi H, Ghobadi MZ. Recent applications of machine learning and deep learning models in the prediction, diagnosis, and management of diabetes: a comprehensive review. Diabetol Metab Syndr 2022; 14:196. [PMID: 36572938 PMCID: PMC9793536 DOI: 10.1186/s13098-022-00969-9] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 12/16/2022] [Indexed: 12/28/2022] Open
Abstract
Diabetes as a metabolic illness can be characterized by increased amounts of blood glucose. This abnormal increase can lead to critical detriment to the other organs such as the kidneys, eyes, heart, nerves, and blood vessels. Therefore, its prediction, prognosis, and management are essential to prevent harmful effects and also recommend more useful treatments. For these goals, machine learning algorithms have found considerable attention and have been developed successfully. This review surveys the recently proposed machine learning (ML) and deep learning (DL) models for the objectives mentioned earlier. The reported results disclose that the ML and DL algorithms are promising approaches for controlling blood glucose and diabetes. However, they should be improved and employed in large datasets to affirm their applicability.
Collapse
|
6
|
Naveena S, Bharathi A. Weighted entropy deep features on hybrid RNN with LSTM for glucose level and diabetes prediction. Comput Methods Biomech Biomed Engin 2022; 26:1-25. [PMID: 36448678 DOI: 10.1080/10255842.2022.2149263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Accepted: 11/15/2022] [Indexed: 12/05/2022]
Abstract
Glucose level regulation with essential advice regarding diabetes must be provided to the patients to maintain their diet for diabetes treatment. Therefore, the academic community has focused on implementing novel glucose prediction techniques for decision support systems. Recent computational techniques for diagnosing diabetes have certain limitations, and also they are not evaluated under various datasets obtained from the different people of various countries. This generates inefficiency in the prediction systems to apply it in real-time applications. This paper plans to suggest a hybrid deep learning model for diabetes prediction and glucose level classification. Two benchmark datasets are used in the data collection process for experimenting. Initially, the deep selected features were extracted by the Convolutional Neural Network (CNN). Further, weighted entropy deep features are extracted, where the tuning of weight is taken place by the Modified Escaping Energy-based Harris Hawks Optimization. These features are processed in the glucose level classification using the modified Fuzzy classifier for classifying the high-level and low-level glucose. Further, glucose prediction is done by the Hybrid Recurrent Neural Network (RNN), and Long Short Term Memory (LSTM) termed R-LSTM with parameter optimization. From the experimental result, In the dataset 2 analyses on SMAPE, the MEE-HHO-R-LSTM is 12.5%, 87.5%, 50%, 12.5%, and 2.5% better than SVM, LSTM, DNN, RNN, and RNN-LSTM, at the learning percentage of 75%. The analytical results enforce that the suggested methods attain enhanced prediction performance concerning the evaluation metrics compared to conventional prediction models.
Collapse
Affiliation(s)
- Somasundaram Naveena
- Assistant Professor Senior Grade, Information Technology, Bannari Amman Institute of Technology, Sathyamangalam, India
| | - Ayyasamy Bharathi
- Professor, Information Technology, Bannari Amman Institute of Technology, Sathyamangalam, India
| |
Collapse
|
7
|
Survival Risk Prediction of Esophageal Cancer Based on the Kohonen Network Clustering Algorithm and Kernel Extreme Learning Machine. MATHEMATICS 2022. [DOI: 10.3390/math10091367] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Accurate prediction of the survival risk level of patients with esophageal cancer is significant for the selection of appropriate treatment methods. It contributes to improving the living quality and survival chance of patients. However, considering that the characteristics of blood index vary with individuals on the basis of their ages, personal habits and living environment etc., a unified artificial intelligence prediction model is not precisely adequate. In order to enhance the precision of the model on the prediction of esophageal cancer survival risk, this study proposes a different model based on the Kohonen network clustering algorithm and the kernel extreme learning machine (KELM), aiming to classifying the tested population into five catergories and provide better efficiency with the use of machine learning. Firstly, the Kohonen network clustering method was used to cluster the patient samples and five types of samples were obtained. Secondly, patients were divided into two risk levels based on 5-year net survival. Then, the Taylor formula was used to expand the theory to analyze the influence of different activation functions on the KELM modeling effect, and conduct experimental verification. RBF was selected as the activation function of the KELM. Finally, the adaptive mutation sparrow search algorithm (AMSSA) was used to optimize the model parameters. The experimental results were compared with the methods of the artificial bee colony optimized support vector machine (ABC-SVM), the three layers of random forest (TLRF), the gray relational analysis–particle swarm optimization support vector machine (GP-SVM) and the mixed-effects Cox model (Cox-LMM). The results showed that the prediction model proposed in this study had certain advantages in terms of prediction accuracy and running time, and could provide support for medical personnel to choose the treatment mode of esophageal cancer patients.
Collapse
|