1
|
de Oliveira EV, Aragão DP, Gonçalves LMG. A New Auto-Regressive Multi-Variable Modified Auto-Encoder for Multivariate Time-Series Prediction: A Case Study with Application to COVID-19 Pandemics. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2024; 21:497. [PMID: 38673408 PMCID: PMC11049878 DOI: 10.3390/ijerph21040497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/27/2024] [Revised: 03/28/2024] [Accepted: 04/05/2024] [Indexed: 04/28/2024]
Abstract
The SARS-CoV-2 global pandemic prompted governments, institutions, and researchers to investigate its impact, developing strategies based on general indicators to make the most precise predictions possible. Approaches based on epidemiological models were used but the outcomes demonstrated forecasting with uncertainty due to insufficient or missing data. Besides the lack of data, machine-learning models including random forest, support vector regression, LSTM, Auto-encoders, and traditional time-series models such as Prophet and ARIMA were employed in the task, achieving remarkable results with limited effectiveness. Some of these methodologies have precision constraints in dealing with multi-variable inputs, which are important for problems like pandemics that require short and long-term forecasting. Given the under-supply in this scenario, we propose a novel approach for time-series prediction based on stacking auto-encoder structures using three variations of the same model for the training step and weight adjustment to evaluate its forecasting performance. We conducted comparison experiments with previously published data on COVID-19 cases, deaths, temperature, humidity, and air quality index (AQI) in São Paulo City, Brazil. Additionally, we used the percentage of COVID-19 cases from the top ten affected countries worldwide until May 4th, 2020. The results show 80.7% and 10.3% decrease in RMSE to entire and test data over the distribution of 50 trial-trained models, respectively, compared to the first experiment comparison. Also, model type#3 achieved 4th better overall ranking performance, overcoming the NBEATS, Prophet, and Glounts time-series models in the second experiment comparison. This model shows promising forecast capacity and versatility across different input dataset lengths, making it a prominent forecasting model for time-series tasks.
Collapse
Affiliation(s)
| | | | - Luiz Marcos Garcia Gonçalves
- Department of Computer Engineering and Automation, Federal University of Rio Grande do Norte, Av. Salgado Filho, 3000, Campus Universitário, Lagoa Nova, Natal 59078-970, RN, Brazil; (E.V.d.O.); (D.P.A.)
| |
Collapse
|
2
|
Rao GM, Ramesh D, Sharma V, Sinha A, Hassan MM, Gandomi AH. AttGRU-HMSI: enhancing heart disease diagnosis using hybrid deep learning approach. Sci Rep 2024; 14:7833. [PMID: 38570560 PMCID: PMC10991318 DOI: 10.1038/s41598-024-56931-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2023] [Accepted: 03/12/2024] [Indexed: 04/05/2024] Open
Abstract
Heart disease is a major global cause of mortality and a major public health problem for a large number of individuals. A major issue raised by regular clinical data analysis is the recognition of cardiovascular illnesses, including heart attacks and coronary artery disease, even though early identification of heart disease can save many lives. Accurate forecasting and decision assistance may be achieved in an effective manner with machine learning (ML). Big Data, or the vast amounts of data generated by the health sector, may assist models used to make diagnostic choices by revealing hidden information or intricate patterns. This paper uses a hybrid deep learning algorithm to describe a large data analysis and visualization approach for heart disease detection. The proposed approach is intended for use with big data systems, such as Apache Hadoop. An extensive medical data collection is first subjected to an improved k-means clustering (IKC) method to remove outliers, and the remaining class distribution is then balanced using the synthetic minority over-sampling technique (SMOTE). The next step is to forecast the disease using a bio-inspired hybrid mutation-based swarm intelligence (HMSI) with an attention-based gated recurrent unit network (AttGRU) model after recursive feature elimination (RFE) has determined which features are most important. In our implementation, we compare four machine learning algorithms: SAE + ANN (sparse autoencoder + artificial neural network), LR (logistic regression), KNN (K-nearest neighbour), and naïve Bayes. The experiment results indicate that a 95.42% accuracy rate for the hybrid model's suggested heart disease prediction is attained, which effectively outperforms and overcomes the prescribed research gap in mentioned related work.
Collapse
Affiliation(s)
- G Madhukar Rao
- Department of Computer Science and Engineering, Indian Institute of Technology (Indian School of Mines), Dhanbad, Jharkhand, 826004, India
- Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Hyderabad, 500075, India
| | - Dharavath Ramesh
- Department of Computer Science and Engineering, Indian Institute of Technology (Indian School of Mines), Dhanbad, Jharkhand, 826004, India
- Department of Computer Science, University of Economics and Human Sciences, Warsaw, Poland
| | - Vandana Sharma
- Computer Science Department, Christ University, Delhi NCR Campus, Ghaziabad, Delhi NCR, India
| | - Anurag Sinha
- Department of Computer Science, ICFAI Tech School, ICFAI University, Ranchi, Jharkhand, India
| | - Md Mehedi Hassan
- Computer Science and Engineering, Discipline Khulna University, Khulna, 9208, Bangladesh
| | - Amir H Gandomi
- Faculty of Engineering and Information Technology, University of Technology Sydney, Ultimo, NSW, 2007, Australia.
- University Research and Innovation Center (EKIK), Óbuda University, 1034, Budapest, Hungary.
| |
Collapse
|
3
|
Naseri A, Tax D, van der Harst P, Reinders M, van der Bilt I. Data-efficient machine learning methods in the ME-TIME study: Rationale and design of a longitudinal study to detect atrial fibrillation and heart failure from wearables. CARDIOVASCULAR DIGITAL HEALTH JOURNAL 2023; 4:165-172. [PMID: 38222103 PMCID: PMC10787149 DOI: 10.1016/j.cvdhj.2023.09.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2024] Open
Abstract
Background Smartwatches enable continuous and noninvasive time series monitoring of cardiovascular biomarkers like heart rate (from photoplethysmograms), step counter, skin temperature, et cetera; as such, they have promise in assisting in early detection and prevention of cardiovascular disease. Although these biomarkers may not be directly useful to physicians, a machine learning (ML) model could find clinically relevant patterns. Unfortunately, ML models typically need supervised (ie, annotated) data, and labeling of large amounts of continuous data is very labor intensive. Therefore, ML methods that are data efficient, ie, needing a low number of labels, are required to detect potential clinical value in patterns found in wearable data. Objective The primary study objective of the ME-TIME (Machine Learning Enabled Time Series Analysis in Medicine) study is to design an ML model that can detect atrial fibrillation (AF) and heart failure (HF) from wearable data in a data-efficient manner. To achieve this, self-supervised and weakly supervised learning techniques are used. Methods Two hundred subjects (100 reference, 50 AF, and 50 HF) are being invited to participate in wearing a Fitbit fitness tracker for 3 months. Interested volunteers are sent a questionnaire to determine their health, in particular cardiovascular health. Volunteers without any (history of) serious illness are assigned to the reference group. Participants with AF and HF are recruited in the Haga teaching hospital in The Hague, The Netherlands. Results Enrollment commenced on May 1, 2022, and as of the time of this report, 62 subjects have been included in the study. Preliminary analysis of the data reveals significant inter-subject variability. Notably, we identified heart rate recovery curves and time-delayed correlations between heart rate and step count as potential strong indicators for heart disease. Conclusion Using self-supervised and multiple-instance learning techniques, we hypothesize that patterns specific to AF and HF can be found in continuous data obtained from smartwatches.
Collapse
Affiliation(s)
- Arman Naseri
- Department of Cardiology, Haga Teaching Hospital, The Hague, The Netherlands
- Pattern Recognition and Bioinformatics, Delft University of Technology, Delft, The Netherlands
| | - David Tax
- Pattern Recognition and Bioinformatics, Delft University of Technology, Delft, The Netherlands
| | - Pim van der Harst
- Department of Cardiology, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Marcel Reinders
- Pattern Recognition and Bioinformatics, Delft University of Technology, Delft, The Netherlands
| | - Ivo van der Bilt
- Department of Cardiology, Haga Teaching Hospital, The Hague, The Netherlands
- Department of Cardiology, University Medical Center Utrecht, Utrecht, The Netherlands
| |
Collapse
|
4
|
Gao W, Shen J, Lin Y, Wang K, Lin Z, Tang H, Chen X. Sequential sparse autoencoder for dynamic heading representation in ventral intraparietal area. Comput Biol Med 2023; 163:107114. [PMID: 37329620 DOI: 10.1016/j.compbiomed.2023.107114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Revised: 05/12/2023] [Accepted: 05/30/2023] [Indexed: 06/19/2023]
Abstract
To navigate in space, it is important to predict headings in real-time from neural responses in the brain to vestibular and visual signals, and the ventral intraparietal area (VIP) is one of the critical brain areas. However, it remains unexplored in the population level how the heading perception is represented in VIP. And there are no commonly used methods suitable for decoding the headings from the population responses in VIP, given the large spatiotemporal dynamics and heterogeneity in the neural responses. Here, responses were recorded from 210 VIP neurons in three rhesus monkeys when they were performing a heading perception task. And by specifically and separately modelling the both dynamics with sparse representation, we built a sequential sparse autoencoder (SSAE) to do the population decoding on the recorded dataset and tried to maximize the decoding performance. The SSAE relies on a three-layer sparse autoencoder to extract temporal and spatial heading features in the dataset via unsupervised learning, and a softmax classifier to decode the headings. Compared with other population decoding methods, the SSAE achieves a leading accuracy of 96.8% ± 2.1%, and shows the advantages of robustness, low storage and computing burden for real-time prediction. Therefore, our SSAE model performs well in learning neurobiologically plausible features comprising dynamic navigational information.
Collapse
Affiliation(s)
- Wei Gao
- Department of Neurology and Psychiatry of the Second Affiliated Hospital, College of Biomedical Engineering and Instrument Science, Interdisciplinary Institute of Neuroscience and Technology, School of Medicine, Zhejiang University, 268 Kaixuan Road, Jianggan District, Hangzhou, 310029, China
| | - Jiangrong Shen
- College of Computer Science and Technology, Zhejiang University, 38 Zheda Road, Xihu District, Hangzhou, 310027, China
| | - Yipeng Lin
- Department of Neurology and Psychiatry of the Second Affiliated Hospital, College of Biomedical Engineering and Instrument Science, Interdisciplinary Institute of Neuroscience and Technology, School of Medicine, Zhejiang University, 268 Kaixuan Road, Jianggan District, Hangzhou, 310029, China
| | - Kejun Wang
- School of Software Technology, Zhejiang University, 38 Zheda Road, Xihu District, Hangzhou, 310027, China
| | - Zheng Lin
- Department of Psychiatry, Second Affiliated Hospital, School of Medicine, Zhejiang University, 88 Jiefang Road, Shangcheng District, Hangzhou, 310009, China
| | - Huajin Tang
- College of Computer Science and Technology, Zhejiang University, 38 Zheda Road, Xihu District, Hangzhou, 310027, China.
| | - Xiaodong Chen
- Department of Neurology and Psychiatry of the Second Affiliated Hospital, College of Biomedical Engineering and Instrument Science, Interdisciplinary Institute of Neuroscience and Technology, School of Medicine, Zhejiang University, 268 Kaixuan Road, Jianggan District, Hangzhou, 310029, China.
| |
Collapse
|
5
|
An intelligent heart disease prediction system using hybrid deep dense Aquila network. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2023.104742] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2023]
|
6
|
Le TD, Noumeir R, Rambaud J, Sans G, Jouvet P. Adaptation of Autoencoder for Sparsity Reduction From Clinical Notes Representation Learning. IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE 2023; 11:469-478. [PMID: 37817825 PMCID: PMC10561736 DOI: 10.1109/jtehm.2023.3241635] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/01/2022] [Revised: 01/08/2023] [Accepted: 01/30/2023] [Indexed: 10/12/2023]
Abstract
When dealing with clinical text classification on a small dataset, recent studies have confirmed that a well-tuned multilayer perceptron outperforms other generative classifiers, including deep learning ones. To increase the performance of the neural network classifier, feature selection for the learning representation can effectively be used. However, most feature selection methods only estimate the degree of linear dependency between variables and select the best features based on univariate statistical tests. Furthermore, the sparsity of the feature space involved in the learning representation is ignored. GOAL Our aim is, therefore, to access an alternative approach to tackle the sparsity by compressing the clinical representation feature space, where limited French clinical notes can also be dealt with effectively. METHODS This study proposed an autoencoder learning algorithm to take advantage of sparsity reduction in clinical note representation. The motivation was to determine how to compress sparse, high-dimensional data by reducing the dimension of the clinical note representation feature space. The classification performance of the classifiers was then evaluated in the trained and compressed feature space. RESULTS The proposed approach provided overall performance gains of up to 3% for each test set evaluation. Finally, the classifier achieved 92% accuracy, 91% recall, 91% precision, and 91% f1-score in detecting the patient's condition. Furthermore, the compression working mechanism and the autoencoder prediction process were demonstrated by applying the theoretic information bottleneck framework. Clinical and Translational Impact Statement- An autoencoder learning algorithm effectively tackles the problem of sparsity in the representation feature space from a small clinical narrative dataset. Significantly, it can learn the best representation of the training data because of its lossless compression capacity compared to other approaches. Consequently, its downstream classification ability can be significantly improved, which cannot be done using deep learning models.
Collapse
Affiliation(s)
- Thanh-Dung Le
- Biomedical Information Processing Laboratory, Ecole de Technologie SuperieureUniversity of QuebecMontrealQCH3C 1K3Canada
- Research Center at CHU Sainte-JustineUniversity of MontrealMontrealQCH3T 1J4Canada
| | - Rita Noumeir
- Biomedical Information Processing Laboratory, Ecole de Technologie SuperieureUniversity of QuebecMontrealQCH3C 1K3Canada
| | - Jerome Rambaud
- Research Center at CHU Sainte-JustineUniversity of MontrealMontrealQCH3T 1J4Canada
| | - Guillaume Sans
- Research Center at CHU Sainte-JustineUniversity of MontrealMontrealQCH3T 1J4Canada
| | - Philippe Jouvet
- Research Center at CHU Sainte-JustineUniversity of MontrealMontrealQCH3T 1J4Canada
| |
Collapse
|
7
|
Sarra RR, Dinar AM, Mohammed MA, Ghani MKA, Albahar MA. A Robust Framework for Data Generative and Heart Disease Prediction Based on Efficient Deep Learning Models. Diagnostics (Basel) 2022; 12:diagnostics12122899. [PMID: 36552906 PMCID: PMC9777498 DOI: 10.3390/diagnostics12122899] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Revised: 11/15/2022] [Accepted: 11/18/2022] [Indexed: 11/23/2022] Open
Abstract
Biomarkers including fasting blood sugar, heart rate, electrocardiogram (ECG), blood pressure, etc. are essential in the heart disease (HD) diagnosing. Using wearable sensors, these measures are collected and applied as inputs to a deep learning (DL) model for HD diagnosis. However, it is observed that model accuracy weakens when the data gathered are scarce or imbalanced. Therefore, this work proposes two DL-based frameworks, GAN-1D-CNN, and GAN-Bi-LSTM. These frameworks contain: (1) a generative adversarial network (GAN) and (2) a one-dimensional convolutional neural network (1D-CNN) or bi-directional long short-term memory (Bi-LSTM). The GAN model is utilized to augment the small and imbalanced dataset, which is the Cleveland dataset. The 1D-CNN and Bi-LSTM models are then trained using the enlarged dataset to diagnose HD. Unlike previous works, the proposed frameworks increase the dataset first to avoid the prediction bias caused by the limited data. The GAN-1D-CNN achieved 99.1% accuracy, specificity, sensitivity, F1-score, and 100% area under the curve (AUC). Similarly, the GAN-Bi-LSTM obtained 99.3% accuracy, 99.2% specificity, 99.3% sensitivity, 99.2% F1-score, and 100% AUC. Furthermore, time complexity of proposed frameworks is investigated with and without principal component analysis (PCA). The PCA method reduced prediction times for 61 samples using GAN-1D-CNN and GAN-Bi-LSTM to 68.8 and 74.8 ms, respectively. These results show that it is reliable to use our frameworks for augmenting limited data and predicting heart disease.
Collapse
Affiliation(s)
- Raniya R. Sarra
- Computer Engineering Department, University of Technology, Baghdad 00964, Iraq
| | - Ahmed M. Dinar
- Computer Engineering Department, University of Technology, Baghdad 00964, Iraq
- Correspondence: ; Tel.: +964-770-307-2072
| | - Mazin Abed Mohammed
- College of Computer Science and Information Technology, University of Anbar, Ramadi 31001, Iraq
| | - Mohd Khanapi Abd Ghani
- Biomedical Computing and Engineering Technologies (BIOCORE) Applied Research Group, Faculty of Information and Communication Technology, Universiti Teknikal Malaysia Melaka, Durian Tunggal 76100, Malaysia
| | - Marwan Ali Albahar
- Department of Computer Science, Umm Al Qura University, Mecca 24211, Saudi Arabia
| |
Collapse
|
8
|
Mixed Machine Learning Approach for Efficient Prediction of Human Heart Disease by Identifying the Numerical and Categorical Features. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12157449] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Heart disease is a danger to people’s health because of its prevalence and high mortality risk. Predicting cardiac disease early using a few simple physical indications collected from a routine physical examination has become difficult. Clinically, it is critical and sensitive for the signs of heart disease for accurate forecasts and concrete steps for future diagnosis. The manual analysis and prediction of a massive volume of data are challenging and time-consuming. In this paper, a unique heart disease prediction model is proposed to predict heart disease correctly and rapidly using a variety of bodily signs. A heart disease prediction algorithm based on the analysis of the predictive models’ classification performance on combined datasets and the train-test split technique is presented. Finally, the proposed technique’s training results are compared with the previous works. For the Cleveland, Switzerland, Hungarian, and Long Beach VA heart disease datasets, accuracy, precision, recall, F1-score, and ROC-AUC curves are used as the performance indicators. The analytical outcomes for Random Forest Classifiers (RFC) of the combined heart disease datasets are F1-score 100%, accuracy 100%, precision 100%, recall 100%, and the ROC-AUC 100%. The Decision Tree Classifiers for pooled heart disease datasets are F1-score 100%, accuracy 98.80%, precision 98%, recall 99%, ROC-AUC 99%, and for RFC and Gradient Boosting Classifiers (GBC), the ROC-AUC gives 100% performance. The performances of the machine learning algorithms are improved by using five-fold cross validation. Again, the Stacking CV Classifier is also used to improve the performances of the individual machine learning algorithms by combining two and three techniques together. In this paper, several reduction methods are incorporated. It is found that the accuracy of the RFC classification algorithm is high. Moreover, the developed method is efficient and reliable for predicting heart disease.
Collapse
|
9
|
Deepika D, Balaji N. Effective heart disease prediction with Grey-wolf with Firefly algorithm-differential evolution (GF-DE) for feature selection and weighted ANN classification. Comput Methods Biomech Biomed Engin 2022; 25:1409-1427. [PMID: 35652537 DOI: 10.1080/10255842.2022.2078966] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
In recent time, heart disease has become common leading to mortality of many individuals. Hence, early and accurate prediction of this disease is vital to reduce death rate and enhance people's lives. Concurrently, Artificial Intelligence has gained more attention at present as it permits deeper understanding of the healthcare data thereby providing accurate prediction results. This efficient prediction will solve complicated queries regarding heart diseases and hence assists clinical practitioners to adopt smart medical decisions. Hence, this study intends to predict heart disease with high accuracy by proposing an improved feature selection and enhanced classification approach. The paper employs Grey-wolf with Firefly algorithm for effective feature selection and using Differential Evolution Algorithm for tuning the hyper parameters of Artificial Neural Network (ANN). Hence, it is named as Grey Wolf Firefly algorithm with Differential Evolution (GF-DE) for better classification of the selected features. This proposed classification model trains the neural network to obtain optimal weights and tunes huge number of hyper parameters in an efficiently. To prove this, the proposed system is comparatively analysed with existing methods in terms of performance metrics like accuracy, precision, recall and F1 score for Cleveland and Statlog dataset. In addition, statistical analysis is also undertaken to analyse the significance of proposed system. Outcomes revealed the efficiency of proposed method which makes it highly suitable for heart disease prediction in an efficient manner.
Collapse
Affiliation(s)
- D Deepika
- Research Scholar, Anna University, Chennai, India
| | - N Balaji
- Professor, Computer Science and Engineering, Velammal Institute of Technology, Chennai, India
| |
Collapse
|
10
|
Aruleba RT, Adekiya TA, Ayawei N, Obaido G, Aruleba K, Mienye ID, Aruleba I, Ogbuokiri B. COVID-19 Diagnosis: A Review of Rapid Antigen, RT-PCR and Artificial Intelligence Methods. Bioengineering (Basel) 2022; 9:153. [PMID: 35447713 PMCID: PMC9024895 DOI: 10.3390/bioengineering9040153] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Revised: 03/22/2022] [Accepted: 03/23/2022] [Indexed: 12/15/2022] Open
Abstract
As of 27 December 2021, SARS-CoV-2 has infected over 278 million persons and caused 5.3 million deaths. Since the outbreak of COVID-19, different methods, from medical to artificial intelligence, have been used for its detection, diagnosis, and surveillance. Meanwhile, fast and efficient point-of-care (POC) testing and self-testing kits have become necessary in the fight against COVID-19 and to assist healthcare personnel and governments curb the spread of the virus. This paper presents a review of the various types of COVID-19 detection methods, diagnostic technologies, and surveillance approaches that have been used or proposed. The review provided in this article should be beneficial to researchers in this field and health policymakers at large.
Collapse
Affiliation(s)
- Raphael Taiwo Aruleba
- Department of Molecular and Cell Biology, Faculty of Science, University of Cape Town, Cape Town 7701, South Africa;
| | - Tayo Alex Adekiya
- Department of Pharmacy and Pharmacology, School of Therapeutic Science, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, 7 York Road, Parktown 2193, South Africa;
| | - Nimibofa Ayawei
- Department of Chemistry, Bayelsa Medical University, Yenagoa PMB 178, Bayelsa State, Nigeria;
| | - George Obaido
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, CA 92093-0404, USA
| | - Kehinde Aruleba
- School of Computing and Mathematical Sciences, University of Leicester, Leicester LE1 7RH, UK
| | - Ibomoiye Domor Mienye
- Department of Electrical and Electronic Engineering Science, University of Johannesburg, Johannesburg 2006, South Africa; (I.D.M.); (I.A.)
| | - Idowu Aruleba
- Department of Electrical and Electronic Engineering Science, University of Johannesburg, Johannesburg 2006, South Africa; (I.D.M.); (I.A.)
| | - Blessing Ogbuokiri
- Department of Mathematics and Statistics, York University, Toronto, ON M3J 1P3, Canada;
| |
Collapse
|
11
|
Sperti M, Malavolta M, Staunovo Polacco F, Dellavalle A, Ruggieri R, Bergia S, Fazio A, Santoro C, Deriu MA. Cardiovascular risk prediction: from classical statistical methods to machine learning approaches. Minerva Cardiol Angiol 2022; 70:102-122. [PMID: 35261223 DOI: 10.23736/s2724-5683.21.05868-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Nowadays, cardiovascular risk prediction scores are commonly used in primary prevention settings. Estimating the cardiovascular individual risk is of crucial importance for effective patient management and optimal therapy identification, with relevant consequences on secondary prevention settings. To reach this goal, a plethora of risk scores have been developed in the past, most of them assuming that each cardiovascular risk factor is linearly dependent on the outcome. However, the overall accuracy of these methods often remains insufficient to solve the problem at hand. In this scenario, machine learning techniques have repeatedly proved successful in improving cardiovascular risk predictions, being able to capture the non-linearity present in the data. In this concern, we present a detailed discussion concerning the application of classical versus machine learning-based cardiovascular risk scores in the clinical setting. This review aimed to give an overview of the current risk scores based on classical statistical approaches and machine learning techniques applied to predict the risk of several cardiovascular diseases, comparing them, discussing their similarities and differences, and highlighting their main drawbacks to aid the physician having a more critical understanding of these tools.
Collapse
Affiliation(s)
- Michela Sperti
- Department of Mechanical and Aerospace Engineering, PolitoBio MedLab, Polytechnic University of Turin, Turin, Italy
| | - Marta Malavolta
- Department of Mechanical and Aerospace Engineering, PolitoBio MedLab, Polytechnic University of Turin, Turin, Italy
| | - Federica Staunovo Polacco
- Department of Mechanical and Aerospace Engineering, PolitoBio MedLab, Polytechnic University of Turin, Turin, Italy
| | - Annalisa Dellavalle
- Department of Mechanical and Aerospace Engineering, PolitoBio MedLab, Polytechnic University of Turin, Turin, Italy
| | - Rossella Ruggieri
- Department of Mechanical and Aerospace Engineering, PolitoBio MedLab, Polytechnic University of Turin, Turin, Italy
| | - Sara Bergia
- Department of Mechanical and Aerospace Engineering, PolitoBio MedLab, Polytechnic University of Turin, Turin, Italy
| | - Alice Fazio
- Department of Mechanical and Aerospace Engineering, PolitoBio MedLab, Polytechnic University of Turin, Turin, Italy
| | - Carmine Santoro
- Department of Mechanical and Aerospace Engineering, PolitoBio MedLab, Polytechnic University of Turin, Turin, Italy
| | - Marco A Deriu
- Department of Mechanical and Aerospace Engineering, PolitoBio MedLab, Polytechnic University of Turin, Turin, Italy -
| |
Collapse
|
12
|
J L G, Abraham B, M S S, Nair MS. A computer-aided diagnosis system for the classification of COVID-19 and non-COVID-19 pneumonia on chest X-ray images by integrating CNN with sparse autoencoder and feed forward neural network. Comput Biol Med 2021; 141:105134. [PMID: 34971978 PMCID: PMC8668604 DOI: 10.1016/j.compbiomed.2021.105134] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Revised: 11/19/2021] [Accepted: 12/10/2021] [Indexed: 12/15/2022]
Abstract
Several infectious diseases have affected the lives of many people and have caused great dilemmas all over the world. COVID-19 was declared a pandemic caused by a newly discovered virus named Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) by the World Health Organisation in 2019. RT-PCR is considered the golden standard for COVID-19 detection. Due to the limited RT-PCR resources, early diagnosis of the disease has become a challenge. Radiographic images such as Ultrasound, CT scans, X-rays can be used for the detection of the deathly disease. Developing deep learning models using radiographic images for detecting COVID-19 can assist in countering the outbreak of the virus. This paper presents a computer-aided detection model utilizing chest X-ray images for combating the pandemic. Several pre-trained networks and their combinations have been used for developing the model. The method uses features extracted from pre-trained networks along with Sparse autoencoder for dimensionality reduction and a Feed Forward Neural Network (FFNN) for the detection of COVID-19. Two publicly available chest X-ray image datasets, consisting of 504 COVID-19 images and 542 non-COVID-19 images, have been combined to train the model. The method was able to achieve an accuracy of 0.9578 and an AUC of 0.9821, using the combination of InceptionResnetV2 and Xception. Experiments have proved that the accuracy of the model improves with the usage of sparse autoencoder as the dimensionality reduction technique.
Collapse
Affiliation(s)
- Gayathri J L
- Department of Computer Science and Engineering, College of Engineering Perumon, Kollam, 691 601, Kerala, India.
| | - Bejoy Abraham
- Department of Computer Science and Engineering, College of Engineering Perumon, Kollam, 691 601, Kerala, India.
| | - Sujarani M S
- Department of Computer Science and Engineering, College of Engineering Perumon, Kollam, 691 601, Kerala, India
| | - Madhu S Nair
- Artificial Intelligence & Computer Vision Lab, Department of Computer Science, Cochin University of Science and Technology, Kochi, 682 022, Kerala, India
| |
Collapse
|
13
|
Improved Heart Disease Prediction Using Particle Swarm Optimization Based Stacked Sparse Autoencoder. ELECTRONICS 2021. [DOI: 10.3390/electronics10192347] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Heart disease is the leading cause of death globally. The most common type of heart disease is coronary heart disease, which occurs when there is a build-up of plaque inside the arteries that supply blood to the heart, making blood circulation difficult. The prediction of heart disease is a challenge in clinical machine learning. Early detection of people at risk of the disease is vital in preventing its progression. This paper proposes a deep learning approach to achieve improved prediction of heart disease. An enhanced stacked sparse autoencoder network (SSAE) is developed to achieve efficient feature learning. The network consists of multiple sparse autoencoders and a softmax classifier. Additionally, in deep learning models, the algorithm’s parameters need to be optimized appropriately to obtain efficient performance. Hence, we propose a particle swarm optimization (PSO) based technique to tune the parameters of the stacked sparse autoencoder. The optimization by the PSO improves the feature learning and classification performance of the SSAE. Meanwhile, the multilayer architecture of autoencoders usually leads to internal covariate shift, a problem that affects the generalization ability of the network; hence, batch normalization is introduced to prevent this problem. The experimental results show that the proposed method effectively predicts heart disease by obtaining a classification accuracy of 0.973 and 0.961 on the Framingham and Cleveland heart disease datasets, respectively, thereby outperforming other machine learning methods and similar studies.
Collapse
|
14
|
Koumetio Tekouabou SC, Diop EB, Azmi R, Jaligot R, Chenal J. Reviewing the application of machine learning methods to model urban form indicators in planning decision support systems: Potential, issues and challenges. JOURNAL OF KING SAUD UNIVERSITY - COMPUTER AND INFORMATION SCIENCES 2021. [DOI: 10.1016/j.jksuci.2021.08.007] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
|
15
|
Abstract
The severe spread of the COVID-19 pandemic has created a situation of public health emergency and global awareness. In our research, we analyzed the demographical factors affecting the global pandemic spread along with the features that lead to death due to the infection. Modeling results stipulate that the mortality rate increase as the age increase and it is found that most of the death cases belong to the age group 60–80. Cluster-based analysis of age groups is also conducted to analyze the maximum targeted age-groups. An association between positive COVID-19 cases and deceased cases are also presented, with the impact on male and female death cases due to corona. Additionally, we have also presented an artificial intelligence-based statistical approach to predict the survival chances of corona infected people in South Korea with the analysis of the impact on the exploratory factors, including age-groups, gender, temporal evolution, etc. To analyze the coronavirus cases, we applied machine learning with hyperparameters tuning and deep learning models with an autoencoder-based approach for estimating the influence of the disparate features on the spread of the disease and predict the survival possibilities of the quarantined patients in isolation. The model calibrated in the study is based on positive corona infection cases and presents the analysis over different aspects that proven to be impactful to analyze the temporal trends in the current situation along with the exploration of deceased cases due to coronavirus. Analysis delineates key points in the outbreak spreading, indicating that the models driven by machine intelligence and deep learning can be effective in providing a quantitative view of the epidemical outbreak.
Collapse
|
16
|
Ray A, Chaudhuri AK. Smart healthcare disease diagnosis and patient management: Innovation, improvement and skill development. MACHINE LEARNING WITH APPLICATIONS 2021. [DOI: 10.1016/j.mlwa.2020.100011] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
|
17
|
Mehmood A, Iqbal M, Mehmood Z, Irtaza A, Nawaz M, Nazir T, Masood M. Prediction of Heart Disease Using Deep Convolutional Neural Networks. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING 2021. [DOI: 10.1007/s13369-020-05105-1] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
|
18
|
Performance analysis of cost-sensitive learning methods with application to imbalanced medical data. INFORMATICS IN MEDICINE UNLOCKED 2021. [DOI: 10.1016/j.imu.2021.100690] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
|
19
|
Kumar PR, Ravichandran S, Narayana S. Ensemble classification technique for heart disease prediction with meta-heuristic-enabled training system. BIO-ALGORITHMS AND MED-SYSTEMS 2020. [DOI: 10.1515/bams-2020-0033] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Abstract
Objectives
This research work exclusively aims to develop a novel heart disease prediction framework including three major phases, namely proposed feature extraction, dimensionality reduction, and proposed ensemble-based classification.
Methods
As the novelty, the training of NN is carried out by a new enhanced optimization algorithm referred to as Sea Lion with Canberra Distance (S-CDF) via tuning the optimal weights. The improved S-CDF algorithm is the extended version of the existing “Sea Lion Optimization (SLnO)”. Initially, the statistical and higher-order statistical features are extracted including central tendency, degree of dispersion, and qualitative variation, respectively. However, in this scenario, the “curse of dimensionality” seems to be the greatest issue, such that there is a necessity of dimensionality reduction in the extracted features. Hence, the principal component analysis (PCA)-based feature reduction approach is deployed here. Finally, the dimensional concentrated features are fed as the input to the proposed ensemble technique with “Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbor (KNN)” with optimized Neural Network (NN) as the final classifier.
Results
An elaborative analyses as well as discussion have been provided by concerning the parameters, like evaluation metrics, year of publication, accuracy, implementation tool, and utilized datasets obtained by various techniques.
Conclusions
From the experiment outcomes, it is proved that the accuracy of the proposed work with the proposed feature set is 5, 42.85, and 10% superior to the performance with other feature sets like central tendency + dispersion feature, central tendency qualitative variation, and dispersion qualitative variation, respectively.
Results
Finally, the comparative evaluation shows that the presented work is appropriate for heart disease prediction as it has high accuracy than the traditional works.
Collapse
Affiliation(s)
- Parvathaneni Rajendra Kumar
- Department of Information Technology, Faculty of Engineering and Technology , Annamalai University Annamalainagar - 608002 , Tamil Nadu , India
| | - Suban Ravichandran
- Department of Information Technology, Faculty of Engineering and Technology , Annamalai University Annamalainagar - 608002 , Tamil Nadu , India
| | | |
Collapse
|
20
|
Integrating Enhanced Sparse Autoencoder-Based Artificial Neural Network Technique and Softmax Regression for Medical Diagnosis. ELECTRONICS 2020. [DOI: 10.3390/electronics9111963] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
In recent times, several machine learning models have been built to aid in the prediction of diverse diseases and to minimize diagnostic errors made by clinicians. However, since most medical datasets seem to be imbalanced, conventional machine learning algorithms tend to underperform when trained with such data, especially in the prediction of the minority class. To address this challenge and proffer a robust model for the prediction of diseases, this paper introduces an approach that comprises of feature learning and classification stages that integrate an enhanced sparse autoencoder (SAE) and Softmax regression, respectively. In the SAE network, sparsity is achieved by penalizing the weights of the network, unlike conventional SAEs that penalize the activations within the hidden layers. For the classification task, the Softmax classifier is further optimized to achieve excellent performance. Hence, the proposed approach has the advantage of effective feature learning and robust classification performance. When employed for the prediction of three diseases, the proposed method obtained test accuracies of 98%, 97%, and 91% for chronic kidney disease, cervical cancer, and heart disease, respectively, which shows superior performance compared to other machine learning algorithms. The proposed approach also achieves comparable performance with other methods available in the recent literature.
Collapse
|
21
|
Shiny Irene D, Sethukarasi T, Vadivelan N. Heart disease prediction using hybrid fuzzy K-medoids attribute weighting method with DBN-KELM based regression model. Med Hypotheses 2020; 143:110072. [PMID: 32721791 DOI: 10.1016/j.mehy.2020.110072] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2020] [Revised: 06/20/2020] [Accepted: 06/30/2020] [Indexed: 02/05/2023]
Abstract
Automated prediction can be offered for further treatment to make effective and relieve the difficulties in the diagnosis of heart condition of patient. In this paper, a hybrid method is proposed combining FKMAW and DBNKELM based ensemble method to enhance medical diagnosis process. Firstly, the input attributes are weighed using a fuzzy k-medoids clustering based attribute weighting (FKMAW) method. Subsequently, the medical data classification performance is improved by applying the weighing method and the linearly separable dataset is obtained with the transformation of non-linearly separable dataset. With the weighted attributes, a regression model based heart disease prediction scheme is proposed combining Deep belief Network and Extreme learning machine (DBNKELM), in which Extreme learning machine is the top layer of the deep belief network to work as a regression model. The results demonstrate that FKMAW + DBNKELM achieved good performance in rectifying the problems in medical data classification for all the six datasets.
Collapse
Affiliation(s)
- D Shiny Irene
- Saveetha School of Engineering, Saveetha Institute of Medical and Technical Sciences, Chennai, India.
| | - T Sethukarasi
- Department of Computer Science and Engineering, R.M.K. Engineering College, Chennai, India
| | - N Vadivelan
- Department of Computer Science and Engineering, Teegala Krishna Reddy Engineering College, Meerpet, Hyderabad, India
| |
Collapse
|