1
|
Ibrahim M, Beneyto A, Contreras I, Vehi J. An ensemble machine learning approach for the detection of unannounced meals to enhance postprandial glucose control. Comput Biol Med 2024; 171:108154. [PMID: 38382387 DOI: 10.1016/j.compbiomed.2024.108154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Revised: 02/02/2024] [Accepted: 02/12/2024] [Indexed: 02/23/2024]
Abstract
BACKGROUND Hybrid automated insulin delivery systems enhance postprandial glucose control in type 1 diabetes, however, meal announcements are burdensome. To overcome this, we propose a machine learning-based automated meal detection approach; METHODS:: A heterogeneous ensemble method combining an artificial neural network, random forest, and logistic regression was employed. Trained and tested on data from two in-silico cohorts comprising 20 and 47 patients. It accounted for various meal sizes (moderate to high) and glucose appearance rates (slow and rapid absorbing). To produce an optimal prediction model, three ensemble configurations were used: logical AND, majority voting, and logical OR. In addition to the in-silico data, the proposed meal detector was also trained and tested using the OhioT1DM dataset. Finally, the meal detector is combined with a bolus insulin compensation scheme; RESULTS:: The ensemble majority voting obtained the best meal detector results for both the in-silico and OhioT1DM cohorts with a sensitivity of 77%, 94%, 61%, precision of 96%, 89%, 72%, F1-score of 85%, 91%, 66%, and with false positives per day values of 0.05, 0.19, 0.17, respectively. Automatic meal detection with insulin compensation has been performed in open-loop insulin therapy using the AND ensemble, chosen for its lower false positive rate. Time-in-range has significantly increased 10.48% and 16.03%, time above range was reduced by 5.16% and 11.85%, with a minimal time below range increase of 0.35% and 2.69% for both in-silico cohorts, respectively, compared to the results without a meal detector; CONCLUSION:: To increase the overall accuracy and robustness of the predictions, this ensemble methodology aims to take advantage of each base model's strengths. All of the results point to the potential application of the proposed meal detector as a separate module for the detection of meals in automated insulin delivery systems to achieve improved glycemic control.
Collapse
Affiliation(s)
- Muhammad Ibrahim
- Modeling, Identification and Control Engineering Laboratory (MICELab), Institut d'Informàtica i Aplicacions, Universitat de Girona, Girona, Spain
| | - Aleix Beneyto
- Modeling, Identification and Control Engineering Laboratory (MICELab), Institut d'Informàtica i Aplicacions, Universitat de Girona, Girona, Spain
| | - Ivan Contreras
- Modeling, Identification and Control Engineering Laboratory (MICELab), Institut d'Informàtica i Aplicacions, Universitat de Girona, Girona, Spain
| | - Josep Vehi
- Modeling, Identification and Control Engineering Laboratory (MICELab), Institut d'Informàtica i Aplicacions, Universitat de Girona, Girona, Spain; Centro de Investigación Biomédica en Red de Diabetes y Enfermedades Metabólicas Asociadas (CIBERDEM), Madrid, Spain.
| |
Collapse
|
2
|
Sulaiman R, Azeman NH, Mokhtar MHH, Mobarak NN, Abu Bakar MH, Bakar AAA. Hybrid ensemble-based machine learning model for predicting phosphorus concentrations in hydroponic solution. Spectrochim Acta A Mol Biomol Spectrosc 2024; 304:123327. [PMID: 37708761 DOI: 10.1016/j.saa.2023.123327] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/13/2023] [Revised: 08/08/2023] [Accepted: 08/31/2023] [Indexed: 09/16/2023]
Abstract
Accurate, label-free, and rapid methods for measuring phosphorus concentrations are essential in a hydroponic system, as excessive or insufficient phosphorus levels can adversely affect plant growth, human health, and environmental sustainability. In this study, we demonstrate the advantages of hybrid machine learning models compared to single machine learning models in predicting phosphorus concentration based on the absorbance dataset. Three machine learning classifiers- Random Forest (RF), Support Vector Machine (SVM), and K-Nearest Neighbors (KNN)- were employed as bases for single and hybrid machine learning models. Three ensemble techniques (voting, bagging, and stacking) were used to hybridize the classifiers. Among the single models, KNN demonstrated the fastest computational time of 18.07 s, while SVM achieved the highest accuracy of 99.6%. The hybrid SVM/KNN model using a voting classifier showed a significant increase in accuracy for KNN with only a slight increase in computational time. Bagging techniques increased the accuracy but at a longer computational time. The stacking technique, which combined SVM, KNN, and RF, achieved the highest accuracy of 99.73% with a short computational time of 36.18 s compared to the bagging and voting technique. This study demonstrates that the machine learning method can effectively distinguish phosphorus concentrations. In contrast, hybrid machine learning techniques can improve accuracy for predicting phosphorus without using labels, despite requiring longer computational time.
Collapse
Affiliation(s)
- Rozita Sulaiman
- Photonics Technology Laboratory, Department of Electrical, Electronic, and Systems Engineering, Faculty of Engineering and Built Environment, Universiti Kebangsaan Malaysia, Malaysia.
| | - Nur Hidayah Azeman
- Institute of Microengineering and Nanoelectronics (IMEN), Universiti Kebangsaan Malaysia, Malaysia.
| | - Mohd Hadri Hafiz Mokhtar
- Photonics Technology Laboratory, Department of Electrical, Electronic, and Systems Engineering, Faculty of Engineering and Built Environment, Universiti Kebangsaan Malaysia, Malaysia
| | - Nadhratun Naiim Mobarak
- Department of Chemical Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, Malaysia
| | - Mohd Hafiz Abu Bakar
- Photonics Technology Laboratory, Department of Electrical, Electronic, and Systems Engineering, Faculty of Engineering and Built Environment, Universiti Kebangsaan Malaysia, Malaysia
| | - Ahmad Ashrif A Bakar
- Photonics Technology Laboratory, Department of Electrical, Electronic, and Systems Engineering, Faculty of Engineering and Built Environment, Universiti Kebangsaan Malaysia, Malaysia; Institute of Islam Hadhari, Universiti Kebangsaan Malaysia, Malaysia.
| |
Collapse
|
3
|
Nambi Narayanan S, Subbian S. HH model based smart deep brain stimulator to detect, predict and control epilepsy using machine learning algorithm. J Neurosci Methods 2023; 389:109825. [PMID: 36822276 DOI: 10.1016/j.jneumeth.2023.109825] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2023] [Revised: 02/15/2023] [Accepted: 02/20/2023] [Indexed: 02/23/2023]
Abstract
BACKGROUND Epilepsy is the most common neurological disorder in the world. To control epilepsy, deep brain stimulation is one of the widely accepted treatment techniques. However, conventional deep brain stimulation technique provides continuous stimulation without optimizing the stimulation parameters, resulting in adverse side effects and unexpected death. Hence, understanding the dynamic behavior of brain neural networks at a cellular level is required for patient-specific epilepsy treatment. Considering the underlying mechanism of a single neuronal shift in the brain neural network, computational model-based techniques have a new face for healthcare, which aims to develop effective medical devices for preclinical investigations. NEW METHOD This paper discusses the design of a Smart Deep Brain Stimulator (SDBS) using the Hodgkin-Huxley (HH) conductance-based cellular model of brain neurons to automatically detect, predict and regulate epilepsy against patient-specific conditions. Epileptic activity is simulated as a spike train of action potential due to sodium and potassium channel conductance variations in the single-neuron HH model. The proposed SDBS consists of three components:- i) seizure detection using bagging and boosting-based ensemble machine learning classifiers, ii) channel conductance prediction using Long Short Term Memory-Recurrent Neural Network (LSTM-RNN) based Deep Neural Network (DNN) for updating model parameters of brain neuron, and iii) model-based intelligent control of epileptic seizure with Nonlinear Autoregressive Moving Average-L2 (NARMA-L2) Controller and Nonlinear Model Predictive Controller (NMPC). RESULTS For effective treatment, improving the overall accuracy and efficiency of SDBS is essential. For epilepsy detection, the ensemble bagging machine learning algorithm provides better accuracy of 92.7% compared to the ensemble boosting algorithm. LSTM-RNN deep neural network model with four layers predicts the variations in channel conductance with Root Mean Square Error (RMSE) of 0.00568 and 0.009081 for sodium and potassium channel conductance, respectively. From the closed-loop performances of SDBS with an intelligent control scheme, it is observed that SDBS with NMPC provides efficient and accurate stimulation with minimum energy consumption. From a stability point of view, SDBS with NMPC provides better stability than SDBS with NARMA-L2 Controller. COMPARISON WITH EXISTING METHOD The proposed SDBS is designed to generate accurate stimulation pulses for epilepsy patients with specific conditions depending on the neuronal activity of a single neuron. Moreover, it will also adapt to the dynamic condition of epilepsy patients. The existing deep brain stimulator continuously provides stimulation pulses without adapting to the patient's conditions. CONCLUSION The proposed SDBS could provide patient-specific treatment based on sodium/potassium channel conductance variations of brain neurons. It will help increase the use of deep brain stimulation techniques and reduce sudden death. Furthermore, the proposed technique will be extended to neural network models with larger neuronal populations to improve the practical feasibility.
Collapse
Affiliation(s)
- S Nambi Narayanan
- Department of Instrumentation Engg, MIT Campus, Anna University, Chennai 44, Tamilnadu, India.
| | - Sutha Subbian
- Department of Instrumentation Engg, MIT Campus, Anna University, Chennai 44, Tamilnadu, India
| |
Collapse
|
4
|
Rai HM, Chatterjee K, Dashkevych S. The prediction of cardiac abnormality and enhancement in minority class accuracy from imbalanced ECG signals using modified deep neural network models. Comput Biol Med 2022; 150:106142. [PMID: 36182760 DOI: 10.1016/j.compbiomed.2022.106142] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 09/04/2022] [Accepted: 09/18/2022] [Indexed: 11/03/2022]
Abstract
Cardiovascular disease (CVD) is the most fatal disease in the world, so its accurate and automated detection in the early stages will certainly support the medical expert in timely diagnosis and treatment, which can save many lives. Many types of research have been carried out in this regard, but due to the problem of data imbalance in the medical and health care sector, it may not provide the desired results in all aspects. To overcome this problem, a sequential ensemble technique has been proposed that detects 6 types of cardiac arrhythmias on large ECG imbalanced datasets, and the data imbalanced issue of the ECG dataset has been addressed by using a hybrid data resampling technique called "Synthetically Minority Oversampling Technique and Tomek Link (SMOTE + Tomek)". The sequential ensemble technique employs two distinct deep learning models: Convolutional Neural Network (CNN) and a hybrid model, CNN with Long Short-Term Memory Network (CNN-LSTM). The two standard datasets "MIT-BIH arrhythmias database" (MITDB) and "PTB diagnostic database" (PTBDB) are combined and extracted 23, 998 ECG beats for the model validation. In this work, the three models CNN, CNN-LSTM, and ensemble approach were tested on four kinds of ECG datasets: the original data (imbalanced), the data sampled using a random oversampled technique, data sampled using SMOTE, and the dataset resampled using SMOTE + Tomek algorithm. The overall highest accuracy was obtained of 99.02% on the SMOTE + Tomek sampled dataset by ensemble technique and the minority class accuracy result (Recall) is improved by 20% as compared to the imbalanced data.
Collapse
Affiliation(s)
- Hari Mohan Rai
- Department of Electrical Engineering, Indian Institute of Technology(ISM), Dhanbad, India; Department of Electronics and Communication Engineering, Dronacharya Group of Institutions, Greater Noida, India.
| | - Kalyan Chatterjee
- Department of Electrical Engineering, Indian Institute of Technology(ISM), Dhanbad, India.
| | - Serhii Dashkevych
- Data Scientist, Polsko-Japońska Akademia Technik Komputerowych, Koszykowa, Warszawa, Poland.
| |
Collapse
|
5
|
Harywanto GN, Veron JS, Suhartono D. A BERTweet-based design for monitoring behaviour change based on five doors theory on coral bleaching campaign. J Big Data 2022; 9:73. [PMID: 35669348 PMCID: PMC9153220 DOI: 10.1186/s40537-022-00615-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/26/2021] [Accepted: 04/11/2022] [Indexed: 06/15/2023]
Abstract
Coral reefs are very important ecosystem which are the foundation of all life on this earth, but now they are under threat. Coral bleaching are happening now at a serious rate and the ultimate goal of conservation effort toward this issue is behaviour change. One of the most important parts of conservation effort is monitoring. However, monitoring the success of the coral bleaching campaign on behaviour change requires extensive data collection so traditional methods are not effective because they require resources that may not be met. The goal of this study is to build fast and vast automation in analyzing the stage of behaviour change. Social media data has prospect to become good alternative to be used because social media usage is currently increasing every year, including Twitter. Therefore, an automatic classification model was designed which can identify the stages of behaviour change based on the Five Doors Theory on Twitter. Five Doors Theory define 5 stages of behavior change: Desirability, Enabling Context, Can Do, Buzz, and Invitation. The data was fetched through a trusted repository, Mendeley Data, with title "An Annotated Dataset for Identifying Behaviour Change Based on Five Doors Theory Under Coral Bleaching Phenomenon on Twitter". There are 1,222 tweets with keywords related to coral bleaching that have been annotated according to the behaviour change stages. There are two proposed designs: embedding extraction which utilizes the output of each encoder layer in BERTweet and stacking ensemble which uses several BERTweet models with different hyperparameters that are ensembled using a logistic regression model. The best accuracy of 0.7796 with an f1-score of 0.7945 was obtained in the stacking ensemble design scenario. The classification model created can identify each class at the stage of behaviour change well, even though the dataset is unbalanced in its distribution. The proposed design has a performance that exceeds all baseline models and the standalone BERTweet. In conclusion, the automatic classification model create the process of monitoring the stages of behavior change run effectively and efficiently so that the success of the coral bleaching campaign can be monitored and achieved.
Collapse
Affiliation(s)
- Gabriela Nathania Harywanto
- Computer Science Department, School of Computer Science, Bina Nusantara University, Jakarta, 11480 Indonesia
| | - Juan Sebastian Veron
- Computer Science Department, School of Computer Science, Bina Nusantara University, Jakarta, 11480 Indonesia
| | - Derwin Suhartono
- Computer Science Department, School of Computer Science, Bina Nusantara University, Jakarta, 11480 Indonesia
| |
Collapse
|
6
|
Rahman MS, Abdel-Aty M, Hasan S, Cai Q. Applying machine learning approaches to analyze the vulnerable road-users' crashes at statewide traffic analysis zones. J Safety Res 2019; 70:275-288. [PMID: 31848006 DOI: 10.1016/j.jsr.2019.04.008] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/08/2018] [Revised: 03/30/2019] [Accepted: 04/16/2019] [Indexed: 06/10/2023]
Abstract
INTRODUCTION In this paper, we present machine learning techniques to analyze pedestrian and bicycle crash by developing macro-level crash prediction models. METHODS We collected the 2010-2012 Statewide Traffic Analysis Zone (STAZ) level crash data and developed rigorous machine learning approach (i.e., decision tree regression (DTR) models) for both pedestrian and bicycle crash counts. To our knowledge, this is the first application of DTR models in the burgeoning macro-level traffic safety literature. RESULTS The DTR models uncovered the most significant predictor variables for both response variables (pedestrian and bicycle crash counts) in terms of three broad categories: traffic, roadway, and socio-demographic characteristics. Additionally, spatial predictor variables of neighboring STAZs were considered along with the targeted STAZ in both DTR models. The DTR model considering spatial predictor variables (spatial DTR model) were compared without considering spatial predictor variables (aspatial DTR model) and the model comparison results discovered that the prediction accuracy of the spatial DTR model performed better than the aspatial DTR model. Finally, the current research effort contributed to the safety literature by applying some ensemble techniques (i.e. bagging, random forest, and gradient boosting) in order to improve the prediction accuracy of the DTR models (weak learner) for macro-level crash count. The study revealed that all the ensemble techniques performed slightly better than the DTR model and the gradient boosting technique outperformed other competing ensemble techniques in macro-level crash prediction models.
Collapse
Affiliation(s)
- Md Sharikur Rahman
- Department of Civil, Environmental and Construction Engineering, University of Central Florida, Orlando, FL 32816, USA.
| | - Mohamed Abdel-Aty
- Department of Civil, Environmental and Construction Engineering, University of Central Florida, Orlando, FL 32816, USA
| | - Samiul Hasan
- Department of Civil, Environmental and Construction Engineering, University of Central Florida, Orlando, FL 32816, USA
| | - Qing Cai
- Department of Civil, Environmental and Construction Engineering, University of Central Florida, Orlando, FL 32816, USA
| |
Collapse
|
7
|
Bashir S, Qamar U, Khan FH. IntelliHealth: A medical decision support application using a novel weighted multi-layer classifier ensemble framework. J Biomed Inform 2015; 59:185-200. [PMID: 26703093 DOI: 10.1016/j.jbi.2015.12.001] [Citation(s) in RCA: 82] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2015] [Revised: 11/01/2015] [Accepted: 12/06/2015] [Indexed: 11/30/2022]
Abstract
Accuracy plays a vital role in the medical field as it concerns with the life of an individual. Extensive research has been conducted on disease classification and prediction using machine learning techniques. However, there is no agreement on which classifier produces the best results. A specific classifier may be better than others for a specific dataset, but another classifier could perform better for some other dataset. Ensemble of classifiers has been proved to be an effective way to improve classification accuracy. In this research we present an ensemble framework with multi-layer classification using enhanced bagging and optimized weighting. The proposed model called "HM-BagMoov" overcomes the limitations of conventional performance bottlenecks by utilizing an ensemble of seven heterogeneous classifiers. The framework is evaluated on five different heart disease datasets, four breast cancer datasets, two diabetes datasets, two liver disease datasets and one hepatitis dataset obtained from public repositories. The analysis of the results show that ensemble framework achieved the highest accuracy, sensitivity and F-Measure when compared with individual classifiers for all the diseases. In addition to this, the ensemble framework also achieved the highest accuracy when compared with the state of the art techniques. An application named "IntelliHealth" is also developed based on proposed model that may be used by hospitals/doctors for diagnostic advice.
Collapse
Affiliation(s)
- Saba Bashir
- Computer Engineering Department, College of Electrical and Mechanical Engineering, National University of Sciences and Technology (NUST), Islamabad 44000, Pakistan.
| | - Usman Qamar
- Computer Engineering Department, College of Electrical and Mechanical Engineering, National University of Sciences and Technology (NUST), Islamabad 44000, Pakistan.
| | - Farhan Hassan Khan
- Computer Engineering Department, College of Electrical and Mechanical Engineering, National University of Sciences and Technology (NUST), Islamabad 44000, Pakistan.
| |
Collapse
|