1
|
He H, Zhao H, Li L, Yang H, Yan J, Yuan Y, Hu X, Zhang Y. Non-experimental rapid identification of lower respiratory tract infections in patients with chronic obstructive pulmonary disease using multi-label learning. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2025; 261:108618. [PMID: 39913996 DOI: 10.1016/j.cmpb.2025.108618] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/01/2024] [Revised: 12/08/2024] [Accepted: 01/24/2025] [Indexed: 02/21/2025]
Abstract
BACKGROUND AND OBJECTIVE Microbiological culture is a standard diagnostic test that takes a long time to identify lower respiratory tract infections (LRTI) in patients with chronic obstructive pulmonary disease (COPD). This study entailed the development of an interactive decision-support system using multi-label machine learning. It is designed to assist clinical medical staff in the rapid and simultaneous diagnosis of various infections in these patients. METHODS Clinical health record data were collected from inpatients with COPD suspected of having a LRTI. Two major categories of multi-label learning frameworks were integrated with various machine learning algorithms to create 23 predictive models to identify four categories of infection: fungal, gram-negative bacterial, gram-positive bacterial, and multidrug-resistant organism infections. The predictive power of the individual models was tested. Subsequently, the model with the highest comprehensive performance was selected and integrated with SHAP technology to construct a decision support system. RESULTS Three-thousand-eight-hundred-one subjects participated in this study. LP-RF recorded the highest overall performance, with a Hamming loss of 0.158 (95 %CI: 0.157-0.159) and a samples-precision of 0.894 (95 %CI: 0.891-0.896). The developed diagnostic decision support system generates predicted probability output for each infection category in a specific patient and displays the interpreted output results. CONCLUSION The developed multi-label decision support system enables effective prediction of four categories of infections in patients with a history of COPD, and has the potential to curb the overuse of antimicrobial drugs. This system is highly explainable and interactive, providing real-time support in the simultaneous diagnosis of multiple infection categories.
Collapse
Affiliation(s)
- Hangzhi He
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi Province 030001, China
| | - Hui Zhao
- Department of Respiratory and Critical Care Medicine, The Second Hospital of Shanxi Medical University, Taiyuan, Shanxi Province 030001, China
| | - Lifang Li
- Department of Respiratory and Critical Care Medicine, The Second Hospital of Shanxi Medical University, Taiyuan, Shanxi Province 030001, China
| | - Hong Yang
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi Province 030001, China
| | - Jingjing Yan
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi Province 030001, China
| | - Yiwei Yuan
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi Province 030001, China
| | - Xiangwen Hu
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi Province 030001, China
| | - Yanbo Zhang
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi Province 030001, China; Shanxi Provincial Key Laboratory of Major Diseases Risk Assessment, Shanxi Medical University, Taiyuan, Shanxi Province 030001, China.
| |
Collapse
|
2
|
Polessa Paula D, Barbosa Aguiar O, Pruner Marques L, Bensenor I, Suemoto CK, Mendes da Fonseca MDJ, Griep RH. Comparing machine learning algorithms for multimorbidity prediction: An example from the Elsa-Brasil study. PLoS One 2022; 17:e0275619. [PMID: 36206287 PMCID: PMC9543987 DOI: 10.1371/journal.pone.0275619] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Accepted: 09/20/2022] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND Multimorbidity is a worldwide concern related to greater disability, worse quality of life, and mortality. The early prediction is crucial for preventive strategies design and integrative medical practice. However, knowledge about how to predict multimorbidity is limited, possibly due to the complexity involved in predicting multiple chronic diseases. METHODS In this study, we present the use of a machine learning approach to build cost-effective multimorbidity prediction models. Based on predictors easily obtainable in clinical practice (sociodemographic, clinical, family disease history and lifestyle), we build and compared the performance of seven multilabel classifiers (multivariate random forest, and classifier chain, binary relevance and binary dependence, with random forest and support vector machine as base classifiers), using a sample of 15105 participants from the Brazilian Longitudinal Study of Adult Health (ELSA-Brasil). We developed a web application for the building and use of prediction models. RESULTS Classifier chain with random forest as base classifier performed better (accuracy = 0.34, subset accuracy = 0.15, and Hamming Loss = 0.16). For different feature sets, random forest based classifiers outperformed those based on support vector machine. BMI, blood pressure, sex, and age were the features most relevant to multimorbidity prediction. CONCLUSIONS Our results support the choice of random forest based classifiers for multimorbidity prediction.
Collapse
Affiliation(s)
- Daniela Polessa Paula
- National School of Statistical Sciences, Brazilian Institute of Geography and Statistics, Rio de Janeiro, Brazil
- * E-mail: ,
| | | | - Larissa Pruner Marques
- National School of Public Health, Oswaldo Cruz Foundation, Rio de Janeiro, Rio de Janeiro, Brazil
| | - Isabela Bensenor
- Department of Internal Medicine, Faculdade de Medicina da Universidade de São Paulo & Hospital Universitário, Universidade de São Paulo, São Paulo, Brazil
| | - Claudia Kimie Suemoto
- Division of Geriatrics, Department of Clinical Medicine, Faculdade de Medicina, Universidade de São Paulo, São Paulo, Brazil
| | | | - Rosane Härter Griep
- Health and Environmental Education Laboratory, Oswaldo Cruz Institute (IOC), Rio de Janeiro, Brazil
| |
Collapse
|
3
|
Identification and Prediction of Chronic Diseases Using Machine Learning Approach. JOURNAL OF HEALTHCARE ENGINEERING 2022; 2022:2826127. [PMID: 35251563 PMCID: PMC8896926 DOI: 10.1155/2022/2826127] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/13/2022] [Revised: 02/01/2022] [Accepted: 02/07/2022] [Indexed: 01/01/2023]
Abstract
Nowadays, humans face various diseases due to the current environmental condition and their living habits. The identification and prediction of such diseases at their earlier stages are much important, so as to prevent the extremity of it. It is difficult for doctors to manually identify the diseases accurately most of the time. The goal of this paper is to identify and predict the patients with more common chronic illnesses. This could be achieved by using a cutting-edge machine learning technique to ensure that this categorization reliably identifies persons with chronic diseases. The prediction of diseases is also a challenging task. Hence, data mining plays a critical role in disease prediction. The proposed system offers a broad disease prognosis based on patient's symptoms by using the machine learning algorithms such as convolutional neural network (CNN) for automatic feature extraction and disease prediction and K-nearest neighbor (KNN) for distance calculation to find the exact match in the data set and the final disease prediction outcome. A collection of disease symptoms has been performed for the preparation of the data set along with the person's living habits, and details related to doctor consultations are taken into account in this general disease prediction. Finally, a comparative study of the proposed system with various algorithms such as Naïve Bayes, decision tree, and logistic regression has been demonstrated in this paper.
Collapse
|
4
|
Luque C, Luna JM, Ventura S. A semantically enriched text mining system for clinical decision support. Comput Intell 2021. [DOI: 10.1111/coin.12322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Carmen Luque
- Knowledge Discovery and Intelligent Systems in Biomedicine Laboratory, Maimonides Biomedical Research Institute of Cordoba Córdoba Spain
| | - José M. Luna
- Knowledge Discovery and Intelligent Systems in Biomedicine Laboratory, Maimonides Biomedical Research Institute of Cordoba Córdoba Spain
- Department of Computer Science and Numerical Analysis, University of Córdoba Córdoba Spain
| | - Sebastián Ventura
- Knowledge Discovery and Intelligent Systems in Biomedicine Laboratory, Maimonides Biomedical Research Institute of Cordoba Córdoba Spain
- Department of Computer Science and Numerical Analysis, University of Córdoba Córdoba Spain
- Faculty of Computing and Information Technology, King Abdulaziz University Jeddah Saudi Arabia
| |
Collapse
|
5
|
Yuan M, Ren J. Numerical Feature Transformation-based Sequence Generation Model for Multi-disease Diagnosis. INT J PATTERN RECOGN 2021. [DOI: 10.1142/s0218001421590345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
The goal of computer-aided diagnosis is to predict patient’s diseases based on patient’s clinical data. The development of deep learning technology provides new help for clinical diagnosis. In this paper, we propose a new sequence generation model for multi-disease diagnosis prediction based on numerical feature transformation. Our model simultaneously uses patient’s laboratory test results and clinical text as input to diagnose and predict the disease that the patient may have. According to medical knowledge, our model can transform numerical features into descriptive text features, thereby enriching the semantic information of clinical texts. Besides, our model uses attention-based sequence generation methods to achieve the diagnosis of multiple diseases and better utilizes the correlation information between multiple diseases. We evaluate our model’s performance on a dataset of respiratory diseases from the real world, and experimental results show that our model’s accuracy reaches 42.75%, and the [Formula: see text] score reaches 65.65%, which is better than many other methods. It is suitable for the accurate diagnosis of multiple diseases.
Collapse
Affiliation(s)
- Ming Yuan
- School of Computer Science and Engineering, Guangdong Province Key Lab of Computational Science, Sun Yat-sen University Guangzhou, Guangdong, P. R. China
| | - Jiangtao Ren
- School of Computer Science and Engineering, Guangdong Province Key Lab of Computational Science, Sun Yat-sen University Guangzhou, Guangdong, P. R. China
| |
Collapse
|
6
|
Stemerman R, Arguello J, Brice J, Krishnamurthy A, Houston M, Kitzmiller R. Identification of social determinants of health using multi-label classification of electronic health record clinical notes. JAMIA Open 2021; 4:ooaa069. [PMID: 34514351 PMCID: PMC8423426 DOI: 10.1093/jamiaopen/ooaa069] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Revised: 11/16/2020] [Accepted: 11/20/2020] [Indexed: 11/13/2022] Open
Abstract
OBJECTIVES Social determinants of health (SDH), key contributors to health, are rarely systematically measured and collected in the electronic health record (EHR). We investigate how to leverage clinical notes using novel applications of multi-label learning (MLL) to classify SDH in mental health and substance use disorder patients who frequent the emergency department. METHODS AND MATERIALS We labeled a gold-standard corpus of EHR clinical note sentences (N = 4063) with 6 identified SDH-related domains recommended by the Institute of Medicine for inclusion in the EHR. We then trained 5 classification models: linear-Support Vector Machine, K-Nearest Neighbors, Random Forest, XGBoost, and bidirectional Long Short-Term Memory (BI-LSTM). We adopted 5 common evaluation measures: accuracy, average precision-recall (AP), area under the curve receiver operating characteristic (AUC-ROC), Hamming loss, and log loss to compare the performance of different methods for MLL classification using the F1 score as the primary evaluation metric. RESULTS Our results suggested that, overall, BI-LSTM outperformed the other classification models in terms of AUC-ROC (93.9), AP (0.76), and Hamming loss (0.12). The AUC-ROC values of MLL models of SDH related domains varied between (0.59-1.0). We found that 44.6% of our study population (N = 1119) had at least one positive documentation of SDH. DISCUSSION AND CONCLUSION The proposed approach of training an MLL model on an SDH rich data source can produce a high performing classifier using only unstructured clinical notes. We also provide evidence that model performance is associated with lexical diversity by health professionals and the auto-generation of clinical note sentences to document SDH.
Collapse
Affiliation(s)
- Rachel Stemerman
- Carolina Health Informatics Program, The University of North Carolina, Chapel Hill, North Carolina, USA
| | - Jaime Arguello
- School of Information and Library Sciences, The University of North Carolina, Chapel Hill, North Carolina, USA
| | - Jane Brice
- Department of Emergency Medicine, The University of North Carolina School of Medicine, Chapel Hill, North Carolina, USA
| | - Ashok Krishnamurthy
- Department of Computer Science, The University of North Carolina, Chapel Hill, North Carolina, USA
| | - Mary Houston
- Department of Emergency Medicine, The University of North Carolina School of Medicine, Chapel Hill, North Carolina, USA
| | - Rebecca Kitzmiller
- School of Nursing, The University of North Carolina, Chapel Hill, North Carolina, USA
| |
Collapse
|
7
|
Łazęcka M, Mielniczuk J, Teisseyre P. Estimating the class prior for positive and unlabelled data via logistic regression. ADV DATA ANAL CLASSI 2021. [DOI: 10.1007/s11634-021-00444-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
AbstractIn the paper, we revisit the problem of class prior probability estimation with positive and unlabelled data gathered in a single-sample scenario. The task is important as it is known that in positive unlabelled setting, a classifier can be successfully learned if the class prior is available. We show that without additional assumptions, class prior probability is not identifiable and thus the existing non-parametric estimators are necessarily biased in general if extra assumptions are not imposed. The magnitude of their bias is also investigated. The problem becomes identifiable when the probabilistic structure satisfies mild semi-parametric assumptions. Consequently, we propose a method based on a logistic fit and a concave minorization of its (non-concave) log-likelihood. The experiments conducted on artificial and benchmark datasets as well as on a large clinical database MIMIC indicate that the estimation errors for the proposed method are usually lower than for its competitors and that it is robust against departures from logistic settings.
Collapse
|
8
|
Qiu L, Gorantla S, Rajan V, Tan BCY. Multi-disease Predictive Analytics: A Clinical Knowledge-aware Approach. ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS 2021. [DOI: 10.1145/3447942] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
Multi-Disease Predictive Analytics (MDPA) models simultaneously predict the risks of multiple diseases in patients and are valuable in early diagnoses. Patients tend to have multiple diseases simultaneously or develop multiple complications over time, and MDPA models can learn and effectively utilize such correlations between diseases. Data from large-scale Electronic Health Records (EHR) can be used through Multi-Label Learning (MLL) methods to develop MDPA models. However, data-driven approaches for MDPA face the challenge of data imbalance, because rare diseases tend to have much less data than common diseases. Insufficient data for rare diseases makes it difficult to leverage correlations with other diseases. These correlations are studied and recorded in biomedical literature but are rarely utilized in predictive analytics. This article presents a novel method called Knowledge-Aware Approach (KAA) that learns clinical correlations from the rapidly growing body of clinical knowledge. KAA can be combined with any data-driven MLL model for MDPA to refine the predictions of the model. Our extensive experiments, on real EHR data, show that the use of KAA improves the predictive performance of commonly used MDPA models, particularly for rare diseases. KAA is also found to be superior to existing general approaches of combining clinical knowledge with data-driven models. Further, a counterfactual analysis shows the efficacy of KAA in improving physicians’ ability to prescribe preventive treatments.
Collapse
Affiliation(s)
- Lin Qiu
- Department of Information Systems and Analytics, National University of Singapore, Singapore
| | - Sruthi Gorantla
- Department of Computer Science, National University of Singapore, Karnataka, Bangalore, India
| | - Vaibhav Rajan
- Department of Information Systems and Analytics, National University of Singapore, Singapore
| | - Bernard C. Y. Tan
- Department of Information Systems and Analytics, National University of Singapore, Singapore
| |
Collapse
|
9
|
|
10
|
An empirical analysis of binary transformation strategies and base algorithms for multi-label learning. Mach Learn 2020. [DOI: 10.1007/s10994-020-05879-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
11
|
Multi-label learning for crop leaf diseases recognition and severity estimation based on convolutional neural networks. Soft comput 2020. [DOI: 10.1007/s00500-020-04866-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
12
|
A classification model for prediction of clinical severity level using qSOFA medical score. INFORMATION DISCOVERY AND DELIVERY 2020. [DOI: 10.1108/idd-02-2019-0013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Purpose
The purpose of this study is to develop an efficient prediction model using vital signs and standard medical score systems, which predicts the clinical severity level of the patient in advance based on the quick sequential organ failure assessment (qSOFA) medical score method.
Design/methodology/approach
To predict the clinical severity level of the patient in advance, the authors have formulated a training dataset that is constructed based on the qSOFA medical score method. Further, along with the multiple vital signs, different standard medical scores and their correlation features are used to build and improve the accuracy of the prediction model. It is made sure that the constructed training set is suitable for the severity level prediction because the formulated dataset has different clusters each corresponding to different severity levels according to qSOFA score.
Findings
From the experimental result, it is found that the inclusion of the standard medical scores and their correlation along with multiple vital signs improves the accuracy of the clinical severity level prediction model. In addition, the authors showed that the training dataset formulated from the temporal data (which includes vital signs and medical scores) based on the qSOFA medical scoring system has the clusters which correspond to each severity level in qSOFA score. Finally, it is found that RAndom k-labELsets multi-label classification performs better prediction of severity level compared to neural network-based multi-label classification.
Originality/value
This paper helps in identifying patient' clinical status.
Collapse
|
13
|
Shanmathi N, Jagannath M. Computerised Decision Support System for Remote Health Monitoring: A Systematic Review. Ing Rech Biomed 2018. [DOI: 10.1016/j.irbm.2018.09.007] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
14
|
Wosiak A, Glinka K, Zakrzewska D. Multi-label classification methods for improving comorbidities identification. Comput Biol Med 2018; 100:279-288. [PMID: 28705417 DOI: 10.1016/j.compbiomed.2017.07.006] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2017] [Revised: 06/17/2017] [Accepted: 07/05/2017] [Indexed: 10/19/2022]
Abstract
The medical diagnostic process may be supported by computational classification techniques. In many cases, patients are affected by multiple illnesses, and more than one classification label is required to improve medical decision-making. In this paper, we consider a multi-perspective classification problem for medical diagnostics, where cases are described by labels from separate sets. We attempt to improve the identification of comorbidities using multi-label classification techniques. Several investigated methods, which provide label dependencies, are analysed and evaluated. The methods' performances are verified by experiments conducted on four sets of medical data from subject patients. The results were evaluated using several metrics and were statistically verified. We compare the effects of the techniques that do and do not consider label correlations. We demonstrate that multi-label classification methods from the first group outperform the techniques from the second one.
Collapse
Affiliation(s)
- A Wosiak
- Lodz University of Technology, Institute of Information Technology, Wólczańska, 215, Lodz, Poland.
| | - K Glinka
- Lodz University of Technology, Institute of Information Technology, Wólczańska, 215, Lodz, Poland
| | - D Zakrzewska
- Lodz University of Technology, Institute of Information Technology, Wólczańska, 215, Lodz, Poland
| |
Collapse
|
15
|
Guo J, Yuan X, Zheng X, Xu P, Xiao Y, Liu B. RETRACTED: Diagnosis labeling with disease-specific characteristics mining. Artif Intell Med 2018; 90:25-33. [PMID: 30076068 DOI: 10.1016/j.artmed.2018.06.006] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2017] [Revised: 06/15/2018] [Accepted: 06/26/2018] [Indexed: 11/18/2022]
Abstract
This article has been retracted: please see Elsevier Policy on Article Withdrawal (https://www.elsevier.com/about/our-business/policies/article-withdrawal).
This article has been retracted at the request of the authors; serious errors had been introduced within the data set presented and, as such, the authors have decided to retract the paper.
Error summary is outlined below:
1. The dataset was divided into training set, validation set and testing set. The training set and validation set were used to tune the parameter. According to the authors this is an overfitting and can mislead the readers.
2. The compared baselines are not strong enough for fair comparison. The code was not requested from the original authors, but algorithms have been implemented.
3. The bag of words model is not trained suitably. The model was trained with the used dataset and the performance has been evaluated on this dataset. Authors think this is an overfitting and makes the comparison untrustworthy.
Collapse
Affiliation(s)
- Jun Guo
- School of Information Science and Technology, Northwest University, Xian710127, PR China.
| | - Xuan Yuan
- School of Information Science and Technology, Northwest University, Xian710127, PR China
| | - Xia Zheng
- Department of Culture Heritage and Museology, Zhejiang University, Hangzhou310028, PR China
| | - Pengfei Xu
- School of Information Science and Technology, Northwest University, Xian710127, PR China
| | - Yun Xiao
- School of Information Science and Technology, Northwest University, Xian710127, PR China
| | - Baoying Liu
- School of Information Science and Technology, Northwest University, Xian710127, PR China.
| |
Collapse
|
16
|
Papanikolaou Y, Tsoumakas G, Katakis I. Hierarchical partitioning of the output space in multi-label data. DATA KNOWL ENG 2018. [DOI: 10.1016/j.datak.2018.05.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
17
|
Forkan ARM, Khalil I. A clinical decision-making mechanism for context-aware and patient-specific remote monitoring systems using the correlations of multiple vital signs. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2017; 139:1-16. [PMID: 28187881 DOI: 10.1016/j.cmpb.2016.10.018] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/21/2016] [Revised: 08/11/2016] [Accepted: 10/18/2016] [Indexed: 06/06/2023]
Abstract
BACKGROUND AND OBJECTIVES In home-based context-aware monitoring patient's real-time data of multiple vital signs (e.g. heart rate, blood pressure) are continuously generated from wearable sensors. The changes in such vital parameters are highly correlated. They are also patient-centric and can be either recurrent or can fluctuate. The objective of this study is to develop an intelligent method for personalized monitoring and clinical decision support through early estimation of patient-specific vital sign values, and prediction of anomalies using the interrelation among multiple vital signs. METHODS In this paper, multi-label classification algorithms are applied in classifier design to forecast these values and related abnormalities. We proposed a completely new approach of patient-specific vital sign prediction system using their correlations. The developed technique can guide healthcare professionals to make accurate clinical decisions. Moreover, our model can support many patients with various clinical conditions concurrently by utilizing the power of cloud computing technology. The developed method also reduces the rate of false predictions in remote monitoring centres. RESULTS In the experimental settings, the statistical features and correlations of six vital signs are formulated as multi-label classification problem. Eight multi-label classification algorithms along with three fundamental machine learning algorithms are used and tested on a public dataset of 85 patients. Different multi-label classification evaluation measures such as Hamming score, F1-micro average, and accuracy are used for interpreting the prediction performance of patient-specific situation classifications. We achieved 90-95% Hamming score values across 24 classifier combinations for 85 different patients used in our experiment. The results are compared with single-label classifiers and without considering the correlations among the vitals. The comparisons show that multi-label method is the best technique for this problem domain. CONCLUSIONS The evaluation results reveal that multi-label classification techniques using the correlations among multiple vitals are effective ways for early estimation of future values of those vitals. In context-aware remote monitoring this process can greatly help the doctors in quick diagnostic decision making.
Collapse
Affiliation(s)
- Abdur Rahim Mohammad Forkan
- School of Science (Computer Science), RMIT University, Melbourne, Victoria 3001, Australia; National ICT Australia (NICTA), Australia.
| | - Ibrahim Khalil
- School of Science (Computer Science), RMIT University, Melbourne, Victoria 3001, Australia; National ICT Australia (NICTA), Australia
| |
Collapse
|