1
|
Kuo CY, Su ECY, Yeh HL, Yeh JH, Chiu HC, Chung CC. Predictive modeling and interpretative analysis of risks of instability in patients with Myasthenia Gravis requiring intensive care unit admission. Heliyon 2024; 10:e41084. [PMID: 39759343 PMCID: PMC11700255 DOI: 10.1016/j.heliyon.2024.e41084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2024] [Revised: 12/02/2024] [Accepted: 12/08/2024] [Indexed: 01/07/2025] Open
Abstract
Objective Myasthenia gravis (MG), a low-prevalence autoimmune disorder characterized by clinical heterogeneity and unpredictable disease fluctuations, presents significant risks of acute exacerbations requiring intensive care. These crises contribute substantially to patient morbidity and mortality. This study aimed to develop and validate machine-learning models for predicting intensive care unit (ICU) admission risk among patients with MG-related disease instability. Methods In this retrospective analysis of 314 MG patients hospitalized between 2015 and 2018, we implemented four machine learning algorithms, including logistic regression, support vector machine, extreme gradient boosting (XGBoost), and random forest, to predict ICU admission risk. The models incorporated fourteen clinical parameters as predictive features. The SHapley Additive exPlanations method was utilized to assess the importance of factors associated with ICU admission. Results Through 10-fold cross-validation, the XGBoost model demonstrated superior predictive performance (area under the receiver operating characteristic curve: 0.8943, accuracy: 0.8603, sensitivity: 0.7222, and specificity: 0.9125). Among the analyzed features, MG severity, as classified by the Myasthenia Gravis Foundation of America clinical classification, was identified as the most significant factor influencing ICU admission. Additionally, disease duration, a key continuous variable, was inversely correlated with the risk of ICU admission. Conclusion MG severity is the primary determinant of ICU admission, with shorter disease duration increasing the risk, possibly due to greater susceptibility to exacerbations. The XGBoost model exhibited excellent performance and accuracy, effectively identifying critical clinical factors for predicting ICU admission risk in MG patients. This novel, personalized approach to risk stratification elucidates crucial risk factors and has the potential to enhance clinical decision-making, optimize resource allocation, and ultimately improve patient outcomes.
Collapse
Affiliation(s)
- Chao-Yang Kuo
- Graduate Institute of Artificial Intelligence and Big Data in Healthcare, Smart Healthcare Interdisciplinary College, National Taipei University of Nursing and Health Sciences, Taipei, 112, Taiwan
| | - Emily Chia-Yu Su
- Institute of Biomedical Informatics, College of Medicine, National Yang Ming Chiao Tung University, Taipei, 112, Taiwan
- Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei, 110, Taiwan
- Clinical Big Data Research Center, Taipei Medical University Hospital, Taipei, 110, Taiwan
| | - Hsu-Ling Yeh
- Institute of Public Health, National Yang Ming Chiao Tung University, Taipei, 112, Taiwan
- Department of Neurology, Shin Kong Wu Ho-Su Memorial Hospital, Taipei, 111, Taiwan
| | - Jiann-Horng Yeh
- Department of Neurology, Shin Kong Wu Ho-Su Memorial Hospital, Taipei, 111, Taiwan
- School of Medicine, Fu Jen Catholic University, New Taipei City, 242, Taiwan
- Department of Neurology, Kaohsiung Medical University, Kaohsiung, 807, Taiwan
| | - Hou-Chang Chiu
- Department of Neurology, Shin Kong Wu Ho-Su Memorial Hospital, Taipei, 111, Taiwan
- Department of Neurology, Taipei Medical University – Shuang Ho Hospital, New Taipei City, 235, Taiwan
- Taipei Neuroscience Institute, Taipei Medical University – Shuang Ho Hospital, New Taipei City, 235, Taiwan
| | - Chen-Chih Chung
- Department of Neurology, Taipei Medical University – Shuang Ho Hospital, New Taipei City, 235, Taiwan
- Taipei Neuroscience Institute, Taipei Medical University – Shuang Ho Hospital, New Taipei City, 235, Taiwan
- Department of Neurology, School of Medicine, College of Medicine, Taipei Medical University, Taipei, 110, Taiwan
| |
Collapse
|
2
|
Gu B, Bao R, Zhang C, Huang H. New Scalable and Efficient Online Pairwise Learning Algorithm. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:17099-17110. [PMID: 37656641 DOI: 10.1109/tnnls.2023.3299756] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/03/2023]
Abstract
Pairwise learning is an important machine-learning topic with many practical applications. An online algorithm is the first choice for processing streaming data and is preferred for handling large-scale pairwise learning problems. However, existing online pairwise learning algorithms are not scalable and efficient enough for large-scale high-dimensional data, because they were designed based on singly stochastic gradients. To address this challenging problem, in this article, we propose a dynamic doubly stochastic gradient algorithm (D2SG) for online pairwise learning. Especially, only the time and space complexities of are needed for incorporating a new sample, where is the dimensionality of data. This means that our D2SG is much faster and more scalable than the existing online pairwise learning algorithms while the statistical accuracy can be guaranteed through our rigorous theoretical analysis under standard assumptions. The experimental results on a variety of real-world datasets not only confirm the theoretical result of our new D2SG algorithm, but also show that D2SG has better efficiency and scalability than the existing online pairwise learning algorithms.
Collapse
|
3
|
Liu B, Tsoumakas G. Integrating Similarities via Local Interaction Consistency and Optimizing Area Under the Curve Measures via Matrix Factorization for Drug-Target Interaction Prediction. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:2212-2225. [PMID: 39226198 DOI: 10.1109/tcbb.2024.3453499] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/05/2024]
Abstract
In drug discovery, identifying drug-target interactions (DTIs) via experimental approaches is a tedious and expensive procedure. Computational methods efficiently predict DTIs and recommend a small part of potential interacting pairs for further experimental confirmation, accelerating the drug discovery process. Although fusing heterogeneous drug and target similarities can improve the prediction ability, the existing similarity combination methods ignore the interaction consistency for neighbour entities. Furthermore, area under the precision-recall curve (AUPR) and area under the receiver operating characteristic curve (AUC) are two widely used evaluation metrics in DTI prediction. However, the two metrics are seldom considered as losses within existing DTI prediction methods. We propose a local interaction consistency (LIC) aware similarity integration method to fuse vital information from diverse views for DTI prediction models. Furthermore, we propose two matrix factorization (MF) methods that optimize AUPR and AUC using convex surrogate losses respectively, and then develop an ensemble MF approach that takes advantage of the two area under the curve metrics by combining the two single metric based MF models. Experimental results under different prediction settings show that the proposed methods outperform various competitors in terms of the metric(s) they optimize and are reliable in discovering potential new DTIs.
Collapse
|
4
|
Mao Y, Hao Y, Liu W, Lin X, Cao X. Class-Imbalanced-Aware Distantly Supervised Named Entity Recognition. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:12117-12129. [PMID: 37099461 DOI: 10.1109/tnnls.2023.3252084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Distantly supervised named entity recognition (NER), which automatically learns NER models without manually labeling data, has gained much attention recently. In distantly supervised NER, positive unlabeled (PU) learning methods have achieved notable success. However, existing PU learning-based NER methods are unable to automatically handle the class imbalance and further depend on the estimation of the unknown class prior; thus, the class imbalance and imperfect estimation of the class prior degenerate the NER performance. To address these issues, this article proposes a novel PU learning method for distantly supervised NER. The proposed method can automatically handle the class imbalance and does not need to engage in class prior estimation, which enables the proposed methods to achieve the state-of-the-art performance. Extensive experiments support our theoretical analysis and validate the superiority of our method.
Collapse
|
5
|
Bhat S, Mansoor A, Georgescu B, Panambur AB, Ghesu FC, Islam S, Packhäuser K, Rodríguez-Salas D, Grbic S, Maier A. AUCReshaping: improved sensitivity at high-specificity. Sci Rep 2023; 13:21097. [PMID: 38036602 PMCID: PMC10689839 DOI: 10.1038/s41598-023-48482-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Accepted: 11/27/2023] [Indexed: 12/02/2023] Open
Abstract
The evaluation of deep-learning (DL) systems typically relies on the Area under the Receiver-Operating-Curve (AU-ROC) as a performance metric. However, AU-ROC, in its holistic form, does not sufficiently consider performance within specific ranges of sensitivity and specificity, which are critical for the intended operational context of the system. Consequently, two systems with identical AU-ROC values can exhibit significantly divergent real-world performance. This issue is particularly pronounced in the context of anomaly detection tasks, a commonly employed application of DL systems across various research domains, including medical imaging, industrial automation, manufacturing, cyber security, fraud detection, and drug research, among others. The challenge arises from the heavy class imbalance in training datasets, with the abnormality class often incurring a considerably higher misclassification cost compared to the normal class. Traditional DL systems address this by adjusting the weighting of the cost function or optimizing for specific points along the ROC curve. While these approaches yield reasonable results in many cases, they do not actively seek to maximize performance for the desired operating point. In this study, we introduce a novel technique known as AUCReshaping, designed to reshape the ROC curve exclusively within the specified sensitivity and specificity range, by optimizing sensitivity at a predetermined specificity level. This reshaping is achieved through an adaptive and iterative boosting mechanism that allows the network to focus on pertinent samples during the learning process. We primarily investigated the impact of AUCReshaping in the context of abnormality detection tasks, specifically in Chest X-Ray (CXR) analysis, followed by breast mammogram and credit card fraud detection tasks. The results reveal a substantial improvement, ranging from 2 to 40%, in sensitivity at high-specificity levels for binary classification tasks.
Collapse
Affiliation(s)
- Sheethal Bhat
- Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91058, Erlangen, Germany.
- Digital Technology and Innovation, Siemens Healthineers, Erlangen, Germany.
| | - Awais Mansoor
- Digital Technology and Innovation, Siemens Medical Solutions, Princeton, NJ, 08540, USA
| | - Bogdan Georgescu
- Digital Technology and Innovation, Siemens Medical Solutions, Princeton, NJ, 08540, USA
| | - Adarsh B Panambur
- Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91058, Erlangen, Germany
- Digital Technology and Innovation, Siemens Healthineers, Erlangen, Germany
| | - Florin C Ghesu
- Digital Technology and Innovation, Siemens Medical Solutions, Princeton, NJ, 08540, USA
| | - Saahil Islam
- Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91058, Erlangen, Germany
- Digital Technology and Innovation, Siemens Healthineers, Erlangen, Germany
| | - Kai Packhäuser
- Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91058, Erlangen, Germany
| | - Dalia Rodríguez-Salas
- Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91058, Erlangen, Germany
| | - Sasa Grbic
- Digital Technology and Innovation, Siemens Medical Solutions, Princeton, NJ, 08540, USA
| | - Andreas Maier
- Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91058, Erlangen, Germany
| |
Collapse
|
6
|
Sun Y, Vong CM, Wang S. Fast AUC Maximization Learning Machine With Simultaneous Outlier Detection. IEEE TRANSACTIONS ON CYBERNETICS 2023; 53:6843-6857. [PMID: 35476558 DOI: 10.1109/tcyb.2022.3164900] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
While AUC maximizing support vector machine (AUCSVM) has been developed to solve imbalanced classification tasks, its huge computational burden will make AUCSVM become impracticable and even computationally forbidden for medium or large-scale imbalanced data. In addition, minority class sometimes means extremely important information for users or is corrupted by noises and/or outliers in practical application scenarios such as medical diagnosis, which actually inspires us to generalize the AUC concept to reflect such importance or upper bound of noises or outliers. In order to address these issues, by means of both the generalized AUC metric and the core vector machine (CVM) technique, a fast AUC maximizing learning machine, called ρ -AUCCVM, with simultaneous outlier detection is proposed in this study. ρ -AUCCVM has its notorious merits: 1) it indeed shares the CVM's advantage, that is, asymptotically linear time complexity with respect to the total number of sample pairs, together with space complexity independent on the total number of sample pairs and 2) it can automatically determine the importance of the minority class (assuming no noise) or the upper bound of noises or outliers. Extensive experimental results about benchmarking imbalanced datasets verify the above advantages of ρ -AUCCVM.
Collapse
|
7
|
Wang C, Wu K, Liu J. Evolutionary Multitasking AUC Optimization [Research Frontier]. IEEE COMPUT INTELL M 2022. [DOI: 10.1109/mci.2022.3155325] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
8
|
Dang Z, Li X, Gu B, Deng C, Huang H. Large-Scale Nonlinear AUC Maximization via Triply Stochastic Gradients. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2022; 44:1385-1398. [PMID: 32946382 DOI: 10.1109/tpami.2020.3024987] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Learning to improve AUC performance for imbalanced data is an important machine learning research problem. Most methods of AUC maximization assume that the model function is linear in the original feature space. However, this assumption is not suitable for nonlinear separable problems. Although there have been some nonlinear methods of AUC maximization, scaling up nonlinear AUC maximization is still an open question. To address this challenging problem, in this paper, we propose a novel large-scale nonlinear AUC maximization method (named as TSAM) based on the triply stochastic gradient descents. Specifically, we first use the random Fourier feature to approximate the kernel function. After that, we use the triply stochastic gradients w.r.t. the pairwise loss and random feature to iteratively update the solution. Finally, we prove that TSAM converges to the optimal solution with the rate of O(1/t) after t iterations. Experimental results on a variety of benchmark datasets not only confirm the scalability of TSAM, but also show a significant reduction of computational time compared with existing batch learning algorithms, while retaining the similar generalization performance.
Collapse
|