1
|
Du X, Li J, Wang B, Zhang J, Wang T, Wang J. NRGCNMDA: Microbe-Drug Association Prediction Based on Residual Graph Convolutional Networks and Conditional Random Fields. Interdiscip Sci 2025:10.1007/s12539-024-00678-z. [PMID: 39775537 DOI: 10.1007/s12539-024-00678-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2024] [Revised: 11/08/2024] [Accepted: 11/18/2024] [Indexed: 01/11/2025]
Abstract
The process of discovering new drugs related to microbes through traditional biological methods is lengthy and costly. In response to these issues, a new computational model (NRGCNMDA) is proposed to predict microbe-drug associations. First, Node2vec is used to extract potential associations between microorganisms and drugs, and a heterogeneous network of microbes and drugs is constructed. Then, a Graph Convolutional Network incorporating a fusion residual network mechanism (REGCN) is utilized to learn meaningful high-order similarity features. In addition, conditional random fields (CRF) are applied to ensure that microbes and drugs have similar feature embeddings. Finally, unobserved microbe-drug associations are scored based on combined embeddings. The experimental findings demonstrate that the NRGCNMDA approach outperforms several existing deep learning methods, and its AUC and AUPR values are 95.16% and 93.02%, respectively. The case study demonstrates that NRGCNMDA accurately predicts drugs associated with Enterococcus faecalis and Listeria monocytogenes, as well as microbes associated with ibuprofen and tetracycline.
Collapse
Affiliation(s)
- Xiaoxin Du
- Computer and Control Engineering College, Qiqihar University, Qiqihar, 161006, China.
- Heilongjiang Key Laboratory of Big Data Network Security Detection and Analysis, Qiqihar University, Qiqihar, 161006, China.
| | - Jingwei Li
- Computer and Control Engineering College, Qiqihar University, Qiqihar, 161006, China
- Heilongjiang Key Laboratory of Big Data Network Security Detection and Analysis, Qiqihar University, Qiqihar, 161006, China
| | - Bo Wang
- Computer and Control Engineering College, Qiqihar University, Qiqihar, 161006, China
- Heilongjiang Key Laboratory of Big Data Network Security Detection and Analysis, Qiqihar University, Qiqihar, 161006, China
| | - Jianfei Zhang
- Computer and Control Engineering College, Qiqihar University, Qiqihar, 161006, China
- Heilongjiang Key Laboratory of Big Data Network Security Detection and Analysis, Qiqihar University, Qiqihar, 161006, China
| | - Tongxuan Wang
- Computer and Control Engineering College, Qiqihar University, Qiqihar, 161006, China
- Heilongjiang Key Laboratory of Big Data Network Security Detection and Analysis, Qiqihar University, Qiqihar, 161006, China
| | - Junqi Wang
- Computer and Control Engineering College, Qiqihar University, Qiqihar, 161006, China
- Heilongjiang Key Laboratory of Big Data Network Security Detection and Analysis, Qiqihar University, Qiqihar, 161006, China
| |
Collapse
|
2
|
Zhang X, Dong S, Chen J, Tian Q, Gong Y, Hong X. Deep Class-Incremental Learning From Decentralized Data. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:7190-7203. [PMID: 36315536 DOI: 10.1109/tnnls.2022.3214573] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
In this article, we focus on a new and challenging decentralized machine learning paradigm in which there are continuous inflows of data to be addressed and the data are stored in multiple repositories. We initiate the study of data-decentralized class-incremental learning (DCIL) by making the following contributions. First, we formulate the DCIL problem and develop the experimental protocol. Second, we introduce a paradigm to create a basic decentralized counterpart of typical (centralized) CIL approaches, and as a result, establish a benchmark for the DCIL study. Third, we further propose a decentralized composite knowledge incremental distillation (DCID) framework to transfer knowledge from historical models and multiple local sites to the general model continually. DCID consists of three main components, namely, local CIL, collaborated knowledge distillation (KD) among local models, and aggregated KD from local models to the general one. We comprehensively investigate our DCID framework by using a different implementation of the three components. Extensive experimental results demonstrate the effectiveness of our DCID framework. The source code of the baseline methods and the proposed DCIL is available at https://github.com/Vision-Intelligence-and-Robots-Group/DCIL.
Collapse
|
3
|
Cho JW, Kim DJ, Jung Y, Kweon IS. MCDAL: Maximum Classifier Discrepancy for Active Learning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023; 34:8753-8763. [PMID: 35316194 DOI: 10.1109/tnnls.2022.3152786] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Recent state-of-the-art active learning methods have mostly leveraged generative adversarial networks (GANs) for sample acquisition; however, GAN is usually known to suffer from instability and sensitivity to hyperparameters. In contrast to these methods, in this article, we propose a novel active learning framework that we call Maximum Classifier Discrepancy for Active Learning (MCDAL) that takes the prediction discrepancies between multiple classifiers. In particular, we utilize two auxiliary classification layers that learn tighter decision boundaries by maximizing the discrepancies among them. Intuitively, the discrepancies in the auxiliary classification layers' predictions indicate the uncertainty in the prediction. In this regard, we propose a novel method to leverage the classifier discrepancies for the acquisition function for active learning. We also provide an interpretation of our idea in relation to existing GAN-based active learning methods and domain adaptation frameworks. Moreover, we empirically demonstrate the utility of our approach where the performance of our approach exceeds the state-of-the-art methods on several image classification and semantic segmentation datasets in active learning setups.
Collapse
|
4
|
Baik SM, Hong KS, Park DJ. Deep learning approach for early prediction of COVID-19 mortality using chest X-ray and electronic health records. BMC Bioinformatics 2023; 24:190. [PMID: 37161395 PMCID: PMC10169101 DOI: 10.1186/s12859-023-05321-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 05/05/2023] [Indexed: 05/11/2023] Open
Abstract
BACKGROUND An artificial-intelligence (AI) model for predicting the prognosis or mortality of coronavirus disease 2019 (COVID-19) patients will allow efficient allocation of limited medical resources. We developed an early mortality prediction ensemble model for COVID-19 using AI models with initial chest X-ray and electronic health record (EHR) data. RESULTS We used convolutional neural network (CNN) models (Inception-ResNet-V2 and EfficientNet) for chest X-ray analysis and multilayer perceptron (MLP), Extreme Gradient Boosting (XGBoost), and random forest (RF) models for EHR data analysis. The Gradient-weighted Class Activation Mapping and Shapley Additive Explanations (SHAP) methods were used to determine the effects of these features on COVID-19. We developed an ensemble model (Area under the receiver operating characteristic curve of 0.8698) using a soft voting method with weight differences for CNN, XGBoost, MLP, and RF models. To resolve the data imbalance, we conducted F1-score optimization by adjusting the cutoff values to optimize the model performance (F1 score of 0.77). CONCLUSIONS Our study is meaningful in that we developed an early mortality prediction model using only the initial chest X-ray and EHR data of COVID-19 patients. Early prediction of the clinical courses of patients is helpful for not only treatment but also bed management. Our results confirmed the performance improvement of the ensemble model achieved by combining AI models. Through the SHAP method, laboratory tests that indicate the factors affecting COVID-19 mortality were discovered, highlighting the importance of these tests in managing COVID-19 patients.
Collapse
Affiliation(s)
- Seung Min Baik
- Division of Critical Care Medicine, Department of Surgery, Ewha Womans University Mokdong Hospital, Ewha Womans University College of Medicine, Seoul, Korea
| | - Kyung Sook Hong
- Division of Critical Care Medicine, Department of Surgery, Ewha Womans University Seoul Hospital, Ewha Womans University College of Medicine, Seoul, Korea
| | - Dong Jin Park
- Department of Laboratory Medicine, Eunpyeong St. Mary's Hospital, College of Medicine, The Catholic University of Korea, 1021, Tongil-ro, Eunpyeong-gu, Seoul, 03312, Korea.
| |
Collapse
|
5
|
Xu Y, Zhang H. Convergence of deep convolutional neural networks. Neural Netw 2022; 153:553-563. [PMID: 35839599 DOI: 10.1016/j.neunet.2022.06.031] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Revised: 05/16/2022] [Accepted: 06/24/2022] [Indexed: 10/17/2022]
Abstract
Convergence of deep neural networks as the depth of the networks tends to infinity is fundamental in building the mathematical foundation for deep learning. In a previous study, we investigated this question for deep networks with the Rectified Linear Unit (ReLU) activation function and with a fixed width. This does not cover the important convolutional neural networks where the widths are increased from layer to layer. For this reason, we first study convergence of general ReLU networks with increased widths and then apply the results obtained to deep convolutional neural networks. It turns out the convergence reduces to convergence of infinite products of matrices with increased sizes, which has not been considered in the literature. We establish sufficient conditions for convergence of such infinite products of matrices. Based on the conditions, we present sufficient conditions for pointwise convergence of general deep ReLU networks with increasing widths, and as well as pointwise convergence of deep ReLU convolutional neural networks.
Collapse
Affiliation(s)
- Yuesheng Xu
- Department of Mathematics & Statistics, Old Dominion University, Norfolk, VA 23529, USA.
| | - Haizhang Zhang
- School of Mathematics (Zhuhai), Sun Yat-sen University, Zhuhai, PR China.
| |
Collapse
|
6
|
Dai D, Tang X, Liu Y, Xia S, Wang G. Multi-granularity association learning for on-the-fly fine-grained sketch-based image retrieval. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109447] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/16/2022]
|
7
|
Development of Machine-Learning Model to Predict COVID-19 Mortality: Application of Ensemble Model and Regarding Feature Impacts. Diagnostics (Basel) 2022; 12:diagnostics12061464. [PMID: 35741274 PMCID: PMC9221552 DOI: 10.3390/diagnostics12061464] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Revised: 06/13/2022] [Accepted: 06/13/2022] [Indexed: 11/16/2022] Open
Abstract
This study was designed to develop machine-learning models to predict COVID-19 mortality and identify its key features based on clinical characteristics and laboratory tests. For this, deep-learning (DL) and machine-learning (ML) models were developed using receiver operating characteristic (ROC) area under the curve (AUC) and F1 score optimization of 87 parameters. Of the two, the DL model exhibited better performance (AUC 0.8721, accuracy 0.84, and F1 score 0.76). However, we also blended DL with ML, and the ensemble model performed the best (AUC 0.8811, accuracy 0.85, and F1 score 0.77). The DL model is generally unable to extract feature importance; however, we succeeded by using the Shapley Additive exPlanations method for each model. This study demonstrated both the applicability of DL and ML models for classifying COVID-19 mortality using hospital-structured data and that the ensemble model had the best predictive ability.
Collapse
|
8
|
Yang X, Wang H, Xie D, Deng C, Tao D. Object-Agnostic Transformers for Video Referring Segmentation. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2022; 31:2839-2849. [PMID: 35349441 DOI: 10.1109/tip.2022.3161832] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Video referring segmentation focuses on segmenting out the object in a video based on the corresponding textual description. Previous works have primarily tackled this task by devising two crucial parts, an intra-modal module for context modeling and an inter-modal module for heterogeneous alignment. However, there are two essential drawbacks of this approach: (1) it lacks joint learning of context modeling and heterogeneous alignment, leading to insufficient interactions among input elements; (2) both modules require task-specific expert knowledge to design, which severely limits the flexibility and generality of prior methods. To address these problems, we here propose a novel Object-Agnostic Transformer-based Network, called OATNet, that simultaneously conducts intra-modal and inter-modal learning for video referring segmentation, without the aid of object detection or category-specific pixel labeling. More specifically, we first directly feed the sequence of textual tokens and visual tokens (pixels rather than detected object bounding boxes) into a multi-modal encoder, where context and alignment are simultaneously and effectively explored. We then design a novel cascade segmentation network to decouple our task into coarse-grained segmentation and fine-grained refinement. Moreover, considering the difficulty of samples, a more balanced metric is provided to better diagnose the performance of the proposed method. Extensive experiments on two popular datasets, A2D Sentences and J-HMDB Sentences, demonstrate that our proposed approach noticeably outperforms state-of-the-art methods.
Collapse
|