51
|
Ahuja C, Sethia D. Harnessing Few-Shot Learning for EEG signal classification: a survey of state-of-the-art techniques and future directions. Front Hum Neurosci 2024; 18:1421922. [PMID: 39050382 PMCID: PMC11266297 DOI: 10.3389/fnhum.2024.1421922] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Accepted: 05/31/2024] [Indexed: 07/27/2024] Open
Abstract
This paper presents a systematic literature review, providing a comprehensive taxonomy of Data Augmentation (DA), Transfer Learning (TL), and Self-Supervised Learning (SSL) techniques within the context of Few-Shot Learning (FSL) for EEG signal classification. EEG signals have shown significant potential in various paradigms, including Motor Imagery, Emotion Recognition, Visual Evoked Potentials, Steady-State Visually Evoked Potentials, Rapid Serial Visual Presentation, Event-Related Potentials, and Mental Workload. However, challenges such as limited labeled data, noise, and inter/intra-subject variability have impeded the effectiveness of traditional machine learning (ML) and deep learning (DL) models. This review methodically explores how FSL approaches, incorporating DA, TL, and SSL, can address these challenges and enhance classification performance in specific EEG paradigms. It also delves into the open research challenges related to these techniques in EEG signal classification. Specifically, the review examines the identification of DA strategies tailored to various EEG paradigms, the creation of TL architectures for efficient knowledge transfer, and the formulation of SSL methods for unsupervised representation learning from EEG data. Addressing these challenges is crucial for enhancing the efficacy and robustness of FSL-based EEG signal classification. By presenting a structured taxonomy of FSL techniques and discussing the associated research challenges, this systematic review offers valuable insights for future investigations in EEG signal classification. The findings aim to guide and inspire researchers, promoting advancements in applying FSL methodologies for improved EEG signal analysis and classification in real-world settings.
Collapse
Affiliation(s)
- Chirag Ahuja
- Department of Computer Science and Engineering, Delhi Technological University, New Delhi, India
| | - Divyashikha Sethia
- Department of Software Engineering, Delhi Technology University, New Delhi, India
| |
Collapse
|
52
|
Luo Y, Liu W, Li H, Lu Y, Lu BL. A cross-scenario and cross-subject domain adaptation method for driving fatigue detection. J Neural Eng 2024; 21:046004. [PMID: 38838664 DOI: 10.1088/1741-2552/ad546d] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Accepted: 06/05/2024] [Indexed: 06/07/2024]
Abstract
Objective.The scarcity of electroencephalogram (EEG) data, coupled with individual and scenario variations, leads to considerable challenges in real-world EEG-based driver fatigue detection. We propose a domain adaptation method that utilizes EEG data collected from a laboratory to supplement real-world EEG data and constructs a cross-scenario and cross-subject driver fatigue detection model for real-world scenarios.Approach.First, we collect EEG data from subjects participating in a driving experiment conducted in both laboratory and real-world scenarios. To address the issue of data scarcity, we build a real-world fatigued driving detection model by integrating the real-world data with the laboratory data. Then, we propose a method named cross-scenario and cross-subject domain adaptation (CS2DA), which aims to eliminate the domain shift problem caused by individual variances and scenario differences. Adversarial learning is adopted to extract the common features observed across different subjects within the same scenario. The multikernel maximum mean discrepancy (MK-MMD) method is applied to further minimize scenario differences. Additionally, we propose a conditional MK-MMD constraint to better utilize label information. Finally, we use seven rules to fuse the predicted labels.Main results.We evaluate the CS2DA method through extensive experiments conducted on the two EEG datasets created in this work: the SEED-VLA and the SEED-VRW datasets. Different domain adaptation methods are used to construct a real-world fatigued driving detection model using data from laboratory and real-world scenarios, as well as a combination of both. Our findings show that the proposed CS2DA method outperforms the existing traditional and adversarial learning-based domain adaptation approaches. We also find that combining data from both laboratory and real-world scenarios improves the performance of the model.Significance.This study contributes two EEG-based fatigue driving datasets and demonstrates that the proposed CS2DA method can effectively enhance the performance of a real-world fatigued driving detection model.
Collapse
Affiliation(s)
- Yun Luo
- RuiJin-Mihoyo Laboratory, Clinical Neuroscience Center, RuiJin Hospital, Shanghai Jiao Tong University School of Medicine, 197 Ruijin 2nd Rd., Shanghai 200020, People's Republic of China
| | - Wei Liu
- RuiJin-Mihoyo Laboratory, Clinical Neuroscience Center, RuiJin Hospital, Shanghai Jiao Tong University School of Medicine, 197 Ruijin 2nd Rd., Shanghai 200020, People's Republic of China
| | - Hanqi Li
- Disaster Information Systems with Information Technology, Disaster Prevention Research Institute, Kyoto University, Gokasho, Uji, Kyoto 611-0011, Japan
| | - Yong Lu
- RuiJin Hospital, Shanghai Jiao Tong University School of Medicine, 197 Ruijin 2nd Rd., Shanghai 200020, People's Republic of China
| | - Bao-Liang Lu
- RuiJin-Mihoyo Laboratory, Clinical Neuroscience Center, RuiJin Hospital, Shanghai Jiao Tong University School of Medicine, 197 Ruijin 2nd Rd., Shanghai 200020, People's Republic of China
- Department of Computer Science and Engineering, Center for Brain-Like Computing and Machine Intelligence, Shanghai Jiao Tong University, 800 Dong Chuan Rd., Shanghai 200240, People's Republic of China
| |
Collapse
|
53
|
Zhang H, Zuo T, Chen Z, Wang X, Sun PZH. Evolutionary Ensemble Learning for EEG-Based Cross-Subject Emotion Recognition. IEEE J Biomed Health Inform 2024; 28:3872-3881. [PMID: 38954558 DOI: 10.1109/jbhi.2024.3384816] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/04/2024]
Abstract
Electroencephalogram (EEG) has been widely utilized in emotion recognition due to its high temporal resolution and reliability. However, the individual differences and non-stationary characteristics of EEG, along with the complexity and variability of emotions, pose challenges in generalizing emotion recognition models across subjects. In this paper, an end-to-end framework is proposed to improve the performance of cross-subject emotion recognition. A novel evolutionary programming (EP)-based optimization strategy with neural network (NN) as the base classifier termed NN ensemble with EP (EPNNE) is designed for cross-subject emotion recognition. The effectiveness of the proposed method is evaluated on the publicly available DEAP, FACED, SEED, and SEED-IV datasets. Numerical results demonstrate that the proposed method is superior to state-of-the-art cross-subject emotion recognition methods. The proposed end-to-end framework for cross-subject emotion recognition aids biomedical researchers in effectively assessing individual emotional states, thereby enabling efficient treatment and interventions.
Collapse
|
54
|
Wang Z, Li A, Wang Z, Zhou T, Xu T, Hu H. Bi-Stream Adaptation Network for Motor Imagery Decoding. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2024; 2024:1-4. [PMID: 40031514 DOI: 10.1109/embc53108.2024.10782480] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/05/2025]
Abstract
Neural activities in distinct brain regions variably contribute to the formation of motor imagery (MI). Utilizing the hidden contextual information can thereby enhance network performance by having a comprehensive understanding of MI. Besides, due to the non-stationarity of EEG, the global and local distributions of cross-session EEG from an individual vary in applications. Based on these ideas, a novel Bi-Stream Adaptation Network (BSAN) is proposed to generate multi-scale context dependencies and to bridge the cross-session discrepancies in MI classification. Specifically, a Bi-attention module is proposed to cultivate multi-scale temporal dependencies and figure out the predominant brain regions. After features extraction, a Bi-discriminator is trained to implement the task of domain adaptation both globally and locally. To validate the proposed BSAN, extensive experiments were conducted based on two public MI datasets. The results revealed that the proposed BSAN improved the performance and robustness of MI classification and outperformed several state-of-the-art methods.
Collapse
|
55
|
Li J, She Q, Meng M, Du S, Zhang Y. Three-stage transfer learning for motor imagery EEG recognition. Med Biol Eng Comput 2024; 62:1689-1701. [PMID: 38342784 DOI: 10.1007/s11517-024-03036-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 01/17/2024] [Indexed: 02/13/2024]
Abstract
Motor imagery (MI) paradigms have been widely used in neural rehabilitation and drowsiness state assessment. The progress in brain-computer interface (BCI) technology has emphasized the importance of accurately and efficiently detecting motor imagery intentions from electroencephalogram (EEG). Despite the recent breakthroughs made in developing EEG-based algorithms for decoding MI, the accuracy and efficiency of these models remain limited by technical challenges posed by cross-subject heterogeneity in EEG data processing and the scarcity of EEG data for training. Inspired by the optimal transport theory, this study aims to develop a novel three-stage transfer learning (TSTL) method, which uses the existing labeled data from a source domain to improve classification performance on an unlabeled target domain. Notably, the proposed method comprises three components, namely, the Riemannian tangent space mapping (RTSM), source domain transformer (SDT), and optimal subspace mapping (OSM). The RTSM maps a symmetric positive definite matrix from the Riemannian space to the tangent space to minimize the marginal probability distribution drift. The SDT transforms the source domain to a target domain by finding the optimal transport mapping matrix to reduce the joint probability distribution differences. The OSM finally maps the transformed source domain and original target domain to the same subspace to further mitigate the distribution discrepancy. The performance of the proposed method was validated on two public BCI datasets, and the average accuracy of the algorithm on two datasets was 72.24% and 69.29%. Our results demonstrated the improved performance of EEG-based MI detection in comparison with state-of-the-art algorithms.
Collapse
Affiliation(s)
- Junhao Li
- School of Automation, Hangzhou Dianzi University, Hangzhou, 310018, Zhejiang, China
| | - Qingshan She
- School of Automation, Hangzhou Dianzi University, Hangzhou, 310018, Zhejiang, China.
| | - Ming Meng
- School of Automation, Hangzhou Dianzi University, Hangzhou, 310018, Zhejiang, China
| | - Shengzhi Du
- Department of Electrical Engineering, Tshwane University of Technology, Pretoria, 0001, South Africa
| | - Yingchun Zhang
- Department of Biomedical Engineering, University of Houston, Houston, TX, 77204, USA
| |
Collapse
|
56
|
Lu N, Xiao H, Ma Z, Yan T, Han M. Domain Adaptation With Self-Supervised Learning and Feature Clustering for Intelligent Fault Diagnosis. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:7657-7670. [PMID: 36378787 DOI: 10.1109/tnnls.2022.3219896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Domain adaptation indeed promotes the progress of intelligent fault diagnosis in industrial scenarios. The abundant labeled samples are not necessary. The identical distribution between the training and testing datasets is not any more the prerequisite for intelligent fault diagnosis working. However, two issues arise subsequently: Feature learning in domain adaptation framework tends to be biased to the source domain, and unreliable pseudolabeling seriously impacts on the conditional domain adaptation. In this article, a new domain adaptation approach with self-supervised learning and feature clustering (DASSL-FC) is proposed, trying to alleviate the issues by unbiased feature learning and pseudolabels updating strategy. Taking different transformation methods as pretext, the transformed data and its pretext train a neural network in an SSL way. As to pseudolabeling, clusters are taken as the auxiliary information to correct the network predicted labels in terms of the "strong cluster" rule. Then, the updated pseudolabels and their confidence are enforced to further estimate the conditional distribution discrepancy and its confidence weight. To verify the effectiveness of the proposed method, the experiments are implemented on intraplatform and interplatforms for simulating the practical scenarios.
Collapse
|
57
|
Zhu X, Liu C, Zhao L, Wang S. EEG Emotion Recognition Network Based on Attention and Spatiotemporal Convolution. SENSORS (BASEL, SWITZERLAND) 2024; 24:3464. [PMID: 38894254 PMCID: PMC11174415 DOI: 10.3390/s24113464] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Revised: 05/24/2024] [Accepted: 05/26/2024] [Indexed: 06/21/2024]
Abstract
Human emotions are complex psychological and physiological responses to external stimuli. Correctly identifying and providing feedback on emotions is an important goal in human-computer interaction research. Compared to facial expressions, speech, or other physiological signals, using electroencephalogram (EEG) signals for the task of emotion recognition has advantages in terms of authenticity, objectivity, and high reliability; thus, it is attracting increasing attention from researchers. However, the current methods have significant room for improvement in terms of the combination of information exchange between different brain regions and time-frequency feature extraction. Therefore, this paper proposes an EEG emotion recognition network, namely, self-organized graph pesudo-3D convolution (SOGPCN), based on attention and spatiotemporal convolution. Unlike previous methods that directly construct graph structures for brain channels, the proposed SOGPCN method considers that the spatial relationships between electrodes in each frequency band differ. First, a self-organizing map is constructed for each channel in each frequency band to obtain the 10 most relevant channels to the current channel, and graph convolution is employed to capture the spatial relationships between all channels in the self-organizing map constructed for each channel in each frequency band. Then, pseudo-three-dimensional convolution combined with partial dot product attention is implemented to extract the temporal features of the EEG sequence. Finally, LSTM is employed to learn the contextual information between adjacent time-series data. Subject-dependent and subject-independent experiments are conducted on the SEED dataset to evaluate the performance of the proposed SOGPCN method, which achieves recognition accuracies of 95.26% and 94.22%, respectively, indicating that the proposed method outperforms several baseline methods.
Collapse
Affiliation(s)
| | | | | | - Shengming Wang
- National Engineering Research Center of Educational Big Data, Central China Normal University, Wuhan 430079, China; (X.Z.); (C.L.); (L.Z.)
| |
Collapse
|
58
|
Wang B, Yu A, Wang H, Liu J. Modeling and Optimization of an Enhanced Soft Sensor for the Fermentation Process of Pichia pastoris. SENSORS (BASEL, SWITZERLAND) 2024; 24:3017. [PMID: 38793872 PMCID: PMC11125098 DOI: 10.3390/s24103017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Revised: 05/02/2024] [Accepted: 05/08/2024] [Indexed: 05/26/2024]
Abstract
This paper proposes a novel soft sensor modeling approach, MIC-TCA-INGO-LSSVM, to address the decline in performance of soft sensor models during the fermentation process of Pichia pastoris, caused by changes in working conditions. Initially, the transfer component analysis (TCA) method is utilized to minimize the differences in data distribution across various working conditions. Subsequently, a least squares support vector machine (LSSVM) model is constructed using the dataset adapted by TCA, and strategies for improving the northern goshawk optimization (INGO) algorithm are proposed to optimize the parameters of the LSSVM model. Finally, to further enhance the model's generalization ability and prediction accuracy, considering the transfer of knowledge from multiple-source working conditions, a sub-model weighted ensemble scheme is proposed based on the maximum information coefficient (MIC) algorithm. The proposed soft sensor model is employed to predict cell and product concentrations during the fermentation process of Pichia pastoris. Simulation results indicate that the RMSE of the INGO-LSSVM model in predicting cell and product concentrations is reduced by 47.3% and 42.1%, respectively, compared to the NGO-LSSVM model. Additionally, TCA significantly enhances the model's adaptability when working conditions change. Moreover, the soft sensor model based on TCA and the MIC-weighted ensemble method achieves a reduction of 41.6% and 31.3% in the RMSE for predicting cell and product concentrations, respectively, compared to the single-source condition transfer model TCA-INGO-LSSVM. These results demonstrate the high reliability and predictive performance of the proposed soft sensor method under varying working conditions.
Collapse
Affiliation(s)
| | - Ameng Yu
- Key Laboratory of Agricultural Measurement and Control Technology and Equipment for Mechanical Industrial Facilities, School of Electrical and Information Engineering, Jiangsu University, Zhenjiang 212013, China; (B.W.); (H.W.); (J.L.)
| | | | | |
Collapse
|
59
|
Jin H, Gao Y, Wang T, Gao P. DAST: A Domain-Adaptive Learning Combining Spatio-Temporal Dynamic Attention for Electroencephalography Emotion Recognition. IEEE J Biomed Health Inform 2024; 28:2512-2523. [PMID: 37607151 DOI: 10.1109/jbhi.2023.3307606] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/24/2023]
Abstract
Multimodal emotion recognition with EEG-based have become mainstream in affective computing. However, previous studies mainly focus on perceived emotions (including posture, speech or face expression et al.) of different subjects, while the lack of research on induced emotions (including video or music et al.) limited the development of two-ways emotions. To solve this problem, we propose a multimodal domain adaptive method based on EEG and music called the DAST, which uses spatio-temporal adaptive attention (STA-attention) to globally model the EEG and maps all embeddings dynamically into high-dimensionally space by adaptive space encoder (ASE). Then, adversarial training is performed with domain discriminator and ASE to learn invariant emotion representations. Furthermore, we conduct extensive experiments on the DEAP dataset, and the results show that our method can further explore the relationship between induced and perceived emotions, and provide a reliable reference for exploring the potential correlation between EEG and music stimulation.
Collapse
|
60
|
Shi X, She Q, Fang F, Meng M, Tan T, Zhang Y. Enhancing cross-subject EEG emotion recognition through multi-source manifold metric transfer learning. Comput Biol Med 2024; 174:108445. [PMID: 38603901 DOI: 10.1016/j.compbiomed.2024.108445] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2023] [Revised: 03/08/2024] [Accepted: 04/07/2024] [Indexed: 04/13/2024]
Abstract
Transfer learning (TL) has demonstrated its efficacy in addressing the cross-subject domain adaptation challenges in affective brain-computer interfaces (aBCI). However, previous TL methods usually use a stationary distance, such as Euclidean distance, to quantify the distribution dissimilarity between two domains, overlooking the inherent links among similar samples, potentially leading to suboptimal feature mapping. In this study, we introduced a novel algorithm called multi-source manifold metric transfer learning (MSMMTL) to enhance the efficacy of conventional TL. Specifically, we first selected the source domain based on Mahalanobis distance to enhance the quality of the source domains and then used manifold feature mapping approach to map the source and target domains on the Grassmann manifold to mitigate data drift between domains. In this newly established shared space, we optimized the Mahalanobis metric by maximizing the inter-class distances while minimizing the intra-class distances in the target domain. Recognizing that significant distribution discrepancies might persist across different domains even on the manifold, to ensure similar distributions between the source and target domains, we further imposed constraints on both domains under the Mahalanobis metric. This approach aims to reduce distributional disparities and enhance the electroencephalogram (EEG) emotion recognition performance. In cross-subject experiments, the MSMMTL model exhibits average classification accuracies of 88.83 % and 65.04 % for SEED and DEAP, respectively, underscoring the superiority of our proposed MSMMTL over other state-of-the-art methods. MSMMTL can effectively solve the problem of individual differences in EEG-based affective computing.
Collapse
Affiliation(s)
- XinSheng Shi
- School of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, China
| | - Qingshan She
- School of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, China; International Joint Research Laboratory for Autonomous Robotic Systems, Hangzhou, Zhejiang, 310018, China.
| | - Feng Fang
- Department of Biomedical Engineering, University of Miami, Coral Gables, Florida, USA
| | - Ming Meng
- School of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, China; International Joint Research Laboratory for Autonomous Robotic Systems, Hangzhou, Zhejiang, 310018, China
| | - Tongcai Tan
- Department of Rehabilitation, Medicine, Zhejiang Provincial People's Hospital, People's Hospital of Hangzhou Medical College, Hangzhou, Zhejiang, 310014, China
| | - Yingchun Zhang
- Department of Biomedical Engineering, University of Miami, Coral Gables, Florida, USA
| |
Collapse
|
61
|
Yang C, Xue B, Tan KC, Zhang M. A Co-Training Framework for Heterogeneous Heuristic Domain Adaptation. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:6863-6877. [PMID: 36269922 DOI: 10.1109/tnnls.2022.3212924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
The purpose of this article is to address unsupervised domain adaptation (UDA) where a labeled source domain and an unlabeled target domain are given. Recent advanced UDA methods attempt to remove domain-specific properties by separating domain-specific information from domain-invariant representations, which heavily rely on the designed neural network structures. Meanwhile, they do not consider class discriminate representations when learning domain-invariant representations. To this end, this article proposes a co-training framework for heterogeneous heuristic domain adaptation (CO-HHDA) to address the above issues. First, a heterogeneous heuristic network is introduced to model domain-specific characters. It allows structures of heuristic network to be different between domains to avoid underfitting or overfitting. Specially, we initialize a small structure that is shared between domains and increase a subnetwork for the domain which preserves rich specific information. Second, we propose a co-training scheme to train two classifiers, a source classifier and a target classifier, to enhance class discriminate representations. The two classifiers are designed based on domain-invariant representations, where the source classifier learns from the labeled source data, and the target classifier is trained from the generated target pseudolabeled data. The two classifiers teach each other in the training process with high-quality pseudolabeled data. Meanwhile, an adaptive threshold is presented to select reliable pseudolabels in each classifier. Empirical results on three commonly used benchmark datasets demonstrate that the proposed CO-HHDA outperforms the state-of-the-art domain adaptation methods.
Collapse
|
62
|
Zhao H, Qiu S, Bai M, Wang L, Wang Z. Toxicity prediction and classification of Gunqile-7 with small sample based on transfer learning method. Comput Biol Med 2024; 173:108348. [PMID: 38531249 DOI: 10.1016/j.compbiomed.2024.108348] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Revised: 03/10/2024] [Accepted: 03/17/2024] [Indexed: 03/28/2024]
Abstract
Drug-induced diseases are the most important component of iatrogenic disease. It is the duty of doctors to provide a reasonable and safe dose of medication. Gunqile-7 is a Mongolian medicine with analgesic and anti-inflammatory effects. As a foreign substance in the body, even with reasonable medication, it may produce varying degrees of adverse reactions or toxic side effects. Since the cost of collecting Gunqile-7 for pharmacological animal trials is high and the data sample is small, this paper employs transfer learning and data augmentation methods to study the toxicity of Gunqile-7. More specifically, to reduce the necessary number of training samples, the data augmentation approach is employed to extend the data set. Then, the transfer learning method and one-dimensional convolutional neural network are utilized to train the network. In addition, we use the support vector machine-recursive feature elimination method for feature selection to reduce features that have adverse effects on model predictions. Furthermore, due to the important role of the pre-trained model of transfer learning, we select a quantitative toxicity prediction model as the pre-trained model, which is consistent with the purpose of this paper. Lastly, the experimental results demonstrate the efficiency of the proposed method. Our method can improve accuracy by up to 9 percentage points compared to the method without transfer learning on a small sample set.
Collapse
Affiliation(s)
- Hongkai Zhao
- Key Laboratory of Intelligent Control and Optimization for Industrial Equipment of Ministry of Education, Dalian University of Technology, Dalian 116024, China; School of Control Science and Engineering, Dalian University of Technology, Dalian 116024, China.
| | - Sen Qiu
- Key Laboratory of Intelligent Control and Optimization for Industrial Equipment of Ministry of Education, Dalian University of Technology, Dalian 116024, China; School of Control Science and Engineering, Dalian University of Technology, Dalian 116024, China.
| | - Meirong Bai
- Key Laboratory of Ministry of Education of Mongolian Medicine RD Engineering, Inner Mongolia Minzu University, Tongliao 028000, China.
| | - Luyao Wang
- Key Laboratory of Intelligent Control and Optimization for Industrial Equipment of Ministry of Education, Dalian University of Technology, Dalian 116024, China; School of Control Science and Engineering, Dalian University of Technology, Dalian 116024, China.
| | - Zhelong Wang
- Key Laboratory of Intelligent Control and Optimization for Industrial Equipment of Ministry of Education, Dalian University of Technology, Dalian 116024, China; School of Control Science and Engineering, Dalian University of Technology, Dalian 116024, China.
| |
Collapse
|
63
|
Yang C, Liu Q, Liu Y, Cheung YM. Transfer Dynamic Latent Variable Modeling for Quality Prediction of Multimode Processes. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:6061-6074. [PMID: 37079407 DOI: 10.1109/tnnls.2023.3265762] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Quality prediction is beneficial to intelligent inspection, advanced process control, operation optimization, and product quality improvements of complex industrial processes. Most of the existing work obeys the assumption that training samples and testing samples follow similar data distributions. The assumption is, however, not true for practical multimode processes with dynamics. In practice, traditional approaches mostly establish a prediction model using the samples from the principal operating mode (POM) with abundant samples. The model is inapplicable to other modes with a few samples. In view of this, this article will propose a novel dynamic latent variable (DLV)-based transfer learning approach, called transfer DLV regression (TDLVR), for quality prediction of multimode processes with dynamics. The proposed TDLVR can not only derive the dynamics between process variables and quality variables in the POM but also extract the co-dynamic variations among process variables between the POM and the new mode. This can effectively overcome data marginal distribution discrepancy and enrich the information of the new mode. To make full use of the available labeled samples from the new mode, an error compensation mechanism is incorporated into the established TDLVR, termed compensated TDLVR (CTDLVR), to adapt to the conditional distribution discrepancy. Empirical studies show the efficacy of the proposed TDLVR and CTDLVR methods in several case studies, including numerical simulation examples and two real-industrial process examples.
Collapse
|
64
|
Ran M, Tang B, Sun P, Li Q, Shi T. A gradient aligned domain adversarial network for unsupervised intelligent fault diagnosis of gearboxes. ISA TRANSACTIONS 2024:S0019-0578(24)00140-X. [PMID: 38594162 DOI: 10.1016/j.isatra.2024.03.032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Revised: 03/26/2024] [Accepted: 03/26/2024] [Indexed: 04/11/2024]
Abstract
Unsupervised domain adaptation alleviates the dependencies of conventional fault diagnosis methods on sufficient labeled data and strict data distributions. Nonetheless, the current domain adaptation methods only concentrate on the data distributions and ignore the feature gradient distributions, leading to some samples being misclassified due to large gradient discrepancies, thus affecting diagnosis performance. In this paper, a gradient aligned domain adversarial network (GADAN) is proposed. First, the discrepancies of the marginal and conditional distribution between the source and target domain are reduced by minimizing the joint maximum mean discrepancy. Then, a pseudo-labeling approach based on a clustering self-supervised strategy is utilized to attain high-quality pseudo-labels of target domains, and most importantly in the dimension of the data gradient, the feature gradient distributions are aligned by adversarial learning to further reduce the domain shift, even if the distributions of the two domains are close enough. Finally, experiments and engineering applications demonstrate the effectiveness and superiority of GADAN for transfer diagnosis between various working conditions or different machines.
Collapse
Affiliation(s)
- Maoqi Ran
- State Key Laboratory of Mechanical Transmission for Advanced Equipment, Chongqing University, Chongqing 400030, China.
| | - Baoping Tang
- State Key Laboratory of Mechanical Transmission for Advanced Equipment, Chongqing University, Chongqing 400030, China.
| | - Peng Sun
- China Shipbuilding Industry Corporation 703 Research Institute, Harbin 150000, China.
| | - Qikang Li
- State Key Laboratory of Mechanical Transmission for Advanced Equipment, Chongqing University, Chongqing 400030, China.
| | - Tielin Shi
- School of Mechanical Science and Engineering, Huazhong University of Science and Technology, Wuhan 430074, China.
| |
Collapse
|
65
|
Liu J, Yuan Y, Jiang X, Guo Y, Jia F, Dai C. A Robust and Real-Time Framework of Cross-Subject Myoelectric Control Model Calibration via Multi-Source Domain Adaptation. IEEE J Biomed Health Inform 2024; 28:1363-1373. [PMID: 38306264 DOI: 10.1109/jbhi.2024.3354909] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2024]
Abstract
Surface electromyogram (sEMG) has been widely used in hand gesture recognition. However, most previous studies focused on user-personalized models, which require a great amount of data from each new target user to learn the user-specific EMG patterns. In this work, we present a novel real-time gesture recognition framework based on multi-source domain adaptation, which learns extra knowledge from the data of other users, thereby reducing the data collection burdens on the target user. Additionally, compared with conventional domain adaptation methods which treat data from all users in the source domain as a whole, the proposed multi-source method treat data from different users as multiple separate source domains. Therefore, more detailed statistical information on the data distribution from each user can be learned effectively. High-density sEMG (256 channels) from 20 subjects was used to validate the proposed method. Importantly, we evaluated our method with a simulated real-time processing pipeline on continuous sEMG data stream, rather than well-segmented data. The false alarm rate during rest periods in an EMG data stream, which is typically neglected by previous studies performing offline analyses, was also considered. Our results showed that, with only 1 s sEMG data per gesture from the new user, the 10-gesture classification accuracy reached 87.66 % but the false alarm rate was reduced to 1.95 %. Our method can reduce the frustratingly heavy data collection burdens on each new user.
Collapse
|
66
|
Su J, Shen H, Peng L, Hu D. Few-Shot Domain-Adaptive Anomaly Detection for Cross-Site Brain Images. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2024; 46:1819-1835. [PMID: 34748478 DOI: 10.1109/tpami.2021.3125686] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Early screening is essential for effective intervention and treatment of individuals with mental disorders. Functional magnetic resonance imaging (fMRI) is a noninvasive tool for depicting neural activity and has demonstrated strong potential as a technique for identifying mental disorders. Due to the difficulty in data collection and diagnosis, imaging data from patients are rare at a single site, whereas abundant healthy control data are available from public datasets. However, joint use of these data from multiple sites for classification model training is hindered by cross-domain distribution discrepancy and diverse label spaces. Herein, we propose few-shot domain-adaptive anomaly detection (FAAD) to achieve cross-site anomaly detection of brain images based on only a few labeled samples. We introduce domain adaptation to mitigate cross-domain distribution discrepancy and jointly align the general and conditional feature distributions of imaging data across multiple sites. We utilize fMRI data of healthy subjects in the Human Connectome Project (HCP) as the source domain and fMRI images from six independent sites, including patients with mental disorders and demographically matched healthy controls, as target domains. Experiments showed the superiority of the proposed method compared with binary classification, traditional anomaly detection methods, and several recognized domain adaptation methods.
Collapse
|
67
|
Zhang Y, Shangguan C, Zhang X, Ma J, He J, Jia M, Chen N. Computer-Aided Diagnosis of Complications After Liver Transplantation Based on Transfer Learning. Interdiscip Sci 2024; 16:123-140. [PMID: 37875773 DOI: 10.1007/s12539-023-00588-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2023] [Revised: 09/20/2023] [Accepted: 09/22/2023] [Indexed: 10/26/2023]
Abstract
Liver transplantation is one of the most effective treatments for acute liver failure, cirrhosis, and even liver cancer. The prediction of postoperative complications is of great significance for liver transplantation. However, the existing prediction methods based on traditional machine learning are often unavailable or unreliable due to the insufficient amount of real liver transplantation data. Therefore, we propose a new framework to increase the accuracy of computer-aided diagnosis of complications after liver transplantation with transfer learning, which can handle small-scale but high-dimensional data problems. Furthermore, since data samples are often high dimensional in the real world, capturing key features that influence postoperative complications can help make the correct diagnosis for patients. So, we also introduce the SHapley Additive exPlanation (SHAP) method into our framework for exploring the key features of postoperative complications. We used data obtained from 425 patients with 456 features in our experiments. Experimental results show that our approach outperforms all compared baseline methods in predicting postoperative complications. In our work, the average precision, the mean recall, and the mean F1 score reach 91.22%, 91.70%, and 91.18%, respectively.
Collapse
Affiliation(s)
- Ying Zhang
- School of Control and Computer Engineering, North China Electric Power University, Beijing, 102206, China.
| | - Chenyuan Shangguan
- School of Control and Computer Engineering, North China Electric Power University, Beijing, 102206, China
| | - Xuena Zhang
- Department of Anesthesiology Medicine, Beijing Chaoyang Hospital, Capital Medical University, Beijing, 100069, China
| | - Jialin Ma
- Tianjin Zhuoman Technology Co., Ltd., Tianjin, 300000, China
| | - Jiyuan He
- School of Control and Computer Engineering, North China Electric Power University, Beijing, 102206, China
| | - Meng Jia
- School of Control and Computer Engineering, North China Electric Power University, Beijing, 102206, China
| | - Na Chen
- Hebei Vocational College of Rail Transportation, Shijiazhuang, 050051, China
| |
Collapse
|
68
|
Zhang R, Guo H, Xu Z, Hu Y, Chen M, Zhang L. MGFKD: A semi-supervised multi-source domain adaptation algorithm for cross-subject EEG emotion recognition. Brain Res Bull 2024; 208:110901. [PMID: 38355058 DOI: 10.1016/j.brainresbull.2024.110901] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 12/31/2023] [Accepted: 02/11/2024] [Indexed: 02/16/2024]
Abstract
Currently, most models rarely consider the negative transfer problem in the research field of cross-subject EEG emotion recognition. To solve this problem, this paper proposes a semi-supervised domain adaptive algorithm based on few labeled samples of target subject, which called multi-domain geodesic flow kernel dynamic distribution alignment (MGFKD). It consists of three modules: 1) GFK common feature extractor: projects the feature distribution of source and target subjects to the Grassmann manifold space, and obtains the latent common features of the two feature distributions through GFK method. 2) Source domain selector: obtains pseudo-labels of the target subject through weak classifier, finds "golden source subjects" by using few known labels of target subjects. 3) Label corrector: uses a dynamic distribution balance strategy to correct the pseudo-labels of the target subject. We conducted comparison experiments on the SEED and SEED-IV datasets, and the results show that MGFKD outperforms unsupervised and semi-supervised domain adaptation algorithms, achieving an average accuracy of 87.51±7.68% and 68.79±8.25% on the SEED and SEED-IV datasets with only one labeled sample per video for target subject. Especially when the number of source domains is set as 6 and the number of known labels is set as 5, the accuracy increase to 90.20±7.57% and 69.99±7.38%, respectively. The above results prove that our proposed algorithm can efficiently and quickly improve the cross-subject EEG emotion classification performance. Since it only need a small number of labeled samples of new subjects, making it has strong application value in future EEG-based emotion recognition applications.
Collapse
Affiliation(s)
- Rui Zhang
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical and Information Engineering, Zhengzhou University, Zhengzhou 450001, PR China
| | - Huifeng Guo
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical and Information Engineering, Zhengzhou University, Zhengzhou 450001, PR China
| | - Zongxin Xu
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical and Information Engineering, Zhengzhou University, Zhengzhou 450001, PR China
| | - Yuxia Hu
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical and Information Engineering, Zhengzhou University, Zhengzhou 450001, PR China
| | - Mingming Chen
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical and Information Engineering, Zhengzhou University, Zhengzhou 450001, PR China
| | - Lipeng Zhang
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical and Information Engineering, Zhengzhou University, Zhengzhou 450001, PR China.
| |
Collapse
|
69
|
Gu X, Sun J, Xu Z. Unsupervised and Semi-Supervised Robust Spherical Space Domain Adaptation. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2024; 46:1757-1774. [PMID: 35275811 DOI: 10.1109/tpami.2022.3158637] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Adversarial domain adaptation has been an effective approach for learning domain-invariant features by adversarial training. In this paper, we propose a novel adversarial domain adaptation approach defined in the spherical feature space, in which we define spherical classifier for label prediction and spherical domain discriminator for discriminating domain labels. In the spherical feature space, we develop a spherical robust pseudo-label loss to utilize pseudo-labels robustly, which weights the importance of the estimated labels of target domain data by the posterior probability of correct labeling, modeled by the Gaussian-uniform mixture model in the spherical space. Our proposed approach can be generally applied to both unsupervised and semi-supervised domain adaptation settings. In particular, to tackle the semi-supervised domain adaptation setting where a few labeled target domain data are available for training, we propose a novel reweighted adversarial training strategy for effectively reducing the intra-domain discrepancy within the target domain. We also present theoretical analysis for the proposed method based on the domain adaptation theory. Extensive experiments are conducted on multiple benchmarks for object recognition, digit recognition, and face recognition. The results show that our method either surpasses or is competitive compared with the recent methods for both unsupervised and semi-supervised domain adaptation. Ablation studies also confirm the effectiveness of the spherical classifier, spherical discriminator, spherical robust pseudo-label loss, and reweighted adversarial training strategy.
Collapse
|
70
|
Yu W, Xu N, Huang N, Chen H. Bridging the gap: Geometry-centric discriminative manifold distribution alignment for enhanced classification in colorectal cancer imaging. Comput Biol Med 2024; 170:107998. [PMID: 38266468 DOI: 10.1016/j.compbiomed.2024.107998] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Revised: 12/19/2023] [Accepted: 01/13/2024] [Indexed: 01/26/2024]
Abstract
The early detection of colorectal cancer (CRC) through medical image analysis is a pivotal concern in healthcare, with the potential to significantly reduce mortality rates. Current Domain Adaptation (DA) methods strive to mitigate the discrepancies between different imaging modalities that are critical in identifying CRC, yet they often fall short in addressing the complexity of cancer's presentation within these images. These conventional techniques typically overlook the intricate geometrical structures and the local variations within the data, leading to suboptimal diagnostic performance. This study introduces an innovative application of the Discriminative Manifold Distribution Alignment (DMDA) method, which is specifically engineered to enhance the medical image diagnosis of colorectal cancer. DMDA transcends traditional DA approaches by focusing on both local and global distribution alignments and by intricately learning the intrinsic geometrical characteristics present in manifold space. This is achieved without depending on the potentially misleading pseudo-labels, a common pitfall in existing methodologies. Our implementation of DMDA on three distinct datasets, involving several unique DA tasks, has consistently demonstrated superior classification accuracy and computational efficiency. The method adeptly captures the complex morphological and textural nuances of CRC lesions, leading to a significant leap in domain adaptation technology. DMDA's ability to reconcile global and local distributional disparities, coupled with its manifold-based geometrical structure learning, signals a paradigm shift in medical imaging analysis. The results obtained are not only promising in terms of advancing domain adaptation theory but also in their practical implications, offering the prospect of substantially improved diagnostic accuracy and faster clinical workflows. This heralds a transformative approach in personalized oncology care, aligning with the pressing need for early and accurate CRC detection.
Collapse
Affiliation(s)
- Weiwei Yu
- Department of Gastroenterology, Wenzhou TCM Hospital of Zhejiang Chinese Medical University, Wenzhou, Zhejiang, 325000, China.
| | - Nuo Xu
- Department of Medical Oncology, Wenzhou TCM Hospital of Zhejiang Chinese Medical University, Wenzhou, Zhejiang, 325000, China.
| | - Nuanhui Huang
- Department of Medical Oncology, Wenzhou TCM Hospital of Zhejiang Chinese Medical University, Wenzhou, Zhejiang, 325000, China.
| | - Houliang Chen
- Department of Medical Oncology, Wenzhou TCM Hospital of Zhejiang Chinese Medical University, Wenzhou, Zhejiang, 325000, China.
| |
Collapse
|
71
|
Zhu H, Wu Y, Yang G, Song R, Yu J, Zhang J. Electronic Nose Drift Suppression Based on Smooth Conditional Domain Adversarial Networks. SENSORS (BASEL, SWITZERLAND) 2024; 24:1319. [PMID: 38400477 PMCID: PMC10892276 DOI: 10.3390/s24041319] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Revised: 01/31/2024] [Accepted: 02/12/2024] [Indexed: 02/25/2024]
Abstract
Anti-drift is a new and serious challenge in the field related to gas sensors. Gas sensor drift causes the probability distribution of the measured data to be inconsistent with the probability distribution of the calibrated data, which leads to the failure of the original classification algorithm. In order to make the probability distributions of the drifted data and the regular data consistent, we introduce the Conditional Adversarial Domain Adaptation Network (CDAN)+ Sharpness Aware Minimization (SAM) optimizer-a state-of-the-art deep transfer learning method.The core approach involves the construction of feature extractors and domain discriminators designed to extract shared features from both drift and clean data. These extracted features are subsequently input into a classifier, thereby amplifying the overall model's generalization capabilities. The method boasts three key advantages: (1) Implementation of semi-supervised learning, thereby negating the necessity for labels on drift data. (2) Unlike conventional deep transfer learning methods such as the Domain-adversarial Neural Network (DANN) and Wasserstein Domain-adversarial Neural Network (WDANN), it accommodates inter-class correlations. (3) It exhibits enhanced ease of training and convergence compared to traditional deep transfer learning networks. Through rigorous experimentation on two publicly available datasets, we substantiate the efficiency and effectiveness of our proposed anti-drift methodology when juxtaposed with state-of-the-art techniques.
Collapse
Affiliation(s)
| | | | | | | | | | - Jianwei Zhang
- School of Control Science and Engineering, Dalian University of Technology, Dalian 116000, China; (H.Z.); (Y.W.); (G.Y.); (R.S.); (J.Y.)
| |
Collapse
|
72
|
Guo Z, Wu T, Lockhart TE, Soangra R, Yoon H. Correlation enhanced distribution adaptation for prediction of fall risk. Sci Rep 2024; 14:3477. [PMID: 38347050 PMCID: PMC10861595 DOI: 10.1038/s41598-024-54053-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Accepted: 02/08/2024] [Indexed: 02/15/2024] Open
Abstract
With technological advancements in diagnostic imaging, smart sensing, and wearables, a multitude of heterogeneous sources or modalities are available to proactively monitor the health of the elderly. Due to the increasing risks of falls among older adults, an early diagnosis tool is crucial to prevent future falls. However, during the early stage of diagnosis, there is often limited or no labeled data (expert-confirmed diagnostic information) available in the target domain (new cohort) to determine the proper treatment for older adults. Instead, there are multiple related but non-identical domain data with labels from the existing cohort or different institutions. Integrating different data sources with labeled and unlabeled samples to predict a patient's condition poses a significant challenge. Traditional machine learning models assume that data for new patients follow a similar distribution. If the data does not satisfy this assumption, the trained models do not achieve the expected accuracy, leading to potential misdiagnosing risks. To address this issue, we utilize domain adaptation (DA) techniques, which employ labeled data from one or more related source domains. These DA techniques promise to tackle discrepancies in multiple data sources and achieve a robust diagnosis for new patients. In our research, we have developed an unsupervised DA model to align two domains by creating a domain-invariant feature representation. Subsequently, we have built a robust fall-risk prediction model based on these new feature representations. The results from simulation studies and real-world applications demonstrate that our proposed approach outperforms existing models.
Collapse
Affiliation(s)
- Ziqi Guo
- Department of Systems Science and Industrial Engineering, The State University of New York at Binghamton, Binghamton, USA
| | - Teresa Wu
- School of Computing and Augmented Intelligence, Arizona State University, Tempe, USA
| | - Thurmon E Lockhart
- School of Biological and Health Systems Engineering, Arizona State University, Tempe, USA
| | - Rahul Soangra
- Department of Physical Therapy, Chapman University, Orange, USA
| | - Hyunsoo Yoon
- Department of Industrial Engineering, Yonsei University, Seoul, Korea.
| |
Collapse
|
73
|
Fu W, Xue B, Gao X, Zhang M. Genetic Programming for Document Classification: A Transductive Transfer Learning System. IEEE TRANSACTIONS ON CYBERNETICS 2024; 54:1119-1132. [PMID: 38127617 DOI: 10.1109/tcyb.2023.3338266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]
Abstract
Document classification is a challenging task to the data being high-dimensional and sparse. Many transfer learning methods have been investigated for improving the classification performance by effectively transferring knowledge from a source domain to a target domain, which is similar to but different from the source domain. However, most of the existing methods cannot handle the case that the training data of the target domain does not have labels. In this study, we propose a transductive transfer learning system, utilizing solutions evolved by genetic programming (GP) on a source domain to automatically pseudolabel the training data in the target domain in order to train classifiers. Different from many other transfer learning techniques, the proposed system pseudolabels target-domain training data to retrains classifiers using all target-domain features. The proposed method is examined on nine transfer learning tasks, and the results show that the proposed transductive GP system has better prediction accuracy on the test data in the target domain than existing transfer learning approaches including subspace alignment-domain adaptation methods, feature-level-domain adaptation methods, and one latest pseudolabeling strategy-based method.
Collapse
|
74
|
Zhang M, Wu Q, Guo J, Li Y, Gao X. Heat Transfer-Inspired Network for Image Super-Resolution Reconstruction. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:1810-1820. [PMID: 35776820 DOI: 10.1109/tnnls.2022.3185529] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Image super-resolution (SR) is a critical image preprocessing task for many applications. How to recover features as accurately as possible is the focus of SR algorithms. Most existing SR methods tend to guide the image reconstruction process with gradient maps, frequency perception modules, etc. and improve the quality of recovered images from the perspective of enhancing edges, but rarely optimize the neural network structure from the system level. In this article, we conduct an in- depth exploration for the inner nature of the SR network structure. In light of the consistency between thermal particles in the thermal field and pixels in the image domain, we propose a novel heat-transfer-inspired network (HTI-Net) for image SR reconstruction based on the theoretical basis of heat transfer. With the finite difference theory, we use a second-order mixed-difference equation to redesign the residual network (ResNet), which can fully integrate multiple information to achieve better feature reuse. In addition, according to the thermal conduction differential equation (TCDE) in the thermal field, the pixel value flow equation (PVFE) in the image domain is derived to mine deep potential feature information. The experimental results on multiple standard databases demonstrate that the proposed HTI-Net has superior edge detail reconstruction effect and parameter performance compared with the existing SR methods. The experimental results on the microscope chip image (MCI) database consisting of realistic low-resolution (LR) and high-resolution (HR) images show that the proposed HTI-Net for image SR reconstruction can improve the effectiveness of the hardware Trojan detection system.
Collapse
|
75
|
Zhu L, Yu F, Huang A, Ying N, Zhang J. Instance-representation transfer method based on joint distribution and deep adaptation for EEG emotion recognition. Med Biol Eng Comput 2024; 62:479-493. [PMID: 37914959 DOI: 10.1007/s11517-023-02956-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Accepted: 10/20/2023] [Indexed: 11/03/2023]
Abstract
Electroencephalogram (EEG) emotion recognition technology is essential for improving human-computer interaction. However, the practical application of emotion recognition technology is limited due to the variety of subjects and sessions. Transfer learning has been applied to address this issue and has received extensive research and application. Studies mainly concentrate on either instance transfer or representation transfer methods. This paper proposes an emotion recognition method called Joint Distributed Instances Represent Transfer (JD-IRT), which includes two core components: Joint Distribution Deep Adaptation (JDDA) and Instance-Representation Transfer (I-RT). Specifically, JDDA is different from common representation transfer methods in transfer learning. It bridges the discrepancies of marginal and conditional distributions simultaneously and combines multiple adaptive layers and kernels for deep domain adaptation. On the other hand, I-RT utilizes instance transfer to select source domain data for better representation transfer. We performed experiments and compared them with other representative methods in the SEED, SEED-IV, and SEED-V datasets. In cross-subject experiments, our approach achieved an average accuracy of 83.21% in SEED, 52.12% in SEED-IV, and 60.17% in SEED-V. Similarly, in cross-session experiments, the accuracy was 91.29% in SEED, 59.02% in SEED-IV, and 65.91% in SEED-V. These results demonstrate the improvement in the accuracy of EEG emotion recognition using the proposed approach.
Collapse
Affiliation(s)
- Lei Zhu
- School of Automation, Hangzhou Dianzi University, Hangzhou, 310000, China.
| | - Fei Yu
- School of Automation, Hangzhou Dianzi University, Hangzhou, 310000, China
| | - Aiai Huang
- School of Automation, Hangzhou Dianzi University, Hangzhou, 310000, China
| | - Nanjiao Ying
- School of Automation, Hangzhou Dianzi University, Hangzhou, 310000, China
- Center for Drug Inspection of Zhejiang Province, Hangzhou, 310000, China
| | - Jianhai Zhang
- School of Computer Science, Hangzhou Dianzi University, Hangzhou, 310000, China
- Key Laboratory of Brain Machine Collaborative Intelligence of Zhejiang Province, Hangzhou, 310000, China
| |
Collapse
|
76
|
Zhang L, Xue J, Xie Y, Huang D, Xie Z, Zhu L, Chen X, Cui G, Ali S, Huang G, Chen X. Automatic detection of ischemic necrotic sites in small intestinal tissue using hyperspectral imaging and transfer learning. JOURNAL OF BIOPHOTONICS 2024; 17:e202300315. [PMID: 38018735 DOI: 10.1002/jbio.202300315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Revised: 11/25/2023] [Accepted: 11/27/2023] [Indexed: 11/30/2023]
Abstract
Acquiring large amounts of hyperspectral data of small intestinal tissue with real labels in the clinic is difficult, and the data shows inter-patient variability. Building an automatic identification model using a small dataset presents a crucial challenge in obtaining a strong generalization of the model. This study aimed to explore the performance of hyperspectral imaging and transfer learning techniques in the automatic identification of normal and ischemic necrotic sites in small intestinal tissue. Hyperspectral data of small intestinal tissues were collected from eight white rabbit samples. The transfer component analysis (TCA) method was performed to transfer learning on hyperspectral data between different samples and the variability of data distribution between samples was reduced. The results showed that the TCA transfer learning method improved the accuracy of the classification model with less training data. This study provided a reliable method for single-sample modelling to detect necrotic sites in small intestinal tissue .
Collapse
Affiliation(s)
- Lechao Zhang
- College of Optoelectronic Engineering, Changchun University of Science and Technology, Changchun, China
- Zhongshan Research Institute, Changchun University of Science and Technology, Zhongshan, China
| | - Jianxia Xue
- College of Electrical and Electronic Engineering, Wenzhou University, Wenzhou, China
| | - Yi Xie
- College of Optoelectronic Engineering, Changchun University of Science and Technology, Changchun, China
- Zhongshan Research Institute, Changchun University of Science and Technology, Zhongshan, China
| | - Danfei Huang
- College of Optoelectronic Engineering, Changchun University of Science and Technology, Changchun, China
- Zhongshan Research Institute, Changchun University of Science and Technology, Zhongshan, China
| | - Zhonghao Xie
- College of Electrical and Electronic Engineering, Wenzhou University, Wenzhou, China
| | - Libin Zhu
- Pediatric General Surgery, The Second Hospital of Wenzhou Medical University, Wenzhou, China
| | - Xiaoqing Chen
- Pediatric General Surgery, The Second Hospital of Wenzhou Medical University, Wenzhou, China
| | - Guihua Cui
- College of Electrical and Electronic Engineering, Wenzhou University, Wenzhou, China
| | - Shujat Ali
- College of Electrical and Electronic Engineering, Wenzhou University, Wenzhou, China
| | - Guangzao Huang
- College of Electrical and Electronic Engineering, Wenzhou University, Wenzhou, China
| | - Xiaojing Chen
- College of Electrical and Electronic Engineering, Wenzhou University, Wenzhou, China
| |
Collapse
|
77
|
Chu C, Zhu L, Huang A, Xu P, Ying N, Zhang J. Transfer learning with data alignment and optimal transport for EEG based motor imagery classification. J Neural Eng 2024; 21:016015. [PMID: 38232381 DOI: 10.1088/1741-2552/ad1f7a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Accepted: 01/17/2024] [Indexed: 01/19/2024]
Abstract
Objective. The non-stationarity of electroencephalogram (EEG) signals and the variability among different subjects present significant challenges in current Brain-Computer Interfaces (BCI) research, which requires a time-consuming specific calibration procedure to address. Transfer Learning (TL) offers a potential solution by leveraging data or models from one or more source domains to facilitate learning in the target domain, so as to address these challenges.Approach. In this paper, a novel Multi-source domain Transfer Learning Fusion (MTLF) framework is proposed to address the calibration problem. Firstly, the method transforms the source domain data with the resting state segment data, in order to decrease the differences between the source domain and the target domain. Subsequently, feature extraction is performed using common spatial pattern. Finally, an improved TL classifier is employed to classify the target samples. Notably, this method does not require the label information of target domain samples, while concurrently reducing the calibration workload.Main results. The proposed MTLF is assessed on Datasets 2a and 2b from the BCI Competition IV. Compared with other algorithms, our method performed relatively the best and achieved mean classification accuracy of 73.69% and 70.83% on Datasets 2a and 2b respectively.Significance.Experimental results demonstrate that the MTLF framework effectively reduces the discrepancy between the source and target domains and acquires better classification performance on two motor imagery datasets.
Collapse
Affiliation(s)
- Chao Chu
- School of Automation, Hangzhou Dianzi University, Hangzhou 310018, People's Republic of China
| | - Lei Zhu
- School of Automation, Hangzhou Dianzi University, Hangzhou 310018, People's Republic of China
| | - Aiai Huang
- School of Automation, Hangzhou Dianzi University, Hangzhou 310018, People's Republic of China
| | - Ping Xu
- School of Automation, Hangzhou Dianzi University, Hangzhou 310018, People's Republic of China
| | - Nanjiao Ying
- School of Automation, Hangzhou Dianzi University, Hangzhou 310018, People's Republic of China
- Center for Drug Inspection of Zhejiang Province, Hangzhou 310018, People's Republic of China
| | - Jianhai Zhang
- School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou 310018, People's Republic of China
- Key Laboratory of Brain Machine Collaborative Intelligence of Zhejiang Province, Hangzhou 310018, People's Republic of China
| |
Collapse
|
78
|
Huang W, Tian Y. Reservoir parameters prediction based on spatially transferred long short-term memory network. PLoS One 2024; 19:e0296506. [PMID: 38289937 PMCID: PMC10826947 DOI: 10.1371/journal.pone.0296506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2023] [Accepted: 12/14/2023] [Indexed: 02/01/2024] Open
Abstract
Reservoir reconstruction, where parameter prediction plays a key role, constitutes an extremely important part in oil and gas reservoir exploration. With the mature development of artificial intelligence, parameter prediction methods are gradually shifting from previous petrophysical models to deep learning models, which bring about obvious improvements in terms of accuracy and efficiency. However, it is difficult to achieve large amount of data acquisition required for deep learning due to the cost of detection, technical difficulties, and the limitations of complex geological parameters. To address the data shortage problem, a transfer learning prediction model based on long short-term memory neural networks has been proposed, and the model structure has been determined by parameter search and optimization methods in this paper. The proposed approach transfers knowledge from historical data to enhance new well prediction by sharing some parameters in the neural network structure. Moreover, the practicality and effectiveness of this method was tested by comparison based on two block datasets. The results showed that this method could significantly improve the prediction accuracy of the reservoir parameters in the event of data shortage.
Collapse
Affiliation(s)
- Wancheng Huang
- Business School, Sichuan University, Chengdu, Sichuan, China
| | - Yuan Tian
- Business School, Sichuan University, Chengdu, Sichuan, China
| |
Collapse
|
79
|
Liu Y, Peng X, Tan Y, Oyemakinde TT, Wang M, Li G, Li X. A novel unsupervised dynamic feature domain adaptation strategy for cross-individual myoelectric gesture recognition. J Neural Eng 2024; 20:066044. [PMID: 38134446 DOI: 10.1088/1741-2552/ad184f] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Accepted: 12/22/2023] [Indexed: 12/24/2023]
Abstract
Objective.Surface electromyography pattern recognition (sEMG-PR) is considered as a promising control method for human-machine interaction systems. However, the performance of a trained classifier would greatly degrade for novel users since sEMG signals are user-dependent and largely affected by a number of individual factors such as the quantity of subcutaneous fat and the skin impedance.Approach.To solve this issue, we proposed a novel unsupervised cross-individual motion recognition method that aligned sEMG features from different individuals by self-adaptive dimensional dynamic distribution adaptation (SD-DDA) in this study. In the method, both the distances of marginal and conditional distributions between source and target features were minimized through automatically selecting the optimal feature domain dimension by using a small amount of unlabeled target data.Main results.The effectiveness of the proposed method was tested on four different feature sets, and results showed that the average classification accuracy was improved by above 10% on our collected dataset with the best accuracy reached 90.4%. Compared to six kinds of classic transfer learning methods, the proposed method showed an outstanding performance with improvements of 3.2%-13.8%. Additionally, the proposed method achieved an approximate 9% improvement on a publicly available dataset.Significance.These results suggested that the proposed SD-DDA method is feasible for cross-individual motion intention recognition, which would provide help for the application of sEMG-PR based system.
Collapse
Affiliation(s)
- Yan Liu
- CAS Key Laboratory of Human-Machine Intelligence-Synergy Systems, Research Center for Neural Engineering, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, People's Republic of China
- Shenzhen Institute of Artificial Intelligence and Robotics for Society, Research Center for Neural Engineering, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, People's Republic of China
| | - Xinhao Peng
- CAS Key Laboratory of Human-Machine Intelligence-Synergy Systems, Research Center for Neural Engineering, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, People's Republic of China
- Shenzhen Institute of Artificial Intelligence and Robotics for Society, Research Center for Neural Engineering, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, People's Republic of China
- Shenzhen College of Advanced Technology, University of Chinese Academy of Sciences, Shenzhen 518055, People's Republic of China
| | - Yingxiao Tan
- CAS Key Laboratory of Human-Machine Intelligence-Synergy Systems, Research Center for Neural Engineering, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, People's Republic of China
- Shenzhen Institute of Artificial Intelligence and Robotics for Society, Research Center for Neural Engineering, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, People's Republic of China
- Shenzhen College of Advanced Technology, University of Chinese Academy of Sciences, Shenzhen 518055, People's Republic of China
| | - Tolulope Tofunmi Oyemakinde
- CAS Key Laboratory of Human-Machine Intelligence-Synergy Systems, Research Center for Neural Engineering, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, People's Republic of China
- Shenzhen Institute of Artificial Intelligence and Robotics for Society, Research Center for Neural Engineering, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, People's Republic of China
- Shenzhen College of Advanced Technology, University of Chinese Academy of Sciences, Shenzhen 518055, People's Republic of China
| | - Mengtao Wang
- CAS Key Laboratory of Human-Machine Intelligence-Synergy Systems, Research Center for Neural Engineering, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, People's Republic of China
- Shenzhen Institute of Artificial Intelligence and Robotics for Society, Research Center for Neural Engineering, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, People's Republic of China
- Shenzhen College of Advanced Technology, University of Chinese Academy of Sciences, Shenzhen 518055, People's Republic of China
| | - Guanglin Li
- CAS Key Laboratory of Human-Machine Intelligence-Synergy Systems, Research Center for Neural Engineering, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, People's Republic of China
- Shenzhen Institute of Artificial Intelligence and Robotics for Society, Research Center for Neural Engineering, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, People's Republic of China
| | - Xiangxin Li
- CAS Key Laboratory of Human-Machine Intelligence-Synergy Systems, Research Center for Neural Engineering, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, People's Republic of China
- Shenzhen Institute of Artificial Intelligence and Robotics for Society, Research Center for Neural Engineering, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, People's Republic of China
| |
Collapse
|
80
|
Yang K, Liu F, Liang S, Xiang M, Han P, Liu J, Dong X, Wei Y, Wang B, Shimizu K, Shao X. Data-driven polarimetric imaging: a review. OPTO-ELECTRONIC SCIENCE 2024; 3:230042-230042. [DOI: 10.29026/oes.2024.230042] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/11/2025]
|
81
|
Zhao B, Wang Y, Zeng X, Qing X. Impact monitoring on complex structure using VMD-MPE feature extraction and transfer learning. ULTRASONICS 2024; 136:107141. [PMID: 37659253 DOI: 10.1016/j.ultras.2023.107141] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/22/2023] [Revised: 07/17/2023] [Accepted: 08/18/2023] [Indexed: 09/04/2023]
Abstract
Impacts are common damage events in aviation scenarios that can cause damage to the structural integrity ofan aircraft and pose a threat to its safe operation. Therefore, it is crucial to monitor impact events. A region-to-point monitoring method is proposed to address the challenges posed by the large area of monitored aircraft structures and the long distance between sensors. Firstly, to fully use the information in the original impact signal and reduce the aliasing effect caused by the reinforced structure, the original signal is decomposed into several modes with different frequency bands by Variational Mode Decomposition (VMD). The Multi-scale Permutation Entropy (MPE) value is then calculated to reflect the various characteristics of each mode, which is used as a basis for classification. Secondly, Transfer Component Analysis (TCA) is selected as a transfer learning method to reduce the difference between the features of the source domain and the target domains' features. Thirdly, the TCA-transformed source domain data are used to train the Probabilistic Neural Network model (PNN), and the unfamiliar target domain data are used to verify the impact area identification. Finally, based on regional location, the system identification technology and weighted centroid algorithm can be used to obtain the history of impact force and the precise coordinates of the impact location.
Collapse
Affiliation(s)
- Bowen Zhao
- School of Aerospace Engineering, Xiamen University, Xiamen 361005, China
| | - Yihan Wang
- School of Aerospace Engineering, Xiamen University, Xiamen 361005, China
| | - Xianping Zeng
- School of Aerospace Engineering, Xiamen University, Xiamen 361005, China
| | - Xinlin Qing
- School of Aerospace Engineering, Xiamen University, Xiamen 361005, China.
| |
Collapse
|
82
|
Zhao J, Nie Z, Shen J, He J, Yang X. Rib segmentation in chest x-ray images based on unsupervised domain adaptation. Biomed Phys Eng Express 2023; 10:015021. [PMID: 38104347 DOI: 10.1088/2057-1976/ad1663] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Accepted: 12/17/2023] [Indexed: 12/19/2023]
Abstract
Rib segmentation in 2D chest x-ray images is a crucial and challenging task. On one hand, chest x-ray images serve as the most prevalent form of medical imaging due to their convenience, affordability, and minimal radiation exposure. However, on the other hand, these images present intricate challenges including overlapping anatomical structures, substantial noise and artifacts, inherent anatomical complexity. Currently, most methods employ deep convolutional networks for rib segmentation, necessitating an extensive quantity of accurately labeled data for effective training. Nonetheless, achieving precise pixel-level labeling in chest x-ray images presents a notable difficulty. Additionally, many methods neglect the challenge of predicting fractured results and subsequent post-processing difficulties. In contrast, CT images benefit from being able to directly label as the 3D structure and patterns of organs or tissues. In this paper, we redesign rib segmentation task for chest x-ray images and propose a concise and efficient cross-modal method based on unsupervised domain adaptation with centerline loss function to prevent result discontinuity and address rigorous post-processing. We utilize digital reconstruction radiography images and the labels generated from 3D CT images to guide rib segmentation on unlabeled 2D chest x-ray images. Remarkably, our model achieved a higher dice score on the test samples and the results are highly interpretable, without requiring any annotated rib markings on chest x-ray images. Our code and demo will be released in 'https://github.com/jialin-zhao/RibsegBasedonUDA'.
Collapse
Affiliation(s)
- Jialin Zhao
- Department of Mathematics, Nanjing University, Nanjing 210093, People's Republic of China
| | - Ziwei Nie
- Department of Mathematics, Nanjing University, Nanjing 210093, People's Republic of China
| | - Jie Shen
- Department of radiology, Nanjing Chest Hospital, Nanjing 210093, People's Republic of China
| | - Jian He
- Department of Nuclear Medicine, Nanjing Drum Tower Hospital, Affiliated Hospital of Medical School, Nanjing University, Nanjing 210093, People's Republic of China
| | - Xiaoping Yang
- Department of Mathematics, Nanjing University, Nanjing 210093, People's Republic of China
| |
Collapse
|
83
|
Xie J, Zhong W, Yang R, Wang L, Zhen X. Discriminative fusion of moments-aligned latent representation of multimodality medical data. Phys Med Biol 2023; 69:015015. [PMID: 38052076 DOI: 10.1088/1361-6560/ad1271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 12/05/2023] [Indexed: 12/07/2023]
Abstract
Fusion of multimodal medical data provides multifaceted, disease-relevant information for diagnosis or prognosis prediction modeling. Traditional fusion strategies such as feature concatenation often fail to learn hidden complementary and discriminative manifestations from high-dimensional multimodal data. To this end, we proposed a methodology for the integration of multimodality medical data by matching their moments in a latent space, where the hidden, shared information of multimodal data is gradually learned by optimization with multiple feature collinearity and correlation constrains. We first obtained the multimodal hidden representations by learning mappings between the original domain and shared latent space. Within this shared space, we utilized several relational regularizations, including data attribute preservation, feature collinearity and feature-task correlation, to encourage learning of the underlying associations inherent in multimodal data. The fused multimodal latent features were finally fed to a logistic regression classifier for diagnostic prediction. Extensive evaluations on three independent clinical datasets have demonstrated the effectiveness of the proposed method in fusing multimodal data for medical prediction modeling.
Collapse
Affiliation(s)
- Jincheng Xie
- School of Biomedical Engineering, Southern Medical University, Guangzhou, Guangdong 510515, People's Republic of China
| | - Weixiong Zhong
- School of Biomedical Engineering, Southern Medical University, Guangzhou, Guangdong 510515, People's Republic of China
| | - Ruimeng Yang
- Department of Radiology, the Second Affiliated Hospital, School of Medicine, South China University of Technology, Guangzhou, Guangdong, 510180, People's Republic of China
| | - Linjing Wang
- Radiotherapy Center, Affiliated Cancer Hospital & Institute of Guangzhou Medical University, Guangzhou, Guangdong 510095, People's Republic of China
| | - Xin Zhen
- School of Biomedical Engineering, Southern Medical University, Guangzhou, Guangdong 510515, People's Republic of China
| |
Collapse
|
84
|
Wu S, Shu L, Song Z, Xu X. SFDA: Domain Adaptation With Source Subject Fusion Based on Multi-Source and Single-Target Fall Risk Assessment. IEEE Trans Neural Syst Rehabil Eng 2023; 31:4907-4920. [PMID: 38032785 DOI: 10.1109/tnsre.2023.3337861] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2023]
Abstract
In cross-subject fall risk classification based on plantar pressure, a challenge is that data from different subjects have significant individual information. Thus, the models with insufficient generalization ability can't perform well on new subjects, which limits their application in daily life. To solve this problem, domain adaptation methods are applied to reduce the gap between source and target domain. However, these methods focus on the distribution of the source and the target domain, but ignore the potential correlation among multiple source subjects, which deteriorates domain adaptation performance. In this paper, we proposed a novel method named domain adaptation with subject fusion (SFDA) for fall risk assessment, greatly improving the cross-subject assessment ability. Specifically, SFDA synchronously carries out source target adaptation and multiple source subject fusion by domain adversarial module to reduce source-target gap and distribution distance within source subjects of same class. Consequently, target samples can learn more task-specific features from source subjects to improve the generalization ability. Experiment results show that SFDA achieved mean accuracy of 79.17 % and 73.66 % based on two backbones in a cross-subject classification manner, outperforming the state-of-the-art methods on continuous plantar pressure dataset. This study proves the effectiveness of SFDA and provides a novel tool for implementing cross-subject and few-gait fall risk assessment.
Collapse
|
85
|
Shimizu M, Zhao Y, Avdelidis NP. A Fault Detection Approach Based on One-Sided Domain Adaptation and Generative Adversarial Networks for Railway Door Systems. SENSORS (BASEL, SWITZERLAND) 2023; 23:9688. [PMID: 38139533 PMCID: PMC10747022 DOI: 10.3390/s23249688] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Revised: 11/30/2023] [Accepted: 12/05/2023] [Indexed: 12/24/2023]
Abstract
Fault detection using the domain adaptation technique is one of the more promising methods of solving the domain shift problem, and has therefore been intensively investigated in recent years. However, the domain adaptation method still has elements of impracticality: firstly, domain-specific decision boundaries are not taken into consideration, which often results in poor performance near the class boundary; and secondly, information on the source domain needs to be exploited with priority over information on the target domain, as the source domain can provide a rich dataset. Thus, the real-world implementations of this approach are still scarce. In order to address these issues, a novel fault detection approach based on one-sided domain adaptation for real-world railway door systems is proposed. An anomaly detector created using label-rich source domain data is used to generate distinctive source latent features, and the target domain features are then aligned toward the source latent features in a one-sided way. The performance and sensitivity analyses show that the proposed method is more accurate than alternative methods, with an F1 score of 97.9%, and is the most robust against variation in the input features. The proposed method also bridges the gap between theoretical domain adaptation research and tangible industrial applications. Furthermore, the proposed approach can be applied to conventional railway components and various electro-mechanical actuators. This is because the motor current signals used in this study are primarily obtained from the controller or motor drive, which eliminates the need for extra sensors.
Collapse
Affiliation(s)
- Minoru Shimizu
- Integrated Vehicle Health Management Centre, Cranfield University, Cranfield MK43 0AL, UK;
| | - Yifan Zhao
- Centre for Life-Cycle Engineering and Management, Cranfield University, Cranfield MK43 0AL, UK;
| | - Nicolas P. Avdelidis
- Integrated Vehicle Health Management Centre, Cranfield University, Cranfield MK43 0AL, UK;
| |
Collapse
|
86
|
Lei B, Zhu Y, Liang E, Yang P, Chen S, Hu H, Xie H, Wei Z, Hao F, Song X, Wang T, Xiao X, Wang S, Han H. Federated Domain Adaptation via Transformer for Multi-Site Alzheimer's Disease Diagnosis. IEEE TRANSACTIONS ON MEDICAL IMAGING 2023; 42:3651-3664. [PMID: 37527297 DOI: 10.1109/tmi.2023.3300725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/03/2023]
Abstract
In multi-site studies of Alzheimer's disease (AD), the difference of data in multi-site datasets leads to the degraded performance of models in the target sites. The traditional domain adaptation method requires sharing data from both source and target domains, which will lead to data privacy issue. To solve it, federated learning is adopted as it can allow models to be trained with multi-site data in a privacy-protected manner. In this paper, we propose a multi-site federated domain adaptation framework via Transformer (FedDAvT), which not only protects data privacy, but also eliminates data heterogeneity. The Transformer network is used as the backbone network to extract the correlation between the multi-template region of interest features, which can capture the brain abundant information. The self-attention maps in the source and target domains are aligned by applying mean squared error for subdomain adaptation. Finally, we evaluate our method on the multi-site databases based on three AD datasets. The experimental results show that the proposed FedDAvT is quite effective, achieving accuracy rates of 88.75%, 69.51%, and 69.88% on the AD vs. NC, MCI vs. NC, and AD vs. MCI two-way classification tasks, respectively.
Collapse
|
87
|
Liu W, Ni Z, Chen Q, Ni L. Attention-Guided Partial Domain Adaptation for Automated Pneumonia Diagnosis From Chest X-Ray Images. IEEE J Biomed Health Inform 2023; 27:5848-5859. [PMID: 37695960 DOI: 10.1109/jbhi.2023.3313886] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/13/2023]
Abstract
Deep neural networks (DNN) supported by multicenter large-scale Chest X-Ray (CXR) datasets can efficiently perform tasks such as disease identification, lesion segmentation, and report generation. However, the non-ignorable inter-domain heterogeneity caused by different equipment, ethnic groups, and scanning protocols may lead to dramatic degradation in model performance. Unsupervised domain adaptation (UDA) methods help alleviate the cross-domain discrepancy for subsequent analysis. Nevertheless, they may be prone to: 1) spatial negative transfer: misaligning non-transferable regions which have inadequate knowledge, and 2) semantic negative transfer: failing to extend to scenarios where the label spaces of the source and target domain are partially shared. In this work, we propose a classification-based framework named attention-guided partial domain adaptation (AGPDA) network for overcoming these two negative transfer challenges. AGPDA is composed of two key modules: 1) a region attention discrimination block (RADB) to generate fine-grained attention value via lightweight region-wise multi-adversarial networks. 2) a residual feature recalibration block (RFRB) trained with class-weighted maximum mean discrepancy (MMD) loss for down-weighing the irrelevant source samples. Extensive experiments on two publicly available CXR datasets containing a total of 8598 pneumonia (viral, bacterial, and COVID-19) cases, 7163 non-pneumonia or healthy cases, demonstrate the superior performance of our AGPDA. Especially on three partial transfer tasks, AGPDA significantly increases the accuracy, sensitivity, and F1 score by 4.35%, 4.05%, and 1.78% compared to recently strong baselines.
Collapse
|
88
|
Miao M, Yang Z, Zeng H, Zhang W, Xu B, Hu W. Explainable cross-task adaptive transfer learning for motor imagery EEG classification. J Neural Eng 2023; 20:066021. [PMID: 37963394 DOI: 10.1088/1741-2552/ad0c61] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Accepted: 11/14/2023] [Indexed: 11/16/2023]
Abstract
Objective. In the field of motor imagery (MI) electroencephalography (EEG)-based brain-computer interfaces, deep transfer learning (TL) has proven to be an effective tool for solving the problem of limited availability in subject-specific data for the training of robust deep learning (DL) models. Although considerable progress has been made in the cross-subject/session and cross-device scenarios, the more challenging problem of cross-task deep TL remains largely unexplored.Approach. We propose a novel explainable cross-task adaptive TL method for MI EEG decoding. Firstly, similarity analysis and data alignment are performed for EEG data of motor execution (ME) and MI tasks. Afterwards, the MI EEG decoding model is obtained via pre-training with extensive ME EEG data and fine-tuning with partial MI EEG data. Finally, expected gradient-based post-hoc explainability analysis is conducted for the visualization of important temporal-spatial features.Main results. Extensive experiments are conducted on one large ME EEG High-Gamma dataset and two large MI EEG datasets (openBMI and GIST). The best average classification accuracy of our method reaches 80.00% and 72.73% for OpenBMI and GIST respectively, which outperforms several state-of-the-art algorithms. In addition, the results of the explainability analysis further validate the correlation between ME and MI EEG data and the effectiveness of ME/MI cross-task adaptation.Significance. This paper confirms that the decoding of MI EEG can be well facilitated by pre-existing ME EEG data, which largely relaxes the constraint of training samples for MI EEG decoding and is important in a practical sense.
Collapse
Affiliation(s)
- Minmin Miao
- School of Information Engineering, Huzhou University, Huzhou, People's Republic of China
- Zhejiang Province Key Laboratory of Smart Management & Application of Modern Agricultural Resources, Huzhou University, Huzhou, People's Republic of China
| | - Zhong Yang
- School of Information Engineering, Huzhou University, Huzhou, People's Republic of China
| | - Hong Zeng
- School of Instrument Science and Engineering, Southeast University, Nanjing, People's Republic of China
| | - Wenbin Zhang
- College of Computer and Information, Hohai University, Nanjing, People's Republic of China
| | - Baoguo Xu
- School of Instrument Science and Engineering, Southeast University, Nanjing, People's Republic of China
| | - Wenjun Hu
- School of Information Engineering, Huzhou University, Huzhou, People's Republic of China
- Zhejiang Province Key Laboratory of Smart Management & Application of Modern Agricultural Resources, Huzhou University, Huzhou, People's Republic of China
| |
Collapse
|
89
|
Tao J, Dan Y, Zhou D. Possibilistic distribution distance metric: a robust domain adaptation learning method. Front Neurosci 2023; 17:1247082. [PMID: 38027506 PMCID: PMC10665527 DOI: 10.3389/fnins.2023.1247082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2023] [Accepted: 10/20/2023] [Indexed: 12/01/2023] Open
Abstract
The affective Brain-Computer Interface (aBCI) systems, which achieve predictions for individual subjects through training on multiple subjects, often cannot achieve satisfactory results due to the differences in Electroencephalogram (EEG) patterns between subjects. One tried to use Subject-specific classifiers, but there was a lack of sufficient labeled data. To solve this problem, Domain Adaptation (DA) has recently received widespread attention in the field of EEG-based emotion recognition. Domain adaptation (DA) learning aims to solve the problem of inconsistent distributions between training and test datasets and has received extensive attention. Most existing methods use Maximum Mean Discrepancy (MMD) or its variants to minimize the problem of domain distribution inconsistency. However, noisy data in the domain can lead to significant drift in domain means, which can affect the adaptability performance of learning methods based on MMD and its variants to some extent. Therefore, we propose a robust domain adaptation learning method with possibilistic distribution distance measure. Firstly, the traditional MMD criterion is transformed into a novel possibilistic clustering model to weaken the influence of noisy data, thereby constructing a robust possibilistic distribution distance metric (P-DDM) criterion. Then the robust effectiveness of domain distribution alignment is further improved by a fuzzy entropy regularization term. The proposed P-DDM is in theory proved which be an upper bound of the traditional distribution distance measure method MMD criterion under certain conditions. Therefore, minimizing P-DDM can effectively optimize the MMD objective. Secondly, based on the P-DDM criterion, a robust domain adaptation classifier based on P-DDM (C-PDDM) is proposed, which adopts the Laplacian matrix to preserve the geometric consistency of instances in the source domain and target domain for improving the label propagation performance. At the same time, by maximizing the use of source domain discriminative information to minimize domain discrimination error, the generalization performance of the learning model is further improved. Finally, a large number of experiments and analyses on multiple EEG datasets (i.e., SEED and SEED-IV) show that the proposed method has superior or comparable robustness performance (i.e., has increased by around 10%) in most cases.
Collapse
Affiliation(s)
- Jianwen Tao
- Institute of Artificial Intelligence Application, Ningbo Polytechnic, Zhejiang, China
| | - Yufang Dan
- Institute of Artificial Intelligence Application, Ningbo Polytechnic, Zhejiang, China
| | - Di Zhou
- Industrial Technological Institute of Intelligent Manufacturing, Sichuan University of Arts and Science, Dazhou, China
| |
Collapse
|
90
|
Wang Z, Zhang W, Li S, Chen X, Wu D. Unsupervised domain adaptation for cross-patient seizure classification. J Neural Eng 2023; 20:066002. [PMID: 37906968 DOI: 10.1088/1741-2552/ad0859] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 10/31/2023] [Indexed: 11/02/2023]
Abstract
Objective. Epileptic seizure is a chronic neurological disease affecting millions of patients. Electroencephalogram (EEG) is the gold standard in epileptic seizure classification. However, its low signal-to-noise ratio, strong non-stationarity, and large individual difference nature make it difficult to directly extend the seizure classification model from one patient to another. This paper considers multi-source unsupervised domain adaptation for cross-patient EEG-based seizure classification, i.e. there are multiple source patients with labeled EEG data, which are used to label the EEG trials of a new patient.Approach. We propose an source domain selection (SDS)-global domain adaptation (GDA)-target agent subdomain adaptation (TASA) approach, which includes SDS to filter out dissimilar source domains, GDA to align the overall distributions of the selected source domains and the target domain, and TASA to identify the most similar source domain to the target domain so that its labels can be utilized.Main results. Experiments on two public seizure datasets demonstrated that SDS-GDA-TASA outperformed 13 existing approaches in unsupervised cross-patient seizure classification.Significance. Our approach could save clinicians plenty of time in labeling EEG data for epilepsy patients, greatly increasing the efficiency of seizure diagnostics.
Collapse
Affiliation(s)
- Ziwei Wang
- Key Laboratory of the Ministry of Education for Image Processing and Intelligent Control, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, People's Republic of China
| | - Wen Zhang
- Key Laboratory of the Ministry of Education for Image Processing and Intelligent Control, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, People's Republic of China
| | - Siyang Li
- Key Laboratory of the Ministry of Education for Image Processing and Intelligent Control, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, People's Republic of China
| | - Xinru Chen
- Key Laboratory of the Ministry of Education for Image Processing and Intelligent Control, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, People's Republic of China
| | - Dongrui Wu
- Key Laboratory of the Ministry of Education for Image Processing and Intelligent Control, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, People's Republic of China
- Belt and Road Joint Laboratory on Measurement and Control Technology, Huazhong University of Science and Technology, Wuhan 430074, People's Republic of China
| |
Collapse
|
91
|
Tao J, Dan Y, Zhou D. Local domain generalization with low-rank constraint for EEG-based emotion recognition. Front Neurosci 2023; 17:1213099. [PMID: 38027525 PMCID: PMC10662311 DOI: 10.3389/fnins.2023.1213099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Accepted: 10/04/2023] [Indexed: 12/01/2023] Open
Abstract
As an important branch in the field of affective computing, emotion recognition based on electroencephalography (EEG) faces a long-standing challenge due to individual diversities. To conquer this challenge, domain adaptation (DA) or domain generalization (i.e., DA without target domain in the training stage) techniques have been introduced into EEG-based emotion recognition to eliminate the distribution discrepancy between different subjects. The preceding DA or domain generalization (DG) methods mainly focus on aligning the global distribution shift between source and target domains, yet without considering the correlations between the subdomains within the source domain and the target domain of interest. Since the ignorance of the fine-grained distribution information in the source may still bind the DG expectation on EEG datasets with multimodal structures, multiple patches (or subdomains) should be reconstructed from the source domain, on which multi-classifiers could be learned collaboratively. It is expected that accurately aligning relevant subdomains by excavating multiple distribution patterns within the source domain could further boost the learning performance of DG/DA. Therefore, we propose in this work a novel DG method for EEG-based emotion recognition, i.e., Local Domain Generalization with low-rank constraint (LDG). Specifically, the source domain is firstly partitioned into multiple local domains, each of which contains only one positive sample and its positive neighbors and k2 negative neighbors. Multiple subject-invariant classifiers on different subdomains are then co-learned in a unified framework by minimizing local regression loss with low-rank regularization for considering the shared knowledge among local domains. In the inference stage, the learned local classifiers are discriminatively selected according to their importance of adaptation. Extensive experiments are conducted on two benchmark databases (DEAP and SEED) under two cross-validation evaluation protocols, i.e., cross-subject within-dataset and cross-dataset within-session. The experimental results under the 5-fold cross-validation demonstrate the superiority of the proposed method compared with several state-of-the-art methods.
Collapse
Affiliation(s)
- Jianwen Tao
- Institute of Artificial Intelligence Application, Ningbo Polytechnic, Zhejiang, China
| | - Yufang Dan
- Institute of Artificial Intelligence Application, Ningbo Polytechnic, Zhejiang, China
| | - Di Zhou
- Industrial Technological Institute of Intelligent Manufacturing, Sichuan University of Arts and Science, Dazhou, China
| |
Collapse
|
92
|
Ma Z, Wang J, Yue J, Lin Y. A homologous and heterogeneous multi-view inter-patient adaptive network for arrhythmia detection. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2023; 241:107740. [PMID: 37567144 DOI: 10.1016/j.cmpb.2023.107740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Revised: 07/17/2023] [Accepted: 07/30/2023] [Indexed: 08/13/2023]
Abstract
BACKGROUND AND OBJECTIVE Electrocardiogram (ECG) is a widely used diagnostic tool for arrhythmia assessment in clinical practice. However, current arrhythmia detection algorithms rely heavily on signal-based data, while cardiologists often use image-based data. This discrepancy, combined with individual differences in physiological signals, poses challenges for accurate arrhythmia detection. To address these challenges and improve arrhythmia detection performance, we propose a homologous and heterogeneous multi-view inter-patient adaptive network. METHODS We designed a multi-view representation learning module to capture dynamic and morphological characteristics from ECG signals and electrocardiographic images. Expert knowledge was also elicited to gain internally-invariant characteristics of each category. Finally, we designed a new loss function that aligns the embedding of the source and target domains in the feature space to minimize the negative effects of individual differences. RESULTS Experiments on the MIT-BIH arrhythmia database demonstrate that our proposed method outperforms state-of-the-art baselines in terms of accuracy, positive predictive value, sensitivity and F1-score. These results indicate the effectiveness of our method in accurately detecting arrhythmias. CONCLUSIONS Our homologous and heterogeneous multi-view inter-patient adaptive network successfully addresses the challenges of utilizing both ECG signal and electrocardiographic image data for arrhythmia detection and overcoming individual differences in physiological signals. Our proposed method has the potential to improve early diagnosis and treatment outcomes of arrhythmias in clinical practice.
Collapse
Affiliation(s)
- Zhaoyang Ma
- School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China; Beijing Key Laboratory of Traffic Data Analysis and Mining, Beijing 100044, China.
| | - Jing Wang
- School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China; Beijing Key Laboratory of Traffic Data Analysis and Mining, Beijing 100044, China; CAAC Key Laboratory of Intelligent Passenger Service of Civil Aviation, Beijing 100044, China.
| | - Jinghang Yue
- School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China; Beijing Key Laboratory of Traffic Data Analysis and Mining, Beijing 100044, China.
| | - Youfang Lin
- School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China; Beijing Key Laboratory of Traffic Data Analysis and Mining, Beijing 100044, China; CAAC Key Laboratory of Intelligent Passenger Service of Civil Aviation, Beijing 100044, China.
| |
Collapse
|
93
|
Voina D, Shea-Brown E, Mihalas S. A biologically inspired architecture with switching units can learn to generalize across backgrounds. Neural Netw 2023; 168:615-630. [PMID: 37839332 PMCID: PMC10843013 DOI: 10.1016/j.neunet.2023.09.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Revised: 08/24/2023] [Accepted: 09/07/2023] [Indexed: 10/17/2023]
Abstract
Humans and other animals navigate different environments effortlessly, their brains rapidly and accurately generalizing across contexts. Despite recent progress in deep learning, this flexibility remains a challenge for many artificial systems. Here, we show how a bio-inspired network motif can explicitly address this issue. We do this using a dataset of MNIST digits of varying transparency, set on one of two backgrounds of different statistics that define two contexts: a pixel-wise noise or a more naturalistic background from the CIFAR-10 dataset. After learning digit classification when both contexts are shown sequentially, we find that both shallow and deep networks have sharply decreased performance when returning to the first background - an instance of the catastrophic forgetting phenomenon known from continual learning. To overcome this, we propose the bottleneck-switching network or switching network for short. This is a bio-inspired architecture analogous to a well-studied network motif in the visual cortex, with additional "switching" units that are activated in the presence of a new background, assuming a priori a contextual signal to turn these units on or off. Intriguingly, only a few of these switching units are sufficient to enable the network to learn the new context without catastrophic forgetting through inhibition of redundant background features. Further, the bottleneck-switching network can generalize to novel contexts similar to contexts it has learned. Importantly, we find that - again as in the underlying biological network motif, recurrently connecting the switching units to network layers is advantageous for context generalization.
Collapse
Affiliation(s)
- Doris Voina
- Department of Applied Mathematics, Computational Neuroscience Center, University of Washington, Seattle, WA 98195, USA.
| | - Eric Shea-Brown
- Department of Applied Mathematics, Computational Neuroscience Center, University of Washington, Seattle, WA 98195, USA; Allen Institute for Brain Science, 615 Westlake Ave N, Seattle, WA 98109, USA
| | - Stefan Mihalas
- Department of Applied Mathematics, Computational Neuroscience Center, University of Washington, Seattle, WA 98195, USA; Allen Institute for Brain Science, 615 Westlake Ave N, Seattle, WA 98109, USA
| |
Collapse
|
94
|
He C, Fan X, Zhou K, Ye Z. Unsupervised Domain Adaptation with Asymmetrical Margin Disparity loss and Outlier Sample Extraction. Neural Netw 2023; 168:602-614. [PMID: 37839331 DOI: 10.1016/j.neunet.2023.09.045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2023] [Revised: 09/12/2023] [Accepted: 09/25/2023] [Indexed: 10/17/2023]
Abstract
Unsupervised domain adaptation (UDA) trains models using labeled data from a specific source domain and then transferring the knowledge to certain target domains that have few or no labels. Many prior measurement-based works achieve lots of progress, but their feature distinguishing abilities to classify target samples with similar features are not enough; they do not adequately consider the confusing samples in the target domain that are similar to the source domain; and they don't consider negative transfer of the outlier sample in source domain. We address these issues in our work and propose an UDA method with asymmetrical margin disparity loss and outlier sample extraction, called AMD-Net with OSE. We propose an Asymmetrical Margin Disparity Discrepancy (AMD) method and a training strategy based on sample selection mechanism to make the network have better feature extraction ability and the network gets rid of local optimal. Firstly, in the AMD method, we design a multi-label entropy metric to evaluate the marginal disparity loss of the confusing samples in the target domain. This asymmetric marginal disparity loss designment uses the different entropy measurement algorithms of the two domains to excavate the differences of the two domains as much as possible, so as to find the common features of the two domains. Secondly, A sample selection mechanism is designed to evaluate which part of the sample in target domain is confusable. We define the certainty of the sample in the target domain, adopt a progressive learning scheme, and adopt one-hot marginal disparity loss for most of the samples in the target domain with low uncertainty and easy to distinguish. The multi-label marginal calculation method is used only for the uncertainty samples in the target domain whose certainty is less than the threshold value, so that the network can get rid of the local optimal as much as possible. At last, we further propose an outlier sample extraction algorithm (OSE) based on weighted cosine similarity distance for source domain to reduce the negative migration effect caused by outlier samples in the source domain. Extensive experiments on four datasets Office-31, Office-Home, VisDA-2017 and DomainNet demonstrate that our method works well in various UDA settings and outperforms the state-of-the-art methods.
Collapse
Affiliation(s)
- Chunmei He
- School of Computer Science, School of Cyberspace Science, Xiangtan University, Xiangtan, Hunan 411105, China.
| | - Xianjun Fan
- School of Computer Science, School of Cyberspace Science, Xiangtan University, Xiangtan, Hunan 411105, China.
| | - Kang Zhou
- School of Computer Science, School of Cyberspace Science, Xiangtan University, Xiangtan, Hunan 411105, China.
| | - Zhengchun Ye
- School of Mechanical Engineering, Xiangtan University, Xiangtan, Hunan 411105, China.
| |
Collapse
|
95
|
Mo Z, Siepel A. Domain-adaptive neural networks improve supervised machine learning based on simulated population genetic data. PLoS Genet 2023; 19:e1011032. [PMID: 37934781 PMCID: PMC10655966 DOI: 10.1371/journal.pgen.1011032] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2023] [Revised: 11/17/2023] [Accepted: 10/23/2023] [Indexed: 11/09/2023] Open
Abstract
Investigators have recently introduced powerful methods for population genetic inference that rely on supervised machine learning from simulated data. Despite their performance advantages, these methods can fail when the simulated training data does not adequately resemble data from the real world. Here, we show that this "simulation mis-specification" problem can be framed as a "domain adaptation" problem, where a model learned from one data distribution is applied to a dataset drawn from a different distribution. By applying an established domain-adaptation technique based on a gradient reversal layer (GRL), originally introduced for image classification, we show that the effects of simulation mis-specification can be substantially mitigated. We focus our analysis on two state-of-the-art deep-learning population genetic methods-SIA, which infers positive selection from features of the ancestral recombination graph (ARG), and ReLERNN, which infers recombination rates from genotype matrices. In the case of SIA, the domain adaptive framework also compensates for ARG inference error. Using the domain-adaptive SIA (dadaSIA) model, we estimate improved selection coefficients at selected loci in the 1000 Genomes CEU population. We anticipate that domain adaptation will prove to be widely applicable in the growing use of supervised machine learning in population genetics.
Collapse
Affiliation(s)
- Ziyi Mo
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
- School of Biological Sciences, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
| | - Adam Siepel
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
- School of Biological Sciences, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
| |
Collapse
|
96
|
Yuan W, Xiang W, Si K, Yang C, Zhao L, Li J, Liu C. Multi-channel EEG-based sleep staging using brain functional connectivity and domain adaptation. Physiol Meas 2023; 44:105007. [PMID: 37827169 DOI: 10.1088/1361-6579/ad02db] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Accepted: 10/12/2023] [Indexed: 10/14/2023]
Abstract
Objective.Sleep stage recognition has essential clinical value for evaluating human physical/mental condition and diagnosing sleep-related diseases. To conduct a five-class (wake, N1, N2, N3 and rapid eye movement) sleep staging task, twenty subjects with recorded six-channel electroencephalography (EEG) signals from the ISRUC-SLEEP dataset is used.Approach.Unlike the exist methods ignoring the channel coupling relationship and non-stationarity characteristics, we developed a brain functional connectivity method to provide a new insight for multi-channel analysis. Furthermore, we investigated three frequency-domain features: two functional connectivity estimations, i.e. synchronization likelihood (SL) and wavelet-based correlation (WC) among four frequency bands, and energy ratio (ER) related to six frequency bands, respectively. Then, the Gaussian support vector machine (SVM) method was used to predict the five sleep stages. The performance of the applied features is evaluated in both subject dependence experiment by ten-fold cross validation and subject independence experiment by leave-one-subject-out cross-validation, respectively.Main results.In subject dependence experiment, the results showed that the fused feature (fusion of SL, WC and ER features) contributes significant gain the performance of SVM classifier, where the mean of classification accuracy can achieve 83.97% ± 1.04%. However, in subject-independence experiment, the individual differences EEG patterns across subjects leads to inferior accuracy. Five typical domain adaptation (DA) methods were applied to reduce the discrepancy of feature distributions by selecting the optimal subspace dimension. Results showed that four DA methods can significantly improve the mean accuracy by 1.89%-5.22% compared to the baseline accuracy 57.44% in leave-one-subject-out cross-validation.Significance.Compared with traditional time-frequency and nonlinear features, brain functional connectivity features can capture the correlation between different brain regions. For the individual EEG response differences, domain adaptation methods can transform features to improve the performance of sleep staging algorithms.
Collapse
Affiliation(s)
- Wenhao Yuan
- Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, People's Republic of China
| | - Wentao Xiang
- Jiangsu Province Engineering Research Center for Smart Wearable and Rehabilitation Devices, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, 211166, People's Republic of China
| | - Kaiyue Si
- Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, People's Republic of China
| | - Chunfeng Yang
- Key Laboratory of Computer Network and Information Integration of Ministry of Education, Southeast University, Nanjing, 210096, People's Republic of China
| | - Lina Zhao
- Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, People's Republic of China
| | - Jianqing Li
- Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, People's Republic of China
- Jiangsu Province Engineering Research Center for Smart Wearable and Rehabilitation Devices, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, 211166, People's Republic of China
| | - Chengyu Liu
- Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, People's Republic of China
| |
Collapse
|
97
|
Li J, Li G, Yu Y. Adaptive Betweenness Clustering for Semi-Supervised Domain Adaptation. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2023; 32:5580-5594. [PMID: 37782617 DOI: 10.1109/tip.2023.3319274] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/04/2023]
Abstract
Compared to unsupervised domain adaptation, semi-supervised domain adaptation (SSDA) aims to significantly improve the classification performance and generalization capability of the model by leveraging the presence of a small amount of labeled data from the target domain. Several SSDA approaches have been developed to enable semantic-aligned feature confusion between labeled (or pseudo labeled) samples across domains; nevertheless, owing to the scarcity of semantic label information of the target domain, they were arduous to fully realize their potential. In this study, we propose a novel SSDA approach named Graph-based Adaptive Betweenness Clustering (G-ABC) for achieving categorical domain alignment, which enables cross-domain semantic alignment by mandating semantic transfer from labeled data of both the source and target domains to unlabeled target samples. In particular, a heterogeneous graph is initially constructed to reflect the pairwise relationships between labeled samples from both domains and unlabeled ones of the target domain. Then, to degrade the noisy connectivity in the graph, connectivity refinement is conducted by introducing two strategies, namely Confidence Uncertainty based Node Removal and Prediction Dissimilarity based Edge Pruning. Once the graph has been refined, Adaptive Betweenness Clustering is introduced to facilitate semantic transfer by using across-domain betweenness clustering and within-domain betweenness clustering, thereby propagating semantic label information from labeled samples across domains to unlabeled target data. Extensive experiments on three standard benchmark datasets, namely DomainNet, Office-Home, and Office-31, indicated that our method outperforms previous state-of-the-art SSDA approaches, demonstrating the superiority of the proposed G-ABC algorithm.
Collapse
|
98
|
Liu Q, Li G, Baladandayuthapani V. Pan-Cancer Drug Response Prediction Using Integrative Principal Component Regression. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.03.560366. [PMID: 37873111 PMCID: PMC10592913 DOI: 10.1101/2023.10.03.560366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
The pursuit of precision oncology heavily relies on large-scale genomic and pharmacological data garnered from preclinical cancer model systems such as cell lines. While cell lines are instrumental in understanding the interplay between genomic programs and drug response, it well-established that they are not fully representative of patient tumors. Development of integrative methods that can systematically assess the commonalities between patient tumors and cell-lines can help bridge this gap. To this end, we introduce the Integrative Principal Component Regression (iPCR) model which uncovers both joint and model-specific structured variations in the genomic data of cell lines and patient tumors through matrix decompositions. The extracted joint variation is then used to predict patient drug responses based on the pharmacological data from preclinical models. Moreover, the interpretability of our model allows for the identification of key driver genes and pathways associated with the treatment-specific response in patients across multiple cancers. We demonstrate that the outputs of the iPCR model can assist in inferring both model-specific and shared co-expression networks between cell lines and patients. We show that iPCR performs favorably compared to competing approaches in predicting patient drug responses, in both simulation studies and real-world applications, in addition to identifying key genomic drivers of cancer drug responses.
Collapse
|
99
|
Yang C, Cheung YM, Ding J, Tan KC, Xue B, Zhang M. Contrastive Learning Assisted-Alignment for Partial Domain Adaptation. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023; 34:7621-7634. [PMID: 35130173 DOI: 10.1109/tnnls.2022.3145034] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
This work addresses unsupervised partial domain adaptation (PDA), in which classes in the target domain are a subset of the source domain. The key challenges of PDA are how to leverage source samples in the shared classes to promote positive transfer and filter out the irrelevant source samples to mitigate negative transfer. Existing PDA methods based on adversarial DA do not consider the loss of class discriminative representation. To this end, this article proposes a contrastive learning-assisted alignment (CLA) approach for PDA to jointly align distributions across domains for better adaptation and to reweight source instances to reduce the contribution of outlier instances. A contrastive learning-assisted conditional alignment (CLCA) strategy is presented for distribution alignment. CLCA first exploits contrastive losses to discover the class discriminative information in both domains. It then employs a contrastive loss to match the clusters across the two domains based on adversarial domain learning. In this respect, CLCA attempts to reduce the domain discrepancy by matching the class-conditional and marginal distributions. Moreover, a new reweighting scheme is developed to improve the quality of weights estimation, which explores information from both the source and the target domains. Empirical results on several benchmark datasets demonstrate that the proposed CLA outperforms the existing state-of-the-art PDA methods.
Collapse
|
100
|
Faraji P, Khodabakhshi MB. CollectiveNet-AltSpec: A collective concurrent CNN architecture of alternate specifications for EEG media perception and emotion tracing aided by multi-domain feature-augmentation. Neural Netw 2023; 167:502-516. [PMID: 37690212 DOI: 10.1016/j.neunet.2023.08.031] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Revised: 04/18/2023] [Accepted: 08/18/2023] [Indexed: 09/12/2023]
Abstract
Enhancing computability of cerebral recordings and connections made with human/non-human brain have been on track and are expected to propel in our current era. An effective contribution towards said ends is improving accuracy of attempts at discerning intricate phenomena taking place within human brain. Here and in two different capacities of experiments, we attempt to distinguish cerebral perceptions shaped and affective states surfaced during observation of samples of media incorporating distinct audio-visual and emotional contents, through employing electroencephalograph/EEG recorded sessions of two reputable datasets of DEAP and SEED. Here we introduce AltSpec(E3) the inceptive form of CollectiveNet intelligent computational architectures employing collective and concurrent multi-spec analysis to exploit complex patterns in complex data-structures. This processing technique uses a full array of diversification protocols with multifarious parts enabling surgical levels of optimization while integrating a holistic analysis of patterns. Data-structures designed here contain multi-electrode neuroinformatic and neurocognitive features studying emotion reactions and attentive patterns. These spatially and temporally featured 2D/3D constructs of domain-augmented data are eventually AI-processed and outputs are defragmented forming one definitive judgement. The media-perception tracing is arguably first of its kind, at least when implemented on mentioned datasets. Backed by this multi-directional approach and in subject-independent configurations for perception-tracing on 5-media-class basis, mean accuracies of 81.00% and 68.93% were obtained on DEAP and SEED, respectively. We also managed to classify emotions with accuracies of 61.59% and 66.21% in cross-dataset validation followed by 81.47% and 88.12% in cross-subject validation settings trained on DEAP and SEED, consecutively.
Collapse
Affiliation(s)
- Parham Faraji
- Department of Biomedical Engineering, Hamedan University of Technology, Hamedan 6516913733, Iran.
| | | |
Collapse
|