1
|
Chen X, Li S, Tu Y, Wang Z, Wu D. User-wise perturbations for user identity protection in EEG-based BCIs. J Neural Eng 2025; 22:016040. [PMID: 39423826 DOI: 10.1088/1741-2552/ad88a5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2024] [Accepted: 10/18/2024] [Indexed: 10/21/2024]
Abstract
Objective. An electroencephalogram (EEG)-based brain-computer interface (BCI) is a direct communication pathway between the human brain and a computer. Most research so far studied more accurate BCIs, but much less attention has been paid to the ethics of BCIs. Aside from task-specific information, EEG signals also contain rich private information, e.g. user identity, emotion, disorders, etc which should be protected.Approach. We show for the first time that adding user-wise perturbations can make identity information in EEG unlearnable. We propose four types of user-wise privacy-preserving perturbations, i.e. random noise, synthetic noise, error minimization noise, and error maximization noise. After adding the proposed perturbations to EEG training data, the user identity information in the data becomes unlearnable, while the BCI task information remains unaffected.Main results. Experiments on six EEG datasets using three neural network classifiers and various traditional machine learning models demonstrated the robustness and practicability of the proposed perturbations.Significance. Our research shows the feasibility of hiding user identity information in EEG data without impacting the primary BCI task information.
Collapse
Affiliation(s)
- Xiaoqing Chen
- Huazhong University of Science and Technology, Wuhan, People's Republic of China
- Zhongguancun Academy, Beijing, People's Republic of China
| | - Siyang Li
- Huazhong University of Science and Technology, Wuhan, People's Republic of China
| | - Yunlu Tu
- Huazhong University of Science and Technology, Wuhan, People's Republic of China
| | - Ziwei Wang
- Huazhong University of Science and Technology, Wuhan, People's Republic of China
| | - Dongrui Wu
- Huazhong University of Science and Technology, Wuhan, People's Republic of China
- Zhongguancun Academy, Beijing, People's Republic of China
| |
Collapse
|
2
|
Liang D, Liu A, Wu L, Li C, Qian R, Chen X. Privacy-preserving multi-source semi-supervised domain adaptation for seizure prediction. Cogn Neurodyn 2024; 18:3521-3534. [PMID: 39712093 PMCID: PMC11655995 DOI: 10.1007/s11571-023-10026-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Revised: 09/22/2023] [Accepted: 10/23/2023] [Indexed: 12/24/2024] Open
Abstract
Domain adaptation (DA) has been frequently used to solve the inter-patient variability problem in EEG-based seizure prediction. However, existing DA methods require access to the existing patients' data when adapting the model, which leads to privacy concerns. Besides, most of them treat the whole existing patients' data as one single source and attempt to minimize the discrepancy with the target patient. This manner ignores the inter-patient variability among source patients, making the adaptation more difficult. Considering theses issues simultaneously, we present a novel multi-source-free semi-supervised domain adaptive seizure prediction model (MSF-SSDA-SPM). MSF-SSDA-SPM considers each source patient as one single source and generates a pretrained model from each source. Without requiring access to the source data, MSF-SSDA-SPM performs adaptation just using these pretrained source models and limited labeled target data. Specifically, we freeze the classifiers of all the source models and optimize the source feature extractors in a joint manner. Then we design a knowledge distillation strategy to integrate the knowledge of these well-adapted source models into one single target model. On the CHB-MIT dataset, MSF-SSDA-SPM attains a sensitivity of 88.6%, a FPR of 0.182/h and an AUC of 0.856; on the Kaggle dataset, it achieves 78.6%, 0.178/h and 0.784, respectively. Experimental results demonstrate that MSF-SSDA-SPM achieves both high privacy-protection and promising prediction performance.
Collapse
Affiliation(s)
- Deng Liang
- Department of Electronic Engineering and Information Science, University of Science and Technology of China, Hefei, 230027 Anhui China
| | - Aiping Liu
- Department of Electronic Engineering and Information Science, University of Science and Technology of China, Hefei, 230027 Anhui China
| | - Le Wu
- Department of Electronic Engineering and Information Science, University of Science and Technology of China, Hefei, 230027 Anhui China
| | - Chang Li
- Department of Biomedical Engineering, Hefei University of Technology, Hefei, 230009 Anhui China
| | - Ruobing Qian
- Department of Neurosurgery, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230001 Anhui China
| | - Xun Chen
- Department of Electronic Engineering and Information Science, University of Science and Technology of China, Hefei, 230027 Anhui China
- Department of Neurosurgery, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230001 Anhui China
| |
Collapse
|
3
|
Jia T, Meng L, Li S, Liu J, Wu D. Federated Motor Imagery Classification for Privacy-Preserving Brain-Computer Interfaces. IEEE Trans Neural Syst Rehabil Eng 2024; 32:3442-3451. [PMID: 39255189 DOI: 10.1109/tnsre.2024.3457504] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/12/2024]
Abstract
Training an accurate classifier for EEG-based brain-computer interface (BCI) requires EEG data from a large number of users, whereas protecting their data privacy is a critical consideration. Federated learning (FL) is a promising solution to this challenge. This paper proposes Federated classification with local Batch-specific batch normalization and Sharpness-aware minimization (FedBS) for privacy protection in EEG-based motor imagery (MI) classification. FedBS utilizes local batch-specific batch normalization to reduce data discrepancies among different clients, and sharpness-aware minimization optimizer in local training to improve model generalization. Experiments on three public MI datasets using three popular deep learning models demonstrated that FedBS outperformed six state-of-the-art FL approaches. Remarkably, it also outperformed centralized training, which does not consider privacy protection at all. In summary, FedBS protects user EEG data privacy, enabling multiple BCI users to participate in large-scale machine learning model training, which in turn improves the BCI decoding accuracy.
Collapse
|
4
|
Wu H, Ma Z, Guo Z, Wu Y, Zhang J, Zhou G, Long J. Online Privacy-Preserving EEG Classification by Source-Free Transfer Learning. IEEE Trans Neural Syst Rehabil Eng 2024; 32:3059-3070. [PMID: 39150815 DOI: 10.1109/tnsre.2024.3445115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/18/2024]
Abstract
Electroencephalogram (EEG) signals play an important role in brain-computer interface (BCI) applications. Recent studies have utilized transfer learning to assist the learning task in the new subject, i.e., target domain, by leveraging beneficial information from previous subjects, i.e., source domains. Nevertheless, EEG signals involve sensitive personal mental and health information. Thus, privacy concern becomes a critical issue. In addition, existing methods mostly assume that a portion of the new subject's data is available and perform alignment or adaptation between the source and target domains. However, in some practical scenarios, new subjects prefer prompt BCI utilization over the time-consuming process of collecting data for calibration and adaptation, which makes the above assumption difficult to hold. To address the above challenges, we propose Online Source-Free Transfer Learning (OSFTL) for privacy-preserving EEG classification. Specifically, the learning procedure contains offline and online stages. At the offline stage, multiple model parameters are obtained based on the EEG samples from multiple source subjects. OSFTL only needs access to these source model parameters to preserve the privacy of the source subjects. At the online stage, a target classifier is trained based on the online sequence of EEG instances. Subsequently, OSFTL learns a weighted combination of the source and target classifiers to obtain the final prediction for each target instance. Moreover, to ensure good transferability, OSFTL dynamically updates the transferred weight of each source domain based on the similarity between each source classifier and the target classifier. Comprehensive experiments on both simulated and real-world applications demonstrate the effectiveness of the proposed method, indicating the potential of OSFTL to facilitate the deployment of BCI applications outside of controlled laboratory settings.
Collapse
|
5
|
Wang Z, Li S, Luo J, Liu J, Wu D. Channel reflection: Knowledge-driven data augmentation for EEG-based brain-computer interfaces. Neural Netw 2024; 176:106351. [PMID: 38713969 DOI: 10.1016/j.neunet.2024.106351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2023] [Revised: 04/04/2024] [Accepted: 04/28/2024] [Indexed: 05/09/2024]
Abstract
A brain-computer interface (BCI) enables direct communication between the human brain and external devices. Electroencephalography (EEG) based BCIs are currently the most popular for able-bodied users. To increase user-friendliness, usually a small amount of user-specific EEG data are used for calibration, which may not be enough to develop a pure data-driven decoding model. To cope with this typical calibration data shortage challenge in EEG-based BCIs, this paper proposes a parameter-free channel reflection (CR) data augmentation approach that incorporates prior knowledge on the channel distributions of different BCI paradigms in data augmentation. Experiments on eight public EEG datasets across four different BCI paradigms (motor imagery, steady-state visual evoked potential, P300, and seizure classifications) using different decoding algorithms demonstrated that: (1) CR is effective, i.e., it can noticeably improve the classification accuracy; (2) CR is robust, i.e., it consistently outperforms existing data augmentation approaches in the literature; and, (3) CR is flexible, i.e., it can be combined with other data augmentation approaches to further improve the performance. We suggest that data augmentation approaches like CR should be an essential step in EEG-based BCIs. Our code is available online.
Collapse
Affiliation(s)
- Ziwei Wang
- Key Laboratory of the Ministry of Education for Image Processing and Intelligent Control, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China; Shenzhen Huazhong University of Science and Technology Research Institute, Shenzhen 518063, China.
| | - Siyang Li
- Key Laboratory of the Ministry of Education for Image Processing and Intelligent Control, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China; Shenzhen Huazhong University of Science and Technology Research Institute, Shenzhen 518063, China.
| | - Jingwei Luo
- China Electronic System Technology Co., Ltd., Beijing 100089, China.
| | - Jiajing Liu
- School of Civil and Hydraulic Engineering, Huazhong University of Science and Technology, Wuhan 430074, China.
| | - Dongrui Wu
- Key Laboratory of the Ministry of Education for Image Processing and Intelligent Control, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China; Shenzhen Huazhong University of Science and Technology Research Institute, Shenzhen 518063, China.
| |
Collapse
|
6
|
Li S, Wang Z, Luo H, Ding L, Wu D. T-TIME: Test-Time Information Maximization Ensemble for Plug-and-Play BCIs. IEEE Trans Biomed Eng 2024; 71:423-432. [PMID: 37552589 DOI: 10.1109/tbme.2023.3303289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/10/2023]
Abstract
OBJECTIVE An electroencephalogram (EEG)-based brain-computer interface (BCI) enables direct communication between the human brain and a computer. Due to individual differences and non-stationarity of EEG signals, such BCIs usually require a subject-specific calibration session before each use, which is time-consuming and user-unfriendly. Transfer learning (TL) has been proposed to shorten or eliminate this calibration, but existing TL approaches mainly consider offline settings, where all unlabeled EEG trials from the new user are available. METHODS This article proposes Test-Time Information Maximization Ensemble (T-TIME) to accommodate the most challenging online TL scenario, where unlabeled EEG data from the new user arrive in a stream, and immediate classification is performed. T-TIME initializes multiple classifiers from the aligned source data. When an unlabeled test EEG trial arrives, T-TIME first predicts its labels using ensemble learning, and then updates each classifier by conditional entropy minimization and adaptive marginal distribution regularization. Our code is publicized. RESULTS Extensive experiments on three public motor imagery based BCI datasets demonstrated that T-TIME outperformed about 20 classical and state-of-the-art TL approaches. SIGNIFICANCE To our knowledge, this is the first work on test time adaptation for calibration-free EEG-based BCIs, making plug-and-play BCIs possible.
Collapse
|
7
|
Kim SJ, Lee DH, Kwak HG, Lee SW. Toward Domain-Free Transformer for Generalized EEG Pre-Training. IEEE Trans Neural Syst Rehabil Eng 2024; 32:482-492. [PMID: 38236672 DOI: 10.1109/tnsre.2024.3355434] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2024]
Abstract
Electroencephalography (EEG) signals are the brain signals acquired using the non-invasive approach. Owing to the high portability and practicality, EEG signals have found extensive application in monitoring human physiological states across various domains. In recent years, deep learning methodologies have been explored to decode the intricate information embedded in EEG signals. However, since EEG signals are acquired from humans, it has issues with acquiring enormous amounts of data for training the deep learning models. Therefore, previous research has attempted to develop pre-trained models that could show significant performance improvement through fine-tuning when data are scarce. Nonetheless, existing pre-trained models often struggle with constraints, such as the necessity to operate within datasets of identical configurations or the need to distort the original data to apply the pre-trained model. In this paper, we proposed the domain-free transformer, called DFformer, for generalizing the EEG pre-trained model. In addition, we presented the pre-trained model based on DFformer, which is capable of seamless integration across diverse datasets without necessitating architectural modification or data distortion. The proposed model achieved competitive performance across motor imagery and sleep stage classification datasets. Notably, even when fine-tuned on datasets distinct from the pre-training phase, DFformer demonstrated marked performance enhancements. Hence, we demonstrate the potential of DFformer to overcome the conventional limitations in pre-trained model development, offering robust applicability across a spectrum of domains.
Collapse
|
8
|
Wang Z, Zhang W, Li S, Chen X, Wu D. Unsupervised domain adaptation for cross-patient seizure classification. J Neural Eng 2023; 20:066002. [PMID: 37906968 DOI: 10.1088/1741-2552/ad0859] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 10/31/2023] [Indexed: 11/02/2023]
Abstract
Objective. Epileptic seizure is a chronic neurological disease affecting millions of patients. Electroencephalogram (EEG) is the gold standard in epileptic seizure classification. However, its low signal-to-noise ratio, strong non-stationarity, and large individual difference nature make it difficult to directly extend the seizure classification model from one patient to another. This paper considers multi-source unsupervised domain adaptation for cross-patient EEG-based seizure classification, i.e. there are multiple source patients with labeled EEG data, which are used to label the EEG trials of a new patient.Approach. We propose an source domain selection (SDS)-global domain adaptation (GDA)-target agent subdomain adaptation (TASA) approach, which includes SDS to filter out dissimilar source domains, GDA to align the overall distributions of the selected source domains and the target domain, and TASA to identify the most similar source domain to the target domain so that its labels can be utilized.Main results. Experiments on two public seizure datasets demonstrated that SDS-GDA-TASA outperformed 13 existing approaches in unsupervised cross-patient seizure classification.Significance. Our approach could save clinicians plenty of time in labeling EEG data for epilepsy patients, greatly increasing the efficiency of seizure diagnostics.
Collapse
Affiliation(s)
- Ziwei Wang
- Key Laboratory of the Ministry of Education for Image Processing and Intelligent Control, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, People's Republic of China
| | - Wen Zhang
- Key Laboratory of the Ministry of Education for Image Processing and Intelligent Control, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, People's Republic of China
| | - Siyang Li
- Key Laboratory of the Ministry of Education for Image Processing and Intelligent Control, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, People's Republic of China
| | - Xinru Chen
- Key Laboratory of the Ministry of Education for Image Processing and Intelligent Control, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, People's Republic of China
| | - Dongrui Wu
- Key Laboratory of the Ministry of Education for Image Processing and Intelligent Control, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, People's Republic of China
- Belt and Road Joint Laboratory on Measurement and Control Technology, Huazhong University of Science and Technology, Wuhan 430074, People's Republic of China
| |
Collapse
|
9
|
Meng L, Jiang X, Huang J, Li W, Luo H, Wu D. User Identity Protection in EEG-Based Brain-Computer Interfaces. IEEE Trans Neural Syst Rehabil Eng 2023; 31:3576-3586. [PMID: 37651476 DOI: 10.1109/tnsre.2023.3310883] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]
Abstract
A brain-computer interface (BCI) establishes a direct communication pathway between the brain and an external device. Electroencephalogram (EEG) is the most popular input signal in BCIs, due to its convenience and low cost. Most research on EEG-based BCIs focuses on the accurate decoding of EEG signals; however, EEG signals also contain rich private information, e.g., user identity, emotion, and so on, which should be protected. This paper first exposes a serious privacy problem in EEG-based BCIs, i.e., the user identity in EEG data can be easily learned so that different sessions of EEG data from the same user can be associated together to more reliably mine private information. To address this issue, we further propose two approaches to convert the original EEG data into identity-unlearnable EEG data, i.e., removing the user identity information while maintaining the good performance on the primary BCI task. Experiments on seven EEG datasets from five different BCI paradigms showed that on average the generated identity-unlearnable EEG data can reduce the user identification accuracy from 70.01% to at most 21.36%, greatly facilitating user privacy protection in EEG-based BCIs.
Collapse
|