1
|
Liu H, Jin X, Liu D, Kong W, Tang J, Peng Y. Joint disentangled representation and domain adversarial training for EEG-based cross-session biometric recognition in single-task protocols. Cogn Neurodyn 2025; 19:31. [PMID: 39866660 PMCID: PMC11757832 DOI: 10.1007/s11571-024-10214-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2024] [Revised: 09/05/2024] [Accepted: 09/23/2024] [Indexed: 01/28/2025] Open
Abstract
The increasing adoption of wearable technologies highlights the potential of electroencephalogram (EEG) signals for biometric recognition. However, the intrinsic variability in cross-session EEG data presents substantial challenges in maintaining model stability and reliability. Moreover, the diversity within single-task protocols complicates achieving consistent and generalized model performance. To address these issues, we propose the Joint Disentangled Representation with Domain Adversarial Training (JDR-DAT) framework for EEG-based cross-session biometric recognition within single-task protocols. The JDR-DAT framework disentangles identity-specific features through mutual information estimation and incorporates domain adversarial training to enhance longitudinal robustness. Extensive experiments on longitudinal EEG data from two publicly available single-task protocol datasets-RSVP-based (Rapid Serial Visual Presentation) and MI-based (Motor Imagery)-demonstrate the efficacy of the JDR-DAT framework, with the proposed method achieving average accuracies of 85.83% and 96.72%, respectively.
Collapse
Affiliation(s)
- Honggang Liu
- School of Computer Science, Hangzhou Dianzi University, Hangzhou, 310018 Zhejiang China
- Zhejiang Key Laboratory of Brain-Machine Collaborative Intelligence, Hangzhou, 310018 Zhejiang China
| | - Xuanyu Jin
- School of Computer Science, Hangzhou Dianzi University, Hangzhou, 310018 Zhejiang China
- Zhejiang Key Laboratory of Brain-Machine Collaborative Intelligence, Hangzhou, 310018 Zhejiang China
| | - Dongjun Liu
- School of Computer Science, Hangzhou Dianzi University, Hangzhou, 310018 Zhejiang China
- Zhejiang Key Laboratory of Brain-Machine Collaborative Intelligence, Hangzhou, 310018 Zhejiang China
| | - Wanzeng Kong
- School of Computer Science, Hangzhou Dianzi University, Hangzhou, 310018 Zhejiang China
- Zhejiang Key Laboratory of Brain-Machine Collaborative Intelligence, Hangzhou, 310018 Zhejiang China
| | - Jiajia Tang
- School of Computer Science, Hangzhou Dianzi University, Hangzhou, 310018 Zhejiang China
- Zhejiang Key Laboratory of Brain-Machine Collaborative Intelligence, Hangzhou, 310018 Zhejiang China
| | - Yong Peng
- School of Computer Science, Hangzhou Dianzi University, Hangzhou, 310018 Zhejiang China
- Zhejiang Key Laboratory of Brain-Machine Collaborative Intelligence, Hangzhou, 310018 Zhejiang China
| |
Collapse
|
2
|
Yan W, Luo Q, Du C. Channel component correlation analysis for multi-channel EEG feature component extraction. Front Neurosci 2025; 19:1522964. [PMID: 40242456 PMCID: PMC12000010 DOI: 10.3389/fnins.2025.1522964] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2024] [Accepted: 02/11/2025] [Indexed: 04/18/2025] Open
Abstract
Introduction Electroencephalogram (EEG) analysis has shown significant research value for brain disease diagnosis, neuromodulation and brain-computer interface (BCI) application. The analysis and processing of EEG signals is complex since EEG are nonstationary, nonlinear, and often contaminated by intense background noise. Principal component analysis (PCA) and independent component analysis (ICA), as the commonly used methods for multi-dimensional signal feature component extraction, still have some limitations in terms of performance and calculation. Methods In this study, channel component correlation analysis (CCCA) method was proposed to extract feature components of multi-channel EEG. Firstly, empirical wavelet transform (EWT) decomposed each channel signal into different frequency bands, and reconstructed them into a multi-dimensional signal. Then the objective optimization function was constructed by maximizing the covariance between multi-dimensional signals. Finally the feature components of multi-channel EEG were extracted using the calculated weight coefficient. Results The results showed that the CCCA method could find the most relevant frequency band between multi-channel EEG. Compared with PCA and ICA methods, CCCA could extract the common components of multi-channel EEG more effectively, which is of great significance for the accurate analysis of EEG. Discussion The CCCA method proposed in this study has shown excellent performance in the feature component extraction of multi-channel EEG and could be considered for practical engineering applications.
Collapse
Affiliation(s)
- Wenqiang Yan
- School of Mechanical Engineering, Xi’an Jiaotong University, Xi’an, China
- State Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, China
| | - Qi Luo
- School of Mechanical Engineering, Xi’an Jiaotong University, Xi’an, China
| | - Chenghang Du
- School of Mechanical Engineering, Xi’an Jiaotong University, Xi’an, China
| |
Collapse
|
3
|
Si Y, Wang Z, Xu G, Wang Z, Xu T, Zhou T, Hu H. Group-member selection for RSVP-based collaborative brain-computer interfaces. Front Neurosci 2024; 18:1402154. [PMID: 39234182 PMCID: PMC11371794 DOI: 10.3389/fnins.2024.1402154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2024] [Accepted: 07/30/2024] [Indexed: 09/06/2024] Open
Abstract
Objective The brain-computer interface (BCI) systems based on rapid serial visual presentation (RSVP) have been widely utilized for the detection of target and non-target images. Collaborative brain-computer interface (cBCI) effectively fuses electroencephalogram (EEG) data from multiple users to overcome the limitations of low single-user performance in single-trial event-related potential (ERP) detection in RSVP-based BCI systems. In a multi-user cBCI system, a superior group mode may lead to better collaborative performance and lower system cost. However, the key factors that enhance the collaboration capabilities of multiple users and how to further use these factors to optimize group mode remain unclear. Approach This study proposed a group-member selection strategy to optimize the group mode and improve the system performance for RSVP-based cBCI. In contrast to the conventional grouping of collaborators at random, the group-member selection strategy enabled pairing each user with a better collaborator and allowed tasks to be done with fewer collaborators. Initially, we introduced the maximum individual capability and maximum collaborative capability (MIMC) to select optimal pairs, improving the system classification performance. The sequential forward floating selection (SFFS) combined with MIMC then selected a sub-group, aiming to reduce the hardware and labor expenses in the cBCI system. Moreover, the hierarchical discriminant component analysis (HDCA) was used as a classifier for within-session conditions, and the Euclidean space data alignment (EA) was used to overcome the problem of inter-trial variability for cross-session analysis. Main results In this paper, we verified the effectiveness of the proposed group-member selection strategy on a public RSVP-based cBCI dataset. For the two-user matching task, the proposed MIMC had a significantly higher AUC and TPR and lower FPR than the common random grouping mode and the potential group-member selection method. Moreover, the SFFS with MIMC enabled a trade-off between maintaining performance and reducing the number of system users. Significance The results showed that our proposed MIMC effectively optimized the group mode, enhanced the classification performance in the two-user matching task, and could reduce the redundant information by selecting the sub-group in the RSVP-based multi-user cBCI systems.
Collapse
Affiliation(s)
- Yuan Si
- Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Zhenyu Wang
- Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, China
| | - Guiying Xu
- Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Zikai Wang
- Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Tianheng Xu
- Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, China
- Shanghai Frontier Innovation Research Institute, Shanghai, China
| | - Ting Zhou
- Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, China
- Shanghai Frontier Innovation Research Institute, Shanghai, China
- School of Microelectronics, Shanghai University, Shanghai, China
| | - Honglin Hu
- Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, China
- University of Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
4
|
Zhang B, Xu M, Zhang Y, Ye S, Chen Y. Attention-ProNet: A Prototype Network with Hybrid Attention Mechanisms Applied to Zero Calibration in Rapid Serial Visual Presentation-Based Brain-Computer Interface. Bioengineering (Basel) 2024; 11:347. [PMID: 38671769 PMCID: PMC11048110 DOI: 10.3390/bioengineering11040347] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Revised: 03/27/2024] [Accepted: 03/30/2024] [Indexed: 04/28/2024] Open
Abstract
The rapid serial visual presentation-based brain-computer interface (RSVP-BCI) system achieves the recognition of target images by extracting event-related potential (ERP) features from electroencephalogram (EEG) signals and then building target classification models. Currently, how to reduce the training and calibration time for classification models across different subjects is a crucial issue in the practical application of RSVP. To address this issue, a zero-calibration (ZC) method termed Attention-ProNet, which involves meta-learning with a prototype network integrating multiple attention mechanisms, was proposed in this study. In particular, multiscale attention mechanisms were used for efficient EEG feature extraction. Furthermore, a hybrid attention mechanism was introduced to enhance model generalization, and attempts were made to incorporate suitable data augmentation and channel selection methods to develop an innovative and high-performance ZC RSVP-BCI decoding model algorithm. The experimental results demonstrated that our method achieved a balance accuracy (BA) of 86.33% in the decoding task for new subjects. Moreover, appropriate channel selection and data augmentation methods further enhanced the performance of the network by affording an additional 2.3% increase in BA. The model generated by the meta-learning prototype network Attention-ProNet, which incorporates multiple attention mechanisms, allows for the efficient and accurate decoding of new subjects without the need for recalibration or retraining.
Collapse
Affiliation(s)
- Baiwen Zhang
- Institute of Information and Artificial Intelligence Technology, Beijing Academy of Science and Technology, Beijing 100089, China;
| | - Meng Xu
- Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China;
| | - Yueqi Zhang
- Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China;
| | - Sicheng Ye
- Intelligent Science and Technology, International College of Beijing University of Posts and Telecommunications, Beijing 100083, China;
| | - Yuanfang Chen
- Beijing Institute of Mechanical Equipment, Beijing 100854, China
| |
Collapse
|
5
|
Wang X, Li B, Lin Y, Gao X. Multi-source domain adaptation based tempo-spatial convolution network for cross-subject EEG classification in RSVP task. J Neural Eng 2024; 21:016025. [PMID: 38324909 DOI: 10.1088/1741-2552/ad2710] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Accepted: 02/07/2024] [Indexed: 02/09/2024]
Abstract
Objective.Many subject-dependent methods were proposed for electroencephalogram (EEG) classification in rapid serial visual presentation (RSVP) task, which required a large amount of data from new subject and were time-consuming to calibrate system. Cross-subject classification can realize calibration reduction or zero calibration. However, cross-subject classification in RSVP task is still a challenge.Approach.This study proposed a multi-source domain adaptation based tempo-spatial convolution (MDA-TSC) network for cross-subject RSVP classification. The proposed network consisted of three modules. First, the common feature extraction with multi-scale tempo-spatial convolution was constructed to extract domain-invariant features across all subjects, which could improve generalization of the network. Second, the multi-branch domain-specific feature extraction and alignment was conducted to extract and align domain-specific feature distributions of source and target domains in pairs, which could consider feature distribution differences among source domains. Third, the domain-specific classifier was exploited to optimize the network through loss functions and obtain prediction for the target domain.Main results.The proposed network was evaluated on the benchmark RSVP dataset, and the cross-subject classification results showed that the proposed MDA-TSC network outperformed the reference methods. Moreover, the effectiveness of the MDA-TSC network was verified through both ablation studies and visualization.Significance.The proposed network could effectively improve cross-subject classification performance in RSVP task, and was helpful to reduce system calibration time.
Collapse
Affiliation(s)
- Xuepu Wang
- School of Integrated Circuits and Electronics, Beijing Institute of Technology, Beijing 100081, People's Republic of China
| | - Bowen Li
- School of Medicine, Tsinghua University, Beijing 100084, People's Republic of China
| | - Yanfei Lin
- School of Integrated Circuits and Electronics, Beijing Institute of Technology, Beijing 100081, People's Republic of China
| | - Xiaorong Gao
- School of Medicine, Tsinghua University, Beijing 100084, People's Republic of China
| |
Collapse
|
6
|
Zhao Z, Lin Y, Wang Y, Gao X. Single-Trial EEG Classification Using Spatio-Temporal Weighting and Correlation Analysis for RSVP-Based Collaborative Brain Computer Interface. IEEE Trans Biomed Eng 2024; 71:553-562. [PMID: 37756179 DOI: 10.1109/tbme.2023.3309255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/29/2023]
Abstract
OBJECTIVE Since single brain computer interface (BCI) is limited in performance, it is necessary to develop collaborative BCI (cBCI) systems which integrate multi-user electroencephalogram (EEG) information to improve system performance. However, there are still some challenges in cBCI systems, including effective discriminant feature extraction of multi-user EEG data, fusion algorithms, time reduction of system calibration, etc. Methods: This study proposed an event-related potential (ERP) feature extraction and classification algorithm of spatio-temporal weighting and correlation analysis (STC) to improve the performance of cBCI systems. The proposed STC algorithm consisted of three modules. First, source extraction and interval modeling were used to overcome the problem of inter-trial variability. Second, spatio-temporal weighting and temporal projection were utilized to extract effective discriminant features for multi-user information fusion and cross-session transfer. Third, correlation analysis was conducted to match target/non-target templates for classification of multi-user and cross-session datasets. RESULTS The collaborative cross-session datasets of rapid serial visual presentation (RSVP) from 14 subjects were used to evaluate the performance of the EEG classification algorithm. For single-user/collaborative EEG classification of within-session and cross-session datasets, STC had significantly higher performance than the existing state-of-the-art machine learning algorithms. CONCLUSION It was demonstrated that STC was effective to improve the classification performance of multi-user collaboration and cross-session transfer for RSVP-based BCI systems, and was helpful to reduce the system calibration time.
Collapse
|
7
|
Du P, Li P, Cheng L, Li X, Su J. Single-trial P300 classification algorithm based on centralized multi-person data fusion CNN. Front Neurosci 2023; 17:1132290. [PMID: 36908799 PMCID: PMC9992797 DOI: 10.3389/fnins.2023.1132290] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2022] [Accepted: 02/01/2023] [Indexed: 02/24/2023] Open
Abstract
Introduction Currently, it is still a challenge to detect single-trial P300 from electroencephalography (EEG) signals. In this paper, to address the typical problems faced by existing single-trial P300 classification, such as complex, time-consuming and low accuracy processes, a single-trial P300 classification algorithm based on multiplayer data fusion convolutional neural network (CNN) is proposed to construct a centralized collaborative brain-computer interfaces (cBCI) for fast and highly accurate classification of P300 EEG signals. Methods In this paper, two multi-person data fusion methods (parallel data fusion and serial data fusion) are used in the data pre-processing stage to fuse multi-person EEG information stimulated by the same task instructions, and then the fused data is fed as input to the CNN for classification. In building the CNN network for single-trial P300 classification, the Conv layer was first used to extract the features of single-trial P300, and then the Maxpooling layer was used to connect the Flatten layer for secondary feature extraction and dimensionality reduction, thereby simplifying the computation. Finally batch normalisation is used to train small batches of data in order to better generalize the network and speed up single-trial P300 signal classification. Results In this paper, the above new algorithms were tested on the Kaggle dataset and the Brain-Computer Interface (BCI) Competition III dataset, and by analyzing the P300 waveform features and EEG topography and the four standard evaluation metrics, namely Accuracy, Precision, Recall and F1-score,it was demonstrated that the single-trial P300 classification algorithm after two multi-person data fusion CNNs significantly outperformed other classification algorithms. Discussion The results show that the single-trial P300 classification algorithm after two multi-person data fusion CNNs significantly outperformed the single-person model, and that the single-trial P300 classification algorithm with two multi-person data fusion CNNs involves smaller models, fewer training parameters, higher classification accuracy and improves the overall P300-cBCI classification rate and actual performance more effectively with a small amount of sample information compared to other algorithms.
Collapse
Affiliation(s)
- Pu Du
- School of Integrated Circuit Science and Engineering, Tianjin University of Technology, Tianjin, China
| | - Penghai Li
- School of Integrated Circuit Science and Engineering, Tianjin University of Technology, Tianjin, China
| | - Longlong Cheng
- School of Integrated Circuit Science and Engineering, Tianjin University of Technology, Tianjin, China.,China Electronics Cloud Brain Technology Co., Ltd., Tianjin, China
| | - Xueqing Li
- School of Integrated Circuit Science and Engineering, Tianjin University of Technology, Tianjin, China
| | - Jianxian Su
- School of Integrated Circuit Science and Engineering, Tianjin University of Technology, Tianjin, China
| |
Collapse
|
8
|
Yan W, Wu Y. A time-frequency denoising method for single-channel event-related EEG. Front Neurosci 2022; 16:991136. [PMID: 36507356 PMCID: PMC9732370 DOI: 10.3389/fnins.2022.991136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Accepted: 11/14/2022] [Indexed: 11/27/2022] Open
Abstract
Introduction Electroencephalogram (EEG) acquisition is easily affected by various noises, including those from electrocardiogram (ECG), electrooculogram (EOG), and electromyogram (EMG). Because noise interference can significantly limit the study and analysis of brain signals, there is a significant need for the development of improved methods to remove this interference for more accurate measurement of EEG signals. Methods Based on the non-linear and non-stationary characteristics of brain signals, a strategy was developed to denoise brain signals using a time-frequency denoising algorithm framework of short-time Fourier transform (STFT), bidimensional empirical mode decomposition (BEMD), and non-local means (NLM). Time-frequency analysis can reveal the signal frequency component and its evolution process, allowing the elimination of noise according to the signal and noise distribution. BEMD can be used to decompose the time-frequency signals into sub-time-frequency signals for noise removal at different scales. NLM relies on structural self-similarity to locally smooth an image to remove noise and restore its main geometric structure, making this method appropriate for time-frequency signal denoising. Results The experimental results show that the proposed method can effectively suppress the high-frequency components of brain signals, resulting in a smoother brain signal waveform after denoising. The correlation coefficient of the reference signal, a superposition average of multiple trial signals, and the original single trial signal was determined, and then correlation coefficients were calculated between the reference signal and single trial signals processed by time-frequency denoising, ensemble empirical mode decomposition (EEMD)-independent component analysis (ICA), EEMD-canonical correlation analysis (CCA), and wavelet threshold denoising methods. The correlation coefficient was highest for the signal processed by the time-frequency denoising method and the reference signal, indicating that the single trial signal after time-frequency denoising was most similar to the waveform of the reference signal and suggesting this is a feasible strategy to effectively reduce noise and more accurately determine signals. Discussion The proposed time-frequency denoising method exhibits excellent performance with promising potential for practical application.
Collapse
|
9
|
Putri F, Susnoschi Luca I, Garcia Pedro JA, Ding H, Vuckovic A. Winners and losers in brain computer interface competitive gaming: Directional connectivity analysis. J Neural Eng 2022; 19. [PMID: 35882224 DOI: 10.1088/1741-2552/ac8451] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Accepted: 07/26/2022] [Indexed: 11/11/2022]
Abstract
OBJECTIVE to characterize the direction within and between brain connectivity in winning and losing players in a competitive brain-computer interface game. APPROACH ten dyads (26.9 ± 4.7 years old, eight females and 12 males) participated in the study. In a competitive game based on neurofeedback, they used their relative alpha (RA) band power from the electrode location Pz, to control a virtual seesaw. The players in each pair were separated into winners (W) and losers (L) based on their scores. Intrabrain connectivity was analyzed using multivariate Granger Causality (GC) and Directed Transfer Function, while interbrain connectivity was analyzed using bivariate GC. RESULTS linear regression analysis revealed a significant relationship (p<0.05) between RA and individual scores. During the game, W players maintained a higher RA than L players, although it was not higher than their baseline RA. The analysis of intrabrain GC indicated that both groups engaged in general social interactions, but only the W group succeeded in controlling their brain activity at Pz. Group L applied an inappropriate metal strategy, characterized by strong activity in the left frontal cortex, indicative of collaborative gaming. Interbrain GC showed a larger flow of information from the L to the W group, suggesting a higher capability of the W group to monitor the activity of their opponent. SIGNIFICANCE both innate neurological indices and gaming mental strategies contribute to game outcomes. Future studies should investigate whether there is a causal relationship between these two factors.
Collapse
Affiliation(s)
- Finda Putri
- Centre for Rehabilitation Engineering, University of Glasgow, James Watt Building (South), G12 8QQ, Glasgow, Glasgow, G12 8QQ, UNITED KINGDOM OF GREAT BRITAIN AND NORTHERN IRELAND
| | - Ioana Susnoschi Luca
- Centre for Rehabilitation Engineering, University of Glasgow, James Watt Building (South), G12 8QQ, Glasgow, Glasgow, G12 8QQ, UNITED KINGDOM OF GREAT BRITAIN AND NORTHERN IRELAND
| | - Jorge Abdullah Garcia Pedro
- Centre for Rehabilitation Engineering, University of Glasgow, James Watt Building (South), G12 8QQ, Glasgow, Glasgow, G12 8QQ, UNITED KINGDOM OF GREAT BRITAIN AND NORTHERN IRELAND
| | - Hao Ding
- Centre for Rehabilitation Engineering, University of Glasgow, James Watt Building (South), G12 8QQ, Glasgow, Glasgow, G12 8QQ, UNITED KINGDOM OF GREAT BRITAIN AND NORTHERN IRELAND
| | - Aleksandra Vuckovic
- School of Engineering, Biomedical Engineering, University of Glasgow, James Watt building (south), G12 8QQ, Glasgow, Glasgow, G12 8QQ, UNITED KINGDOM OF GREAT BRITAIN AND NORTHERN IRELAND
| |
Collapse
|
10
|
Xie P, Hao S, Zhao J, Liang Z, Li X. A Spatio-Temporal Method for Extracting Gamma-Band Features to Enhance Classification in a Rapid Serial Visual Presentation Task. Int J Neural Syst 2022; 32:2250010. [PMID: 35049411 DOI: 10.1142/s0129065722500101] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Rapid serial visual presentation (RSVP) is a type of electroencephalogram (EEG) pattern commonly used for target recognition. Besides delta- and theta-band responses already used for classification, RSVP task also evokes gamma-band responses having low amplitude and large individual difference. This paper proposes a filter bank spatio-temporal component analysis (FBSCA) method, extracting spatio-temporal features of the gamma-band responses for the first time, to enhance the RSVP classification performance. Considering the individual difference in time latency and responsive frequency, the proposed FBSCA method decomposes the gamma-band EEG data into sub-components in different time-frequency-space domains and seeks the weight coefficients to optimize the combinations of electrodes, common spatial pattern (CSP) components, time windows and frequency bands. Two state-of-the-art methods, i.e. hierarchical discriminant principal component analysis (HDPCA) and discriminative canonical pattern matching (DCPM), were used for comparison. The performance was evaluated in [Formula: see text] cross validations using a public dataset. Study results showed that the FBSCA method outperformed the other methods regardless of number of training trials. These results suggest that the proposed FBSCA method can enhance the RSVP classification.
Collapse
Affiliation(s)
- Ping Xie
- Key Laboratory of Intelligent Rehabilitation, and Neromodulation of Hebei Province, School of Electrical Engineering, Yanshan University, Qinhuangdao 066004, P. R. China
| | - Shencai Hao
- Key Laboratory of Intelligent Rehabilitation, and Neromodulation of Hebei Province, School of Electrical Engineering, Yanshan University, Qinhuangdao 066004, P. R. China
| | - Jing Zhao
- Key Laboratory of Intelligent Rehabilitation, and Neromodulation of Hebei Province, School of Electrical Engineering, Yanshan University, Qinhuangdao 066004, P. R. China
| | - Zhenhu Liang
- Key Laboratory of Intelligent Rehabilitation, and Neromodulation of Hebei Province, School of Electrical Engineering, Yanshan University, Qinhuangdao 066004, P. R. China
| | - Xiaoli Li
- State Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Beijing 100875, P. R. China
| |
Collapse
|
11
|
Onishi A. Brain-computer interface with rapid serial multimodal presentation using artificial facial images and voice. Comput Biol Med 2021; 136:104685. [PMID: 34343888 DOI: 10.1016/j.compbiomed.2021.104685] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2021] [Revised: 07/22/2021] [Accepted: 07/22/2021] [Indexed: 10/20/2022]
Abstract
Electroencephalography (EEG) signals elicited by multimodal stimuli can drive brain-computer interfaces (BCIs), and research has demonstrated that visual and auditory stimuli can be employed simultaneously to improve BCI performance. However, no studies have investigated the effect of multimodal stimuli in rapid serial visual presentation (RSVP) BCIs. The present study proposed a rapid serial multimodal presentation (RSMP) BCI that incorporates artificial facial images and artificial voice stimuli. To clarify the effect of audiovisual stimuli on the RSMP BCI, scrambled images and masked sounds were applied instead of visual and auditory stimuli, respectively. The findings indicated that the audiovisual stimuli improved performance of the RSMP BCI, and that P300 at Pz contributed to classification accuracy. Online accuracy of the BCI reached 85.7 ± 11.5 %. Taken together, these findings may aid in the development of better gaze-independent BCI systems.
Collapse
Affiliation(s)
- A Onishi
- Department of Electronic Systems Engineering, National Institute of Technology, Kagawa College, 551, Kohda, Takuma-cho, Mitoyo-shi, Kagawa, 769-1192, Japan; Center for Frontier Medical Engineering, Chiba University, 1-33 Yayoi-cho, Inage-ku, Chiba, Japan.
| |
Collapse
|