1
|
Chen K, Ruan W, Liu Q, Ai Q, Ma L. A novel deep learning model combining 3DCNN-CapsNet and hierarchical attention mechanism for EEG emotion recognition. Neural Netw 2025; 186:107267. [PMID: 40010290 DOI: 10.1016/j.neunet.2025.107267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2024] [Revised: 12/22/2024] [Accepted: 02/10/2025] [Indexed: 02/28/2025]
Abstract
Emotion recognition plays a key role in the field of human-computer interaction. Classifying and predicting human emotions using electroencephalogram (EEG) signals has consistently been a challenging research area. Recently, with the increasing application of deep learning methods such as convolutional neural network (CNN) and channel attention mechanism (CA). The recognition accuracy of emotion recognition methods has already reached an outstanding level. However, CNN and its derivatives have the defect that the sensory field of view is small and can only extract local features. The traditional channel attention mechanism only focuses on the correlation between different channels and assigns weights to each channel according to its contribution to the emotion recognition task, ignoring the fact that different EEG frequency bands in the same channel signal also have different contributions to the task. To address the above-mentioned problems , this paper propose HA-CapsNet, a novel end-to-end model combining 3DCNN-CapsNet with a Hierarchical Attention mechanism. This model captures both inter-channel correlations and the contribution of each frequency band. Additionally, the capsule network in 3DCNN-CapsNet extracts more spatial feature information compared to conventional CNNs. Our HA-CapsNet achieves recognition accuracies of 97.40%, 97.20%, and 97.60% on the DEAP dataset, and 95.80%, 96.10%, and 96.30% on the DREAMER dataset, outperforming state-of-the-art methods with the smallest variance. Furthermore, experiments removing channels from the DEAP and DREAMER datasets in ascending order of their hierarchical attention weights showed that even with fewer channels, the model maintained strong recognition performance. This demonstrates HA-CapsNet's low dependence on large datasets and its suitability for lightweight EEG devices, promoting advancements in EEG device development.
Collapse
Affiliation(s)
- Kun Chen
- School of Information Engineering, Wuhan University of Technology, Wuhan 430070, Hubei, China
| | - Wenhao Ruan
- School of Information Engineering, Wuhan University of Technology, Wuhan 430070, Hubei, China
| | - Quan Liu
- School of Information Engineering, Wuhan University of Technology, Wuhan 430070, Hubei, China
| | - Qingsong Ai
- School of Information Engineering, Wuhan University of Technology, Wuhan 430070, Hubei, China; School of Artificial Intelligence, Hubei University, Wuhan 430062, Hubei, China
| | - Li Ma
- School of Information Engineering, Wuhan University of Technology, Wuhan 430070, Hubei, China.
| |
Collapse
|
2
|
Aslam MA, Khan K, Khan W, Khan SU, Albanyan A, Algamdi SA. Paraphrase detection for Urdu language text using fine-tune BiLSTM framework. Sci Rep 2025; 15:15383. [PMID: 40316633 PMCID: PMC12048677 DOI: 10.1038/s41598-025-93260-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2024] [Accepted: 03/05/2025] [Indexed: 05/04/2025] Open
Abstract
Automated paraphrase detection is crucial for natural language processing (NL) applications like text summarization, plagiarism detection, and question-answering systems. Detecting paraphrases in Urdu text remains challenging due to the language's complex morphology, distinctive script, and lack of resources such as labelled datasets, pre-trained models, and tailored NLP tools. This research proposes a novel bidirectional long short-term memory (BiLSTM) framework to address Urdu paraphrase detection's intricacies. Our approach employs word embeddings and text preprocessing techniques like tokenization, stop-word removal, and label encoding to effectively handle Urdu's morphological variations. The BiLSTM network sequentially processes the input, leveraging both forward and backward contextual information to encode the complex syntactic and semantic patterns inherent in Urdu text. An essential contribution of this work is the creation of a large-scale Urdu Paraphrased Corpus (UPC) comprising 400,000 potential sentence pair duplicates, with 150,000 pairs manually identified as paraphrases. Our findings reveal a significant improvement in paraphrase detection performance compared to existing methods. We provide insights into the underlying linguistic features and patterns that contribute to the robustness of our framework. This resource facilitates training and evaluating Urdu paraphrase detection models. Experimental evaluations on the custom UPC dataset demonstrate our BiLSTM model's superiority, achieving 94.14% accuracy and outperforming state-of-the-art methods like CNN (83.43%) and LSTM (88.09%). Our model attains an impressive 95.34% accuracy on the benchmark Quora dataset. Furthermore, we incorporate a comprehensive linguistic rule engine to handle exceptional cases during paraphrase analysis, ensuring robust performance across diverse contexts.
Collapse
Affiliation(s)
- Muhammad Ali Aslam
- Department of Computer Science, University of Science and Technology, Bannu, 28100, Pakistan
| | - Khairullah Khan
- Department of Computer Science, University of Science and Technology, Bannu, 28100, Pakistan
| | - Wahab Khan
- Department of Computer Science, University of Science and Technology, Bannu, 28100, Pakistan
| | - Sajid Ullah Khan
- Department of Information Systems, College of Computer Engineering and Sciences, Prince Sattam Bin Abdul Aziz University, Al-Kharj, Kingdom of Saudi Arabia.
| | - Abdullah Albanyan
- Department of Computer Science, College of Computer Engineering and Sciences, Prince Sattam Bin Abdul Aziz University, Al-Kharj, Kingdom of Saudi Arabia
| | - Shabbab Ali Algamdi
- Department of Software Engineering, College of Computer Engineering and Sciences, Prince Sattam Bin Abdul Aziz University, Al-Kharj, Kingdom of Saudi Arabia
| |
Collapse
|
3
|
Feng X, Cong P, Dong L, Xin Y, Miao F, Xin R. Channel attention convolutional aggregation network based on video-level features for EEG emotion recognition. Cogn Neurodyn 2024; 18:1689-1707. [PMID: 39104696 PMCID: PMC11297860 DOI: 10.1007/s11571-023-10034-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Revised: 06/01/2023] [Accepted: 06/27/2023] [Indexed: 08/07/2024] Open
Abstract
Electroencephalogram (EEG) emotion recognition plays a vital role in affective computing. A limitation of the EEG emotion recognition task is that the features of multiple domains are rarely included in the analysis simultaneously because of the lack of an effective feature organization form. This paper proposes a video-level feature organization method to effectively organize the temporal, frequency and spatial domain features. In addition, a deep neural network, Channel Attention Convolutional Aggregation Network, is designed to explore deeper emotional information from video-level features. The network uses a channel attention mechanism to adaptively captures critical EEG frequency bands. Then the frame-level representation of each time point is obtained by multi-layer convolution. Finally, the frame-level features are aggregated through NeXtVLAD to learn the time-sequence-related features. The method proposed in this paper achieves the best classification performance in SEED and DEAP datasets. The mean accuracy and standard deviation of the SEED dataset are 95.80% and 2.04%. In the DEAP dataset, the average accuracy with the standard deviation of arousal and valence are 98.97% ± 1.13% and 98.98% ± 0.98%, respectively. The experimental results show that our approach based on video-level features is effective for EEG emotion recognition tasks.
Collapse
Affiliation(s)
- Xin Feng
- School of Science, Jilin Institute of Chemical Technology, Jilin, 130000 People’s Republic of China
| | - Ping Cong
- College of Information and Control Engineering, Jilin Institute of Chemical Technology, Jilin, 130000 People’s Republic of China
| | - Lin Dong
- Department of Epidemiology and Biostatistics, School of Public Health, Jilin University, Changchun, 130012 People’s Republic of China
| | - Yongxian Xin
- College of Business and Economics, Australian National University, Act, Canberra, 2601 Australia
| | - Fengbo Miao
- College of Electronics and Information Engineering, Tiangong University, Tianjin, 300387 People’s Republic of China
| | - Ruihao Xin
- College of Information and Control Engineering, Jilin Institute of Chemical Technology, Jilin, 130000 People’s Republic of China
- College of Computer Science and Technology, and Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, 130012 People’s Republic of China
| |
Collapse
|
4
|
Saeedinia SA, Jahed-Motlagh MR, Tafakhori A, Kasabov NK. Diagnostic biomarker discovery from brain EEG data using LSTM, reservoir-SNN, and NeuCube methods in a pilot study comparing epilepsy and migraine. Sci Rep 2024; 14:10667. [PMID: 38724576 PMCID: PMC11082192 DOI: 10.1038/s41598-024-60996-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2023] [Accepted: 04/30/2024] [Indexed: 05/12/2024] Open
Abstract
The study introduces a new online spike encoding algorithm for spiking neural networks (SNN) and suggests new methods for learning and identifying diagnostic biomarkers using three prominent deep learning neural network models: deep BiLSTM, reservoir SNN, and NeuCube. EEG data from datasets related to epilepsy, migraine, and healthy subjects are employed. Results reveal that BiLSTM hidden neurons capture biological significance, while reservoir SNN activities and NeuCube spiking dynamics identify EEG channels as diagnostic biomarkers. BiLSTM and reservoir SNN achieve 90 and 85% classification accuracy, while NeuCube achieves 97%, all methods pinpointing potential biomarkers like T6, F7, C4, and F8. The research bears implications for refining online EEG classification, analysis, and early brain state diagnosis, enhancing AI models with interpretability and discovery. The proposed techniques hold promise for streamlined brain-computer interfaces and clinical applications, representing a significant advancement in pattern discovery across the three most popular neural network methods for addressing a crucial problem. Further research is planned to study how early can these diagnostic biomarkers predict an onset of brain states.
Collapse
Affiliation(s)
| | | | - Abbas Tafakhori
- Department of Neurology, School of Medicine, Iranian Center of Neurological Research, Tehran University of Medical Sciences, Tehran, Iran
| | - Nikola Kirilov Kasabov
- School of Engineering, Computing and Mathematical Sciences, Auckland University of Technology, Auckland, New Zealand.
- Institute for Information and Communication Technology, Bulgarian Academy of Sciences, Sofia, Bulgaria.
- Computer Science and Engineering Department, Dalian University, Dalian, China.
| |
Collapse
|
5
|
Gu J, Jiang J, Ge S, Wang H. Capped L21-norm-based common spatial patterns for EEG signals classification applicable to BCI systems. Med Biol Eng Comput 2023; 61:1083-1092. [PMID: 36658415 DOI: 10.1007/s11517-023-02782-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Accepted: 01/06/2023] [Indexed: 01/21/2023]
Abstract
The common spatial patterns (CSP) technique is an effective strategy for the classification of multichannel electroencephalogram (EEG) signals. However, the objective function expression of the conventional CSP algorithm is based on the L2-norm, which makes the performance of the method easily affected by outliers and noise. In this paper, we consider a new extension to CSP, which is termed capped L21-norm-based common spatial patterns (CCSP-L21), by using the capped L21-norm rather than the L2-norm for robust modeling. L21-norm considers the L1-norm sum which largely alleviates the influence of outliers and noise for the sake of robustness. The capped norm is further used to mitigate the effects of extreme outliers whose signal amplitude is much higher than that of the normal signal. Moreover, a non-greedy iterative procedure is derived to solve the proposed objective function. The experimental results show that the proposed method achieves the highest average recognition rates on the three real data sets of BCI competitions, which are 91.67%, 85.07%, and 82.04%, respectively. Capped L21-norm-based common spatial patterns-a robust model for EEG signals classification.
Collapse
Affiliation(s)
- Jingyu Gu
- Key Laboratory of Child Development and Learning Science of Ministry of Education, School of Biological Science & Medical Engineering, Southeast University, Nanjing, 210096, Jiangsu, People's Republic of China
| | - Jiuchuan Jiang
- School of Information Engineering, Nanjing University of Finance and Economics, Nanjing, 210003, Jiangsu, People's Republic of China
| | - Sheng Ge
- Key Laboratory of Child Development and Learning Science of Ministry of Education, School of Biological Science & Medical Engineering, Southeast University, Nanjing, 210096, Jiangsu, People's Republic of China
| | - Haixian Wang
- Key Laboratory of Child Development and Learning Science of Ministry of Education, School of Biological Science & Medical Engineering, Southeast University, Nanjing, 210096, Jiangsu, People's Republic of China.
| |
Collapse
|
6
|
Nazir S, Khan HU, Shahzad S, García-Magariño I. Editorial on decision support system for development of intelligent applications. Soft comput 2022. [DOI: 10.1007/s00500-022-07390-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|