1
|
Gu C, Jin X, Zhu L, Yi H, Liu H, Yang X, Babiloni F, Kong W. Cross-session SSVEP brainprint recognition using attentive multi-sub-band depth identity embedding learning network. Cogn Neurodyn 2025; 19:15. [PMID: 39801915 PMCID: PMC11717760 DOI: 10.1007/s11571-024-10192-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2024] [Revised: 10/23/2024] [Accepted: 11/10/2024] [Indexed: 01/16/2025] Open
Abstract
Brainprint recognition technology, regarded as a promising biometric technology, encounters challenges stemming from the time-varied, low signal-to-noise ratio of brain signals, such as electroencephalogram (EEG). Steady-state visual evoked potentials (SSVEP) exhibit high signal-to-noise ratio and frequency locking, making them a promising paradigm for brainprint recognition. Consequently, the extraction of time-invariant identity information from SSVEP EEG signals is essential. In this paper, we propose an Attentive Multi-sub-band Depth Identity Embedding Learning Network for stable cross-session SSVEP brainprint recognition. To address the issue of low recognition accuracy across sessions, we introduce the Sub-band Attentive Frequency mechanism, which integrates the frequency-domain relevant characteristics of the SSVEP paradigm and focuses on exploring depth-frequency identity embedding information. Also, we employ Attentive Statistic Pooling to enhance the stability of frequency domain feature distributions across sessions. Extensive experimentation and validation were conducted on two multi-session SSVEP benchmark datasets. The experimental results show that our approach outperforms other state-of-art models on 2-second samples across sessions and has the potential to serve as a benchmark in multi-subject biometric recognition systems.
Collapse
Affiliation(s)
- Chengxian Gu
- School of Computer Science, Hangzhou Dianzi University, Hangzhou, 310018 Zhejiang China
- Zhejiang Key Laboratory of Brain-Machine Collaborative Intelligence, Hangzhou, 310018 Zhejiang China
| | - Xuanyu Jin
- School of Computer Science, Hangzhou Dianzi University, Hangzhou, 310018 Zhejiang China
- Zhejiang Key Laboratory of Brain-Machine Collaborative Intelligence, Hangzhou, 310018 Zhejiang China
| | - Li Zhu
- School of Computer Science, Hangzhou Dianzi University, Hangzhou, 310018 Zhejiang China
- Zhejiang Key Laboratory of Brain-Machine Collaborative Intelligence, Hangzhou, 310018 Zhejiang China
| | - Hangjie Yi
- School of Computer Science, Hangzhou Dianzi University, Hangzhou, 310018 Zhejiang China
- Zhejiang Key Laboratory of Brain-Machine Collaborative Intelligence, Hangzhou, 310018 Zhejiang China
| | - Honggang Liu
- School of Computer Science, Hangzhou Dianzi University, Hangzhou, 310018 Zhejiang China
- Zhejiang Key Laboratory of Brain-Machine Collaborative Intelligence, Hangzhou, 310018 Zhejiang China
| | - Xinyu Yang
- School of Computer Science, Hangzhou Dianzi University, Hangzhou, 310018 Zhejiang China
- Zhejiang Key Laboratory of Brain-Machine Collaborative Intelligence, Hangzhou, 310018 Zhejiang China
| | - Fabio Babiloni
- Department of Physiology and Pharmacology, University of Rome “Sapienza”, 00185 Rome, RM Italy
| | - Wanzeng Kong
- School of Computer Science, Hangzhou Dianzi University, Hangzhou, 310018 Zhejiang China
- Zhejiang Key Laboratory of Brain-Machine Collaborative Intelligence, Hangzhou, 310018 Zhejiang China
| |
Collapse
|
2
|
Yadav A, Hussain R, Shukla M, B J, Kalia R, Mary SP, Hsu CY, Mishra MK, Saleem K, El-Meligy M. Enhancing convolutional neural networks in electroencephalogram driver drowsiness detection using human inspired optimizers. Sci Rep 2025; 15:10842. [PMID: 40155444 PMCID: PMC11953301 DOI: 10.1038/s41598-025-93765-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2025] [Accepted: 03/10/2025] [Indexed: 04/01/2025] Open
Abstract
Driver drowsiness is a significant safety concern, contributing to numerous traffic accidents. To address this issue, researchers have explored electroencephalogram (EEG)-based detection systems. Due to the high-dimensional nature of EEG signals and the subtle temporal patterns of drowsiness, there is increasing recognition of the need for deep neural networks (DNNs) to capture the dynamics of drowsy driving better. Meanwhile, optimizing DNNs architectures remains a challenge, as training these models is an NP-hard problem. Meta-heuristic algorithms offer an alternative to traditional gradient-based optimizers for improving DNNs performance. This study investigates the use of two human-inspired algorithms-teaching learning-based optimization (TLBO) and student psychology-based optimization (SPBO)-to optimize convolutional neural networks (CNNs) for EEG-based drowsiness detection. Results demonstrate strong predictive performance for both CNN-TLBO and CNN-SPBO, with area under the curve values of 0.926 and 0.920, respectively. TLBO produced a simpler model with 4,145 parameters, whereas SPBO generated a more complex architecture with 264,065 parameters but completed optimization faster (116 vs. 148 min). Despite minor overfitting, SPBO's efficiency makes it a cost-effective solution. In general, our findings contribute to the advancement of driver monitoring systems and road safety while emphasizing the broader role of meta-heuristic techniques in deep learning optimization.
Collapse
Affiliation(s)
- Anupam Yadav
- Department of Computer Engineering and Application, GLA University, Mathura, Chaumuhan, 281406, India
| | - Rifat Hussain
- College of Administrative Sciences, Applied Science University, Al Eker, Bahrain
| | - Madhu Shukla
- Department of Computer Engineering, Marwadi University Research Center, Faculty of Engineering & Technology Marwadi University, Rajkot, Gujarat, 360003, India
| | - Jayaprakash B
- Department of Computer Science & IT, School of Sciences, JAIN (Deemed to Be University), Bangalore, Karnataka, India
| | - Rishiv Kalia
- Centre for Research Impact & Outcome, Chitkara University Institute of Engineering and Technology, Chitkara University, Rajpura, Punjab, 140401, India
| | - S Prince Mary
- Department of Computer Science and Engineering, Sathyabama Institute of Science and Technology, Chennai, Tamil Nadu, India
| | - Chou-Yi Hsu
- Thunderbird School of Global Management, Arizona State University, Tempe Campus, Phoenix, AZ, 85004, USA
| | | | - Kashif Saleem
- Department of Computer Science, College of Computer & Information Sciences, King Saud University, 11543, Riyadh, Saudi Arabia
| | - Mohammed El-Meligy
- Jadara University Research Center, Jadara University, PO Box 733, Irbid, Jordan
- Applied Science Research Center, Applied Science Private University, Amman, Jordan
| |
Collapse
|
3
|
Zhang G, Luck SJ. Assessing the impact of artifact correction and artifact rejection on the performance of SVM-based decoding of EEG signals. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.02.22.639684. [PMID: 40060477 PMCID: PMC11888300 DOI: 10.1101/2025.02.22.639684] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 03/20/2025]
Abstract
Numerous studies have demonstrated that eyeblinks and other large artifacts can decrease the signal-to-noise ratio of EEG data, resulting in decreased statistical power for conventional univariate analyses. However, it is not clear whether eliminating these artifacts during preprocessing enhances the performance of multivariate pattern analysis (MVPA; decoding), especially given that artifact rejection reduces the number of trials available for training the decoder. This study aimed to evaluate the impact of artifact-minimization approaches on the decoding performance of support vector machines. Independent component analysis (ICA) was used to correct ocular artifacts, and artifact rejection was used to discard trials with large voltage deflections from other sources (e.g., muscle artifacts). We assessed decoding performance in relatively simple binary classification tasks using data from seven commonly-used event-related potential paradigms (N170, mismatch negativity, N2pc, P3b, N400, lateralized readiness potential, and error-related negativity), as well as more challenging multi-way decoding tasks, including stimulus location and stimulus orientation. The results indicated that the combination of artifact correction and rejection did not improve decoding performance in the vast majority of cases. However, artifact correction may still be essential to minimize artifact-related confounds that might artificially inflate decoding accuracy. Researchers who are decoding EEG data from paradigms, populations, and recording setups that are similar to those examined here may benefit from our recommendations to optimize decoding performance and avoid incorrect conclusions.
Collapse
Affiliation(s)
- Guanghui Zhang
- Research Center of Brain and Cognitive Neuroscience, Liaoning Normal University, Dalian, Liaoning, China
- Key Laboratory of Brain and Cognitive Neuroscience, Liaoning Province, Dalian, China
- Center for Mind & Brain, University of California-Davis, Davis, CA, USA
| | - Steven J Luck
- Center for Mind & Brain, University of California-Davis, Davis, CA, USA
| |
Collapse
|
4
|
Li X, Zhang Y, Peng Y, Kong W. Enhanced performance of EEG-based brain-computer interfaces by joint sample and feature importance assessment. Health Inf Sci Syst 2024; 12:9. [PMID: 38375134 PMCID: PMC10874355 DOI: 10.1007/s13755-024-00271-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Accepted: 01/07/2024] [Indexed: 02/21/2024] Open
Abstract
Electroencephalograph (EEG) has been a reliable data source for building brain-computer interface (BCI) systems; however, it is not reasonable to use the feature vector extracted from multiple EEG channels and frequency bands to perform recognition directly due to the two deficiencies. One is that EEG data is weak and non-stationary, which easily causes different EEG samples to have different quality. The other is that different feature dimensions corresponding to different brain regions and frequency bands have different correlations to a certain mental task, which is not sufficiently investigated. To this end, a Joint Sample and Feature importance Assessment (JSFA) model was proposed to simultaneously explore the different impacts of EEG samples and features in mental state recognition, in which the former is based on the self-paced learning technique while the latter is completed by the feature self-weighting technique. The efficacy of JSFA is extensively evaluated on two EEG data sets, i.e., SEED-IV and SEED-VIG. One is a classification task for emotion recognition and the other is a regression task for driving fatigue detection. Experimental results demonstrate that JSFA can effectively identify the importance of different EEG samples and features, leading to enhanced recognition performance of corresponding BCI systems.
Collapse
Affiliation(s)
- Xing Li
- School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, 310018 China
| | - Yikai Zhang
- School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, 310018 China
| | - Yong Peng
- School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, 310018 China
- Zhejiang Key Laboratory of Brain-Machine Collaborative Intelligence, Hangzhou, 310018 China
| | - Wanzeng Kong
- School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, 310018 China
- Zhejiang Key Laboratory of Brain-Machine Collaborative Intelligence, Hangzhou, 310018 China
| |
Collapse
|
5
|
Khosravi M, Parsaei H, Rezaee K, Helfroush MS. Fusing convolutional learning and attention-based Bi-LSTM networks for early Alzheimer's diagnosis from EEG signals towards IoMT. Sci Rep 2024; 14:26002. [PMID: 39472526 PMCID: PMC11522596 DOI: 10.1038/s41598-024-77876-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2024] [Accepted: 10/25/2024] [Indexed: 11/02/2024] Open
Abstract
The Internet of Medical Things (IoMT) is poised to play a pivotal role in future medical support systems, enabling pervasive health monitoring in smart cities. Alzheimer's disease (AD) afflicts millions globally, and this paper explores the potential of electroencephalogram (EEG) data in addressing this challenge. We propose the Convolutional Learning Attention-Bidirectional Time-Aware Long-Short-Term Memory (CL-ATBiLSTM) model, a deep learning approach designed to classify different AD phases through EEG data analysis. The model utilizes Discrete Wavelet Transform (DWT) to decompose EEG data into distinct frequency bands, allowing for targeted analysis of AD-related brain activity patterns. Additionally, the data is segmented into smaller windows to handle the dynamic nature of EEG signals, and these segments are transformed into spectrogram images, visually depicting brain activity distribution over time and frequency. The CL-ATBiLSTM model incorporates convolutional layers to capture spatial features, attention mechanisms to emphasize crucial data, and BiLSTM networks to explore temporal relationships within the sequences. To optimize the model's performance, Bayesian optimization is employed to fine-tune the hyperparameters of the ATBiLSTM network, enhancing its ability to generalize and accurately classify AD stages. Incorporating Bayesian learning ensures the most effective model configuration, improving sensitivity and specificity for identifying AD-related patterns. Our model extracts discriminative features from EEG data to differentiate between AD, Mild Cognitive Impairment (MCI), and healthy controls (CO), offering a more comprehensive approach than existing two-class detection algorithms. By including the MCI category, our method facilitates earlier identification and potentially more impactful therapy interventions. Achieving a 96.52% accuracy on Figshare datasets containing AD, MCI, and CO groups, our approach demonstrates strong potential for practical use, accelerating AD identification, enhancing patient care, and contributing to the development of targeted treatments for this debilitating condition.
Collapse
Affiliation(s)
- Mohamadreza Khosravi
- Department of Medical Physics and Engineering, School of Medicine, Shiraz University of Medical Sciences, Shiraz, Iran.
- IT Services, Lidoma Sanat Mehregan Part Ltd., Shiraz 71581, Fars, Iran.
- Shandong Provincial University Laboratory for Protected Horticulture (SPUL4PH), Weifang University of Science and Technology, Weifang 262700, China.
| | - Hossein Parsaei
- Department of Medical Physics and Engineering, School of Medicine, Shiraz University of Medical Sciences, Shiraz, Iran.
- Shiraz Neuroscience Research Center, Shiraz University of Medical Sciences, Shiraz, Iran.
| | - Khosro Rezaee
- Department of Biomedical Engineering, Meybod University, Meybod, Iran
| | | |
Collapse
|
6
|
Wang Z, Hu H, Zhou T, Xu T, Zhao X. Average Time Consumption per Character: A Practical Performance Metric for Generic Synchronous BCI Spellers. IEEE Trans Biomed Eng 2024; 71:2684-2698. [PMID: 38602850 DOI: 10.1109/tbme.2024.3387469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/13/2024]
Abstract
OBJECTIVE The information transfer rate (ITR) is widely accepted as a performance metric for generic brain-computer interface (BCI) spellers, while it is noticeable that the communication speed given by ITR is actually an upper bound which however can never be reached in real systems. A new performance metric is therefore needed. METHODS In this paper, a new metric named average time consumption per character (ATCPC) is proposed. It quantifies how long it takes on average to type one character using a typical synchronous BCI speller. To analytically derive ATCPC, the real typing process is modelled with a random walk on a graph. Misclassification and backspace are carefully characterized. A close-form formula of ATCPC is obtained through computing the hitting time of the random walk. The new metric is validated through simulated typing experiments and compared with ITR. RESULTS Firstly, the formula and simulation show a good consistency. Secondly, ITR always tends to overestimate the communication speed, while ATCPC is more realistic. CONCLUSION The proposed ATCPC metric is valid. SIGNIFICANCE ATCPC is a qualified substitute for ITR. ATCPC also reveals the great potential of keyboard optimization to further enhance the performance of BCI spellers, which was hardly investigated before.
Collapse
|
7
|
Messinis SC, Protonotarios NE, Doulamis N. Differentially Private Client Selection and Resource Allocation in Federated Learning for Medical Applications Using Graph Neural Networks. SENSORS (BASEL, SWITZERLAND) 2024; 24:5142. [PMID: 39204839 PMCID: PMC11360477 DOI: 10.3390/s24165142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/03/2024] [Revised: 07/30/2024] [Accepted: 08/05/2024] [Indexed: 09/04/2024]
Abstract
Federated learning (FL) has emerged as a pivotal paradigm for training machine learning models across decentralized devices while maintaining data privacy. In the healthcare domain, FL enables collaborative training among diverse medical devices and institutions, enhancing model robustness and generalizability without compromising patient privacy. In this paper, we propose DPS-GAT, a novel approach integrating graph attention networks (GATs) with differentially private client selection and resource allocation strategies in FL. Our methodology addresses the challenges of data heterogeneity and limited communication resources inherent in medical applications. By employing graph neural networks (GNNs), we effectively capture the relational structures among clients, optimizing the selection process and ensuring efficient resource distribution. Differential privacy mechanisms are incorporated, to safeguard sensitive information throughout the training process. Our extensive experiments, based on the Regensburg pediatric appendicitis open dataset, demonstrated the superiority of our approach, in terms of model accuracy, privacy preservation, and resource efficiency, compared to traditional FL methods. The ability of DPS-GAT to maintain a high and stable number of client selections across various rounds and differential privacy budgets has significant practical implications, indicating that FL systems can achieve strong privacy guarantees without compromising client engagement and model performance. This balance is essential for real-world applications where both privacy and performance are paramount. This study suggests a promising direction for more secure and efficient FL medical applications, which could improve patient care through enhanced predictive models and collaborative data utilization.
Collapse
Affiliation(s)
- Sotirios C. Messinis
- Institute of Communication and Computer Systems, National Technical University of Athens, 15773 Athens, Greece;
| | | | - Nikolaos Doulamis
- Institute of Communication and Computer Systems, National Technical University of Athens, 15773 Athens, Greece;
| |
Collapse
|
8
|
Zou R, Zhao L, He S, Zhou X, Yin X. Effect of the period of EEG signals on the decoding of motor information. Phys Eng Sci Med 2024; 47:249-260. [PMID: 38150057 DOI: 10.1007/s13246-023-01361-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Accepted: 11/22/2023] [Indexed: 12/28/2023]
Abstract
Decoding movement information from electroencephalogram to construct brain-computer interface has promising applications. The EEG data during the entire motor imagery (MI) period or movement execution (ME) period is generally decoded, and calculation of numerous information and massive dataset is time-consuming. In order to improve decoding efficiency, the joint topographic maps of the brain activation state of 15 subjects were studied during different periods. The results showed that the activation intensity of the preparation period in the motor imagery experiment was higher than during the exercise period, while during the exercise period, the activation intensity was higher than in the preparation period in the movement execution experiment. Hence, the wavelet neural network was used to decode the six-class movements including elbow flexion/extension, forearm pronation/supination and hand open/close in periods of MI/ME. The experimental results show that the accuracy obtained in the preparation period is the highest in the motor imagery experiment, which is 80.77%. On the other hand, the highest accuracy obtained in the exercise period of the movement execution experiment is 79.26%. It further proves that the optimized period is a key decoding factor to reduce the cost of calculation, and this new decoding method is effective to build a more intelligent brain-computer interface system.
Collapse
Affiliation(s)
- Renling Zou
- Department of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, China.
| | - Liang Zhao
- Department of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, China
| | - Shuang He
- Department of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, China
| | - Xiaobo Zhou
- Department of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, China
| | - Xuezhi Yin
- Shanghai Berry Electronic Technology Co., Ltd, Shanghai, China
| |
Collapse
|
9
|
Zhu H, Wu Y, Guo Y, Fu C, Shu F, Yu H, Chen W, Chen C. Towards Real-Time Sleep Stage Prediction and Online Calibration Based on Architecturally Switchable Deep Learning Models. IEEE J Biomed Health Inform 2024; 28:470-481. [PMID: 37878423 DOI: 10.1109/jbhi.2023.3327470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2023]
Abstract
Despite the recent advances in automatic sleep staging, few studies have focused on real-time sleep staging to promote the regulation of sleep or the intervention of sleep disorders. In this paper, a novel network named SwSleepNet, that can handle both precisely offline sleep staging, and online sleep stages prediction and calibration is proposed. For offline analysis, the proposed network coordinates sequence broadening module (SBM), sequential CNN (SCNN), squeeze and excitation (SE) block, and sequence consolidation module (SCM) to balance the operational efficiency of the network and the comprehensive feature extraction. For online analysis, only SCNN and SE are involved in predicting the sleep stage within a short-time segment of the recordings. Once more than two successive segments have disparate predictions, the calibration mechanism will be triggered, and contextual information will be involved. In addition, to investigate the appropriate time of the segment that is suitable to predict a sleep stage, segments with five-second, three-second, and two-second data are analyzed. The performance of SwSleepNet is validated on two publicly available datasets Sleep-EDF Expanded and Montreal Archive of Sleep Studies (MASS), and one clinical dataset Huashan Hospital Fudan University (HSFU), with the offline accuracy of 84.5%, 86.7%, and 81.8%, respectively, which outperforms the state-of-the-art methods. Additionally, for the online sleep staging, the dedicated calibration mechanism allows SwSleepNet to achieve high accuracy over 80% on three datasets with the short-time segments, demonstrating the robustness and stability of SwSleepNet. This study presents a real-time sleep staging architecture, which is expected to pave the way for accurate sleep regulation and intervention.
Collapse
|
10
|
Shang B, Duan F, Fu R, Gao J, Sik H, Meng X, Chang C. EEG-based investigation of effects of mindfulness meditation training on state and trait by deep learning and traditional machine learning. Front Hum Neurosci 2023; 17:1033420. [PMID: 37719770 PMCID: PMC10500069 DOI: 10.3389/fnhum.2023.1033420] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 06/16/2023] [Indexed: 09/19/2023] Open
Abstract
Introduction This study examines the state and trait effects of short-term mindfulness-based stress reduction (MBSR) training using convolutional neural networks (CNN) based deep learning methods and traditional machine learning methods, including shallow and deep ConvNets as well as support vector machine (SVM) with features extracted from common spatial pattern (CSP) and filter bank CSP (FBCSP). Methods We investigated the electroencephalogram (EEG) measurements of 11 novice MBSR practitioners (6 males, 5 females; mean age 35.7 years; 7 Asians and 4 Caucasians) during resting and meditation at early and late training stages. The classifiers are trained and evaluated using inter-subject, mix-subject, intra-subject, and subject-transfer classification strategies, each according to a specific application scenario. Results For MBSR state effect recognition, trait effect recognition using meditation EEG, and trait effect recognition using resting EEG, from shallow ConvNet classifier we get mix-subject/intra-subject classification accuracies superior to related previous studies for both novice and expert meditators with a variety of meditation types including yoga, Tibetan, and mindfulness, whereas from FBSCP + SVM classifier we get inter-subject classification accuracies of 68.50, 85.00, and 78.96%, respectively. Conclusion Deep learning is superior for state effect recognition of novice meditators and slightly inferior but still comparable for both state and trait effects recognition of expert meditators when compared to the literatures. This study supports previous findings that short-term meditation training has EEG-recognizable state and trait effects.
Collapse
Affiliation(s)
- Baoxiang Shang
- School of Biomedical Engineering, Shenzhen University Medical School, Shenzhen University, Shenzhen, China
- Department of Neurosurgery, Shenzhen University General Hospital, Shenzhen University, Shenzhen, China
| | - Feiyan Duan
- School of Biomedical Engineering, Shenzhen University Medical School, Shenzhen University, Shenzhen, China
- Deepbay Innovation Technology Corporation Ltd., Shenzhen, China
| | - Ruiqi Fu
- Shenzhen Key Laboratory of Smart Healthcare Engineering, Southern University of Science and Technology, Shenzhen, China
| | - Junling Gao
- Buddhist Practice and Counselling Science Lab, Centre of Buddhist Studies, The University of Hong Kong, Hong Kong, Hong Kong SAR, China
| | - Hinhung Sik
- Buddhist Practice and Counselling Science Lab, Centre of Buddhist Studies, The University of Hong Kong, Hong Kong, Hong Kong SAR, China
| | - Xianghong Meng
- Department of Neurosurgery, Shenzhen University General Hospital, Shenzhen University, Shenzhen, China
| | - Chunqi Chang
- School of Biomedical Engineering, Shenzhen University Medical School, Shenzhen University, Shenzhen, China
| |
Collapse
|
11
|
Zhang Y, Peng Y, Li J, Kong W. SIFIAE: An adaptive emotion recognition model with EEG feature-label inconsistency consideration. J Neurosci Methods 2023:109909. [PMID: 37399992 DOI: 10.1016/j.jneumeth.2023.109909] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Revised: 06/06/2023] [Accepted: 06/26/2023] [Indexed: 07/05/2023]
Abstract
BACKGROUND A common but easily overlooked affective overlap problem has not been received enough attention in electroencephalogram (EEG)-based emotion recognition research. In real life, affective overlap refers to the current emotional state of human being is sometimes influenced easily by his/her historical mood. In stimulus-evoked EEG collection experiment, due to the short rest interval in consecutive trials, the inner mechanisms of neural responses make subjects cannot switch their emotion state easily and quickly, which might lead to the affective overlap. For example, we might be still in sad state to some extent even if we are watching a comedy because we just saw a tragedy before. In pattern recognition, affective overlap usually means that there exists the feature-label inconsistency in EEG data. NEW METHODS To alleviate the impact of inconsistent EEG data, we introduce a variable to adaptively explore the sample inconsistency in emotion recognition model development. Then, we propose a semi-supervised emotion recognition model for joint sample inconsistency and feature importance exploration (SIFIAE). Accordingly, an efficient optimization method to SIFIAE model is proposed. RESULTS Extensive experiments on the SEED-V dataset demonstrate the effectiveness of SIFIAE. Specifically, SIFIAE achieves 69.10%, 67.01%, 71.5%, 73.26%, 72.07% and 71.35% average accuracies in six cross-session emotion recognition tasks. CONCLUSION The results illustrated that the sample weights have a rising trend in the beginning of most trials, which coincides with the affective overlap hypothesis. The feature importance factor indicated the critical bands and channels are more obvious compared with some models without considering EEG feature-label inconsistency.
Collapse
Affiliation(s)
- Yikai Zhang
- School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, 310018, Zhejiang Province, China
| | - Yong Peng
- School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, 310018, Zhejiang Province, China; Key Laboratory of Brain Machine Collaborative Intelligence of Zhejiang Province, Hangzhou, 310018, Zhejiang Province, China.
| | - Junhua Li
- School of Computer Science and Electronic Engineering, University of Essex, Colchester, CO4 3SQ, UK
| | - Wanzeng Kong
- School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, 310018, Zhejiang Province, China; Key Laboratory of Brain Machine Collaborative Intelligence of Zhejiang Province, Hangzhou, 310018, Zhejiang Province, China
| |
Collapse
|
12
|
Zhu H, Fu C, Shu F, Yu H, Chen C, Chen W. The Effect of Coupled Electroencephalography Signals in Electrooculography Signals on Sleep Staging Based on Deep Learning Methods. Bioengineering (Basel) 2023; 10:573. [PMID: 37237643 PMCID: PMC10215192 DOI: 10.3390/bioengineering10050573] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 04/20/2023] [Accepted: 04/26/2023] [Indexed: 05/28/2023] Open
Abstract
The influence of the coupled electroencephalography (EEG) signal in electrooculography (EOG) on EOG-based automatic sleep staging has been ignored. Since the EOG and prefrontal EEG are collected at close range, it is not clear whether EEG couples in EOG or not, and whether or not the EOG signal can achieve good sleep staging results due to its intrinsic characteristics. In this paper, the effect of a coupled EEG signal in an EOG signal on automatic sleep staging is explored. The blind source separation algorithm was used to extract a clean prefrontal EEG signal. Then the raw EOG signal and clean prefrontal EEG signal were processed to obtain EOG signals coupled with different EEG signal contents. Afterwards, the coupled EOG signals were fed into a hierarchical neural network, including a convolutional neural network and recurrent neural network for automatic sleep staging. Finally, an exploration was performed using two public datasets and one clinical dataset. The results showed that using a coupled EOG signal could achieve an accuracy of 80.4%, 81.1%, and 78.9% for the three datasets, slightly better than the accuracy of sleep staging using the EOG signal without coupled EEG. Thus, an appropriate content of coupled EEG signal in an EOG signal improved the sleep staging results. This paper provides an experimental basis for sleep staging with EOG signals.
Collapse
Affiliation(s)
- Hangyu Zhu
- School of Information Science and Technology, Fudan University, Shanghai 200433, China
| | - Cong Fu
- Huashan Hospital, Shanghai Medical College, Fudan University, Shanghai 200040, China
| | - Feng Shu
- Academy for Engineering and Technology, Fudan University, Shanghai 200433, China
| | - Huan Yu
- Huashan Hospital, Shanghai Medical College, Fudan University, Shanghai 200040, China
| | - Chen Chen
- Human Phenome Institute, Fudan University, Shanghai 201203, China
| | - Wei Chen
- School of Information Science and Technology, Fudan University, Shanghai 200433, China
- Human Phenome Institute, Fudan University, Shanghai 201203, China
| |
Collapse
|
13
|
Oikonomou VP, Georgiadis K, Kalaganis F, Nikolopoulos S, Kompatsiaris I. A Sparse Representation Classification Scheme for the Recognition of Affective and Cognitive Brain Processes in Neuromarketing. SENSORS (BASEL, SWITZERLAND) 2023; 23:2480. [PMID: 36904683 PMCID: PMC10007402 DOI: 10.3390/s23052480] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Revised: 02/09/2023] [Accepted: 02/20/2023] [Indexed: 06/18/2023]
Abstract
In this work, we propose a novel framework to recognize the cognitive and affective processes of the brain during neuromarketing-based stimuli using EEG signals. The most crucial component of our approach is the proposed classification algorithm that is based on a sparse representation classification scheme. The basic assumption of our approach is that EEG features from a cognitive or affective process lie on a linear subspace. Hence, a test brain signal can be represented as a linear (or weighted) combination of brain signals from all classes in the training set. The class membership of the brain signals is determined by adopting the Sparse Bayesian Framework with graph-based priors over the weights of linear combination. Furthermore, the classification rule is constructed by using the residuals of linear combination. The experiments on a publicly available neuromarketing EEG dataset demonstrate the usefulness of our approach. For the two classification tasks offered by the employed dataset, namely affective state recognition and cognitive state recognition, the proposed classification scheme manages to achieve a higher classification accuracy compared to the baseline and state-of-the art methods (more than 8% improvement in classification accuracy).
Collapse
Affiliation(s)
- Vangelis P. Oikonomou
- Information Technologies Institute, Centre for Research and Technology Hellas, CERTH-ITI, 6th km Charilaou-Thermi Road, 57001 Thessaloniki, Greece
| | | | | | | | | |
Collapse
|
14
|
A Long Short-Term Memory Network Using Resting-State Electroencephalogram to Predict Outcomes Following Moderate Traumatic Brain Injury. COMPUTERS 2023. [DOI: 10.3390/computers12020045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/22/2023]
Abstract
Although traumatic brain injury (TBI) is a global public health issue, not all injuries necessitate additional hospitalisation. Thinking, memory, attention, personality, and movement can all be negatively impacted by TBI. However, only a small proportion of nonsevere TBIs necessitate prolonged observation. Clinicians would benefit from an electroencephalography (EEG)-based computational intelligence model for outcome prediction by having access to an evidence-based analysis that would allow them to securely discharge patients who are at minimal risk of TBI-related mortality. Despite the increasing popularity of EEG-based deep learning research to create predictive models with breakthrough performance, particularly in epilepsy prediction, its use in clinical decision making for the diagnosis and prognosis of TBI has not been as widely exploited. Therefore, utilising 60s segments of unprocessed resting-state EEG data as input, we suggest a long short-term memory (LSTM) network that can distinguish between improved and unimproved outcomes in moderate TBI patients. Complex feature extraction and selection are avoided in this architecture. The experimental results show that, with a classification accuracy of 87.50 ± 0.05%, the proposed prognostic model outperforms three related works. The results suggest that the proposed methodology is an efficient and reliable strategy to assist clinicians in creating an automated tool for predicting treatment outcomes from EEG signals.
Collapse
|
15
|
Peng Y, Liu H, Li J, Huang J, Lu BL, Kong W. Cross-Session Emotion Recognition by Joint Label-Common and Label-Specific EEG Features Exploration. IEEE Trans Neural Syst Rehabil Eng 2023; 31:759-768. [PMID: 37015629 DOI: 10.1109/tnsre.2022.3233109] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Since Electroencephalogram (EEG) is resistant to camouflage, it has been a reliable data source for objective emotion recognition. EEG is naturally multi-rhythm and multi-channel, based on which we can extract multiple features for further processing. In EEG-based emotion recognition, it is important to investigate whether there exist some common features shared by different emotional states, and the specific features associated with each emotional state. However, such fundamental problem is ignored by most of the existing studies. To this end, we propose a Joint label-Common and label-Specific Features Exploration (JCSFE) model for semi-supervised cross-session EEG emotion recognition in this paper. To be specific, JCSFE imposes the $\ell _{\text {2,1}}$ -norm on the projection matrix to explore the label-common EEG features and simultaneously the $\ell _{{1}}$ -norm is used to explore the label-specific EEG features. Besides, a graph regularization term is introduced to enforce the data local invariance property, i.e., similar EEG samples are encouraged to have the same emotional state. Results obtained from the SEED-IV and SEED-V emotional data sets experimentally demonstrate that JCSFE not only achieves superior emotion recognition performance in comparison with the state-of-the-art models but also provides us with a quantitative method to identify the label-common and label-specific EEG features in emotion recognition.
Collapse
|
16
|
Guo Z, Lin L. Cognitive physiological data analysis based on the XGBoost algorithm to realize positive perceptual sample classification. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2023. [DOI: 10.3233/jifs-222656] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Designers refer to existing product cases and innovate products to develop new products. However, when designers screen product cases, there is no user participation, which leads to the lack of user-side knowledge and emotional drive that is very important for design. Therefore, it is necessary to play the role of user emotional knowledge in promoting the whole design process. This paper proposes the concept of the positive perceptual sample, which applies the knowledge emotion integration of designers and users to the screening sample case stage at the beginning of the design process. This study is based on the lack of user-side knowledge and emotional drive of reference cases and integrates user emotion into the reference case screening process. Then, in the emotion measurement process, users’ cognitive data in the screening process are obtained through the eye-brain fusion cognitive experiment. Finally, the XGBoost algorithm is used to process feature index data to realize the classification and recognition of cognitive data and applied to the positive perceptual classification of products. The results show that the classification accuracy of physiological cognitive data with user emotional representation by the XGBoost algorithm is 90.87% . The results of cognitive data classification are applied to the screening of positive perceptual samples, and the satisfaction rate is 98.35% . The results show that the method proposed in this paper provides a new source of ideas for obtaining positive perceptual samples and can be applied to new product development.
Collapse
Affiliation(s)
- Zhuen Guo
- School of Mechanical Engineering, Guizhou University, Guiyang, Guizhou, China
| | - Li Lin
- School of Mechanical Engineering, Guizhou University, Guiyang, Guizhou, China
| |
Collapse
|
17
|
Yasemin M, Cruz A, Nunes UJ, Pires G. Single trial detection of error-related potentials in brain-machine interfaces: a survey and comparison of methods. J Neural Eng 2023; 20. [PMID: 36595316 DOI: 10.1088/1741-2552/acabe9] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Accepted: 12/15/2022] [Indexed: 12/23/2022]
Abstract
Objective.Error-related potential (ErrP) is a potential elicited in the brain when humans perceive an error. ErrPs have been researched in a variety of contexts, such as to increase the reliability of brain-computer interfaces (BCIs), increase the naturalness of human-machine interaction systems, teach systems, as well as study clinical conditions. Still, there is a significant challenge in detecting ErrP from a single trial, which may hamper its effective use. The literature presents ErrP detection accuracies quite variable across studies, which raises the question of whether this variability depends more on classification pipelines or on the quality of elicited ErrPs (mostly directly related to the underlying paradigms).Approach.With this purpose, 11 datasets have been used to compare several classification pipelines which were selected according to the studies that reported online performance above 75%. We also analyze the effects of different steps of the pipelines, such as resampling, window selection, augmentation, feature extraction, and classification.Main results.From our analysis, we have found that shrinkage-regularized linear discriminant analysis is the most robust method for classification, and for feature extraction, using Fisher criterion beamformer spatial features and overlapped window averages result in better classification performance. The overall experimental results suggest that classification accuracy is highly dependent on user tasks in BCI experiments and on signal quality (in terms of ErrP morphology, signal-to-noise ratio (SNR), and discrimination).Significance.This study contributes to the BCI research field by responding to the need for a guideline that can direct researchers in designing ErrP-based BCI tasks by accelerating the design steps.
Collapse
Affiliation(s)
- Mine Yasemin
- Institute of Systems and Robotics (ISR-UC), University of Coimbra, Coimbra, Portugal.,Department of Electrical and Computer Engineering, University of Coimbra, Coimbra, Portugal
| | - Aniana Cruz
- Institute of Systems and Robotics (ISR-UC), University of Coimbra, Coimbra, Portugal
| | - Urbano J Nunes
- Institute of Systems and Robotics (ISR-UC), University of Coimbra, Coimbra, Portugal.,Department of Electrical and Computer Engineering, University of Coimbra, Coimbra, Portugal
| | - Gabriel Pires
- Institute of Systems and Robotics (ISR-UC), University of Coimbra, Coimbra, Portugal.,Engineering Department, Polytechnic Institute of Tomar, Tomar, Portugal
| |
Collapse
|
18
|
Walther D, Viehweg J, Haueisen J, Mäder P. A systematic comparison of deep learning methods for EEG time series analysis. Front Neuroinform 2023; 17:1067095. [PMID: 36911074 PMCID: PMC9995756 DOI: 10.3389/fninf.2023.1067095] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Accepted: 01/30/2023] [Indexed: 02/25/2023] Open
Abstract
Analyzing time series data like EEG or MEG is challenging due to noisy, high-dimensional, and patient-specific signals. Deep learning methods have been demonstrated to be superior in analyzing time series data compared to shallow learning methods which utilize handcrafted and often subjective features. Especially, recurrent deep neural networks (RNN) are considered suitable to analyze such continuous data. However, previous studies show that they are computationally expensive and difficult to train. In contrast, feed-forward networks (FFN) have previously mostly been considered in combination with hand-crafted and problem-specific feature extractions, such as short time Fourier and discrete wavelet transform. A sought-after are easily applicable methods that efficiently analyze raw data to remove the need for problem-specific adaptations. In this work, we systematically compare RNN and FFN topologies as well as advanced architectural concepts on multiple datasets with the same data preprocessing pipeline. We examine the behavior of those approaches to provide an update and guideline for researchers who deal with automated analysis of EEG time series data. To ensure that the results are meaningful, it is important to compare the presented approaches while keeping the same experimental setup, which to our knowledge was never done before. This paper is a first step toward a fairer comparison of different methodologies with EEG time series data. Our results indicate that a recurrent LSTM architecture with attention performs best on less complex tasks, while the temporal convolutional network (TCN) outperforms all the recurrent architectures on the most complex dataset yielding a 8.61% accuracy improvement. In general, we found the attention mechanism to substantially improve classification results of RNNs. Toward a light-weight and online learning-ready approach, we found extreme learning machines (ELM) to yield comparable results for the less complex tasks.
Collapse
Affiliation(s)
- Dominik Walther
- Data-Intensive Systems and Visualization Group (dAI.SY), Technische Universität Ilmenau, Ilmenau, Germany
| | - Johannes Viehweg
- Data-Intensive Systems and Visualization Group (dAI.SY), Technische Universität Ilmenau, Ilmenau, Germany
| | - Jens Haueisen
- Institute of Biomedical Engineering and Informatics, Technische Universität Ilmenau, Ilmenau, Germany
| | - Patrick Mäder
- Data-Intensive Systems and Visualization Group (dAI.SY), Technische Universität Ilmenau, Ilmenau, Germany.,Faculty of Biological Sciences, Friedrich Schiller University, Jena, Germany
| |
Collapse
|
19
|
Jaipriya D, Sriharipriya KC. A comparative analysis of masking empirical mode decomposition and a neural network with feed-forward and back propagation along with masking empirical mode decomposition to improve the classification performance for a reliable brain-computer interface. Front Comput Neurosci 2022; 16:1010770. [PMID: 36405787 PMCID: PMC9672820 DOI: 10.3389/fncom.2022.1010770] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 10/03/2022] [Indexed: 02/25/2024] Open
Abstract
In general, extraction and classification are used in various fields like image processing, pattern recognition, signal processing, and so on. Extracting effective characteristics from raw electroencephalogram (EEG) signals is a crucial role of the brain-computer interface for motor imagery. Recently, there has been a great deal of focus on motor imagery in the EEG signals since they encode a person's intent to do an action. Researchers have been using MI signals to assist paralyzed people and even move them on their own with certain equipment, like wheelchairs. As a result, proper decoding is an important step required for the interconnection of the brain and the computer. EEG decoding is a challenging process because of poor SNR, complexity, and other reasons. However, choosing an appropriate method to extract the features to improve the performance of motor imagery recognition is still a research hotspot. To extract the features of the EEG signal in the classification task, this paper proposes a Masking Empirical Mode Decomposition (MEMD) based Feed Forward Back Propagation Neural Network (MEMD-FFBPNN). The dataset consists of EEG signals which are first normalized using the minimax method and given as input to the MEMD to extract the features and then given to the FFBPNN to classify the tasks. The accuracy of the proposed method MEMD-FFBPNN has been measured using the confusion matrix, mean square error and which has been recorded up to 99.9%. Thus, the proposed method gives better accuracy than the other conventional methods.
Collapse
Affiliation(s)
| | - K. C. Sriharipriya
- School of Electronics Engineering, Vellore Institute of Technology, Vellore, India
| |
Collapse
|
20
|
Alouani AT, Elfouly T. Traumatic Brain Injury (TBI) Detection: Past, Present, and Future. Biomedicines 2022; 10:biomedicines10102472. [PMID: 36289734 PMCID: PMC9598576 DOI: 10.3390/biomedicines10102472] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 09/28/2022] [Accepted: 09/30/2022] [Indexed: 11/16/2022] Open
Abstract
Traumatic brain injury (TBI) can produce temporary biochemical imbalance due to leaks through cell membranes or disruption of the axoplasmic flow due to the misalignment of intracellular neurofilaments. If untreated, TBI can lead to Alzheimer's, Parkinson's, or total disability. Mild TBI (mTBI) accounts for about about 90 percent of all TBI cases. The detection of TBI as soon as it happens is crucial for successful treatment management. Neuroimaging-based tests provide only a structural and functional mapping of the brain with poor temporal resolution. Such tests may not detect mTBI. On the other hand, the electroencephalogram (EEG) provides good spatial resolution and excellent temporal resolution of the brain activities beside its portability and low cost. The objective of this paper is to provide clinicians and scientists with a one-stop source of information to quickly learn about the different technologies used for TBI detection, their advantages and limitations. Our research led us to conclude that even though EEG-based TBI detection is potentially a powerful technology, it is currently not able to detect the presence of a mTBI with high confidence. The focus of the paper is to review existing approaches and provide the reason for the unsuccessful state of EEG-based detection of mTBI.
Collapse
|
21
|
EEG-Based Alzheimer's Disease Recognition Using Robust-PCA and LSTM Recurrent Neural Network. SENSORS 2022; 22:s22103696. [PMID: 35632105 PMCID: PMC9145212 DOI: 10.3390/s22103696] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 05/02/2022] [Accepted: 05/10/2022] [Indexed: 01/27/2023]
Abstract
The use of electroencephalography (EEG) has recently grown as a means to diagnose neurodegenerative pathologies such as Alzheimer's disease (AD). AD recognition can benefit from machine learning methods that, compared with traditional manual diagnosis methods, have higher reliability and improved recognition accuracy, being able to manage large amounts of data. Nevertheless, machine learning methods may exhibit lower accuracies when faced with incomplete, corrupted, or otherwise missing data, so it is important do develop robust pre-processing techniques do deal with incomplete data. The aim of this paper is to develop an automatic classification method that can still work well with EEG data affected by artifacts, as can arise during the collection with, e.g., a wireless system that can lose packets. We show that a recurrent neural network (RNN) can operate successfully even in the case of significantly corrupted data, when it is pre-filtered by the robust principal component analysis (RPCA) algorithm. RPCA was selected because of its stated ability to remove outliers from the signal. To demonstrate this idea, we first develop an RNN which operates on EEG data, properly processed through traditional PCA; then, we use corrupted data as input and process them with RPCA to filter outlier components, showing that even with data corruption causing up to 20% erasures, the RPCA was able to increase the detection accuracy by about 5% with respect to the baseline PCA.
Collapse
|
22
|
Mukhtar H, Qaisar SM, Zaguia A. Deep Convolutional Neural Network Regularization for Alcoholism Detection Using EEG Signals. SENSORS (BASEL, SWITZERLAND) 2021; 21:5456. [PMID: 34450899 PMCID: PMC8402228 DOI: 10.3390/s21165456] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/19/2021] [Revised: 08/05/2021] [Accepted: 08/09/2021] [Indexed: 12/31/2022]
Abstract
Alcoholism is attributed to regular or excessive drinking of alcohol and leads to the disturbance of the neuronal system in the human brain. This results in certain malfunctioning of neurons that can be detected by an electroencephalogram (EEG) using several electrodes on a human skull at appropriate positions. It is of great interest to be able to classify an EEG activity as that of a normal person or an alcoholic person using data from the minimum possible electrodes (or channels). Due to the complex nature of EEG signals, accurate classification of alcoholism using only a small dataset is a challenging task. Artificial neural networks, specifically convolutional neural networks (CNNs), provide efficient and accurate results in various pattern-based classification problems. In this work, we apply CNN on raw EEG data and demonstrate how we achieved 98% average accuracy by optimizing a baseline CNN model and outperforming its results in a range of performance evaluation metrics on the University of California at Irvine Machine Learning (UCI-ML) EEG dataset. This article explains the stepwise improvement of the baseline model using the dropout, batch normalization, and kernel regularization techniques and provides a comparison of the two models that can be beneficial for aspiring practitioners who aim to develop similar classification models in CNN. A performance comparison is also provided with other approaches using the same dataset.
Collapse
Affiliation(s)
- Hamid Mukhtar
- Department of Computer Science, College of Computers and Information Technology, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia;
| | - Saeed Mian Qaisar
- Electrical and Computer Engineering Department, College of Engineering, Effat University, Jeddah 22332, Saudi Arabia;
- Communication and Signal Processing Lab, Energy and Technology Research Centre, Effat University, Jeddah 22332, Saudi Arabia
| | - Atef Zaguia
- Department of Computer Science, College of Computers and Information Technology, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia;
| |
Collapse
|