1
|
Gordienko Y, Gordienko N, Taran V, Rojbi A, Telenyk S, Stirenko S. Effect of natural and synthetic noise data augmentation on physical action classification by brain-computer interface and deep learning. Front Neuroinform 2025; 19:1521805. [PMID: 40083893 PMCID: PMC11903462 DOI: 10.3389/fninf.2025.1521805] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2024] [Accepted: 02/13/2025] [Indexed: 03/16/2025] Open
Abstract
Analysis of electroencephalography (EEG) signals gathered by brain-computer interface (BCI) recently demonstrated that deep neural networks (DNNs) can be effectively used for investigation of time sequences for physical actions (PA) classification. In this study, the relatively simple DNN with fully connected network (FCN) components and convolutional neural network (CNN) components was considered to classify finger-palm-hand manipulations each from the grasp-and-lift (GAL) dataset. The main aim of this study was to imitate and investigate environmental influence by the proposed noise data augmentation (NDA) of two kinds: (i) natural NDA by inclusion of noise EEG data from neighboring regions by increasing the sampling size N and the different offset values for sample labeling and (ii) synthetic NDA by adding the generated Gaussian noise. The natural NDA by increasing N leads to the higher micro and macro area under the curve (AUC) for receiver operating curve values for the bigger N values than usage of synthetic NDA. The detrended fluctuation analysis (DFA) was applied to investigate the fluctuation properties and calculate the correspondent Hurst exponents H for the quantitative characterization of the fluctuation variability. H values for the low time window scales (< 2 s) are higher in comparison with ones for the bigger time window scales. For example, H more than 2-3 times higher for some PAs, i.e., it means that the shorter EEG fragments (< 2 s) demonstrate the scaling behavior of the higher complexity than the longer fragments. As far as these results were obtained by the relatively small DNN with the low resource requirements, this approach can be promising for porting such models to Edge Computing infrastructures on devices with the very limited computational resources.
Collapse
Affiliation(s)
- Yuri Gordienko
- Computer Engineering Department, National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute,”Kyiv, Ukraine
| | - Nikita Gordienko
- Computer Engineering Department, National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute,”Kyiv, Ukraine
| | - Vladyslav Taran
- Computer Engineering Department, National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute,”Kyiv, Ukraine
| | - Anis Rojbi
- Laboratoire Cognitions Humaine et Artificielle, Université Paris 8, Paris, France
| | - Sergii Telenyk
- Computer Engineering Department, National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute,”Kyiv, Ukraine
- Department of Automation and Computer Science, Faculty of Electrical and Computer Engineering, Cracow University of Technology, Cracow, Poland
| | - Sergii Stirenko
- Computer Engineering Department, National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute,”Kyiv, Ukraine
| |
Collapse
|
2
|
Borra D, Magosso E, Ravanelli M. A protocol for trustworthy EEG decoding with neural networks. Neural Netw 2025; 182:106847. [PMID: 39549492 DOI: 10.1016/j.neunet.2024.106847] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Revised: 10/03/2024] [Accepted: 10/23/2024] [Indexed: 11/18/2024]
Abstract
Deep learning solutions have rapidly emerged for EEG decoding, achieving state-of-the-art performance on a variety of decoding tasks. Despite their high performance, existing solutions do not fully address the challenge posed by the introduction of many hyperparameters, defining data pre-processing, network architecture, network training, and data augmentation. Automatic hyperparameter search is rarely performed and limited to network-related hyperparameters. Moreover, pipelines are highly sensitive to performance fluctuations due to random initialization, hindering their reliability. Here, we design a comprehensive protocol for EEG decoding that explores the hyperparameters characterizing the entire pipeline and that includes multi-seed initialization for providing robust performance estimates. Our protocol is validated on 9 datasets about motor imagery, P300, SSVEP, including 204 participants and 26 recording sessions, and on different deep learning models. We accompany our protocol with extensive experiments on the main aspects influencing it, such as the number of participants used for hyperparameter search, the split into sequential simpler searches (multi-step search), the use of informed vs. non-informed search algorithms, and the number of random seeds for obtaining stable performance. The best protocol included 2-step hyperparameter search via an informed search algorithm, with the final training and evaluation performed using 10 random initializations. The optimal trade-off between performance and computational time was achieved by using a subset of 3-5 participants for hyperparameter search. Our protocol consistently outperformed baseline state-of-the-art pipelines, widely across datasets and models, and could represent a standard approach for neuroscientists for decoding EEG in a trustworthy and reliable way.
Collapse
Affiliation(s)
- Davide Borra
- Department of Electrical, Electronic and Information Engineering "Guglielmo Marconi" (DEI), University of Bologna, Cesena, Forlì-Cesena, Italy.
| | - Elisa Magosso
- Department of Electrical, Electronic and Information Engineering "Guglielmo Marconi" (DEI), University of Bologna, Cesena, Forlì-Cesena, Italy
| | - Mirco Ravanelli
- Department of Computer Science and Software Engineering, Concordia University, Montreal, Quebec, Canada; Mila - Quebec AI Institute, Montreal, Quebec, Canada
| |
Collapse
|
3
|
Borra D, Paissan F, Ravanelli M. SpeechBrain-MOABB: An open-source Python library for benchmarking deep neural networks applied to EEG signals. Comput Biol Med 2024; 182:109097. [PMID: 39265481 DOI: 10.1016/j.compbiomed.2024.109097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2024] [Revised: 08/20/2024] [Accepted: 08/30/2024] [Indexed: 09/14/2024]
Abstract
Deep learning has revolutionized EEG decoding, showcasing its ability to outperform traditional machine learning models. However, unlike other fields, EEG decoding lacks comprehensive open-source libraries dedicated to neural networks. Existing tools (MOABB and braindecode) prevent the creation of robust and complete decoding pipelines, as they lack support for hyperparameter search across the entire pipeline, and are sensitive to fluctuations in results due to network random initialization. Furthermore, the absence of a standardized experimental protocol exacerbates the reproducibility crisis in the field. To address these limitations, we introduce SpeechBrain-MOABB, a novel open-source toolkit carefully designed to facilitate the development of a comprehensive EEG decoding pipeline based on deep learning. SpeechBrain-MOABB incorporates a complete experimental protocol that standardizes critical phases, such as hyperparameter search and model evaluation. It natively supports multi-step hyperparameter search for finding the optimal hyperparameters in a high-dimensional space defined by the entire pipeline, and multi-seed training and evaluation for obtaining performance estimates robust to the variability caused by random initialization. SpeechBrain-MOABB outperforms other libraries, including MOABB and braindecode, with accuracy improvements of 14.9% and 25.2% (on average), respectively. By enabling easy-to-use and easy-to-share decoding pipelines, our toolkit can be exploited by neuroscientists for decoding EEG with neural networks in a replicable and trustworthy way.
Collapse
Affiliation(s)
- Davide Borra
- Department of Electrical, Electronic and Information Engineering "Guglielmo Marconi" (DEI), University of Bologna, Cesena, Forlì-Cesena, Italy.
| | | | - Mirco Ravanelli
- Department of Computer Science and Software Engineering, Concordia University, Montreal, Quebec, Canada; Mila - Quebec AI Institute, Montreal, Quebec, Canada
| |
Collapse
|
4
|
Palanisamy KK, Rengaraj A. Detection of Anxiety-Based Epileptic Seizures in EEG Signals Using Fuzzy Features and Parrot Optimization-Tuned LSTM. Brain Sci 2024; 14:848. [PMID: 39199539 PMCID: PMC11352876 DOI: 10.3390/brainsci14080848] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2024] [Revised: 08/06/2024] [Accepted: 08/17/2024] [Indexed: 09/01/2024] Open
Abstract
In humans, epilepsy is diagnosed through electroencephalography (EEG) signals. Epileptic seizures (ESs) arise due to anxiety. The detection of anxiety-based seizures is challenging for radiologists, and there is a limited availability of anxiety-based EEG signals. Data augmentation methods are required to increase the number of novel samples. An epileptic seizure arises due to anxiety, which manifests as variations in EEG signal patterns consisting of changes in the size and shape of the signal. In this study, anxiety EEG signals were synthesized by applying data augmentation methods such as random data augmentation (RDA) to existing epileptic seizure signals from the Bonn EEG dataset. The data-augmented anxiety seizure signals were processed using three algorithms-(i) fuzzy C-means-particle swarm optimization-long short-term memory (FCM-PS-LSTM), (ii) particle swarm optimization-long short-term memory (PS-LSTM), and (iii) parrot optimization LSTM (PO-LSTM)-for the detection of anxiety ESs via EEG signals. The predicted accuracies of detecting ESs through EEG signals using the proposed algorithms-namely, (i) FCM-PS-LSTM, (ii) PS-LSTM, and (iii) PO-LSTM-were about 98%, 98.5%, and 96%, respectively.
Collapse
Affiliation(s)
| | - Arthi Rengaraj
- Department of ECE, Faculty of Engineering & Technology, SRM Institute of Science and Technology, Ramapuram Campus, Ramapuram, Chennai 600089, India;
| |
Collapse
|
5
|
Nikouei M, Abdali-Mohammadi F. A novel method for modeling effective connections between brain regions based on EEG signals and graph neural networks for motor imagery detection. Comput Methods Biomech Biomed Engin 2024; 27:1430-1447. [PMID: 37548428 DOI: 10.1080/10255842.2023.2244110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Revised: 06/07/2023] [Accepted: 07/28/2023] [Indexed: 08/08/2023]
Abstract
Classified as biomedical signal processing, cerebral signal processing plays a key role in human-computer interaction (HCI) and medical diagnosis. The motor imagery (MI) problem is an important research area in this field. Accurate solutions to this problem will greatly affect real-world applications. Most of the proposed methods are based on raw signal processing techniques. Known as prior knowledge, the structural-functional information and interregional connections can improve signal processing accuracy. It is possible to correctly perceive the generated signals by considering the brain structure (i.e. anatomical units), the source of signals, and the structural-functional dependence of different brain regions (i.e. effective connection) that are the semantic generators of signals. This study employed electroencephalograph (EEG) signals based on the activity of brain regions (cortex) and effective connections between brain regions based on dynamic causal modeling to solve the MI problem. EEG signals, as well as effective connections between brain regions to improve the interpretability of MI action, were fed into the architecture of Graph Convolutional Neural Network (GCN). The proposed model allowed GCN to extract more discriminative features. The results indicated that the proposed method was successful in developing a model with a MI detection accuracy of 93.73%.
Collapse
Affiliation(s)
- Mahya Nikouei
- Department of Computer Engineering and Information Technology, Razi University, Kermanshah, Iran
| | - Fardin Abdali-Mohammadi
- Department of Computer Engineering and Information Technology, Razi University, Kermanshah, Iran
| |
Collapse
|
6
|
Lee MR, Kao MH, Hsieh YC, Sun M, Tang KT, Wang JY, Ho CC, Shih JY, Yu CJ. Cross-site validation of lung cancer diagnosis by electronic nose with deep learning: a multicenter prospective study. Respir Res 2024; 25:203. [PMID: 38730430 PMCID: PMC11084132 DOI: 10.1186/s12931-024-02840-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Accepted: 05/06/2024] [Indexed: 05/12/2024] Open
Abstract
BACKGROUND Although electronic nose (eNose) has been intensively investigated for diagnosing lung cancer, cross-site validation remains a major obstacle to be overcome and no studies have yet been performed. METHODS Patients with lung cancer, as well as healthy control and diseased control groups, were prospectively recruited from two referral centers between 2019 and 2022. Deep learning models for detecting lung cancer with eNose breathprint were developed using training cohort from one site and then tested on cohort from the other site. Semi-Supervised Domain-Generalized (Semi-DG) Augmentation (SDA) and Noise-Shift Augmentation (NSA) methods with or without fine-tuning was applied to improve performance. RESULTS In this study, 231 participants were enrolled, comprising a training/validation cohort of 168 individuals (90 with lung cancer, 16 healthy controls, and 62 diseased controls) and a test cohort of 63 individuals (28 with lung cancer, 10 healthy controls, and 25 diseased controls). The model has satisfactory results in the validation cohort from the same hospital while directly applying the trained model to the test cohort yielded suboptimal results (AUC, 0.61, 95% CI: 0.47─0.76). The performance improved after applying data augmentation methods in the training cohort (SDA, AUC: 0.89 [0.81─0.97]; NSA, AUC:0.90 [0.89─1.00]). Additionally, after applying fine-tuning methods, the performance further improved (SDA plus fine-tuning, AUC:0.95 [0.89─1.00]; NSA plus fine-tuning, AUC:0.95 [0.90─1.00]). CONCLUSION Our study revealed that deep learning models developed for eNose breathprint can achieve cross-site validation with data augmentation and fine-tuning. Accordingly, eNose breathprints emerge as a convenient, non-invasive, and potentially generalizable solution for lung cancer detection. CLINICAL TRIAL REGISTRATION This study is not a clinical trial and was therefore not registered.
Collapse
Affiliation(s)
- Meng-Rui Lee
- Department of Internal Medicine, National Taiwan University Hospital, Taipei, Taiwan
- Department of Internal Medicine, National Taiwan University Hospital Hsin-Chu Branch, Hsin-Chu, Taiwan
| | - Mu-Hsiang Kao
- Department. of Electrical Engineering, National Tsing Hua University, No. 101, Sec. 2, Kuang-Fu Road, Hsinchu, 30013, Taiwan
| | - Ya-Chu Hsieh
- Department. of Electrical Engineering, National Tsing Hua University, No. 101, Sec. 2, Kuang-Fu Road, Hsinchu, 30013, Taiwan
| | - Min Sun
- Department. of Electrical Engineering, National Tsing Hua University, No. 101, Sec. 2, Kuang-Fu Road, Hsinchu, 30013, Taiwan.
| | - Kea-Tiong Tang
- Department. of Electrical Engineering, National Tsing Hua University, No. 101, Sec. 2, Kuang-Fu Road, Hsinchu, 30013, Taiwan.
| | - Jann-Yuan Wang
- Department of Internal Medicine, National Taiwan University Hospital, Taipei, Taiwan
| | - Chao-Chi Ho
- Department of Internal Medicine, National Taiwan University Hospital, Taipei, Taiwan
| | - Jin-Yuan Shih
- Department of Internal Medicine, National Taiwan University Hospital, Taipei, Taiwan
| | - Chong-Jen Yu
- Department of Internal Medicine, National Taiwan University Hospital, Taipei, Taiwan
- Department of Internal Medicine, National Taiwan University Hospital Hsin-Chu Branch, Hsin-Chu, Taiwan
| |
Collapse
|
7
|
Akuthota S, K R, Ravichander J. Artifact removal and motor imagery classification in EEG using advanced algorithms and modified DNN. Heliyon 2024; 10:e27198. [PMID: 38560190 PMCID: PMC10980936 DOI: 10.1016/j.heliyon.2024.e27198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2024] [Revised: 02/21/2024] [Accepted: 02/26/2024] [Indexed: 04/04/2024] Open
Abstract
This paper presents an advanced approach for EEG artifact removal and motor imagery classification using a combination of Four Class Iterative Filtering and Filter Bank Common Spatial Pattern Algorithm with a Modified Deep Neural Network (DNN) classifier. The research aims to enhance the accuracy and reliability of BCI systems by addressing the challenges posed by EEG artifacts and complex motor imagery tasks. The methodology begins by introducing FCIF, a novel technique for ocular artifact removal, utilizing iterative filtering and filter banks. FCIF's mathematical formulation allows for effective artifact mitigation, thereby improving the quality of EEG data. In tandem, the FC-FBCSP algorithm is introduced, extending the Filter Bank Common Spatial Pattern approach to handle four-class motor imagery classification. The Modified DNN classifier enhances the discriminatory power of the FC-FBCSP features, optimizing the classification process. The paper showcases a comprehensive experimental setup, featuring the utilization of BCI Competition IV Dataset 2a & 2b. Detailed preprocessing steps, including filtering and feature extraction, are presented with mathematical rigor. Results demonstrate the remarkable artifact removal capabilities of FCIF and the classification prowess of FC-FBCSP combined with the Modified DNN classifier. Comparative analysis highlights the superiority of the proposed approach over baseline methods and the method achieves the mean accuracy of 98.575%.
Collapse
Affiliation(s)
- Srinath Akuthota
- Department of Electronics & Communication Engineering, SR University, Warangal-506371, Telangana, India
| | - RajKumar K
- Department of Electronics & Communication Engineering, SR University, Warangal-506371, Telangana, India
| | - Janapati Ravichander
- Department of Electronics & Communication Engineering, SR University, Warangal-506371, Telangana, India
| |
Collapse
|
8
|
Wang W, Li B, Wang H, Wang X, Qin Y, Shi X, Liu S. EEG-FMCNN: A fusion multi-branch 1D convolutional neural network for EEG-based motor imagery classification. Med Biol Eng Comput 2024; 62:107-120. [PMID: 37728715 DOI: 10.1007/s11517-023-02931-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2023] [Accepted: 09/07/2023] [Indexed: 09/21/2023]
Abstract
Motor imagery (MI) electroencephalogram (EEG) signal is recognized as a promising paradigm for brain-computer interface (BCI) systems and has been extensively employed in various BCI applications, including assisting disabled individuals, controlling devices and environments, and enhancing human capabilities. The high-performance decoding capability of MI-EEG signals is a key issue that impacts the development of the industry. However, decoding MI-EEG signals is challenging due to the low signal-to-noise ratio and inter-subject variability. In response to the aforementioned core problems, this paper proposes a novel end-to-end network, a fusion multi-branch 1D convolutional neural network (EEG-FMCNN), to decode MI-EEG signals without pre-processing. The utilization of multi-branch 1D convolution not only exhibits a certain level of noise tolerance but also addresses the issue of inter-subject variability to some extent. This is attributed to the ability of multi-branch architectures to capture information from different frequency bands, enabling the establishment of optimal convolutional scales and depths. Furthermore, we incorporate 1D squeeze-and-excitation (SE) blocks and shortcut connections at appropriate locations to further enhance the generalization and robustness of the network. In the BCI Competition IV-2a dataset, our proposed model has obtained good experimental results, achieving accuracies of 78.82% and 68.41% for subject-dependent and subject-independent modes, respectively. In addition, extensive ablative experiments and fine-tuning experiments were conducted, resulting in a notable 7% improvement in the average performance of the network, which holds significant implications for the generalization and application of the network.
Collapse
Affiliation(s)
- Wenlong Wang
- The School of Electrical Engineering, Shanghai Dianji University, Shanghai, 201306, China
- Intelligent Decision and Control Technology Institute, Shanghai Dianji University, Shanghai, 201306, China
| | - Baojiang Li
- The School of Electrical Engineering, Shanghai Dianji University, Shanghai, 201306, China.
- Intelligent Decision and Control Technology Institute, Shanghai Dianji University, Shanghai, 201306, China.
| | - Haiyan Wang
- The School of Electrical Engineering, Shanghai Dianji University, Shanghai, 201306, China
- Intelligent Decision and Control Technology Institute, Shanghai Dianji University, Shanghai, 201306, China
| | - Xichao Wang
- The School of Electrical Engineering, Shanghai Dianji University, Shanghai, 201306, China
- Intelligent Decision and Control Technology Institute, Shanghai Dianji University, Shanghai, 201306, China
| | - Yuxin Qin
- The School of Electrical Engineering, Shanghai Dianji University, Shanghai, 201306, China
- Intelligent Decision and Control Technology Institute, Shanghai Dianji University, Shanghai, 201306, China
| | - Xingbin Shi
- The School of Electrical Engineering, Shanghai Dianji University, Shanghai, 201306, China
- Intelligent Decision and Control Technology Institute, Shanghai Dianji University, Shanghai, 201306, China
| | - Shuxin Liu
- The School of Electrical Engineering, Shanghai Dianji University, Shanghai, 201306, China
- The Key Laboratory of Cognitive Computing and Intelligent Information Processing of Fujian Education Institutions (Wuyi University), Fujian, 354300, China
| |
Collapse
|
9
|
Aldawsari H, Al-Ahmadi S, Muhammad F. Optimizing 1D-CNN-Based Emotion Recognition Process through Channel and Feature Selection from EEG Signals. Diagnostics (Basel) 2023; 13:2624. [PMID: 37627883 PMCID: PMC10453543 DOI: 10.3390/diagnostics13162624] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 07/28/2023] [Accepted: 07/29/2023] [Indexed: 08/27/2023] Open
Abstract
EEG-based emotion recognition has numerous real-world applications in fields such as affective computing, human-computer interaction, and mental health monitoring. This offers the potential for developing IOT-based, emotion-aware systems and personalized interventions using real-time EEG data. This study focused on unique EEG channel selection and feature selection methods to remove unnecessary data from high-quality features. This helped improve the overall efficiency of a deep learning model in terms of memory, time, and accuracy. Moreover, this work utilized a lightweight deep learning method, specifically one-dimensional convolutional neural networks (1D-CNN), to analyze EEG signals and classify emotional states. By capturing intricate patterns and relationships within the data, the 1D-CNN model accurately distinguished between emotional states (HV/LV and HA/LA). Moreover, an efficient method for data augmentation was used to increase the sample size and observe the performance deep learning model using additional data. The study conducted EEG-based emotion recognition tests on SEED, DEAP, and MAHNOB-HCI datasets. Consequently, this approach achieved mean accuracies of 97.6, 95.3, and 89.0 on MAHNOB-HCI, SEED, and DEAP datasets, respectively. The results have demonstrated significant potential for the implementation of a cost-effective IoT device to collect EEG signals, thereby enhancing the feasibility and applicability of the data.
Collapse
Affiliation(s)
- Haya Aldawsari
- Department of Computer Science, College of Arts and Science, Prince Sattam bin Abdulaziz University, Al-Kharj 16278, Saudi Arabia;
| | - Saad Al-Ahmadi
- Center of Excellence in Information Assurance (CoEIA), King Saud University, Riyadh 11543, Saudi Arabia;
- College of Computer & Information Sciences, King Saud University, Riyadh 11543, Saudi Arabia
| | - Farah Muhammad
- Center of Excellence in Information Assurance (CoEIA), King Saud University, Riyadh 11543, Saudi Arabia;
| |
Collapse
|
10
|
De-La-Cruz C, Trevejo-Pinedo J, Bravo F, Visurraga K, Peña-Echevarría J, Pinedo A, Rojas F, Sun-Kou MR. Application of Machine Learning Algorithms to Classify Peruvian Pisco Varieties Using an Electronic Nose. SENSORS (BASEL, SWITZERLAND) 2023; 23:5864. [PMID: 37447715 DOI: 10.3390/s23135864] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 06/09/2023] [Accepted: 06/19/2023] [Indexed: 07/15/2023]
Abstract
Pisco is an alcoholic beverage obtained from grape juice distillation. Considered the flagship drink of Peru, it is produced following strict and specific quality standards. In this work, sensing results for volatile compounds in pisco, obtained with an electronic nose, were analyzed through the application of machine learning algorithms for the differentiation of pisco varieties. This differentiation aids in verifying beverage quality, considering the parameters established in its Designation of Origin". For signal processing, neural networks, multiclass support vector machines and random forest machine learning algorithms were implemented in MATLAB. In addition, data augmentation was performed using a proposed procedure based on interpolation-extrapolation. All algorithms trained with augmented data showed an increase in performance and more reliable predictions compared to those trained with raw data. From the comparison of these results, it was found that the best performance was achieved with neural networks.
Collapse
Affiliation(s)
- Celso De-La-Cruz
- Department of Engineering, Pontifical Catholic University of Peru, Lima 15088, Peru
| | | | - Fabiola Bravo
- Department of Science, Pontifical Catholic University of Peru, Lima 15088, Peru
| | - Karina Visurraga
- Department of Science, Pontifical Catholic University of Peru, Lima 15088, Peru
| | | | - Angela Pinedo
- Department of Science, Pontifical Catholic University of Peru, Lima 15088, Peru
| | - Freddy Rojas
- Department of Engineering, Pontifical Catholic University of Peru, Lima 15088, Peru
| | - María R Sun-Kou
- Department of Science, Pontifical Catholic University of Peru, Lima 15088, Peru
| |
Collapse
|
11
|
Svantesson M, Olausson H, Eklund A, Thordstein M. Get a New Perspective on EEG: Convolutional Neural Network Encoders for Parametric t-SNE. Brain Sci 2023; 13:453. [PMID: 36979263 PMCID: PMC10046040 DOI: 10.3390/brainsci13030453] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 03/03/2023] [Accepted: 03/04/2023] [Indexed: 03/09/2023] Open
Abstract
t-distributed stochastic neighbor embedding (t-SNE) is a method for reducing high-dimensional data to a low-dimensional representation, and is mostly used for visualizing data. In parametric t-SNE, a neural network learns to reproduce this mapping. When used for EEG analysis, the data are usually first transformed into a set of features, but it is not known which features are optimal. The principle of t-SNE was used to train convolutional neural network (CNN) encoders to learn to produce both a high- and a low-dimensional representation, eliminating the need for feature engineering. To evaluate the method, the Temple University EEG Corpus was used to create three datasets with distinct EEG characters: (1) wakefulness and sleep; (2) interictal epileptiform discharges; and (3) seizure activity. The CNN encoders produced low-dimensional representations of the datasets with a structure that conformed well to the EEG characters and generalized to new data. Compared to parametric t-SNE for either a short-time Fourier transform or wavelet representation of the datasets, the developed CNN encoders performed equally well in separating categories, as assessed by support vector machines. The CNN encoders generally produced a higher degree of clustering, both visually and in the number of clusters detected by k-means clustering. The developed principle is promising and could be further developed to create general tools for exploring relations in EEG data.
Collapse
Affiliation(s)
- Mats Svantesson
- Department of Clinical Neurophysiology, University Hospital of Linköping, 58185 Linköping, Sweden
- Center for Social and Affective Neuroscience, Linköping University, 58183 Linköping, Sweden
- Center for Medical Image Science and Visualization, Linköping University, 58183 Linköping, Sweden
- Department of Biomedical and Clinical Sciences, Linköping University, 58183 Linköping, Sweden
| | - Håkan Olausson
- Department of Clinical Neurophysiology, University Hospital of Linköping, 58185 Linköping, Sweden
- Center for Social and Affective Neuroscience, Linköping University, 58183 Linköping, Sweden
- Department of Biomedical and Clinical Sciences, Linköping University, 58183 Linköping, Sweden
| | - Anders Eklund
- Center for Medical Image Science and Visualization, Linköping University, 58183 Linköping, Sweden
- Department of Biomedical Engineering, Linköping University, 58183 Linköping, Sweden
- Division of Statistics & Machine Learning, Department of Computer and Information Science, Linköping University, 58183 Linköping, Sweden
| | - Magnus Thordstein
- Department of Clinical Neurophysiology, University Hospital of Linköping, 58185 Linköping, Sweden
- Center for Medical Image Science and Visualization, Linköping University, 58183 Linköping, Sweden
- Department of Biomedical and Clinical Sciences, Linköping University, 58183 Linköping, Sweden
| |
Collapse
|
12
|
Collazos-Huertas DF, Álvarez-Meza AM, Cárdenas-Peña DA, Castaño-Duque GA, Castellanos-Domínguez CG. Posthoc Interpretability of Neural Responses by Grouping Subject Motor Imagery Skills Using CNN-Based Connectivity. SENSORS (BASEL, SWITZERLAND) 2023; 23:2750. [PMID: 36904950 PMCID: PMC10007181 DOI: 10.3390/s23052750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 02/19/2023] [Accepted: 02/27/2023] [Indexed: 06/18/2023]
Abstract
Motor Imagery (MI) refers to imagining the mental representation of motor movements without overt motor activity, enhancing physical action execution and neural plasticity with potential applications in medical and professional fields like rehabilitation and education. Currently, the most promising approach for implementing the MI paradigm is the Brain-Computer Interface (BCI), which uses Electroencephalogram (EEG) sensors to detect brain activity. However, MI-BCI control depends on a synergy between user skills and EEG signal analysis. Thus, decoding brain neural responses recorded by scalp electrodes poses still challenging due to substantial limitations, such as non-stationarity and poor spatial resolution. Also, an estimated third of people need more skills to accurately perform MI tasks, leading to underperforming MI-BCI systems. As a strategy to deal with BCI-Inefficiency, this study identifies subjects with poor motor performance at the early stages of BCI training by assessing and interpreting the neural responses elicited by MI across the evaluated subject set. Using connectivity features extracted from class activation maps, we propose a Convolutional Neural Network-based framework for learning relevant information from high-dimensional dynamical data to distinguish between MI tasks while preserving the post-hoc interpretability of neural responses. Two approaches deal with inter/intra-subject variability of MI EEG data: (a) Extracting functional connectivity from spatiotemporal class activation maps through a novel kernel-based cross-spectral distribution estimator, (b) Clustering the subjects according to their achieved classifier accuracy, aiming to find common and discriminative patterns of motor skills. According to the validation results obtained on a bi-class database, an average accuracy enhancement of 10% is achieved compared to the baseline EEGNet approach, reducing the number of "poor skill" subjects from 40% to 20%. Overall, the proposed method can be used to help explain brain neural responses even in subjects with deficient MI skills, who have neural responses with high variability and poor EEG-BCI performance.
Collapse
Affiliation(s)
| | | | | | - Germán Albeiro Castaño-Duque
- Cultura de la Calidad en la Educación Research Group, Universidad Nacional de Colombia, Manizales 170003, Colombia
| | | |
Collapse
|
13
|
A compact multi-branch 1D convolutional neural network for EEG-based motor imagery classification. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104456] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
14
|
Martins FM, Suárez VMG, Flecha JRV, López BG. Data Augmentation Effects on Highly Imbalanced EEG Datasets for Automatic Detection of Photoparoxysmal Responses. SENSORS (BASEL, SWITZERLAND) 2023; 23:2312. [PMID: 36850910 PMCID: PMC9963310 DOI: 10.3390/s23042312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 02/02/2023] [Accepted: 02/03/2023] [Indexed: 06/18/2023]
Abstract
Photosensitivity is a neurological disorder in which a person's brain produces epileptic discharges, known as Photoparoxysmal Responses (PPRs), when it receives certain visual stimuli. The current standardized diagnosis process used in hospitals consists of submitting the subject to the Intermittent Photic Stimulation process and attempting to trigger these phenomena. The brain activity is measured by an Electroencephalogram (EEG), and the clinical specialists manually look for the PPRs that were provoked during the session. Due to the nature of this disorder, long EEG recordings may contain very few PPR segments, meaning that a highly imbalanced dataset is available. To tackle this problem, this research focused on applying Data Augmentation (DA) to create synthetic PPR segments from the real ones, improving the balance of the dataset and, thus, the global performance of the Machine Learning techniques applied for automatic PPR detection. K-Nearest Neighbors and a One-Hidden-Dense-Layer Neural Network were employed to evaluate the performance of this DA stage. The results showed that DA is able to improve the models, making them more robust and more able to generalize. A comparison with the results obtained from a previous experiment also showed a performance improvement of around 20% for the Accuracy and Specificity measurements without Sensitivity suffering any losses. This project is currently being carried out with subjects at Burgos University Hospital, Spain.
Collapse
|