1
|
Chen J, Fan X, Ge R, Xiao J, Wang R, Ma W, Li Y. Towards interpretable sleep stage classification with a multi-stream fusion network. BMC Med Inform Decis Mak 2025; 25:164. [PMID: 40229774 PMCID: PMC11998347 DOI: 10.1186/s12911-025-02995-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Accepted: 04/04/2025] [Indexed: 04/16/2025] Open
Abstract
Sleep stage classification is a significant measure in assessing sleep quality and diagnosing sleep disorders. Many researchers have investigated automatic sleep stage classification methods and achieved promising results. However, these methods ignored the heterogeneous information fusion of the spatial-temporal and spectral-temporal features among multiple-channel sleep monitoring signals. In this study, we propose an interpretable multi-stream fusion network, named MSF-SleepNet, for sleep stage classification. Specifically, we employ Chebyshev graph convolution and temporal convolution to obtain the spatial-temporal features from body-topological information of sleep monitoring signals. Meanwhile, we utilize a short time Fourier transform and gated recurrent unit to learn the spectral-temporal features from sleep monitoring signals. After fusing the spatial-temporal and spectral-temporal features, we use a contrastive learning scheme to enhance the differences in feature patterns of sleep monitoring signals across various sleep stages. Finally, LIME is employed to improve the interpretability of MSF-SleepNet. Experimental results on ISRUC-S1 and ISRUC-S3 datasets show that MSF-SleepNet achieves competitive results and is superior to its state-of-the-art counterparts on most of performance metrics.
Collapse
Affiliation(s)
- Jingrui Chen
- Department of Information Management, Guangdong Justice Police Vocational College, Guangzhou, Guangdong, 510520, China
| | - Xiaomao Fan
- College of Big Data and Internet, Shenzhen Technology University, Shenzhen, Guangdong, 518055, China.
| | - Ruiquan Ge
- School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, China
| | - Jing Xiao
- School of Computer Science, South China Normal University, Guangzhou, Guangdong, 510631, China
| | - Ruxin Wang
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong, 518055, China
| | - Wenjun Ma
- School of Computer Science, South China Normal University, Guangzhou, Guangdong, 510631, China.
| | - Ye Li
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong, 518055, China
| |
Collapse
|
2
|
Wang J, Wang X, Ning X, Lin Y, Phan H, Jia Z. Subject-Adaptation Salient Wave Detection Network for Multimodal Sleep Stage Classification. IEEE J Biomed Health Inform 2025; 29:2172-2184. [PMID: 40030418 DOI: 10.1109/jbhi.2024.3512584] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/05/2025]
Abstract
Sleep stage classification is an important step in the diagnosis and treatment of sleep disorders. Despite the high classification performance of previous sleep stage classification work, some challenges remain unresolved: 1) How to effectively capture salient waves in sleep signals to improve sleep stage classification results. 2) How to capture salient waves affected by inter-subject variability. 3) How to adaptively regulate the importance of different modals for different sleep stages. To address these challenges, we propose SleepWaveNet, a multimodal salient wave detection network, which is motivated by the salient object detection task in computer vision. It has a U-Transformer structure to detect salient waves in sleep signals. Meanwhile, the subject-adaptation wave extraction architecture based on transfer learning can adapt to the information of target individuals and extract salient waves with inter-subject variability. In addition, the multimodal attention module can adaptively enhance the importance of specific modal data for sleep stage classification tasks. Experiments on three datasets show that SleepWaveNet has better overall performance than existing baselines. Moreover, visualization experiments show that the model has the ability to capture salient waves with inter-subject variability.
Collapse
|
3
|
Wan C, Nnamdi MC, Shi W, Smith B, Purnell C, Wang MD. Advancing Sleep Disorder Diagnostics: A Transformer-Based EEG Model for Sleep Stage Classification and OSA Prediction. IEEE J Biomed Health Inform 2025; 29:878-886. [PMID: 40030422 DOI: 10.1109/jbhi.2024.3512616] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/05/2025]
Abstract
Sleep disorders, particularly Obstructive Sleep Apnea (OSA), have a considerable effect on an individual's health and quality of life. Accurate sleep stage classification and prediction of OSA are crucial for timely diagnosis and effective management of sleep disorders. In this study, we develop a sequential network that enhances sleep stage classification by incorporating self-attention mechanisms and Conditional Random Fields (CRF) into a deep learning model comprising multi-kernel Convolutional Neural Networks (CNNs) and Transformer-based encoders. The self-attention mechanism enables the model to focus on the most discriminative features extracted from single-channel electroencephalography (EEG) recordings, while the CRF module captures the temporal dependencies between sleep stages, improving the model's ability to learn more plausible sleep stage sequences. Moreover, we explore the relationship between sleep stages and OSA severity by utilizing the predicted sleep stage features to train various regression models for Apnea-Hypopnea Index (AHI) prediction. Our experiments demonstrate an improved sleep stage classification performance of 78.7%, particularly on datasets with diverse AHI values, and highlight the potential of leveraging sleep stage information for monitoring OSA. By employing advanced deep learning techniques, we thoroughly explore the intricate relationship between sleep stages and sleep apnea, laying the foundation for more precise and automated diagnostics of sleep disorders.
Collapse
|
4
|
Pei Y, Xu J, Yu F, Zhang L, Luo W. WaveSleepNet: An Interpretable Network for Expert-Like Sleep Staging. IEEE J Biomed Health Inform 2025; 29:1371-1382. [PMID: 40030379 DOI: 10.1109/jbhi.2024.3498871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/06/2025]
Abstract
Although deep learning algorithms have proven their efficiency in automatic sleep staging, their "black-box" nature has limited their clinical adoption. In this study, we propose WaveSleepNet, an interpretable neural network for sleep staging that reasons in a similar way to sleep clinical experts. In this network, we utilize the latent space representations generated during training to identify characteristic wave prototypes corresponding to different sleep stages. The feature representation of an input signal is segmented into patches within the latent space, each of which is compared against the learned wave prototypes. The proximity between these patches and the wave prototypes is quantified through scores, indicating the prototypes' presence and relative proportion within the signal. The scores serve as the decision-making criteria for final sleep staging. During training, an ensemble of loss functions is employed for the prototypes' diversity and robustness. Furthermore, the learned wave prototypes are visualized by analyzing occlusion sensitivity. The efficacy of WaveSleepNet is validated across three public datasets, achieving sleep staging performance that are on par with those of the state-of-the-art models. A detailed case study examining the decision-making process of WaveSleepNet demonstrates that it aligns closely with American Academy of Sleep Medicine (AASM) manual guidelines. Another case study systematically explained the misidentified reasons behind each sleep stage. WaveSleepNet's transparent process provides specialists with direct access to the physiological significance of the model's criteria, allowing for future validation, adoption and further enrichment by sleep clinical experts.
Collapse
|
5
|
Wang X, Zhu Y. SleepGCN: A transition rule learning model based on Graph Convolutional Network for sleep staging. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024; 257:108405. [PMID: 39243591 DOI: 10.1016/j.cmpb.2024.108405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Revised: 08/29/2024] [Accepted: 08/30/2024] [Indexed: 09/09/2024]
Abstract
BACKGROUND AND OBJECTIVE Automatic sleep staging is essential for assessing and diagnosing sleep disorders, serving millions of people who suffer from them. Numerous sleep staging models have been proposed recently, but most of them have not fully explored the sleep transition rules that are essential for sleep experts to identify sleep stages. Therefore, one objective of this paper is to develop an automatic sleep staging model to capture the transition rules between sleep stages. METHODS In this paper, we propose a novel sleep staging model named SleepGCN. It utilizes the deep features of electroencephalogram (EEG) and electrooculogram (EOG) signals extracted by the sleep representation learning (SRL) module, in conjunction with the transition rules learned by the sleep transition rule learning (STRL) module to identify sleep stages. Specifically, the SRL module utilizes the residual network (ResNet) and Long Short Term Memory (LSTM) structure to capture the deep time-invariant features and temporal information of each sleep stage from the two-channel EEG-EOG, and then applies a feature enhancement block to obtain the refined features. The STRL module employs a Graph Convolutional Network (GCN) and a transition rule matrix to capture transition rules between sleep stages based on the sequence labels of the input signals. RESULTS We evaluate SleepGCN on five public datasets: SleepEDF-20, SleepEDF-78, SHHS, DOD-H and DOD-O. Overall, SleepGCN achieves an accuracy of 89.70%, 87.70%, 86.16%, 82.07%, and 81.20%, alongside a macro-average F1-score of 85.20%, 82.70%, 77.69%, 72.44%, and 72.93% across these datasets, respectively. CONCLUSIONS The results achieved by our proposed model are much better than those of all other compared models. The ablation study validates the contributions of the SRL and STRL modules proposed in SleepGCN to the sleep staging tasks. Additionally, it shows that the sleep staging model using two-channel EEG-EOG outperforms the model using single-channel EEG or EOG. Overall, SleepGCN is an effective solution for sleep staging using two-channel EEG-EOG.
Collapse
Affiliation(s)
- Xuhui Wang
- School of Computer Science, Wuhan University, Wuhan, 430061, China
| | - Yuanyuan Zhu
- School of Computer Science, Wuhan University, Wuhan, 430061, China.
| |
Collapse
|
6
|
Jirakittayakorn N, Wongsawat Y, Mitrirattanakul S. An enzyme-inspired specificity in deep learning model for sleep stage classification using multi-channel PSG signals input: Separating training approach and its performance on cross-dataset validation for generalizability. Comput Biol Med 2024; 182:109138. [PMID: 39305732 DOI: 10.1016/j.compbiomed.2024.109138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2024] [Revised: 08/15/2024] [Accepted: 09/08/2024] [Indexed: 11/14/2024]
Abstract
Numerous automatic sleep stage classification systems have been developed, but none have become effective assistive tools for sleep technicians due to issues with generalization. Four key factors hinder the generalization of these models are instruments, montage of recording, subject type, and scoring manual factors. This study aimed to develop a deep learning model that addresses generalization problems by integrating enzyme-inspired specificity and employing separating training approaches. Subject type and scoring manual factors were controlled, while the focus was on instruments and montage of recording factors. The proposed model consists of three sets of signal-specific models including EEG-, EOG-, and EMG-specific model. The EEG-specific models further include three sets of channel-specific models. All signal-specific and channel-specific models were established with data manipulation and weighted loss strategies, resulting in three sets of data manipulation models and class-specific models, respectively. These models were CNNs. Additionally, BiLSTM models were applied to EEG- and EOG-specific models to obtain temporal information. Finally, classification task for sleep stage was handled by 'the-last-dense' layer. The optimal sampling frequency for each physiological signal was identified and used during the training process. The proposed model was trained on MGH dataset and evaluated using both within dataset and cross-dataset. For MGH dataset, overall accuracy of 81.05 %, MF1 of 79.05 %, Kappa of 0.7408, and per-class F1-scores: W (84.98 %), N1 (58.06 %), N2 (84.82 %), N3 (79.20 %), and REM (88.17 %) can be achieved. Performances on cross-datasets are as follows: SHHS1 200 records reached 79.54 %, 70.56 %, and 0.7078; SHHS2 200 records achieved 76.77 %, 66.30 %, and 0.6632; Sleep-EDF 153 records gained 78.52 %, 72.13 %, and 0.7031; and BCI-MU (local dataset) 94 records achieved 83.57 %, 82.17 %, and 0.7769 for overall accuracy, MF1, and Kappa respectively. Additionally, the proposed model has approximately 9.3 M trainable parameters and takes around 26 s to process one PSG record. The results indicate that the proposed model demonstrates generalizability in sleep stage classification and shows potential as a feasibility tool for real-world applications. Additionally, enzyme-inspired specificity effectively addresses the challenges posed by varying montage of recording, while the identified optimal frequencies mitigate instrument-related issues.
Collapse
Affiliation(s)
| | - Yodchanan Wongsawat
- Department of Biomedical Engineering, Faculty of Engineering, Mahidol University, Thailand.
| | | |
Collapse
|
7
|
Chen W, Cai Y, Li A, Su Y, Jiang K. Single-Channel Sleep EEG Classification Method Based on LSTM and Hidden Markov Model. Brain Sci 2024; 14:1087. [PMID: 39595850 PMCID: PMC11592309 DOI: 10.3390/brainsci14111087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2024] [Revised: 10/19/2024] [Accepted: 10/28/2024] [Indexed: 11/28/2024] Open
Abstract
BACKGROUND The single-channel sleep EEG has the advantages of convenient collection, high-cost performance, and easy daily use, and it has been widely used in the classification of sleep stages. METHODS This paper proposes a single-channel sleep EEG classification method based on long short-term memory and a hidden Markov model (LSTM-HMM). First, the single-channel EEG is decomposed using wavelet transform (WT), and multi-domain features are extracted from the component signals to characterize the EEG characteristics fully. Considering the temporal nature of sleep stage changes, this paper uses a multi-step time series as the input for the model. After that, the multi-step time series features are input into the LSTM. Finally, the HMM improves the classification results, and the final prediction results are obtained. RESULTS A complete experiment was conducted on the Sleep-EDFx dataset. The results show that the proposed method can extract deep information from EEG and make full use of the sleep stage transition rule. The proposed method shows the best performance in single-channel sleep EEG classification; the accuracy, macro average F1 score, and kappa are 82.71%, 0.75, and 0.76, respectively. CONCLUSIONS The proposed method can realize single-channel sleep EEG classification and provide a reference for other EEG classifications.
Collapse
Affiliation(s)
| | - Yanping Cai
- School of Combat Support, Rocket Force University of Engineering, Xi’an 710025, China; (W.C.); (A.L.); (Y.S.); (K.J.)
| | | | | | | |
Collapse
|
8
|
Beauchemin N, Charland P, Karran A, Boasen J, Tadson B, Sénécal S, Léger PM. Enhancing learning experiences: EEG-based passive BCI system adapts learning speed to cognitive load in real-time, with motivation as catalyst. Front Hum Neurosci 2024; 18:1416683. [PMID: 39435350 PMCID: PMC11491376 DOI: 10.3389/fnhum.2024.1416683] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2024] [Accepted: 09/26/2024] [Indexed: 10/23/2024] Open
Abstract
Computer-based learning has gained popularity in recent years, providing learners greater flexibility and freedom. However, these learning environments do not consider the learner's mental state in real-time, resulting in less optimized learning experiences. This research aimed to explore the effect on the learning experience of a novel EEG-based Brain-Computer Interface (BCI) that adjusts the speed of information presentation in real-time during a learning task according to the learner's cognitive load. We also explored how motivation moderated these effects. In accordance with three experimental groups (non-adaptive, adaptive, and adaptive with motivation), participants performed a calibration task (n-back), followed by a memory-based learning task concerning astrological constellations. Learning gains were assessed based on performance on the learning task. Self-perceived mental workload, cognitive absorption and satisfaction were assessed using a post-test questionnaire. Between-group analyses using Mann-Whitney tests suggested that combining BCI and motivational factors led to more significant learning gains and an improved learning experience. No significant difference existed between the BCI without motivational factor and regular non-adaptive interface for overall learning gains, self-perceived mental workload, and cognitive absorption. However, participants who undertook the experiment with an imposed learning pace reported higher overall satisfaction with their learning experience and a higher level of temporal stress. Our findings suggest BCI's potential applicability and feasibility in improving memorization-based learning experiences. Further work should seek to optimize the BCI adaptive index and explore generalizability to other learning contexts.
Collapse
Affiliation(s)
- Noémie Beauchemin
- Tech3Lab, HEC Montréal, Information Technology Department, Montreal, QC, Canada
| | - Patrick Charland
- Didactics Department, Université du Québec à Montréal, Montreal, QC, Canada
| | - Alexander Karran
- Tech3Lab, HEC Montréal, Information Technology Department, Montreal, QC, Canada
| | - Jared Boasen
- Tech3Lab, HEC Montréal, Information Technology Department, Montreal, QC, Canada
- Faculty of Health Sciences, Hokkaido University, Sapporo, Japan
| | - Bella Tadson
- Tech3Lab, HEC Montréal, Information Technology Department, Montreal, QC, Canada
| | - Sylvain Sénécal
- Tech3Lab, HEC Montréal, Information Technology Department, Montreal, QC, Canada
| | | |
Collapse
|
9
|
Zhou W, Shen N, Zhou L, Liu M, Zhang Y, Fu C, Yu H, Shu F, Chen W, Chen C. PSEENet: A Pseudo-Siamese Neural Network Incorporating Electroencephalography and Electrooculography Characteristics for Heterogeneous Sleep Staging. IEEE J Biomed Health Inform 2024; 28:5189-5200. [PMID: 38771683 DOI: 10.1109/jbhi.2024.3403878] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/23/2024]
Abstract
Sleep staging plays a critical role in evaluating the quality of sleep. Currently, most studies are either suffering from dramatic performance drops when coping with varying input modalities or unable to handle heterogeneous signals. To handle heterogeneous signals and guarantee favorable sleep staging performance when a single modality is available, a pseudo-siamese neural network (PSN) to incorporate electroencephalography (EEG), electrooculography (EOG) characteristics is proposed (PSEENet). PSEENet consists of two parts, spatial mapping modules (SMMs) and a weight-shared classifier. SMMs are used to extract high-dimensional features. Meanwhile, joint linkages among multi-modalities are provided by quantifying the similarity of features. Finally, with the cooperation of heterogeneous characteristics, associations within various sleep stages can be established by the classifier. The evaluation of the model is validated on two public datasets, namely, Montreal Archive of Sleep Studies (MASS) and SleepEDFX, and one clinical dataset from Huashan Hospital of Fudan University (HSFU). Experimental results show that the model can handle heterogeneous signals, provide superior results under multimodal signals and show good performance with single modality. PSEENet obtains accuracy of 79.1%, 82.1% with EEG, EEG and EOG on Sleep-EDFX, and significantly improves the accuracy with EOG from 73.7% to 76% by introducing similarity information.
Collapse
|
10
|
Jiang C, Xie W, Zheng J, Yan B, Luo J, Zhang J. MLS-Net: An Automatic Sleep Stage Classifier Utilizing Multimodal Physiological Signals in Mice. BIOSENSORS 2024; 14:406. [PMID: 39194635 DOI: 10.3390/bios14080406] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Revised: 08/20/2024] [Accepted: 08/20/2024] [Indexed: 08/29/2024]
Abstract
Over the past decades, feature-based statistical machine learning and deep neural networks have been extensively utilized for automatic sleep stage classification (ASSC). Feature-based approaches offer clear insights into sleep characteristics and require low computational power but often fail to capture the spatial-temporal context of the data. In contrast, deep neural networks can process raw sleep signals directly and deliver superior performance. However, their overfitting, inconsistent accuracy, and computational cost were the primary drawbacks that limited their end-user acceptance. To address these challenges, we developed a novel neural network model, MLS-Net, which integrates the strengths of neural networks and feature extraction for automated sleep staging in mice. MLS-Net leverages temporal and spectral features from multimodal signals, such as EEG, EMG, and eye movements (EMs), as inputs and incorporates a bidirectional Long Short-Term Memory (bi-LSTM) to effectively capture the spatial-temporal nonlinear characteristics inherent in sleep signals. Our studies demonstrate that MLS-Net achieves an overall classification accuracy of 90.4% and REM state precision of 91.1%, sensitivity of 84.7%, and an F1-Score of 87.5% in mice, outperforming other neural network and feature-based algorithms in our multimodal dataset.
Collapse
Affiliation(s)
- Chengyong Jiang
- State Key Laboratory of Medical Neurobiology, MOE Frontiers Center for Brain Science, Institutes of Brain Science, Institute for Medical and Engineering Innovation, Department of Ophthalmology and Vision Science, Eye & ENT Hospital, Fudan University, Shanghai 200032, China
| | - Wenbin Xie
- State Key Laboratory of Medical Neurobiology, MOE Frontiers Center for Brain Science, Institutes of Brain Science, Institute for Medical and Engineering Innovation, Department of Ophthalmology and Vision Science, Eye & ENT Hospital, Fudan University, Shanghai 200032, China
| | - Jiadong Zheng
- State Key Laboratory of Medical Neurobiology, MOE Frontiers Center for Brain Science, Institutes of Brain Science, Institute for Medical and Engineering Innovation, Department of Ophthalmology and Vision Science, Eye & ENT Hospital, Fudan University, Shanghai 200032, China
| | - Biao Yan
- State Key Laboratory of Medical Neurobiology, MOE Frontiers Center for Brain Science, Institutes of Brain Science, Institute for Medical and Engineering Innovation, Department of Ophthalmology and Vision Science, Eye & ENT Hospital, Fudan University, Shanghai 200032, China
| | - Junwen Luo
- State Key Laboratory of Medical Neurobiology, MOE Frontiers Center for Brain Science, Institutes of Brain Science, Institute for Medical and Engineering Innovation, Department of Ophthalmology and Vision Science, Eye & ENT Hospital, Fudan University, Shanghai 200032, China
| | - Jiayi Zhang
- State Key Laboratory of Medical Neurobiology, MOE Frontiers Center for Brain Science, Institutes of Brain Science, Institute for Medical and Engineering Innovation, Department of Ophthalmology and Vision Science, Eye & ENT Hospital, Fudan University, Shanghai 200032, China
| |
Collapse
|
11
|
Pradeepkumar J, Anandakumar M, Kugathasan V, Suntharalingham D, Kappel SL, De Silva AC, Edussooriya CUS. Toward Interpretable Sleep Stage Classification Using Cross-Modal Transformers. IEEE Trans Neural Syst Rehabil Eng 2024; 32:2893-2904. [PMID: 39102323 DOI: 10.1109/tnsre.2024.3438610] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/07/2024]
Abstract
Accurate sleep stage classification is significant for sleep health assessment. In recent years, several machine-learning based sleep staging algorithms have been developed, and in particular, deep-learning based algorithms have achieved performance on par with human annotation. Despite improved performance, a limitation of most deep-learning based algorithms is their black-box behavior, which have limited their use in clinical settings. Here, we propose a cross-modal transformer, which is a transformer-based method for sleep stage classification. The proposed cross-modal transformer consists of a cross-modal transformer encoder architecture along with a multi-scale one-dimensional convolutional neural network for automatic representation learning. The performance of our method is on-par with the state-of-the-art methods and eliminates the black-box behavior of deep-learning models by utilizing the interpretability aspect of the attention modules. Furthermore, our method provides considerable reductions in the number of parameters and training time compared to the state-of-the-art methods. Our code is available at https://github.com/Jathurshan0330/Cross-Modal-Transformer. A demo of our work can be found at https://bit.ly/Cross_modal_transformer_demo.
Collapse
|
12
|
Li Y, Xu Z, Chen Z, Zhang Y, Zhang B. Insights from the 2nd China intelligent sleep staging competition. Sleep Breath 2024; 28:1661-1669. [PMID: 38730204 DOI: 10.1007/s11325-024-03055-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2023] [Revised: 04/25/2024] [Accepted: 05/08/2024] [Indexed: 05/12/2024]
Abstract
STUDY OBJECTIVES Artificial intelligence (AI) is quickly advancing in the field of sleep medicine, which bodes well for the potential of actual clinical use. In this study, an analysis of the 2nd China Intelligent Sleep Staging Competition was conducted to gain insights into the general level and constraints of AI-assisted sleep staging in China. METHODS The outcomes of 10 teams from the children's track and 13 teams from the adult track were investigated in this study. The analysis included overall performance, differences between five different sleep stages, variations across subjects, and performance during stage transitions. RESULTS The adult track's accuracy peaked at 80.46%, while the children's track's accuracy peaked at 88.96%. On average, accuracy rates stood at 71.43% for children and 68.40% for adults. All results were produced within a mere 5-min timeframe. The N1 stage was prone to misclassification as W, N2, and R stages. In the adult track, significant differences were apparent among subjects (p < 0.05), whereas in the children's track, such differences were not observed. Nonetheless, both tracks experienced a performance decline during stage transitions. CONCLUSIONS The computational speed of AI is remarkably fast, simultaneously holding the potential to surpass the accuracy of physicians. Improving the machine learning model's classification of the N1 stage and transitional periods between stages, along with bolstering its robustness to individual subject variations, is imperative for maximizing its ability in assisting clinical scoring.
Collapse
Affiliation(s)
- Yamei Li
- College of Electronic and Information Engineering, Southwest University, Chongqing, 400715, China
| | - Zhifei Xu
- Department of Respiratory Medicine, Beijing Children's Hospital, Capital Medical University, Beijing, 100045, China
| | - Zhiqiang Chen
- College of Electronic and Information Engineering, Southwest University, Chongqing, 400715, China
| | - Yuan Zhang
- College of Electronic and Information Engineering, Southwest University, Chongqing, 400715, China.
| | - Bin Zhang
- Department of Psychiatry, Nanfang Hospital, Southern Medical University, Guangzhou, 510515, China
| |
Collapse
|
13
|
Ingle M, Sharma M, Verma S, Sharma N, Bhurane A, Rajendra Acharya U. Automated explainable wavelet-based sleep scoring system for a population suspected with insomnia, apnea and periodic leg movement. Med Eng Phys 2024; 130:104208. [PMID: 39160031 DOI: 10.1016/j.medengphy.2024.104208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Revised: 05/31/2024] [Accepted: 07/01/2024] [Indexed: 08/21/2024]
Abstract
Sleep is an integral and vital component of human life, contributing significantly to overall health and well-being, but a considerable number of people worldwide experience sleep disorders. Sleep disorder diagnosis heavily depends on accurately classifying sleep stages. Traditionally, this classification has been performed manually by trained sleep technologists that visually inspect polysomnography records. However, in order to mitigate the labor-intensive nature of this process, automated approaches have been developed. These automated methods aim to streamline and facilitate sleep stage classification. This study aims to classify sleep stages in a dataset comprising subjects with insomnia, PLM, and sleep apnea. The dataset consists of PSG recordings from the multi-ethnic study of atherosclerosis (MESA) cohort of the national sleep research resource (NSRR), including 2056 subjects. Among these subjects, 130 have insomnia, 39 suffer from PLM, 156 have sleep apnea, and the remaining 1731 are classified as good sleepers. This study proposes an automated computerized technique to classify sleep stages, developing a machine-learning model with explainable artificial intelligence (XAI) capabilities using wavelet-based Hjorth parameters. An optimal biorthogonal wavelet filter bank (BOWFB) has been employed to extract subbands (SBs) from 30 seconds of electroencephalogram (EEG) epochs. Three EEG channels, namely: Fz_Cz, Cz_Oz, and C4_M1, are employed to yield an optimum outcome. The Hjorth parameters extracted from SBs were then fed to different machine learning algorithms. To gain an understanding of the model, in this study, we used SHAP (Shapley Additive explanations) method. For subjects suffering from the aforementioned diseases, the model utilized features derived from all channels and employed an ensembled bagged trees (EnBT) classifier. The highest accuracy of 86.8%, 87.3%, 85.0%, 84.5%, and 83.8% is obtained for the insomniac, PLM, apniac, good sleepers and complete datasets, respectively. Using these techniques and datasets, the study aims to enhance sleep stage classification accuracy and improve understanding of sleep disorders such as insomnia, PLM, and sleep apnea.
Collapse
Affiliation(s)
- Manisha Ingle
- Department of Electronics and Communication Engineering, Visvesvaraya National Institute of Technology, Nagpur-440010, Maharashtra, India.
| | - Manish Sharma
- Department of Electrical and Computer Science Engineering, and Centre of Advanced Defence Technology (CADT), Institute of Infrastructure, Technology, Research and Management (IITRAM), Ahmedabad-380026, Gujrat, India.
| | - Shresth Verma
- Department of Electrical and Computer Science Engineering, and Centre of Advanced Defence Technology (CADT), Institute of Infrastructure, Technology, Research and Management (IITRAM), Ahmedabad-380026, Gujrat, India.
| | - Nishant Sharma
- Department of Electrical and Computer Science Engineering, and Centre of Advanced Defence Technology (CADT), Institute of Infrastructure, Technology, Research and Management (IITRAM), Ahmedabad-380026, Gujrat, India.
| | - Ankit Bhurane
- Department of Electronics and Communication Engineering, Visvesvaraya National Institute of Technology, Nagpur-440010, Maharashtra, India.
| | - U Rajendra Acharya
- School of Mathematics, Physics and Computing, University of Southern Queensland, Springfield, Australia.
| |
Collapse
|
14
|
McMahon M, Goldin J, Kealy ES, Wicks DJ, Zilberg E, Freeman W, Aliahmad B. Performance Investigation of Somfit Sleep Staging Algorithm. Nat Sci Sleep 2024; 16:1027-1043. [PMID: 39071546 PMCID: PMC11277903 DOI: 10.2147/nss.s463026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Accepted: 07/01/2024] [Indexed: 07/30/2024] Open
Abstract
Purpose To investigate accuracy of the sleep staging algorithm in a new miniaturized home sleep monitoring device - Compumedics® Somfit. Somfit is attached to patient's forehead and combines channels specified for a pulse arterial tonometry (PAT)-based home sleep apnea testing (HSAT) device with the neurological signals. Somfit sleep staging deep learning algorithm is based on convolutional neural network architecture. Patients and Methods One hundred and ten participants referred for sleep investigation with suspected or preexisting obstructive sleep apnea (OSA) in need of a review were enrolled into the study involving simultaneous recording of full overnight polysomnography (PSG) and Somfit data. The recordings were conducted at three centers in Australia. The reported statistics include standard measures of agreement between Somfit automatic hypnogram and consensus PSG hypnogram. Results Overall percent agreement across five sleep stages (N1, N2, N3, REM, and wake) between Somfit automatic and consensus PSG hypnograms was 76.14 (SE: 0.79). The percent agreements between different pairs of sleep technologists' PSG hypnograms varied from 74.36 (1.93) to 85.50 (0.64), with interscorer agreement being greater for scorers from the same sleep laboratory. The estimate of kappa between Somfit and consensus PSG was 0.672 (0.002). Percent agreement for sleep/wake discrimination was 89.30 (0.37). The accuracy of Somfit sleep staging algorithm varied with increasing OSA severity - percent agreement was 79.67 (1.87) for the normal subjects, 77.38 (1.06) for mild OSA, 74.83 (1.79) for moderate OSA and 72.93 (1.68) for severe OSA. Conclusion Agreement between Somfit and PSG hypnograms was non-inferior to PSG interscorer agreement for a number of scorers, thus confirming acceptability of electrode placement at the center of the forehead. The directions for algorithm improvement include additional arousal detection, integration of motion and oximetry signals and separate inference models for individual sleep stages.
Collapse
Affiliation(s)
- Marcus McMahon
- Department of Respiratory and Sleep Medicine, Epworth Hospital, Richmond, Victoria, Australia and Department of Respiratory and Sleep Medicine, Austin Health, Heidelberg, Victoria, Australia
| | - Jeremy Goldin
- Department of Respiratory and Sleep Medicine, Royal Melbourne Hospital, Parkvile, Victoria, Australia
| | | | | | - Eugene Zilberg
- Medical Innovations, Compumedics Limited, Abbotsford, Victoria, Australia
| | - Warwick Freeman
- Medical Innovations, Compumedics Limited, Abbotsford, Victoria, Australia
| | - Behzad Aliahmad
- Medical Innovations, Compumedics Limited, Abbotsford, Victoria, Australia
| |
Collapse
|
15
|
Shintani D, Nakari I, Washizaki S, Takadama K. REM Estimation Based on Accelerometer by Excluding Other Stages and Two-Scale Smoothing. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2024; 2024:1-4. [PMID: 40039771 DOI: 10.1109/embc53108.2024.10781582] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/06/2025]
Abstract
This paper focuses on the REM sleep as one of sleep stages and proposes its estimation method based on the accelerometer attached to a nightwear around waist. In our approach, the REM sleep is estimated by (i) excluding other sleep stages before the REM sleep estimation and (ii) correcting the estimated probability according to the difference of the two scale moving averages. To investigate the effectiveness of the proposed method, this paper conducted the human subject experiment on 35 whole nights, which revealed the following implications: (i) the proposed method can significantly improve recall and F1 compared to Random Forest as one of the major machine learning; (ii) the REM sleep estimation excluding other sleep stages can improve performance compared to REM estimation without.Clinical relevance- A technology for estimating sleep on a daily basis is needed. The results show that the proposed method improves the performance of the sleep stage estimation by correcting inaccurate probabilities of the sleep stage. The proposed method can contribute to improving the performance of sleep stage estimation in daily life.
Collapse
|
16
|
Niknazar H, Mednick SC. A Multi-Level Interpretable Sleep Stage Scoring System by Infusing Experts' Knowledge Into a Deep Network Architecture. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2024; 46:5044-5061. [PMID: 38358869 DOI: 10.1109/tpami.2024.3366170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/17/2024]
Abstract
In recent years, deep learning has shown potential and efficiency in a wide area including computer vision, image and signal processing. Yet, translational challenges remain for user applications due to a lack of interpretability of algorithmic decisions and results. This black box problem is particularly problematic for high-risk applications such as medical-related decision-making. The current study goal was to design an interpretable deep learning system for time series classification of electroencephalogram (EEG) for sleep stage scoring as a step toward designing a transparent system. We have developed an interpretable deep neural network that includes a kernel-based layer guided by a set of principles used for sleep scoring by human experts in the visual analysis of polysomnographic records. A kernel-based convolutional layer was defined and used as the first layer of the system and made available for user interpretation. The trained system and its results were interpreted in four levels from microstructure of EEG signals, such as trained kernels and effect of each kernel on the detected stages, to macrostructures, such as transitions between stages. The proposed system demonstrated greater performance than prior studies and the system learned information consistent with expert knowledge.
Collapse
|
17
|
Li W, Liu T, Xu B, Song A. SleepFC: Feature Pyramid and Cross-Scale Context Learning for Sleep Staging. IEEE Trans Neural Syst Rehabil Eng 2024; 32:2198-2208. [PMID: 38805336 DOI: 10.1109/tnsre.2024.3406383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/30/2024]
Abstract
Automated sleep staging is essential to assess sleep quality and treat sleep disorders, so the issue of electroencephalography (EEG)-based sleep staging has gained extensive research interests. However, the following difficulties exist in this issue: 1) how to effectively learn the intrinsic features of salient waves from single-channel EEG signals; 2) how to learn and capture the useful information of sleep stage transition rules; 3) how to address the class imbalance problem of sleep stages. To handle these problems in sleep staging, we propose a novel method named SleepFC. This method comprises convolutional feature pyramid network (CFPN), cross-scale temporal context learning (CSTCL), and class adaptive fine-tuning loss function (CAFTLF) based classification network. CFPN learns the multi-scale features from salient waves of EEG signals. CSTCL extracts the informative multi-scale transition rules between sleep stages. CAFTLF-based classification network handles the class imbalance problem. Extensive experiments on three public benchmark datasets demonstrate the superiority of SleepFC over the state-of-the-art approaches. Particularly, SleepFC has a significant performance advantage in recognizing the N1 sleep stage, which is challenging to distinguish.
Collapse
|
18
|
Yazdi M, Samaee M, Massicotte D. A Review on Automated Sleep Study. Ann Biomed Eng 2024; 52:1463-1491. [PMID: 38493234 DOI: 10.1007/s10439-024-03486-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Accepted: 02/25/2024] [Indexed: 03/18/2024]
Abstract
In recent years, research on automated sleep analysis has witnessed significant growth, reflecting advancements in understanding sleep patterns and their impact on overall health. This review synthesizes findings from an exhaustive analysis of 87 papers, systematically retrieved from prominent databases such as Google Scholar, PubMed, IEEE Xplore, and ScienceDirect. The selection criteria prioritized studies focusing on methods employed, signal modalities utilized, and machine learning algorithms applied in automated sleep analysis. The overarching goal was to critically evaluate the strengths and weaknesses of the proposed methods, shedding light on the current landscape and future directions in sleep research. An in-depth exploration of the reviewed literature revealed a diverse range of methodologies and machine learning approaches employed in automated sleep studies. Notably, K-Nearest Neighbors (KNN), Ensemble Learning Methods, and Support Vector Machine (SVM) emerged as versatile and potent classifiers, exhibiting high accuracies in various applications. However, challenges such as performance variability and computational demands were observed, necessitating judicious classifier selection based on dataset intricacies. In addition, the integration of traditional feature extraction methods with deep structures and the combination of different deep neural networks were identified as promising strategies to enhance diagnostic accuracy in sleep-related studies. The reviewed literature emphasized the need for adaptive classifiers, cross-modality integration, and collaborative efforts to drive the field toward more accurate, robust, and accessible sleep-related diagnostic solutions. This comprehensive review serves as a solid foundation for researchers and practitioners, providing an organized synthesis of the current state of knowledge in automated sleep analysis. By highlighting the strengths and challenges of various methodologies, this review aims to guide future research toward more effective and nuanced approaches to sleep diagnostics.
Collapse
Affiliation(s)
- Mehran Yazdi
- Laboratory of Signal and System Integration, Department of Electrical and Computer Engineering, Université du Québec à Trois-Rivières, Trois-Rivières, Canada.
- Signal and Image Processing Laboratory, School of Electrical and Computer Engineering, Shiraz University, Shiraz, Iran.
| | - Mahdi Samaee
- Signal and Image Processing Laboratory, School of Electrical and Computer Engineering, Shiraz University, Shiraz, Iran
| | - Daniel Massicotte
- Laboratory of Signal and System Integration, Department of Electrical and Computer Engineering, Université du Québec à Trois-Rivières, Trois-Rivières, Canada
| |
Collapse
|
19
|
Li Y, Chen J, Ma W, Zhao G, Fan X. MVF-SleepNet: Multi-View Fusion Network for Sleep Stage Classification. IEEE J Biomed Health Inform 2024; 28:2485-2495. [PMID: 36129857 DOI: 10.1109/jbhi.2022.3208314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Sleep stage classification is of great importance in human health monitoring and disease diagnosing. Clinically, visual-inspected classifying sleep into different stages is quite time consuming and highly relies on the expertise of sleep specialists. Many automated models for sleep stage classification have been proposed in previous studies but their performances still exist a gap to the real clinical application. In this work, we propose a novel multi-view fusion network named MVF-SleepNet based on multi-modal physiological signals of electroencephalography (EEG), electrocardiography (ECG), electrooculography (EOG), and electromyography (EMG). To capture the relationship representation among multi-modal physiological signals, we construct two views of Time-frequency images (TF images) and Graph-learned graphs (GL graphs). To learn the spectral-temporal representation from sequentially timed TF images, the combination of VGG-16 and GRU networks is utilized. To learn the spatial-temporal representation from sequentially timed GL graphs, the combination of Chebyshev graph convolution and temporal convolution networks is employed. Fusing the spectral-temporal representation and spatial-temporal representation can further boost the performance of sleep stage classification. A large number of experiment results on the publicly available datasets of ISRUC-S1 and ISRUC-S3 show that the MVF-SleepNet achieves overall accuracy of 0.821, F1 score of 0.802 and Kappa of 0.768 on ISRUC-S1 dataset, and accuracy of 0.841, F1 score of 0.828 and Kappa of 0.795 on ISRUC-S3 dataset. The MVF-SleepNet achieves competitive results on both datasets of ISRUC-S1 and ISRUC-S3 for sleep stage classification compared to the state-of-the-art baselines. The source code of MVF-SleepNet is available on Github (https://github.com/YJPai65/MVF-SleepNet).
Collapse
|
20
|
An P, Zhao J, Du B, Zhao W, Zhang T, Yuan Z. Amplitude-Time Dual-View Fused EEG Temporal Feature Learning for Automatic Sleep Staging. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:6492-6506. [PMID: 36215384 DOI: 10.1109/tnnls.2022.3210384] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Electroencephalogram (EEG) plays an important role in studying brain function and human cognitive performance, and the recognition of EEG signals is vital to develop an automatic sleep staging system. However, due to the complex nonstationary characteristics and the individual difference between subjects, how to obtain the effective signal features of the EEG for practical application is still a challenging task. In this article, we investigate the EEG feature learning problem and propose a novel temporal feature learning method based on amplitude-time dual-view fusion for automatic sleep staging. First, we explore the feature extraction ability of convolutional neural networks for the EEG signal from the perspective of interpretability and construct two new representation signals for the raw EEG from the views of amplitude and time. Then, we extract the amplitude-time signal features that reflect the transformation between different sleep stages from the obtained representation signals by using conventional 1-D CNNs. Furthermore, a hybrid dilation convolution module is used to learn the long-term temporal dependency features of EEG signals, which can overcome the shortcoming that the small-scale convolution kernel can only learn the local signal variation information. Finally, we conduct attention-based feature fusion for the learned dual-view signal features to further improve sleep staging performance. To evaluate the performance of the proposed method, we test 30-s-epoch EEG signal samples for healthy subjects and subjects with mild sleep disorders. The experimental results from the most commonly used datasets show that the proposed method has better sleep staging performance and has the potential for the development and application of an EEG-based automatic sleep staging system.
Collapse
|
21
|
Jirakittayakorn N, Wongsawat Y, Mitrirattanakul S. ZleepAnlystNet: a novel deep learning model for automatic sleep stage scoring based on single-channel raw EEG data using separating training. Sci Rep 2024; 14:9859. [PMID: 38684765 PMCID: PMC11058251 DOI: 10.1038/s41598-024-60796-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 04/26/2024] [Indexed: 05/02/2024] Open
Abstract
Numerous models for sleep stage scoring utilizing single-channel raw EEG signal have typically employed CNN and BiLSTM architectures. While these models, incorporating temporal information for sequence classification, demonstrate superior overall performance, they often exhibit low per-class performance for N1-stage, necessitating an adjustment of loss function. However, the efficacy of such adjustment is constrained by the training process. In this study, a pioneering training approach called separating training is introduced, alongside a novel model, to enhance performance. The developed model comprises 15 CNN models with varying loss function weights for feature extraction and 1 BiLSTM for sequence classification. Due to its architecture, this model cannot be trained using an end-to-end approach, necessitating separate training for each component using the Sleep-EDF dataset. Achieving an overall accuracy of 87.02%, MF1 of 82.09%, Kappa of 0.8221, and per-class F1-socres (W 90.34%, N1 54.23%, N2 89.53%, N3 88.96%, and REM 87.40%), our model demonstrates promising performance. Comparison with sleep technicians reveals a Kappa of 0.7015, indicating alignment with reference sleep stags. Additionally, cross-dataset validation and adaptation through training with the SHHS dataset yield an overall accuracy of 84.40%, MF1 of 74.96% and Kappa of 0.7785 when tested with the Sleep-EDF-13 dataset. These findings underscore the generalization potential in model architecture design facilitated by our novel training approach.
Collapse
Affiliation(s)
- Nantawachara Jirakittayakorn
- Institute for Innovative Learning, Mahidol University, Nakhon Pathom, Thailand
- Faculty of Dentistry, Mahidol University, Bangkok, Thailand
| | - Yodchanan Wongsawat
- Department of Biomedical Engineering, Faculty of Engineering, Mahidol University, Nakhon Pathom, Thailand
| | - Somsak Mitrirattanakul
- Department of Masticatory Science, Faculty of Dentistry, Mahidol University, Bangkok, Thailand.
| |
Collapse
|
22
|
Liu L, Feng J, Li J, Chen W, Mao Z, Tan X. Multi-layer CNN-LSTM network with self-attention mechanism for robust estimation of nonlinear uncertain systems. Front Neurosci 2024; 18:1379495. [PMID: 38638692 PMCID: PMC11024260 DOI: 10.3389/fnins.2024.1379495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Accepted: 03/19/2024] [Indexed: 04/20/2024] Open
Abstract
Introduction With the help of robot technology, intelligent rehabilitation of patients with lower limb motor dysfunction caused by stroke can be realized. A key factor constraining the clinical application of rehabilitation robots is how to realize pattern recognition of human movement intentions by using the surface electromyography (sEMG) sensors to ensure unhindered human-robot interaction. Methods A multilayer CNN-LSTM prediction network incorporating the self-attention mechanism (SAM) is proposed, in this paper, which can extract and learn the periodic and trend characteristics of the sEMG signals, and realize the accurate autoregressive prediction of the human motion information. Firstly, the multilayer CNN-LSTM network utilizes the CNN layer for initial feature extraction of data, and the LSTM network is used to improve the enhancement of the historical time-series features. Then, the SAM is used to improve the global feature extraction performance and parallel computation speed of the network. Results In comparison with existing test is carried out using actual data from five healthy subjects as well as a clinical hemiplegic patient to verify the superiority and practicality of the proposed algorithm. The results show that most of the model's prediction R > 0.9 for different motion states of healthy subjects; in the experiments oriented to the motion characteristics of patient subjects, the angle prediction results of R > 0.99 for the untrained data on the affected side, which proves that our proposed model also has a better effect on the angle prediction of the affected side. Discussion The main contribution of this paper is to realize continuous motion estimation of ankle joint for healthy and hemiplegic individuals under non-ideal conditions (weak sEMG signals, muscle fatigue, high muscle tension, etc.), which improves the pattern recognition accuracy and robustness of the sEMG sensor-based system.
Collapse
Affiliation(s)
- Lin Liu
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning, China
| | - Jun Feng
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning, China
- State Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, Liaoning, China
| | - Jiwei Li
- State Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, Liaoning, China
- Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang, Liaoning, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Wanxin Chen
- State Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, Liaoning, China
- Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang, Liaoning, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Zhizhong Mao
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning, China
| | - Xiaowei Tan
- State Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, Liaoning, China
- Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang, Liaoning, China
| |
Collapse
|
23
|
Yue H, Chen Z, Guo W, Sun L, Dai Y, Wang Y, Ma W, Fan X, Wen W, Lei W. Research and application of deep learning-based sleep staging: Data, modeling, validation, and clinical practice. Sleep Med Rev 2024; 74:101897. [PMID: 38306788 DOI: 10.1016/j.smrv.2024.101897] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Revised: 12/30/2023] [Accepted: 01/04/2024] [Indexed: 02/04/2024]
Abstract
Over the past few decades, researchers have attempted to simplify and accelerate the process of sleep stage classification through various approaches; however, only a few such approaches have gained widespread acceptance. Artificial intelligence technology, particularly deep learning, is promising for earning the trust of the sleep medicine community in automated sleep-staging systems, thus facilitating its application in clinical practice and integration into daily life. We aimed to comprehensively review the latest methods that are applying deep learning for enhancing sleep staging efficiency and accuracy. Starting from the requisite "data" for constructing deep learning algorithms, we elucidated the current landscape of this domain and summarized the fundamental modeling process, encompassing signal selection, data pre-processing, model architecture, classification tasks, and performance metrics. Furthermore, we reviewed the applications of automated sleep staging in scenarios such as sleep-disorder screening, diagnostic procedures, and health monitoring and management. Finally, we conducted an in-depth analysis and discussion of the challenges and future in intelligent sleep staging, particularly focusing on large-scale sleep datasets, interdisciplinary collaborations, and human-computer interactions.
Collapse
Affiliation(s)
- Huijun Yue
- Otorhinolaryngology Hospital, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, People's Republic of China
| | - Zhuqi Chen
- Otorhinolaryngology Hospital, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, People's Republic of China
| | - Wenbin Guo
- Otorhinolaryngology Hospital, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, People's Republic of China
| | - Lin Sun
- Otorhinolaryngology Hospital, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, People's Republic of China
| | - Yidan Dai
- School of Computer Science, South China Normal University, Guangzhou, People's Republic of China
| | - Yiming Wang
- Otorhinolaryngology Hospital, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, People's Republic of China
| | - Wenjun Ma
- School of Computer Science, South China Normal University, Guangzhou, People's Republic of China
| | - Xiaomao Fan
- College of Big Data and Internet, Shenzhen Technology University, Shenzhen, People's Republic of China
| | - Weiping Wen
- Otorhinolaryngology Hospital, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, People's Republic of China; Department of Otolaryngology, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, People's Republic of China.
| | - Wenbin Lei
- Otorhinolaryngology Hospital, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, People's Republic of China.
| |
Collapse
|
24
|
Zhu H, Xu Y, Wu Y, Shen N, Wang L, Chen C, Chen W. A Sequential End-to-End Neonatal Sleep Staging Model with Squeeze and Excitation Blocks and Sequential Multi-Scale Convolution Neural Networks. Int J Neural Syst 2024; 34:2450013. [PMID: 38369905 DOI: 10.1142/s0129065724500138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]
Abstract
Automatic sleep staging offers a quick and objective assessment for quantitatively interpreting sleep stages in neonates. However, most of the existing studies either do not encompass any temporal information, or simply apply neural networks to exploit temporal information at the expense of high computational overhead and modeling ambiguity. This limits the application of these methods to multiple scenarios. In this paper, a sequential end-to-end sleep staging model, SeqEESleepNet, which is competent for parallelly processing sequential epochs and has a fast training rate to adapt to different scenarios, is proposed. SeqEESleepNet consists of a sequence epoch generation (SEG) module, a sequential multi-scale convolution neural network (SMSCNN) and squeeze and excitation (SE) blocks. The SEG module expands independent epochs into sequential signals, enabling the model to learn the temporal information between sleep stages. SMSCNN is a multi-scale convolution neural network that can extract both multi-scale features and temporal information from the signal. Subsequently, the followed SE block can reassign the weights of features through mapping and pooling. Experimental results exhibit that in a clinical dataset, the proposed method outperforms the state-of-the-art approaches, achieving an overall accuracy, F1-score, and Kappa coefficient of 71.8%, 71.8%, and 0.684 on a three-class classification task with a single channel EEG signal. Based on our overall results, we believe the proposed method could pave the way for convenient multi-scenario neonatal sleep staging methods.
Collapse
Affiliation(s)
- Hangyu Zhu
- Center for Intelligent Medical Electronics, School of Information Science and Technology, Fudan University, Shanghai 200433, P. R. China
| | - Yan Xu
- Department of Neurology, Children's Hospital of Fudan University, National Children's Medical Center, Shanghai, P. R. China
| | - Yonglin Wu
- Center for Intelligent Medical Electronics, School of Information Science and Technology, Fudan University, Shanghai 200433, P. R. China
| | - Ning Shen
- Center for Intelligent Medical Electronics, School of Information Science and Technology, Fudan University, Shanghai 200433, P. R. China
| | - Laishuan Wang
- Department of Neurology, Children's Hospital of Fudan University, National Children's Medical Center, Shanghai, P. R. China
| | - Chen Chen
- Human Phenome Institute, Fudan University, 825 Zhangheng Road, Shanghai 201203, P. R. China
| | - Wei Chen
- Center for Intelligent Medical Electronics, School of Information Science and Technology, Fudan University, Shanghai 200433, P. R. China
| |
Collapse
|
25
|
Duan L, Zhang Y, Huang Z, Ma B, Wang W, Qiao Y. Dual-Teacher Feature Distillation: A Transfer Learning Method for Insomniac PSG Staging. IEEE J Biomed Health Inform 2024; 28:1730-1741. [PMID: 38032775 DOI: 10.1109/jbhi.2023.3337261] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2023]
Abstract
Insomnia is the most common sleep disorder linked with adverse long-term medical and psychiatric outcomes. Automatic sleep staging plays a crucial role in aiding doctors to diagnose insomnia disorder. Only a few studies have been conducted to develop automatic sleep staging methods for insomniacs, and most of them have utilized transfer learning methods, which involve pre-training models on healthy individuals and then fine-tuning them on insomniacs. Unfortunately, significant differences in feature distribution between the two subject groups impede the transfer performance, highlighting the need to effectively integrate the features of healthy subjects and insomniacs. In this paper, we propose a dual-teacher cross-domain knowledge transfer method based on the feature-based knowledge distillation to improve the performance of sleep staging for insomniacs. Specifically, the insomnia teacher directly learns from insomniacs and feeds the corresponding domain-specific features into the student network, while the health domain teacher guide the student network to learn domain-generic features. During the training process, we adopt the OFD (Overhaul of Feature Distillation) method to build the health domain teacher. We conducted the experiments to validate the proposed method, using the Sleep-EDF database as the source domain and the CAP-Database as the target domain. The results demonstrate that our method surpasses advanced techniques, achieving an average sleep staging accuracy of 80.56% on the CAP-Database. Furthermore, our method exhibits promising performance on the private dataset.
Collapse
|
26
|
Pei W, Li Y, Wen P, Yang F, Ji X. An automatic method using MFCC features for sleep stage classification. Brain Inform 2024; 11:6. [PMID: 38340211 DOI: 10.1186/s40708-024-00219-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Accepted: 01/19/2024] [Indexed: 02/12/2024] Open
Abstract
Sleep stage classification is a necessary step for diagnosing sleep disorders. Generally, experts use traditional methods based on every 30 seconds (s) of the biological signals, such as electrooculograms (EOGs), electrocardiograms (ECGs), electromyograms (EMGs), and electroencephalograms (EEGs), to classify sleep stages. Recently, various state-of-the-art approaches based on a deep learning model have been demonstrated to have efficient and accurate outcomes in sleep stage classification. In this paper, a novel deep convolutional neural network (CNN) combined with a long short-time memory (LSTM) model is proposed for sleep scoring tasks. A key frequency domain feature named Mel-frequency Cepstral Coefficient (MFCC) is extracted from EEG and EMG signals. The proposed method can learn features from frequency domains on different bio-signal channels. It firstly extracts the MFCC features from multi-channel signals, and then inputs them to several convolutional layers and an LSTM layer. Secondly, the learned representations are fed to a fully connected layer and a softmax classifier for sleep stage classification. The experiments are conducted on two widely used sleep datasets, Sleep Heart Health Study (SHHS) and Vincent's University Hospital/University College Dublin Sleep Apnoea (UCDDB) to test the effectiveness of the method. The results of this study indicate that the model can perform well in the classification of sleep stages using the features of the 2-dimensional (2D) MFCC feature. The advantage of using the feature is that it can be used to input a two-dimensional data stream, which can be used to retain information about each sleep stage. Using 2D data streams can reduce the time it takes to retrieve the data from the one-dimensional stream. Another advantage of this method is that it eliminates the need for deep layers, which can help improve the performance of the model. For instance, by reducing the number of layers, our seven layers of the model structure takes around 400 s to train and test 100 subjects in the SHHS1 dataset. Its best accuracy and Cohen's kappa are 82.35% and 0.75 for the SHHS dataset, and 73.07% and 0.63 for the UCDDB dataset, respectively.
Collapse
Affiliation(s)
- Wei Pei
- School of Mathematics, Physics and Computing, University of Southern Queensland, Toowoomba, QLD, 4350, Australia.
| | - Yan Li
- School of Mathematics, Physics and Computing, University of Southern Queensland, Toowoomba, QLD, 4350, Australia
| | - Peng Wen
- School of Engineering, University of Southern Queensland, Toowoomba, QLD, 4350, Australia
| | - Fuwen Yang
- School of Engineering and Built Environment, Griffith University, Gold Coast, QLD, 4222, Australia
| | - Xiaopeng Ji
- School of Mathematics, Physics and Computing, University of Southern Queensland, Toowoomba, QLD, 4350, Australia
| |
Collapse
|
27
|
Wei Y, Zhu Y, Zhou Y, Yu X, Luo Y. Automatic Sleep Staging Based on Contextual Scalograms and Attention Convolution Neural Network Using Single-Channel EEG. IEEE J Biomed Health Inform 2024; 28:801-811. [PMID: 37955995 DOI: 10.1109/jbhi.2023.3332503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
Single-channel EEG based sleep staging is of interest to researchers due to its broad application prospect in daily sleep monitoring recently. We proposed using contextual scalograms as input and developed a convolutional neural network with attention modules named Co-ScaleNet for sleep staging. The contextual scalograms were obtained by combining the same color channels of three original RGB scalograms from consecutive epochs, and a simple and efficient data augmentation was designed according to their various forms. The Co-ScaleNet consists of two main parts. Firstly, three parallel convolutional branches with attention modules correspondingly extract and fuse features from contextual scalograms at the top layers. The remaining part is a stack of lightweight blocks. We achieved an overall accuracy of 87.0% for healthy individuals, 84.7% for depressed patients. And we obtained comparable performance on the public Sleep-EDFx (82.8%), ISRUC (84.6%) and SHHS datasets (87.7%), including a high recall of N1. The contextual scalograms of R channel as input achieved the best performance, which conform to the features of interest in visual scoring. The attention modules improved the recall of N1 and N3. Overall, the contextual scalograms provided a novel scheme for both contextual information extraction and data augmentation. Our study successfully expanded its application to depression datasets, as well as patients with sleep apnea, demonstrating its wide applicability.
Collapse
|
28
|
Jain R, G RA. Modality-Specific Feature Selection, Data Augmentation and Temporal Context for Improved Performance in Sleep Staging. IEEE J Biomed Health Inform 2024; 28:1031-1042. [PMID: 38051608 DOI: 10.1109/jbhi.2023.3339713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/07/2023]
Abstract
This work attempts to design an effective sleep staging system, making the best use of the available signals, strategies, and features in the literature. It must not only perform well on different datasets comprising healthy and clinical populations but also achieve good accuracy in cross-dataset experiments. Toward this end, we propose a model comprising multiple binary classifiers in a hierarchical fashion, where, at each level, one or more of EEG, EOG, and EMG are selected to best differentiate between two sleep stages. The best set of 100 features is chosen out of all the features derived from selected signals. The class imbalance in data is addressed by random undersampling and boosting techniques with decision trees as weak learners. Temporal context and data augmentation are used to improve the performance. We also evaluate the performance of our model by training and testing on different datasets. We compare the results of five approaches: using only EEG, EEG+EOG, EEG+EMG+EOG, EEG+EMG, and selective modality with a specific combination of EEG, EMG, and/or EOG at each level. The best results are obtained by considering features from EEG+EMG+EOG at each hierarchical level. The proposed model achieves average accuracies of 83.1%, 90.0%, 84.4%, 82.1%, 81.5%, 79.9%, and 73.7% on Sleep-EDF, Exp Sleep-EDF, ISRUC-S1, S2 and S3, DRMS-SUB, and DRMS-PAT datasets, respectively. For all the datasets except DRMS-SUB, the proposed method outperforms all the state-of-the-art approaches. Cross-dataset performance exceeds 80% for all datasets except DRMS-PAT; independent of whether the test data is from normal subjects or patients.
Collapse
|
29
|
Korkalainen H, Kainulainen S, Islind AS, Óskarsdóttir M, Strassberger C, Nikkonen S, Töyräs J, Kulkas A, Grote L, Hedner J, Sund R, Hrubos-Strom H, Saavedra JM, Ólafsdóttir KA, Ágústsson JS, Terrill PI, McNicholas WT, Arnardóttir ES, Leppänen T. Review and perspective on sleep-disordered breathing research and translation to clinics. Sleep Med Rev 2024; 73:101874. [PMID: 38091850 DOI: 10.1016/j.smrv.2023.101874] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Revised: 09/18/2023] [Accepted: 11/09/2023] [Indexed: 01/23/2024]
Abstract
Sleep-disordered breathing, ranging from habitual snoring to severe obstructive sleep apnea, is a prevalent public health issue. Despite rising interest in sleep and awareness of sleep disorders, sleep research and diagnostic practices still rely on outdated metrics and laborious methods reducing the diagnostic capacity and preventing timely diagnosis and treatment. Consequently, a significant portion of individuals affected by sleep-disordered breathing remain undiagnosed or are misdiagnosed. Taking advantage of state-of-the-art scientific, technological, and computational advances could be an effective way to optimize the diagnostic and treatment pathways. We discuss state-of-the-art multidisciplinary research, review the shortcomings in the current practices of SDB diagnosis and management in adult populations, and provide possible future directions. We critically review the opportunities for modern data analysis methods and machine learning to combine multimodal information, provide a perspective on the pitfalls of big data analysis, and discuss approaches for developing analysis strategies that overcome current limitations. We argue that large-scale and multidisciplinary collaborative efforts based on clinical, scientific, and technical knowledge and rigorous clinical validation and implementation of the outcomes in practice are needed to move the research of sleep-disordered breathing forward, thus increasing the quality of diagnostics and treatment.
Collapse
Affiliation(s)
- Henri Korkalainen
- Department of Technical Physics, University of Eastern Finland, Kuopio, Finland; Diagnostic Imaging Center, Kuopio University Hospital, Kuopio, Finland.
| | - Samu Kainulainen
- Department of Technical Physics, University of Eastern Finland, Kuopio, Finland; Diagnostic Imaging Center, Kuopio University Hospital, Kuopio, Finland
| | - Anna Sigridur Islind
- Department of Computer Science, Reykjavik University, Reykjavik, Iceland; Reykjavik University Sleep Institute, Reykjavik University, Reykjavik, Iceland
| | - María Óskarsdóttir
- Department of Computer Science, Reykjavik University, Reykjavik, Iceland
| | - Christian Strassberger
- Centre for Sleep and Wake Disorders, Sahlgrenska Academy, Gothenburg University, Gothenburg, Sweden
| | - Sami Nikkonen
- Department of Technical Physics, University of Eastern Finland, Kuopio, Finland; Diagnostic Imaging Center, Kuopio University Hospital, Kuopio, Finland
| | - Juha Töyräs
- Department of Technical Physics, University of Eastern Finland, Kuopio, Finland; School of Electrical Engineering and Computer Science, The University of Queensland, Brisbane, Australia; Science Service Center, Kuopio University Hospital, Kuopio, Finland
| | - Antti Kulkas
- Department of Technical Physics, University of Eastern Finland, Kuopio, Finland; Department of Clinical Neurophysiology, Seinäjoki Central Hospital, Seinäjoki, Finland
| | - Ludger Grote
- Centre for Sleep and Wake Disorders, Sahlgrenska Academy, Gothenburg University, Gothenburg, Sweden; Sleep Disorders Centre, Pulmonary Medicine, Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Jan Hedner
- Centre for Sleep and Wake Disorders, Sahlgrenska Academy, Gothenburg University, Gothenburg, Sweden; Sleep Disorders Centre, Pulmonary Medicine, Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Reijo Sund
- School of Medicine, Institute of Clinical Medicine, University of Eastern Finland, Kuopio, Finland
| | - Harald Hrubos-Strom
- Institute of Clinical Medicine, University of Oslo, Oslo, Norway; Department of Ear, Nose and Throat Surgery, Akershus University Hospital, Lørenskog, Norway
| | - Jose M Saavedra
- Reykjavik University Sleep Institute, Reykjavik University, Reykjavik, Iceland; Physical Activity, Physical Education, Sport and Health (PAPESH) Research Group, Department of Sports Science, Reykjavik University, Reykjavik, Iceland
| | | | | | - Philip I Terrill
- School of Electrical Engineering and Computer Science, The University of Queensland, Brisbane, Australia
| | - Walter T McNicholas
- School of Medicine, University College Dublin, and Department of Respiratory and Sleep Medicine, St Vincent's Hospital Group, Dublin Ireland
| | - Erna Sif Arnardóttir
- Reykjavik University Sleep Institute, Reykjavik University, Reykjavik, Iceland; Landspitali - The National University Hospital of Iceland, Reykjavik, Iceland
| | - Timo Leppänen
- Department of Technical Physics, University of Eastern Finland, Kuopio, Finland; Diagnostic Imaging Center, Kuopio University Hospital, Kuopio, Finland; School of Electrical Engineering and Computer Science, The University of Queensland, Brisbane, Australia
| |
Collapse
|
30
|
Masad IS, Alqudah A, Qazan S. Automatic classification of sleep stages using EEG signals and convolutional neural networks. PLoS One 2024; 19:e0297582. [PMID: 38277364 PMCID: PMC10817107 DOI: 10.1371/journal.pone.0297582] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 01/08/2024] [Indexed: 01/28/2024] Open
Abstract
Sleep stages classification is one of the new topics in studying human life quality because it plays a crucial role in getting a healthy lifestyle. Abnormal changes or absence of normal sleep may lead to different diseases such as heart-related diseases, diabetes, and obesity. In general, sleep staging analysis can be performed using electroencephalography (EEG) signals. This study proposes a convolutional neural network (CNN) based methodology for sleep stage classification using EEG signals taken by six channels and transformed into time-frequency analysis images. The proposed methodology consists of three major steps: (i) segment the EEG signal into epochs with 30 seconds in length, (ii) convert epochs into 2D representation using time-frequency analysis, and (iii) feed the 2D time-frequency analysis to the 2D CNN. The results showed that the proposed methodology is robust and achieved a very high accuracy of 99.39% for channel C4-A1. All other channels have accuracy values above 98.5%, which indicates that any channel can be used for sleep stage classification with high accuracy. The proposed methodology outperformed the methods in the literature in terms of overall accuracy or single channel accuracy. It is expected to provide a great benefit for physicians, especially neurologists; by providing them with a new powerful tool to support the clinical diagnosis of sleep-related diseases.
Collapse
Affiliation(s)
- Ihssan S. Masad
- Department of Biomedical Systems and Informatics Engineering, Yarmouk University, Irbid, Jordan
| | - Amin Alqudah
- Department of Computer Engineering, Yarmouk University, Irbid, Jordan
| | - Shoroq Qazan
- Department of Biomedical Systems and Informatics Engineering, Yarmouk University, Irbid, Jordan
| |
Collapse
|
31
|
Liu H, Zhang H, Li B, Yu X, Zhang Y, Penzel T. MSleepNet: A Semi-Supervision-Based Multiview Hybrid Neural Network for Simultaneous Sleep Arousal and Sleep Stage Detection. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT 2024; 73:1-9. [DOI: 10.1109/tim.2023.3348898] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2025]
Affiliation(s)
- Hongmei Liu
- College of Electronic and Information Engineering, Southwest University, Chongqing, China
| | - Haibo Zhang
- School of Computing, University of Otago, Dunedin, New Zealand
| | - Baozhu Li
- Internet of Things Smart City Innovation Platform, Zhuhai Fudan Innovation Institute, Zhuhai, China
| | - Xinge Yu
- Department of Biomedical Engineering, City University of Hong Kong, Kowloon Tong, Hong Kong
| | - Yuan Zhang
- College of Electronic and Information Engineering, Southwest University, Chongqing, China
| | - Thomas Penzel
- Interdisciplinary Sleep Medicine Center, Charité—Universitätsmedizin, Berlin, Germany
| |
Collapse
|
32
|
Brodersen PJN, Alfonsa H, Krone LB, Blanco-Duque C, Fisk AS, Flaherty SJ, Guillaumin MCC, Huang YG, Kahn MC, McKillop LE, Milinski L, Taylor L, Thomas CW, Yamagata T, Foster RG, Vyazovskiy VV, Akerman CJ. Somnotate: A probabilistic sleep stage classifier for studying vigilance state transitions. PLoS Comput Biol 2024; 20:e1011793. [PMID: 38232122 PMCID: PMC10824458 DOI: 10.1371/journal.pcbi.1011793] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 01/29/2024] [Accepted: 01/02/2024] [Indexed: 01/19/2024] Open
Abstract
Electrophysiological recordings from freely behaving animals are a widespread and powerful mode of investigation in sleep research. These recordings generate large amounts of data that require sleep stage annotation (polysomnography), in which the data is parcellated according to three vigilance states: awake, rapid eye movement (REM) sleep, and non-REM (NREM) sleep. Manual and current computational annotation methods ignore intermediate states because the classification features become ambiguous, even though intermediate states contain important information regarding vigilance state dynamics. To address this problem, we have developed "Somnotate"-a probabilistic classifier based on a combination of linear discriminant analysis (LDA) with a hidden Markov model (HMM). First we demonstrate that Somnotate sets new standards in polysomnography, exhibiting annotation accuracies that exceed human experts on mouse electrophysiological data, remarkable robustness to errors in the training data, compatibility with different recording configurations, and an ability to maintain high accuracy during experimental interventions. However, the key feature of Somnotate is that it quantifies and reports the certainty of its annotations. We leverage this feature to reveal that many intermediate vigilance states cluster around state transitions, whereas others correspond to failed attempts to transition. This enables us to show for the first time that the success rates of different types of transition are differentially affected by experimental manipulations and can explain previously observed sleep patterns. Somnotate is open-source and has the potential to both facilitate the study of sleep stage transitions and offer new insights into the mechanisms underlying sleep-wake dynamics.
Collapse
Affiliation(s)
- Paul J. N. Brodersen
- Department of Pharmacology, University of Oxford; Mansfield Road, Oxford, United Kingdom
| | - Hannah Alfonsa
- Department of Pharmacology, University of Oxford; Mansfield Road, Oxford, United Kingdom
| | - Lukas B. Krone
- Department of Physiology, Anatomy and Genetics, University of Oxford; Parks Road, United Kingdom
| | - Cristina Blanco-Duque
- Department of Physiology, Anatomy and Genetics, University of Oxford; Parks Road, United Kingdom
| | - Angus S. Fisk
- Nuffield Department of Clinical Neurosciences, University of Oxford; John Radcliffe Hospital, Oxford, United Kingdom
| | - Sarah J. Flaherty
- Department of Physiology, Anatomy and Genetics, University of Oxford; Parks Road, United Kingdom
| | - Mathilde C. C. Guillaumin
- Nuffield Department of Clinical Neurosciences, University of Oxford; John Radcliffe Hospital, Oxford, United Kingdom
- Sleep and Circadian Neuroscience Institute, University of Oxford; Oxford, United Kingdom
- Institute for Neuroscience, Department of Health Sciences and Technology, ETH Zurich; Schwerzenbach, Switzerland
| | - Yi-Ge Huang
- Department of Physiology, Anatomy and Genetics, University of Oxford; Parks Road, United Kingdom
| | - Martin C. Kahn
- Department of Physiology, Anatomy and Genetics, University of Oxford; Parks Road, United Kingdom
| | - Laura E. McKillop
- Department of Physiology, Anatomy and Genetics, University of Oxford; Parks Road, United Kingdom
| | - Linus Milinski
- Department of Physiology, Anatomy and Genetics, University of Oxford; Parks Road, United Kingdom
| | - Lewis Taylor
- Nuffield Department of Clinical Neurosciences, University of Oxford; John Radcliffe Hospital, Oxford, United Kingdom
| | - Christopher W. Thomas
- Department of Physiology, Anatomy and Genetics, University of Oxford; Parks Road, United Kingdom
| | - Tomoko Yamagata
- Nuffield Department of Clinical Neurosciences, University of Oxford; John Radcliffe Hospital, Oxford, United Kingdom
| | - Russell G. Foster
- Sleep and Circadian Neuroscience Institute, University of Oxford; Oxford, United Kingdom
| | - Vladyslav V. Vyazovskiy
- Department of Physiology, Anatomy and Genetics, University of Oxford; Parks Road, United Kingdom
| | - Colin J. Akerman
- Department of Pharmacology, University of Oxford; Mansfield Road, Oxford, United Kingdom
| |
Collapse
|
33
|
Zhu H, Wu Y, Guo Y, Fu C, Shu F, Yu H, Chen W, Chen C. Towards Real-Time Sleep Stage Prediction and Online Calibration Based on Architecturally Switchable Deep Learning Models. IEEE J Biomed Health Inform 2024; 28:470-481. [PMID: 37878423 DOI: 10.1109/jbhi.2023.3327470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2023]
Abstract
Despite the recent advances in automatic sleep staging, few studies have focused on real-time sleep staging to promote the regulation of sleep or the intervention of sleep disorders. In this paper, a novel network named SwSleepNet, that can handle both precisely offline sleep staging, and online sleep stages prediction and calibration is proposed. For offline analysis, the proposed network coordinates sequence broadening module (SBM), sequential CNN (SCNN), squeeze and excitation (SE) block, and sequence consolidation module (SCM) to balance the operational efficiency of the network and the comprehensive feature extraction. For online analysis, only SCNN and SE are involved in predicting the sleep stage within a short-time segment of the recordings. Once more than two successive segments have disparate predictions, the calibration mechanism will be triggered, and contextual information will be involved. In addition, to investigate the appropriate time of the segment that is suitable to predict a sleep stage, segments with five-second, three-second, and two-second data are analyzed. The performance of SwSleepNet is validated on two publicly available datasets Sleep-EDF Expanded and Montreal Archive of Sleep Studies (MASS), and one clinical dataset Huashan Hospital Fudan University (HSFU), with the offline accuracy of 84.5%, 86.7%, and 81.8%, respectively, which outperforms the state-of-the-art methods. Additionally, for the online sleep staging, the dedicated calibration mechanism allows SwSleepNet to achieve high accuracy over 80% on three datasets with the short-time segments, demonstrating the robustness and stability of SwSleepNet. This study presents a real-time sleep staging architecture, which is expected to pave the way for accurate sleep regulation and intervention.
Collapse
|
34
|
Li J, Wu C, Pan J, Wang F. Few-shot EEG sleep staging based on transductive prototype optimization network. Front Neuroinform 2023; 17:1297874. [PMID: 38125309 PMCID: PMC10730933 DOI: 10.3389/fninf.2023.1297874] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 11/13/2023] [Indexed: 12/23/2023] Open
Abstract
Electroencephalography (EEG) is a commonly used technology for monitoring brain activities and diagnosing sleep disorders. Clinically, doctors need to manually stage sleep based on EEG signals, which is a time-consuming and laborious task. In this study, we propose a few-shot EEG sleep staging termed transductive prototype optimization network (TPON) method, which aims to improve the performance of EEG sleep staging. Compared with traditional deep learning methods, TPON uses a meta-learning algorithm, which generalizes the classifier to new classes that are not visible in the training set, and only have a few examples for each new class. We learn the prototypes of existing objects through meta-training, and capture the sleep features of new objects through the "learn to learn" method of meta-learning. The prototype distribution of the class is optimized and captured by using support set and unlabeled high confidence samples to increase the authenticity of the prototype. Compared with traditional prototype networks, TPON can effectively solve too few samples in few-shot learning and improve the matching degree of prototypes in prototype network. The experimental results on the public SleepEDF-2013 dataset show that the proposed algorithm outperform than most advanced algorithms in the overall performance. In addition, we experimentally demonstrate the feasibility of cross-channel recognition, which indicates that there are many similar sleep EEG features between different channels. In future research, we can further explore the common features among different channels and investigate the combination of universal features in sleep EEG. Overall, our method achieves high accuracy in sleep stage classification, demonstrating the effectiveness of this approach and its potential applications in other medical fields.
Collapse
Affiliation(s)
| | | | | | - Fei Wang
- School of Software, South China Normal University, Guangzhou, China
| |
Collapse
|
35
|
Yeckle J, Manian V. Automated Sleep Stage Classification in Home Environments: An Evaluation of Seven Deep Neural Network Architectures. SENSORS (BASEL, SWITZERLAND) 2023; 23:8942. [PMID: 37960641 PMCID: PMC10649735 DOI: 10.3390/s23218942] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 10/25/2023] [Accepted: 10/31/2023] [Indexed: 11/15/2023]
Abstract
Sleep is an essential human physiological need that has garnered increasing scientific attention due to the burgeoning prevalence of sleep-related disorders and their impact on public health. Among contemporary challenges, the demand for authentic sleep monitoring outside the confines of specialized laboratories, ideally within the home environment, has arisen. Addressing this, we explore the development of pragmatic approaches that facilitate implementation within domestic settings. Such approaches necessitate the deployment of streamlined, computationally efficient automated classifiers. In pursuit of a sleep stage classifier tailored for home use, this study rigorously assessed seven conventional neural network architectures prominent in deep learning (LeNet, ResNet, VGG, MLP, LSTM-CNN, LSTM, BLSTM). Leveraging sleep recordings from a cohort of 20 subjects, we elucidate that LeNet, VGG, and ResNet exhibit superior performance compared to recent advancements reported in the literature. Furthermore, a comprehensive architectural analysis was conducted, illuminating the strengths and limitations of each in the context of home-based sleep monitoring. Our findings distinctly identify LeNet as the most-amenable architecture for this purpose, with LSTM and BLSTM demonstrating relatively lesser compatibility. Ultimately, this research substantiates the feasibility of automating sleep stage classification employing lightweight neural networks, thereby accommodating scenarios with constrained computational resources. This advancement aims at revolutionizing the field of sleep monitoring, making it more accessible and reliable for individuals in their homes.
Collapse
Affiliation(s)
- Jaime Yeckle
- Department of Electrical and Computer Engineering, University of Puerto Rico, Mayaguez, PR 00681, USA;
| | | |
Collapse
|
36
|
Liu Z, Qin M, Lu Y, Luo S, Zhang Q. DenSleepNet: DenseNet based model for sleep staging with two-frequency feature fusion and coordinate attention. Biomed Eng Lett 2023; 13:751-761. [PMID: 37872995 PMCID: PMC10590351 DOI: 10.1007/s13534-023-00301-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 06/15/2023] [Accepted: 06/23/2023] [Indexed: 10/25/2023] Open
Abstract
Sleep staging is often applied to assess the quality of sleep and also be used to prevent and monitor psychiatric disorders caused by sleep. However, it remains a challenge to extract the discriminative features of salient waveforms in sleep EEG and enable the network to effectively classify sleep stages by emphasizing these crucial features, thus achieving higher accuracy. In this study, an end-to-end deep learning model based on DenseNet for automatic sleep staging is designed and constructed. In the framework, two convolutional branches are devised to extract the underlying features (Two-Frequency Feature) at various frequencies, which are then fused and input into the DenseNet module to extract salient waveform features. After that, the Coordinate Attention mechanism is employed to enhance the localization of salient waveform features by emphasizing the position of salient waveforms and the spatial relationship across the entire frequency spectrum. Finally, the obtained features are accessed to the fully connected for sleep staging. The model was validated with a 20-fold cross-validation procedure on two public available datasets, and the overall accuracy, kappa coefficient, and MF1 score reached 92.9%, 78.7, 0.86 and 90.0%, 75.8, 0.80 on Sleep-EDF-20 and Sleep-EDFx, respectively. Experimental results show that the proposed model achieves competitive performance for sleep staging compared with the reported approaches under the identical conditions.
Collapse
Affiliation(s)
- Zhi Liu
- School of Artificial Intelligence, Chongqing University of Technology, Chongqing, China
| | - Meiqiao Qin
- School of Artificial Intelligence, Chongqing University of Technology, Chongqing, China
| | - Yunhua Lu
- School of Artificial Intelligence, Chongqing University of Technology, Chongqing, China
| | - Sixin Luo
- School of Artificial Intelligence, Chongqing University of Technology, Chongqing, China
| | - Qinhan Zhang
- School of Artificial Intelligence, Chongqing University of Technology, Chongqing, China
| |
Collapse
|
37
|
Huang X, Schmelter F, Irshad MT, Piet A, Nisar MA, Sina C, Grzegorzek M. Optimizing sleep staging on multimodal time series: Leveraging borderline synthetic minority oversampling technique and supervised convolutional contrastive learning. Comput Biol Med 2023; 166:107501. [PMID: 37742416 DOI: 10.1016/j.compbiomed.2023.107501] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 08/15/2023] [Accepted: 09/15/2023] [Indexed: 09/26/2023]
Abstract
Sleep is an important research area in nutritional medicine that plays a crucial role in human physical and mental health restoration. It can influence diet, metabolism, and hormone regulation, which can affect overall health and well-being. As an essential tool in the sleep study, the sleep stage classification provides a parsing of sleep architecture and a comprehensive understanding of sleep patterns to identify sleep disorders and facilitate the formulation of targeted sleep interventions. However, the class imbalance issue is typically salient in sleep datasets, which severely affects classification performances. To address this issue and to extract optimal multimodal features of EEG, EOG, and EMG that can improve the accuracy of sleep stage classification, a Borderline Synthetic Minority Oversampling Technique (B-SMOTE)-Based Supervised Convolutional Contrastive Learning (BST-SCCL) is proposed, which can avoid the risk of data mismatch between various sleep knowledge domains (varying health conditions and annotation rules) and strengthening learning characteristics of the N1 stage from the pair-wise segments comparison strategy. The lightweight residual network architecture with a novel truncated cross-entropy loss function is designed to accommodate multimodal time series and boost the training speed and performance stability. The proposed model has been validated on four well-known public sleep datasets (Sleep-EDF-20, Sleep-EDF-78, ISRUC-1, and ISRUC-3) and its superior performance (overall accuracy of 91.31-92.34%, MF1 of 88.21-90.08%, and Cohen's Kappa coefficient k of 0.87-0.89) has further demonstrated its effectiveness. It shows the great potential of contrastive learning for cross-domain knowledge interaction in precision medicine.
Collapse
Affiliation(s)
- Xinyu Huang
- Institute of Medical Informatics, University of Lübeck, Germany.
| | - Franziska Schmelter
- Institute of Nutritional Medicine, University of Lübeck and University Medical Center Schleswig-Holstein, Lübeck, Germany.
| | | | - Artur Piet
- Institute of Medical Informatics, University of Lübeck, Germany.
| | | | - Christian Sina
- Institute of Nutritional Medicine, University of Lübeck and University Medical Center Schleswig-Holstein, Lübeck, Germany; Fraunhofer Research Institution for Individualized and Cell-Based Medical Engineering (IMTE), Lübeck, Germany.
| | - Marcin Grzegorzek
- Institute of Medical Informatics, University of Lübeck, Germany; Fraunhofer Research Institution for Individualized and Cell-Based Medical Engineering (IMTE), Lübeck, Germany.
| |
Collapse
|
38
|
Lyu J, Shi W, Zhang C, Yeh CH. A Novel Sleep Staging Method Based on EEG and ECG Multimodal Features Combination. IEEE Trans Neural Syst Rehabil Eng 2023; 31:4073-4084. [PMID: 37819827 DOI: 10.1109/tnsre.2023.3323892] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/13/2023]
Abstract
Accurate sleep staging evaluates the quality of sleep, supporting the clinical diagnosis and intervention of sleep disorders and related diseases. Although previous attempts to classify sleep stages have achieved high classification performance, little attention has been paid to integrating the rich information in brain and heart dynamics during sleep for sleep staging. In this study, we propose a generalized EEG and ECG multimodal feature combination to classify sleep stages with high efficiency and accuracy. Briefly, a hybrid features combination in terms of multiscale entropy and intrinsic mode function are used to reflect nonlinear dynamics in multichannel EEGs, along with heart rate variability measures over time/frequency domains, and sample entropy across scales are applied for ECGs. For both the max-relevance and min-redundancy method and principal component analysis were used for dimensionality reduction. The selected features were classified by four traditional machine learning classifiers. Macro-F1 score, macro-geometric mean, and Cohen kappa value are adopted to evaluate the classification performance of each class in an imbalanced dataset. Experimental results show that EEG features contribute more to wake stage classification while ECG features contribute more to deep sleep stages. The proposed combination achieves the highest accuracy of 84.3% and the highest kappa value of 0.794 on the support vector machine in the ISRUC-S3 dataset, suggesting the proposed multimodal features combination is promising in accuracy and efficiency compared to other state-of-the-art methods.
Collapse
|
39
|
Zan H, Yildiz A. Multi-task learning for arousal and sleep stage detection using fully convolutional networks. J Neural Eng 2023; 20:056034. [PMID: 37769664 DOI: 10.1088/1741-2552/acfe3a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Accepted: 09/28/2023] [Indexed: 10/03/2023]
Abstract
Objective.Sleep is a critical physiological process that plays a vital role in maintaining physical and mental health. Accurate detection of arousals and sleep stages is essential for the diagnosis of sleep disorders, as frequent and excessive occurrences of arousals disrupt sleep stage patterns and lead to poor sleep quality, negatively impacting physical and mental health. Polysomnography is a traditional method for arousal and sleep stage detection that is time-consuming and prone to high variability among experts.Approach. In this paper, we propose a novel multi-task learning approach for arousal and sleep stage detection using fully convolutional neural networks. Our model, FullSleepNet, accepts a full-night single-channel EEG signal as input and produces segmentation masks for arousal and sleep stage labels. FullSleepNet comprises four modules: a convolutional module to extract local features, a recurrent module to capture long-range dependencies, an attention mechanism to focus on relevant parts of the input, and a segmentation module to output final predictions.Main results.By unifying the two interrelated tasks as segmentation problems and employing a multi-task learning approach, FullSleepNet achieves state-of-the-art performance for arousal detection with an area under the precision-recall curve of 0.70 on Sleep Heart Health Study and Multi-Ethnic Study of Atherosclerosis datasets. For sleep stage classification, FullSleepNet obtains comparable performance on both datasets, achieving an accuracy of 0.88 and an F1-score of 0.80 on the former and an accuracy of 0.83 and an F1-score of 0.76 on the latter.Significance. Our results demonstrate that FullSleepNet offers improved practicality, efficiency, and accuracy for the detection of arousal and classification of sleep stages using raw EEG signals as input.
Collapse
Affiliation(s)
- Hasan Zan
- Vocational School, Mardin Artuklu University, Mardin, Turkey
| | - Abdulnasır Yildiz
- Department of Electrical and Electronics Engineering, Dicle University, Diyarbakir, Turkey
| |
Collapse
|
40
|
Li T, Gong Y, Lv Y, Wang F, Hu M, Wen Y. GAC-SleepNet: A dual-structured sleep staging method based on graph structure and Euclidean structure. Comput Biol Med 2023; 165:107477. [PMID: 37717528 DOI: 10.1016/j.compbiomed.2023.107477] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Revised: 08/16/2023] [Accepted: 09/04/2023] [Indexed: 09/19/2023]
Abstract
Sleep staging is a precondition for the diagnosis and treatment of sleep disorders. However, how to fully exploit the relationship between spatial features of the brain and sleep stages is an important task. Many current classical algorithms only extract the characteristic information of the brain in the Euclidean space without considering other spatial structures. In this study, a sleep staging network named GAC-SleepNet is designed. GAC-SleepNet uses the characteristic information in the dual structure of the graph structure and the Euclidean structure for the classification of sleep stages. In the graph structure, this study uses a graph convolutional neural network to learn the deep features of each sleep stage and converts the features in the topological structure into feature vectors by a multilayer perceptron. In the Euclidean structure, this study uses convolutional neural networks to learn the temporal features of sleep information and combine attention mechanism to portray the connection between different sleep periods and EEG signals, while enhancing the description of global features to avoid local optima. In this study, the performance of the proposed network is evaluated on two public datasets. The experimental results show that the dual spatial structure captures more adequate and comprehensive information about sleep features and shows advancement in terms of different evaluation metrics.
Collapse
Affiliation(s)
- Tianxing Li
- School of Electronic Information Engineering, Changchun University of Science and Technology, Changchun, 130000, China
| | - Yulin Gong
- School of Electronic Information Engineering, Changchun University of Science and Technology, Changchun, 130000, China.
| | - Yudan Lv
- The Department of Neurology, First Hospital of Jilin University, Changchun, 130000, China
| | - Fatong Wang
- School of Electronic Information Engineering, Changchun University of Science and Technology, Changchun, 130000, China
| | - Mingjia Hu
- School of Electronic Information Engineering, Changchun University of Science and Technology, Changchun, 130000, China
| | - Yinke Wen
- School of Electronic Information Engineering, Changchun University of Science and Technology, Changchun, 130000, China
| |
Collapse
|
41
|
Mao S, Sejdic E. A Review of Recurrent Neural Network-Based Methods in Computational Physiology. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023; 34:6983-7003. [PMID: 35130174 PMCID: PMC10589904 DOI: 10.1109/tnnls.2022.3145365] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Artificial intelligence and machine learning techniques have progressed dramatically and become powerful tools required to solve complicated tasks, such as computer vision, speech recognition, and natural language processing. Since these techniques have provided promising and evident results in these fields, they emerged as valuable methods for applications in human physiology and healthcare. General physiological recordings are time-related expressions of bodily processes associated with health or morbidity. Sequence classification, anomaly detection, decision making, and future status prediction drive the learning algorithms to focus on the temporal pattern and model the nonstationary dynamics of the human body. These practical requirements give birth to the use of recurrent neural networks (RNNs), which offer a tractable solution in dealing with physiological time series and provide a way to understand complex time variations and dependencies. The primary objective of this article is to provide an overview of current applications of RNNs in the area of human physiology for automated prediction and diagnosis within different fields. Finally, we highlight some pathways of future RNN developments for human physiology.
Collapse
|
42
|
Li W, Gao J. Automatic sleep staging by a hybrid model based on deep 1D-ResNet-SE and LSTM with single-channel raw EEG signals. PeerJ Comput Sci 2023; 9:e1561. [PMID: 37810362 PMCID: PMC10557479 DOI: 10.7717/peerj-cs.1561] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Accepted: 08/10/2023] [Indexed: 10/10/2023]
Abstract
Sleep staging is crucial for assessing sleep quality and diagnosing sleep disorders. Recent advances in deep learning methods with electroencephalogram (EEG) signals have shown remarkable success in automatic sleep staging. However, the use of deeper neural networks may lead to the issues of gradient disappearance and explosion, while the non-stationary nature and low signal-to-noise ratio of EEG signals can negatively impact feature representation. To overcome these challenges, we proposed a novel lightweight sequence-to-sequence deep learning model, 1D-ResNet-SE-LSTM, to classify sleep stages into five classes using single-channel raw EEG signals. Our proposed model consists of two main components: a one-dimensional residual convolutional neural network with a squeeze-and-excitation module to extract and reweight features from EEG signals, and a long short-term memory network to capture the transition rules among sleep stages. In addition, we applied the weighted cross-entropy loss function to alleviate the class imbalance problem. We evaluated the performance of our model on two publicly available datasets; Sleep-EDF Expanded consists of 153 overnight PSG recordings collected from 78 healthy subjects and ISRUC-Sleep includes 100 PSG recordings collected from 100 subjects diagnosed with various sleep disorders, and obtained an overall accuracy rate of 86.39% and 81.97%, respectively, along with corresponding macro average F1-scores of 81.95% and 79.94%. Our model outperforms existing sleep staging models in terms of overall performance metrics and per-class F1-scores for several sleep stages, particularly for the N1 stage, where it achieves F1-scores of 59.00% and 55.53%. The kappa coefficient is 0.812 and 0.766 for the Sleep-EDF Expanded and ISRUC-Sleep datasets, respectively, indicating strong agreement with certified sleep experts. We also investigated the effect of different weight coefficient combinations and sequence lengths of EEG epochs used as input to the model on its performance. Furthermore, the ablation study was conducted to evaluate the contribution of each component to the model's performance. The results demonstrate the effectiveness and robustness of the proposed model in classifying sleep stages, and highlights its potential to reduce human clinicians' workload, making sleep assessment and diagnosis more effective. However, the proposed model is subject to several limitations. Firstly, the model is a sequence-to-sequence network, which requires input sequences of EEG epochs. Secondly, the weight coefficients in the loss function could be further optimized to balance the classification performance of each sleep stage. Finally, apart from the channel attention mechanism, incorporating more advanced attention mechanisms could enhance the model's effectiveness.
Collapse
Affiliation(s)
- Weiming Li
- Shanghai Nuanhe Brain Technology Co. Ltd., Shanghai, China
| | - Junhui Gao
- Shanghai Nuanhe Brain Technology Co. Ltd., Shanghai, China
| |
Collapse
|
43
|
Dai Y, Li X, Liang S, Wang L, Duan Q, Yang H, Zhang C, Chen X, Li L, Li X, Liao X. MultiChannelSleepNet: A Transformer-Based Model for Automatic Sleep Stage Classification With PSG. IEEE J Biomed Health Inform 2023; 27:4204-4215. [PMID: 37289607 DOI: 10.1109/jbhi.2023.3284160] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Automatic sleep stage classification plays an essential role in sleep quality measurement and sleep disorder diagnosis. Although many approaches have been developed, most use only single-channel electroencephalogram signals for classification. Polysomnography (PSG) provides multiple channels of signal recording, enabling the use of the appropriate method to extract and integrate the information from different channels to achieve higher sleep staging performance. We present a transformer encoder-based model, MultiChannelSleepNet, for automatic sleep stage classification with multichannel PSG data, whose architecture is implemented based on the transformer encoder for single-channel feature extraction and multichannel feature fusion. In a single-channel feature extraction block, transformer encoders extract features from time-frequency images of each channel independently. Based on our integration strategy, the feature maps extracted from each channel are fused in the multichannel feature fusion block. Another set of transformer encoders further capture joint features, and a residual connection preserves the original information from each channel in this block. Experimental results on three publicly available datasets demonstrate that our method achieves higher classification performance than state-of-the-art techniques. MultiChannelSleepNet is an efficient method to extract and integrate the information from multichannel PSG data, which facilitates precision sleep staging in clinical applications.
Collapse
|
44
|
Jin Z, Jia K. A temporal multi-scale hybrid attention network for sleep stage classification. Med Biol Eng Comput 2023; 61:2291-2303. [PMID: 36997808 DOI: 10.1007/s11517-023-02808-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Accepted: 02/13/2023] [Indexed: 04/01/2023]
Abstract
Sleep is crucial for human health. Automatic sleep stage classification based on polysomnogram (PSG) is meaningful for the diagnosis of sleep disorders, which has attracted extensive attention in recent years. Most existing methods could not fully consider the different transitions of sleep stages and fit the visual inspection of sleep experts simultaneously. To this end, we propose a temporal multi-scale hybrid attention network, namely TMHAN, to automatically achieve sleep staging. The temporal multi-scale mechanism incorporates short-term abrupt and long-term periodic transitions of the successive PSG epochs. Furthermore, the hybrid attention mechanism includes 1-D local attention, 2-D global attention, and 2-D contextual sparse multi-head self-attention for three kinds of sequence-level representations. The concatenated representation is subsequently fed into a softmax layer to train an end-to-end model. Experimental results on two benchmark sleep datasets show that TMHAN obtains the best performance compared with several baselines, demonstrating the effectiveness of our model. In general, our work not only provides good classification performance, but also fits the actual sleep staging processes, which makes contribution for the combination of deep learning and sleep medicine.
Collapse
Affiliation(s)
- Zheng Jin
- Beijing Key Laboratory of Computational Intelligence and Intelligent System, Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
- Beijing Laboratory of Advanced Information Networks, Beijing, 100124, China
| | - Kebin Jia
- Beijing Key Laboratory of Computational Intelligence and Intelligent System, Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China.
- Beijing Laboratory of Advanced Information Networks, Beijing, 100124, China.
| |
Collapse
|
45
|
Zahid AN, Jennum P, Mignot E, Sorensen HBD. MSED: A Multi-Modal Sleep Event Detection Model for Clinical Sleep Analysis. IEEE Trans Biomed Eng 2023; 70:2508-2518. [PMID: 37028083 DOI: 10.1109/tbme.2023.3252368] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2023]
Abstract
Clinical sleep analysis require manual analysis of sleep patterns for correct diagnosis of sleep disorders. However, several studies have shown significant variability in manual scoring of clinically relevant discrete sleep events, such as arousals, leg movements, and sleep disordered breathing (apneas and hypopneas). We investigated whether an automatic method could be used for event detection and if a model trained on all events (joint model) performed better than corresponding event-specific models (single-event models). We trained a deep neural network event detection model on 1653 individual recordings and tested the optimized model on 1000 separate hold-out recordings. F1 scores for the optimized joint detection model were 0.70, 0.63, and 0.62 for arousals, leg movements, and sleep disordered breathing, respectively, compared to 0.65, 0.61, and 0.60 for the optimized single-event models. Index values computed from detected events correlated positively with manual annotations (r2 = 0.73, r2 = 0.77, r2 = 0.78, respectively). We furthermore quantified model accuracy based on temporal difference metrics, which improved overall by using the joint model compared to single-event models. Our automatic model jointly detects arousals, leg movements and sleep disordered breathing events with high correlation with human annotations. Finally, we benchmark against previous state-of-the-art multi-event detection models and found an overall increase in F1 score with our proposed model despite a 97.5% reduction in model size.
Collapse
|
46
|
Lee M, Kwak HG, Kim HJ, Won DO, Lee SW. SeriesSleepNet: an EEG time series model with partial data augmentation for automatic sleep stage scoring. Front Physiol 2023; 14:1188678. [PMID: 37700762 PMCID: PMC10494443 DOI: 10.3389/fphys.2023.1188678] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2023] [Accepted: 08/10/2023] [Indexed: 09/14/2023] Open
Abstract
Introduction: We propose an automatic sleep stage scoring model, referred to as SeriesSleepNet, based on convolutional neural network (CNN) and bidirectional long short-term memory (bi-LSTM) with partial data augmentation. We used single-channel raw electroencephalography signals for automatic sleep stage scoring. Methods: Our framework was focused on time series information, so we applied partial data augmentation to learn the connected time information in small series. In specific, the CNN module learns the time information of one epoch (intra-epoch) whereas the bi-LSTM trains the sequential information between the adjacent epochs (inter-epoch). Note that the input of the bi-LSTM is the augmented CNN output. Moreover, the proposed loss function was used to fine-tune the model by providing additional weights. To validate the proposed framework, we conducted two experiments using the Sleep-EDF and SHHS datasets. Results and Discussion: The results achieved an overall accuracy of 0.87 and 0.84 and overall F1-score of 0.80 and 0.78 and kappa value of 0.81 and 0.78 for five-class classification, respectively. We showed that the SeriesSleepNet was superior to the baselines based on each component in the proposed framework. Our architecture also outperformed the state-of-the-art methods with overall F1-score, accuracy, and kappa value. Our framework could provide information on sleep disorders or quality of sleep to automatically classify sleep stages with high performance.
Collapse
Affiliation(s)
- Minji Lee
- Department of Biomedical Software Engineering, The Catholic University of Korea, Bucheon, Republic of Korea
| | - Heon-Gyu Kwak
- Department of Artificial Intelligence, Korea University, Seoul, Republic of Korea
| | - Hyeong-Jin Kim
- Department of Brain and Cognitive Engineering, Korea University, Seoul, Republic of Korea
| | - Dong-Ok Won
- Department of Artificial Intelligence Convergence, Hallym University, Chuncheon, Republic of Korea
| | - Seong-Whan Lee
- Department of Artificial Intelligence, Korea University, Seoul, Republic of Korea
| |
Collapse
|
47
|
Yao H, Liu T, Zou R, Ding S, Xu Y. A Spatial-Temporal Transformer Architecture Using Multi-Channel Signals for Sleep Stage Classification. IEEE Trans Neural Syst Rehabil Eng 2023; 31:3353-3362. [PMID: 37578925 DOI: 10.1109/tnsre.2023.3305201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/16/2023]
Abstract
Sleep stage classification is a fundamental task in diagnosing and monitoring sleep diseases. There are 2 challenges that remain open: (1) Since most methods only rely on input from a single channel, the spatial-temporal relationship of sleep signals has not been fully explored. (2) Lack of sleep data makes models hard to train from scratch. Here, we propose a vision Transformer-based architecture to process multi-channel polysomnogram signals. The method is an end-to-end framework that consists of a spatial encoder, a temporal encoder, and an MLP head classifier. The spatial encoder using a pre-trained Vision Transformer captures spatial information from multiple PSG channels. The temporal encoder utilizing the self-attention mechanism understands transitions between nearby epochs. In addition, we introduce a tailored image generation method to extract features within multi-channel and reshape them for transfer learning. We validate our method on 3 datasets and outperform the state-of-the-art algorithms. Our method fully explores the spatial-temporal relationship among different brain regions and addresses the problem of data insufficiency in clinical environments. Benefiting from reformulating the problem as image classification, the method could be applied to other 1D-signal problems in the future.
Collapse
|
48
|
Gaiduk M, Serrano Alarcón Á, Seepold R, Martínez Madrid N. Current status and prospects of automatic sleep stages scoring: Review. Biomed Eng Lett 2023; 13:247-272. [PMID: 37519865 PMCID: PMC10382458 DOI: 10.1007/s13534-023-00299-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Revised: 06/07/2023] [Accepted: 06/18/2023] [Indexed: 08/01/2023] Open
Abstract
The scoring of sleep stages is one of the essential tasks in sleep analysis. Since a manual procedure requires considerable human and financial resources, and incorporates some subjectivity, an automated approach could result in several advantages. There have been many developments in this area, and in order to provide a comprehensive overview, it is essential to review relevant recent works and summarise the characteristics of the approaches, which is the main aim of this article. To achieve it, we examined articles published between 2018 and 2022 that dealt with the automated scoring of sleep stages. In the final selection for in-depth analysis, 125 articles were included after reviewing a total of 515 publications. The results revealed that automatic scoring demonstrates good quality (with Cohen's kappa up to over 0.80 and accuracy up to over 90%) in analysing EEG/EEG + EOG + EMG signals. At the same time, it should be noted that there has been no breakthrough in the quality of results using these signals in recent years. Systems involving other signals that could potentially be acquired more conveniently for the user (e.g. respiratory, cardiac or movement signals) remain more challenging in the implementation with a high level of reliability but have considerable innovation capability. In general, automatic sleep stage scoring has excellent potential to assist medical professionals while providing an objective assessment.
Collapse
Affiliation(s)
- Maksym Gaiduk
- HTWG Konstanz – University of Applied Sciences, Alfred-Wachtel-Str.8, 78462 Konstanz, Germany
| | | | - Ralf Seepold
- HTWG Konstanz – University of Applied Sciences, Alfred-Wachtel-Str.8, 78462 Konstanz, Germany
| | | |
Collapse
|
49
|
Liu G, Wei G, Sun S, Mao D, Zhang J, Zhao D, Tian X, Wang X, Chen N. Micro SleepNet: efficient deep learning model for mobile terminal real-time sleep staging. Front Neurosci 2023; 17:1218072. [PMID: 37575302 PMCID: PMC10416229 DOI: 10.3389/fnins.2023.1218072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2023] [Accepted: 07/07/2023] [Indexed: 08/15/2023] Open
Abstract
The real-time sleep staging algorithm that can perform inference on mobile devices without burden is a prerequisite for closed-loop sleep modulation. However, current deep learning sleep staging models have poor real-time efficiency and redundant parameters. We propose a lightweight and high-performance sleep staging model named Micro SleepNet, which takes a 30-s electroencephalography (EEG) epoch as input, without relying on contextual signals. The model features a one-dimensional group convolution with a kernel size of 1 × 3 and an Efficient Channel and Spatial Attention (ECSA) module for feature extraction and adaptive recalibration. Moreover, the model efficiently performs feature fusion using dilated convolution module and replaces the conventional fully connected layer with Global Average Pooling (GAP). These design choices significantly reduce the total number of model parameters to 48,226, with only approximately 48.95 Million Floating-point Operations per Second (MFLOPs) computation. The proposed model is conducted subject-independent cross-validation on three publicly available datasets, achieving an overall accuracy of up to 83.3%, and the Cohen Kappa is 0.77. Additionally, we introduce Class Activation Mapping (CAM) to visualize the model's attention to EEG waveforms, which demonstrate the model's ability to accurately capture feature waveforms of EEG at different sleep stages. This provides a strong interpretability foundation for practical applications. Furthermore, the Micro SleepNet model occupies approximately 100 KB of memory on the Android smartphone and takes only 2.8 ms to infer one EEG epoch, meeting the real-time requirements of sleep staging tasks on mobile devices. Consequently, our proposed model has the potential to serve as a foundation for accurate closed-loop sleep modulation.
Collapse
Affiliation(s)
- Guisong Liu
- Department of Biomedical Engineering, Bioengineering College, Chongqing University, Chongqing, China
| | - Guoliang Wei
- Department of Biomedical Engineering, Bioengineering College, Chongqing University, Chongqing, China
| | - Shuqing Sun
- Department of Biomedical Engineering, Bioengineering College, Chongqing University, Chongqing, China
| | - Dandan Mao
- Department of Sleep and Psychology, Institute of Surgery Research, Daping Hospital, Third Military Medical University (Army Medical University), Chongqing, China
| | - Jiansong Zhang
- School of Medicine, Huaqiao University, Quanzhou, Fujian, China
| | - Dechun Zhao
- College of Bioinformatics, Chongqing University of Posts and Telecommunications, Chongqing, China
| | - Xuelong Tian
- Department of Biomedical Engineering, Bioengineering College, Chongqing University, Chongqing, China
| | - Xing Wang
- Department of Biomedical Engineering, Bioengineering College, Chongqing University, Chongqing, China
| | - Nanxi Chen
- Department of Biomedical Engineering, Bioengineering College, Chongqing University, Chongqing, China
| |
Collapse
|
50
|
章 浩, 许 哲, 苑 成, 季 曹, 刘 颖. [Automatic sleep staging model based on single channel electroencephalogram signal]. SHENG WU YI XUE GONG CHENG XUE ZA ZHI = JOURNAL OF BIOMEDICAL ENGINEERING = SHENGWU YIXUE GONGCHENGXUE ZAZHI 2023; 40:458-464. [PMID: 37380384 PMCID: PMC10307605 DOI: 10.7507/1001-5515.202210072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 04/09/2023] [Indexed: 06/30/2023]
Abstract
Sleep staging is the basis for solving sleep problems. There's an upper limit for the classification accuracy of sleep staging models based on single-channel electroencephalogram (EEG) data and features. To address this problem, this paper proposed an automatic sleep staging model that mixes deep convolutional neural network (DCNN) and bi-directional long short-term memory network (BiLSTM). The model used DCNN to automatically learn the time-frequency domain features of EEG signals, and used BiLSTM to extract the temporal features between the data, fully exploiting the feature information contained in the data to improve the accuracy of automatic sleep staging. At the same time, noise reduction techniques and adaptive synthetic sampling were used to reduce the impact of signal noise and unbalanced data sets on model performance. In this paper, experiments were conducted using the Sleep-European Data Format Database Expanded and the Shanghai Mental Health Center Sleep Database, and achieved an overall accuracy rate of 86.9% and 88.9% respectively. When compared with the basic network model, all the experimental results outperformed the basic network, further demonstrating the validity of this paper's model, which can provide a reference for the construction of a home sleep monitoring system based on single-channel EEG signals.
Collapse
Affiliation(s)
- 浩伟 章
- 上海理工大学 健康科学与工程学院(上海 200093)School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai 200093, P. R. China
| | - 哲 许
- 上海理工大学 健康科学与工程学院(上海 200093)School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai 200093, P. R. China
| | - 成梅 苑
- 上海理工大学 健康科学与工程学院(上海 200093)School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai 200093, P. R. China
| | - 曹珺 季
- 上海理工大学 健康科学与工程学院(上海 200093)School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai 200093, P. R. China
| | - 颖 刘
- 上海理工大学 健康科学与工程学院(上海 200093)School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai 200093, P. R. China
| |
Collapse
|