51
|
Fiorillo L, Favaro P, Faraci FD. DeepSleepNet-Lite: A Simplified Automatic Sleep Stage Scoring Model With Uncertainty Estimates. IEEE Trans Neural Syst Rehabil Eng 2021; 29:2076-2085. [PMID: 34648450 DOI: 10.1109/tnsre.2021.3117970] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Deep learning is widely used in the most recent automatic sleep scoring algorithms. Its popularity stems from its excellent performance and from its ability to process raw signals and to learn feature directly from the data. Most of the existing scoring algorithms exploit very computationally demanding architectures, due to their high number of training parameters, and process lengthy time sequences in input (up to 12 minutes). Only few of these architectures provide an estimate of the model uncertainty. In this study we propose DeepSleepNet-Lite, a simplified and lightweight scoring architecture, processing only 90-seconds EEG input sequences. We exploit, for the first time in sleep scoring, the Monte Carlo dropout technique to enhance the performance of the architecture and to also detect the uncertain instances. The evaluation is performed on a single-channel EEG Fpz-Cz from the open source Sleep-EDF expanded database. DeepSleepNet-Lite achieves slightly lower performance, if not on par, compared to the existing state-of-the-art architectures, in overall accuracy, macro F1-score and Cohen's kappa (on Sleep-EDF v1-2013 ±30mins: 84.0%, 78.0%, 0.78; on Sleep-EDF v2-2018 ±30mins: 80.3%, 75.2%, 0.73). Monte Carlo dropout enables the estimate of the uncertain predictions. By rejecting the uncertain instances, the model achieves higher performance on both versions of the database (on Sleep-EDF v1-2013 ±30mins: 86.1.0%, 79.6%, 0.81; on Sleep-EDF v2-2018 ±30mins: 82.3%, 76.7%, 0.76). Our lighter sleep scoring approach paves the way to the application of scoring algorithms for sleep analysis in real-time.
Collapse
|
52
|
Mikkelsen KB, Phan H, Rank ML, Hemmsen MC, de Vos M, Kidmose P. Sleep monitoring using ear-centered setups: Investigating the influence from electrode configurations. IEEE Trans Biomed Eng 2021; 69:1564-1572. [PMID: 34587000 DOI: 10.1109/tbme.2021.3116274] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Modern sleep monitoring development is shifting towards the use of unobtrusive sensors combined with algorithms for automatic sleep scoring. Many different combinations of wet and dry electrodes, ear-centered, forehead-mounted or headband-inspired designs have been proposed, alongside an ever growing variety of machine learning algorithms for automatic sleep scoring. OBJECTIVE Among candidate positions, those in the facial area and around the ears have the benefit of being relatively hairless, and in our view deserve extra attention. In this paper, we seek to determine the limits to sleep monitoring quality within this spatial constraint. METHODS We compare 13 different, realistic sensor setups derived from the same data set and analysed with the same pipeline. RESULTS All setups which include both a lateral and an EOG derivation show similar, state-of-the-art performance, with average Cohen's kappa values of at least 0.80. CONCLUSION If large electrode distances are used, positioning is not critical for achieving large sleep-related signal-to-noise-ratio, and hence accurate sleep scoring. SIGNIFICANCE We argue that with the current competitive performance of automated staging approaches, there is a need for establishing an improved benchmark beyond current single human rater scoring.
Collapse
|
53
|
Yoo C, Lee HW, Kang JW. Transferring Structured Knowledge in Unsupervised Domain Adaptation of a Sleep Staging Network. IEEE J Biomed Health Inform 2021; 26:1273-1284. [PMID: 34388101 DOI: 10.1109/jbhi.2021.3103614] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Automatic sleep staging based on deep learning (DL) has been attracting attention for analyzing sleep quality and determining treatment effects. It is challenging to acquire long-term sleep data from numerous subjects and manually labeling them even though most DL-based models are trained using large-scale sleep data to provide state-of-the-art performance. One way to overcome this data shortage is to create a pre-trained network with an existing large-scale dataset (source domain) that is applicable to small cohorts of datasets (target domain); however, discrepancies in data distribution between the domains prevent successful refinement of this approach. In this paper, we propose an unsupervised domain adaptation method for sleep staging networks to reduce discrepancies by realigning the domains in the same space and producing domain-invariant features. Specifically, in addition to a classical domain discriminator, we introduce local dis-criminators-subject and stage-to maintain the intrinsic structure of sleep data to decrease local misalignments while using adversarial learning to play a minimax game between the feature extractor and discriminators. Moreover, we present several optimization schemes during training because the conventional adversarial learning is not effective to our training scheme. We evaluate the performance of the proposed method by examining the staging performances of a baseline network compared with direct transfer (DT) learning in various conditions. The experimental results demonstrate that the proposed domain adaptation significantly improves the performance though it needs no labeled sleep data in target domain.
Collapse
|
54
|
Creagh AP, Lipsmeier F, Lindemann M, Vos MD. Interpretable deep learning for the remote characterisation of ambulation in multiple sclerosis using smartphones. Sci Rep 2021; 11:14301. [PMID: 34253769 PMCID: PMC8275610 DOI: 10.1038/s41598-021-92776-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2021] [Accepted: 06/14/2021] [Indexed: 12/04/2022] Open
Abstract
The emergence of digital technologies such as smartphones in healthcare applications have demonstrated the possibility of developing rich, continuous, and objective measures of multiple sclerosis (MS) disability that can be administered remotely and out-of-clinic. Deep Convolutional Neural Networks (DCNN) may capture a richer representation of healthy and MS-related ambulatory characteristics from the raw smartphone-based inertial sensor data than standard feature-based methodologies. To overcome the typical limitations associated with remotely generated health data, such as low subject numbers, sparsity, and heterogeneous data, a transfer learning (TL) model from similar large open-source datasets was proposed. Our TL framework leveraged the ambulatory information learned on human activity recognition (HAR) tasks collected from wearable smartphone sensor data. It was demonstrated that fine-tuning TL DCNN HAR models towards MS disease recognition tasks outperformed previous Support Vector Machine (SVM) feature-based methods, as well as DCNN models trained end-to-end, by upwards of 8-15%. A lack of transparency of "black-box" deep networks remains one of the largest stumbling blocks to the wider acceptance of deep learning for clinical applications. Ensuing work therefore aimed to visualise DCNN decisions attributed by relevance heatmaps using Layer-Wise Relevance Propagation (LRP). Through the LRP framework, the patterns captured from smartphone-based inertial sensor data that were reflective of those who are healthy versus people with MS (PwMS) could begin to be established and understood. Interpretations suggested that cadence-based measures, gait speed, and ambulation-related signal perturbations were distinct characteristics that distinguished MS disability from healthy participants. Robust and interpretable outcomes, generated from high-frequency out-of-clinic assessments, could greatly augment the current in-clinic assessment picture for PwMS, to inform better disease management techniques, and enable the development of better therapeutic interventions.
Collapse
Affiliation(s)
- Andrew P Creagh
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK.
| | | | | | - Maarten De Vos
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK
- Department of Electrical Engineering, KU Leuven, Leuven, Belgium
- Department of Development and Regeneration, KU Leuven, Leuven, Belgium
| |
Collapse
|
55
|
Vandecasteele K, De Cooman T, Chatzichristos C, Cleeren E, Swinnen L, Macea Ortiz J, Van Huffel S, Dümpelmann M, Schulze-Bonhage A, De Vos M, Van Paesschen W, Hunyadi B. The power of ECG in multimodal patient-specific seizure monitoring: Added value to an EEG-based detector using limited channels. Epilepsia 2021; 62:2333-2343. [PMID: 34240748 PMCID: PMC8518059 DOI: 10.1111/epi.16990] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2021] [Revised: 06/17/2021] [Accepted: 06/17/2021] [Indexed: 11/27/2022]
Abstract
Objective Wearable seizure detection devices could provide more reliable seizure documentation outside the hospital compared to seizure self‐reporting by patients, which is the current standard. Previously, during the SeizeIT1 project, we studied seizure detection based on behind‐the‐ear electroencephalography (EEG). However, the obtained sensitivities were too low for practical use, because not all seizures are associated with typical ictal EEG patterns. Therefore, in this paper, we aim to develop a multimodal automated seizure detection algorithm integrating behind‐the‐ear EEG and electrocardiography (ECG) for detecting focal seizures. In this framework, we quantified the added value of ECG to behind‐the‐ear EEG. Methods This study analyzed three multicenter databases consisting of 135 patients having focal epilepsy and a total of 896 seizures. A patient‐specific multimodal automated seizure detection algorithm was developed using behind‐the‐ear/temporal EEG and single‐lead ECG. The EEG and ECG data were processed separately using machine learning methods. A late integration approach was applied for fusing those predictions. Results The multimodal algorithm outperformed the EEG‐based algorithm in two of three databases, with an increase of 11% and 8% in sensitivity for the same false alarm rate. Significance ECG can be of added value to an EEG‐based seizure detection algorithm using only behind‐the‐ear/temporal lobe electrodes for patients with focal epilepsy.
Collapse
Affiliation(s)
- Kaat Vandecasteele
- Department of Electrical Engineering, STADIUS Center for Dynamic Systems, Signal Processing and Data Analytics, KU Leuven, Leuven, Belgium
| | - Thomas De Cooman
- Department of Electrical Engineering, STADIUS Center for Dynamic Systems, Signal Processing and Data Analytics, KU Leuven, Leuven, Belgium
| | - Christos Chatzichristos
- Department of Electrical Engineering, STADIUS Center for Dynamic Systems, Signal Processing and Data Analytics, KU Leuven, Leuven, Belgium
| | - Evy Cleeren
- Department of Neurology, University Hospitals Leuven, Leuven, Belgium
| | - Lauren Swinnen
- Laboratory for Epilepsy Research, Department of Neurology, University Hospital, KU Leuven, Leuven, Belgium
| | - Jaiver Macea Ortiz
- Laboratory for Epilepsy Research, Department of Neurology, University Hospital, KU Leuven, Leuven, Belgium
| | - Sabine Van Huffel
- Department of Electrical Engineering, STADIUS Center for Dynamic Systems, Signal Processing and Data Analytics, KU Leuven, Leuven, Belgium
| | - Matthias Dümpelmann
- Faculty of Medicine, Department of Neurosurgery, Epilepsy Center, University of Freiburg, Freiburg, Germany
| | - Andreas Schulze-Bonhage
- Faculty of Medicine, Department of Neurosurgery, Epilepsy Center, University of Freiburg, Freiburg, Germany
| | - Maarten De Vos
- Department of Electrical Engineering, STADIUS Center for Dynamic Systems, Signal Processing and Data Analytics, KU Leuven, Leuven, Belgium.,Department of Development and Regeneration, KU Leuven, Leuven, Belgium
| | - Wim Van Paesschen
- Laboratory for Epilepsy Research, Department of Neurology, University Hospital, KU Leuven, Leuven, Belgium
| | - Borbála Hunyadi
- Department of Microelectronics, TU Delft, Delft, Netherlands
| |
Collapse
|
56
|
Cesari M, Stefani A, Penzel T, Ibrahim A, Hackner H, Heidbreder A, Szentkirályi A, Stubbe B, Völzke H, Berger K, Högl B. Interrater sleep stage scoring reliability between manual scoring from two European sleep centers and automatic scoring performed by the artificial intelligence-based Stanford-STAGES algorithm. J Clin Sleep Med 2021; 17:1237-1247. [PMID: 33599203 DOI: 10.5664/jcsm.9174] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
STUDY OBJECTIVES The objective of this study was to evaluate interrater reliability between manual sleep stage scoring performed in 2 European sleep centers and automatic sleep stage scoring performed by the previously validated artificial intelligence-based Stanford-STAGES algorithm. METHODS Full night polysomnographies of 1,066 participants were included. Sleep stages were manually scored in Berlin and Innsbruck sleep centers and automatically scored with the Stanford-STAGES algorithm. For each participant, we compared (1) Innsbruck to Berlin scorings (INN vs BER); (2) Innsbruck to automatic scorings (INN vs AUTO); (3) Berlin to automatic scorings (BER vs AUTO); (4) epochs where scorers from Innsbruck and Berlin had consensus to automatic scoring (CONS vs AUTO); and (5) both Innsbruck and Berlin manual scorings (MAN) to the automatic ones (MAN vs AUTO). Interrater reliability was evaluated with several measures, including overall and sleep stage-specific Cohen's κ. RESULTS Overall agreement across participants was substantial for INN vs BER (κ = 0.66 ± 0.13), INN vs AUTO (κ = 0.68 ± 0.14), CONS vs AUTO (κ = 0.73 ± 0.14), and MAN vs AUTO (κ = 0.61 ± 0.14), and moderate for BER vs AUTO (κ = 0.55 ± 0.15). Human scorers had the highest disagreement for N1 sleep (κN1 = 0.40 ± 0.16 for INN vs BER). Automatic scoring had lowest agreement with manual scorings for N1 and N3 sleep (κN1 = 0.25 ± 0.14 and κN3 = 0.42 ± 0.32 for MAN vs AUTO). CONCLUSIONS Interrater reliability for sleep stage scoring between human scorers was in line with previous findings, and the algorithm achieved an overall substantial agreement with manual scoring. In this cohort, the Stanford-STAGES algorithm showed similar performances to the ones achieved in the original study, suggesting that it is generalizable to new cohorts. Before its integration in clinical practice, future independent studies should further evaluate it in other cohorts.
Collapse
Affiliation(s)
- Matteo Cesari
- Department of Neurology, Medical University of Innsbruck, Innsbruck, Austria
| | - Ambra Stefani
- Department of Neurology, Medical University of Innsbruck, Innsbruck, Austria
| | - Thomas Penzel
- Interdisciplinary Sleep Medicine Center, Charité-Universitätsmedizin Berlin, Berlin, Germany.,Saratov State University, Saratov, Russian Federation
| | - Abubaker Ibrahim
- Department of Neurology, Medical University of Innsbruck, Innsbruck, Austria
| | - Heinz Hackner
- Department of Neurology, Medical University of Innsbruck, Innsbruck, Austria
| | - Anna Heidbreder
- Department of Neurology, Medical University of Innsbruck, Innsbruck, Austria
| | - András Szentkirályi
- Institute of Epidemiology and Social Medicine, University of Münster, Münster, Germany
| | - Beate Stubbe
- Department of Internal Medicine B, University Medicine Greifswald, Greifswald, Germany
| | - Henry Völzke
- Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
| | - Klaus Berger
- Institute of Epidemiology and Social Medicine, University of Münster, Münster, Germany
| | - Birgit Högl
- Department of Neurology, Medical University of Innsbruck, Innsbruck, Austria
| |
Collapse
|
57
|
Banluesombatkul N, Ouppaphan P, Leelaarporn P, Lakhan P, Chaitusaney B, Jaimchariyatam N, Chuangsuwanich E, Chen W, Phan H, Dilokthanakul N, Wilaiprasitporn T. MetaSleepLearner: A Pilot Study on Fast Adaptation of Bio-Signals-Based Sleep Stage Classifier to New Individual Subject Using Meta-Learning. IEEE J Biomed Health Inform 2021; 25:1949-1963. [PMID: 33180737 DOI: 10.1109/jbhi.2020.3037693] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Identifying bio-signals based-sleep stages requires time-consuming and tedious labor of skilled clinicians. Deep learning approaches have been introduced in order to challenge the automatic sleep stage classification conundrum. However, the difficulties can be posed in replacing the clinicians with the automatic system due to the differences in many aspects found in individual bio-signals, causing the inconsistency in the performance of the model on every incoming individual. Thus, we aim to explore the feasibility of using a novel approach, capable of assisting the clinicians and lessening the workload. We propose the transfer learning framework, entitled MetaSleepLearner, based on Model Agnostic Meta-Learning (MAML), in order to transfer the acquired sleep staging knowledge from a large dataset to new individual subjects (source code is available at https://github.com/IoBT-VISTEC/MetaSleepLearner). The framework was demonstrated to require the labelling of only a few sleep epochs by the clinicians and allow the remainder to be handled by the system. Layer-wise Relevance Propagation (LRP) was also applied to understand the learning course of our approach. In all acquired datasets, in comparison to the conventional approach, MetaSleepLearner achieved a range of 5.4% to 17.7% improvement with statistical difference in the mean of both approaches. The illustration of the model interpretation after the adaptation to each subject also confirmed that the performance was directed towards reasonable learning. MetaSleepLearner outperformed the conventional approaches as a result from the fine-tuning using the recordings of both healthy subjects and patients. This is the first work that investigated a non-conventional pre-training method, MAML, resulting in a possibility for human-machine collaboration in sleep stage classification and easing the burden of the clinicians in labelling the sleep stages through only several epochs rather than an entire recording.
Collapse
|
58
|
Ko W, Jeon E, Jeong S, Phyo J, Suk HI. A Survey on Deep Learning-Based Short/Zero-Calibration Approaches for EEG-Based Brain-Computer Interfaces. Front Hum Neurosci 2021; 15:643386. [PMID: 34140883 PMCID: PMC8204721 DOI: 10.3389/fnhum.2021.643386] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Accepted: 04/27/2021] [Indexed: 11/28/2022] Open
Abstract
Brain-computer interfaces (BCIs) utilizing machine learning techniques are an emerging technology that enables a communication pathway between a user and an external system, such as a computer. Owing to its practicality, electroencephalography (EEG) is one of the most widely used measurements for BCI. However, EEG has complex patterns and EEG-based BCIs mostly involve a cost/time-consuming calibration phase; thus, acquiring sufficient EEG data is rarely possible. Recently, deep learning (DL) has had a theoretical/practical impact on BCI research because of its use in learning representations of complex patterns inherent in EEG. Moreover, algorithmic advances in DL facilitate short/zero-calibration in BCI, thereby suppressing the data acquisition phase. Those advancements include data augmentation (DA), increasing the number of training samples without acquiring additional data, and transfer learning (TL), taking advantage of representative knowledge obtained from one dataset to address the so-called data insufficiency problem in other datasets. In this study, we review DL-based short/zero-calibration methods for BCI. Further, we elaborate methodological/algorithmic trends, highlight intriguing approaches in the literature, and discuss directions for further research. In particular, we search for generative model-based and geometric manipulation-based DA methods. Additionally, we categorize TL techniques in DL-based BCIs into explicit and implicit methods. Our systematization reveals advances in the DA and TL methods. Among the studies reviewed herein, ~45% of DA studies used generative model-based techniques, whereas ~45% of TL studies used explicit knowledge transferring strategy. Moreover, based on our literature review, we recommend an appropriate DA strategy for DL-based BCIs and discuss trends of TLs used in DL-based BCIs.
Collapse
Affiliation(s)
- Wonjun Ko
- Department of Brain and Cognitive Engineering, Korea University, Seoul, South Korea
| | - Eunjin Jeon
- Department of Brain and Cognitive Engineering, Korea University, Seoul, South Korea
| | - Seungwoo Jeong
- Department of Artificial Intelligence, Korea University, Seoul, South Korea
| | - Jaeun Phyo
- Department of Brain and Cognitive Engineering, Korea University, Seoul, South Korea
| | - Heung-Il Suk
- Department of Brain and Cognitive Engineering, Korea University, Seoul, South Korea
- Department of Artificial Intelligence, Korea University, Seoul, South Korea
| |
Collapse
|
59
|
Perslev M, Darkner S, Kempfner L, Nikolic M, Jennum PJ, Igel C. U-Sleep: resilient high-frequency sleep staging. NPJ Digit Med 2021; 4:72. [PMID: 33859353 PMCID: PMC8050216 DOI: 10.1038/s41746-021-00440-5] [Citation(s) in RCA: 122] [Impact Index Per Article: 30.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Accepted: 03/10/2021] [Indexed: 02/02/2023] Open
Abstract
Sleep disorders affect a large portion of the global population and are strong predictors of morbidity and all-cause mortality. Sleep staging segments a period of sleep into a sequence of phases providing the basis for most clinical decisions in sleep medicine. Manual sleep staging is difficult and time-consuming as experts must evaluate hours of polysomnography (PSG) recordings with electroencephalography (EEG) and electrooculography (EOG) data for each patient. Here, we present U-Sleep, a publicly available, ready-to-use deep-learning-based system for automated sleep staging ( sleep.ai.ku.dk ). U-Sleep is a fully convolutional neural network, which was trained and evaluated on PSG recordings from 15,660 participants of 16 clinical studies. It provides accurate segmentations across a wide range of patient cohorts and PSG protocols not considered when building the system. U-Sleep works for arbitrary combinations of typical EEG and EOG channels, and its special deep learning architecture can label sleep stages at shorter intervals than the typical 30 s periods used during training. We show that these labels can provide additional diagnostic information and lead to new ways of analyzing sleep. U-Sleep performs on par with state-of-the-art automatic sleep staging systems on multiple clinical datasets, even if the other systems were built specifically for the particular data. A comparison with consensus-scores from a previously unseen clinic shows that U-Sleep performs as accurately as the best of the human experts. U-Sleep can support the sleep staging workflow of medical experts, which decreases healthcare costs, and can provide highly accurate segmentations when human expertize is lacking.
Collapse
Affiliation(s)
- Mathias Perslev
- Department of Computer Science, University of Copenhagen, Copenhagen, Denmark
| | - Sune Darkner
- Department of Computer Science, University of Copenhagen, Copenhagen, Denmark
| | - Lykke Kempfner
- Danish Center for Sleep Medicine, Rigshospitalet, Copenhagen, Denmark
| | - Miki Nikolic
- Danish Center for Sleep Medicine, Rigshospitalet, Copenhagen, Denmark
| | | | - Christian Igel
- Department of Computer Science, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
60
|
Raurale SA, Boylan GB, Mathieson S, Marnane WP, Lightbody G, O'Toole JM. Grading hypoxic-ischemic encephalopathy in neonatal EEG with convolutional neural networks and quadratic time-frequency distributions. J Neural Eng 2021; 18. [PMID: 33618337 PMCID: PMC8208632 DOI: 10.1088/1741-2552/abe8ae] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2020] [Accepted: 02/22/2021] [Indexed: 12/12/2022]
Abstract
OBJECTIVE To develop an automated system to classify the severity of hypoxic-ischaemic encephalopathy injury (HIE) in neonates from the background electroencephalogram (EEG). METHOD By combining a quadratic time{frequency distribution (TFD) with a convolutional neural network, we develop a system that classifies 4 EEG grades of HIE. The network learns directly from the two- dimensional TFD through 3 independent layers with convolution in the time, frequency, and time{frequency directions. Computationally efficient algorithms make it feasible to transform each 5 minute epoch to the time-frequency domain by controlling for oversampling to reduce both computation and computer memory. The system is developed on EEG recordings from 54 neonates. Then the system is validated on a large unseen dataset of 338 hours of EEG recordings from 91 neonates obtained across multiple international centres. RESULTS The proposed EEG HIE-grading system achieves a leave-one-subject-out testing accuracy of 88.9% and kappa of 0.84 on the development dataset. Accuracy for the large unseen test dataset is 69.5% (95% confidence interval, CI: 65.3 to 73.6%) and kappa of 0.54, which is a significant (P < 0.001) improvement over a state-of-the-art feature-based method with an accuracy of 56.8% (95% CI: 51.4 to 61.7%) and kappa of 0.39. Performance of the proposed system was unaffected when the number of channels in testing was reduced from 8 to 2|accuracy for large validation dataset remained at 69.5% (95% CI: 65.5 to 74.0%). SIGNIFICANCE The proposed system outperforms the state-of-the-art machine learning algorithms for EEG grade classification on a large multi-centre unseen dataset, indicating the potential to assist clinical decision making for neonates with HIE.
Collapse
Affiliation(s)
- Sumit Arun Raurale
- Pediatrics and child health, INFANT Centre, University College Cork, Cork, Cork, T12 DC4A, IRELAND
| | - Geraldine B Boylan
- Department of Paediatrics and Child Health, University College Cork, University College Cork,, Cork, IRELAND
| | - Sean Mathieson
- Podiatric and Child Health, INFANT Centre, University College Cork, Wilton, Cork, T12 DC4A, IRELAND
| | - W P Marnane
- Department of Electrical Engineering and Microelectronics, University College Cork, College Road, Cork, T12 DC4A, IRELAND
| | - Gordon Lightbody
- Department of Electrical Engineering and Microelectronics, University College Cork, College Road, Cork, T12 DC4A, IRELAND
| | - John M O'Toole
- Irish Centre for Fetal and Neonatal Translational Research, Dept. ofPaediatrics and Child Health, University College Cork National University of Ireland, Western Gateway Building, Western Road, Cork, IRELAND
| |
Collapse
|