1
|
Wang K, Wei W, Yi W, Qiu S, He H, Xu M, Ming D. Contrastive fine-grained domain adaptation network for EEG-based vigilance estimation. Neural Netw 2024; 179:106617. [PMID: 39180976 DOI: 10.1016/j.neunet.2024.106617] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2023] [Revised: 06/17/2024] [Accepted: 08/06/2024] [Indexed: 08/27/2024]
Abstract
Vigilance state is crucial for the effective performance of users in brain-computer interface (BCI) systems. Most vigilance estimation methods rely on a large amount of labeled data to train a satisfactory model for the specific subject, which limits the practical application of the methods. This study aimed to build a reliable vigilance estimation method using a small amount of unlabeled calibration data. We conducted a vigilance experiment in the designed BCI-based cursor-control task. Electroencephalogram (EEG) signals of eighteen participants were recorded in two sessions on two different days. And, we proposed a contrastive fine-grained domain adaptation network (CFGDAN) for vigilance estimation. Here, an adaptive graph convolution network (GCN) was built to project the EEG data of different domains into a common space. The fine-grained feature alignment mechanism was designed to weight and align the feature distributions across domains at the EEG channel level, and the contrastive information preservation module was developed to preserve the useful target-specific information during the feature alignment. The experimental results show that the proposed CFGDAN outperforms the compared methods in our BCI vigilance dataset and SEED-VIG dataset. Moreover, the visualization results demonstrate the efficacy of the designed feature alignment mechanisms. These results indicate the effectiveness of our method for vigilance estimation. Our study is helpful for reducing calibration efforts and promoting the practical application potential of vigilance estimation methods.
Collapse
Affiliation(s)
- Kangning Wang
- Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin 300072, China; Laboratory of Brain Atlas and Brain-Inspired Intelligence, Key Laboratory of Brain Cognition and Brain-inspired Intelligence Technology, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
| | - Wei Wei
- Laboratory of Brain Atlas and Brain-Inspired Intelligence, Key Laboratory of Brain Cognition and Brain-inspired Intelligence Technology, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
| | - Weibo Yi
- Beijing Machine and Equipment Institute, Beijing 100854, China
| | - Shuang Qiu
- Laboratory of Brain Atlas and Brain-Inspired Intelligence, Key Laboratory of Brain Cognition and Brain-inspired Intelligence Technology, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China; School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing 100049, China.
| | - Huiguang He
- Laboratory of Brain Atlas and Brain-Inspired Intelligence, Key Laboratory of Brain Cognition and Brain-inspired Intelligence Technology, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China; School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Minpeng Xu
- Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin 300072, China; College of Precision Instruments and Optoelectronics Engineering, Tianjin University, Tianjin 300072, China
| | - Dong Ming
- Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin 300072, China; College of Precision Instruments and Optoelectronics Engineering, Tianjin University, Tianjin 300072, China.
| |
Collapse
|
2
|
Yang H, Huang J, Yu Y, Sun Z, Zhang S, Liu Y, Liu H, Xia L. An improved CapsNet based on data augmentation for driver vigilance estimation with forehead single-channel EEG. Cogn Neurodyn 2024; 18:2535-2550. [PMID: 39678725 PMCID: PMC11639747 DOI: 10.1007/s11571-024-10105-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 02/21/2024] [Accepted: 03/12/2024] [Indexed: 12/17/2024] Open
Abstract
Various studies have shown that it is necessary to estimate the drivers' vigilance to reduce the occurrence of traffic accidents. Most existing EEG-based vigilance estimation studies have been performed on intra-subject and multi-channel signals, and these methods are too costly and complicated to implement in practice. Hence, aiming at the problem of cross-subject vigilance estimation of single-channel EEG signals, an estimation algorithm based on capsule network (CapsNet) is proposed. Firstly, we propose a new construction method of the input feature maps to fit the characteristics of CapsNet to improve the algorithm accuracy. Meanwhile, the self-attention mechanism is incorporated in the algorithm to focus on the key information in feature maps. Secondly, we propose substituting the traditional multi-channel signals with the single-channel signals to improve the utility of algorithm. Thirdly, since the single-channel signals carry fewer dimensions of the information compared to the multi-channel signals, we use the conditional generative adversarial network to improve the accuracy of single-channel signals by increasing the amount of data. The proposed algorithm is verified on the SEED-VIG, and Root-mean-square-error (RMSE) and Pearson Correlation Coefficient (PCC) are used as the evaluation metrics. The results show that the proposed algorithm improves the computing speed while the RMSE is reduced by 3%, and the PCC is improved by 12% compared to the mainstream algorithm. Experiment results prove the feasibility of using forehead single-channel EEG signals for cross-subject vigilance estimation and offering the possibility of lightweight EEG vigilance estimation devices for practical applications.
Collapse
Affiliation(s)
- Huizhou Yang
- College of Information Science and Technology, Nanjing Forestry University, Nanjing, 210037 China
| | - Jingwen Huang
- College of Information Science and Technology, Nanjing Forestry University, Nanjing, 210037 China
| | - Yifei Yu
- CR/RIX1-AP, Bosch (China) Investment Ltd., Shanghai, 200335 China
| | - Zhigang Sun
- CR/RIX1-AP, Bosch (China) Investment Ltd., Shanghai, 200335 China
| | - Shouyi Zhang
- CR/RIX1-AP, Bosch (China) Investment Ltd., Shanghai, 200335 China
| | - Yunfei Liu
- College of Information Science and Technology, Nanjing Forestry University, Nanjing, 210037 China
| | - Han Liu
- College of Letters and Science, University of Wisconsin-Madison, Madison, WI 53706 USA
| | - Lijuan Xia
- CR/RIX1-AP, Bosch (China) Investment Ltd., Shanghai, 200335 China
| |
Collapse
|
3
|
Zhang M, Luo Z, Xie L, Liu T, Yan Y, Yao D, Zhao S, Yin E. Multimodal Vigilance Estimation With Modality-Pairwise Contrastive Loss. IEEE Trans Biomed Eng 2024; 71:1139-1150. [PMID: 37906494 DOI: 10.1109/tbme.2023.3328942] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2023]
Abstract
Nowadays, how to estimate vigilance with higher accuracy has become a hot field of research direction. Although the increasing available modalities opens the door for amazing new possibilities to achieve good performance, the uncertain cross-modal interaction still poses a real challenge to the multimodal fusion. In this paper, a cross-modality alignment method has been proposed based on the contrastive learning for extracting shared but not the same information among modalities. The contrastive learning is adopted to minimize the intermodal differences by maximizing the similarity of semantic representation of modalities. Applying our proposed modeling framework, we evaluated our approach on SEED-VIG dataset consisting of EEG and EOG signals. Experiments showed that our study achieved state-of-the-art multimodal vigilance estimation performance both in intra-subject and inter-subject situations, the average of RMSE/CORR were improved to 0.092/0.893 and 0.144/0.887, respectively. In addition, analysis on the frequency bands showed that theta and alpha activities contain valuable information for vigilance estimation, and the correlation between them and PERCLOS can be significantly improved by contrastive learning. We argue that the proposed method in the inter-subject case could offer the possibility of reducing the high-cost of data annotation, and further analysis may provide an idea for the application of multimodal vigilance regression.
Collapse
|
4
|
Xiao Z, Yang C, Li Y, Xing Y, Ma C, Zhang Y, Long X, Li J, Liu C. Human Eye Activity Monitoring Using Continuous Wave Doppler Radar: A Feasibility Study. IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS 2024; 18:322-333. [PMID: 37851555 DOI: 10.1109/tbcas.2023.3325547] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/20/2023]
Abstract
Human eye activity has been widely studied in many fields such as psychology, neuroscience, medicine, and human-computer interaction engineering. In previous studies, monitoring of human eye activity mainly depends on electrooculogram (EOG) that requires a contact sensor. This article proposes a novel eye movement monitoring method called continuous wave doppler oculogram (cDOG). Unlike the conventional EOG-based eye movement monitoring methods, cDOG based on continuous wave doppler radar sensor (cDRS) can remotely measure human eye activity without placing electrodes on the head. To verify the feasibility of using cDOG for eye movement monitoring, we first theoretically analyzed the association between the radar signal and the corresponding eye movements measured with EOG. Afterward, we conducted an experiment to compare EOG and cDOG measurements under the conditions of eyes closure and opening. In addition, different eye movement states were considered, including right-left saccade, up-down saccade, eye-blink, and fixation. Several representative time domain and frequency domain features obtained from cDOG and from EOG were compared in these states, allowing us to demonstrate the feasibility of using cDOG for monitoring eye movements. The experimental results show that there is a correlation between cDOG and EOG in the time and frequency domain features, the average time error of single eye movement is less than 280.5 ms, and the accuracy of cDOG in eye movement detection is higher than 92.35%, when the distance between the cDRS and the face is 10 cm and eyes is facing the radar directly.
Collapse
|
5
|
Zhang X, Wang D, Wu H, Chao J, Zhong J, Peng H, Hu B. Vigilance estimation using truncated l1 distance kernel-based sparse representation regression with physiological signals. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2023; 242:107773. [PMID: 37734218 DOI: 10.1016/j.cmpb.2023.107773] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 01/22/2023] [Accepted: 08/20/2023] [Indexed: 09/23/2023]
Abstract
BACKGROUND With a large number of accidents caused by the decline in the vigilance of operators, finding effective automatic vigilance monitoring methods is a work of great significance in recent years. Based on physiological signals and machine learning algorithms, researchers have opened up a path for objective vigilance estimation. METHODS Sparse representation (SR)-based recognition algorithms with excellent performance and simple models are very promising approaches in this field. This paper aims to study the adaptability and performance improvement of truncated l1 distance (TL1) kernel on SR-based algorithm in the context of physiological signal vigilance estimation. Compared with the traditional radial basis function (RBF), the TL1 kernel has good adaptiveness to nonlinearity and is suitable for the discrimination of complex physiological signals. A recognition framework based on TL1 and SR theory is proposed. Firstly, the inseparable physiological features are mapped to the reproducing kernel Kreĭn space through the infinite-dimensional projection of the TL1 kernel. Then the obtained kernel matrix is converted into the symmetric positive definite matrix according to the eigenspectrum approaches. Finally, the final prediction result is obtained through the sparse representation regression process. RESULTS We verified the performance of the proposed framework on the popular SEED-VIG dataset containing physiological signals (electroencephalogram and electrooculogram) associated with vigilance. In the experimental results, the TL1 kernel is superior to the RBF kernel in both performance and kernel parameter stability. CONCLUSIONS This demonstrates the effectiveness of the TL1 kernel in distinguishing physiological signals and the excellent vigilance estimation capability of the proposed framework. Moreover, the contribution of our research motivates the development of physiological signal recognition based on kernel methods.
Collapse
Affiliation(s)
- Xuan Zhang
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China
| | - Dixin Wang
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China
| | - Hongtong Wu
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China
| | - Jinlong Chao
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China
| | - Jitao Zhong
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China
| | - Hong Peng
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China
| | - Bin Hu
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China.
| |
Collapse
|
6
|
Ding N, Zhang C, Eskandarian A. EEG-fest: few-shot based attention network for driver's drowsiness estimation with EEG signals. Biomed Phys Eng Express 2023; 10:015008. [PMID: 37995351 DOI: 10.1088/2057-1976/ad0f3f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Accepted: 11/23/2023] [Indexed: 11/25/2023]
Abstract
The leading factor behind most vehicular accidents is the driver's inattentiveness. To accurately determine driver's drowsiness, Electroencephalography (EEG) has been proven to be a reliable and effective method. Even though previous studies have developed accurate driver's drowsiness detection algorithms, certain challenges still persist, such as (a) limited training sample sizes, (b) detecting anomalous signals, and (c) achieving subject-independent classification. In this paper we propose a novel solution, names as EEG-Fest, which is a generalized few-shot model aimed at addressing the aforementioned limitations. The EEG-Fest has the ability to (a) classify a query sample's level of drowsiness with only a few support sample inputs (b) identify whether a query sample is anomalous signals or not, and (c) perform subject-independent classification. During the evaluation, our proposed EEG-Fest algorithm demonstrates better performance compared to other two conventional EEG algorithms in cross-subject validation.
Collapse
Affiliation(s)
- Ning Ding
- Mechanical Engineering, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States of America
| | - Ce Zhang
- Mechanical Engineering, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States of America
| | - Azim Eskandarian
- Mechanical Engineering, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States of America
| |
Collapse
|
7
|
Zhang W, Wu QMJ, Yang Y. Semisupervised Manifold Regularization via a Subnetwork-Based Representation Learning Model. IEEE TRANSACTIONS ON CYBERNETICS 2023; 53:6923-6936. [PMID: 35687637 DOI: 10.1109/tcyb.2022.3177573] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Semisupervised classification with a few labeled training samples is a challenging task in the area of data mining. Moore-Penrose inverse (MPI)-based manifold regularization (MR) is a widely used technique in tackling semisupervised classification. However, most of the existing MPI-based MR algorithms can only generate loosely connected feature encoding, which is generally less effective in data representation and feature learning. To alleviate this deficiency, we introduce a new semisupervised multilayer subnet neural network called SS-MSNN. The key contributions of this article are as follows: 1) a novel MPI-based MR model using the subnetwork structure is introduced. The subnet model is utilized to enrich the latent space representations iteratively; 2) a one-step training process to learn the discriminative encoding is proposed. The proposed SS-MSNN learns parameters by directly optimizing the entire network, accepting input from one end, and producing output at the other end; and 3) a new semisupervised dataset called HFSWR-RDE is built for this research. Experimental results on multiple domains show that the SS-MSNN achieves promising performance over the other semisupervised learning algorithms, demonstrating fast inference speed and better generalization ability.
Collapse
|
8
|
Shi J, Wang K. Fatigue driving detection method based on Time-Space-Frequency features of multimodal signals. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2023.104744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2023]
|
9
|
Wang S, Ji B, Shao D, Chen W, Gao K. A Methodology for Enhancing SSVEP Features Using Adaptive Filtering Based on the Spatial Distribution of EEG Signals. MICROMACHINES 2023; 14:mi14050976. [PMID: 37241600 DOI: 10.3390/mi14050976] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Revised: 04/22/2023] [Accepted: 04/24/2023] [Indexed: 05/28/2023]
Abstract
In this paper, we propose a classification algorithm of EEG signal based on canonical correlation analysis (CCA) and integrated with adaptive filtering. It can enhance the detection of steady-state visual evoked potentials (SSVEPs) in a brain-computer interface (BCI) speller. An adaptive filter is employed in front of the CCA algorithm to improve the signal-to-noise ratio (SNR) of SSVEP signals by removing background electroencephalographic (EEG) activities. The ensemble method is developed to integrate recursive least squares (RLS) adaptive filter corresponding to multiple stimulation frequencies. The method is tested by the SSVEP signal recorded from six targets by actual experiment and the EEG in a public SSVEP dataset of 40 targets from Tsinghua University. The accuracy rates of the CCA method and the CCA-based integrated RLS filter algorithm (RLS-CCA method) are compared. Experiment results show that the proposed RLS-CCA-based method significantly improves the classification accuracy compared with the pure CCA method. Especially when the number of EEG leads is low (three occipital electrodes and five non occipital electrodes), its advantage is more significant, and accuracy reaches 91.23%, which is more suitable for wearable environments where high-density EEG is not easy to collect.
Collapse
Affiliation(s)
- Shengyu Wang
- School of Information Science and Technology, Donghua University, Shanghai 201620, China
| | - Bowen Ji
- Unmanned System Research Institute, Northwestern Polytechnical University, Xi'an 710072, China
- Innovation Center NPU Chongqing, Northwestern Polytechnical University, Chongqing 401135, China
| | - Dian Shao
- Unmanned System Research Institute, Northwestern Polytechnical University, Xi'an 710072, China
| | - Wanru Chen
- School of Information Science and Technology, Donghua University, Shanghai 201620, China
| | - Kunpeng Gao
- School of Information Science and Technology, Donghua University, Shanghai 201620, China
| |
Collapse
|
10
|
Murugan S, Sivakumar PK, Kavitha C, Harichandran A, Lai WC. An Electro-Oculogram (EOG) Sensor's Ability to Detect Driver Hypovigilance Using Machine Learning. SENSORS (BASEL, SWITZERLAND) 2023; 23:2944. [PMID: 36991654 PMCID: PMC10058593 DOI: 10.3390/s23062944] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Revised: 02/23/2023] [Accepted: 03/02/2023] [Indexed: 06/19/2023]
Abstract
Driving safely is crucial to avoid death, injuries, or financial losses that can be sustained in an accident. Thus, a driver's physical state should be monitored to prevent accidents, rather than vehicle-based or behavioral measurements, and provide reliable information in this regard. Electrocardiography (ECG), electroencephalography (EEG), electrooculography (EOG), and surface electromyography (sEMG) signals are used to monitor a driver's physical state during a drive. The purpose of this study was to detect driver hypovigilance (drowsiness, fatigue, as well as visual and cognitive inattention) using signals collected from 10 drivers while they were driving. EOG signals from the driver were preprocessed to remove noise, and 17 features were extracted. ANOVA (analysis of variance) was used to select statistically significant features that were then loaded into a machine learning algorithm. We then reduced the features by using principal component analysis (PCA) and trained three classifiers: support vector machine (SVM), k-nearest neighbor (KNN), and ensemble. A maximum accuracy of 98.7% was obtained for the classification of normal and cognitive classes under the category of two-class detection. Upon considering hypovigilance states as five-class, a maximum accuracy of 90.9% was achieved. In this case, the number of detection classes increased, resulting in a reduction in the accuracy of detecting more driver states. However, with the possibility of incorrect identification and the presence of issues, the ensemble classifier's performance produced an enhanced accuracy when compared to others.
Collapse
Affiliation(s)
- Suganiya Murugan
- Department of Computing Technologies, SRM Institute of Science and Technology—KTR, Chennai 603203, India
| | - Pradeep Kumar Sivakumar
- Department of Electrical and Electronics Engineering, Vels Institute of Science Technology and Advanced Studies, Chennai 600117, India
| | - C. Kavitha
- Department of Computer Science and Engineering, Sathyabama Institute of Science and Technology, Chennai 600119, India
| | - Anandhi Harichandran
- Department of Biomedical Engineering, Agni College of Technology, Chennai 600130, India
| | - Wen-Cheng Lai
- Bachelor Program in Industrial Projects, National Yunlin University of Science and Technology, Douliu 640301, Taiwan
- Department of Electronic Engineering, National Yunlin University of Science and Technology, Douliu 640301, Taiwan
| |
Collapse
|
11
|
Zhang T, Zhang X, Lu Z, Zhang Y, Jiang Z, Zhang Y. Feasibility study of personalized speed adaptation method based on mental state for teleoperated robots. Front Neurosci 2022; 16:976437. [PMID: 36117631 PMCID: PMC9479697 DOI: 10.3389/fnins.2022.976437] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Accepted: 07/29/2022] [Indexed: 11/13/2022] Open
Abstract
The teleoperated robotic system can support humans to complete tasks in high-risk, high-precision and difficult special environments. Because this kind of special working environment is easy to cause stress, high mental workload, fatigue and other mental states of the operator, which will reduce the quality of operation and even cause safety accidents, so the mental state of the people in this system has received extensive attention. However, the existence of individual differences and mental state diversity is often ignored, so that most of the existing adjustment strategy is out of a match between mental state and adaptive decision, which cannot effectively improve operational quality and safety. Therefore, a personalized speed adaptation (PSA) method based on policy gradient reinforcement learning was proposed in this paper. It can use electroencephalogram and electro-oculogram to accurately perceive the operator’s mental state, and adjust the speed of the robot individually according to the mental state of different operators, in order to perform teleoperation tasks efficiently and safely. The experimental results showed that the PSA method learns the mapping between the mental state and the robot’s speed regulation action by means of rewards and punishments, and can adjust the speed of the robot individually according to the mental state of different operators, thereby improving the operating quality of the system. And the feasibility and superiority of this method were proved. It is worth noting that the PSA method was validated on 6 real subjects rather than a simulation model. To the best of our knowledge, the PSA method is the first implementation of online reinforcement learning control of teleoperated robots involving human subjects.
Collapse
Affiliation(s)
- Teng Zhang
- School of Mechanical Engineering, Xi’an Jiaotong University, Xi’an, China
| | - Xiaodong Zhang
- School of Mechanical Engineering, Xi’an Jiaotong University, Xi’an, China
- Shaanxi Key Laboratory of Intelligent Robot, Xi’an Jiaotong University, Xi’an, China
- *Correspondence: Xiaodong Zhang,
| | - Zhufeng Lu
- School of Mechanical Engineering, Xi’an Jiaotong University, Xi’an, China
| | - Yi Zhang
- School of Mechanical Engineering, Xi’an Jiaotong University, Xi’an, China
| | - Zhiming Jiang
- School of Mechanical Engineering, Xi’an Jiaotong University, Xi’an, China
| | - Yingjie Zhang
- School of Mechanical Engineering, Xi’an Jiaotong University, Xi’an, China
| |
Collapse
|
12
|
Fatigue Driving Detection Method Based on Combination of BP Neural Network and Time Cumulative Effect. SENSORS 2022; 22:s22134717. [PMID: 35808213 PMCID: PMC9269348 DOI: 10.3390/s22134717] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/28/2022] [Revised: 06/18/2022] [Accepted: 06/20/2022] [Indexed: 02/06/2023]
Abstract
Fatigue driving has always received a lot of attention, but few studies have focused on the fact that human fatigue is a cumulative process over time, and there are no models available to reflect this phenomenon. Furthermore, the problem of incorrect detection due to facial expression is still not well addressed. In this article, a model based on BP neural network and time cumulative effect was proposed to solve these problems. Experimental data were used to carry out this work and validate the proposed method. Firstly, the Adaboost algorithm was applied to detect faces, and the Kalman filter algorithm was used to trace the face movement. Then, a cascade regression tree-based method was used to detect the 68 facial landmarks and an improved method combining key points and image processing was adopted to calculate the eye aspect ratio (EAR). After that, a BP neural network model was developed and trained by selecting three characteristics: the longest period of continuous eye closure, number of yawns, and percentage of eye closure time (PERCLOS), and then the detection results without and with facial expressions were discussed and analyzed. Finally, by introducing the Sigmoid function, a fatigue detection model considering the time accumulation effect was established, and the drivers’ fatigue state was identified segment by segment through the recorded video. Compared with the traditional BP neural network model, the detection accuracies of the proposed model without and with facial expressions increased by 3.3% and 8.4%, respectively. The number of incorrect detections in the awake state also decreased obviously. The experimental results show that the proposed model can effectively filter out incorrect detections caused by facial expressions and truly reflect that driver fatigue is a time accumulating process.
Collapse
|
13
|
Zhang W, Wu QMJ, Yang Y, Akilan T. Multimodel Feature Reinforcement Framework Using Moore-Penrose Inverse for Big Data Analysis. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021; 32:5008-5021. [PMID: 33021948 DOI: 10.1109/tnnls.2020.3026621] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Fully connected representation learning (FCRL) is one of the widely used network structures in multimodel image classification frameworks. However, most FCRL-based structures, for instance, stacked autoencoder encode features and find the final cognition with separate building blocks, resulting in loosely connected feature representation. This article achieves a robust representation by considering a low-dimensional feature and the classifier model simultaneously. Thus, a new hierarchical subnetwork-based neural network (HSNN) is proposed in this article. The novelties of this framework are as follows: 1) it is an iterative learning process, instead of stacking separate blocks to obtain the discriminative encoding and the final classification results. In this sense, the optimal global features are generated; 2) it applies Moore-Penrose (MP) inverse-based batch-by-batch learning strategy to handle large-scale data sets, so that large data set, such as Place365 containing 1.8 million images, can be processed effectively. The experimental results on multiple domains with a varying number of training samples from ∼ 1 K to ∼ 2 M show that the proposed feature reinforcement framework achieves better generalization performance compared with most state-of-the-art FCRL methods.
Collapse
|
14
|
Deep Coupling Recurrent Auto-Encoder with Multi-Modal EEG and EOG for Vigilance Estimation. ENTROPY 2021; 23:e23101316. [PMID: 34682040 PMCID: PMC8534880 DOI: 10.3390/e23101316] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/11/2021] [Revised: 09/29/2021] [Accepted: 10/07/2021] [Indexed: 11/25/2022]
Abstract
Vigilance estimation of drivers is a hot research field of current traffic safety. Wearable devices can monitor information regarding the driver’s state in real time, which is then analyzed by a data analysis model to provide an estimation of vigilance. The accuracy of the data analysis model directly affects the effect of vigilance estimation. In this paper, we propose a deep coupling recurrent auto-encoder (DCRA) that combines electroencephalography (EEG) and electrooculography (EOG). This model uses a coupling layer to connect two single-modal auto-encoders to construct a joint objective loss function optimization model, which consists of single-modal loss and multi-modal loss. The single-modal loss is measured by Euclidean distance, and the multi-modal loss is measured by a Mahalanobis distance of metric learning, which can effectively reflect the distance between different modal data so that the distance between different modes can be described more accurately in the new feature space based on the metric matrix. In order to ensure gradient stability in the long sequence learning process, a multi-layer gated recurrent unit (GRU) auto-encoder model was adopted. The DCRA integrates data feature extraction and feature fusion. Relevant comparative experiments show that the DCRA is better than the single-modal method and the latest multi-modal fusion. The DCRA has a lower root mean square error (RMSE) and a higher Pearson correlation coefficient (PCC).
Collapse
|
15
|
Wu W, Sun W, Wu QMJ, Zhang C, Yang Y, Yu H, Lu BL. Faster Single Model Vigilance Detection Based on Deep Learning. IEEE Trans Cogn Dev Syst 2021. [DOI: 10.1109/tcds.2019.2963073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
16
|
Zhang G, Etemad A. Capsule Attention for Multimodal EEG-EOG Representation Learning With Application to Driver Vigilance Estimation. IEEE Trans Neural Syst Rehabil Eng 2021; 29:1138-1149. [PMID: 34129500 DOI: 10.1109/tnsre.2021.3089594] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Driver vigilance estimation is an important task for transportation safety. Wearable and portable brain-computer interface devices provide a powerful means for real-time monitoring of the vigilance level of drivers to help with avoiding distracted or impaired driving. In this paper, we propose a novel multimodal architecture for in-vehicle vigilance estimation from Electroencephalogram and Electrooculogram. To enable the system to focus on the most salient parts of the learned multimodal representations, we propose an architecture composed of a capsule attention mechanism following a deep Long Short-Term Memory (LSTM) network. Our model learns hierarchical dependencies in the data through the LSTM and capsule feature representation layers. To better explore the discriminative ability of the learned representations, we study the effect of the proposed capsule attention mechanism including the number of dynamic routing iterations as well as other parameters. Experiments show the robustness of our method by outperforming other solutions and baseline techniques, setting a new state-of-the-art. We then provide an analysis on different frequency bands and brain regions to evaluate their suitability for driver vigilance estimation. Lastly, an analysis on the role of capsule attention, multimodality, and robustness to noise is performed, highlighting the advantages of our approach.
Collapse
|
17
|
Karnati M, Seal A, Yazidi A, Krejcar O. LieNet: A Deep Convolution Neural Networks Framework for Detecting Deception. IEEE Trans Cogn Dev Syst 2021. [DOI: 10.1109/tcds.2021.3086011] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
18
|
Cao J, Zhu J, Hu W, Kummert A. Epileptic Signal Classification With Deep EEG Features by Stacked CNNs. IEEE Trans Cogn Dev Syst 2020. [DOI: 10.1109/tcds.2019.2936441] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
19
|
Zhang W, Wu J, Yang Y. Wi-HSNN: A subnetwork-based encoding structure for dimension reduction and food classification via harnessing multi-CNN model high-level features. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2020.07.018] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
|
20
|
Peng Y, Wang Z, Wong CM, Nan W, Rosa A, Xu P, Wan F, Hu Y. Changes of EEG phase synchronization and EOG signals along the use of steady state visually evoked potential-based brain computer interface. J Neural Eng 2020; 17:045006. [PMID: 32408272 DOI: 10.1088/1741-2552/ab933e] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
|