1
|
Liu Y, Li X, Wang M, Bi J, Lin S, Wang Q, Yu Y, Ye J, Zheng Y. Multimodal depression recognition and analysis: Facial expression and body posture changes via emotional stimuli. J Affect Disord 2025; 381:44-54. [PMID: 40187420 DOI: 10.1016/j.jad.2025.03.155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/30/2024] [Revised: 02/22/2025] [Accepted: 03/25/2025] [Indexed: 04/07/2025]
Abstract
BACKGROUND Clinical studies have shown that facial expressions and body posture in depressed patients differ significantly from those of healthy individuals. Combining relevant behavioral features with artificial intelligence technology can effectively improve the efficiency of depression detection, thereby assisting doctors in early identification of patients. This study aims to develop an end-to-end multimodal recognition model combining facial expressions and body posture via deep learning techniques, enabling rapid preliminary screening of depression. METHODS We invited 146 subjects (73 in the patient group and 73 in the control group) to participate in an emotion-stimulus experiment for depression recognition. We focused on differentiating depression patients from the control group by analyzing changes in body posture and facial expressions under emotional stimuli. We first extracted images of body position and facial emotions from the video, then used a pre-trained ResNet-50 network to extract features. Additionally, we analyzed facial expression features using OpenFace for sequence analysis. Subsequently, various deep learning frameworks were combined to assess the severity of depression. RESULTS We found that under different stimuli, facial expression units AU04, AU07, AU10, AU12, AU17, and AU26 had significant effects in the emotion-stimulus experiment, with these features generally being negative. The decision-level fusion model based on facial expressions and body posture achieved excellent results, with the highest accuracy of 0.904 and an F1 score of 0.901. CONCLUSIONS The experimental results suggest that depression patients exhibit predominantly negative facial expressions. This study validates the emotion-stimulus experiment, demonstrating that combining facial expressions and body posture enables accurate preliminary depression screening.
Collapse
Affiliation(s)
- Yang Liu
- Endocrinology, The Fifth Clinical College of Guangzhou University of Chinese Medicine, Guangzhou 510095, China; Endocrinology, Guangdong Provincial Second Hospital of Traditional Chinese Medicine, Guangzhou 510095, China
| | - Xingyun Li
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China
| | - Mengqi Wang
- Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Qilu University of Technology (Shandong Academy of Science), Jinan 250353, China
| | - Jianlu Bi
- Endocrinology, The Fifth Clinical College of Guangzhou University of Chinese Medicine, Guangzhou 510095, China; Endocrinology, Guangdong Provincial Second Hospital of Traditional Chinese Medicine, Guangzhou 510095, China
| | - Shaoqin Lin
- Endocrinology, The Fifth Clinical College of Guangzhou University of Chinese Medicine, Guangzhou 510095, China; Endocrinology, Guangdong Provincial Second Hospital of Traditional Chinese Medicine, Guangzhou 510095, China
| | - Qingxiang Wang
- Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Qilu University of Technology (Shandong Academy of Science), Jinan 250353, China.
| | - Yanhong Yu
- College of Traditional Chinese Medicine, Shandong University of Traditional Chinese Medicine, Jinan 250355, China.
| | - Jiayu Ye
- Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Qilu University of Technology (Shandong Academy of Science), Jinan 250353, China; School of Computer Science, Guangdong University of Technology, Guangzhou 510006, China.
| | - Yunshao Zheng
- Shandong Mental Health Center, Shandong University, Jinan 250014, China
| |
Collapse
|
2
|
Li L, Li J, Wu H, Zhao Y, Liu Q, Zhang H, Xu W. Optimal channel and feature selection for automatic prediction of functional brain age of preterm infant based on EEG. Front Neurosci 2025; 19:1517141. [PMID: 39935839 PMCID: PMC11811077 DOI: 10.3389/fnins.2025.1517141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2024] [Accepted: 01/10/2025] [Indexed: 02/13/2025] Open
Abstract
Introduction Approximately 15 million premature infants are born each year, many of whom face risks of neurological impairments. Accurate assessment of brain maturity is crucial for timely intervention and treatment planning. Electroencephalography (EEG) is a noninvasive method commonly used for this purpose. However, using all channels and features for brain maturity assessment can lead to high computational burden and overfitting, which can decrease the performance of the prediction system. Methods In this study, we propose an automatic prediction framework based on EEG to predict functional brain age (FBA) for assessing brain maturity in preterm infants. To optimize channel selection, we combine Binary Particle Swarm Optimization (BPSO) with Forward Addition (FA) and Backward Elimination (BE) methods. For feature selection, we combine the Pearson Correlation Coefficient (PCC), Recursive Feature Elimination (RFE), and Support Vector Regression (SVR) model. Results The proposed framework achieved a prediction accuracy of 76.71% within ±1 week and 94.52% within ±2 weeks. Effective channel and feature selection significantly improved model performance while reducing computational costs. Discussion These results demonstrate that optimizing channel and feature selection can enhance the performance of FBA prediction in preterm infants, offering a more efficient and accurate tool for brain maturity assessment.
Collapse
Affiliation(s)
- Ling Li
- College of Communication Engineering, Jilin University, Changchun, Jilin, China
| | - Jiahui Li
- College of Communication Engineering, Jilin University, Changchun, Jilin, China
| | - Hui Wu
- Department of Neonatology, The First Hospital of Jilin University, Changchun, Jilin, China
| | - Yanping Zhao
- College of Communication Engineering, Jilin University, Changchun, Jilin, China
| | - Qinmei Liu
- Department of Neonatology, The First Hospital of Jilin University, Changchun, Jilin, China
| | - Hairong Zhang
- College of Communication Engineering, Jilin University, Changchun, Jilin, China
| | - Wei Xu
- Department of Neonatology, The First Hospital of Jilin University, Changchun, Jilin, China
| |
Collapse
|
3
|
Lian Z, Xu T, Yuan Z, Li J, Thakor N, Wang H. Driving Fatigue Detection Based on Hybrid Electroencephalography and Eye Tracking. IEEE J Biomed Health Inform 2024; 28:6568-6580. [PMID: 39167519 DOI: 10.1109/jbhi.2024.3446952] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/23/2024]
Abstract
EEG-based unimodal method has demonstrated significant success in the detection of driving fatigue. Nonetheless, data from a single modality might be not sufficient to optimize fatigue detection due to incomplete information. To address this limitation and enhance the performance of driving fatigue detection, a novel multimodal architecture combining hybrid electroencephalograph (EEG) and eye tracking data was proposed in this work. Specifically, the EEG and eye tracking data were separately input into encoders, generating two one-dimensional (1D) features. Subsequently, these 1D features were fed into a cross-modal predictive alignment module to improve fusion efficiency and two 1D attention modules to enhance feature representation. Furthermore, the fused features were recognized by a linear classifier. To evaluate the effectiveness of the proposed multimodal method, comprehensive validation tasks were conducted, including intra-session, cross-session, and cross-subject evaluations. In the intra-session task, the proposed architecture achieves an exceptional average accuracy of 99.93%. Moreover, in the cross-session task, our method demonstrates an average accuracy of 88.67%, surpassing the performance of EEG-only approach by 8.52%, eye tracking-only method by 5.92%, multimodal deep canonical correlation analysis (DCCA) technique by 0.42%, and multimodal deep generalized canonical correlation analysis (DGCCA) approach by 0.84%. Similarly, in the cross-subject task, the proposed approach achieves an average accuracy of 78.19%, outperforming EEG-only method by 5.87%, eye tracking-only approach by 4.21%, DCCA method by 0.55%, and DGCCA approach by 0.44%. The experimental results conclusively illustrate the superior effectiveness of the proposed method compared to both single modality approaches and canonical correlation analysis-based multimodal methods.
Collapse
|
4
|
Liu W, Zhou B, Li G, Luo X. Enhanced diagnostics for generalized anxiety disorder: leveraging differential channel and functional connectivity features based on frontal EEG signals. Sci Rep 2024; 14:22789. [PMID: 39354007 PMCID: PMC11445517 DOI: 10.1038/s41598-024-73615-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2024] [Accepted: 09/19/2024] [Indexed: 10/03/2024] Open
Abstract
Generalized Anxiety Disorder (GAD) is a chronic anxiety condition characterized by persistent excessive worry, anxiety, and fear. Current diagnostic practices primarily rely on clinicians' subjective assessments and experience, highlighting a need for more objective and reliable methods. This study collected 10-minute resting-state electroencephalogram (EEG) from 45 GAD patients and 36 healthy controls (HC), focusing on six frontal EEG channels for preprocessing, data segmentation, and frequency band division. Innovatively, this study introduced the "Differential Channel" method, which enhances classification performance by enhancing the information related to anxiety from the data, thereby highlighting signal differences. Utilizing the preprocessed EEG signals, undirected functional connectivity features (Phase Lag Index, Pearson Correlation Coefficient, and Mutual Information) and directed functional connectivity features (Partial Directed Coherence) were extracted. Multiple machine learning models were applied to distinguish between GAD patients and HC. The results show that the Deep Forest classifier achieves excellent performance with a 12-second time window of DiffFeature. In particular, the classification of GAD and HC was successfully obtained by combining OriFeature and DiffFeature on Mutual Information with a maximum accuracy of 98.08%. Furthermore, it was observed that undirected functional connectivity features significantly outperformed directed functional connectivity when fewer frontal channels were used. Overall, the methodologies developed in this study offer accurate and practical identification strategies for the early screening and clinical diagnosis of GAD, offering the necessary theoretical and technical support for further enhancing the portability of EEG devices.
Collapse
Affiliation(s)
- Wei Liu
- College of Computer Science and Technology, Zhejiang Normal University, Jinhua, 321004, China
| | - Bin Zhou
- College of Mathematical Medicine, Zhejiang Normal University, Jinhua, 321004, China
| | - Gang Li
- College of Mathematical Medicine, Zhejiang Normal University, Jinhua, 321004, China.
| | - Xiaodong Luo
- The Second Hospital of Jinhua, Jinhua, 321016, China.
| |
Collapse
|
5
|
Wang R, He Q, Shi L, Che Y, Xu H, Song C. Automatic detection of Alzheimer's disease from EEG signals using an improved AFS-GA hybrid algorithm. Cogn Neurodyn 2024; 18:2993-3013. [PMID: 39555281 PMCID: PMC11564554 DOI: 10.1007/s11571-024-10130-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2024] [Revised: 05/06/2024] [Accepted: 05/13/2024] [Indexed: 11/19/2024] Open
Abstract
Alzheimer's Disease (AD) is a neurodegenerative disorder characterized by energy diffusion and partial disconnection in the brain, with its main feature being an insidious onset and subtle clinical symptoms. Electroencephalogram (EEG) as a primary tool for assessing and aiding in the diagnosis of brain diseases has been widely used in AD detection. Accurate diagnosis is crucial for preventing the transition from early cognitive impairment to AD and providing early treatment for AD patients. This study aims to establish a hybrid model based on the Improved Artificial Fish Swarm Algorithm (IAFS) and Genetic Algorithm (GA)-IAFS-GA, to determine the optimal channel combination for AD detection under multiple EEG signals. Geometric features and complexity features of AD EEG signals were extracted using Second Order Difference Plot (SODP) and entropy analysis across the full frequency band. Subsequently, Pearson correlation was used for feature ranking, selecting the six least correlated features for each channel. The Relief algorithm was then used to fuse these selected features, with one fused feature representing one channel. Based on this, a feature selection optimization algorithm, IAFS-GA, combining the improved artificial fish swarm algorithm and genetic algorithm, was proposed. Finally, the feature combination was input into a Naive Bayes classifier for the identification of AD patients and normal controls. The feature combination was input into a Naive Bayes classifier for the identification of AD patients and normal controls. Using a five-fold cross-validation strategy across the entire frequency band, the classification accuracy reached 93.53%, with a sensitivity of 98.74%, specificity of 98.25%, and an AUC area of 97.82%. This framework can quickly select appropriate brain channels to enhance the efficiency of detecting AD and other neurological diseases. Moreover, it is the first time that an improved artificial fish swarm genetic combination algorithm and SODP features has been used for channel selection in EEG, proving to be an effective method for AD detection. It is based on SODP analysis, entropy analysis, and intelligent algorithms, which can assist clinicians in rapidly diagnosing AD, reducing the misdiagnosis rate of false positives, and expanding our understanding of brain function in patients with neurological diseases.
Collapse
Affiliation(s)
- Ruofan Wang
- School of Information Technology Engineering, Tianjin University of Technology and Education, Tianjin, 300350 China
| | - Qiguang He
- School of Information Technology Engineering, Tianjin University of Technology and Education, Tianjin, 300350 China
| | - Lianshuan Shi
- School of Information Technology Engineering, Tianjin University of Technology and Education, Tianjin, 300350 China
| | - Yanqiu Che
- School of Automation and Electrical Engineering, Tianjin University of Technology and Education, Tianjin, 300222 China
- Tianjin Key Laboratory of Information Sensing and Intelligent Control, Tianjin University of Technology and Education, Tianjin, 300222 China
| | - Haojie Xu
- School of Information Technology Engineering, Tianjin University of Technology and Education, Tianjin, 300350 China
| | - Changzhi Song
- School of Information Technology Engineering, Tianjin University of Technology and Education, Tianjin, 300350 China
| |
Collapse
|
6
|
Chen W, Cai Y, Li A, Jiang K, Su Y. MDD brain network analysis based on EEG functional connectivity and graph theory. Heliyon 2024; 10:e36991. [PMID: 39281492 PMCID: PMC11402240 DOI: 10.1016/j.heliyon.2024.e36991] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Revised: 08/26/2024] [Accepted: 08/26/2024] [Indexed: 09/18/2024] Open
Abstract
Background Existing studies have shown that the brain network of major depression disorder (MDD) has abnormal topologies. However, constructing reliable MDD brain networks is still an open problem. New method This paper proposed a reliable MDD brain network construction method. First, seven connectivity methods are used to calculate the correlation between channels and obtain the functional connectivity matrix. Then, the matrix is binarized using four binarization methods to obtain the EEG brain network. Besides, we proposed an improved binarization method based on the criterion of maximizing differences between groups: the adaptive threshold (AT) method. The AT can automatically set the optimal binarization threshold and overcome the artificial influence of traditional methods. After that, several network metrics are extracted from the brain network to analyze inter-group differences. Finally, we used statistical analysis and Fscore values to compare the performance of different methods and establish the most reliable method for brain network construction. Results In theta, alpha, and total frequency bands, the clustering coefficient, global efficiency, local efficiency, and degree of the MDD brain network decrease, and the path length of the MDD brain network increases. Comparison with existing methods The results show that AT outperforms the existing binarization methods. Compared with other methods, the brain network construction method based on phase-locked value (PLV) and AT has better reliability. Conclusions MDD has brain dysfunction, particularly in the frontal and temporal lobes.
Collapse
Affiliation(s)
- Wan Chen
- Rocket Force University of Engineering, Xi'an, 710025, China
| | - Yanping Cai
- Rocket Force University of Engineering, Xi'an, 710025, China
| | - Aihua Li
- Rocket Force University of Engineering, Xi'an, 710025, China
| | - Ke Jiang
- Rocket Force University of Engineering, Xi'an, 710025, China
| | - Yanzhao Su
- Rocket Force University of Engineering, Xi'an, 710025, China
| |
Collapse
|
7
|
Shen J, Li K, Liang H, Zhao Z, Ma Y, Wu J, Zhang J, Zhang Y, Hu B. HEMAsNet: A Hemisphere Asymmetry Network Inspired by the Brain for Depression Recognition From Electroencephalogram Signals. IEEE J Biomed Health Inform 2024; 28:5247-5259. [PMID: 38781058 DOI: 10.1109/jbhi.2024.3404664] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/25/2024]
Abstract
Depression is a prevalent mental disorder that affects a significant portion of the global population. Despite recent advancements in EEG-based depression recognition models rooted in machine learning and deep learning approaches, many lack comprehensive consideration of depression's pathogenesis, leading to limited neuroscientific interpretability. To address these issues, we propose a hemisphere asymmetry network (HEMAsNet) inspired by the brain for depression recognition from EEG signals. HEMAsNet employs a combination of multi-scale Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) blocks to extract temporal features from both hemispheres of the brain. Moreover, the model introduces a unique 'Callosum-like' block, inspired by the corpus callosum's pivotal role in facilitating inter-hemispheric information transfer within the brain. This block enhances information exchange between hemispheres, potentially improving depression recognition accuracy. To validate the performance of HEMAsNet, we first confirmed the asymmetric features of frontal lobe EEG in the MODMA dataset. Subsequently, our method achieved a depression recognition accuracy of 0.8067, indicating its effectiveness in increasing classification performance. Furthermore, we conducted a comprehensive investigation from spatial and frequency perspectives, demonstrating HEMAsNet's innovation in explaining model decisions. The advantages of HEMAsNet lie in its ability to achieve more accurate and interpretable recognition of depression through the simulation of physiological processes, integration of spatial information, and incorporation of the Callosum-like block.
Collapse
|
8
|
Shao K, Liu Y, Mo Y, Yang Q, Hao Y, Chen M. fNIRS-Driven Depression Recognition Based on Cross-Modal Data Augmentation. IEEE Trans Neural Syst Rehabil Eng 2024; 32:2688-2698. [PMID: 39012734 DOI: 10.1109/tnsre.2024.3429337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/18/2024]
Abstract
Early diagnosis and intervention of depression promote complete recovery, with its traditional clinical assessments depending on the diagnostic scales, clinical experience of doctors and patient cooperation. Recent researches indicate that functional near-infrared spectroscopy (fNIRS) based on deep learning provides a promising approach to depression diagnosis. However, collecting large fNIRS datasets within a standard experimental paradigm remains challenging, limiting the applications of deep networks that require more data. To address these challenges, in this paper, we propose an fNIRS-driven depression recognition architecture based on cross-modal data augmentation (fCMDA), which converts fNIRS data into pseudo-sequence activation images. The approach incorporates a time-domain augmentation mechanism, including time warping and time masking, to generate diverse data. Additionally, we design a stimulation task-driven data pseudo-sequence method to map fNIRS data into pseudo-sequence activation images, facilitating the extraction of spatial-temporal, contextual and dynamic characteristics. Ultimately, we construct a depression recognition model based on deep classification networks using the imbalance loss function. Extensive experiments are performed on the two-class depression diagnosis and five-class depression severity recognition, which reveal impressive results with accuracy of 0.905 and 0.889, respectively. The fCMDA architecture provides a novel solution for effective depression recognition with limited data.
Collapse
|
9
|
Risqiwati D, Wibawa AD, Pane ES, Yuniarno EM, Islamiyah WR, Purnomo MH. Effective relax acquisition: a novel approach to classify relaxed state in alpha band EEG-based transformation. Brain Inform 2024; 11:12. [PMID: 38740660 DOI: 10.1186/s40708-024-00225-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Accepted: 04/17/2024] [Indexed: 05/16/2024] Open
Abstract
A relaxed state is essential for effective hypnotherapy, a crucial component of mental health treatments. During hypnotherapy sessions, neurologists rely on the patient's relaxed state to introduce positive suggestions. While EEG is a widely recognized method for detecting human emotions, analyzing EEG data presents challenges due to its multi-channel, multi-band nature, leading to high-dimensional data. Furthermore, determining the onset of relaxation remains challenging for neurologists. This paper presents the Effective Relax Acquisition (ERA) method designed to identify the beginning of a relaxed state. ERA employs sub-band sampling within the Alpha band for the frequency domain and segments the data into four-period groups for the time domain analysis. Data enhancement strategies include using Window Length (WL) and Overlapping Shifting Windows (OSW) scenarios. Dimensionality reduction is achieved through Principal Component Analysis (PCA) by prioritizing the most significant eigenvector values. Our experimental results indicate that the relaxed state is predominantly observable in the high Alpha sub-band, particularly within the fourth period group. The ERA demonstrates high accuracy with a WL of 3 s and OSW of 0.25 s using the KNN classifier (90.63%). These findings validate the effectiveness of ERA in accurately identifying relaxed states while managing the complexity of EEG data.
Collapse
Affiliation(s)
- Diah Risqiwati
- Departement of Electrical Engineering, Institut Teknologi Sepuluh Nopember, Keputih, Surabaya, 60111, Indonesia
- Departement of Informatics, Universitas Muhammadiyah Malang, Tlogomas, Malang, 65144, Indonesia
| | - Adhi Dharma Wibawa
- Departement of Electrical Engineering, Institut Teknologi Sepuluh Nopember, Keputih, Surabaya, 60111, Indonesia
- Medical Technology, Institut Teknologi Sepuluh Nopember, Keputih, Surabaya, 60111, Indonesia
| | - Evi Septiana Pane
- Industrial Training and Education of Surabaya, Ministry of Industry RI, Gayungan, Surabaya, 60235, Indonesia
| | - Eko Mulyanto Yuniarno
- Departement of Electrical Engineering, Institut Teknologi Sepuluh Nopember, Keputih, Surabaya, 60111, Indonesia
- Departement of Computer Engineering, Institut Teknologi Sepuluh Nopember, Keputih, Surabaya, 60111, Indonesia
| | - Wardah Rahmatul Islamiyah
- Neurology Department, Faculty of Medicine, Universitas Airlangga, Gubeng, Surabaya, 60131, Indonesia
| | - Mauridhi Hery Purnomo
- Departement of Electrical Engineering, Institut Teknologi Sepuluh Nopember, Keputih, Surabaya, 60111, Indonesia.
- Departement of Computer Engineering, Institut Teknologi Sepuluh Nopember, Keputih, Surabaya, 60111, Indonesia.
| |
Collapse
|
10
|
Liu W, Jia K, Wang Z. Graph-based EEG approach for depression prediction: integrating time-frequency complexity and spatial topology. Front Neurosci 2024; 18:1367212. [PMID: 38633266 PMCID: PMC11022962 DOI: 10.3389/fnins.2024.1367212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Accepted: 03/11/2024] [Indexed: 04/19/2024] Open
Abstract
Depression has become the prevailing global mental health concern. The accuracy of traditional depression diagnosis methods faces challenges due to diverse factors, making primary identification a complex task. Thus, the imperative lies in developing a method that fulfills objectivity and effectiveness criteria for depression identification. Current research underscores notable disparities in brain activity between individuals with depression and those without. The Electroencephalogram (EEG), as a biologically reflective and easily accessible signal, is widely used to diagnose depression. This article introduces an innovative depression prediction strategy that merges time-frequency complexity and electrode spatial topology to aid in depression diagnosis. Initially, time-frequency complexity and temporal features of the EEG signal are extracted to generate node features for a graph convolutional network. Subsequently, leveraging channel correlation, the brain network adjacency matrix is employed and calculated. The final depression classification is achieved by training and validating a graph convolutional network with graph node features and a brain network adjacency matrix based on channel correlation. The proposed strategy has been validated using two publicly available EEG datasets, MODMA and PRED+CT, achieving notable accuracy rates of 98.30 and 96.51%, respectively. These outcomes affirm the reliability and utility of our proposed strategy in predicting depression using EEG signals. Additionally, the findings substantiate the effectiveness of EEG time-frequency complexity characteristics as valuable biomarkers for depression prediction.
Collapse
Affiliation(s)
- Wei Liu
- Faculty of Information Technology, Beijing University of Technology, Beijing, China
- Beijing Laboratory of Advanced Information Networks, Beijing, China
- Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, China
| | - Kebin Jia
- Faculty of Information Technology, Beijing University of Technology, Beijing, China
- Beijing Laboratory of Advanced Information Networks, Beijing, China
- Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, China
| | - Zhuozheng Wang
- Faculty of Information Technology, Beijing University of Technology, Beijing, China
| |
Collapse
|
11
|
Li X, Yi X, Lu L, Wang H, Zheng Y, Han M, Wang Q. TSFFM: Depression detection based on latent association of facial and body expressions. Comput Biol Med 2024; 168:107805. [PMID: 38064845 DOI: 10.1016/j.compbiomed.2023.107805] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 11/23/2023] [Accepted: 11/29/2023] [Indexed: 01/10/2024]
Abstract
Depression is a prevalent mental disorder worldwide. Early screening and treatment are crucial in preventing the progression of the illness. Existing emotion-based depression recognition methods primarily rely on facial expressions, while body expressions as a means of emotional expression have been overlooked. To aid in the identification of depression, we recruited 156 participants for an emotional stimulation experiment, gathering data on facial and body expressions. Our analysis revealed notable distinctions in facial and body expressions between the case group and the control group and a synergistic relationship between these variables. Hence, we propose a two-stream feature fusion model (TSFFM) that integrates facial and body features. The central component of TSFFM is the Fusion and Extraction (FE) module. In contrast to conventional methods such as feature concatenation and decision fusion, our approach, FE, places a greater emphasis on in-depth analysis during the feature extraction and fusion processes. Firstly, within FE, we carry out local enhancement of facial and body features, employing an embedded attention mechanism, eliminating the need for original image segmentation and the use of multiple feature extractors. Secondly, FE conducts the extraction of temporal features to better capture the dynamic aspects of expression patterns. Finally, we retain and fuse informative data from different temporal and spatial features to support the ultimate decision. TSFFM achieves an Accuracy and F1-score of 0.896 and 0.896 on the depression emotional stimulus dataset, respectively. On the AVEC2014 dataset, TSFFM achieves MAE and RMSE values of 5.749 and 7.909, respectively. Furthermore, TSFFM has undergone testing on additional public datasets to showcase the effectiveness of the FE module.
Collapse
Affiliation(s)
- Xingyun Li
- Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center (National Supercomputer Center in Jinan), Qilu University of Technology (Shandong Academy of Sciences), Jinan, China; Shandong Engineering Research Center of Big Data Applied Technology, Faculty of Computer Science and Technology, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China; Shandong Provincial Key Laboratory of Computer Networks, Shandong Fundamental Research Center for Computer Science, Jinan, China
| | - Xinyu Yi
- Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center (National Supercomputer Center in Jinan), Qilu University of Technology (Shandong Academy of Sciences), Jinan, China; Shandong Engineering Research Center of Big Data Applied Technology, Faculty of Computer Science and Technology, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China; Shandong Provincial Key Laboratory of Computer Networks, Shandong Fundamental Research Center for Computer Science, Jinan, China
| | - Lin Lu
- Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center (National Supercomputer Center in Jinan), Qilu University of Technology (Shandong Academy of Sciences), Jinan, China; Shandong Engineering Research Center of Big Data Applied Technology, Faculty of Computer Science and Technology, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China; Shandong Provincial Key Laboratory of Computer Networks, Shandong Fundamental Research Center for Computer Science, Jinan, China
| | - Hao Wang
- Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center (National Supercomputer Center in Jinan), Qilu University of Technology (Shandong Academy of Sciences), Jinan, China; Shandong Engineering Research Center of Big Data Applied Technology, Faculty of Computer Science and Technology, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China; Shandong Provincial Key Laboratory of Computer Networks, Shandong Fundamental Research Center for Computer Science, Jinan, China
| | - Yunshao Zheng
- Shandong Mental Health Center, Shandong University, Jinan, China
| | - Mengmeng Han
- Advanced Technology Research Institute, Beijing Institute of Technology, Jinan, China
| | - Qingxiang Wang
- Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center (National Supercomputer Center in Jinan), Qilu University of Technology (Shandong Academy of Sciences), Jinan, China; Shandong Engineering Research Center of Big Data Applied Technology, Faculty of Computer Science and Technology, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China; Shandong Mental Health Center, Shandong University, Jinan, China; Shandong Provincial Key Laboratory of Computer Networks, Shandong Fundamental Research Center for Computer Science, Jinan, China.
| |
Collapse
|
12
|
Zhang S, Wang H, Zheng Z, Liu T, Li W, Zhang Z, Sun Y. Multi-View Graph Contrastive Learning via Adaptive Channel Optimization for Depression Detection in EEG Signals. Int J Neural Syst 2023; 33:2350055. [PMID: 37899654 DOI: 10.1142/s0129065723500557] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/31/2023]
Abstract
Automated detection of depression using Electroencephalogram (EEG) signals has become a promising application in advanced bioinformatics technology. Although current methods have achieved high detection performance, several challenges still need to be addressed: (1) Previous studies do not consider data redundancy when modeling multi-channel EEG signals, resulting in some unrecognized noise channels remaining. (2) Most works focus on the functional connection of EEG signals, ignoring their spatial proximity. The spatial topological structure of EEG signals has not been fully utilized to capture more fine-grained features. (3) Prior depression detection models fail to provide interpretability. To address these challenges, this paper proposes a new model, Multi-view Graph Contrastive Learning via Adaptive Channel Optimization (MGCL-ACO) for depression detection in EEG signals. Specifically, the proposed model first selects the critical channels by maximizing the mutual information between tracks and labels of EEG signals to eliminate data redundancy. Then, the MGCL-ACO model builds two similarity metric views based on functional connectivity and spatial proximity. MGCL-ACO constructs the feature extraction module by graph convolutions and contrastive learning to capture more fine-grained features of different perspectives. Finally, our model provides interpretability by visualizing a brain map related to the significance scores of the selected channels. Extensive experiments have been performed on public datasets, and the results show that our proposed model outperforms the most advanced baselines. Our proposed model not only provides a promising approach for automated depression detection using optimal EEG signals but also has the potential to improve the accuracy and interpretability of depression diagnosis in clinical practice.
Collapse
Affiliation(s)
- Shuangyong Zhang
- School of Information Science and Engineering Shandong Normal University, Jinan 250014, P. R. China
| | - Hong Wang
- School of Information Science and Engineering Shandong Normal University, Jinan 250014, P. R. China
| | - Zixi Zheng
- School of Information Science and Engineering Shandong Normal University, Jinan 250014, P. R. China
| | - Tianyu Liu
- School of Information Science and Engineering Shandong Normal University, Jinan 250014, P. R. China
| | - Weixin Li
- School of Information Science and Engineering Shandong Normal University, Jinan 250014, P. R. China
| | - Zishan Zhang
- School of Information Science and Engineering Shandong Normal University, Jinan 250014, P. R. China
| | - Yanshen Sun
- Department of Computer Science, Virginia Tech, Blacksburg 24061, USA
| |
Collapse
|
13
|
Hag A, Al-Shargie F, Handayani D, Asadi H. Mental Stress Classification Based on Selected Electroencephalography Channels Using Correlation Coefficient of Hjorth Parameters. Brain Sci 2023; 13:1340. [PMID: 37759941 PMCID: PMC10527440 DOI: 10.3390/brainsci13091340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 09/11/2023] [Accepted: 09/13/2023] [Indexed: 09/29/2023] Open
Abstract
Electroencephalography (EEG) signals offer invaluable insights into diverse activities of the human brain, including the intricate physiological and psychological responses associated with mental stress. A major challenge, however, is accurately identifying mental stress while mitigating the limitations associated with a large number of EEG channels. Such limitations encompass computational complexity, potential overfitting, and the prolonged setup time for electrode placement, all of which can hinder practical applications. To address these challenges, this study presents the novel CCHP method, aimed at identifying and ranking commonly optimal EEG channels based on their sensitivity to the mental stress state. This method's uniqueness lies in its ability not only to find common channels, but also to prioritize them according to their responsiveness to stress, ensuring consistency across subjects and making it potentially transformative for real-world applications. From our rigorous examinations, eight channels emerged as universally optimal in detecting stress variances across participants. Leveraging features from the time, frequency, and time-frequency domains of these channels, and employing machine learning algorithms, notably RLDA, SVM, and KNN, our approach achieved a remarkable accuracy of 81.56% with the SVM algorithm outperforming existing methodologies. The implications of this research are profound, offering a stepping stone toward the development of real-time stress detection devices, and consequently, enabling clinicians to make more informed therapeutic decisions based on comprehensive brain activity monitoring.
Collapse
Affiliation(s)
- Ala Hag
- School of Computer Science & Engineering, Taylor’s University, Jalan Taylors, Subang Jaya 47500, Selangor, Malaysia;
| | - Fares Al-Shargie
- Institute for Intelligent Systems Research and Innovation, Deakin University, Geelong, VIC 3216, Australia
| | - Dini Handayani
- Department of Electrical Engineering, Abu Dhabi University, Abu Dhabi P.O. Box 59911, United Arab Emirates;
| | - Houshyar Asadi
- Computer Science Department, KICT, International Islamic University Malaysia, Kuala Lumpur 53100, Selangor, Malaysia
| |
Collapse
|
14
|
Ma Y, Shen J, Zhao Z, Liang H, Tan Y, Liu Z, Qian K, Yang M, Hu B. What Can Facial Movements Reveal? Depression Recognition and Analysis Based on Optical Flow Using Bayesian Networks. IEEE Trans Neural Syst Rehabil Eng 2023; 31:3459-3468. [PMID: 37581961 DOI: 10.1109/tnsre.2023.3305351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/17/2023]
Abstract
Recent evidence have demonstrated that facial expressions could be a valid and important aspect for depression recognition. Although various works have been achieved in automatic depression recognition, it is a challenge to explore the inherent nuances of facial expressions that might reveal the underlying differences between depressed patients and healthy subjects under different stimuli. There is a lack of an undisturbed system that monitors depressive patients' mental states in various free-living scenarios, so this paper steps towards building a classification model where data collection, feature extraction, depression recognition and facial actions analysis are conducted to infer the differences of facial movements between depressive patients and healthy subjects. In this study, we firstly present a plan of dividing facial regions of interest to extract optical flow features of facial expressions for depression recognition. We then propose facial movements coefficients utilising discrete wavelet transformation. Specifically, Bayesian Networks equipped with construction of Pearson Correlation Coefficients based on discrete wavelet transformation is learnt, which allows for analysing movements of different facial regions. We evaluate our method on a clinically validated dataset of 30 depressed patients and 30 healthy control subjects, and experiments results obtained the accuracy and recall of 81.7%, 96.7%, respectively, outperforming other features for comparison. Most importantly, the Bayesian Networks we built on the coefficients under different stimuli may reveal some facial action patterns of depressed subjects, which have a potential to assist the automatic diagnosis of depression.
Collapse
|
15
|
Shen J, Zhang Y, Liang H, Zhao Z, Zhu K, Qian K, Dong Q, Zhang X, Hu B. Depression Recognition From EEG Signals Using an Adaptive Channel Fusion Method via Improved Focal Loss. IEEE J Biomed Health Inform 2023; 27:3234-3245. [PMID: 37037251 DOI: 10.1109/jbhi.2023.3265805] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/12/2023]
Abstract
Depression is a serious and common psychiatric disease characterized by emotional and cognitive dysfunction. In addition, the rates of clinical diagnosis and treatment for depression are low. Therefore, the accurate recognition of depression is important for its effective treatment. Electroencephalogram (EEG) signals, which can objectively reflect the inner states of human brains, are regarded as promising physiological tools that can enable effective and efficient clinical depression diagnosis and recognition. However, one of the challenges regarding EEG-based depression recognition involves sufficiently optimizing the spatial information derived from the multichannel space of EEG signals. Consequently, we propose an adaptive channel fusion method via improved focal loss (FL) functions for depression recognition based on EEG signals to effectively address this challenge. In this method, we propose two improved FL functions that can enhance the separability of hard examples by upweighting their losses as optimization objectives and can optimize the channel weights by a proposed adaptive channel fusion framework. The experimental results obtained on two EEG datasets show that the developed channel fusion method can achieve improved classification performance. The learned channel weights include the individual characteristics of each EEG epoch, which can effectively optimize the spatial information of each EEG epoch via the channel fusion method. In addition, the proposed method performs better than the state-of-the-art channel fusion methods.
Collapse
|
16
|
Yang L, Wei X, Liu F, Zhu X, Zhou F. Automatic feature learning model combining functional connectivity network and graph regularization for depression detection. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104520] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
|
17
|
Yang L, Wang Y, Zhu X, Yang X, Zheng C. A gated temporal-separable attention network for EEG-based depression recognition. Comput Biol Med 2023; 157:106782. [PMID: 36931203 DOI: 10.1016/j.compbiomed.2023.106782] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 02/14/2023] [Accepted: 03/09/2023] [Indexed: 03/13/2023]
Abstract
Depression, a common mental illness worldwide, needs to be diagnosed and cured at an early stage. To assist clinical diagnosis, an EEG-based deep learning frame, which is named the gated temporal-separable attention network (GTSAN), is proposed in this paper for depression recognition. GTSAN model extracts discriminative information from EEG recordings in two ways. On the one hand, the gated recurrent unit (GRU) is used in the GTSAN model to capture the EEG historical information to form the features. On the other hand, the model digs the multilevel information by using an improved version of temporal convolutional network (TCN), called temporal-separable convolution network (TSCN), which applies causal convolution and dilated convolution to extract features from fine to coarse scales. The TSCN and GRU features can be produced in parallel. Finally, the new model introduces the attention mechanism to give different weights to these features, allowing them to be used to identify depression more effectively. Experiments on two depression datasets have demonstrated that the proposed model can mine potential depression patterns in data and obtain high recognition accuracies. The proposed model provides the possibility of using an EEG-based system to assist for diagnosing depression.
Collapse
Affiliation(s)
- Lijun Yang
- School of Mathematics and Statistics, Henan Engineering Research Center for Artificial Intelligence Theory and Algorithms, Henan University, Kaifeng 475004, China; Center for Applied Mathematics of Henan Province, Henan University, Zhengzhou, 450046, China.
| | - Yixin Wang
- School of Mathematics and Statistics, Henan Engineering Research Center for Artificial Intelligence Theory and Algorithms, Henan University, Kaifeng 475004, China.
| | - Xiangru Zhu
- Institute of Cognition, Brain, and Health, Henan University, Kaifeng 475004, China.
| | - Xiaohui Yang
- School of Mathematics and Statistics, Henan Engineering Research Center for Artificial Intelligence Theory and Algorithms, Henan University, Kaifeng 475004, China; Center for Applied Mathematics of Henan Province, Henan University, Zhengzhou, 450046, China.
| | - Chen Zheng
- School of Mathematics and Statistics, Henan Engineering Research Center for Artificial Intelligence Theory and Algorithms, Henan University, Kaifeng 475004, China; Center for Applied Mathematics of Henan Province, Henan University, Zhengzhou, 450046, China.
| |
Collapse
|
18
|
Li Y, Cao Q, Hou Z, Tang B, Shen Y. Transcranial Sonography as a Diagnostic Tool for Depressive Disorders. JOURNAL OF ULTRASOUND IN MEDICINE : OFFICIAL JOURNAL OF THE AMERICAN INSTITUTE OF ULTRASOUND IN MEDICINE 2023; 42:687-699. [PMID: 36047031 DOI: 10.1002/jum.16081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/16/2022] [Revised: 07/19/2022] [Accepted: 07/25/2022] [Indexed: 06/15/2023]
Abstract
BACKGROUND Transcranial sonography (TCS) is an available and noninvasive neuroimaging method that has been found to reduce the echogenicity of the brainstem raphe (BR) in patients with depression. Applying the criteria of the Diagnostic and Statistical Manual of Mental Disorders-IV (DSM-IV), we performed a meta-analysis of the diagnostic accuracy of TCS. METHODS A systematic search was conducted in PubMed, EMBASE, The Cochrane Library, and Web of Science. The databases were searched from inception to December 2021. The quality of the included literature was assessed using the QUADAS-2. Heterogeneity analysis was performed. A summary receiver operating characteristic (SROC) curve was generated to evaluate the diagnostic accuracy of TCS. RESULTS We included 12 studies with 809 patients. The pooled sensitivity was 0.66 (95% confidence interval [CI]: 0.61-0.71), and the specificity was 0.84 (95% CI: 0.80-0.87). The combined positive likelihood ratio (LR) was 3.84 (95% CI: 2.68-5.51), the negative LR was 0.41 (95% CI: 0.29-0.57), and the diagnostic odds ratio (DOR) was 11.45 (95% CI: 5.57-23.02). The area under the curve (AUC) of the plotted SROC curve was 0.86 (95% CI: 0.83-0.89). The meta-regression and subgroup analyses found no source of heterogeneity. CONCLUSION TCS has high potential and efficacy in diagnosing depression and may be a reasonable test to perform clinically for the assessment of depression.
Collapse
Affiliation(s)
- Yanping Li
- Department of neuroelectrophysiology, The Second Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
| | - Qian Cao
- Department of Neurology, The Second Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
| | - Zhuo Hou
- Department of Neurology, The Second Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
| | - Boji Tang
- Department of Neurology, The Second Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
| | - Yu Shen
- Department of Neurology, The Second Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
- Department of Neurology, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
| |
Collapse
|
19
|
Maniruzzaman M, Hasan MAM, Asai N, Shin J. Optimal Channels and Features Selection Based ADHD Detection From EEG Signal Using Statistical and Machine Learning Techniques. IEEE ACCESS 2023; 11:33570-33583. [DOI: 10.1109/access.2023.3264266] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2025]
Affiliation(s)
- Md. Maniruzzaman
- School of Computer Science and Engineering, The University of Aizu, Aizuwakamatsu, Japan
| | - Md. Al Mehedi Hasan
- Department of Computer Science and Engineering, Rajshahi University of Engineering and Technology, Rajshahi, Bangladesh
| | - Nobuyoshi Asai
- School of Computer Science and Engineering, The University of Aizu, Aizuwakamatsu, Japan
| | - Jungpil Shin
- School of Computer Science and Engineering, The University of Aizu, Aizuwakamatsu, Japan
| |
Collapse
|
20
|
Garg N, Garg R, Anand A, Baths V. Decoding the neural signatures of valence and arousal from portable EEG headset. Front Hum Neurosci 2022; 16:1051463. [PMID: 36561835 PMCID: PMC9764010 DOI: 10.3389/fnhum.2022.1051463] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Accepted: 11/08/2022] [Indexed: 12/12/2022] Open
Abstract
Emotion classification using electroencephalography (EEG) data and machine learning techniques have been on the rise in the recent past. However, past studies use data from medical-grade EEG setups with long set-up times and environment constraints. This paper focuses on classifying emotions on the valence-arousal plane using various feature extraction, feature selection, and machine learning techniques. We evaluate different feature extraction and selection techniques and propose the optimal set of features and electrodes for emotion recognition. The images from the OASIS image dataset were used to elicit valence and arousal emotions, and the EEG data was recorded using the Emotiv Epoc X mobile EEG headset. The analysis is carried out on publicly available datasets: DEAP and DREAMER for benchmarking. We propose a novel feature ranking technique and incremental learning approach to analyze performance dependence on the number of participants. Leave-one-subject-out cross-validation was carried out to identify subject bias in emotion elicitation patterns. The importance of different electrode locations was calculated, which could be used for designing a headset for emotion recognition. The collected dataset and pipeline are also published. Our study achieved a root mean square score (RMSE) of 0.905 on DREAMER, 1.902 on DEAP, and 2.728 on our dataset for valence label and a score of 0.749 on DREAMER, 1.769 on DEAP, and 2.3 on our proposed dataset for arousal label.
Collapse
Affiliation(s)
- Nikhil Garg
- Institut Interdisciplinaire d'Innovation Technologique (3IT), Université de Sherbrooke, Sherbrooke, QC, Canada,Laboratoire Nanotechnologies Nanosystèmes (LN2)—CNRS UMI-3463, Université de Sherbrooke, Sherbrooke, QC, Canada,Institute of Electronics, Microelectronics and Nanotechnology (IEMN), Université de Lille, Lille, France
| | - Rohit Garg
- Department of Computer Science and Information Systems, BITS Pilani, K K Birla Goa Campus, Goa, India,*Correspondence: Rohit Garg
| | - Apoorv Anand
- Department of Biological Sciences, BITS Pilani, K K Birla Goa Campus, Goa, India
| | - Veeky Baths
- Department of Biological Sciences, BITS Pilani, K K Birla Goa Campus, Goa, India,Cognitive Neuroscience Lab, BITS Pilani, K K Birla Goa Campus, Goa, India,Veeky Baths
| |
Collapse
|
21
|
Zhang B, Wei D, Yan G, Lei T, Cai H, Yang Z. Feature-level fusion based on spatial-temporal of pervasive EEG for depression recognition. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2022; 226:107113. [PMID: 36103735 DOI: 10.1016/j.cmpb.2022.107113] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 08/23/2022] [Accepted: 09/04/2022] [Indexed: 06/15/2023]
Abstract
BACKGROUND AND OBJECTIVE In view of the depression characteristics such as high prevalence, high disability rate, high fatality rate, and high recurrence rate, early identification and early intervention are the most effective methods to prevent irreversible damage of brain function over time. The traditional method of depression recognition based on questionnaires and interviews is time-consuming and labor-intensive, and heavily depends on the doctor's subjective experience. Therefore, accurate, convenient and effective recognition of depression has important social value and scientific significance. METHODS This paper proposes a depression recognition framework based on feature-level fusion of spatial-temporal pervasive electroencephalography (EEG). Time series EEG data were collected by portable three-electrode EEG acquisition instrument, and mapped to a spatial complex network called visibility graph (VG). Then temporal EEG features and spatial VG metric features were extracted and selected. Based on the correlation between features and categories, the differences in contribution of individual feature are explored, and different contribution coefficients are assigned to different features as the data basis of feature-level fusion to ensure the diversity of data. A cascade forest model based on three different decision forests is designed to realize the efficient depression recognition using spatial-temporal feature-level fusion data. RESULTS Experimental data were obtained from 26 depressed patients and 29 healthy controls (HC). The results of multiple control experiments show that compared with single type feature, feature-level fusion without contribution coefficient, and independent classifiers, the feature-level method with contribution coefficient of spatial-temporal has a stronger recognition ability of depression, and the highest accuracy is 92.48%. CONCLUSION Feature-level fusion method provides an effective computer-aided tool for rapid clinical diagnosis of depression.
Collapse
Affiliation(s)
- Bingtao Zhang
- School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China; School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China.
| | - Dan Wei
- School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China
| | - Guanghui Yan
- School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China
| | - Tao Lei
- School of Electronic Information and Artificial Intelligence, Shaanxi University of Science and Technology, Xi'an 710021, China
| | - Haishu Cai
- School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China
| | - Zhifei Yang
- School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China
| |
Collapse
|
22
|
Yang J, Lu H, Li C, Hu X, Hu B. Data augmentation for depression detection using skeleton-based gait information. Med Biol Eng Comput 2022; 60:2665-2679. [PMID: 35829811 DOI: 10.1007/s11517-022-02595-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Accepted: 04/22/2022] [Indexed: 11/27/2022]
Abstract
In recent years, the incidence of depression is rising rapidly worldwide, but large-scale depression screening is still challenging. Gait analysis provides a non-contact, low-cost, and efficient early screening method for depression. However, the early screening of depression based on gait analysis lacks sufficient effective sample data. In this paper, we propose a skeleton data augmentation method for assessing the risk of depression. First, we propose five techniques to augment skeleton data and apply them to depression and emotion datasets. Then, we divide augmentation methods into two types (non-noise augmentation and noise augmentation) based on the mutual information and the classification accuracy. Finally, we explore which augmentation strategies can capture the characteristics of human skeleton data more effectively. Experimental results show that the augmented training dataset that retains more of the raw skeleton data properties determines the performance of the detection model. Specifically, rotation augmentation and channel mask augmentation make the depression detection accuracy reach 92.15% and 91.34%, respectively.
Collapse
Affiliation(s)
- Jingjing Yang
- School of information Science and Engineering, Lanzhou University, Lanzhou, China
| | - Haifeng Lu
- School of information Science and Engineering, Lanzhou University, Lanzhou, China
| | - Chengming Li
- School of Intelligent Systems Engineering, Sun Yat-sen University, Shenzhen, China.
| | - Xiping Hu
- School of information Science and Engineering, Lanzhou University, Lanzhou, China. .,School of Intelligent Systems Engineering, Sun Yat-sen University, Shenzhen, China.
| | - Bin Hu
- School of information Science and Engineering, Lanzhou University, Lanzhou, China. .,Institute of Engineering Medicine, Beijing Institute of Technology, Beijing, China.
| |
Collapse
|
23
|
Abstract
BACKGROUND In this modern era, depression is one of the most prevalent mental disorders from which millions of individuals are affected today. The symptoms of depression are heterogeneous and often coincide with other disorders such as bipolar disorder, Parkinson's, schizophrenia, etc. It is a serious mental illness that may lead to other health problems if left untreated. Currently, identifying individuals with depression is totally based on the expertise of the clinician's experience. In order to assist clinicians in identifying the characteristics and classifying depressed people, different types of data modalities and machine learning techniques have been incorporated by researchers in this field. This study aims to find the answers to some important questions related to the trend of publications, data modality, machine learning models, dataset usage, pre-processing techniques and feature extraction and selection techniques that are prevalent and guide the direction of future research on depression diagnosis. METHODS This systematic review was conducted using a broad range of articles from two major databases: IEEE Xplore and PubMed. Studies ranging from the years 2011 to April 2021 were retrieved from the databases resulting in a total of 590 articles (53 articles from the IEEE Xplore database and 537 articles from the PubMed database). Out of those, the articles which satisfied the defined inclusion criteria were investigated for further analysis. RESULTS A total of 135 articles were identified and analysed for this review. High growth in the number of publications has been observed in recent years. Furthermore, significant diversity in the use of data modalities and machine learning classifiers has also been noted in this study. fMRI data with an SVM classifier was found to be the most popular choice among researchers. In most of the studies, data scarcity and small sample size, particularly for neuroimaging data are major concerns. The use of identical data pre-processing tools for similar data modalities can be seen. This study also provides statistical analysis of the current framework with respect to the modality, machine learning classifier, sample size and accuracy by applying one-way ANOVA and the Tukey - Kramer test. CONCLUSION The results indicate that an effective fusion of machine learning techniques with a potential data modality has a promising future for assisting clinicians in automatic depression diagnosis.
Collapse
Affiliation(s)
- Sweta Bhadra
- Department of CS & IT, Cotton University, Guwahati, India
| | | |
Collapse
|
24
|
Kim H, Kang SH, Kim SH, Kim SH, Hwang J, Kim JG, Han K, Kim JB. Drinking coffee enhances neurocognitive function by reorganizing brain functional connectivity. Sci Rep 2021; 11:14381. [PMID: 34257387 PMCID: PMC8277884 DOI: 10.1038/s41598-021-93849-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Accepted: 07/01/2021] [Indexed: 11/08/2022] Open
Abstract
The purpose of this study was to identify the mechanisms underlying effects of coffee on cognition in the context of brain networks. Here we investigated functional connectivity before and after drinking coffee using graph-theoretic analysis of electroencephalography (EEG). Twenty-one healthy adults voluntarily participated in this study. The resting-state EEG data and results of neuropsychological tests were consecutively acquired before and 30 min after coffee consumption. Graph analyses were performed and compared before and after coffee consumption. Correlation analyses were conducted to assess the relationship between changes in graph measures and those in cognitive function tests. Functional connectivity (FC) was reorganized toward more efficient network properties after coffee consumption. Performance in Digit Span tests and Trail Making Test Part B improved after coffee consumption, and the improved performance in executive function was correlated with changes in graph measures, reflecting a shift toward efficient network properties. The beneficial effects of coffee on cognitive function might be attributed to the reorganization of FC toward more efficient network properties. Based on our findings, the patterns of network reorganization could be used as quantitative markers to elucidate the mechanisms underlying the beneficial effects of coffee on cognition, especially executive function.
Collapse
Affiliation(s)
- Hayom Kim
- Department of Neurology, Korea University Anam Hospital, Korea University College of Medicine, Seoul, Republic of Korea
| | - Sung Hoon Kang
- Department of Neurology, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Republic of Korea
- Department of Neurology, Korea University Guro Hospital, Korea University College of Medicine, Seoul, Republic of Korea
| | - Soon Ho Kim
- Laboratory of Computational Neurophysics, Brain Science Institute, Korea Institute of Science and Technology, Seoul, Republic of Korea
| | - Seong Hwan Kim
- Department of Neurology, Korea University Anam Hospital, Korea University College of Medicine, Seoul, Republic of Korea
| | - Jihyeon Hwang
- Department of Neurology, Korea University Anam Hospital, Korea University College of Medicine, Seoul, Republic of Korea
| | - Jae-Gyum Kim
- Department of Neurology, Korea University Anam Hospital, Korea University College of Medicine, Seoul, Republic of Korea
| | - Kyungreem Han
- Laboratory of Computational Neurophysics, Brain Science Institute, Korea Institute of Science and Technology, Seoul, Republic of Korea.
| | - Jung Bin Kim
- Department of Neurology, Korea University Anam Hospital, Korea University College of Medicine, Seoul, Republic of Korea.
| |
Collapse
|
25
|
Xie W, Liang L, Lu Y, Wang C, Shen J, Luo H, Liu X. Interpreting Depression From Question-wise Long-term Video Recording of SDS Evaluation. IEEE J Biomed Health Inform 2021; 26:865-875. [PMID: 34170837 DOI: 10.1109/jbhi.2021.3092628] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Self-Rating Depression Scale (SDS) questionnaire has frequently been used for efficient depression preliminary screening. However, the uncontrollable self-administered measure can be easily affected by insouciantly or deceptively answering, and producing the different results with the clinician-administered Hamilton Depression Rating Scale (HDRS) and the final diagnosis. Clinically, facial expression (FE) and actions play a vital role in clinician-administered evaluation, while FE and action are underexplored for self-administered evaluations. In this work, we collect a novel dataset of 200 subjects to evidence the validity of self-rating questionnaires with their corresponding question-wise video recording. To automatically interpret depression from the SDS evaluation and the paired video, we propose an end-to-end hierarchical framework for the long-term variable-length video, which is also conditioned on the questionnaire results and the answering time. Specifically, we resort to a hierarchical model which utilizes a 3D CNN for local temporal pattern exploration and a redundancy-aware self-attention (RAS) scheme for question-wise global feature aggregation. Targeting for the redundant long-term FE video processing, our RAS is able to effectively exploit the correlations of each video clip within a question set to emphasize the discriminative information and eliminate the redundancy based on feature pair-wise affinity. Then, the question-wise video feature is concatenated with the questionnaire scores for final depression detection. Our thorough evaluations also show the validity of fusing SDS evaluation and its video recording, and the superiority of our framework to the conventional state-of-the-art temporal modeling methods.
Collapse
|