1
|
Ding Z, Chen J, Zhong BL, Liu CL, Liu ZT. Emotional stimulated speech-based assisted early diagnosis of depressive disorders using personality-enhanced deep learning. J Affect Disord 2025; 376:177-188. [PMID: 39914753 DOI: 10.1016/j.jad.2025.01.136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/10/2024] [Revised: 01/15/2025] [Accepted: 01/26/2025] [Indexed: 02/11/2025]
Abstract
BACKGROUND Early diagnosis of depression is crucial, and speech-based early diagnosis of depression is promising, but insufficient data and lack of theoretical support make it difficult to be applied. Therefore, it is valuable to combine psychiatric theories, collect speech recognition data for depression, and develop a practicable recognition method for depression. METHODS In this study, 24 patients with major depressive disorders (MDDs) and 36 healthy controls (HCs) were recruited to participate in a multi-task speech experiment. Descriptive statistics and tests of variance were used to analyze subjects' personality and speech changes. Subsequently, the speech task with the most depressive cues was explored using the Bidirectional Long - Short Term Memory (Bi-LSTM) algorithm, on which a personality-assisted multitasking deep model, i.e., multi-task attentional temporal convolutional network model (TCN-MTA). RESULTS Statistical analyses of speech duration showed that the fable reading, neutral stimulus, and negative stimulus tasks had significant differences on subjects' speech duration, and the negative stimulus task had significant differences between the depressed and control groups (p < 0.001, 0.03, 0.04). Notably, the Big Five personality emotional stability scores were significantly different between the depressed and control groups (0.03). Depression was best identified using Bi-LSTM in negative (Youden index = 0.44) and positive stimulus speech (Youden index = 0.42). Further, the specificity of 0.72 and sensitivity of 0.87 for recognizing depression in negative stimulus speech using our proposed TCN-MTA outperforms existing methods. LIMITATIONS The sample size enrolled in this study is higher than the minimum sample size calculated through G-Power 3.1, but the sample size in this study is still small. CONCLUSION The proposed deep learning-based personality-assisted multitasking method could accurately recognize major depression, which demonstrated the potential of the method based on the fusion of specialized theories and artificial intelligence.
Collapse
Affiliation(s)
- Zhong Ding
- School of Education, China University of Geosciences, Lumo Road, Wuhan 430074, Hubei, China; Psychological Science and Health Research Center, China University of Geosciences, Lumo Road, Wuhan 430074, Hubei, China
| | - Jing Chen
- Wuhan Mental Health Center, Jianshe Avenue, Wuhan 430032, Hubei, China; Wuhan Hospital for Psychotherapy, Jianshe Avenue, Wuhan 430032, Hubei, China
| | - Bao-Liang Zhong
- Psychological Science and Health Research Center, China University of Geosciences, Lumo Road, Wuhan 430074, Hubei, China; Wuhan Mental Health Center, Jianshe Avenue, Wuhan 430032, Hubei, China; Wuhan Hospital for Psychotherapy, Jianshe Avenue, Wuhan 430032, Hubei, China.
| | - Chen-Ling Liu
- School of Education, China University of Geosciences, Lumo Road, Wuhan 430074, Hubei, China; Psychological Science and Health Research Center, China University of Geosciences, Lumo Road, Wuhan 430074, Hubei, China.
| | - Zhen-Tao Liu
- Psychological Science and Health Research Center, China University of Geosciences, Lumo Road, Wuhan 430074, Hubei, China; School of Automation, China University of Geosciences, Lumo Road, Wuhan 430074, Hubei, China.
| |
Collapse
|
2
|
Zhu J, Li Y, Yang C, Cai H, Li X, Hu B. Transformer-based fusion model for mild depression recognition with EEG and pupil area signals. Med Biol Eng Comput 2025:10.1007/s11517-024-03269-8. [PMID: 39909988 DOI: 10.1007/s11517-024-03269-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Accepted: 12/09/2024] [Indexed: 02/07/2025]
Abstract
Early detection and treatment are crucial for the prevention and treatment of depression; compared with major depression, current researches pay less attention to mild depression. Meanwhile, analysis of multimodal biosignals such as EEG, eye movement data, and magnetic resonance imaging provides reliable technical means for the quantitative analysis of depression. However, how to effectively capture relevant and complementary information between multimodal data so as to achieve efficient and accurate depression recognition remains a challenge. This paper proposes a novel Transformer-based fusion model using EEG and pupil area signals for mild depression recognition. We first introduce CSP into the Transformer to construct single-modal models of EEG and pupil data and then utilize attention bottleneck to construct a mid-fusion model to facilitate information exchange between the two modalities; this strategy enables the model to learn the most relevant and complementary information for each modality and only share the necessary information, which improves the model accuracy while reducing the computational cost. Experimental results show that the accuracy of the EEG and pupil area signals of single-modal models we constructed is 89.75% and 84.17%, the precision is 92.04% and 95.21%, the recall is 89.5% and 71%, the specificity is 90% and 97.33%, the F1 score is 89.41% and 78.44%, respectively, and the accuracy of mid-fusion model can reach 93.25%. Our study demonstrates that the Transformer model can learn the long-term time-dependent relationship between EEG and pupil area signals, providing an idea for designing a reliable multimodal fusion model for mild depression recognition based on EEG and pupil area signals.
Collapse
Affiliation(s)
- Jing Zhu
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, 73000, China
| | - Yuanlong Li
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, 73000, China
| | - Changlin Yang
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, 73000, China
| | - Hanshu Cai
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, 73000, China
| | - Xiaowei Li
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, 73000, China
| | - Bin Hu
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, 73000, China.
- School of Medical Technology, Beijing Institute of Technology, Beijing, China.
- CAS Center for Excellence in Brain Science and Intelligence Technology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Lanzhou, 73000, China.
- Joint Research Center for Cognitive Neurosensor Technology of Lanzhou University and Institute of Semiconductors, Chinese Academy of Sciences, Lanzhou, China.
| |
Collapse
|
3
|
Ntalampiras S, Qi W. Siamese Neural Network for Speech-Based Depression Classification and Severity Assessment. JOURNAL OF HEALTHCARE INFORMATICS RESEARCH 2024; 8:577-593. [PMID: 39463856 PMCID: PMC11499503 DOI: 10.1007/s41666-024-00175-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2024] [Revised: 09/23/2024] [Accepted: 09/23/2024] [Indexed: 10/29/2024]
Abstract
The evaluation of an individual's mental health and behavioral functioning, known as psychological assessment, is generally conducted by a mental health professional. This process aids in diagnosing mental health conditions, identifying suitable treatment options, and assessing progress during treatment. Currently, national health systems are unable to cope with the constantly growing demand for such services. To address and expedite the diagnosis process, this study suggests an AI-powered tool capable of delivering understandable predictions through the automated processing of the captured speech signals. To this end, we employed a Siamese neural network (SNN) elaborating on standardized speech representations free of domain expert knowledge. Such an SNN-based framework is able to address multiple downstream tasks using the same latent representation. Interestingly, it has been applied both for classifying speech depression as well as assessing its severity. After extensive experiments on a publicly available dataset following a standardized protocol, it is shown to significantly outperform the state of the art with respect to both tasks. Last but not least, the present solution offers interpretable predictions, while being able to meaningfully interact with the medical experts.
Collapse
Affiliation(s)
- Stavros Ntalampiras
- Department of Computer Science, University of Milan, 20135 via Celoria 18, Milan, Italy
| | - Wen Qi
- School of Future Technology, South China University of Technology, 510641 Wushan Road 381, Guangzhou, China
| |
Collapse
|
4
|
Chung KH, Chang YS, Yen WT, Lin L, Abimannan S. Depression assessment using integrated multi-featured EEG bands deep neural network models: Leveraging ensemble learning techniques. Comput Struct Biotechnol J 2024; 23:1450-1468. [PMID: 38623563 PMCID: PMC11016871 DOI: 10.1016/j.csbj.2024.03.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Revised: 03/24/2024] [Accepted: 03/25/2024] [Indexed: 04/17/2024] Open
Abstract
Mental Status Assessment (MSA) holds significant importance in psychiatry. In recent years, several studies have leveraged Electroencephalogram (EEG) technology to gauge an individual's mental state or level of depression. This study introduces a novel multi-tier ensemble learning approach to integrate multiple EEG bands for conducting mental state or depression assessments. Initially, the EEG signal is divided into eight sub-bands, and then a Long Short-Term Memory (LSTM)-based Deep Neural Network (DNN) model is trained for each band. Subsequently, the integration of multi-band EEG frequency models and the evaluation of mental state or depression level are facilitated through a two-tier ensemble learning approach based on Multiple Linear Regression (MLR). The authors conducted numerous experiments to validate the performance of the proposed method under different evaluation metrics. For clarity and conciseness, the research employs the simplest commercialized one-channel EEG sensor, positioned at FP1, to collect data from 57 subjects (49 depressed and 18 healthy subjects). The obtained results, including an accuracy of 0.897, F1-score of 0.921, precision of 0.935, negative predictive value of 0.829, recall of 0.908, specificity of 0.875, and AUC of 0.8917, provide evidence of the superior performance of the proposed method compared to other ensemble learning techniques. This method not only proves effective but also holds the potential to significantly enhance the accuracy of depression assessment.
Collapse
Affiliation(s)
- Kuo-Hsuan Chung
- Department of Psychiatry and Psychiatric Research Center, Taipei Medical University Hospital, Taipei, Taiwan
- Department of Psychiatry, School of Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan
| | - Yue-Shan Chang
- National Taipei University, Sanxia District, New Taipei City 237, Taiwan
| | - Wei-Ting Yen
- National Taipei University, Sanxia District, New Taipei City 237, Taiwan
| | - Linen Lin
- Department of Psychiatry, En Chu Kong Hospital, Taiwan
| | - Satheesh Abimannan
- Amity School of Engineering and Technology, Amity University Maharashtra, Mumbai, India
| |
Collapse
|
5
|
Gupta AK, Dhamaniya A, Gupta P. RADIANCE: Reliable and interpretable depression detection from speech using transformer. Comput Biol Med 2024; 183:109325. [PMID: 39489109 DOI: 10.1016/j.compbiomed.2024.109325] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2023] [Revised: 05/18/2024] [Accepted: 09/25/2024] [Indexed: 11/05/2024]
Abstract
Depression is a common but severe mental disorder that adversely impacts the ability of an individual to function normally in their day-to-day life. A majority of depressed individuals remain undiagnosed due to factors such as social stigma and a shortage of healthcare professionals. Consequently, several Machine Learning and Deep Learning (DL) models based on speech have been proposed for automatic depression detection, with the latter generally outperforming the former. However, DL models are blackbox and offer no transparency. In contrast, healthcare professionals prefer models that provide interpretability besides being accurate. In this direction, we propose a method RADIANCE (Reliable AnD InterpretAble depressioN deteCtion transformErs). RADIANCE incorporates a novel FilterBank VIsion Transformer (FBViT) network, which provides the symptoms of depression as interpretable features. Additionally, we employ a novel loss function that handles the class imbalance issue in the datasets. It also incorporates a penalty term that addresses the hierarchy of misclassification errors. We also propose a reliability predictor based on low-level descriptors that provides a reliability score to indicate the trustworthiness of the prediction by FBViT. Furthermore, in contrast to the conventional averaging and majority pooling, RADIANCE consolidates predictions from multiple clips of the input audio by intricately weighing each prediction based on its reliability score, ensuring a more accurate overall prediction. RADIANCE outperforms the state-of-the-art depression detection methods, achieving an accuracy of 89.36%, 80.36%, and 94.44% over the DAIC-WOZ, E-DAIC, and CMDC datasets, respectively. Further, RADIANCE achieves MAE scores of 3.27 and 5.04 on the DAIC-WOZ and E-DAIC datasets, respectively.
Collapse
Affiliation(s)
- Anup Kumar Gupta
- Department of Computer Science and Engineering, Indian Institute of Technology Indore, Indore, 452020, Madhya Pradesh, India.
| | - Ashutosh Dhamaniya
- Department of Computer Science and Engineering, Indian Institute of Technology Indore, Indore, 452020, Madhya Pradesh, India.
| | - Puneet Gupta
- Department of Computer Science and Engineering, Indian Institute of Technology Indore, Indore, 452020, Madhya Pradesh, India.
| |
Collapse
|
6
|
Wang Y, Peng Y, Han M, Liu X, Niu H, Cheng J, Chang S, Liu T. GCTNet: a graph convolutional transformer network for major depressive disorder detection based on EEG signals. J Neural Eng 2024; 21:036042. [PMID: 38788706 DOI: 10.1088/1741-2552/ad5048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Accepted: 05/24/2024] [Indexed: 05/26/2024]
Abstract
Objective.Identifying major depressive disorder (MDD) using objective physiological signals has become a pressing challenge.Approach.Hence, this paper proposes a graph convolutional transformer network (GCTNet) for accurate and reliable MDD detection using electroencephalogram (EEG) signals. The developed framework integrates a residual graph convolutional network block to capture spatial information and a Transformer block to extract global temporal dynamics. Additionally, we introduce the contrastive cross-entropy (CCE) loss that combines contrastive learning to enhance the stability and discriminability of the extracted features, thereby improving classification performance.Main results. The effectiveness of the GCTNet model and CCE loss was assessed using EEG data from 41 MDD patients and 44 normal controls, in addition to a publicly available dataset. Utilizing a subject-independent data partitioning method and 10-fold cross-validation, the proposed method demonstrated significant performance, achieving an average Area Under the Curve of 0.7693 and 0.9755 across both datasets, respectively. Comparative analyses demonstrated the superiority of the GCTNet framework with CCE loss over state-of-the-art algorithms in MDD detection tasks.Significance. The proposed method offers an objective and effective approach to MDD detection, providing valuable support for clinical-assisted diagnosis.
Collapse
Affiliation(s)
- Yuwen Wang
- Beijing Advanced Innovation Center for Biomedical Engineering, School of Biological Science and Medical Engineering, Beihang University, Beijing, People's Republic of China
| | - Yudan Peng
- Beijing Advanced Innovation Center for Biomedical Engineering, School of Biological Science and Medical Engineering, Beihang University, Beijing, People's Republic of China
| | - Mingxiu Han
- Beijing Advanced Innovation Center for Biomedical Engineering, School of Biological Science and Medical Engineering, Beihang University, Beijing, People's Republic of China
| | - Xinyi Liu
- Beijing Advanced Innovation Center for Biomedical Engineering, School of Biological Science and Medical Engineering, Beihang University, Beijing, People's Republic of China
| | - Haijun Niu
- Beijing Advanced Innovation Center for Biomedical Engineering, School of Biological Science and Medical Engineering, Beihang University, Beijing, People's Republic of China
| | - Jian Cheng
- School of Computer Science and Engineering, Beihang University, Beijing, People's Republic of China
| | - Suhua Chang
- Peking University Sixth Hospital, Peking University Institute of Mental Health, NHC Key Laboratory of Mental Health (Peking University), National Clinical Research Center for Mental Disorders (Peking University Sixth Hospital), Beijing, People's Republic of China
| | - Tao Liu
- Beijing Advanced Innovation Center for Biomedical Engineering, School of Biological Science and Medical Engineering, Beihang University, Beijing, People's Republic of China
| |
Collapse
|
7
|
Chen H, Lei Y, Li R, Xia X, Cui N, Chen X, Liu J, Tang H, Zhou J, Huang Y, Tian Y, Wang X, Zhou J. Resting-state EEG dynamic functional connectivity distinguishes non-psychotic major depression, psychotic major depression and schizophrenia. Mol Psychiatry 2024; 29:1088-1098. [PMID: 38267620 DOI: 10.1038/s41380-023-02395-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/01/2023] [Revised: 12/17/2023] [Accepted: 12/21/2023] [Indexed: 01/26/2024]
Abstract
This study aims to identify dynamic patterns within the spatiotemporal feature space that are specific to nonpsychotic major depression (NPMD), psychotic major depression (PMD), and schizophrenia (SCZ). The study also evaluates the effectiveness of machine learning algorithms based on these network manifestations in differentiating individuals with NPMD, PMD, and SCZ. A total of 579 participants were recruited, including 152 patients with NPMD, 45 patients with PMD, 185 patients with SCZ, and 197 healthy controls (HCs). A dynamic functional connectivity (DFC) approach was employed to estimate the principal FC states within each diagnostic group. Incremental proportions of data (ranging from 10% to 100%) within each diagnostic group were used for variability testing. DFC metrics, such as proportion, mean duration, and transition number, were examined among the four diagnostic groups to identify disease-related neural activity patterns. These patterns were then used to train a two-layer classifier for the four groups (HC, NPMD, PMD, and SCZ). The four principal brain states (i.e., states 1,2,3, and 4) identified by the DFC approach were highly representative within and across diagnostic groups. Between-group comparisons revealed significant differences in network metrics of state 2 and state 3, within delta, theta, and gamma frequency bands, between healthy individuals and patients in each diagnostic group (p < 0.01, FDR corrected). Moreover, the identified key dynamic network metrics achieved an accuracy of 73.1 ± 2.8% in the four-way classification of HC, NPMD, PMD, and SCZ, outperforming the static functional connectivity (SFC) approach (p < 0.001). These findings suggest that the proposed DFC approach can identify dynamic network biomarkers at the single-subject level. These biomarkers have the potential to accurately differentiate individual subjects among various diagnostic groups of psychiatric disorders or healthy controls. This work may contribute to the development of a valuable EEG-based diagnostic tool with enhanced accuracy and assistive capabilities.
Collapse
Affiliation(s)
- Hui Chen
- Department of Psychiatry, National Clinical Research Center for Mental Disorders, and National Center for Mental Disorders, The Second Xiangya Hospital of Central South University, Changsha, 410011, Hunan, China
| | - Yanqin Lei
- TeleBrain Medical Technology Co., Beijing, 100000, China
| | - Rihui Li
- Center for Cognitive and Brain Sciences, Institute of Collaborative Innovation, University of Macau, Macau S.A.R., 999078, China
- Department of Electrical and Computer Engineering, Faculty of Science and Technology, University of Macau, Macau S.A.R., 999078, China
| | - Xinxin Xia
- TeleBrain Medical Technology Co., Beijing, 100000, China
| | - Nanyi Cui
- TeleBrain Medical Technology Co., Beijing, 100000, China
| | - Xianliang Chen
- Department of Psychiatry, National Clinical Research Center for Mental Disorders, and National Center for Mental Disorders, The Second Xiangya Hospital of Central South University, Changsha, 410011, Hunan, China
| | - Jiali Liu
- Department of Psychiatry, National Clinical Research Center for Mental Disorders, and National Center for Mental Disorders, The Second Xiangya Hospital of Central South University, Changsha, 410011, Hunan, China
| | - Huajia Tang
- Department of Psychiatry, National Clinical Research Center for Mental Disorders, and National Center for Mental Disorders, The Second Xiangya Hospital of Central South University, Changsha, 410011, Hunan, China
| | - Jiawei Zhou
- Department of Psychiatry, National Clinical Research Center for Mental Disorders, and National Center for Mental Disorders, The Second Xiangya Hospital of Central South University, Changsha, 410011, Hunan, China
| | - Ying Huang
- Department of Psychiatry, National Clinical Research Center for Mental Disorders, and National Center for Mental Disorders, The Second Xiangya Hospital of Central South University, Changsha, 410011, Hunan, China
| | - Yusheng Tian
- Department of Psychiatry, National Clinical Research Center for Mental Disorders, and National Center for Mental Disorders, The Second Xiangya Hospital of Central South University, Changsha, 410011, Hunan, China
| | - Xiaoping Wang
- Department of Psychiatry, National Clinical Research Center for Mental Disorders, and National Center for Mental Disorders, The Second Xiangya Hospital of Central South University, Changsha, 410011, Hunan, China.
| | - Jiansong Zhou
- Department of Psychiatry, National Clinical Research Center for Mental Disorders, and National Center for Mental Disorders, The Second Xiangya Hospital of Central South University, Changsha, 410011, Hunan, China.
| |
Collapse
|
8
|
Jiang Z, Seyedi S, Griner E, Abbasi A, Rad AB, Kwon H, Cotes RO, Clifford GD. Multimodal Mental Health Digital Biomarker Analysis From Remote Interviews Using Facial, Vocal, Linguistic, and Cardiovascular Patterns. IEEE J Biomed Health Inform 2024; 28:1680-1691. [PMID: 38198249 PMCID: PMC10986761 DOI: 10.1109/jbhi.2024.3352075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2024]
Abstract
OBJECTIVE Psychiatric evaluation suffers from subjectivity and bias, and is hard to scale due to intensive professional training requirements. In this work, we investigated whether behavioral and physiological signals, extracted from tele-video interviews, differ in individuals with psychiatric disorders. METHODS Temporal variations in facial expression, vocal expression, linguistic expression, and cardiovascular modulation were extracted from simultaneously recorded audio and video of remote interviews. Averages, standard deviations, and Markovian process-derived statistics of these features were computed from 73 subjects. Four binary classification tasks were defined: detecting 1) any clinically-diagnosed psychiatric disorder, 2) major depressive disorder, 3) self-rated depression, and 4) self-rated anxiety. Each modality was evaluated individually and in combination. RESULTS Statistically significant feature differences were found between psychiatric and control subjects. Correlations were found between features and self-rated depression and anxiety scores. Heart rate dynamics provided the best unimodal performance with areas under the receiver-operator curve (AUROCs) of 0.68-0.75 (depending on the classification task). Combining multiple modalities provided AUROCs of 0.72-0.82. CONCLUSION Multimodal features extracted from remote interviews revealed informative characteristics of clinically diagnosed and self-rated mental health status. SIGNIFICANCE The proposed multimodal approach has the potential to facilitate scalable, remote, and low-cost assessment for low-burden automated mental health services.
Collapse
|
9
|
Hu B, Tao Y, Yang M. Detecting depression based on facial cues elicited by emotional stimuli in video. Comput Biol Med 2023; 165:107457. [PMID: 37708718 DOI: 10.1016/j.compbiomed.2023.107457] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 08/11/2023] [Accepted: 09/04/2023] [Indexed: 09/16/2023]
Abstract
Recently, depression research has received considerable attention and there is an urgent need for objective and validated methods to detect depression. Depression detection based on facial expressions may be a promising adjunct to depression detection due to its non-contact nature. Stimulated facial expressions may contain more information that is useful in detecting depression than natural facial expressions. To explore facial cues in healthy controls and depressed patients in response to different emotional stimuli, facial expressions of 62 subjects were collected while watching video stimuli, and a local face reorganization method for depression detection is proposed. The method extracts the local phase pattern features, facial action unit (AU) features and head motion features of a local face reconstructed according to facial proportions, and then fed into the classifier for classification. The classification accuracy was 76.25%, with a recall of 80.44% and a specificity of 83.21%. The results demonstrated that the negative video stimuli in the single-attribute stimulus analysis were more effective in eliciting changes in facial expressions in both healthy controls and depressed patients. Fusion of facial features under both neutral and negative stimuli was found to be useful in discriminating between healthy controls and depressed individuals. The Pearson correlation coefficient (PCC) showed that changes in the emotional stimulus paradigm were more strongly correlated with changes in subjects' facial AU when exposed to negative stimuli compared to stimuli of other attributes. These results demonstrate the feasibility of our proposed method and provide a framework for future work in assisting diagnosis.
Collapse
Affiliation(s)
- Bin Hu
- Gansu Provincial Key Laboratory of Wearable Computin, Lanzhou University, Lanzhou, 730000, Gansu, China.
| | - Yongfeng Tao
- Gansu Provincial Key Laboratory of Wearable Computin, Lanzhou University, Lanzhou, 730000, Gansu, China.
| | - Minqiang Yang
- Gansu Provincial Key Laboratory of Wearable Computin, Lanzhou University, Lanzhou, 730000, Gansu, China.
| |
Collapse
|
10
|
Jiang Z, Seyedi S, Griner E, Abbasi A, Bahrami Rad A, Kwon H, Cotes RO, Clifford GD. Multimodal mental health assessment with remote interviews using facial, vocal, linguistic, and cardiovascular patterns. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.09.11.23295212. [PMID: 37745610 PMCID: PMC10516063 DOI: 10.1101/2023.09.11.23295212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
Objective The current clinical practice of psychiatric evaluation suffers from subjectivity and bias, and requires highly skilled professionals that are often unavailable or unaffordable. Objective digital biomarkers have shown the potential to address these issues. In this work, we investigated whether behavioral and physiological signals, extracted from remote interviews, provided complimentary information for assessing psychiatric disorders. Methods Time series of multimodal features were derived from four conceptual modes: facial expression, vocal expression, linguistic expression, and cardiovascular modulation. The features were extracted from simultaneously recorded audio and video of remote interviews using task-specific and foundation models. Averages, standard deviations, and hidden Markov model-derived statistics of these features were computed from 73 subjects. Four binary classification tasks were defined: detecting 1) any clinically-diagnosed psychiatric disorder, 2) major depressive disorder, 3) self-rated depression, and 4) self-rated anxiety. Each modality was evaluated individually and in combination. Results Statistically significant feature differences were found between controls and subjects with mental health conditions. Correlations were found between features and self-rated depression and anxiety scores. Visual heart rate dynamics achieved the best unimodal performance with areas under the receiver-operator curve (AUROCs) of 0.68-0.75 (depending on the classification task). Combining multiple modalities achieved AUROCs of 0.72-0.82. Features from task-specific models outperformed features from foundation models. Conclusion Multimodal features extracted from remote interviews revealed informative characteristics of clinically diagnosed and self-rated mental health status. Significance The proposed multimodal approach has the potential to facilitate objective, remote, and low-cost assessment for low-burden automated mental health services.
Collapse
|
11
|
Ma Y, Shen J, Zhao Z, Liang H, Tan Y, Liu Z, Qian K, Yang M, Hu B. What Can Facial Movements Reveal? Depression Recognition and Analysis Based on Optical Flow Using Bayesian Networks. IEEE Trans Neural Syst Rehabil Eng 2023; 31:3459-3468. [PMID: 37581961 DOI: 10.1109/tnsre.2023.3305351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/17/2023]
Abstract
Recent evidence have demonstrated that facial expressions could be a valid and important aspect for depression recognition. Although various works have been achieved in automatic depression recognition, it is a challenge to explore the inherent nuances of facial expressions that might reveal the underlying differences between depressed patients and healthy subjects under different stimuli. There is a lack of an undisturbed system that monitors depressive patients' mental states in various free-living scenarios, so this paper steps towards building a classification model where data collection, feature extraction, depression recognition and facial actions analysis are conducted to infer the differences of facial movements between depressive patients and healthy subjects. In this study, we firstly present a plan of dividing facial regions of interest to extract optical flow features of facial expressions for depression recognition. We then propose facial movements coefficients utilising discrete wavelet transformation. Specifically, Bayesian Networks equipped with construction of Pearson Correlation Coefficients based on discrete wavelet transformation is learnt, which allows for analysing movements of different facial regions. We evaluate our method on a clinically validated dataset of 30 depressed patients and 30 healthy control subjects, and experiments results obtained the accuracy and recall of 81.7%, 96.7%, respectively, outperforming other features for comparison. Most importantly, the Bayesian Networks we built on the coefficients under different stimuli may reveal some facial action patterns of depressed subjects, which have a potential to assist the automatic diagnosis of depression.
Collapse
|
12
|
Shen J, Zhang Y, Liang H, Zhao Z, Zhu K, Qian K, Dong Q, Zhang X, Hu B. Depression Recognition From EEG Signals Using an Adaptive Channel Fusion Method via Improved Focal Loss. IEEE J Biomed Health Inform 2023; 27:3234-3245. [PMID: 37037251 DOI: 10.1109/jbhi.2023.3265805] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/12/2023]
Abstract
Depression is a serious and common psychiatric disease characterized by emotional and cognitive dysfunction. In addition, the rates of clinical diagnosis and treatment for depression are low. Therefore, the accurate recognition of depression is important for its effective treatment. Electroencephalogram (EEG) signals, which can objectively reflect the inner states of human brains, are regarded as promising physiological tools that can enable effective and efficient clinical depression diagnosis and recognition. However, one of the challenges regarding EEG-based depression recognition involves sufficiently optimizing the spatial information derived from the multichannel space of EEG signals. Consequently, we propose an adaptive channel fusion method via improved focal loss (FL) functions for depression recognition based on EEG signals to effectively address this challenge. In this method, we propose two improved FL functions that can enhance the separability of hard examples by upweighting their losses as optimization objectives and can optimize the channel weights by a proposed adaptive channel fusion framework. The experimental results obtained on two EEG datasets show that the developed channel fusion method can achieve improved classification performance. The learned channel weights include the individual characteristics of each EEG epoch, which can effectively optimize the spatial information of each EEG epoch via the channel fusion method. In addition, the proposed method performs better than the state-of-the-art channel fusion methods.
Collapse
|
13
|
Yang J, Zhang Z, Fu Z, Li B, Xiong P, Liu X. Cross-subject classification of depression by using multiparadigm EEG feature fusion. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2023; 233:107360. [PMID: 36944276 DOI: 10.1016/j.cmpb.2023.107360] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Revised: 12/22/2022] [Accepted: 01/17/2023] [Indexed: 06/18/2023]
Abstract
BACKGROUND AND OBJECTIVE The aim of this study is to solve the non-stationarity and complexity characteristics and huge individual differences in the electroencephalogram (EEG) signals for depression classification. METHODS To address those problems, the Lempel-Ziv complexity feature matrices were extracted from the EEG signals under the two paradigms of eyes open and eyes closed in the resting state. Topographical map of brain and statistical analysis were introduced to investigate the significance of eyes open and eyes closed EEG for depression classification. To promote the classification accuracy, feature matrices from the two paradigms were fused. And linear combination and concatenation fusion methods were proposed to further reveal the underlying mechanism of improving classification accuracy. Support vector machine (SVM), K-nearest neighbor, and decision tree classifiers were employed and compared to classify depression under the eyes open, eyes closed and fused paradigm. RESULTS The classification results of 10-fold cross-validation showed that the highest average accuracy (86.58%) under a single paradigm was achieved in the eyes-open state. The multiparadigm fusion method of concatenation was better than the linear combination. The best classification result was obtained using multiparadigm feature concatenation under the SVM classifier, yielding an accuracy of 94.03%. CONCLUSION The multiparadigm feature fusion method proposed in this paper can effectively improve the accuracy of depression classification. It was proved that eyes open and eyes closed EEG have complementary information, which was benefit for the cross-subject classification of depression. It provides new ideas for depression classification in clinics.
Collapse
Affiliation(s)
- Jianli Yang
- College of Electronic Information and Engineering, Hebei University, Baoding 071002, China; Key Laboratory of Digital Medical Engineering of Heibei Province, Baoding 071002, China
| | - Zhen Zhang
- College of Electronic Information and Engineering, Hebei University, Baoding 071002, China
| | - Zhiyu Fu
- College of Electronic Information and Engineering, Hebei University, Baoding 071002, China
| | - Bing Li
- Hebei Mental Health Center, Baoding 071000, China; Hebei Key Laboratory of Mental and Behavioral Disorders Research, Baoding 071000, China
| | - Peng Xiong
- College of Electronic Information and Engineering, Hebei University, Baoding 071002, China; Key Laboratory of Digital Medical Engineering of Heibei Province, Baoding 071002, China.
| | - Xiuling Liu
- College of Electronic Information and Engineering, Hebei University, Baoding 071002, China; Key Laboratory of Digital Medical Engineering of Heibei Province, Baoding 071002, China.
| |
Collapse
|
14
|
Zhang R, Chen Y, Xu Z, Zhang L, Hu Y, Chen M. Recognition of single upper limb motor imagery tasks from EEG using multi-branch fusion convolutional neural network. Front Neurosci 2023; 17:1129049. [PMID: 36908782 PMCID: PMC9992961 DOI: 10.3389/fnins.2023.1129049] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Accepted: 02/03/2023] [Indexed: 02/24/2023] Open
Abstract
Motor imagery-based brain-computer interfaces (MI-BCI) have important application values in the field of neurorehabilitation and robot control. At present, MI-BCI mostly use bilateral upper limb motor tasks, but there are relatively few studies on single upper limb MI tasks. In this work, we conducted studies on the recognition of motor imagery EEG signals of the right upper limb and proposed a multi-branch fusion convolutional neural network (MF-CNN) for learning the features of the raw EEG signals as well as the two-dimensional time-frequency maps at the same time. The dataset used in this study contained three types of motor imagery tasks: extending the arm, rotating the wrist, and grasping the object, 25 subjects were included. In the binary classification experiment between the grasping object and the arm-extending tasks, MF-CNN achieved an average classification accuracy of 78.52% and kappa value of 0.57. When all three tasks were used for classification, the accuracy and kappa value were 57.06% and 0.36, respectively. The comparison results showed that the classification performance of MF-CNN is higher than that of single CNN branch algorithms in both binary-class and three-class classification. In conclusion, MF-CNN makes full use of the time-domain and frequency-domain features of EEG, can improve the decoding accuracy of single limb motor imagery tasks, and it contributes to the application of MI-BCI in motor function rehabilitation training after stroke.
Collapse
Affiliation(s)
- Rui Zhang
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical and Information Engineering, Zhengzhou University, Zhengzhou, China
| | - Yadi Chen
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical and Information Engineering, Zhengzhou University, Zhengzhou, China
| | - Zongxin Xu
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical and Information Engineering, Zhengzhou University, Zhengzhou, China
| | - Lipeng Zhang
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical and Information Engineering, Zhengzhou University, Zhengzhou, China
| | - Yuxia Hu
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical and Information Engineering, Zhengzhou University, Zhengzhou, China
| | - Mingming Chen
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical and Information Engineering, Zhengzhou University, Zhengzhou, China
| |
Collapse
|
15
|
Zhang B, Wei D, Yan G, Lei T, Cai H, Yang Z. Feature-level fusion based on spatial-temporal of pervasive EEG for depression recognition. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2022; 226:107113. [PMID: 36103735 DOI: 10.1016/j.cmpb.2022.107113] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 08/23/2022] [Accepted: 09/04/2022] [Indexed: 06/15/2023]
Abstract
BACKGROUND AND OBJECTIVE In view of the depression characteristics such as high prevalence, high disability rate, high fatality rate, and high recurrence rate, early identification and early intervention are the most effective methods to prevent irreversible damage of brain function over time. The traditional method of depression recognition based on questionnaires and interviews is time-consuming and labor-intensive, and heavily depends on the doctor's subjective experience. Therefore, accurate, convenient and effective recognition of depression has important social value and scientific significance. METHODS This paper proposes a depression recognition framework based on feature-level fusion of spatial-temporal pervasive electroencephalography (EEG). Time series EEG data were collected by portable three-electrode EEG acquisition instrument, and mapped to a spatial complex network called visibility graph (VG). Then temporal EEG features and spatial VG metric features were extracted and selected. Based on the correlation between features and categories, the differences in contribution of individual feature are explored, and different contribution coefficients are assigned to different features as the data basis of feature-level fusion to ensure the diversity of data. A cascade forest model based on three different decision forests is designed to realize the efficient depression recognition using spatial-temporal feature-level fusion data. RESULTS Experimental data were obtained from 26 depressed patients and 29 healthy controls (HC). The results of multiple control experiments show that compared with single type feature, feature-level fusion without contribution coefficient, and independent classifiers, the feature-level method with contribution coefficient of spatial-temporal has a stronger recognition ability of depression, and the highest accuracy is 92.48%. CONCLUSION Feature-level fusion method provides an effective computer-aided tool for rapid clinical diagnosis of depression.
Collapse
Affiliation(s)
- Bingtao Zhang
- School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China; School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China.
| | - Dan Wei
- School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China
| | - Guanghui Yan
- School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China
| | - Tao Lei
- School of Electronic Information and Artificial Intelligence, Shaanxi University of Science and Technology, Xi'an 710021, China
| | - Haishu Cai
- School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China
| | - Zhifei Yang
- School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China
| |
Collapse
|
16
|
Nassibi A, Papavassiliou C, Atashzar SF. Depression diagnosis using machine intelligence based on spatiospectrotemporal analysis of multi-channel EEG. Med Biol Eng Comput 2022; 60:3187-3202. [DOI: 10.1007/s11517-022-02647-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2021] [Accepted: 07/28/2022] [Indexed: 11/30/2022]
|
17
|
Zhao F, Pan H, Li N, Chen X, Zhang H, Mao N, Ren Y. High-order brain functional network for electroencephalography-based diagnosis of major depressive disorder. Front Neurosci 2022; 16:976229. [PMID: 36017184 PMCID: PMC9396245 DOI: 10.3389/fnins.2022.976229] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Accepted: 07/15/2022] [Indexed: 12/04/2022] Open
Abstract
Brain functional network (BFN) based on electroencephalography (EEG) has been widely used to diagnose brain diseases, such as major depressive disorder (MDD). However, most existing BFNs only consider the correlation between two channels, ignoring the high-level interaction among multiple channels that contain more rich information for diagnosing brain diseases. In such a sense, the BFN is called low-order BFN (LO-BFN). In order to fully explore the high-level interactive information among multiple channels of the EEG signals, a scheme for constructing a high-order BFN (HO-BFN) based on the “correlation’s correlation” strategy is proposed in this paper. Specifically, the entire EEG time series is firstly divided into multiple epochs by sliding window. For each epoch, the short-term correlation between channels is calculated to construct a LO-BFN. The correlation time series of all channel pairs are formulated by these LO-BFNs obtained from all epochs to describe the dynamic change of short-term correlation along the time. To construct HO-BFN, we cluster all correlation time series to avoid the problems caused by high dimensionality, and the correlation of the average correlation time series from different clusters is calculated to reflect the high-order correlation among multiple channels. Experimental results demonstrate the efficiency of the proposed HO-BFN in MDD identification, and its integration with the LO-BFN can further improve the recognition rate.
Collapse
Affiliation(s)
- Feng Zhao
- School of Computer Science and Technology, Shandong Technology and Business University, Yantai, China
| | - Hongxin Pan
- School of Computer Science and Technology, Shandong Technology and Business University, Yantai, China
| | - Na Li
- School of Computer Science and Technology, Shandong Technology and Business University, Yantai, China
| | - Xiaobo Chen
- School of Computer Science and Technology, Shandong Technology and Business University, Yantai, China
| | - Haicheng Zhang
- Department of Radiology, Yantai Yuhuangding Hospital, Yantai, China
| | - Ning Mao
- Department of Radiology, Yantai Yuhuangding Hospital, Yantai, China
| | - Yande Ren
- Department of Radiology, The Affiliated Hospital of Qingdao University, Qingdao, China
- *Correspondence: Yande Ren,
| |
Collapse
|
18
|
Liu B, Chang H, Peng K, Wang X. An End-to-End Depression Recognition Method Based on EEGNet. Front Psychiatry 2022; 13:864393. [PMID: 35360138 PMCID: PMC8963113 DOI: 10.3389/fpsyt.2022.864393] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Accepted: 02/14/2022] [Indexed: 11/18/2022] Open
Abstract
Major depressive disorder (MDD) is a common and highly debilitating condition that threatens the health of millions of people. However, current diagnosis of depression relies on questionnaires that are highly correlated with physician experience and hence not completely objective. Electroencephalography (EEG) signals combined with deep learning techniques may be an objective approach to effective diagnosis of MDD. This study proposes an end-to-end deep learning framework for MDD diagnosis based on EEG signals. We used EEG signals from 29 healthy subjects and 24 patients with severe depression to calculate Accuracy, Precision, Recall, F1-Score, and Kappa coefficient, which were 90.98%, 91.27%, 90.59%, and 81.68%, respectively. In addition, we found that these values were highest when happy-neutral face pairs were used as stimuli for detecting depression. Compared with exiting methods for EEG-based MDD classification, ours can maintain stable model performance without re-calibration. The present results suggest that the method is highly accurate for diagnosis of MDD and can be used to develop an automatic plug-and-play EEG-based system for diagnosing depression.
Collapse
Affiliation(s)
- Bo Liu
- Department of Emergency, The Second Hospital of Shandong University, Jinan, China
| | - Hongli Chang
- School of Information Science and Engineering, Southeast University, Nanjing, China
| | - Kang Peng
- Department of Rehabilitation Medicine, Guangzhou First People's Hospital, School of Medicine, South China University of Technology, Guangzhou, China
| | - Xuenan Wang
- Shanghai Key Laboratory of Psychotic Disorders, Shanghai Mental Health Center, Shanghai Jiaotong University School of Medicine, Shanghai, China
| |
Collapse
|
19
|
Li R, Ren C, Zhang X, Hu B. A novel ensemble learning method using multiple objective particle swarm optimization for subject-independent EEG-based emotion recognition. Comput Biol Med 2022; 140:105080. [PMID: 34902609 DOI: 10.1016/j.compbiomed.2021.105080] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Revised: 11/24/2021] [Accepted: 11/24/2021] [Indexed: 12/19/2022]
Abstract
Emotion recognition is a vital but challenging step in creating passive brain-computer interface applications. In recent years, many studies on electroencephalogram (EEG)-based emotion recognition have been conducted. Ensemble learning has been widely used in emotion recognition because of its superior accuracy and generalization. In this study, we proposed a novel ensemble learning method based on multiple objective particle swarm optimization for subject-independent EEG-based emotion recognition. First, we used a 4 s sliding time window with a 2 s overlap to extract 13 different features from EEG signals and construct a feature vector. Then, we employed L1 regularization to select effective features. Second, a model selection method was applied to choose the optimal basic analysis submodels. Afterward, we proposed an ensemble operator that converts the classification results of a single model from discrete values to continuous values to better characterize the classification results. Subsequently, multiple objective particle swarm optimization was adopted to confirm the optimal parameters of the ensemble learning model. Finally, we conducted extensive experiments on two public datasets: DEAP and SEED. Considering the generalization of the model, we applied leave-one-subject-out cross-validation to evaluate the performance of the model. The experimental results demonstrate that the proposed method achieves a better recognition performance than single methods, commonly used ensemble learning methods, and state-of-the-art methods. The average accuracies for arousal and valence are 65.70% and 64.22%, respectively, on the DEAP database, and the average accuracy on the SEED database is 84.44%.
Collapse
Affiliation(s)
- Rui Li
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, China.
| | - Chao Ren
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, China.
| | - Xiaowei Zhang
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, China.
| | - Bin Hu
- Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, China.
| |
Collapse
|
20
|
Shen J, Zhang X, Huang X, Wu M, Gao J, Lu D, Ding Z, Hu B. An Optimal Channel Selection for EEG-Based Depression Detection via Kernel-Target Alignment. IEEE J Biomed Health Inform 2021; 25:2545-2556. [PMID: 33338023 DOI: 10.1109/jbhi.2020.3045718] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Depression is a mental disorder with emotional and cognitive dysfunction. The main clinical characteristic of depression is significant and persistent low mood. As reported, depression is a leading cause of disability worldwide. Moreover, the rate of recognition and treatment for depression is low. Therefore, the detection and treatment of depression are urgent. Multichannel electroencephalogram (EEG) signals, which reflect the working status of the human brain, can be used to develop an objective and promising tool for augmenting the clinical effects in the diagnosis and detection of depression. However, when a large number of EEG channels are acquired, the information redundancy and computational complexity of the EEG signals increase; thus, effective channel selection algorithms are required not only for machine learning feasibility, but also for practicality in clinical depression detection. Consequently, we propose an optimal channel selection method for EEG-based depression detection via kernel-target alignment (KTA) to effectively resolve the abovementioned issues. In this method, we consider a modified version KTA that can measure the similarity between the kernel matrix for channel selection and the target matrix as an objective function and optimize the objective function by a proposed optimal channel selection strategy. Experimental results on two EEG datasets show that channel selection can effectively increase the classification performance and that even if we rely only on a small subset of channels, the results are still acceptable. The selected channels are in line with the expected latent cortical activity patterns in depression detection. Moreover, the experimental results demonstrate that our method outperforms the state-of-the-art channel selection approaches.
Collapse
|
21
|
Xie W, Liang L, Lu Y, Wang C, Shen J, Luo H, Liu X. Interpreting Depression From Question-wise Long-term Video Recording of SDS Evaluation. IEEE J Biomed Health Inform 2021; 26:865-875. [PMID: 34170837 DOI: 10.1109/jbhi.2021.3092628] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Self-Rating Depression Scale (SDS) questionnaire has frequently been used for efficient depression preliminary screening. However, the uncontrollable self-administered measure can be easily affected by insouciantly or deceptively answering, and producing the different results with the clinician-administered Hamilton Depression Rating Scale (HDRS) and the final diagnosis. Clinically, facial expression (FE) and actions play a vital role in clinician-administered evaluation, while FE and action are underexplored for self-administered evaluations. In this work, we collect a novel dataset of 200 subjects to evidence the validity of self-rating questionnaires with their corresponding question-wise video recording. To automatically interpret depression from the SDS evaluation and the paired video, we propose an end-to-end hierarchical framework for the long-term variable-length video, which is also conditioned on the questionnaire results and the answering time. Specifically, we resort to a hierarchical model which utilizes a 3D CNN for local temporal pattern exploration and a redundancy-aware self-attention (RAS) scheme for question-wise global feature aggregation. Targeting for the redundant long-term FE video processing, our RAS is able to effectively exploit the correlations of each video clip within a question set to emphasize the discriminative information and eliminate the redundancy based on feature pair-wise affinity. Then, the question-wise video feature is concatenated with the questionnaire scores for final depression detection. Our thorough evaluations also show the validity of fusing SDS evaluation and its video recording, and the superiority of our framework to the conventional state-of-the-art temporal modeling methods.
Collapse
|
22
|
Zhang B, Yan G, Yang Z, Su Y, Wang J, Lei T. Brain Functional Networks Based on Resting-State EEG Data for Major Depressive Disorder Analysis and Classification. IEEE Trans Neural Syst Rehabil Eng 2020; 29:215-229. [PMID: 33296307 DOI: 10.1109/tnsre.2020.3043426] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
If the brain is regarded as a system, it will be one of the most complex systems in the universe. Traditional analysis and classification methods of major depressive disorder (MDD) based on electroencephalography (EEG) feature-levels often regard electrode as isolated node and ignore the correlation between them, so it's difficult to find alters of abnormal topological architecture in brain. To solve this problem, we propose a brain functional network framework for MDD of analysis and classification based on resting state EEG. The phase lag index (PLI) was calculated based on the 64-channel resting state EEG to construct the function connection matrix to reduce and avoid the volume conductor effect. Then binarization of brain function network based on small world index was realized. Statistical analyses were performed on different EEG frequency band and different brain regions. The results showed that significant alterations of brain synchronization occurred in frontal, temporal, parietal-occipital regions of left brain and temporal region of right brain. And average shortest path length and clustering coefficient in left central region of theta band and node betweenness centrality in right parietal-occipital region were significantly correlated with PHQ-9 score of MDD, which indicates these three network metrics may be served as potential biomarkers to effectively distinguish MDD from controls and the highest classification accuracy can reach 93.31%. Our findings also point out that the brain function network of MDD patients shows a random trend, and small world characteristics appears to weaken.
Collapse
|