1
|
O’Leary A, Lahey T, Lovato J, Loftness B, Douglas A, Skelton J, Cohen JG, Copeland WE, McGinnis RS, McGinnis EW. Using Wearable Digital Devices to Screen Children for Mental Health Conditions: Ethical Promises and Challenges. SENSORS (BASEL, SWITZERLAND) 2024; 24:3214. [PMID: 38794067 PMCID: PMC11125700 DOI: 10.3390/s24103214] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Revised: 05/13/2024] [Accepted: 05/16/2024] [Indexed: 05/26/2024]
Abstract
In response to a burgeoning pediatric mental health epidemic, recent guidelines have instructed pediatricians to regularly screen their patients for mental health disorders with consistency and standardization. Yet, gold-standard screening surveys to evaluate mental health problems in children typically rely solely on reports given by caregivers, who tend to unintentionally under-report, and in some cases over-report, child symptomology. Digital phenotype screening tools (DPSTs), currently being developed in research settings, may help overcome reporting bias by providing objective measures of physiology and behavior to supplement child mental health screening. Prior to their implementation in pediatric practice, however, the ethical dimensions of DPSTs should be explored. Herein, we consider some promises and challenges of DPSTs under three broad categories: accuracy and bias, privacy, and accessibility and implementation. We find that DPSTs have demonstrated accuracy, may eliminate concerns regarding under- and over-reporting, and may be more accessible than gold-standard surveys. However, we also find that if DPSTs are not responsibly developed and deployed, they may be biased, raise privacy concerns, and be cost-prohibitive. To counteract these potential shortcomings, we identify ways to support the responsible and ethical development of DPSTs for clinical practice to improve mental health screening in children.
Collapse
Affiliation(s)
- Aisling O’Leary
- Department of Philosophy, Virginia Polytechnic Institute and State University, Blacksburg, VA 24060, USA;
| | - Timothy Lahey
- University of Vermont Medical Center, Burlington, VT 05401, USA; (T.L.); (A.D.)
| | - Juniper Lovato
- Complex Systems Center, University of Vermont, Burlington VT 05405, USA; (J.L.); (B.L.)
| | - Bryn Loftness
- Complex Systems Center, University of Vermont, Burlington VT 05405, USA; (J.L.); (B.L.)
| | - Antranig Douglas
- University of Vermont Medical Center, Burlington, VT 05401, USA; (T.L.); (A.D.)
| | - Joseph Skelton
- Department of Pediatrics, Wake Forest University School of Medicine, Winston-Salem 27101, NC, USA;
- Division of Public Health Sciences, Wake Forest University School of Medicine, Winston-Salem 27101, NC, USA
| | - Jenna G. Cohen
- Department of Electrical and Biomedical Engineering, University of Vermont, Burlington VT 05405, USA;
| | | | - Ryan S. McGinnis
- Department of Biomedical Engineering, Wake Forest University School of Medicine, Winston-Salem 27101, NC, USA
| | - Ellen W. McGinnis
- Department of Pediatrics, Wake Forest University School of Medicine, Winston-Salem 27101, NC, USA;
- Division of Public Health Sciences, Wake Forest University School of Medicine, Winston-Salem 27101, NC, USA
| |
Collapse
|
2
|
Saghiri MA, Vakhnovetsky J, Amanabi M, Karamifar K, Farhadi M, Amini SB, Conte M. Exploring the impact of type II diabetes mellitus on voice quality. Eur Arch Otorhinolaryngol 2024; 281:2707-2716. [PMID: 38319369 DOI: 10.1007/s00405-024-08485-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Accepted: 01/15/2024] [Indexed: 02/07/2024]
Abstract
PURPOSE This cross-sectional study aimed to investigate the potential of voice analysis as a prescreening tool for type II diabetes mellitus (T2DM) by examining the differences in voice recordings between non-diabetic and T2DM participants. METHODS 60 participants diagnosed as non-diabetic (n = 30) or T2DM (n = 30) were recruited on the basis of specific inclusion and exclusion criteria in Iran between February 2020 and September 2023. Participants were matched according to their year of birth and then placed into six age categories. Using the WhatsApp application, participants recorded the translated versions of speech elicitation tasks. Seven acoustic features [fundamental frequency, jitter, shimmer, harmonic-to-noise ratio (HNR), cepstral peak prominence (CPP), voice onset time (VOT), and formant (F1-F2)] were extracted from each recording and analyzed using Praat software. Data was analyzed with Kolmogorov-Smirnov, two-way ANOVA, post hoc Tukey, binary logistic regression, and student t tests. RESULTS The comparison between groups showed significant differences in fundamental frequency, jitter, shimmer, CPP, and HNR (p < 0.05), while there were no significant differences in formant and VOT (p > 0.05). Binary logistic regression showed that shimmer was the most significant predictor of the disease group. There was also a significant difference between diabetes status and age, in the case of CPP. CONCLUSIONS Participants with type II diabetes exhibited significant vocal variations compared to non-diabetic controls.
Collapse
Affiliation(s)
- M A Saghiri
- Biomaterial and Prosthodontics Laboratory, Department of Restorative Dentistry, Rutgers School of Dental Medicine, Rutgers Biomedical and Health Sciences, MSB C639A, 185 South Orange Avenue, Newark, NJ, 07103, USA.
- Department of Endodontics, University of the Pacific, Arthur A. Dugoni School of Dentistry, San Francisco, CA, USA.
| | - Julia Vakhnovetsky
- Sector of Innovation in Dentistry, Dr. Hajar Afsar Lajevardi Research Cluster (DHAL), Hackensack, NJ, USA
- Rutgers School of Dental Medicine, Newark, NJ, USA
- University of Michigan School of Dentistry, Ann Arbor, MI, USA
| | | | - Kasra Karamifar
- Sector of Innovation in Medicine and Dentistry, Dr. Hajar Afsar Lajevardi Research Cluster (DHAL), Hackensack, NJ, USA
| | - Maziar Farhadi
- Sector of Innovation in Medicine and Dentistry, Dr. Hajar Afsar Lajevardi Research Cluster (DHAL), Hackensack, NJ, USA
| | - Saeid B Amini
- Dr. Hajar Afsar Lajevardi Research Cluster (DHAL), Hackensack, NJ, USA
| | - Michael Conte
- Office for Clinical Affairs, Rutgers School of Dental Medicine, Newark, NJ, USA
| |
Collapse
|
3
|
Choo M, Park D, Cho M, Bae S, Kim J, Han DH. Exploring a multimodal approach for utilizing digital biomarkers for childhood mental health screening. Front Psychiatry 2024; 15:1348319. [PMID: 38666089 PMCID: PMC11043569 DOI: 10.3389/fpsyt.2024.1348319] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Accepted: 03/25/2024] [Indexed: 04/28/2024] Open
Abstract
Background Depression and anxiety are prevalent mental health concerns among children and adolescents. The application of conventional assessment methods, such as survey questionnaires to children, may lead to self-reporting issues. Digital biomarkers provide extensive data, reducing bias in mental health self-reporting, and significantly influence patient screening. Our primary objectives were to accurately assess children's mental health and to investigate the feasibility of using various digital biomarkers. Methods This study included a total of 54 boys and girls aged between 7 to 11 years. Each participant's mental state was assessed using the Depression, Anxiety, and Stress Scale. Subsequently, the subjects participated in digital biomarker collection tasks. Heart rate variability (HRV) data were collected using a camera sensor. Eye-tracking data were collected through tasks displaying emotion-face stimuli. Voice data were obtained by recording the participants' voices while they engaged in free speech and description tasks. Results Depressive symptoms were positively correlated with low frequency (LF, 0.04-0.15 Hz of HRV) in HRV and negatively associated with eye-tracking variables. Anxiety symptoms had a negative correlation with high frequency (HF, 0.15-0.40 Hz of HRV) in HRV and a positive association with LF/HF. Regarding stress, eye-tracking variables indicated a positive correlation, while pNN50, which represents the proportion of NN50 (the number of pairs of successive R-R intervals differing by more than 50 milliseconds) divided by the total number of NN (R-R) intervals, exhibited a negative association. Variables identified for childhood depression included LF and the total time spent looking at a sad face. Those variables recognized for anxiety were LF/HF, heart rate (HR), and pNN50. For childhood stress, HF, LF, and Jitter showed different correlation patterns between the two grade groups. Discussion We examined the potential of multimodal biomarkers in children, identifying features linked to childhood depression, particularly LF and the Sad.TF:time. Anxiety was most effectively explained by HRV features. To explore reasons for non-replication of previous studies, we categorized participants by elementary school grades into lower grades (1st, 2nd, 3rd) and upper grades (4th, 5th, 6th). Conclusion This study confirmed the potential use of multimodal digital biomarkers for children's mental health screening, serving as foundational research.
Collapse
Affiliation(s)
| | - Doeun Park
- HCI Lab, Yonsei University, Seoul, Republic of Korea
| | - Minseo Cho
- HCI Lab, Yonsei University, Seoul, Republic of Korea
| | - Sujin Bae
- Department of Psychiatry, College of Medicine, Chung-Ang University, Seoul, Republic of Korea
| | - Jinwoo Kim
- HCI Lab, Yonsei University, Seoul, Republic of Korea
| | - Doug Hyun Han
- Department of Psychiatry, College of Medicine, Chung-Ang University, Seoul, Republic of Korea
| |
Collapse
|
4
|
Loftness BC, Halvorson-Phelan J, OLeary A, Bradshaw C, Prytherch S, Berman I, Torous J, Copeland WL, Cheney N, McGinnis RS, McGinnis EW. The ChAMP App: A Scalable mHealth Technology for Detecting Digital Phenotypes of Early Childhood Mental Health. IEEE J Biomed Health Inform 2023; PP:10.1109/JBHI.2023.3337649. [PMID: 38019617 PMCID: PMC11133764 DOI: 10.1109/jbhi.2023.3337649] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2023]
Abstract
Childhood mental health problems are common, impairing, and can become chronic if left untreated. Children are not reliable reporters of their emotional and behavioral health, and caregivers often unintentionally under- or over-report child symptoms, making assessment challenging. Objective physiological and behavioral measures of emotional and behavioral health are emerging. However, these methods typically require specialized equipment and expertise in data and sensor engineering to administer and analyze. To address this challenge, we have developed the ChAMP (Childhood Assessment and Management of digital Phenotypes) System, which includes a mobile application for collecting movement and audio data during a battery of mood induction tasks and an open-source platform for extracting digital biomarkers. As proof of principle, we present ChAMP System data from 101 children 4-8 years old, with and without diagnosed mental health disorders. Machine learning models trained on these data detect the presence of specific disorders with 70-73% balanced accuracy, with similar results to clinical thresholds on established parent-report measures (63-82% balanced accuracy). Features favored in model architectures are described using Shapley Additive Explanations (SHAP). Canonical Correlation Analysis reveals moderate to strong associations between predictors of each disorder and associated symptom severity (r = .51-.83). The open-source ChAMP System provides clinically-relevant digital biomarkers that may later complement parent-report measures of emotional and behavioral health for detecting kids with underlying mental health conditions and lowers the barrier to entry for researchers interested in exploring digital phenotyping of childhood mental health.
Collapse
|
5
|
Loftness BC, Halvorson-Phelan J, O'Leary A, Bradshaw C, Prytherch S, Berman I, Torous J, Copeland WL, Cheney N, McGinnis RS, McGinnis EW. The ChAMP App: A Scalable mHealth Technology for Detecting Digital Phenotypes of Early Childhood Mental Health. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.01.19.23284753. [PMID: 38076802 PMCID: PMC10705626 DOI: 10.1101/2023.01.19.23284753] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Childhood mental health problems are common, impairing, and can become chronic if left untreated. Children are not reliable reporters of their emotional and behavioral health, and caregivers often unintentionally under- or over-report child symptoms, making assessment challenging. Objective physiological and behavioral measures of emotional and behavioral health are emerging. However, these methods typically require specialized equipment and expertise in data and sensor engineering to administer and analyze. To address this challenge, we have developed the ChAMP (Childhood Assessment and Management of digital Phenotypes) System, which includes a mobile application for collecting movement and audio data during a battery of mood induction tasks and an open-source platform for extracting digital biomarkers. As proof of principle, we present ChAMP System data from 101 children 4-8 years old, with and without diagnosed mental health disorders. Machine learning models trained on these data detect the presence of specific disorders with 70-73% balanced accuracy, with similar results to clinical thresholds on established parent-report measures (63-82% balanced accuracy). Features favored in model architectures are described using Shapley Additive Explanations (SHAP). Canonical Correlation Analysis reveals moderate to strong associations between predictors of each disorder and associated symptom severity (r = .51-.83). The open-source ChAMP System provides clinically-relevant digital biomarkers that may later complement parent-report measures of emotional and behavioral health for detecting kids with underlying mental health conditions and lowers the barrier to entry for researchers interested in exploring digital phenotyping of childhood mental health.
Collapse
Affiliation(s)
- Bryn C Loftness
- University of Vermont's Complex Systems Center and M-Sense Research Group
| | | | | | - Carter Bradshaw
- University of Vermont Medical Center Department of Psychiatry
| | | | - Isabel Berman
- University of Vermont Medical Center Department of Psychiatry
| | - John Torous
- Digital Psychiatry Division for Beth Israel Deaconess Medical Center at Harvard Medical School
| | | | - Nick Cheney
- University of Vermont Complex Systems Center
| | | | | |
Collapse
|
6
|
Mao K, Wu Y, Chen J. A systematic review on automated clinical depression diagnosis. NPJ MENTAL HEALTH RESEARCH 2023; 2:20. [PMID: 38609509 PMCID: PMC10955993 DOI: 10.1038/s44184-023-00040-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Accepted: 09/27/2023] [Indexed: 04/14/2024]
Abstract
Assessing mental health disorders and determining treatment can be difficult for a number of reasons, including access to healthcare providers. Assessments and treatments may not be continuous and can be limited by the unpredictable nature of psychiatric symptoms. Machine-learning models using data collected in a clinical setting can improve diagnosis and treatment. Studies have used speech, text, and facial expression analysis to identify depression. Still, more research is needed to address challenges such as the need for multimodality machine-learning models for clinical use. We conducted a review of studies from the past decade that utilized speech, text, and facial expression analysis to detect depression, as defined by the Diagnostic and Statistical Manual of Mental Disorders (DSM-5), using the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guideline. We provide information on the number of participants, techniques used to assess clinical outcomes, speech-eliciting tasks, machine-learning algorithms, metrics, and other important discoveries for each study. A total of 544 studies were examined, 264 of which satisfied the inclusion criteria. A database has been created containing the query results and a summary of how different features are used to detect depression. While machine learning shows its potential to enhance mental health disorder evaluations, some obstacles must be overcome, especially the requirement for more transparent machine-learning models for clinical purposes. Considering the variety of datasets, feature extraction techniques, and metrics used in this field, guidelines have been provided to collect data and train machine-learning models to guarantee reproducibility and generalizability across different contexts.
Collapse
Affiliation(s)
- Kaining Mao
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton, AB, T6G 2R3, Canada
| | - Yuqi Wu
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton, AB, T6G 2R3, Canada
| | - Jie Chen
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton, AB, T6G 2R3, Canada.
| |
Collapse
|
7
|
Diaz-Ramos RE, Noriega I, Trejo LA, Stroulia E, Cao B. Using Wearable Devices and Speech Data for Personalized Machine Learning in Early Detection of Mental Disorders: Protocol for a Participatory Research Study. JMIR Res Protoc 2023; 12:e48210. [PMID: 37955959 PMCID: PMC10682927 DOI: 10.2196/48210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2023] [Revised: 09/22/2023] [Accepted: 09/25/2023] [Indexed: 11/14/2023] Open
Abstract
BACKGROUND Early identification of mental disorder symptoms is crucial for timely treatment and reduction of recurring symptoms and disabilities. A tool to help individuals recognize warning signs is important. We posit that such a tool would have to rely on longitudinal analysis of patterns and trends in the individual's daily activities and mood, which can now be captured through data from wearable activity trackers, speech recordings from mobile devices, and the individual's own description of their mental state. In this paper, we describe such a tool developed by our team to detect early signs of depression, anxiety, and stress. OBJECTIVE This study aims to examine three questions about the effectiveness of machine learning models constructed based on multimodal data from wearables, speech, and self-reports: (1) How does speech about issues of personal context differ from speech while reading a neutral text, what type of speech data are more helpful in detecting mental health indicators, and how is the quality of the machine learning models influenced by multilanguage data? (2) Does accuracy improve with longitudinal data collection and how, and what are the most important features? and (3) How do personalized machine learning models compare against population-level models? METHODS We collect longitudinal data to aid machine learning in accurately identifying patterns of mental disorder symptoms. We developed an app that collects voice, physiological, and activity data. Physiological and activity data are provided by a variety of off-the-shelf fitness trackers, that record steps, active minutes, duration of sleeping stages (rapid eye movement, deep, and light sleep), calories consumed, distance walked, heart rate, and speed. We also collect voice recordings of users reading specific texts and answering open-ended questions chosen randomly from a set of questions without repetition. Finally, the app collects users' answers to the Depression, Anxiety, and Stress Scale. The collected data from wearable devices and voice recordings will be used to train machine learning models to predict the levels of anxiety, stress, and depression in participants. RESULTS The study is ongoing, and data collection will be completed by November 2023. We expect to recruit at least 50 participants attending 2 major universities (in Canada and Mexico) fluent in English or Spanish. The study will include participants aged between 18 and 35 years, with no communication disorders, acute neurological diseases, or history of brain damage. Data collection complied with ethical and privacy requirements. CONCLUSIONS The study aims to advance personalized machine learning for mental health; generate a data set to predict Depression, Anxiety, and Stress Scale results; and deploy a framework for early detection of depression, anxiety, and stress. Our long-term goal is to develop a noninvasive and objective method for collecting mental health data and promptly detecting mental disorder symptoms. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID) DERR1-10.2196/48210.
Collapse
Affiliation(s)
- Ramon E Diaz-Ramos
- Department of Computing Science, University of Alberta, Edmonton, AB, Canada
| | - Isabella Noriega
- School of Engineering and Sciences, Tecnologico de Monterrey, Monterrey, Mexico
| | - Luis A Trejo
- School of Engineering and Sciences, Tecnologico de Monterrey, Atizapan, Mexico
| | - Eleni Stroulia
- Department of Computing Science, University of Alberta, Edmonton, AB, Canada
| | - Bo Cao
- Department of Psychiatry, University of Alberta, Edmonton, AB, Canada
| |
Collapse
|
8
|
Lønfeldt NN, Clemmensen LKH, Pagsberg AK. A Wearable Artificial Intelligence Feedback Tool (Wrist Angel) for Treatment and Research of Obsessive Compulsive Disorder: Protocol for a Nonrandomized Pilot Study. JMIR Res Protoc 2023; 12:e45123. [PMID: 37486738 PMCID: PMC10407771 DOI: 10.2196/45123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 03/30/2023] [Accepted: 04/30/2023] [Indexed: 07/25/2023] Open
Abstract
BACKGROUND Obsessive compulsive disorder (OCD) in youth is characterized by behaviors, emotions, physiological reactions, and family interaction patterns. An essential component of therapy involves increasing awareness of the links among thoughts, emotions, behaviors, bodily sensations, and family interactions. An automatic assessment tool using physiological signals from a wearable biosensor may enable continuous symptom monitoring inside and outside of the clinic and support cognitive behavioral therapy for OCD. OBJECTIVE The primary aim of this study is to evaluate the feasibility and acceptability of using a wearable biosensor to monitor OCD symptoms. The secondary aim is to explore the feasibility of developing clinical and research tools that can detect and predict OCD-relevant internal states and interpersonal processes with the use of speech and behavioral signals. METHODS Eligibility criteria for the study include children and adolescents between 8 and 17 years of age diagnosed with OCD, controls with no psychiatric diagnoses, and one parent of the participating youths. Youths and parents wear biosensors on their wrists that measure pulse, electrodermal activity, skin temperature, and acceleration. Patients and their parents mark OCD episodes, while control youths and their parents mark youth fear episodes. Continuous, in-the-wild data collection will last for 8 weeks. Controlled experiments designed to link physiological, speech, behavioral, and biochemical signals to mental states are performed at baseline and after 8 weeks. Interpersonal interactions in the experiments are filmed and coded for behavior. The films are also processed with computer vision and for speech signals. Participants complete clinical interviews and questionnaires at baseline, and at weeks 4, 7, and 8. Feasibility criteria were set for recruitment, retention, biosensor functionality and acceptability, adherence to wearing the biosensor, and safety related to the biosensor. As a first step in learning the associations between signals and OCD-related parameters, we will use paired t tests and mixed effects models with repeated measures to assess associations between oxytocin, individual biosignal features, and outcomes such as stress-rest and case-control comparisons. RESULTS The first participant was enrolled on December 3, 2021, and recruitment closed on December 31, 2022. Nine patient dyads and nine control dyads were recruited. Sixteen participating dyads completed follow-up assessments. CONCLUSIONS The results of this study will provide preliminary evidence for the extent to which a wearable biosensor that collects physiological signals can be used to monitor OCD severity and events in youths. If we find the study to be feasible, further studies will be conducted to integrate biosensor signals output into machine learning algorithms that can provide patients, parents, and therapists with actionable insights into OCD symptoms and treatment progress. Future definitive studies will be tasked with testing the accuracy of machine learning models to detect and predict OCD episodes and classify clinical severity. TRIAL REGISTRATION ClinicalTrials.gov NCT05064527; https://clinicaltrials.gov/ct2/show/NCT05064527. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID) DERR1-10.2196/45123.
Collapse
Affiliation(s)
- Nicole Nadine Lønfeldt
- Child and Adolescent Mental Health Center, Mental Health Services CPH, Copenhagen University Hospital, Hellerup, Denmark
| | | | - Anne Katrine Pagsberg
- Child and Adolescent Mental Health Center, Mental Health Services CPH, Copenhagen University Hospital, Hellerup, Denmark
- Department of Clinical Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
9
|
Muzik M, Menke RA, Issa M, Fisk C, Charles J, Jester JM. Evaluation of the Michigan Clinical Consultation and Care Program: An Evidence-Based Approach to Perinatal Mental Healthcare. J Clin Med 2023; 12:4836. [PMID: 37510951 PMCID: PMC10381794 DOI: 10.3390/jcm12144836] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 07/07/2023] [Accepted: 07/20/2023] [Indexed: 07/30/2023] Open
Abstract
Mood and anxiety disorders affect pregnant individuals and their families at increased rates throughout the perinatal period. Geographic, financial, and social barriers often preclude adequate diagnosis and treatment. The aim of this manuscript is to describe the consultation and care arms of the Michigan Clinical Consultation and Care (MC3) program, a statewide program designed to facilitate access to perinatal mental healthcare for OB/Gyn patients, and to describe the participants engaged in the program, examine the predictors of participant retention, and provide preliminary data regarding participants' mental health outcomes. We enrolled 209 participants to the clinical care arm, of which 48 were lost to follow-up, while 107 remained enrolled at the time of data analysis. A total of 54 participants met their treatment goals. A total of 97% of participants asserted they were satisfied with the services they received. Black race and public insurance predicted faster attrition from the care arm treatment; risks for interpersonal violence exposure and substance use were unrelated to attrition. Preliminary mental health outcomes showed significant decreases in anxiety and depression, with the most dramatic decreases in the first month of treatment. Overall, the MC3 clinical care arm shows promising rates of adherence, excellent program satisfaction, and a positive impact on perinatal mental health, supporting continued program implementation and ongoing evaluation.
Collapse
Affiliation(s)
- Maria Muzik
- Department of Psychiatry, University of Michigan-Michigan Medicine, Ann Arbor, MI 48109, USA
- Department of Obstetrics & Gynecology, University of Michigan-Michigan Medicine, Ann Arbor, MI 48109, USA
| | - Rena A Menke
- Department of Psychiatry, University of Michigan-Michigan Medicine, Ann Arbor, MI 48109, USA
| | - Meriam Issa
- Department of Psychiatry, University of Michigan-Michigan Medicine, Ann Arbor, MI 48109, USA
| | - Chelsea Fisk
- Department of Psychiatry, University of Michigan-Michigan Medicine, Ann Arbor, MI 48109, USA
| | - Jordan Charles
- Department of Psychiatry, University of Michigan-Michigan Medicine, Ann Arbor, MI 48109, USA
| | - Jennifer M Jester
- Department of Psychiatry, University of Michigan-Michigan Medicine, Ann Arbor, MI 48109, USA
| |
Collapse
|
10
|
Loftness BC, Rizzo DM, Halvorson-Phelan J, O'Leary A, Prytherch S, Bradshaw C, Brown AJ, Cheney N, McGinnis EW, McGinnis RS. Toward Digital Phenotypes of Early Childhood Mental Health via Unsupervised and Supervised Machine Learning. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2023; 2023:1-4. [PMID: 38082795 DOI: 10.1109/embc40787.2023.10340806] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2023]
Abstract
Childhood mental health disorders such as anxiety, depression, and ADHD are commonly-occurring and often go undetected into adolescence or adulthood. This can lead to detrimental impacts on long-term wellbeing and quality of life. Current parent-report assessments for pre-school aged children are often biased, and thus increase the need for objective mental health screening tools. Leveraging digital tools to identify the behavioral signature of childhood mental disorders may enable increased intervention at the time with the highest chance of long-term impact. We present data from 84 participants (4-8 years old, 50% diagnosed with anxiety, depression, and/or ADHD) collected during a battery of mood induction tasks using the ChAMP System. Unsupervised Kohonen Self-Organizing Maps (SOM) constructed from movement and audio features indicate that age did not tend to explain clusters as consistently as gender within task-specific and cross-task SOMs. Symptom prevalence and diagnostic status also showed some evidence of clustering. Case studies suggest that high impairment (>80th percentile symptom counts) and diagnostic subtypes (ADHD-Combined) may account for most behaviorally distinct children. Based on this same dataset, we also present results from supervised modeling for the binary classification of diagnoses. Our top performing models yield moderate but promising results (ROC AUC .6-.82, TPR .36-.71, Accuracy .62-.86) on par with our previous efforts for isolated behavioral tasks. Enhancing features, tuning model parameters, and incorporating additional wearable sensor data will continue to enable the rapid progression towards the discovery of digital phenotypes of childhood mental health.Clinical Relevance- This work advances the use of wearables for detecting childhood mental health disorders.
Collapse
|
11
|
Wang Y, Liang L, Zhang Z, Xu X, Liu R, Fang H, Zhang R, Wei Y, Liu Z, Zhu R, Zhang X, Wang F. Fast and accurate assessment of depression based on voice acoustic features: a cross-sectional and longitudinal study. Front Psychiatry 2023; 14:1195276. [PMID: 37415683 PMCID: PMC10320390 DOI: 10.3389/fpsyt.2023.1195276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Accepted: 06/02/2023] [Indexed: 07/08/2023] Open
Abstract
Background Depression is a widespread mental disorder that affects a significant portion of the population. However, the assessment of depression is often subjective, relying on standard questions or interviews. Acoustic features have been suggested as a reliable and objective alternative for depression assessment. Therefore, in this study, we aim to identify and explore voice acoustic features that can effectively and rapidly predict the severity of depression, as well as investigate the potential correlation between specific treatment options and voice acoustic features. Methods We utilized voice acoustic features correlated with depression scores to train a prediction model based on artificial neural network. Leave-one-out cross-validation was performed to evaluate the performance of the model. We also conducted a longitudinal study to analyze the correlation between the improvement of depression and changes in voice acoustic features after an Internet-based cognitive-behavioral therapy (ICBT) program consisting of 12 sessions. Results Our study showed that the neural network model trained based on the 30 voice acoustic features significantly correlated with HAMD scores can accurately predict the severity of depression with an absolute mean error of 3.137 and a correlation coefficient of 0.684. Furthermore, four out of the 30 features significantly decreased after ICBT, indicating their potential correlation with specific treatment options and significant improvement in depression (p < 0.05). Conclusion Voice acoustic features can effectively and rapidly predict the severity of depression, providing a low-cost and efficient method for screening patients with depression on a large scale. Our study also identified potential acoustic features that may be significantly related to specific treatment options for depression.
Collapse
Affiliation(s)
- Yang Wang
- Psychology Institute, Inner Mongolia Normal University, Hohhot, Inner Mongolia, China
- Early Intervention Unit, Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China
- Functional Brain Imaging Institute, Nanjing Medical University, Nanjing, China
| | - Lijuan Liang
- Early Intervention Unit, Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China
- Functional Brain Imaging Institute, Nanjing Medical University, Nanjing, China
- Laboratory of Psychology, The First Affiliated Hospital of Hainan Medical University, Haikou, Hainan, China
| | - Zhongguo Zhang
- Early Intervention Unit, Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China
- Functional Brain Imaging Institute, Nanjing Medical University, Nanjing, China
- The Fourth People’s Hospital of Yancheng, Yancheng, Jiangsu, China
| | - Xiao Xu
- School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, China
| | - Rongxun Liu
- Early Intervention Unit, Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China
- Functional Brain Imaging Institute, Nanjing Medical University, Nanjing, China
- College of Medical Engineering, Xinxiang Medical University, Xinxiang, Henan, China
| | - Hanzheng Fang
- School of Computer Science and Engineering, Northeastern University, Shenyang, Liaoning, China
| | - Ran Zhang
- Early Intervention Unit, Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China
- Functional Brain Imaging Institute, Nanjing Medical University, Nanjing, China
| | - Yange Wei
- Early Intervention Unit, Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China
- Functional Brain Imaging Institute, Nanjing Medical University, Nanjing, China
| | - Zhongchun Liu
- Department of Psychiatry, Renmin Hospital of Wuhan University, Wuhan, Hubei, China
| | - Rongxin Zhu
- Early Intervention Unit, Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China
- Functional Brain Imaging Institute, Nanjing Medical University, Nanjing, China
| | - Xizhe Zhang
- School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, China
| | - Fei Wang
- Early Intervention Unit, Department of Psychiatry, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China
- Functional Brain Imaging Institute, Nanjing Medical University, Nanjing, China
| |
Collapse
|
12
|
Squires M, Tao X, Elangovan S, Gururajan R, Zhou X, Acharya UR, Li Y. Deep learning and machine learning in psychiatry: a survey of current progress in depression detection, diagnosis and treatment. Brain Inform 2023; 10:10. [PMID: 37093301 PMCID: PMC10123592 DOI: 10.1186/s40708-023-00188-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2022] [Accepted: 03/08/2023] [Indexed: 04/25/2023] Open
Abstract
Informatics paradigms for brain and mental health research have seen significant advances in recent years. These developments can largely be attributed to the emergence of new technologies such as machine learning, deep learning, and artificial intelligence. Data-driven methods have the potential to support mental health care by providing more precise and personalised approaches to detection, diagnosis, and treatment of depression. In particular, precision psychiatry is an emerging field that utilises advanced computational techniques to achieve a more individualised approach to mental health care. This survey provides an overview of the ways in which artificial intelligence is currently being used to support precision psychiatry. Advanced algorithms are being used to support all phases of the treatment cycle. These systems have the potential to identify individuals suffering from mental health conditions, allowing them to receive the care they need and tailor treatments to individual patients who are mostly to benefit. Additionally, unsupervised learning techniques are breaking down existing discrete diagnostic categories and highlighting the vast disease heterogeneity observed within depression diagnoses. Artificial intelligence also provides the opportunity to shift towards evidence-based treatment prescription, moving away from existing methods based on group averages. However, our analysis suggests there are several limitations currently inhibiting the progress of data-driven paradigms in care. Significantly, none of the surveyed articles demonstrate empirically improved patient outcomes over existing methods. Furthermore, greater consideration needs to be given to uncertainty quantification, model validation, constructing interdisciplinary teams of researchers, improved access to diverse data and standardised definitions within the field. Empirical validation of computer algorithms via randomised control trials which demonstrate measurable improvement to patient outcomes are the next step in progressing models to clinical implementation.
Collapse
Affiliation(s)
- Matthew Squires
- School of Mathematics, Physics and Computing, University of Southern Queensland, Toowoomba, QLD, Australia.
| | - Xiaohui Tao
- School of Mathematics, Physics and Computing, University of Southern Queensland, Toowoomba, QLD, Australia
| | | | - Raj Gururajan
- School of Business, University of Southern Queensland, Springfield, QLD, Australia
| | - Xujuan Zhou
- School of Business, University of Southern Queensland, Springfield, QLD, Australia
| | - U Rajendra Acharya
- School of Mathematics, Physics and Computing, University of Southern Queensland, Toowoomba, QLD, Australia
| | - Yuefeng Li
- School of Computer Science, Queensland University of Technology, Brisbane, QLD, Australia
| |
Collapse
|
13
|
Huang P, Chan SY, Ngoh ZM, Nadarajan R, Chong YS, Gluckman PD, Chen H, Fortier MV, Tan AP, Meaney MJ. Functional connectivity analysis of childhood depressive symptoms. Neuroimage Clin 2023; 38:103395. [PMID: 37031637 PMCID: PMC10120398 DOI: 10.1016/j.nicl.2023.103395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2022] [Revised: 03/26/2023] [Accepted: 03/27/2023] [Indexed: 04/09/2023]
Abstract
BACKGROUND Childhood depression is a highly distinct and prevalent condition with an unknown neurobiological basis. We wish to explore the resting state fMRI data in children for potential associations between neural connectivity and childhood depressive symptoms. METHODS A longitudinal birth cohort study with neuroimaging data obtained at 4.5, 6.0 and 7.5 years of age and the Children Depression Inventory 2 (CDI) administered between 8.5 and 10.5 years was used. The CDI score was used as the dependent variable and tested for correlation, both simple Pearson and network based statistic, with the functional connectivity values obtained from the resting state fMRI. Cross-validated permutation testing with a general linear model was used to validate that the identified functional connections were indeed implicated in childhood depression. RESULTS Ten functional connections and four brain regions (Somatomotor Area B, Temporoparietal Junction, Orbitofrontal Cortex and Insula) were identified as significantly associated with childhood depressive symptoms for girls at 6.0 and 7.5 years. No significant functional connections were found in girls at 4.5 years or for boys at any timepoint. Network based statistic and permutation testing confirmed these findings. CONCLUSIONS This study revealed significant sex-dependent associations of neural connectivity and childhood depressive symptoms. The regions identified are implicated in speech/language, social cognition and information integration and suggest unique pathways to childhood depressive symptoms.
Collapse
Affiliation(s)
- Pei Huang
- Singapore Institute for Clinical Sciences, Agency for Science and Technology, Singapore.
| | - Shi Yu Chan
- Singapore Institute for Clinical Sciences, Agency for Science and Technology, Singapore
| | - Zhen Ming Ngoh
- Singapore Institute for Clinical Sciences, Agency for Science and Technology, Singapore; Yong Loo Lin School of Medicine, National University of Singapore, Singapore
| | - Ranjani Nadarajan
- Department of Psychiatry, Douglas Mental Health University Institute, McGill University, Montreal, Quebec, Canada
| | - Yap Seng Chong
- Singapore Institute for Clinical Sciences, Agency for Science and Technology, Singapore; Department of Obstetrics & Gynaecology, National University Hospital Singapore, Singapore
| | - Peter D Gluckman
- Singapore Institute for Clinical Sciences, Agency for Science and Technology, Singapore; Centre for Human Evolution, Adaptation and Disease, Liggins Institute, University of Auckland, Auckland, New Zealand
| | - Helen Chen
- Department of Psychological Medicine, KK Women's and Children's Hospital, Singapore; Duke-National University of Singapore, Singapore
| | - Marielle V Fortier
- Singapore Institute for Clinical Sciences, Agency for Science and Technology, Singapore; Department of Diagnostic and Interventional Radiology, KK Women's and Children's Hospital, Singapore
| | - Ai Peng Tan
- Singapore Institute for Clinical Sciences, Agency for Science and Technology, Singapore; Department of Diagnostic Imaging, National University Hospital Singapore, Singapore
| | - Michael J Meaney
- Singapore Institute for Clinical Sciences, Agency for Science and Technology, Singapore; Yong Loo Lin School of Medicine, National University of Singapore, Singapore; Department of Psychiatry, Douglas Mental Health University Institute, McGill University, Montreal, Quebec, Canada; Brain - Body Initiative, Agency for Science and Technology, Singapore
| |
Collapse
|
14
|
Smart voice recognition based on deep learning for depression diagnosis. ARTIFICIAL LIFE AND ROBOTICS 2023. [DOI: 10.1007/s10015-023-00852-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
|
15
|
Chen Y, Ma S, Yang X, Liu D, Yang J. Screening Children's Intellectual Disabilities with Phonetic Features, Facial Phenotype and Craniofacial Variability Index. Brain Sci 2023; 13:brainsci13010155. [PMID: 36672135 PMCID: PMC9857173 DOI: 10.3390/brainsci13010155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2022] [Revised: 12/31/2022] [Accepted: 01/09/2023] [Indexed: 01/18/2023] Open
Abstract
BACKGROUND Intellectual Disability (ID) is a kind of developmental deficiency syndrome caused by congenital diseases or postnatal events. This syndrome could be intervened as soon as possible if its early screening was efficient, which may improve the condition of patients and enhance their self-care ability. The early screening of ID is always achieved by clinical interview, which needs in-depth participation of medical professionals and related medical resources. METHODS A new method for screening ID has been proposed by analyzing the facial phenotype and phonetic characteristic of young subjects. First, the geometric features of subjects' faces and phonetic features of subjects' voice are extracted from interview videos, then craniofacial variability index (CVI) is calculated with the geometric features and the risk of ID is given with the measure of CVI. Furthermore, machine learning algorithms are utilized to establish a method for further screening ID based on facial features and phonetic features. RESULTS The proposed method using three feature sets, including geometric features, CVI features and phonetic features was evaluated. The best performance of accuracy was closer to 80%. CONCLUSIONS The results using the three feature sets revealed that the proposed method may be applied in a clinical setting in the future after continuous improvement.
Collapse
Affiliation(s)
- Yuhe Chen
- School of Foreign Languages, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Simeng Ma
- Department of Psychiatry, Renmin Hospital of Wuhan University, Wuhan 430060, China
| | - Xiaoyu Yang
- Department of Pharmacy, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, China
- Hubei Province Clinical Research Center for Precision Medicine for Critical Illness, Wuhan 430030, China
| | - Dujuan Liu
- Department of Psychiatry, Renmin Hospital of Wuhan University, Wuhan 430060, China
- Correspondence: (D.L.); (J.Y.)
| | - Jun Yang
- School of Computer Science & Technology, Huazhong University of Science and Technology, Wuhan 430074, China
- School of Information Engineering, Wuhan University of Technology, Wuhan 430070, China
- Correspondence: (D.L.); (J.Y.)
| |
Collapse
|
16
|
Applications of Speech Analysis in Psychiatry. Harv Rev Psychiatry 2023; 31:1-13. [PMID: 36608078 DOI: 10.1097/hrp.0000000000000356] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
ABSTRACT The need for objective measurement in psychiatry has stimulated interest in alternative indicators of the presence and severity of illness. Speech may offer a source of information that bridges the subjective and objective in the assessment of mental disorders. We systematically reviewed the literature for articles exploring speech analysis for psychiatric applications. The utility of speech analysis depends on how accurately speech features represent clinical symptoms within and across disorders. We identified four domains of the application of speech analysis in the literature: diagnostic classification, assessment of illness severity, prediction of onset of illness, and prognosis and treatment outcomes. We discuss the findings in each of these domains, with a focus on how types of speech features characterize different aspects of psychopathology. Models that bring together multiple speech features can distinguish speakers with psychiatric disorders from healthy controls with high accuracy. Differentiating between types of mental disorders and symptom dimensions are more complex problems that expose the transdiagnostic nature of speech features. Convergent progress in speech research and computer sciences opens avenues for implementing speech analysis to enhance objectivity of assessment in clinical practice. Application of speech analysis will need to address issues of ethics and equity, including the potential to perpetuate discriminatory bias through models that learn from clinical assessment data. Methods that mitigate bias are available and should play a key role in the implementation of speech analysis.
Collapse
|
17
|
Koops S, Brederoo SG, de Boer JN, Nadema FG, Voppel AE, Sommer IE. Speech as a Biomarker for Depression. CNS & NEUROLOGICAL DISORDERS DRUG TARGETS 2023; 22:152-160. [PMID: 34961469 DOI: 10.2174/1871527320666211213125847] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Revised: 10/10/2021] [Accepted: 10/10/2021] [Indexed: 01/01/2023]
Abstract
BACKGROUND Depression is a debilitating disorder that at present lacks a reliable biomarker to aid in diagnosis and early detection. Recent advances in computational analytic approaches have opened up new avenues in developing such a biomarker by taking advantage of the wealth of information that can be extracted from a person's speech. OBJECTIVE The current review provides an overview of the latest findings in the rapidly evolving field of computational language analysis for the detection of depression. We cover a wide range of both acoustic and content-related linguistic features, data types (i.e., spoken and written language), and data sources (i.e., lab settings, social media, and smartphone-based). We put special focus on the current methodological advances with regard to feature extraction and computational modeling techniques. Furthermore, we pay attention to potential hurdles in the implementation of automatic speech analysis. CONCLUSION Depressive speech is characterized by several anomalies, such as lower speech rate, less pitch variability and more self-referential speech. With current computational modeling techniques, such features can be used to detect depression with an accuracy of up to 91%. The performance of the models is optimized when machine learning techniques are implemented that suit the type and amount of data. Recent studies now work towards further optimization and generalizability of the computational language models to detect depression. Finally, privacy and ethical issues are of paramount importance to be addressed when automatic speech analysis techniques are further implemented in, for example, smartphones. Altogether, computational speech analysis is well underway towards becoming an effective diagnostic aid for depression.
Collapse
Affiliation(s)
- Sanne Koops
- Department of Biomedical Sciences of Cells & Systems, Cognitive Neurosciences, University of Groningen, University Medical Center Groningen (UMCG), Groningen, The Netherlands
| | - Sanne G Brederoo
- Department of Biomedical Sciences of Cells & Systems, Cognitive Neurosciences, University of Groningen, University Medical Center Groningen (UMCG), Groningen, The Netherlands
- University Center for Psychiatry, University Medical Center Groningen, Groningen, The Netherlands
| | - Janna N de Boer
- Department of Psychiatry, University Medical Center Utrecht, Utrecht University & Brain Center Rudolf Magnus, Utrecht, The Netherlands
| | - Femke G Nadema
- Department of Biomedical Sciences of Cells & Systems, Cognitive Neurosciences, University of Groningen, University Medical Center Groningen (UMCG), Groningen, The Netherlands
| | - Alban E Voppel
- Department of Biomedical Sciences of Cells & Systems, Cognitive Neurosciences, University of Groningen, University Medical Center Groningen (UMCG), Groningen, The Netherlands
| | - Iris E Sommer
- Department of Biomedical Sciences of Cells & Systems, Cognitive Neurosciences, University of Groningen, University Medical Center Groningen (UMCG), Groningen, The Netherlands
| |
Collapse
|
18
|
Soroski T, da Cunha Vasco T, Newton-Mason S, Granby S, Lewis C, Harisinghani A, Rizzo M, Conati C, Murray G, Carenini G, Field TS, Jang H. Evaluating Web-Based Automatic Transcription for Alzheimer Speech Data: Transcript Comparison and Machine Learning Analysis. JMIR Aging 2022; 5:e33460. [PMID: 36129754 PMCID: PMC9536526 DOI: 10.2196/33460] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Revised: 07/11/2022] [Accepted: 07/23/2022] [Indexed: 11/16/2022] Open
Abstract
Background Speech data for medical research can be collected noninvasively and in large volumes. Speech analysis has shown promise in diagnosing neurodegenerative disease. To effectively leverage speech data, transcription is important, as there is valuable information contained in lexical content. Manual transcription, while highly accurate, limits the potential scalability and cost savings associated with language-based screening. Objective To better understand the use of automatic transcription for classification of neurodegenerative disease, namely, Alzheimer disease (AD), mild cognitive impairment (MCI), or subjective memory complaints (SMC) versus healthy controls, we compared automatically generated transcripts against transcripts that went through manual correction. Methods We recruited individuals from a memory clinic (“patients”) with a diagnosis of mild-to-moderate AD, (n=44, 30%), MCI (n=20, 13%), SMC (n=8, 5%), as well as healthy controls (n=77, 52%) living in the community. Participants were asked to describe a standardized picture, read a paragraph, and recall a pleasant life experience. We compared transcripts generated using Google speech-to-text software to manually verified transcripts by examining transcription confidence scores, transcription error rates, and machine learning classification accuracy. For the classification tasks, logistic regression, Gaussian naive Bayes, and random forests were used. Results The transcription software showed higher confidence scores (P<.001) and lower error rates (P>.05) for speech from healthy controls compared with patients. Classification models using human-verified transcripts significantly (P<.001) outperformed automatically generated transcript models for both spontaneous speech tasks. This comparison showed no difference in the reading task. Manually adding pauses to transcripts had no impact on classification performance. However, manually correcting both spontaneous speech tasks led to significantly higher performances in the machine learning models. Conclusions We found that automatically transcribed speech data could be used to distinguish patients with a diagnosis of AD, MCI, or SMC from controls. We recommend a human verification step to improve the performance of automatic transcripts, especially for spontaneous tasks. Moreover, human verification can focus on correcting errors and adding punctuation to transcripts. However, manual addition of pauses is not needed, which can simplify the human verification step to more efficiently process large volumes of speech data.
Collapse
Affiliation(s)
- Thomas Soroski
- Vancouver Stroke Program and Division of Neurology, Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
| | - Thiago da Cunha Vasco
- Department of Computer Science, Faculty of Science, University of British Columbia, Vancouver, BC, Canada
| | - Sally Newton-Mason
- Vancouver Stroke Program and Division of Neurology, Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
| | - Saffrin Granby
- Department of Computer Science, Faculty of Science, University of British Columbia, Vancouver, BC, Canada
| | - Caitlin Lewis
- Vancouver Stroke Program and Division of Neurology, Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
| | - Anuj Harisinghani
- Department of Computer Science, Faculty of Science, University of British Columbia, Vancouver, BC, Canada
| | - Matteo Rizzo
- Department of Computer Science, Faculty of Science, University of British Columbia, Vancouver, BC, Canada
| | - Cristina Conati
- Department of Computer Science, Faculty of Science, University of British Columbia, Vancouver, BC, Canada
| | - Gabriel Murray
- School of Computing, University of the Fraser Valley, Abbotsford, BC, Canada
| | - Giuseppe Carenini
- Department of Computer Science, Faculty of Science, University of British Columbia, Vancouver, BC, Canada
| | - Thalia S Field
- Vancouver Stroke Program and Division of Neurology, Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
| | - Hyeju Jang
- Department of Computer Science, Faculty of Science, University of British Columbia, Vancouver, BC, Canada
| |
Collapse
|
19
|
Teferra BG, Borwein S, DeSouza DD, Simpson W, Rheault L, Rose J. Acoustic and Linguistic Features of Impromptu Speech and Their Association With Anxiety: Validation Study. JMIR Ment Health 2022; 9:e36828. [PMID: 35802401 PMCID: PMC9308078 DOI: 10.2196/36828] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Revised: 04/27/2022] [Accepted: 05/23/2022] [Indexed: 01/26/2023] Open
Abstract
BACKGROUND The measurement and monitoring of generalized anxiety disorder requires frequent interaction with psychiatrists or psychologists. Access to mental health professionals is often difficult because of high costs or insufficient availability. The ability to assess generalized anxiety disorder passively and at frequent intervals could be a useful complement to conventional treatment and help with relapse monitoring. Prior work suggests that higher anxiety levels are associated with features of human speech. As such, monitoring speech using personal smartphones or other wearable devices may be a means to achieve passive anxiety monitoring. OBJECTIVE This study aims to validate the association of previously suggested acoustic and linguistic features of speech with anxiety severity. METHODS A large number of participants (n=2000) were recruited and participated in a single web-based study session. Participants completed the Generalized Anxiety Disorder 7-item scale assessment and provided an impromptu speech sample in response to a modified version of the Trier Social Stress Test. Acoustic and linguistic speech features were a priori selected based on the existing speech and anxiety literature, along with related features. Associations between speech features and anxiety levels were assessed using age and personal income as covariates. RESULTS Word count and speaking duration were negatively correlated with anxiety scores (r=-0.12; P<.001), indicating that participants with higher anxiety scores spoke less. Several acoustic features were also significantly (P<.05) associated with anxiety, including the mel-frequency cepstral coefficients, linear prediction cepstral coefficients, shimmer, fundamental frequency, and first formant. In contrast to previous literature, second and third formant, jitter, and zero crossing rate for the z score of the power spectral density acoustic features were not significantly associated with anxiety. Linguistic features, including negative-emotion words, were also associated with anxiety (r=0.10; P<.001). In addition, some linguistic relationships were sex dependent. For example, the count of words related to power was positively associated with anxiety in women (r=0.07; P=.03), whereas it was negatively associated with anxiety in men (r=-0.09; P=.01). CONCLUSIONS Both acoustic and linguistic speech measures are associated with anxiety scores. The amount of speech, acoustic quality of speech, and gender-specific linguistic characteristics of speech may be useful as part of a system to screen for anxiety, detect relapse, or monitor treatment.
Collapse
Affiliation(s)
- Bazen Gashaw Teferra
- The Edward S Rogers Sr Department of Electrical and Computer Engineering, University of Toronto, Toronto, ON, Canada
| | - Sophie Borwein
- School of Public Policy, Simon Fraser University, Vancouver, BC, Canada
| | | | - William Simpson
- Winterlight Labs, Toronto, ON, Canada.,Department of Psychiatry and Behavioural Neurosciences, McMaster University, Hamilton, ON, Canada
| | - Ludovic Rheault
- Department of Political Science, Munk School of Global Affairs and Public Policy, University of Toronto, Toronto, ON, Canada
| | - Jonathan Rose
- The Edward S Rogers Sr Department of Electrical and Computer Engineering, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
20
|
Matsushita FY, Krebs VLJ, Carvalho WBD. Artificial intelligence and machine learning in pediatrics and neonatology healthcare. Rev Assoc Med Bras (1992) 2022; 68:745-750. [PMID: 35766685 PMCID: PMC9575899 DOI: 10.1590/1806-9282.20220177] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 02/09/2022] [Indexed: 11/23/2022] Open
Affiliation(s)
- Felipe Yu Matsushita
- Universidade de São Paulo, Faculty of Medicine, Department of Pediatrics - São Paulo (SP), Brazil
| | - Vera Lucia Jornada Krebs
- Universidade de São Paulo, Faculty of Medicine, Department of Pediatrics - São Paulo (SP), Brazil
| | | |
Collapse
|
21
|
Loftness BC, Halvorson-Phelan J, O'Leary A, Cheney N, McGinnis EW, McGinnis RS. UVM KID Study: Identifying Multimodal Features and Optimizing Wearable Instrumentation to Detect Child Anxiety. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2022; 2022:1141-1144. [PMID: 36085630 DOI: 10.1109/embc48229.2022.9871090] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Anxiety and depression, collectively known as internalizing disorders, begin as early as the preschool years and impact nearly 1 out of every 5 children. Left undiagnosed and untreated, childhood internalizing disorders predict later health problems including substance abuse, development of comorbid psychopathology, increased risk for suicide, and substantial functional impairment. Current diagnostic procedures require access to clinical experts, take considerable time to complete, and inherently assume that child symptoms are observable by caregivers. Multi-modal wearable sensors may enable development of rapid point-of-care diagnostics that address these challenges. Building on our prior work, here we present an assessment battery for the development of a digital phenotype for internalizing disorders in young children and an early feasibility case study of multi-modal wearable sensor data from two participants, one of whom has been clinically diagnosed with an internalizing disorder. Results lend support that sacral movement responses and R-R interval during a short stress-induction task may facilitate child diagnosis. Multi-modal sensors measuring movement and surface biopotentials of the chest and trapezius are also shown to have significant redundancy, introducing the potential for sensor optimization moving forward. Future work aims to further optimize sensor placement, signals, features, and assessments to enable deployment in clinical practice. Clinical Relevance- This work considers the development and optimization of technologies for improving the identification of children with internalizing disorders.
Collapse
|
22
|
Advancing Digital Medicine with Wearables in the Wild. SENSORS 2022; 22:s22124576. [PMID: 35746358 PMCID: PMC9227612 DOI: 10.3390/s22124576] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 06/15/2022] [Indexed: 02/04/2023]
|
23
|
Thakre TP, Kulkarni H, Adams KS, Mischel R, Hayes R, Pandurangi A. Polysomnographic identification of anxiety and depression using deep learning. J Psychiatr Res 2022; 150:54-63. [PMID: 35358832 DOI: 10.1016/j.jpsychires.2022.03.027] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 03/10/2022] [Accepted: 03/21/2022] [Indexed: 10/18/2022]
Abstract
Anxiety and depression are common psychiatric conditions associated with significant morbidity and healthcare costs. Sleep is an evolutionarily conserved health state. Anxiety and depression have a bidirectional relationship with sleep. This study reports on the use of analysis of polysomnographic data using deep learning methods to detect the presence of anxiety and depression. Polysomnography data on 940 patients performed at an academic sleep center during the 3-year period from 01/01/2016 to 12/31/2018 were identified for analysis. The data were divided into 3 subgroups: 205 patients with Anxiety/Depression, 349 patients with no Anxiety/Depression, and 386 patients with likely Anxiety/Depression. The first two subgroups were used for training and testing of the deep learning algorithm, and the third subgroup was used for external validation of the resulting model. Hypnograms were constructed via automatic sleep staging, with the 12-channel PSG data being transformed into three-channel RGB (red, green, blue channels) images for analysis. Composite patient images were generated and utilized for training the Xception model, which provided a validation set accuracy of 0.9782 on the ninth training epoch. In the independent test set, the model achieved a high accuracy (0.9688), precision (0.9533), recall (0.9630), and F1-score (0.9581). Classification performance of most other mainstream deep learning models was comparable. These findings suggest that machine learning techniques have the potential to accurately detect the presence of anxiety and depression from analysis of sleep study data. Further studies are needed to explore the utility of these techniques in the field of psychiatry.
Collapse
Affiliation(s)
- Tushar P Thakre
- Department of Psychiatry, Virginia Commonwealth University School of Medicine, Richmond, VA, USA; Center for Sleep Medicine, Virginia Commonwealth University Health, Richmond, VA, USA
| | | | - Katie S Adams
- Department of Psychiatry, Virginia Commonwealth University School of Medicine, Richmond, VA, USA; Department of Pharmacy Services, Virginia Commonwealth University Health, Richmond, VA, USA
| | - Ryan Mischel
- Department of Psychiatry, Washington University at St. Louis School of Medicine, St. Louis, MO, USA
| | - Ronnie Hayes
- Center for Sleep Medicine, Virginia Commonwealth University Health, Richmond, VA, USA
| | - Ananda Pandurangi
- Department of Psychiatry, Virginia Commonwealth University School of Medicine, Richmond, VA, USA
| |
Collapse
|
24
|
Teferra BG, Borwein S, DeSouza DD, Rose J. Screening for Generalized Anxiety Disorder from Acoustic and Linguistic Features of Impromptu Speech: Prediction Model Evaluation Study (Preprint). JMIR Form Res 2022; 6:e39998. [PMID: 36306165 PMCID: PMC9652731 DOI: 10.2196/39998] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Revised: 09/29/2022] [Accepted: 09/30/2022] [Indexed: 11/23/2022] Open
Abstract
Background Frequent interaction with mental health professionals is required to screen, diagnose, and track mental health disorders. However, high costs and insufficient access can make frequent interactions difficult. The ability to assess a mental health disorder passively and at frequent intervals could be a useful complement to the conventional treatment. It may be possible to passively assess clinical symptoms with high frequency by characterizing speech alterations collected using personal smartphones or other wearable devices. The association between speech features and mental health disorders can be leveraged as an objective screening tool. Objective This study aimed to evaluate the performance of a model that predicts the presence of generalized anxiety disorder (GAD) from acoustic and linguistic features of impromptu speech on a larger and more generalizable scale than prior studies did. Methods A total of 2000 participants were recruited, and they participated in a single web-based session. They completed the Generalized Anxiety Disorder-7 item scale assessment and provided an impromptu speech sample in response to a modified version of the Trier Social Stress Test. We used the linguistic and acoustic features that were found to be associated with anxiety disorders in previous studies along with demographic information to predict whether participants fell above or below the screening threshold for GAD based on the Generalized Anxiety Disorder-7 item scale threshold of 10. Separate models for each sex were also evaluated. We reported the mean area under the receiver operating characteristic (AUROC) from a repeated 5-fold cross-validation to evaluate the performance of the models. Results A logistic regression model using only acoustic and linguistic speech features achieved a significantly greater prediction accuracy than a random model did (mean AUROC 0.57, SD 0.03; P<.001). When separately assessing samples from female participants, we observed a mean AUROC of 0.55 (SD 0.05; P=.01). The model constructed from the samples from male participants achieved a mean AUROC of 0.57 (SD 0.07; P=.002). The mean AUROC increased to 0.62 (SD 0.03; P<.001) on the all-sample data set when demographic information (age, sex, and income) was included, indicating the importance of demographics when screening for anxiety disorders. The performance also increased for the female sample to a mean of 0.62 (SD 0.04; P<.001) when using demographic information (age and income). An increase in performance was not observed when demographic information was added to the model constructed from the male samples. Conclusions A logistic regression model using acoustic and linguistic speech features, which have been suggested to be associated with anxiety disorders in prior studies, can achieve above-random accuracy for predicting GAD. Importantly, the addition of basic demographic variables further improves model performance, suggesting a role for speech and demographic information to be used as automated, objective screeners of GAD.
Collapse
Affiliation(s)
- Bazen Gashaw Teferra
- The Edward S Rogers Sr Department of Electrical and Computer Engineering, University of Toronto, Toronto, ON, Canada
| | - Sophie Borwein
- School of Public Policy, Simon Fraser University, Vancouver, BC, Canada
| | - Danielle D DeSouza
- Winterlight Labs, Toronto, ON, Canada
- Department of Neurology and Neurological Sciences, Stanford University, Palo Alto, CA, United States
| | - Jonathan Rose
- The Edward S Rogers Sr Department of Electrical and Computer Engineering, University of Toronto, Toronto, ON, Canada
- The Centre for Addiction and Mental Health, Toronto, ON, Canada
| |
Collapse
|
25
|
Clemmensen LKH, Lønfeldt N, Das S, Lund NL, Uhre V, Mora-Jensen AC, Pretzman L, Uhre CF, Ritter M, Korsbjerg NLJ, Hagstrøm J, Thoustrup CL, Clemmensen I, Plessen KJ, Pagsberg A. Associations Between OCD Severity and Focal Features in Children and Adolescents: A Statistical and Machine Learning Analysis Plan (Preprint). JMIR Res Protoc 2022; 11:e39613. [DOI: 10.2196/39613] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Revised: 07/27/2022] [Accepted: 07/30/2022] [Indexed: 11/13/2022] Open
|
26
|
Abstract
BACKGROUND In this modern era, depression is one of the most prevalent mental disorders from which millions of individuals are affected today. The symptoms of depression are heterogeneous and often coincide with other disorders such as bipolar disorder, Parkinson's, schizophrenia, etc. It is a serious mental illness that may lead to other health problems if left untreated. Currently, identifying individuals with depression is totally based on the expertise of the clinician's experience. In order to assist clinicians in identifying the characteristics and classifying depressed people, different types of data modalities and machine learning techniques have been incorporated by researchers in this field. This study aims to find the answers to some important questions related to the trend of publications, data modality, machine learning models, dataset usage, pre-processing techniques and feature extraction and selection techniques that are prevalent and guide the direction of future research on depression diagnosis. METHODS This systematic review was conducted using a broad range of articles from two major databases: IEEE Xplore and PubMed. Studies ranging from the years 2011 to April 2021 were retrieved from the databases resulting in a total of 590 articles (53 articles from the IEEE Xplore database and 537 articles from the PubMed database). Out of those, the articles which satisfied the defined inclusion criteria were investigated for further analysis. RESULTS A total of 135 articles were identified and analysed for this review. High growth in the number of publications has been observed in recent years. Furthermore, significant diversity in the use of data modalities and machine learning classifiers has also been noted in this study. fMRI data with an SVM classifier was found to be the most popular choice among researchers. In most of the studies, data scarcity and small sample size, particularly for neuroimaging data are major concerns. The use of identical data pre-processing tools for similar data modalities can be seen. This study also provides statistical analysis of the current framework with respect to the modality, machine learning classifier, sample size and accuracy by applying one-way ANOVA and the Tukey - Kramer test. CONCLUSION The results indicate that an effective fusion of machine learning techniques with a potential data modality has a promising future for assisting clinicians in automatic depression diagnosis.
Collapse
Affiliation(s)
- Sweta Bhadra
- Department of CS & IT, Cotton University, Guwahati, India
| | | |
Collapse
|
27
|
Do papers (really) match journals’ “aims and scope”? A computational assessment of innovation studies. Scientometrics 2022. [DOI: 10.1007/s11192-022-04327-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
28
|
Moragrega I, Bridler R, Mohr C, Possenti M, Rochat D, Parramon JS, Stassen HH. Monitoring the effects of therapeutic interventions in depression through self-assessments. RESEARCH IN PSYCHOTHERAPY (MILANO) 2021; 24:548. [PMID: 35047425 PMCID: PMC8715262 DOI: 10.4081/ripppo.2021.548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Accepted: 09/07/2021] [Indexed: 11/30/2022]
Abstract
The treatment of major psychiatric disorders is an arduous and thorny path for the patients concerned, characterized by polypharmacy, massive adverse side effects, modest prospects of success, and constantly declining response rates. The more important is the early detection of psychiatric disorders prior to the development of clinically relevant symptoms, so that people can benefit from early interventions. A well-proven approach to monitoring mental health relies on voice analysis. This method has been successfully used with psychiatric patients to 'objectively' document the progress of improvement or the onset of relapse. The studies with psychiatric patients over 2-4 weeks demonstrated that daily voice assessments have a notable therapeutic effect in themselves. Therefore, daily voice assessments appear to be a lowthreshold form of therapeutic means that may be realized through self-assessments. To evaluate performance and reliability of this approach, we have carried out a longitudinal study on 82 university students in 3 different countries with daily assessments over 2 weeks. The sample included 41 males (mean age 24.2±3.83 years) and 41 females (mean age 21.6±2.05 years). Unlike other research in the field, this study was not concerned with the classification of individuals in terms of diagnostic categories. The focus lay on the monitoring aspect and the extent to which the effects of therapeutic interventions or of behavioural changes are visible in the results of self-assessment voice analyses. The test persons showed an over-proportionally good adherence to the daily voice analysis scheme. The accumulated data were of generally high quality: sufficiently high signal levels, a very limited number of movement artifacts, and little to no interfering background noise. The method was sufficiently sensitive to detect: i) habituation effects when test persons became used to the daily procedure; and ii) short-term fluctuations that exceeded prespecified thresholds and reached significance. Results are directly interpretable and provide information about what is going well, what is going less well, and where there is a need for action. The proposed self-assessment approach was found to be well-suited to serve as a health-monitoring tool for subjects with an elevated vulnerability to psychiatric disorders or to stress-induced mental health problems. Daily voice assessments are in fact a low-threshold form of therapeutic means that can be realized through selfassessments, that requires only little effort, can be carried out in the test person's own home, and has the potential to strengthen resilience and to induce positive behavioural changes.
Collapse
Affiliation(s)
- Ines Moragrega
- Department of Psychobiology, University of Valencia, Valencia, Spain
| | | | - Christine Mohr
- Department of Psychology, University of Lausanne, Lausanne, Switzerland
| | - Michela Possenti
- Department of Psychology, University of Milano Bicocca, Milano, Italy
| | - Deborah Rochat
- Department of Psychology, University of Lausanne, Lausanne, Switzerland
| | | | - Hans H. Stassen
- Institute for Response-Genetics, Department of Psychiatry, Psychotherapy and Psychosomatics, Psychiatric University Hospital, Zurich, Switzerland
| |
Collapse
|
29
|
Klangpornkun N, Ruangritchai M, Munthuli A, Onsuwan C, Jaisin K, Pattanaseri K, Lortrakul J, Thanakulakkarachai P, Anansiripinyo T, Amornlaksananon A, Laohawee S, Tantibundhit C. Classification of Depression and Other Psychiatric Conditions Using Speech Features Extracted from a Thai Psychiatric and Verbal Screening Test. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2021; 2021:651-656. [PMID: 34891377 DOI: 10.1109/embc46164.2021.9629571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Depression is a common and serious mental illness which negatively affects daily functioning. To prevent the progression of the illness into severe or long-term consequences, early diagnosis is crucial. We developed an automated speech feature analysis application for depression and other psychiatric disorders derived from a developed Thai psychiatric and verbal screening test. The screening test includes Thai's version of Patient Health Questionnaire-9 (PHQ-9) and Hamilton Depression Rating Scale (HAM-D), and 32 additional emotion-induced questions. Case-control study was conducted on speech features from 66 participants. Twenty seven of those had depression (DP), 12 had other psychiatric disorders (OP), and 27 were normal controls (NC). The five-fold cross-validation from 6 settings of 5 classifiers with the combination of PHQ-9 and HAM-D scores, and speech features were examined. Results showed highest performance from the multilayer perceptron (MLP) classifier which yielded 83.33% sensitivity, 91.67% specificity, and 83.33% accuracy, where negative-emotional questions were most effective in classification. The automated speech feature analysis showed promising results for screening patients with depression or other psychiatric disorders. The current application is accessible through smartphone, making it a feasible and intuitive setup for low-resource countries such as Thailand.
Collapse
|
30
|
Richter T, Fishbain B, Richter-Levin G, Okon-Singer H. Machine Learning-Based Behavioral Diagnostic Tools for Depression: Advances, Challenges, and Future Directions. J Pers Med 2021; 11:jpm11100957. [PMID: 34683098 PMCID: PMC8537335 DOI: 10.3390/jpm11100957] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Revised: 09/12/2021] [Accepted: 09/21/2021] [Indexed: 01/05/2023] Open
Abstract
The psychiatric diagnostic procedure is currently based on self-reports that are subject to personal biases. Therefore, the diagnostic process would benefit greatly from data-driven tools that can enhance accuracy and specificity. In recent years, many studies have achieved promising results in detecting and diagnosing depression based on machine learning (ML) analysis. Despite these favorable results in depression diagnosis, which are primarily based on ML analysis of neuroimaging data, most patients do not have access to neuroimaging tools. Hence, objective assessment tools are needed that can be easily integrated into the routine psychiatric diagnostic process. One solution is to use behavioral data, which can be easily collected while still maintaining objectivity. The current paper summarizes the main ML-based approaches that use behavioral data in diagnosing depression and other psychiatric disorders. We classified these studies into two main categories: (a) laboratory-based assessments and (b) data mining, the latter of which we further divided into two sub-groups: (i) social media usage and movement sensors data and (ii) demographic and clinical information. The paper discusses the advantages and challenges in this field and suggests future research directions and implementations. The paper's overarching aim is to serve as a first step in synthetizing existing knowledge about ML-based behavioral diagnosis studies in order to develop interventions and individually tailored treatments in the future.
Collapse
Affiliation(s)
- Thalia Richter
- Department of Psychology, School of Psychological Sciences, University of Haifa, Haifa 3498838, Israel; (G.R.-L.); (H.O.-S.)
- Correspondence:
| | - Barak Fishbain
- Faculty of Civil and Environmental Engineering, Technion—Israel Institute of Technology, Haifa 3200003, Israel;
| | - Gal Richter-Levin
- Department of Psychology, School of Psychological Sciences, University of Haifa, Haifa 3498838, Israel; (G.R.-L.); (H.O.-S.)
| | - Hadas Okon-Singer
- Department of Psychology, School of Psychological Sciences, University of Haifa, Haifa 3498838, Israel; (G.R.-L.); (H.O.-S.)
| |
Collapse
|
31
|
McGinnis EW, Scism J, Hruschak J, Muzik M, Rosenblum KL, Fitzgerald K, Copeland W, McGinnis RS. Digital Phenotype for Childhood Internalizing Disorders: Less Positive Play and Promise for a Brief Assessment Battery. IEEE J Biomed Health Inform 2021; 25:3176-3184. [PMID: 33481724 PMCID: PMC8384142 DOI: 10.1109/jbhi.2021.3053846] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Childhood internalizing disorders, like anxiety and depression, are common, impairing, and difficult to detect. Universal childhood mental health screening has been recommended, but new technologies are needed to provide objective detection. Instrumented mood induction tasks, designed to press children for specific behavioral responses, have emerged as means for detecting childhood internalizing psychopathology. In our previous work, we leveraged machine learning to identify digital phenotypes of childhood internalizing psychopathology from movement and voice data collected during negative valence tasks (pressing for anxiety and fear). In this work, we develop a digital phenotype for childhood internalizing disorders based on wearable inertial sensor data recorded from a Positive Valence task during which a child plays with bubbles. We find that a phenotype derived from features that capture reward responsiveness is able to accurately detect children with underlying internalizing psychopathology (AUC = 0.81). In so doing, we explore the impact of a variety of feature sets computed from wearable sensors deployed to two body locations on phenotype performance across two phases of the task. We further consider this novel digital phenotype in the context of our previous Negative Valence digital phenotypes and find that each task brings unique information to the problem of detecting childhood internalizing psychopathology, capturing different problems and disorder subtypes. Collectively, these results provide preliminary evidence for a mood induction task battery to develop a novel diagnostic for childhood internalizing disorders.
Collapse
|
32
|
Chen X, Pan Z. A Convenient and Low-Cost Model of Depression Screening and Early Warning Based on Voice Data Using for Public Mental Health. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:6441. [PMID: 34198659 PMCID: PMC8296267 DOI: 10.3390/ijerph18126441] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Revised: 06/10/2021] [Accepted: 06/10/2021] [Indexed: 12/12/2022]
Abstract
Depression is a common mental health disease, which has great harm to public health. At present, the diagnosis of depression mainly depends on the interviews between doctors and patients, which is subjective, slow and expensive. Voice data are a kind of data that are easy to obtain and have the advantage of low cost. It has been proved that it can be used in the diagnosis of depression. The voice data used for modeling in this study adopted the authoritative public data set, which had passed the ethical review. The features of voice data were extracted by Python programming, and the voice features were stored in the format of CSV files. Through data processing, a big database, containing 1479 voice feature samples, was generated for modeling. Then, the decision tree screening model of depression was established by 10-fold cross validation and algorithm selection. The experiment achieved 83.4% prediction accuracy on voice data set. According to the prediction results of the model, the patients can be given early warning and intervention in time, so as to realize the health management of personal depression.
Collapse
Affiliation(s)
- Xin Chen
- School of Medicine, Hangzhou Normal University, Hangzhou 311121, China;
- Engineering Research Center of Mobile Health Management System, Ministry of Education, Hangzhou Normal University, Hangzhou 311121, China
- Institute of VR and Intelligent System, Hangzhou Normal University, Hangzhou 311121, China
| | - Zhigeng Pan
- School of Medicine, Hangzhou Normal University, Hangzhou 311121, China;
- Engineering Research Center of Mobile Health Management System, Ministry of Education, Hangzhou Normal University, Hangzhou 311121, China
- Institute of VR and Intelligent System, Hangzhou Normal University, Hangzhou 311121, China
| |
Collapse
|
33
|
Two-layer fuzzy multiple random forest for speech emotion recognition in human-robot interaction. Inf Sci (N Y) 2020. [DOI: 10.1016/j.ins.2019.09.005] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|