1
|
Linardon J, Chen K, Gajjar S, Eadara A, Wang S, Flathers M, Burns J, Torous J. Smartphone digital phenotyping in mental health disorders: A review of raw sensors utilized, machine learning processing pipelines, and derived behavioral features. Psychiatry Res 2025; 348:116483. [PMID: 40187059 DOI: 10.1016/j.psychres.2025.116483] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/10/2024] [Revised: 03/18/2025] [Accepted: 04/01/2025] [Indexed: 04/07/2025]
Abstract
With increased access to digital technology, there has been a surge in the use of and interest in digital phenotyping as a tool to calculate various features from raw smart device data. However, the increased usage of digital phenotyping has created confusion. The vast number of sensors that can be utilized to collect passive data, and diverse methods utilized to convert that sensor data into features has introduced conflicting results and conclusions into the literature. Consequently, there is an identified need for standardizing how digital phenotyping data is measured and collected. This review evaluates the different sensors and methods utilized in digital phenotyping research across 112 papers, with the goal of finding the most common platforms, sensors, and methods for each behavioral measure. This should help guide future digital phenotyping research, and resolve some existing confusion in the field. Information on each study's data sensor variables were tracked and consolidated into a double-coded Codebook. Variables assessed included but were not limited to data sensors, features extracted from data sensors, statistical methods used, phone type, patient access to phones, and characteristics of patient population. This review found that most studies used Android devices (n = 67) or both Android and iPhone (n = 38) with an average duration of 14.3 weeks. The GPS sensor was also found to be the most frequently used sensor. This review underscores the need for standardization in methodological reporting, sensor utilization, and feature extraction across mental health studies.
Collapse
Affiliation(s)
- Jake Linardon
- SEED Lifespan Strategic Research Centre, School of Psychology, Faculty of Health, Deakin University, Geelong, Australia
| | - Kelly Chen
- Beth Israel Deaconess Medical Center, Harvard Medical School, 330 Brookline Ave, Boston, MA, 02115, Boston, MA, USA
| | - Shruti Gajjar
- Beth Israel Deaconess Medical Center, Harvard Medical School, 330 Brookline Ave, Boston, MA, 02115, Boston, MA, USA
| | - Amrik Eadara
- Beth Israel Deaconess Medical Center, Harvard Medical School, 330 Brookline Ave, Boston, MA, 02115, Boston, MA, USA
| | - Shiwei Wang
- Beth Israel Deaconess Medical Center, Harvard Medical School, 330 Brookline Ave, Boston, MA, 02115, Boston, MA, USA
| | - Matthew Flathers
- Beth Israel Deaconess Medical Center, Harvard Medical School, 330 Brookline Ave, Boston, MA, 02115, Boston, MA, USA
| | - James Burns
- Beth Israel Deaconess Medical Center, Harvard Medical School, 330 Brookline Ave, Boston, MA, 02115, Boston, MA, USA
| | - John Torous
- Beth Israel Deaconess Medical Center, Harvard Medical School, 330 Brookline Ave, Boston, MA, 02115, Boston, MA, USA.
| |
Collapse
|
2
|
Ringwald WR, Feltman S, Schwartz HA, Samaras D, Khudari C, Luft BJ, Kotov R. Day-to-day dynamics of facial emotion expressions in posttraumatic stress disorder. J Affect Disord 2025; 380:331-339. [PMID: 40122249 DOI: 10.1016/j.jad.2025.03.109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/26/2024] [Revised: 03/13/2025] [Accepted: 03/19/2025] [Indexed: 03/25/2025]
Abstract
Facial expressions are an essential component of emotions that may reveal mechanisms maintaining posttraumatic stress disorder (PTSD). However, most research on emotions in PTSD has relied on self-reports, which only capture subjective affect. The few studies on outward emotion expressions have been hampered by methodological limitations, including low ecological validity and failure to capture the dynamic nature of emotions and symptoms. Our study addresses these limitations with an approach that has not been applied to psychopathology: person-specific models of day-to-day facial emotion expression and PTSD symptom dynamics. We studied a sample of World Trade Center responders (N = 112) with elevated PTSD pathology who recorded a daily video diary and self-reported symptoms for 90 days (8953 videos altogether). Facial expressions were detected from video recordings with a facial emotion recognition model. In data-driven, idiographic network models, most participants (80 %) had at least one, reliable expression-symptom link. Six expression-symptom dynamics were significant for >10 % of the sample. Each of these dynamics had statistically meaningful heterogeneity, with some people's symptoms related to over-expressivity and others to under-expressivity. Our results provide the foundation for a more complete understanding of emotions in PTSD that not only includes subjective feelings but also outward emotion expressions.
Collapse
Affiliation(s)
- Whitney R Ringwald
- Department of Psychology, University of Minnesota, United States of America.
| | - Scott Feltman
- Department of Applied Mathematics, Stony Brook University, United States of America
| | - H Andrew Schwartz
- Department of Computer Science, Stony Brook University, United States of America
| | - Dimitris Samaras
- Department of Computer Science, Stony Brook University, United States of America
| | - Christopher Khudari
- Department of Psychology, Rosalind Franklin University, United States of America
| | - Benjamin J Luft
- World Trade Center Health Program, Stony Brook University, United States of America; Department of Medicine, Stony Brook University, United States of America
| | - Roman Kotov
- Department of Psychiatry, Stony Brook University, United States of America
| |
Collapse
|
3
|
Mucci A, Leucht S, Giordano GM, Giuliani L, Wehr S, Weigel L, Galderisi S. Assessment of Negative Symptoms in Schizophrenia: From the Consensus Conference-Derived Scales to Remote Digital Phenotyping. Brain Sci 2025; 15:83. [PMID: 39851450 PMCID: PMC11764445 DOI: 10.3390/brainsci15010083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2024] [Revised: 01/13/2025] [Accepted: 01/15/2025] [Indexed: 01/26/2025] Open
Abstract
The assessment of negative symptoms in schizophrenia has advanced since the 2006 NIMH-MATRICS Consensus Statement, leading to the development of second-generation rating scales like the Brief Negative Symptom Scale and the Clinical Assessment Interview for Negative Symptoms. These scales address the limitations of first-generation tools, such as the inclusion of aspects that are not negative symptoms and the lack of assessment of the subject's internal experience. However, psychometric validation of these scales is still in progress, and they are not yet recommended by regulatory agencies, thus limiting their use in clinical trials and settings. Complementing these traditional methods, remote digital phenotyping offers a novel approach by leveraging smartphones and wearable technology to capture real-time, high-resolution clinical data. Despite the potential to overcome traditional assessment barriers, challenges remain in aligning these digital measures with clinical ratings and ensuring data security. Equally important is patient acceptance, as the success of remote digital phenotyping relies on the willingness of patients to use these technologies. This review provides a critical overview of both second-generation scales and remote digital phenotyping for assessing negative symptoms, highlighting future research needs.
Collapse
Affiliation(s)
- Armida Mucci
- Department of Mental and Physical Health and Preventive Medicine, School of Medicine, University of Campania Luigi Vanvitelli, Largo Madonna delle Grazie 1, 80135 Naples, Italy
| | - Stefan Leucht
- Department of Psychiatry and Psychotherapy, School of Medicine, Technical University of Munich, Klinikum Rechts der Isar, Ismaningerstrasse 22, 81675 Munich, Germany
| | - Giulia M. Giordano
- Department of Mental and Physical Health and Preventive Medicine, School of Medicine, University of Campania Luigi Vanvitelli, Largo Madonna delle Grazie 1, 80135 Naples, Italy
| | - Luigi Giuliani
- Department of Mental and Physical Health and Preventive Medicine, School of Medicine, University of Campania Luigi Vanvitelli, Largo Madonna delle Grazie 1, 80135 Naples, Italy
| | - Sophia Wehr
- Department of Psychiatry and Psychotherapy, School of Medicine, Technical University of Munich, Klinikum Rechts der Isar, Ismaningerstrasse 22, 81675 Munich, Germany
| | - Lucia Weigel
- Department of Psychiatry and Psychotherapy, School of Medicine, Technical University of Munich, Klinikum Rechts der Isar, Ismaningerstrasse 22, 81675 Munich, Germany
| | - Silvana Galderisi
- Department of Mental and Physical Health and Preventive Medicine, School of Medicine, University of Campania Luigi Vanvitelli, Largo Madonna delle Grazie 1, 80135 Naples, Italy
| |
Collapse
|
4
|
Ozturk S, Feltman S, Klein DN, Kotov R, Mohanty A. Digital assessment of nonverbal behaviors forecasts first onset of depression. Psychol Med 2024; 54:1-12. [PMID: 39363541 PMCID: PMC11496224 DOI: 10.1017/s0033291724002010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Revised: 07/29/2024] [Accepted: 08/12/2024] [Indexed: 10/05/2024]
Abstract
BACKGROUND Adolescence is marked by a sharp increase in the incidence of depression, especially in females. Identification of risk for depressive disorders (DD) in this key developmental stage can help prevention efforts, mitigating the clinical and public burden of DD. While frequently used in diagnosis, nonverbal behaviors are relatively understudied as risk markers for DD. Digital technology, such as facial recognition, may provide objective, fast, efficient, and cost-effective means of measuring nonverbal behavior. METHOD Here, we analyzed video-recorded clinical interviews of 359 never-depressed adolescents females via commercially available facial emotion recognition software. RESULTS We found that average head and facial movements forecast future first onset of depression (AUC = 0.70) beyond the effects of other established self-report and physiological markers of DD risk. CONCLUSIONS Overall, these findings suggest that digital assessment of nonverbal behaviors may provide a promising risk marker for DD, which could aid in early identification and intervention efforts.
Collapse
Affiliation(s)
- Sekine Ozturk
- Department of Psychology, Stony Brook University, Stony Brook, NY, USA
| | - Scott Feltman
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY, USA
| | - Daniel N. Klein
- Department of Psychology, Stony Brook University, Stony Brook, NY, USA
| | - Roman Kotov
- Department of Psychiatry and Behavioral Science, Stony Brook University, Stony Brook, NY, USA
| | - Aprajita Mohanty
- Department of Psychology, Stony Brook University, Stony Brook, NY, USA
| |
Collapse
|
5
|
Martin EA, Lian W, Oltmanns JR, Jonas KG, Samaras D, Hallquist MN, Ruggero CJ, Clouston SAP, Kotov R. Behavioral meaures of psychotic disorders: Using automatic facial coding to detect nonverbal expressions in video. J Psychiatr Res 2024; 176:9-17. [PMID: 38830297 DOI: 10.1016/j.jpsychires.2024.05.056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Revised: 04/11/2024] [Accepted: 05/29/2024] [Indexed: 06/05/2024]
Abstract
Emotional deficits in psychosis are prevalent and difficult to treat. In particular, much remains unknown about facial expression abnormalities, and a key reason is that expressions are very labor-intensive to code. Automatic facial coding (AFC) can remove this barrier. The current study sought to both provide evidence for the utility of AFC in psychosis for research purposes and to provide evidence that AFC are valid measures of clinical constructs. Changes of facial expressions and head position of participants-39 with schizophrenia/schizoaffective disorder (SZ), 46 with other psychotic disorders (OP), and 108 never psychotic individuals (NP)-were assessed via FaceReader, a commercially available automated facial expression analysis software, using video recorded during a clinical interview. We first examined the behavioral measures of the psychotic disorder groups and tested if they can discriminate between the groups. Next, we evaluated links of behavioral measures with clinical symptoms, controlling for group membership. We found the SZ group was characterized by significantly less variation in neutral expressions, happy expressions, arousal, and head movements compared to NP. These measures discriminated SZ from NP well (AUC = 0.79, sensitivity = 0.79, specificity = 0.67) but discriminated SZ from OP less well (AUC = 0.66, sensitivity = 0.77, specificity = 0.46). We also found significant correlations between clinician-rated symptoms and most behavioral measures (particularly happy expressions, arousal, and head movements). Taken together, these results suggest that AFC can provide useful behavioral measures of psychosis, which could improve research on non-verbal expressions in psychosis and, ultimately, enhance treatment.
Collapse
Affiliation(s)
- Elizabeth A Martin
- Department of Psychological Science, University of California, Irvine, CA, USA.
| | - Wenxuan Lian
- Department of Materials Science and Engineering and Department of Applied Math and Statistics, Stony Brook University, Stony Brook, NY, USA
| | - Joshua R Oltmanns
- Department of Psychiatry, Stony Brook University, Stony Brook, NY, USA
| | - Katherine G Jonas
- Department of Psychiatry, Stony Brook University, Stony Brook, NY, USA
| | - Dimitris Samaras
- Department of Computer Science, Stony Brook University, Stony Brook, NY, USA
| | - Michael N Hallquist
- Department of Psychology and Neuroscience, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Camilo J Ruggero
- Department of Psychology, University of Texas at Dallas, Richardson, TX, USA
| | - Sean A P Clouston
- Program in Public Health and Department of Family, Population, and Preventive Medicine, Renaissance School of Medicine, Stony Brook University, Stony Brook, NY, USA
| | - Roman Kotov
- Department of Psychiatry, Stony Brook University, Stony Brook, NY, USA.
| |
Collapse
|
6
|
Jiang Z, Seyedi S, Griner E, Abbasi A, Rad AB, Kwon H, Cotes RO, Clifford GD. Evaluating and mitigating unfairness in multimodal remote mental health assessments. PLOS DIGITAL HEALTH 2024; 3:e0000413. [PMID: 39046989 PMCID: PMC11268595 DOI: 10.1371/journal.pdig.0000413] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Accepted: 06/13/2024] [Indexed: 07/27/2024]
Abstract
Research on automated mental health assessment tools has been growing in recent years, often aiming to address the subjectivity and bias that existed in the current clinical practice of the psychiatric evaluation process. Despite the substantial health and economic ramifications, the potential unfairness of those automated tools was understudied and required more attention. In this work, we systematically evaluated the fairness level in a multimodal remote mental health dataset and an assessment system, where we compared the fairness level in race, gender, education level, and age. Demographic parity ratio (DPR) and equalized odds ratio (EOR) of classifiers using different modalities were compared, along with the F1 scores in different demographic groups. Post-training classifier threshold optimization was employed to mitigate the unfairness. No statistically significant unfairness was found in the composition of the dataset. Varying degrees of unfairness were identified among modalities, with no single modality consistently demonstrating better fairness across all demographic variables. Post-training mitigation effectively improved both DPR and EOR metrics at the expense of a decrease in F1 scores. Addressing and mitigating unfairness in these automated tools are essential steps in fostering trust among clinicians, gaining deeper insights into their use cases, and facilitating their appropriate utilization.
Collapse
Affiliation(s)
- Zifan Jiang
- Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, Georgia, United States of America
- Department of Biomedical Engineering, Emory University and Georgia Institute of Technology, Atlanta, Georgia, United States of America
| | - Salman Seyedi
- Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, Georgia, United States of America
| | - Emily Griner
- Department of Psychiatry and Behavioral Sciences, Emory University School of Medicine, Atlanta, Georgia, United States of America
| | - Ahmed Abbasi
- Department of IT, Analytics, and Operations, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - Ali Bahrami Rad
- Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, Georgia, United States of America
| | - Hyeokhyen Kwon
- Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, Georgia, United States of America
| | - Robert O. Cotes
- Department of Psychiatry and Behavioral Sciences, Emory University School of Medicine, Atlanta, Georgia, United States of America
| | - Gari D. Clifford
- Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, Georgia, United States of America
- Department of Biomedical Engineering, Emory University and Georgia Institute of Technology, Atlanta, Georgia, United States of America
| |
Collapse
|
7
|
Jiang Z, Seyedi S, Griner E, Abbasi A, Rad AB, Kwon H, Cotes RO, Clifford GD. Multimodal Mental Health Digital Biomarker Analysis From Remote Interviews Using Facial, Vocal, Linguistic, and Cardiovascular Patterns. IEEE J Biomed Health Inform 2024; 28:1680-1691. [PMID: 38198249 PMCID: PMC10986761 DOI: 10.1109/jbhi.2024.3352075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2024]
Abstract
OBJECTIVE Psychiatric evaluation suffers from subjectivity and bias, and is hard to scale due to intensive professional training requirements. In this work, we investigated whether behavioral and physiological signals, extracted from tele-video interviews, differ in individuals with psychiatric disorders. METHODS Temporal variations in facial expression, vocal expression, linguistic expression, and cardiovascular modulation were extracted from simultaneously recorded audio and video of remote interviews. Averages, standard deviations, and Markovian process-derived statistics of these features were computed from 73 subjects. Four binary classification tasks were defined: detecting 1) any clinically-diagnosed psychiatric disorder, 2) major depressive disorder, 3) self-rated depression, and 4) self-rated anxiety. Each modality was evaluated individually and in combination. RESULTS Statistically significant feature differences were found between psychiatric and control subjects. Correlations were found between features and self-rated depression and anxiety scores. Heart rate dynamics provided the best unimodal performance with areas under the receiver-operator curve (AUROCs) of 0.68-0.75 (depending on the classification task). Combining multiple modalities provided AUROCs of 0.72-0.82. CONCLUSION Multimodal features extracted from remote interviews revealed informative characteristics of clinically diagnosed and self-rated mental health status. SIGNIFICANCE The proposed multimodal approach has the potential to facilitate scalable, remote, and low-cost assessment for low-burden automated mental health services.
Collapse
|
8
|
Bufano P, Laurino M, Said S, Tognetti A, Menicucci D. Digital Phenotyping for Monitoring Mental Disorders: Systematic Review. J Med Internet Res 2023; 25:e46778. [PMID: 38090800 PMCID: PMC10753422 DOI: 10.2196/46778] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Revised: 05/29/2023] [Accepted: 07/31/2023] [Indexed: 12/18/2023] Open
Abstract
BACKGROUND The COVID-19 pandemic has increased the impact and spread of mental illness and made health services difficult to access; therefore, there is a need for remote, pervasive forms of mental health monitoring. Digital phenotyping is a new approach that uses measures extracted from spontaneous interactions with smartphones (eg, screen touches or movements) or other digital devices as markers of mental status. OBJECTIVE This review aimed to evaluate the feasibility of using digital phenotyping for predicting relapse or exacerbation of symptoms in patients with mental disorders through a systematic review of the scientific literature. METHODS Our research was carried out using 2 bibliographic databases (PubMed and Scopus) by searching articles published up to January 2023. By following the PRISMA (Preferred Reporting Items for Systematic Review and Meta-Analysis) guidelines, we started from an initial pool of 1150 scientific papers and screened and extracted a final sample of 29 papers, including studies concerning clinical populations in the field of mental health, which were aimed at predicting relapse or exacerbation of symptoms. The systematic review has been registered on the web registry Open Science Framework. RESULTS We divided the results into 4 groups according to mental disorder: schizophrenia (9/29, 31%), mood disorders (15/29, 52%), anxiety disorders (4/29, 14%), and substance use disorder (1/29, 3%). The results for the first 3 groups showed that several features (ie, mobility, location, phone use, call log, heart rate, sleep, head movements, facial and vocal characteristics, sociability, social rhythms, conversations, number of steps, screen on or screen off status, SMS text message logs, peripheral skin temperature, electrodermal activity, light exposure, and physical activity), extracted from data collected via the smartphone and wearable wristbands, can be used to create digital phenotypes that could support gold-standard assessment and could be used to predict relapse or symptom exacerbations. CONCLUSIONS Thus, as the data were consistent for almost all the mental disorders considered (mood disorders, anxiety disorders, and schizophrenia), the feasibility of this approach was confirmed. In the future, a new model of health care management using digital devices should be integrated with the digital phenotyping approach and tailored mobile interventions (managing crises during relapse or exacerbation).
Collapse
Affiliation(s)
- Pasquale Bufano
- Institute of Clinical Physiology, National Research Council, Pisa, Italy
| | - Marco Laurino
- Institute of Clinical Physiology, National Research Council, Pisa, Italy
| | - Sara Said
- Department of Surgical, Medical and Molecular Pathology and Critical Care Medicine, University of Pisa, Pisa, Italy
| | | | - Danilo Menicucci
- Department of Surgical, Medical and Molecular Pathology and Critical Care Medicine, University of Pisa, Pisa, Italy
| |
Collapse
|
9
|
Jiang Z, Seyedi S, Griner E, Abbasi A, Bahrami Rad A, Kwon H, Cotes RO, Clifford GD. Multimodal mental health assessment with remote interviews using facial, vocal, linguistic, and cardiovascular patterns. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.09.11.23295212. [PMID: 37745610 PMCID: PMC10516063 DOI: 10.1101/2023.09.11.23295212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
Objective The current clinical practice of psychiatric evaluation suffers from subjectivity and bias, and requires highly skilled professionals that are often unavailable or unaffordable. Objective digital biomarkers have shown the potential to address these issues. In this work, we investigated whether behavioral and physiological signals, extracted from remote interviews, provided complimentary information for assessing psychiatric disorders. Methods Time series of multimodal features were derived from four conceptual modes: facial expression, vocal expression, linguistic expression, and cardiovascular modulation. The features were extracted from simultaneously recorded audio and video of remote interviews using task-specific and foundation models. Averages, standard deviations, and hidden Markov model-derived statistics of these features were computed from 73 subjects. Four binary classification tasks were defined: detecting 1) any clinically-diagnosed psychiatric disorder, 2) major depressive disorder, 3) self-rated depression, and 4) self-rated anxiety. Each modality was evaluated individually and in combination. Results Statistically significant feature differences were found between controls and subjects with mental health conditions. Correlations were found between features and self-rated depression and anxiety scores. Visual heart rate dynamics achieved the best unimodal performance with areas under the receiver-operator curve (AUROCs) of 0.68-0.75 (depending on the classification task). Combining multiple modalities achieved AUROCs of 0.72-0.82. Features from task-specific models outperformed features from foundation models. Conclusion Multimodal features extracted from remote interviews revealed informative characteristics of clinically diagnosed and self-rated mental health status. Significance The proposed multimodal approach has the potential to facilitate objective, remote, and low-cost assessment for low-burden automated mental health services.
Collapse
|
10
|
Galatzer-Levy IR, Onnela JP. Machine Learning and the Digital Measurement of Psychological Health. Annu Rev Clin Psychol 2023; 19:133-154. [PMID: 37159287 DOI: 10.1146/annurev-clinpsy-080921-073212] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
Since its inception, the discipline of psychology has utilized empirical epistemology and mathematical methodologies to infer psychological functioning from direct observation. As new challenges and technological opportunities emerge, scientists are once again challenged to define measurement paradigms for psychological health and illness that solve novel problems and capitalize on new technological opportunities. In this review, we discuss the theoretical foundations of and scientific advances in remote sensor technology and machine learning models as they are applied to quantify psychological functioning, draw clinical inferences, and chart new directions in treatment.
Collapse
Affiliation(s)
- Isaac R Galatzer-Levy
- Department of Psychiatry, New York University Grossman School of Medicine, New York, NY, USA;
- Current affiliation: Google LLC, Mountain View, California, USA
| | - Jukka-Pekka Onnela
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, USA
| |
Collapse
|
11
|
Lam WWT, Tang YM, Fong KNK. A systematic review of the applications of markerless motion capture (MMC) technology for clinical measurement in rehabilitation. J Neuroeng Rehabil 2023; 20:57. [PMID: 37131238 PMCID: PMC10155325 DOI: 10.1186/s12984-023-01186-9] [Citation(s) in RCA: 33] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Accepted: 04/26/2023] [Indexed: 05/04/2023] Open
Abstract
BACKGROUND Markerless motion capture (MMC) technology has been developed to avoid the need for body marker placement during motion tracking and analysis of human movement. Although researchers have long proposed the use of MMC technology in clinical measurement-identification and measurement of movement kinematics in a clinical population, its actual application is still in its preliminary stages. The benefits of MMC technology are also inconclusive with regard to its use in assessing patients' conditions. In this review we put a minor focus on the method's engineering components and sought primarily to determine the current application of MMC as a clinical measurement tool in rehabilitation. METHODS A systematic computerized literature search was conducted in PubMed, Medline, CINAHL, CENTRAL, EMBASE, and IEEE. The search keywords used in each database were "Markerless Motion Capture OR Motion Capture OR Motion Capture Technology OR Markerless Motion Capture Technology OR Computer Vision OR Video-based OR Pose Estimation AND Assessment OR Clinical Assessment OR Clinical Measurement OR Assess." Only peer-reviewed articles that applied MMC technology for clinical measurement were included. The last search took place on March 6, 2023. Details regarding the application of MMC technology for different types of patients and body parts, as well as the assessment results, were summarized. RESULTS A total of 65 studies were included. The MMC systems used for measurement were most frequently used to identify symptoms or to detect differences in movement patterns between disease populations and their healthy counterparts. Patients with Parkinson's disease (PD) who demonstrated obvious and well-defined physical signs were the largest patient group to which MMC assessment had been applied. Microsoft Kinect was the most frequently used MMC system, although there was a recent trend of motion analysis using video captured with a smartphone camera. CONCLUSIONS This review explored the current uses of MMC technology for clinical measurement. MMC technology has the potential to be used as an assessment tool as well as to assist in the detection and identification of symptoms, which might further contribute to the use of an artificial intelligence method for early screening for diseases. Further studies are warranted to develop and integrate MMC system in a platform that can be user-friendly and accurately analyzed by clinicians to extend the use of MMC technology in the disease populations.
Collapse
Affiliation(s)
- Winnie W. T. Lam
- Department of Rehabilitation Sciences, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR China
| | - Yuk Ming Tang
- Department of Industrial and Systems Engineering, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR China
| | - Kenneth N. K. Fong
- Department of Rehabilitation Sciences, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR China
| |
Collapse
|
12
|
Chen ZS, Kulkarni P(P, Galatzer-Levy IR, Bigio B, Nasca C, Zhang Y. Modern views of machine learning for precision psychiatry. PATTERNS (NEW YORK, N.Y.) 2022; 3:100602. [PMID: 36419447 PMCID: PMC9676543 DOI: 10.1016/j.patter.2022.100602] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
In light of the National Institute of Mental Health (NIMH)'s Research Domain Criteria (RDoC), the advent of functional neuroimaging, novel technologies and methods provide new opportunities to develop precise and personalized prognosis and diagnosis of mental disorders. Machine learning (ML) and artificial intelligence (AI) technologies are playing an increasingly critical role in the new era of precision psychiatry. Combining ML/AI with neuromodulation technologies can potentially provide explainable solutions in clinical practice and effective therapeutic treatment. Advanced wearable and mobile technologies also call for the new role of ML/AI for digital phenotyping in mobile mental health. In this review, we provide a comprehensive review of ML methodologies and applications by combining neuroimaging, neuromodulation, and advanced mobile technologies in psychiatry practice. We further review the role of ML in molecular phenotyping and cross-species biomarker identification in precision psychiatry. We also discuss explainable AI (XAI) and neuromodulation in a closed human-in-the-loop manner and highlight the ML potential in multi-media information extraction and multi-modal data fusion. Finally, we discuss conceptual and practical challenges in precision psychiatry and highlight ML opportunities in future research.
Collapse
Affiliation(s)
- Zhe Sage Chen
- Department of Psychiatry, New York University Grossman School of Medicine, New York, NY 10016, USA
- Department of Neuroscience and Physiology, New York University Grossman School of Medicine, New York, NY 10016, USA
- The Neuroscience Institute, New York University Grossman School of Medicine, New York, NY 10016, USA
- Department of Biomedical Engineering, New York University Tandon School of Engineering, Brooklyn, NY 11201, USA
| | | | - Isaac R. Galatzer-Levy
- Department of Psychiatry, New York University Grossman School of Medicine, New York, NY 10016, USA
- Meta Reality Lab, New York, NY, USA
| | - Benedetta Bigio
- Department of Psychiatry, New York University Grossman School of Medicine, New York, NY 10016, USA
| | - Carla Nasca
- Department of Psychiatry, New York University Grossman School of Medicine, New York, NY 10016, USA
- The Neuroscience Institute, New York University Grossman School of Medicine, New York, NY 10016, USA
| | - Yu Zhang
- Department of Bioengineering, Lehigh University, Bethlehem, PA 18015, USA
- Department of Electrical and Computer Engineering, Lehigh University, Bethlehem, PA 18015, USA
| |
Collapse
|
13
|
Safety monitoring method of moving target in underground coal mine based on computer vision processing. Sci Rep 2022; 12:17899. [PMID: 36284147 PMCID: PMC9596409 DOI: 10.1038/s41598-022-22564-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2022] [Accepted: 10/17/2022] [Indexed: 11/09/2022] Open
Abstract
Coal is one of the main energy sources in China. The country attaches great importance to the development of coal mining industry, and coal production is on the rise. At the same time, mine safety accidents are becoming more and more frequent, and the country is paying more and more attention to mine safety accidents. The underground environment of coal mine is complex, noisy and uneven, and there will be problems such as occlusion and high false detection rate during video monitoring. In order to ensure the safety of underground personnel, moving target detection and tracking based on video monitoring information is of great significance for coal mine safety production. The purpose of this paper is to study how to analyze and study the monitoring of moving targets in coal mines based on computer vision processing, and describe the image processing methods. This paper puts forward the problem of target monitoring, which is based on image processing, and then elaborates on the concept of image enhancement and related algorithms. From the average gradient, the algorithm in this paper is 56.60% higher than the histogram equalization algorithm, and 68.26% higher than the dark primary color prior dehazing algorithm. and designs and analyzes cases of image enhancement in coal mines. The experimental results show that the information entropy of the algorithm in this paper is 31.10% higher than that of the dark primary color prior dehazing algorithm, and 18.72% higher than that of the histogram equalization algorithm. It can be seen that the algorithm in this paper can achieve better enhancement effect.
Collapse
|
14
|
Zhang J, Yang H, Li W, Li Y, Qin J, He L. Automatic Schizophrenia Detection Using Multimodality Media via a Text Reading Task. Front Neurosci 2022; 16:933049. [PMID: 35911987 PMCID: PMC9331283 DOI: 10.3389/fnins.2022.933049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Accepted: 06/13/2022] [Indexed: 11/13/2022] Open
Abstract
Schizophrenia is a crippling chronic mental disease that affects people worldwide. In this work, an automatic schizophrenia detection algorithm is proposed based on the reading deficit of schizophrenic patients. From speech and video modalities, the automatic schizophrenia detection algorithm illustrates abnormal speech, head movement, and reading fluency during the reading task. In the speech modality, an acoustic model of speech emotional flatness in schizophrenia is established to reflect the emotional expression flatness of schizophrenic speech from the perspective of speech production and perception. In the video modality, the head-movement-related features are proposed to illustrate the spontaneous head movement caused by repeated reading and unconscious movement, and the reading-fluency-related features are proposed to convey the damaged degree of schizophrenic patients' reading fluency. The experimental data of this work are 160 segments of speech and video data recorded by 40 participants (20 schizophrenic patients and 20 normal controls). Combined with support vector machines and random forest, the accuracy of the proposed acoustic model, the head-movement-related features, and the reading-fluency-related features range from 94.38 to 96.50%, 73.38 to 83.38%, and 79.50 to 83.63%, respectively. The average accuracy of the proposed automatic schizophrenia detection algorithm reaches 97.50%. The experimental results indicate the effectiveness of the proposed automatic detection algorithm as an auxiliary diagnostic method for schizophrenia.
Collapse
Affiliation(s)
- Jing Zhang
- College of Biomedical Engineering, Sichuan University, Chengdu, China
| | - Hui Yang
- College of Biomedical Engineering, Sichuan University, Chengdu, China
| | - Wen Li
- College of Biomedical Engineering, Sichuan University, Chengdu, China
| | - Yuanyuan Li
- Mental Health Center, West China Hospital of Sichuan University, Chengdu, China
| | - Jing Qin
- Centre for Smart Health, School of Nursing, The Hong Kong Polytechnic University, Hong Kong, Hong Kong SAR, China
| | - Ling He
- College of Biomedical Engineering, Sichuan University, Chengdu, China
- *Correspondence: Ling He
| |
Collapse
|
15
|
Jiang Z, Luskus M, Seyedi S, Griner EL, Rad AB, Clifford GD, Boazak M, Cotes RO. Utilizing computer vision for facial behavior analysis in schizophrenia studies: A systematic review. PLoS One 2022; 17:e0266828. [PMID: 35395049 PMCID: PMC8992987 DOI: 10.1371/journal.pone.0266828] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Accepted: 03/28/2022] [Indexed: 12/22/2022] Open
Abstract
Background Schizophrenia is a severe psychiatric disorder that causes significant social and functional impairment. Currently, the diagnosis of schizophrenia is based on information gleaned from the patient’s self-report, what the clinician observes directly, and what the clinician gathers from collateral informants, but these elements are prone to subjectivity. Utilizing computer vision to measure facial expressions is a promising approach to adding more objectivity in the evaluation and diagnosis of schizophrenia. Method We conducted a systematic review using PubMed and Google Scholar. Relevant publications published before (including) December 2021 were identified and evaluated for inclusion. The objective was to conduct a systematic review of computer vision for facial behavior analysis in schizophrenia studies, the clinical findings, and the corresponding data processing and machine learning methods. Results Seventeen studies published between 2007 to 2021 were included, with an increasing trend in the number of publications over time. Only 14 articles used interviews to collect data, of which different combinations of passive to evoked, unstructured to structured interviews were used. Various types of hardware were adopted and different types of visual data were collected. Commercial, open-access, and in-house developed models were used to recognize facial behaviors, where frame-level and subject-level features were extracted. Statistical tests and evaluation metrics varied across studies. The number of subjects ranged from 2-120, with an average of 38. Overall, facial behaviors appear to have a role in estimating diagnosis of schizophrenia and psychotic symptoms. When studies were evaluated with a quality assessment checklist, most had a low reporting quality. Conclusion Despite the rapid development of computer vision techniques, there are relatively few studies that have applied this technology to schizophrenia research. There was considerable variation in the clinical paradigm and analytic techniques used. Further research is needed to identify and develop standardized practices, which will help to promote further advances in the field.
Collapse
Affiliation(s)
- Zifan Jiang
- Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, GA, United States of America
- Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, GA, United States of America
- * E-mail:
| | - Mark Luskus
- Department of Psychiatry and Behavioral Sciences, University of California, San Francisco, San Francisco, CA, United States of America
| | - Salman Seyedi
- Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, GA, United States of America
| | - Emily L. Griner
- Department of Psychiatry and Behavioral Sciences, Emory University School of Medicine, Atlanta, GA, United States of America
| | - Ali Bahrami Rad
- Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, GA, United States of America
| | - Gari D. Clifford
- Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, GA, United States of America
- Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, GA, United States of America
| | - Mina Boazak
- Animo Sano Psychiatry, PLLC, Durham, NC, United States of America
| | - Robert O. Cotes
- Department of Psychiatry and Behavioral Sciences, Emory University School of Medicine, Atlanta, GA, United States of America
| |
Collapse
|
16
|
Qin W, Qi F, Li J, Li P, Zang YS. Prognostic Biomarkers on a Competitive Endogenous RNA Network Reveals Overall Survival in Triple-Negative Breast Cancer. Front Oncol 2021; 11:681946. [PMID: 34178671 PMCID: PMC8232227 DOI: 10.3389/fonc.2021.681946] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Accepted: 05/20/2021] [Indexed: 11/13/2022] Open
Abstract
The objective of this study was to construct a competitive endogenous RNA (ceRNA) regulatory network using differentially expressed long noncoding RNAs (lncRNAs), microRNAs (miRNAs), and mRNAs in patients with triple-negative breast cancer (TNBC) and to construct a prognostic model for predicting overall survival (OS) in patients with TNBC. Differentially expressed lncRNAs, miRNAs, and mRNAs in TNBC patients from the TCGA and Metabric databases were examined. A prognostic model based on prognostic scores (PSs) was established for predicting OS in TNBC patients, and the performance of the model was assessed by a recipient that operated on a distinctive curve. A total of 874 differentially expressed RNAs (DERs) were screened, among which 6 lncRNAs, 295 miRNAs and 573 mRNAs were utilized to construct targeted and coexpression ceRNA regulatory networks. Eight differentially expressed genes (DEGs) associated with survival prognosis, DBX2, MYH7, TARDBP, POU4F1, ABCB11, LHFPL5, TRHDE and TIMP4, were identified by multivariate Cox regression and then used to establish a prognostic model. Our study shows that the ceRNA network has a critical role in maintaining the aggressiveness of TNBC and provides comprehensive molecular-level insight for predicting individual mortality hazards for TNBC patients. Our data suggest that these prognostic mRNAs from the ceRNA network are promising therapeutic targets for clinical intervention.
Collapse
Affiliation(s)
- Wenxing Qin
- Department of Oncology, Changzheng Hospital, Naval Medical University, Shanghai, China
| | - Feng Qi
- Department of Oncology, Changzheng Hospital, Naval Medical University, Shanghai, China.,Liver Cancer Institute, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Jia Li
- Liver Cancer Institute, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Ping Li
- Department of Gastroenterology, Changhai Hospital, Naval Medical University, Shanghai, China
| | - Yuan-Sheng Zang
- Department of Oncology, Changzheng Hospital, Naval Medical University, Shanghai, China
| |
Collapse
|
17
|
Galatzer-Levy I, Abbas A, Ries A, Homan S, Sels L, Koesmahargyo V, Yadav V, Colla M, Scheerer H, Vetter S, Seifritz E, Scholz U, Kleim B. Validation of Visual and Auditory Digital Markers of Suicidality in Acutely Suicidal Psychiatric Inpatients: Proof-of-Concept Study. J Med Internet Res 2021; 23:e25199. [PMID: 34081022 PMCID: PMC8212625 DOI: 10.2196/25199] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 12/15/2020] [Accepted: 03/16/2021] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Multiple symptoms of suicide risk have been assessed based on visual and auditory information, including flattened affect, reduced movement, and slowed speech. Objective quantification of such symptomatology from novel data sources can increase the sensitivity, scalability, and timeliness of suicide risk assessment. OBJECTIVE We aimed to examine measurements extracted from video interviews using open-source deep learning algorithms to quantify facial, vocal, and movement behaviors in relation to suicide risk severity in recently admitted patients following a suicide attempt. METHODS We utilized video to quantify facial, vocal, and movement markers associated with mood, emotion, and motor functioning from a structured clinical conversation in 20 patients admitted to a psychiatric hospital following a suicide risk attempt. Measures were calculated using open-source deep learning algorithms for processing facial expressivity, head movement, and vocal characteristics. Derived digital measures of flattened affect, reduced movement, and slowed speech were compared to suicide risk with the Beck Scale for Suicide Ideation controlling for age and sex, using multiple linear regression. RESULTS Suicide severity was associated with multiple visual and auditory markers, including speech prevalence (β=-0.68, P=.02, r2=0.40), overall expressivity (β=-0.46, P=.10, r2=0.27), and head movement measured as head pitch variability (β=-1.24, P=.006, r2=0.48) and head yaw variability (β=-0.54, P=.06, r2=0.32). CONCLUSIONS Digital measurements of facial affect, movement, and speech prevalence demonstrated strong effect sizes and linear associations with the severity of suicidal ideation.
Collapse
Affiliation(s)
- Isaac Galatzer-Levy
- Research and Development, AiCure, New York, NY, United States
- Psychiatry, New York University School of Medicine, New York, NY, United States
| | - Anzar Abbas
- Research and Development, AiCure, New York, NY, United States
| | - Anja Ries
- Department of Psychology, University of Zurich, Zurich, Switzerland
- Department of Psychiatry, Psychotherapy and Psychosomatics, University of Zurich, Zurich, Switzerland
- Neuroscience Centre Zurich, University of Zurich, Zurich, Switzerland
| | - Stephanie Homan
- Department of Psychology, University of Zurich, Zurich, Switzerland
- Department of Psychiatry, Psychotherapy and Psychosomatics, University of Zurich, Zurich, Switzerland
- Neuroscience Centre Zurich, University of Zurich, Zurich, Switzerland
| | - Laura Sels
- Department of Psychology, University of Zurich, Zurich, Switzerland
- Department of Psychiatry, Psychotherapy and Psychosomatics, University of Zurich, Zurich, Switzerland
| | | | - Vijay Yadav
- Research and Development, AiCure, New York, NY, United States
| | - Michael Colla
- Department of Psychiatry, Psychotherapy and Psychosomatics, University of Zurich, Zurich, Switzerland
| | - Hanne Scheerer
- Department of Psychiatry, Psychotherapy and Psychosomatics, University of Zurich, Zurich, Switzerland
| | - Stefan Vetter
- Department of Psychiatry, Psychotherapy and Psychosomatics, University of Zurich, Zurich, Switzerland
| | - Erich Seifritz
- Department of Psychiatry, Psychotherapy and Psychosomatics, University of Zurich, Zurich, Switzerland
- Neuroscience Centre Zurich, University of Zurich, Zurich, Switzerland
| | - Urte Scholz
- Department of Psychiatry, Psychotherapy and Psychosomatics, University of Zurich, Zurich, Switzerland
| | - Birgit Kleim
- Department of Psychology, University of Zurich, Zurich, Switzerland
- Department of Psychiatry, Psychotherapy and Psychosomatics, University of Zurich, Zurich, Switzerland
- Neuroscience Centre Zurich, University of Zurich, Zurich, Switzerland
| |
Collapse
|
18
|
Abbas A, Sauder C, Yadav V, Koesmahargyo V, Aghjayan A, Marecki S, Evans M, Galatzer-Levy IR. Remote Digital Measurement of Facial and Vocal Markers of Major Depressive Disorder Severity and Treatment Response: A Pilot Study. Front Digit Health 2021; 3:610006. [PMID: 34713091 PMCID: PMC8521884 DOI: 10.3389/fdgth.2021.610006] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Accepted: 02/19/2021] [Indexed: 12/21/2022] Open
Abstract
Objectives: Multiple machine learning-based visual and auditory digital markers have demonstrated associations between major depressive disorder (MDD) status and severity. The current study examines if such measurements can quantify response to antidepressant treatment (ADT) with selective serotonin reuptake inhibitors (SSRIs) and serotonin-norepinephrine uptake inhibitors (SNRIs). Methods: Visual and auditory markers were acquired through an automated smartphone task that measures facial, vocal, and head movement characteristics across 4 weeks of treatment (with time points at baseline, 2 weeks, and 4 weeks) on ADT (n = 18). MDD diagnosis was confirmed using the Mini-International Neuropsychiatric Interview (MINI), and the Montgomery-Åsberg Depression Rating Scale (MADRS) was collected concordantly to assess changes in MDD severity. Results: Patient responses to ADT demonstrated clinically and statistically significant changes in the MADRS [F (2, 34) = 51.62, p < 0.0001]. Additionally, patients demonstrated significant increases in multiple digital markers including facial expressivity, head movement, and amount of speech. Finally, patients demonstrated significantly decreased frequency of fear and anger facial expressions. Conclusion: Digital markers associated with MDD demonstrate validity as measures of treatment response.
Collapse
Affiliation(s)
| | - Colin Sauder
- Adams Clinical, Watertown, MA, United States
- Karuna Therapeutics, Boston, MA, United States
| | | | | | | | | | | | - Isaac R. Galatzer-Levy
- AiCure, New York, NY, United States
- Psychiatry, New York University School of Medicine, New York, NY, United States
| |
Collapse
|