Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Leis A, Ronzano F, Mayer MA, Furlong LI, Sanz F. Detecting Signs of Depression in Tweets in Spanish: Behavioral and Linguistic Analysis. J Med Internet Res 2019;21:e14199. [PMID: 31250832 PMCID: PMC6620890 DOI: 10.2196/14199] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Revised: 05/24/2019] [Accepted: 05/24/2019] [Indexed: 01/24/2023] Open

For:	Leis A, Ronzano F, Mayer MA, Furlong LI, Sanz F. Detecting Signs of Depression in Tweets in Spanish: Behavioral and Linguistic Analysis. J Med Internet Res 2019;21:e14199. [PMID: 31250832 PMCID: PMC6620890 DOI: 10.2196/14199] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Revised: 05/24/2019] [Accepted: 05/24/2019] [Indexed: 01/24/2023] Open

Number

Cited by Other Article(s)

Xu X, An F, Wu S, Wang H, Kang Q, Wang Y, Zhu T, Zhang B, Huang W, Liu X, Wang X. Affective norms for 501 Chinese words from three emotional dimensions rated by depressive disorder patients. Front Psychiatry 2024;15:1309501. [PMID: 38469031 PMCID: PMC10925686 DOI: 10.3389/fpsyt.2024.1309501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/08/2023] [Accepted: 02/07/2024] [Indexed: 03/13/2024] Open

Xu C, Wongpakaran N, Wongpakaran T, Siriwittayakorn T, Wedding D, Varnado P. Syntactic Errors in Older Adults with Depression. MEDICINA (KAUNAS, LITHUANIA) 2023;59:2133. [PMID: 38138236 PMCID: PMC10744892 DOI: 10.3390/medicina59122133] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 12/03/2023] [Accepted: 12/05/2023] [Indexed: 12/24/2023]

Shi J, Khoo Z. Words for the hearts: a corpus study of metaphors in online depression communities. Front Psychol 2023;14:1227123. [PMID: 37829080 PMCID: PMC10566633 DOI: 10.3389/fpsyg.2023.1227123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Accepted: 08/18/2023] [Indexed: 10/14/2023] Open

Abstract

Purpose/significance

Humans understand, think, and express themselves through metaphors. The current paper emphasizes the importance of identifying the metaphorical language used in online health communities (OHC) to understand how users frame and make sense of their experiences, which can boost the effectiveness of counseling and interventions for this population.

Methods/process

We used a web crawler to obtain a corpus of an online depression community. We introduced a three-stage procedure for metaphor identification in a Chinese Corpus: (1) combine MIPVU to identify metaphorical expressions (ME) bottom-up and formulate preliminary working hypotheses; (2) collect more ME top-down in the corpus by performing semantic domain analysis on identified ME; and (3) analyze ME and categorize conceptual metaphors using a reference list. In this way, we have gained a greater understanding of how depression sufferers conceptualize their experience metaphorically in an under-represented language in the literature (Chinese) of a new genre (online health community).

Results/conclusion

Main conceptual metaphors for depression are classified into PERSONAL LIFE, INTERPERSONAL RELATIONSHIP, TIME, and CYBERCULTURE metaphors. Identifying depression metaphors in the Chinese corpus pinpoints the sociocultural environment people with depression are experiencing: lack of offline support, social stigmatization, and substitutability of offline support with online support. We confirm a number of depression metaphors found in other languages, providing a theoretical basis for researching, identifying, and treating depression in multilingual settings. Our study also identifies new metaphors with source-target connections based on embodied, sociocultural, and idiosyncratic levels. From these three levels, we analyze metaphor research's theoretical and practical implications, finding ways to emphasize its inherent cross-disciplinarity meaningfully.

Collapse

Marszałek M, Miązek A, Roczniewska M. Promotion and prevention regulatory focus LIWC dictionary. Polish adaptation and validation. PLoS One 2023;18:e0288726. [PMID: 37471322 PMCID: PMC10358899 DOI: 10.1371/journal.pone.0288726] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Accepted: 07/04/2023] [Indexed: 07/22/2023] Open

Abstract

This article describes the adaptation and validation of a Polish version of the regulatory focus (RF) Linguistic Inquiry and Word Count (LIWC) dictionary. RF theory proposes that there are two types of self-regulation: promotion (focus on gains, growth, and ideals) and prevention (focus on losses, security, and oughts). Apart from self-report questionnaires, one method to measure RF includes a linguistic analysis. LIWC counts the frequency of words from relevant categories and presents the output as a percentage of all words used in a writing sample. RF LIWC contains two categories: promotion (e.g., achieve, ideal) and prevention (e.g., afraid, fail). To test the psychometric properties of our Polish adaptation of the RF LIWC instrument, we performed three studies. In Study 1 (N = 10), experts in RF theory rated the extent to which each dictionary entry was related to promotion and prevention foci. Results showed that words from the promotion category were rated as more promotion than prevention-related, and the pattern was reversed for words from the prevention category. In Study 2 (N = 130) we examined the divergent validity of the instrument by experimentally manipulating RF and testing the writing patterns. When a promotion focus was activated, individuals wrote more words from the promotion than prevention category, and the pattern was reversed in the prevention group. Study 3 (N = 414) investigated whether the promotion and prevention scores obtained through RF LIWC are linked with results obtained using a self-report questionnaire that measures chronic RF. Promotion scores from RF LIWC correlated positively with chronic promotion RF and prevention scores from RF LIWC correlated positively with chronic prevention RF. These preliminary findings provide initial support for the validity of the Polish adaptation of the RF LIWC.

Collapse

Ryu J, Heisig S, McLaughlin C, Katz M, Mayberg HS, Gu X. A natural language processing approach reveals first-person pronoun usage and non-fluency as markers of therapeutic alliance in psychotherapy. iScience 2023;26:106860. [PMID: 37255661 PMCID: PMC10225921 DOI: 10.1016/j.isci.2023.106860] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 04/18/2023] [Accepted: 05/08/2023] [Indexed: 06/01/2023] Open

Pool-Cen J, Carlos-Martínez H, Hernández-Chan G, Sánchez-Siordia O. Detection of Depression-Related Tweets in Mexico Using Crosslingual Schemes and Knowledge Distillation. Healthcare (Basel) 2023;11:healthcare11071057. [PMID: 37046984 PMCID: PMC10094126 DOI: 10.3390/healthcare11071057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 03/18/2023] [Accepted: 03/20/2023] [Indexed: 04/08/2023] Open

Shi J, Khoo Z. Online health community for change: Analysis of self-disclosure and social networks of users with depression. Front Psychol 2023;14:1092884. [PMID: 37057164 PMCID: PMC10088863 DOI: 10.3389/fpsyg.2023.1092884] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 02/23/2023] [Indexed: 03/30/2023] Open

Abstract BackgroundA key research question with theoretical and practical implications is to investigate the various conditions by which social network sites (SNS) may either enhance or interfere with mental well-being, given the omnipresence of SNS and their dual effects on well-being.Method/processWe study SNS’ effects on well-being by accounting for users’ personal (i.e., self-disclosure) and situational (i.e., social networks) attributes, using a mixed design of content analysis and social network analysis.Result/conclusionWe compare users’ within-person changes in self-disclosure and social networks in two phases (over half a year), drawing on Weibo Depression SuperTalk, an online community for depression, and find: ① Several network attributes strengthen social support, including network connectivity, global efficiency, degree centralization, hubs of communities, and reciprocal interactions. ② Users’ self-disclosure attributes reflect positive changes in mental well-being and increased attachment to the community. ③ Correlations exist between users’ topological and self-disclosure attributes. ④ A Poisson regression model extracts self-disclosure attributes that may affect users’ received social support, including the writing length, number of active days, informal words, adverbs, negative emotion words, biological process words, and first-person singular forms.InnovationWe combine social network analysis with content analysis, highlighting the need to understand SNS’ effects on well-being by accounting for users’ self-disclosure (content) and communication partners (social networks).Implication/contributionAuthentic user data helps to avoid recall bias commonly found in self-reported data. A longitudinal within-person analysis of SNS’ effects on well-being is helpful for policymakers in public health intervention, community managers for group organizations, and users in online community engagement. Collapse

Surveillance of communicable diseases using social media: A systematic review. PLoS One 2023;18:e0282101. [PMID: 36827297 PMCID: PMC9956027 DOI: 10.1371/journal.pone.0282101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Accepted: 02/07/2023] [Indexed: 02/25/2023] Open

Abstract

BACKGROUND

Communicable diseases pose a severe threat to public health and economic growth. The traditional methods that are used for public health surveillance, however, involve many drawbacks, such as being labor intensive to operate and resulting in a lag between data collection and reporting. To effectively address the limitations of these traditional methods and to mitigate the adverse effects of these diseases, a proactive and real-time public health surveillance system is needed. Previous studies have indicated the usefulness of performing text mining on social media.

OBJECTIVE

To conduct a systematic review of the literature that used textual content published to social media for the purpose of the surveillance and prediction of communicable diseases.

METHODOLOGY

Broad search queries were formulated and performed in four databases. Both journal articles and conference materials were included. The quality of the studies, operationalized as reliability and validity, was assessed. This qualitative systematic review was guided by the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines.

RESULTS

Twenty-three publications were included in this systematic review. All studies reported positive results for using textual social media content to surveille communicable diseases. Most studies used Twitter as a source for these data. Influenza was studied most frequently, while other communicable diseases received far less attention. Journal articles had a higher quality (reliability and validity) than conference papers. However, studies often failed to provide important information about procedures and implementation.

CONCLUSION

Text mining of health-related content published on social media can serve as a novel and powerful tool for the automated, real-time, and remote monitoring of public health and for the surveillance and prediction of communicable diseases in particular. This tool can address limitations related to traditional surveillance methods, and it has the potential to supplement traditional methods for public health surveillance.

Collapse

Santos WRD, de Oliveira RL, Paraboni I. SetembroBR: a social media corpus for depression and anxiety disorder prediction. LANG RESOUR EVAL 2023. [DOI: 10.1007/s10579-022-09633-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]

Lyu S, Ren X, Du Y, Zhao N. Detecting depression of Chinese microblog users via text analysis: Combining Linguistic Inquiry Word Count (LIWC) with culture and suicide related lexicons. Front Psychiatry 2023;14:1121583. [PMID: 36846219 PMCID: PMC9947407 DOI: 10.3389/fpsyt.2023.1121583] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/12/2022] [Accepted: 01/26/2023] [Indexed: 02/11/2023] Open

Abstract

INTRODUCTION

In recent years, research has used psycholinguistic features in public discourse, networking behaviors on social media and profile information to train models for depression detection. However, the most widely adopted approach for the extraction of psycholinguistic features is to use the Linguistic Inquiry Word Count (LIWC) dictionary and various affective lexicons. Other features related to cultural factors and suicide risk have not been explored. Moreover, the use of social networking behavioral features and profile features would limit the generalizability of the model. Therefore, our study aimed at building a prediction model of depression for text-only social media data through a wider range of possible linguistic features related to depression, and illuminate the relationship between linguistic expression and depression.

METHODS

We collected 789 users' depression scores as well as their past posts on Weibo, and extracted a total of 117 lexical features via Simplified Chinese Linguistic Inquiry Word Count, Chinese Suicide Dictionary, Chinese Version of Moral Foundations Dictionary, Chinese Version of Moral Motivation Dictionary, and Chinese Individualism/Collectivism Dictionary.

RESULTS

Results showed that all the dictionaries contributed to the prediction. The best performing model occurred with linear regression, with the Pearson correlation coefficient between predicted values and self-reported values was 0.33, the R-squared was 0.10, and the split-half reliability was 0.75.

DISCUSSION

This study did not only develop a predictive model applicable to text-only social media data, but also demonstrated the importance taking cultural psychological factors and suicide related expressions into consideration in the calculation of word frequency. Our research provided a more comprehensive understanding of how lexicons related to cultural psychology and suicide risk were associated with depression, and could contribute to the recognition of depression.

Collapse

Koops S, Brederoo SG, de Boer JN, Nadema FG, Voppel AE, Sommer IE. Speech as a Biomarker for Depression. CNS & NEUROLOGICAL DISORDERS DRUG TARGETS 2023;22:152-160. [PMID: 34961469 DOI: 10.2174/1871527320666211213125847] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Revised: 10/10/2021] [Accepted: 10/10/2021] [Indexed: 01/01/2023]

Tejaswini V, Babu KS, Sahoo B. Depression Detection from Social Media Text Analysis using Natural Language Processing Techniques and Hybrid Deep Learning Model. ACM T ASIAN LOW-RESO 2022. [DOI: 10.1145/3569580] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Pan W, Han Y, Li J, Zhang E, He B. The positive energy of netizens: development and application of fine-grained sentiment lexicon and emotional intensity model. CURRENT PSYCHOLOGY 2022;42:1-18. [PMID: 36345548 PMCID: PMC9630060 DOI: 10.1007/s12144-022-03876-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/10/2022] [Indexed: 11/06/2022]

Abstract

The outbreak of COVID-19 has led to a global health crisis and caused huge emotional swings. However, the positive emotional expressions, like self-confidence, optimism, and praise, that appear in Chinese social networks are rarely explored by researchers. This study aims to analyze the characteristics of netizens' positive energy expressions and the impact of node events on public emotional expression during the COVID-19 pandemic. First, a total of 6,525,249 Chinese texts posted by Sina Weibo users were randomly selected through textual data cleaning and word segmentation for corpus construction. A fine-grained sentiment lexicon that contained POSITIVE ENERGY was built using Word2Vec technology; this lexicon was later used to conduct sentiment category analysis on original posts. Next, through manual labeling and multi-classification machine learning model construction, four mainstream machine learning algorithms were selected to train the emotional intensity model. Finally, the lexicon and optimized emotional intensity model were used to analyze the emotional expressions of Chinese netizens. The results show that POSITIVE ENERGY expression accounted for 40.97% during the COVID-19 pandemic. Over the course of time, POSITIVE ENERGY emotions were displayed at the highest levels and SURPRISES the lowest. The analysis results of the node events showed after the outbreak was confirmed officially, the expressions of POSITIVE ENERGY and FEAR increased simultaneously. After the initial victory in pandemic prevention and control, the expression of POSITIVE ENERGY and SAD reached a peak, while the increase of SAD was the most prominent. The fine-grained sentiment lexicon, which includes a POSITIVE ENERGY category, demonstrated reliable algorithm performance and can be used for sentiment classification of Chinese Internet context. We also found many POSITIVE ENERGY expressions in Chinese online social platforms which are proven to be significantly affected by nod events of different nature.

Collapse

Abu-Taieh EM, AlHadid I, Masa’deh R, Alkhawaldeh RS, Khwaldeh S, Alrowwad A. Factors Affecting the Use of Social Networks and Its Effect on Anxiety and Depression among Parents and Their Children: Predictors Using ML, SEM and Extended TAM. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022;19:ijerph192113764. [PMID: 36360644 PMCID: PMC9656283 DOI: 10.3390/ijerph192113764] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 10/14/2022] [Accepted: 10/17/2022] [Indexed: 05/12/2023]

Chen L, Jeong J, Simpkins B, Ferrara E. Exploring ADHD Users’ Behavior on Twitter: A Comparative Analysis of Tweet Content and User Interactions (Preprint). J Med Internet Res 2022;25:e43439. [PMID: 37195757 DOI: 10.2196/43439] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Revised: 04/04/2023] [Accepted: 04/05/2023] [Indexed: 04/08/2023] Open

Abstract

BACKGROUND

With the widespread use of social media, people share their real-time thoughts and feelings via interactions on these platforms, including those revolving around mental health problems. This can provide a new opportunity for researchers to collect health-related data to study and analyze mental disorders. However, as one of the most common mental disorders, there are few studies regarding the manifestations of attention-deficit/hyperactivity disorder (ADHD) on social media.

OBJECTIVE

This study aims to examine and identify the different behavioral patterns and interactions of users with ADHD on Twitter through the text content and metadata of their posted tweets.

METHODS

First, we built 2 data sets: an ADHD user data set containing 3135 users who explicitly reported having ADHD on Twitter and a control data set made up of 3223 randomly selected Twitter users without ADHD. All historical tweets of users in both data sets were collected. We applied mixed methods in this study. We performed Top2Vec topic modeling to extract topics frequently mentioned by users with ADHD and those without ADHD and used thematic analysis to further compare the differences in contents that were discussed by the 2 groups under these topics. We used a distillBERT sentiment analysis model to calculate the sentiment scores for the emotion categories and compared the sentiment intensity and frequency. Finally, we extracted users' posting time, tweet categories, and the number of followers and followings from the metadata of tweets and compared the statistical distribution of these features between ADHD and non-ADHD groups.

RESULTS

In contrast to the control group of the non-ADHD data set, users with ADHD tweeted about the inability to concentrate and manage time, sleep disturbance, and drug abuse. Users with ADHD felt confusion and annoyance more frequently, while they felt less excitement, caring, and curiosity (all P<.001). Users with ADHD were more sensitive to emotions and felt more intense feelings of nervousness, sadness, confusion, anger, and amusement (all P<.001). As for the posting characteristics, compared with controls, users with ADHD were more active in posting tweets (P=.04), especially at night between midnight and 6 AM (P<.001); posting more tweets with original content (P<.001); and following fewer people on Twitter (P<.001).

CONCLUSIONS

This study revealed how users with ADHD behave and interact differently on Twitter compared with those without ADHD. On the basis of these differences, researchers, psychiatrists, and clinicians can use Twitter as a potentially powerful platform to monitor and study people with ADHD, provide additional health care support to them, improve the diagnostic criteria of ADHD, and design complementary tools for automatic ADHD detection.

Collapse

Sheoran H, Srivastava P. Self-Reported Depression Is Associated With Aberration in Emotional Reactivity and Emotional Concept Coding. Front Psychol 2022;13:814234. [PMID: 35814123 PMCID: PMC9267768 DOI: 10.3389/fpsyg.2022.814234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 04/06/2022] [Indexed: 12/02/2022] Open

Abstract

Cognitive impairment, alterations in mood, emotion dysregulation are just a few of the consequences of depression. Despite depression being reported as the most common mental disorder worldwide, examining depression or risks of depression is still challenging. Emotional reactivity has been observed to predict the risk of depression, but the results have been mixed for negative emotional reactivity (NER). To better understand the emotional response conflict, we asked our participants to describe their feeling in meaningful sentences alongside reporting their reactions to the emotionally evocative words. We presented a word on the screen and asked participants to perform two tasks, rate their feeling after reading the word using the self-assessment manikin (SAM) scale, and describe their feeling using the property generation task. The emotional content was analyzed using a novel machine-learning algorithm approach. We performed these two tasks in blocks and randomized their order across participants. Beck Depression Inventory (BDI) was used to categorize participants into self-reported non-depressed (ND) and depressed (D) groups. Compared to the ND, the D group reported reduced positive emotional reactivity when presented with extremely pleasant words regardless of their arousal levels. However, no significant difference was observed between the D and ND groups for negative emotional reactivity. In contrast, we observed increased sadness and inclination toward low negative context from descriptive content by the D compared to the ND group. The positive content analyses showed mixed results. The contrasting results between the emotional reactivity and emotional content analyses demand further examination between cohorts of self-reported depressive symptoms, no-symptoms, and MDD patients to better examine the risks of depression and help design early interventions.

Collapse

Zarate D, Stavropoulos V, Ball M, de Sena Collier G, Jacobson NC. Exploring the digital footprint of depression: a PRISMA systematic literature review of the empirical evidence. BMC Psychiatry 2022;22:421. [PMID: 35733121 PMCID: PMC9214685 DOI: 10.1186/s12888-022-04013-y] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Accepted: 05/17/2022] [Indexed: 12/14/2022] Open

Kelley SW, Mhaonaigh CN, Burke L, Whelan R, Gillan CM. Machine learning of language use on Twitter reveals weak and non-specific predictions. NPJ Digit Med 2022;5:35. [PMID: 35338248 PMCID: PMC8956571 DOI: 10.1038/s41746-022-00576-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Accepted: 02/11/2022] [Indexed: 11/30/2022] Open

Using language in social media posts to study the network dynamics of depression longitudinally. Nat Commun 2022;13:870. [PMID: 35169166 PMCID: PMC8847554 DOI: 10.1038/s41467-022-28513-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Accepted: 01/21/2022] [Indexed: 12/13/2022] Open

Salas-Zárate R, Alor-Hernández G, Salas-Zárate MDP, Paredes-Valverde MA, Bustos-López M, Sánchez-Cervantes JL. Detecting Depression Signs on Social Media: A Systematic Literature Review. Healthcare (Basel) 2022;10:healthcare10020291. [PMID: 35206905 PMCID: PMC8871802 DOI: 10.3390/healthcare10020291] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2021] [Revised: 01/21/2022] [Accepted: 01/29/2022] [Indexed: 01/14/2023] Open

Using Machine Learning for Pharmacovigilance: A Systematic Review. Pharmaceutics 2022;14:pharmaceutics14020266. [PMID: 35213998 PMCID: PMC8924891 DOI: 10.3390/pharmaceutics14020266] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Revised: 01/13/2022] [Accepted: 01/21/2022] [Indexed: 02/04/2023] Open

Liu J, Shi M. A Hybrid Feature Selection and Ensemble Approach to Identify Depressed Users in Online Social Media. Front Psychol 2022;12:802821. [PMID: 35115990 PMCID: PMC8803736 DOI: 10.3389/fpsyg.2021.802821] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Accepted: 12/23/2021] [Indexed: 11/13/2022] Open

Razia Sulthana A., Jaithunbi A. K., Harikrishnan H, Varadarajan V. Sentiment Analysis on Movie Reviews Dataset Using Support Vector Machines and Ensemble Learning. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY AND WEB ENGINEERING 2022. [DOI: 10.4018/ijitwe.311428] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Han J, Feng Y, Li N, Feng L, Xiao L, Zhu X, Wang G. Correlation Between Word Frequency and 17 Items of Hamilton Scale in Major Depressive Disorder. Front Psychiatry 2022;13:902873. [PMID: 35592381 PMCID: PMC9110653 DOI: 10.3389/fpsyt.2022.902873] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Accepted: 04/14/2022] [Indexed: 12/03/2022] Open

Abstract

OBJECTIVE

To explore the correlation between word frequency and 17 items of the Hamilton Depression Scale (HAMD-17) in assessing the severity of depression in clinical interviews.

METHODS

This study included 70 patients with major depressive disorder (MDD) who were hospitalized in the Beijing Anding Hospital. Clinicians interviewed eligible patients, collected general information, disease symptoms, duration, and scored patients with HAMD-17. The words used by the patients during the interview were classified and extracted according to the HowNet sentiment dictionary, including positive evaluation words, positive emotional words, negative evaluation words, negative emotional words. Symptom severity was grouped according to the HAMD-17 score: mild depressive symptoms is 8-17 points, moderate depressive symptoms is 18-24 points and severe depressive symptoms is >24 points. Analysis of Variance (ANOVA) was used to analyze the four categories of words among the groups, and partial correlation analysis was used to analyze the correlation between the four categories of word frequencies based on HowNet sentiment dictionary and the HAMD-17 scale to evaluate the total score. Receiver operating characteristic (ROC) curves were used to determine meaningful cut-off values.

RESULTS

There was a significant difference in negative evaluation words between groups (p = 0.032). After controlling for gender, age and years of education, the HAMD-17 total score was correlated with negative evaluation words (p = 0.009, r = 0.319) and negative emotional words (p = 0.027, r = 0.272), as the severity of depressive symptoms increased, the number of negative evaluation and negative emotional words in clinical interviews increased. Negative evaluation words distinguished patients with mild and moderate-severe depression. The area under the curve is 0.693 (p = 0.006) when the cut-off value is 8.48, the Youden index was 0.41, the sensitivity was 55.2%, and the specificity was 85.4%.

CONCLUSION

In the clinical interview with MDD patients, the number of word frequencies based on HowNet sentiment dictionary may be beneficial in evaluating the severity of depressive symptoms.

Collapse

Cuerda C, Zornoza A, Gallud JA, Tesoriero R, Ayuso DR. Deep learning assisted cognitive diagnosis for the D-Riska application. Soft comput 2021. [DOI: 10.1007/s00500-021-06510-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Wongkoblap A, Vadillo MA, Curcin V. Deep Learning With Anaphora Resolution for the Detection of Tweeters With Depression: Algorithm Development and Validation Study. JMIR Ment Health 2021;8:e19824. [PMID: 34383688 PMCID: PMC8380581 DOI: 10.2196/19824] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/03/2020] [Revised: 09/02/2020] [Accepted: 03/31/2021] [Indexed: 11/28/2022] Open

Abstract

BACKGROUND

Mental health problems are widely recognized as a major public health challenge worldwide. This concern highlights the need to develop effective tools for detecting mental health disorders in the population. Social networks are a promising source of data wherein patients publish rich personal information that can be mined to extract valuable psychological cues; however, these data come with their own set of challenges, such as the need to disambiguate between statements about oneself and third parties. Traditionally, natural language processing techniques for social media have looked at text classifiers and user classification models separately, hence presenting a challenge for researchers who want to combine text sentiment and user sentiment analysis.

OBJECTIVE

The objective of this study is to develop a predictive model that can detect users with depression from Twitter posts and instantly identify textual content associated with mental health topics. The model can also address the problem of anaphoric resolution and highlight anaphoric interpretations.

METHODS

We retrieved the data set from Twitter by using a regular expression or stream of real-time tweets comprising 3682 users, of which 1983 self-declared their depression and 1699 declared no depression. Two multiple instance learning models were developed-one with and one without an anaphoric resolution encoder-to identify users with depression and highlight posts related to the mental health of the author. Several previously published models were applied to our data set, and their performance was compared with that of our models.

RESULTS

The maximum accuracy, F1 score, and area under the curve of our anaphoric resolution model were 92%, 92%, and 90%, respectively. The model outperformed alternative predictive models, which ranged from classical machine learning models to deep learning models.

CONCLUSIONS

Our model with anaphoric resolution shows promising results when compared with other predictive models and provides valuable insights into textual content that is relevant to the mental health of the tweeter.

Collapse

Zhou J, Zogan H, Yang S, Jameel S, Xu G, Chen F. Detecting Community Depression Dynamics Due to COVID-19 Pandemic in Australia. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS 2021;8:982-991. [PMID: 37982038 PMCID: PMC8545002 DOI: 10.1109/tcss.2020.3047604] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/03/2020] [Revised: 12/07/2020] [Accepted: 12/20/2020] [Indexed: 11/21/2023]

Ramírez-Cifuentes D, Freire A, Baeza-Yates R, Sanz Lamora N, Álvarez A, González-Rodríguez A, Lozano Rochel M, Llobet Vives R, Velazquez DA, Gonfaus JM, Gonzàlez J. Characterization of Anorexia Nervosa on Social Media: Textual, Visual, Relational, Behavioral, and Demographical Analysis. J Med Internet Res 2021;23:e25925. [PMID: 34283033 PMCID: PMC8335610 DOI: 10.2196/25925] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2020] [Revised: 03/10/2021] [Accepted: 05/04/2021] [Indexed: 01/29/2023] Open

Abstract

BACKGROUND

Eating disorders are psychological conditions characterized by unhealthy eating habits. Anorexia nervosa (AN) is defined as the belief of being overweight despite being dangerously underweight. The psychological signs involve emotional and behavioral issues. There is evidence that signs and symptoms can manifest on social media, wherein both harmful and beneficial content is shared daily.

OBJECTIVE

This study aims to characterize Spanish-speaking users showing anorexia signs on Twitter through the extraction and inference of behavioral, demographical, relational, and multimodal data. By using the transtheoretical model of health behavior change, we focus on characterizing and comparing users at the different stages of the model for overcoming AN, including treatment and full recovery periods.

METHODS

We analyzed the writings, posting patterns, social relationships, and images shared by Twitter users who underwent different stages of anorexia nervosa and compared the differences among users going through each stage of the illness and users in the control group (ie, users without AN). We also analyzed the topics of interest of their followees (ie, users followed by study participants). We used a clustering approach to distinguish users at an early phase of the illness (precontemplation) from those that recognize that their behavior is problematic (contemplation) and generated models for the detection of tweets and images related to AN. We considered two types of control users-focused control users, which are those that use terms related to anorexia, and random control users.

RESULTS

We found significant differences between users at each stage of the recovery process (P<.001) and control groups. Users with AN tweeted more frequently at night, with a median sleep time tweets ratio (STTR) of 0.05, than random control users (STTR=0.04) and focused control users (STTR=0.03). Pictures were relevant for the characterization of users. Focused and random control users were characterized by the use of text in their profile pictures. We also found a strong polarization between focused control users and users in the first stages of the disorder. There was a strong correlation among the shared interests between users with AN and their followees (ρ=0.96). In addition, the interests of recovered users and users in treatment were more highly correlated to those corresponding to the focused control group (ρ=0.87 for both) than those of AN users (ρ=0.67), suggesting a shift in users' interest during the recovery process.

CONCLUSIONS

We mapped the signs of AN to social media context. These results support the findings of previous studies that focused on other languages and involved a deep analysis of the topics of interest of users at each phase of the disorder. The features and patterns identified provide a basis for the development of detection tools and recommender systems.

Collapse

Cohrdes C, Yenikent S, Wu J, Ghanem B, Franco-Salvador M, Vogelgesang F. Indications of Depressive Symptoms During the COVID-19 Pandemic in Germany: Comparison of National Survey and Twitter Data. JMIR Ment Health 2021;8:e27140. [PMID: 34142973 PMCID: PMC8216331 DOI: 10.2196/27140] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/12/2021] [Revised: 04/25/2021] [Accepted: 04/29/2021] [Indexed: 02/02/2023] Open

Abstract

BACKGROUND

The current COVID-19 pandemic is associated with extensive individual and societal challenges, including challenges to both physical and mental health. To date, the development of mental health problems such as depressive symptoms accompanying population-based federal distancing measures is largely unknown, and opportunities for rapid, effective, and valid monitoring are currently a relevant matter of investigation.

OBJECTIVE

In this study, we aim to investigate, first, the temporal progression of depressive symptoms during the COVID-19 pandemic and, second, the consistency of the results from tweets and survey-based self-reports of depressive symptoms within the same time period.

METHODS

Based on a cross-sectional population survey of 9011 German adolescents and adults (n=4659, 51.7% female; age groups from 15 to 50 years and older) and a sample of 88,900 tweets (n=74,587, 83.9% female; age groups from 10 to 50 years and older), we investigated five depressive symptoms (eg, depressed mood and energy loss) using items from the Patient Health Questionnaire (PHQ-8) before, during, and after relaxation of the first German social contact ban from January to July 2020.

RESULTS

On average, feelings of worthlessness were the least frequently reported symptom (survey: n=1011, 13.9%; Twitter: n=5103, 5.7%) and fatigue or loss of energy was the most frequently reported depressive symptom (survey: n=4472, 51.6%; Twitter: n=31,005, 34.9%) among both the survey and Twitter respondents. Young adult women and people living in federal districts with high COVID-19 infection rates were at an increased risk for depressive symptoms. The comparison of the survey and Twitter data before and after the first contact ban showed that German adolescents and adults had a significant decrease in feelings of fatigue and energy loss over time. The temporal progression of depressive symptoms showed high correspondence between both data sources (ρ=0.76-0.93; P<.001), except for diminished interest and depressed mood, which showed a steady increase even after the relaxation of the contact ban among the Twitter respondents but not among the survey respondents.

CONCLUSIONS

Overall, the results indicate relatively small differences in depressive symptoms associated with social distancing measures during the COVID-19 pandemic and highlight the need to differentiate between positive (eg, energy level) and negative (eg, depressed mood) associations and variations over time. The results also underscore previous suggestions of Twitter data's potential to help identify hot spots of declining and improving public mental health and thereby help provide early intervention measures, especially for young and middle-aged adults. Further efforts are needed to investigate the long-term consequences of recurring lockdown phases and to address the limitations of social media data such as Twitter data to establish real-time public mental surveillance approaches.

Collapse

Population attitudes toward contraceptive methods over time on a social media platform. Am J Obstet Gynecol 2021;224:597.e1-597.e14. [PMID: 33309562 DOI: 10.1016/j.ajog.2020.11.042] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Revised: 11/03/2020] [Accepted: 11/26/2020] [Indexed: 02/03/2023]

Abstract

BACKGROUND

Contraceptive method choice is often strongly influenced by the experiences and opinions of one's social network. Although social media, including Twitter, increasingly influences reproductive-age individuals, discussion of contraception in this setting has yet to be characterized. Natural language processing, a type of machine learning in which computers analyze natural language data, enables this analysis.

OBJECTIVE

This study aimed to illuminate temporal trends in attitudes toward long- and short-acting reversible contraceptive methods in tweets between 2006 and 2019 and establish social media platforms as alternate data sources for large-scale sentiment analysis on contraception.

STUDY DESIGN

We studied English-language tweets mentioning reversible prescription contraceptive methods between March 2006 (founding of Twitter) and December 2019. Tweets mentioning contraception were extracted using search terms, including generic or brand names, colloquial names, and abbreviations. We characterized and performed sentiment analysis on tweets. We used Mann-Kendall nonparametric tests to assess temporal trends in the overall number and the number of positive, negative, and neutral tweets referring to each method. The code to reproduce this analysis is available at https://github.com/hms-dbmi/contraceptionOnTwitter.

RESULTS

We extracted 838,739 tweets mentioning at least 1 contraceptive method. The annual number of contraception-related tweets increased considerably over the study period. The intrauterine device was the most commonly referenced method (45.9%). Long-acting methods were mentioned more often than short-acting ones (58% vs 42%), and the annual proportion of long-acting reversible contraception-related tweets increased over time. In sentiment analysis of tweets mentioning a single contraceptive method (n=665,064), the greatest proportion of all tweets was negative (65,339 of 160,713 tweets with at least 95% confident sentiment, or 40.66%). Tweets mentioning long-acting methods were nearly twice as likely to be positive compared with tweets mentioning short-acting methods (19.65% vs 10.21%; P<.002).

CONCLUSION

Recognizing the influence of social networks on contraceptive decision making, social media platforms may be useful in the collection and dissemination of information about contraception.

Collapse

Kelly DL, Spaderna M, Hodzic V, Coppersmith G, Chen S, Resnik P. Can language use in social media help in the treatment of severe mental illness? CURRENT RESEARCH IN PSYCHIATRY 2021;1:1-4. [PMID: 34532718 PMCID: PMC8442995 DOI: 10.46439/psychiatry.1.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]

Ávila-Tomás JF, Mayer-Pujadas MA, Quesada-Varela VJ. [Artificial intelligence and its applications in medicine II: Current importance and practical applications]. Aten Primaria 2021;53:81-88. [PMID: 32571595 PMCID: PMC7752970 DOI: 10.1016/j.aprim.2020.04.014] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Accepted: 04/22/2020] [Indexed: 12/16/2022] Open

Leis A, Ronzano F, Mayer MA, Furlong LI, Sanz F. Evaluating Behavioral and Linguistic Changes During Drug Treatment for Depression Using Tweets in Spanish: Pairwise Comparison Study. J Med Internet Res 2020;22:e20920. [PMID: 33337338 PMCID: PMC7775819 DOI: 10.2196/20920] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2020] [Revised: 09/01/2020] [Accepted: 11/12/2020] [Indexed: 11/13/2022] Open

Abstract

Background

Depressive disorders are the most common mental illnesses, and they constitute the leading cause of disability worldwide. Selective serotonin reuptake inhibitors (SSRIs) are the most commonly prescribed drugs for the treatment of depressive disorders. Some people share information about their experiences with antidepressants on social media platforms such as Twitter. Analysis of the messages posted by Twitter users under SSRI treatment can yield useful information on how these antidepressants affect users’ behavior.

Objective

This study aims to compare the behavioral and linguistic characteristics of the tweets posted while users were likely to be under SSRI treatment, in comparison to the tweets posted by the same users when they were less likely to be taking this medication.

Methods

In the first step, the timelines of Twitter users mentioning SSRI antidepressants in their tweets were selected using a list of 128 generic and brand names of SSRIs. In the second step, two datasets of tweets were created, the in-treatment dataset (made up of the tweets posted throughout the 30 days after mentioning an SSRI) and the unknown-treatment dataset (made up of tweets posted more than 90 days before or more than 90 days after any tweet mentioning an SSRI). For each user, the changes in behavioral and linguistic features between the tweets classified in these two datasets were analyzed. 186 users and their timelines with 668,842 tweets were finally included in the study.

Results

The number of tweets generated per day by the users when they were in treatment was higher than it was when they were in the unknown-treatment period (P=.001). When the users were in treatment, the mean percentage of tweets posted during the daytime (from 8 AM to midnight) increased in comparison to the unknown-treatment period (P=.002). The number of characters and words per tweet was higher when the users were in treatment (P=.03 and P=.02, respectively). Regarding linguistic features, the percentage of pronouns that were first-person singular was higher when users were in treatment (P=.008).

Conclusions

Behavioral and linguistic changes have been detected when users with depression are taking antidepressant medication. These features can provide interesting insights for monitoring the evolution of this disease, as well as offering additional information related to treatment adherence. This information may be especially useful in patients who are receiving long-term treatments such as people suffering from depression.

Collapse

Kelly DL, Spaderna M, Hodzic V, Nair S, Kitchen C, Werkheiser AE, Powell MM, Liu F, Coppersmith G, Chen S, Resnik P. Blinded Clinical Ratings of Social Media Data are Correlated with In-Person Clinical Ratings in Participants Diagnosed with Either Depression, Schizophrenia, or Healthy Controls. Psychiatry Res 2020;294:113496. [PMID: 33065372 DOI: 10.1016/j.psychres.2020.113496] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Accepted: 10/01/2020] [Indexed: 12/16/2022]

Garcia-Rudolph A, Saurí J, Cegarra B, Bernabeu Guitart M. Discovering the Context of People With Disabilities: Semantic Categorization Test and Environmental Factors Mapping of Word Embeddings from Reddit. JMIR Med Inform 2020;8:e17903. [PMID: 33216006 PMCID: PMC7718084 DOI: 10.2196/17903] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2020] [Revised: 04/17/2020] [Accepted: 04/19/2020] [Indexed: 11/13/2022] Open

Abstract

BACKGROUND

The World Health Organization's International Classification of Functioning Disability and Health (ICF) conceptualizes disability not solely as a problem that resides in the individual, but as a health experience that occurs in a context. Word embeddings build on the idea that words that occur in similar contexts tend to have similar meanings. In spite of both sharing "context" as a key component, word embeddings have been scarcely applied in disability. In this work, we propose social media (particularly, Reddit) to link them.

OBJECTIVE

The objective of our study is to train a model for generating word associations using a small dataset (a subreddit on disability) able to retrieve meaningful content. This content will be formally validated and applied to the discovery of related terms in the corpus of the disability subreddit that represent the physical, social, and attitudinal environment (as defined by a formal framework like the ICF) of people with disabilities.

METHODS

Reddit data were collected from pushshift.io with the pushshiftr R package as a wrapper. A word2vec model was trained with the wordVectors R package using the disability subreddit comments, and a preliminary validation was performed using a subset of Mikolov analogies. We used Van Overschelde's updated and expanded version of the Battig and Montague norms to perform a semantic categories test. Silhouette coefficients were calculated using cosine distance from the wordVectors R package. For each of the 5 ICF environmental factors (EF), we selected representative subcategories addressing different aspects of daily living (ADLs); then, for each subcategory, we identified specific terms extracted from their formal ICF definition and ran the word2vec model to generate their nearest semantic terms, validating the obtained nearest semantic terms using public evidence. Finally, we applied the model to a specific subcategory of an EF involved in a relevant use case in the field of rehabilitation.

RESULTS

We analyzed 96,314 comments posted between February 2009 and December 2019, by 10,411 Redditors. We trained word2vec and identified more than 30 analogies (eg, breakfast - 8 am + 8 pm = dinner). The semantic categorization test showed promising results over 60 categories; for example, s(A relative)=0.562, s(A sport)=0.475 provided remarkable explanations for low s values. We mapped the representative subcategories of all EF chapters and obtained the closest terms for each, which we confirmed with publications. This allowed immediate access (≤ 2 seconds) to the terms related to ADLs, ranging from apps "to know accessibility before you go" to adapted sports (boccia). For example, for the support and relationships EF subcategory, the closest term discovered by our model was "resilience," recently regarded as a key feature of rehabilitation, not yet having one unified definition. Our model discovered 10 closest terms, which we validated with publications, contributing to the "resilience" definition.

CONCLUSIONS

This study opens up interesting opportunities for the exploration and discovery of the use of a word2vec model that has been trained with a small disability dataset, leading to immediate, accurate, and often unknown (for authors, in many cases) terms related to ADLs within the ICF framework.

Collapse

Ramírez-Cifuentes D, Freire A, Baeza-Yates R, Puntí J, Medina-Bravo P, Velazquez DA, Gonfaus JM, Gonzàlez J. Detection of Suicidal Ideation on Social Media: Multimodal, Relational, and Behavioral Analysis. J Med Internet Res 2020;22:e17758. [PMID: 32673256 PMCID: PMC7381053 DOI: 10.2196/17758] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2020] [Revised: 03/28/2020] [Accepted: 03/28/2020] [Indexed: 12/13/2022] Open

Abstract

Background

Suicide risk assessment usually involves an interaction between doctors and patients. However, a significant number of people with mental disorders receive no treatment for their condition due to the limited access to mental health care facilities; the reduced availability of clinicians; the lack of awareness; and stigma, neglect, and discrimination surrounding mental disorders. In contrast, internet access and social media usage have increased significantly, providing experts and patients with a means of communication that may contribute to the development of methods to detect mental health issues among social media users.

Objective

This paper aimed to describe an approach for the suicide risk assessment of Spanish-speaking users on social media. We aimed to explore behavioral, relational, and multimodal data extracted from multiple social platforms and develop machine learning models to detect users at risk.

Methods

We characterized users based on their writings, posting patterns, relations with other users, and images posted. We also evaluated statistical and deep learning approaches to handle multimodal data for the detection of users with signs of suicidal ideation (suicidal ideation risk group). Our methods were evaluated over a dataset of 252 users annotated by clinicians. To evaluate the performance of our models, we distinguished 2 control groups: users who make use of suicide-related vocabulary (focused control group) and generic random users (generic control group).

Results

We identified significant statistical differences between the textual and behavioral attributes of each of the control groups compared with the suicidal ideation risk group. At a 95% CI, when comparing the suicidal ideation risk group and the focused control group, the number of friends (P=.04) and median tweet length (P=.04) were significantly different. The median number of friends for a focused control user (median 578.5) was higher than that for a user at risk (median 372.0). Similarly, the median tweet length was higher for focused control users, with 16 words against 13 words of suicidal ideation risk users. Our findings also show that the combination of textual, visual, relational, and behavioral data outperforms the accuracy of using each modality separately. We defined text-based baseline models based on bag of words and word embeddings, which were outperformed by our models, obtaining an increase in accuracy of up to 8% when distinguishing users at risk from both types of control users.

Conclusions

The types of attributes analyzed are significant for detecting users at risk, and their combination outperforms the results provided by generic, exclusively text-based baseline models. After evaluating the contribution of image-based predictive models, we believe that our results can be improved by enhancing the models based on textual and relational features. These methods can be extended and applied to different use cases related to other mental disorders.

Collapse

Patra BG, Kar R, Roberts K, Wu H. Mental Health Severity Detection from Psychological Forum Data using Domain-Specific Unlabelled Data. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE PROCEEDINGS. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE 2020;2020:487-496. [PMID: 32477670 PMCID: PMC7233051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Wang J, Deng H, Liu B, Hu A, Liang J, Fan L, Zheng X, Wang T, Lei J. Systematic Evaluation of Research Progress on Natural Language Processing in Medicine Over the Past 20 Years: Bibliometric Study on PubMed. J Med Internet Res 2020;22:e16816. [PMID: 32012074 PMCID: PMC7005695 DOI: 10.2196/16816] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2019] [Revised: 12/05/2019] [Accepted: 12/15/2019] [Indexed: 12/15/2022] Open

Abstract

BACKGROUND

Natural language processing (NLP) is an important traditional field in computer science, but its application in medical research has faced many challenges. With the extensive digitalization of medical information globally and increasing importance of understanding and mining big data in the medical field, NLP is becoming more crucial.

OBJECTIVE

The goal of the research was to perform a systematic review on the use of NLP in medical research with the aim of understanding the global progress on NLP research outcomes, content, methods, and study groups involved.

METHODS

A systematic review was conducted using the PubMed database as a search platform. All published studies on the application of NLP in medicine (except biomedicine) during the 20 years between 1999 and 2018 were retrieved. The data obtained from these published studies were cleaned and structured. Excel (Microsoft Corp) and VOSviewer (Nees Jan van Eck and Ludo Waltman) were used to perform bibliometric analysis of publication trends, author orders, countries, institutions, collaboration relationships, research hot spots, diseases studied, and research methods.

RESULTS

A total of 3498 articles were obtained during initial screening, and 2336 articles were found to meet the study criteria after manual screening. The number of publications increased every year, with a significant growth after 2012 (number of publications ranged from 148 to a maximum of 302 annually). The United States has occupied the leading position since the inception of the field, with the largest number of articles published. The United States contributed to 63.01% (1472/2336) of all publications, followed by France (5.44%, 127/2336) and the United Kingdom (3.51%, 82/2336). The author with the largest number of articles published was Hongfang Liu (70), while Stéphane Meystre (17) and Hua Xu (33) published the largest number of articles as the first and corresponding authors. Among the first author's affiliation institution, Columbia University published the largest number of articles, accounting for 4.54% (106/2336) of the total. Specifically, approximately one-fifth (17.68%, 413/2336) of the articles involved research on specific diseases, and the subject areas primarily focused on mental illness (16.46%, 68/413), breast cancer (5.81%, 24/413), and pneumonia (4.12%, 17/413).

CONCLUSIONS

NLP is in a period of robust development in the medical field, with an average of approximately 100 publications annually. Electronic medical records were the most used research materials, but social media such as Twitter have become important research materials since 2015. Cancer (24.94%, 103/413) was the most common subject area in NLP-assisted medical research on diseases, with breast cancers (23.30%, 24/103) and lung cancers (14.56%, 15/103) accounting for the highest proportions of studies. Columbia University and the talents trained therein were the most active and prolific research forces on NLP in the medical field.

Collapse