Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Zhang S, Grave E, Sklar E, Elhadad N. Longitudinal analysis of discussion topics in an online breast cancer community using convolutional neural networks. J Biomed Inform 2017;69:1-9. [PMID: 28323113 DOI: 10.1016/j.jbi.2017.03.012] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2016] [Revised: 03/14/2017] [Accepted: 03/16/2017] [Indexed: 11/30/2022]

For:	Zhang S, Grave E, Sklar E, Elhadad N. Longitudinal analysis of discussion topics in an online breast cancer community using convolutional neural networks. J Biomed Inform 2017;69:1-9. [PMID: 28323113 DOI: 10.1016/j.jbi.2017.03.012] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2016] [Revised: 03/14/2017] [Accepted: 03/16/2017] [Indexed: 11/30/2022]

Number

Cited by Other Article(s)

Da C, Duan Y, Ji Z, Chen J, Xia H, Weng Y, Zhou T, Yuan C, Cai T. Assessing the needs of patients with breast cancer and their families across various treatment phases using a Latent Dirichlet Allocation model: a text-mining approach to online health communities. Support Care Cancer 2024;32:314. [PMID: 38683417 DOI: 10.1007/s00520-024-08513-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Accepted: 04/17/2024] [Indexed: 05/01/2024]

Abstract

PURPOSE

This study aimed to assess the different needs of patients with breast cancer and their families in online health communities at different treatment phases using a Latent Dirichlet Allocation (LDA) model.

METHODS

Using Python, breast cancer-related posts were collected from two online health communities: patient-to-patient and patient-to-doctor. After data cleaning, eligible posts were categorized based on the treatment phase. Subsequently, an LDA model identifying the distinct need-related topics for each phase of treatment, including data preprocessing and LDA topic modeling, was established. Additionally, the demographic and interactive features of the posts were manually analyzed.

RESULTS

We collected 84,043 posts, of which 9504 posts were included after data cleaning. Early diagnosis and rehabilitation treatment phases had the highest and lowest number of posts, respectively. LDA identified 11 topics: three in the initial diagnosis phase and two in each of the remaining treatment phases. The topics included disease outcomes, diagnosis analysis, treatment information, and emotional support in the initial diagnosis phase; surgical options and outcomes, postoperative care, and treatment planning in the perioperative treatment phase; treatment options and costs, side effects management, and disease prognosis assessment in the non-operative treatment phase; diagnosis and treatment options, disease prognosis, and emotional support in the relapse and metastasis treatment phase; and follow-up and recurrence concerns, physical symptoms, and lifestyle adjustments in the rehabilitation treatment phase.

CONCLUSION

The needs of patients with breast cancer and their families differ across various phases of cancer therapy. Therefore, specific information or emotional assistance should be tailored to each phase of treatment based on the unique needs of patients and their families.

Collapse

Shah AM, Lee KY, Hidayat A, Falchook A, Muhammad W. A text analytics approach for mining public discussions in online cancer forum: Analysis of multi-intent lung cancer treatment dataset. Int J Med Inform 2024;184:105375. [PMID: 38367390 DOI: 10.1016/j.ijmedinf.2024.105375] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 01/25/2024] [Accepted: 02/07/2024] [Indexed: 02/19/2024]

Abstract

BACKGROUND

Online cancer forums (OCF) are increasingly popular platforms for patients and caregivers to discuss, seek information on, and share opinions about diseases and treatments. This interaction generates a substantial amount of unstructured text data, necessitating deeper exploration. Using time series data, our study exploits topic modeling in the novel domain of online cancer forums (OCFs) to identify meaningful topics and changing dynamics of online discussion across different lung cancer treatment intent groups.

METHODS

For this purpose, a dataset comprising 27,998 forum posts about lung cancer was collected from three OCFs: lungcancer.net, lungevity.org, and reddit.com, spanning the years 2016 to 2018.

RESULTS

The analysis reflects the public discussion on multi-intent lung cancer treatment over time, taking into account seasonal variations. Discussions on cancer symptoms and prevention garnered the most attention, dominating both curative and palliative care discussions. There were distinct seasonal peaks: curative care topics surged from winter to late spring, while palliative care topics peaked from late summer to mid-autumn. Keyword analysis highlighted that lung cancer diagnosis and treatment were primary topics, whereas cancer prevention and treatment outcomes were predominant across multi-care contexts. For the study period, curative care discussions predominantly revolved around informational support and disease syndromes. In contrast, social support and cancer prevention prevailed in the palliative care context. Notably, topics such as cancer screening and cancer treatment exhibit pronounced seasonal variations in curative care, peaking in frequency during the summers (May to August) of the study period. Meanwhile, the topic of tumor control within palliative care showed significant seasonal influence during the winters and summers of 2017 and 2018.

CONCLUSION

Our text analysis approach using OCF data shows potential for computational methods in this novel domain to gain insights into trends in public cancer communication and seasonal variations for a better understanding of improving personalized care, decision support, treatment outcomes, and quality of life.

Collapse

Xiang M, Zhong D, Han M, Lv K. A Study on Online Health Community Users' Information Demands Based on the BERT-LDA Model. Healthcare (Basel) 2023;11:2142. [PMID: 37570382 PMCID: PMC10419037 DOI: 10.3390/healthcare11152142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2023] [Revised: 07/17/2023] [Accepted: 07/24/2023] [Indexed: 08/13/2023] Open

Xu Q, Zhou Y, Liao B, Xin Z, Xie W, Hu C, Luo A. Named Entity Recognition of Diabetes Online Health Community Data Using Multiple Machine Learning Models. Bioengineering (Basel) 2023;10:659. [PMID: 37370590 DOI: 10.3390/bioengineering10060659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Revised: 05/19/2023] [Accepted: 05/25/2023] [Indexed: 06/29/2023] Open

Affiliation(s)

Qian Xu Second Xiangya Hospital, Central South University, Changsha 410011, China School of Life Sciences, Central South University, Changsha 410013, China College of Computer Science and Engineering, Jishou University, Jishou 416000, China Key Laboratory of Medical Information Research, Central South University, College of Hunan Province, Changsha 410013, China Clinical Research Center for Cardiovascular Intelligent Healthcare in Hunan Province, Changsha 410011, China
Yue Zhou Second Xiangya Hospital, Central South University, Changsha 410011, China School of Life Sciences, Central South University, Changsha 410013, China Key Laboratory of Medical Information Research, Central South University, College of Hunan Province, Changsha 410013, China Clinical Research Center for Cardiovascular Intelligent Healthcare in Hunan Province, Changsha 410011, China
Bolin Liao College of Computer Science and Engineering, Jishou University, Jishou 416000, China
Zirui Xin Second Xiangya Hospital, Central South University, Changsha 410011, China Key Laboratory of Medical Information Research, Central South University, College of Hunan Province, Changsha 410013, China Clinical Research Center for Cardiovascular Intelligent Healthcare in Hunan Province, Changsha 410011, China
Wenzhao Xie Key Laboratory of Medical Information Research, Central South University, College of Hunan Province, Changsha 410013, China Clinical Research Center for Cardiovascular Intelligent Healthcare in Hunan Province, Changsha 410011, China
Chao Hu Big Data Institute, Central South University, Changsha 410011, China
Aijing Luo Second Xiangya Hospital, Central South University, Changsha 410011, China Key Laboratory of Medical Information Research, Central South University, College of Hunan Province, Changsha 410013, China Clinical Research Center for Cardiovascular Intelligent Healthcare in Hunan Province, Changsha 410011, China

Collapse

Singh T, Roberts K, Cohen T, Cobb N, Franklin A, Myneni S. Discerning conversational context in online health communities for personalized digital behavior change solutions using Pragmatics to Reveal Intent in Social Media (PRISM) framework. J Biomed Inform 2023;140:104324. [PMID: 36842490 PMCID: PMC10206862 DOI: 10.1016/j.jbi.2023.104324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2022] [Revised: 02/18/2023] [Accepted: 02/21/2023] [Indexed: 02/28/2023]

Abstract

BACKGROUND

Online health communities (OHCs) have emerged as prominent platforms for behavior modification, and the digitization of online peer interactions has afforded researchers with unique opportunities to model multilevel mechanisms that drive behavior change. Existing studies, however, have been limited by a lack of methods that allow the capture of conversational context and socio-behavioral dynamics at scale, as manifested in these digital platforms.

OBJECTIVE

We develop, evaluate, and apply a novel methodological framework, Pragmatics to Reveal Intent in Social Media (PRISM), to facilitate granular characterization of peer interactions by combining multidimensional facets of human communication.

METHODS

We developed and applied PRISM to analyze peer interactions (N = 2.23 million) in QuitNet, an OHC for tobacco cessation. First, we generated a labeled set of peer interactions (n = 2,005) through manual annotation along three dimensions: communication themes (CTs), behavior change techniques (BCTs), and speech acts (SAs). Second, we used deep learning models to apply our qualitative codes at scale. Third, we applied our validated model to perform a retrospective analysis. Finally, using social network analysis (SNA), we portrayed large-scale patterns and relationships among the aforementioned communication dimensions embedded in peer interactions in QuitNet.

RESULTS

Qualitative analysis showed that the themes of social support and behavioral progress were common. The most used BCTs were feedback and monitoring and comparison of behavior, and users most commonly expressed their intentions using SAs-expressive and emotion. With additional in-domain pre-training, bidirectional encoder representations from Transformers (BERT) outperformed other deep learning models on the classification tasks. Content-specific SNA revealed that users' engagement or abstinence status is associated with the prevalence of various categories of BCTs and SAs, which also was evident from the visualization of network structures.

CONCLUSIONS

Our study describes the interplay of multilevel characteristics of online communication and their association with individual health behaviors.

Collapse

Medical QA Oriented Multi-Task Learning Model for Question Intent Classification and Named Entity Recognition. INFORMATION 2022. [DOI: 10.3390/info13120581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open

Wang S, Song F, Qiao Q, Liu Y, Chen J, Ma J. A Comparative Study of Natural Language Processing Algorithms Based on Cities Changing Diabetes Vulnerability Data. Healthcare (Basel) 2022;10:healthcare10061119. [PMID: 35742169 PMCID: PMC9223144 DOI: 10.3390/healthcare10061119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Revised: 06/08/2022] [Accepted: 06/13/2022] [Indexed: 11/16/2022] Open

Dehdarirad T, Freer J. Is there alignment amongst scientific literature, news media and patient forums regarding topics?: A study of breast and lung cancer. ONLINE INFORMATION REVIEW 2021. [DOI: 10.1108/oir-06-2020-0228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Abstract PurposeDuring recent years, web technologies and mass media have become prevalent in the context of medicine and health. Two examples of important web technologies used in health are news media and patient forums. Both have a significant role in shaping patients' perspective and behaviour in relation to health and illness, as well as the way that they might choose or change their treatment. In this paper, the authors investigated the application of web technologies using the data analysis approach. The authors did this analysis from the point of view of topics being discussed and disseminated via patients and journalists in breast and lung cancer. The study also investigated the (dis)alignment amongst these two groups and scientists in terms of topics.Design/methodology/approachThree data sets comprised documents published between 2014 and 2018 obtained from ProQuest and Web of Science Medline databases, alongside data from three major patient forums on breast and lung cancer. The analysis and visualisation in this paper have been done using the udpipe, igraph R packages and VOSviewer.FindingsThe study’s findings showed that in general scientists focussed more on prognosis and treatment of cancer, whereas patients and journalists focussed more on detection, prevention and role of social and emotional support. The only exception was for news coverage of lung cancer where the largest cluster was related to treatment, research in cancer treatment and therapies. However, when comparing coverage by scientists and journalists in terms of treatment, the focus of news articles in both cancer types was mainly on chemotherapy and complimentary therapies. Finally, topics such as lifestyle or pain management were only discussed by breast cancer patients.Originality/valueThe results obtained from this study may provide valuable insights into topics of interest for each group of scientists, journalist and patients as well as (dis)alignment among them in terms of topics. These findings are important as scientific research is heavily dependent on communication, and research does not exist in a bubble. Scientists and journalists can gain insights from patients' experiences and needs, which in turn may help them to have a more holistic and realistic view.Peer reviewThe peer review history for this article is available at: https://publons.com/publon/10.1108/OIR-06-2020-0228 Collapse

Sarker A, DeRoos A, Perrone J. Mining social media for prescription medication abuse monitoring: a review and proposal for a data-centric framework. J Am Med Inform Assoc 2021;27:315-329. [PMID: 31584645 PMCID: PMC7025330 DOI: 10.1093/jamia/ocz162] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2019] [Revised: 08/14/2019] [Indexed: 01/02/2023] Open

Wang X, High A, Wang X, Zhao K. Predicting users' continued engagement in online health communities from the quantity and quality of received support. J Assoc Inf Sci Technol 2020. [DOI: 10.1002/asi.24436] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

HCI for biomedical decision-making: From diagnosis to therapy. J Biomed Inform 2020;111:103593. [PMID: 33069887 DOI: 10.1016/j.jbi.2020.103593] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Accepted: 10/06/2020] [Indexed: 01/08/2023]

Wang X, Zhao K, Zhou X, Street N. Predicting User Posting Activities in Online Health Communities with Deep Learning. ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS 2020. [DOI: 10.1145/3383780] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

An attention-based multi-task model for named entity recognition and intent analysis of Chinese online medical questions. J Biomed Inform 2020;108:103511. [DOI: 10.1016/j.jbi.2020.103511] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Revised: 07/03/2020] [Accepted: 07/07/2020] [Indexed: 01/22/2023]

Griffin AC, Topaloglu U, Davis S, Chung AE. From Patient Engagement to Precision Oncology: Leveraging Informatics to Advance Cancer Care. Yearb Med Inform 2020;29:235-242. [PMID: 32823322 PMCID: PMC7442514 DOI: 10.1055/s-0040-1701983] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open

Solikhah S, Matahari R, Utami FP, Handayani L, Marwati TA. Breast cancer stigma among Indonesian women: a case study of breast cancer patients. BMC WOMENS HEALTH 2020;20:116. [PMID: 32493375 PMCID: PMC7268729 DOI: 10.1186/s12905-020-00983-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 09/05/2019] [Accepted: 05/26/2020] [Indexed: 02/27/2023]

Yin Z, Harrell M, Warner JL, Chen Q, Fabbri D, Malin BA. The therapy is making me sick: how online portal communications between breast cancer patients and physicians indicate medication discontinuation. J Am Med Inform Assoc 2019;25:1444-1451. [PMID: 30380083 DOI: 10.1093/jamia/ocy118] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2018] [Accepted: 08/10/2018] [Indexed: 12/13/2022] Open

Abstract

Objective

Online platforms have created a variety of opportunities for breast patients to discuss their hormonal therapy, a long-term adjuvant treatment to reduce the chance of breast cancer occurrence and mortality. The goal of this investigation is to ascertain the extent to which the messages breast cancer patients communicated through an online portal can indicate their potential for discontinuing hormonal therapy.

Materials and Methods

We studied the de-identified electronic medical records of 1106 breast cancer patients who were prescribed hormonal therapy at Vanderbilt University Medical Center over a 12-year period. We designed a data-driven approach to investigate patients' patterns of messaging with healthcare providers, the topics they communicated, and the extent to which these messaging behaviors associate with the likelihood that a patient will discontinue a prescribed 5-year regimen of therapy.

Results

The results indicates that messaging rate over time [hazard ratio (HR) = 1.373, P = 0.002], mentions of side effects (HR = 1.214, P = 0.006), and surgery-related topics (HR = 1.170, P = 0.034) were associated with increased risk of early medication discontinuation. In contrast, seeking professional suggestions (HR = 0.766, P = 0.002), expressing gratitude to healthcare providers (HR = 0.872, P = 0.044), and mentions of drugs used to treat side effects (HR = 0.807, P = 0.013) were associated with decreased risk of medication discontinuation.

Discussion and Conclusion

This investigation suggests that patient-generated content can inform the study of health-related behaviors. Given that approximately 50% of breast cancer patients do not complete a course of hormonal therapy as described, the identification of factors associated with medication discontinuation can facilitate real-time interventions to prevent early discontinuation.

Collapse

Manas S, Young LE, Fujimoto K, Franklin A, Myneni S. Exploring the Social Structure of a Health-Related Online Community for Tobacco Cessation: A Two-Mode Network Approach. Stud Health Technol Inform 2019;264:1268-1272. [PMID: 31438129 PMCID: PMC7656969 DOI: 10.3233/shti190430] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Conway M, Hu M, Chapman WW. Recent Advances in Using Natural Language Processing to Address Public Health Research Questions Using Social Media and ConsumerGenerated Data. Yearb Med Inform 2019;28:208-217. [PMID: 31419834 PMCID: PMC6697505 DOI: 10.1055/s-0039-1677918] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open

Yin Z, Sulieman LM, Malin BA. A systematic literature review of machine learning in online personal health data. J Am Med Inform Assoc 2019;26:561-576. [PMID: 30908576 PMCID: PMC7647332 DOI: 10.1093/jamia/ocz009] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2018] [Revised: 01/06/2019] [Accepted: 01/11/2019] [Indexed: 02/07/2023] Open

Zhang Z, Hu Z, Yang H, Zhu R, Zuo D. Factorization machines and deep views-based co-training for improving answer quality prediction in online health expert question-answering services. J Biomed Inform 2018;87:21-36. [PMID: 30240803 DOI: 10.1016/j.jbi.2018.09.011] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2018] [Revised: 08/27/2018] [Accepted: 09/17/2018] [Indexed: 11/26/2022]

Abstract

In online health expert question-answering (HQA) services, it is significant to automatically determine the quality of the answers. There are two prominent challenges in this task. First, the answers are usually written in short text, which makes it difficult to absorb the text semantic information. Second, it usually lacks sufficient labeled data but contains a huge amount of unlabeled data. To tackle these challenges, we propose a novel deep co-training framework based on factorization machines (FM) and deep textual views to intelligently and automatically identify the quality of HQA systems. More specifically, we exploit additional domain-specific semantic information from domain-specific word embeddings to expand the semantic space of short text and apply FM to excavate the non-independent interaction relationships among diverse features within individual views for improving the performance of the base classifier via co-training. Our learned deep textual views, the convolutional neural networks (CNN) view which focuses on extracting local features using convolution filters to locally model short text and the dependency-sensitive convolutional neural networks (DSCNN) view which focuses on capturing long-distance dependency information within the text to globally model short text, can then overcome the challenge of feature sparseness in the short text answers from the doctors. The developed co-training framework can effectively mine the highly non-linear semantic information embedded in the unlabeled data and expose the highly non-linear relationships between different views, which minimizes the labeling effort. Finally, we conduct extensive empirical evaluations and demonstrate that our proposed method can significantly improve the predictive performance of the answer quality in the context of HQA services.

Collapse

Myneni S, Sridharan V, Cobb N, Cohen T. Content-Sensitive Characterization of Peer Interactions of Highly Engaged Users in an Online Community for Smoking Cessation: Mixed-Methods Approach for Modeling User Engagement in Health Promotion Interventions. J Particip Med 2018;10:e9. [PMID: 33052116 PMCID: PMC7434072 DOI: 10.2196/jopm.9745] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2017] [Revised: 05/16/2018] [Accepted: 06/22/2018] [Indexed: 11/13/2022] Open

Abstract

Background

Online communities provide affordable venues for behavior change. However, active user engagement holds the key to the success of these platforms. In order to enhance user engagement and in turn, health outcomes, it is essential to offer targeted interventional and informational support.

Objective

In this paper, we describe a content plus frequency framework to enable the characterization of highly engaged users in online communities and study theoretical techniques employed by these users through analysis of exchanged communication.

Methods

We applied the proposed methodology for analysis of peer interactions within QuitNet, an online community for smoking cessation. Firstly, we identified 144 highly engaged users based on communication frequency within QuitNet over a period of 16 years. Secondly, we used the taxonomy of behavior change techniques, text analysis methods from distributional semantics, machine learning, and sentiment analysis to assign theory-driven labels to content. Finally, we extracted content-specific insights from peer interactions (n=159,483 messages) among highly engaged QuitNet users.

Results

Studying user engagement using our proposed framework led to the definition of 3 user categories—conversation initiators, conversation attractors, and frequent posters. Specific behavior change techniques employed by top tier users (threshold set at top 3) within these 3 user groups were found to be goal setting, social support, rewards and threat, and comparison of outcomes. Engagement-specific trends within sentiment manifestations were also identified.

Conclusions

Use of content-inclusive analytics has offered deep insight into specific behavior change techniques employed by highly engaged users within QuitNet. Implications for personalization and active user engagement are discussed.

Collapse

Tapi Nzali MD, Aze J, Bringay S, Lavergne C, Mollevi C, Optiz T. Reconciliation of patient/doctor vocabulary in a structured resource. Health Informatics J 2018;25:1219-1231. [PMID: 29332530 DOI: 10.1177/1460458217751014] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Rios A, Kavuluru R. Ordinal convolutional neural networks for predicting RDoC positive valence psychiatric symptom severity scores. J Biomed Inform 2017;75S:S85-S93. [PMID: 28506904 PMCID: PMC5682241 DOI: 10.1016/j.jbi.2017.05.008] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2017] [Revised: 04/04/2017] [Accepted: 05/10/2017] [Indexed: 10/19/2022]

Abstract

BACKGROUND

The CEGS N-GRID 2016 Shared Task in Clinical Natural Language Processing (NLP) provided a set of 1000 neuropsychiatric notes to participants as part of a competition to predict psychiatric symptom severity scores. This paper summarizes our methods, results, and experiences based on our participation in the second track of the shared task.

OBJECTIVE

Classical methods of text classification usually fall into one of three problem types: binary, multi-class, and multi-label classification. In this effort, we study ordinal regression problems with text data where misclassifications are penalized differently based on how far apart the ground truth and model predictions are on the ordinal scale. Specifically, we present our entries (methods and results) in the N-GRID shared task in predicting research domain criteria (RDoC) positive valence ordinal symptom severity scores (absent, mild, moderate, and severe) from psychiatric notes.

METHODS

We propose a novel convolutional neural network (CNN) model designed to handle ordinal regression tasks on psychiatric notes. Broadly speaking, our model combines an ordinal loss function, a CNN, and conventional feature engineering (wide features) into a single model which is learned end-to-end. Given interpretability is an important concern with nonlinear models, we apply a recent approach called locally interpretable model-agnostic explanation (LIME) to identify important words that lead to instance specific predictions.

RESULTS

Our best model entered into the shared task placed third among 24 teams and scored a macro mean absolute error (MMAE) based normalized score (100·(1-MMAE)) of 83.86. Since the competition, we improved our score (using basic ensembling) to 85.55, comparable with the winning shared task entry. Applying LIME to model predictions, we demonstrate the feasibility of instance specific prediction interpretation by identifying words that led to a particular decision.

CONCLUSION

In this paper, we present a method that successfully uses wide features and an ordinal loss function applied to convolutional neural networks for ordinal text classification specifically in predicting psychiatric symptom severity scores. Our approach leads to excellent performance on the N-GRID shared task and is also amenable to interpretability using existing model-agnostic approaches.

Collapse

Amith M, Cunningham R, Savas LS, Boom J, Schvaneveldt R, Tao C, Cohen T. Using Pathfinder networks to discover alignment between expert and consumer conceptual knowledge from online vaccine content. J Biomed Inform 2017;74:33-45. [PMID: 28823922 DOI: 10.1016/j.jbi.2017.08.007] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2017] [Revised: 05/28/2017] [Accepted: 08/14/2017] [Indexed: 10/19/2022]

Abstract

This study demonstrates the use of distributed vector representations and Pathfinder Network Scaling (PFNETS) to represent online vaccine content created by health experts and by laypeople. By analyzing a target audience's conceptualization of a topic, domain experts can develop targeted interventions to improve the basic health knowledge of consumers. The underlying assumption is that the content created by different groups reflects the mental organization of their knowledge. Applying automated text analysis to this content may elucidate differences between the knowledge structures of laypeople (heath consumers) and professionals (health experts). This paper utilizes vaccine information generated by laypeople and health experts to investigate the utility of this approach. We used an established technique from cognitive psychology, Pathfinder Network Scaling to infer the structure of the associational networks between concepts learned from online content using methods of distributional semantics. In doing so, we extend the original application of PFNETS to infer knowledge structures from individual participants, to infer the prevailing knowledge structures within communities of content authors. The resulting graphs reveal opportunities for public health and vaccination education experts to improve communication and intervention efforts directed towards health consumers. Our efforts demonstrate the feasibility of using an automated procedure to examine the manifestation of conceptual models within large bodies of free text, revealing evidence of conflicting understanding of vaccine concepts among health consumers as compared with health experts. Additionally, this study provides insight into the differences between consumer and expert abstraction of domain knowledge, revealing vaccine-related knowledge gaps that suggest opportunities to improve provider-patient communication.

Collapse

Tapi Nzali MD, Bringay S, Lavergne C, Mollevi C, Opitz T. What Patients Can Tell Us: Topic Analysis for Social Media on Breast Cancer. JMIR Med Inform 2017;5:e23. [PMID: 28760725 PMCID: PMC5556259 DOI: 10.2196/medinform.7779] [Citation(s) in RCA: 52] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2017] [Revised: 06/16/2017] [Accepted: 06/17/2017] [Indexed: 11/13/2022] Open

Abstract

Background

Social media dedicated to health are increasingly used by patients and health professionals. They are rich textual resources with content generated through free exchange between patients. We are proposing a method to tackle the problem of retrieving clinically relevant information from such social media in order to analyze the quality of life of patients with breast cancer.

Objective

Our aim was to detect the different topics discussed by patients on social media and to relate them to functional and symptomatic dimensions assessed in the internationally standardized self-administered questionnaires used in cancer clinical trials (European Organization for Research and Treatment of Cancer [EORTC] Quality of Life Questionnaire Core 30 [QLQ-C30] and breast cancer module [QLQ-BR23]).

Methods

First, we applied a classic text mining technique, latent Dirichlet allocation (LDA), to detect the different topics discussed on social media dealing with breast cancer. We applied the LDA model to 2 datasets composed of messages extracted from public Facebook groups and from a public health forum (cancerdusein.org, a French breast cancer forum) with relevant preprocessing. Second, we applied a customized Jaccard coefficient to automatically compute similarity distance between the topics detected with LDA and the questions in the self-administered questionnaires used to study quality of life.

Results

Among the 23 topics present in the self-administered questionnaires, 22 matched with the topics discussed by patients on social media. Interestingly, these topics corresponded to 95% (22/23) of the forum and 86% (20/23) of the Facebook group topics. These figures underline that topics related to quality of life are an important concern for patients. However, 5 social media topics had no corresponding topic in the questionnaires, which do not cover all of the patients’ concerns. Of these 5 topics, 2 could potentially be used in the questionnaires, and these 2 topics corresponded to a total of 3.10% (523/16,868) of topics in the cancerdusein.org corpus and 4.30% (3014/70,092) of the Facebook corpus.

Conclusions

We found a good correspondence between detected topics on social media and topics covered by the self-administered questionnaires, which substantiates the sound construction of such questionnaires. We detected new emerging topics from social media that can be used to complete current self-administered questionnaires. Moreover, we confirmed that social media mining is an important source of information for complementary analysis of quality of life.

Collapse

Zhang S, Kang T, Qiu L, Zhang W, Yu Y, Elhadad N. Cataloguing Treatments Discussed and Used in Online Autism Communities. PROCEEDINGS OF THE ... INTERNATIONAL WORLD-WIDE WEB CONFERENCE. INTERNATIONAL WWW CONFERENCE 2017;2017:123-131. [PMID: 28736777 PMCID: PMC5516208 DOI: 10.1145/3038912.3052661] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Zhang S, Elhadad N. Factors Contributing to Dropping-out in an Online Health Community: Static and Longitudinal Analyses. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2017;2016:2090-2099. [PMID: 28269969 PMCID: PMC5333218] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Sridharan V, Cohen T, Cobb N, Myneni S. Characterization of Temporal Semantic Shifts of Peer-to-Peer Communication in a Health-Related Online Community: Implications for Data-driven Health Promotion. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2017;2016:1977-1986. [PMID: 28269957 PMCID: PMC5333293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Torii M, Tilak SS, Doan S, Zisook DS, Fan JW. Mining Health-Related Issues in Consumer Product Reviews by Using Scalable Text Analytics. BIOMEDICAL INFORMATICS INSIGHTS 2016;8:1-11. [PMID: 27375358 PMCID: PMC4915789 DOI: 10.4137/bii.s37791] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/15/2016] [Revised: 05/01/2016] [Accepted: 05/17/2016] [Indexed: 11/25/2022]