1
|
Holmes A, Sachar AS, Chang YP. Perceived Impact of COVID-19 in an Underserved Community: A Natural Language Processing Approach. J Adv Nurs 2025; 81:3201-3212. [PMID: 39373025 DOI: 10.1111/jan.16522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2024] [Revised: 09/10/2024] [Accepted: 09/23/2024] [Indexed: 10/08/2024]
Abstract
AIM To utilise natural language processing (NLP) to analyse interviews about the impact of COVID-19 in underserved communities and to compare it to traditional thematic analysis in a small subset of interviews. DESIGN NLP and thematic analysis were used together to comprehensively examine the interview data. METHODS Fifty transcribed interviews with purposively sampled adults living in underserved communities in the United States, conducted from June 2021 to May 2022, were analysed to explore the impact of the COVID-19 pandemic on social activities, mental and emotional stress and physical and spiritual well-being. NLP includes several stages: data extraction, preprocessing, processing using word embeddings and topic modelling and visualisation. This was compared to thematic analysis in a random sample of 10 interviews. RESULTS Six themes emerged from thematic analysis: The New Normal, Juxtaposition of Emotions, Ripple Effects on Health, Brutal yet Elusive Reality, Evolving Connections and Journey of Spirituality and Self-Realisation. With NLP, four clusters of similar context words for each approach were analysed visually and numerically. The frequency-based word embedding approach was most interpretable and well aligned with the thematic analysis. CONCLUSION The NLP results complemented the thematic analysis and offered new insights regarding the passage of time, the interconnectedness of impacts and the semantic connections among words. This research highlights the interdependence of pandemic impacts, simultaneously positive and negative effects and deeply individual COVID-19 experiences in underserved communities. IMPLICATIONS The iterative integration of NLP and thematic analysis was efficient and effective, facilitating the analysis of many transcripts and expanding nursing research methodology. IMPACT While thematic analysis provided richer, more detailed themes, NLP captured new elements and combinations of words, making it a promising tool in qualitative analysis. REPORTING METHOD Not applicable. PATIENT OR PUBLIC CONTRIBUTION No patient or public contribution.
Collapse
Affiliation(s)
- Ashleigh Holmes
- School of Nursing, The State University of New York, University at Buffalo, Buffalo, New York, USA
| | - Amanjot Singh Sachar
- School of Engineering and Applied Sciences, The State University of New York, University at Buffalo, Buffalo, New York, USA
| | - Yu-Ping Chang
- School of Nursing, The State University of New York, University at Buffalo, Buffalo, New York, USA
| |
Collapse
|
2
|
Whitfield C, Liu Y, Anwar M. Impact of COVID-19 Pandemic on Social Determinants of Health Issues of Marginalized Black and Asian Communities: A Social Media Analysis Empowered by Natural Language Processing. J Racial Ethn Health Disparities 2025; 12:1641-1656. [PMID: 38625665 PMCID: PMC12069143 DOI: 10.1007/s40615-024-01996-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2023] [Revised: 04/02/2024] [Accepted: 04/07/2024] [Indexed: 04/17/2024]
Abstract
PURPOSE This study aims to understand the impact of the COVID-19 pandemic on social determinants of health (SDOH) of marginalized racial/ethnic US population groups, specifically African Americans and Asians, by leveraging natural language processing (NLP) and machine learning (ML) techniques on race-related spatiotemporal social media text data. Specifically, this study establishes the extent to which Latent Dirichlet Allocation (LDA) and Gibbs Sampling Dirichlet Multinomial Mixture (GSDMM)-based topic modeling determines social determinants of health (SDOH) categories, and how adequately custom named-entity recognition (NER) detects key SDOH factors from a race/ethnicity-related Reddit data corpus. METHODS In this study, we collected race/ethnicity-specific data from 5 location subreddits including New York City, NY; Los Angeles, CA; Chicago, IL; Philadelphia, PA; and Houston, TX from March to December 2019 (before COVID-19 pandemic) and from March to December 2020 (during COVID-19 pandemic). Next, we applied methods from natural language processing and machine learning to analyze SDOH issues from extracted Reddit comments and conversation threads using feature engineering, topic modeling, and custom named-entity recognition (NER). RESULTS Topic modeling identified 35 SDOH-related topics. The SDOH-based custom NER analyses revealed that the COVID-19 pandemic significantly impacted SDOH issues of marginalized Black and Asian communities. On average, the Social and Community Context (SCC) category of SDOH had the highest percent increase (366%) from the pre-pandemic period to the pandemic period across all locations and population groups. Some of the detected SCC issues were racism, protests, arrests, immigration, police brutality, hate crime, white supremacy, and discrimination. CONCLUSION Reddit social media platform can be an alternative source to assess the SDOH issues of marginalized Black and Asian communities during the COVID-19 pandemic. By employing NLP/ML techniques such as LDA/GSDMM-based topic modeling and custom NER on a race/ethnicity-specific Reddit corpus, we uncovered various SDOH issues affecting marginalized Black and Asian communities that were significantly worsened during the COVID-19 pandemic. As a result of conducting this research, we recommend that researchers, healthcare providers, and governments utilize social media and collaboratively formulate responses and policies that will address SDOH issues during public health crises.
Collapse
Affiliation(s)
| | - Yang Liu
- North Carolina A&T State University, Greensboro, NC, 27411, USA
| | - Mohd Anwar
- North Carolina A&T State University, Greensboro, NC, 27411, USA.
| |
Collapse
|
3
|
Ahmadi S, Irandoost SF, SoleimanvandiAzar N, Nojomi M, Yoosefi Lebni J, Tehrani-Banihashemi A. Identification of emerging harms due to COVID-19 outbreak: a qualitative study in Iran. BMC Public Health 2025; 25:361. [PMID: 39881309 PMCID: PMC11776294 DOI: 10.1186/s12889-025-21513-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2024] [Accepted: 01/17/2025] [Indexed: 01/31/2025] Open
Abstract
INTRODUCTION Although COVID-19 has altered various harms and exacerbated the prevalence of some of them, this period has also set the stage for the emergence of new harms. The present study aims to identify the emerging harms resulting from the COVID-19 outbreak in Iran. METHODS The study was conducted using a qualitative content analysis approach through semi-structured interviews with 21 experts and professors knowledgeable about social harms and COVID-19 consequences who were selected through purposive and theoretical sampling. Data analysis was carried out using the Graneheim and Lundman's method in MAXQDA-2018 software. Guba and Lincoln's criteria were used to trustworthiness of results. RESULTS The results showed that the COVID-19 pandemic led to a range of issues and problems at various levels of society that were not considered social harms before the pandemic, given their prevalence and impact. After analyzing the data, four main categories and fourteen subcategories were identified. The main categories were social fatigue, ineffective education system, formation of a digital lifestyle, and formation of a new understanding and meaning of death and life. CONCLUSION The COVID-19 crisis has intensified existing social harms and introduced new ones, rendering previous mitigation strategies ineffective. Designing novel policies and guidelines is crucial to address these evolving challenges and reduce the adverse societal impacts of the pandemic.
Collapse
Affiliation(s)
- Sina Ahmadi
- Social Development and Health Promotion Research Center, Health Institute, Kermanshah University of Medical Sciences, Kermanshah, Iran
| | - Seyed Fahim Irandoost
- Department of Community Medicine, School of Medicine, Urmia University of Medical Sciences, Urmia, Iran.
| | - Neda SoleimanvandiAzar
- Preventive Medicine and Public Health Research Center, Psychosocial Health Research Institute, Department of Community and Family Medicine, School of Medicine, Iran University of Medical Sciences, Tehran, Iran., Shahid Hemmat Highway, Tehran, P.O Box: 14665-354, 1449614535, Iran.
| | - Marzieh Nojomi
- Preventive Medicine and Public Health Research Center, Psychosocial Health Research Institute, Department of Community and Family Medicine, School of Medicine, Iran University of Medical Sciences, Tehran, Iran., Shahid Hemmat Highway, Tehran, P.O Box: 14665-354, 1449614535, Iran
- Department of Sociology and Anthropology, Nipissing University, North Bay, ON, Canada
| | - Javad Yoosefi Lebni
- Social Determinants of Health Research Center, Lorestan University of Medical Sciences, Khorramabad, Iran
| | - Arash Tehrani-Banihashemi
- Preventive Medicine and Public Health Research Center, Psychosocial Health Research Institute, Department of Community and Family Medicine, School of Medicine, Iran University of Medical Sciences, Tehran, Iran., Shahid Hemmat Highway, Tehran, P.O Box: 14665-354, 1449614535, Iran
| |
Collapse
|
4
|
Wu D, Ng M, Gupta SS, Raynor P, Tao Y, Ren Y, Hung P, Qiao S, Zhang J, Fillo J, Li X, Guille C, Eichelberger K, Olatosi B. Disclosure Patterns of Opioid Use Disorders in Perinatal Care During the Opioid Epidemic on X From 2019 to 2021: Thematic Analysis. JMIR Pediatr Parent 2024; 7:e52735. [PMID: 39374068 PMCID: PMC11494255 DOI: 10.2196/52735] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 03/31/2024] [Accepted: 05/03/2024] [Indexed: 10/08/2024] Open
Abstract
BACKGROUND In 2021, the United States experienced a 14% rise in fatal drug overdoses totaling 106,699 deaths, driven by harmful opioid use, particularly among individuals in the perinatal period who face increased risks associated with opioid use disorders (OUDs). Increased concerns about the impacts of escalating harmful opioid use among pregnant and postpartum persons are rising. Most of the current limited perinatal OUD studies were conducted using traditional methods, such as interviews and randomized controlled trials to understand OUD treatment, risk factors, and associated adverse effects. However, little is known about how social media data, such as X, formerly known as Twitter, can be leveraged to explore and identify broad perinatal OUD trends, disclosure and communication patterns, and public health surveillance about OUD in the perinatal period. OBJECTIVE The objective is 3-fold: first, we aim to identify key themes and trends in perinatal OUD discussions on platform X. Second, we explore user engagement patterns, including replying and retweeting behaviors. Third, we investigate computational methods that could potentially streamline and scale the labor-intensive manual annotation effort. METHODS We extracted 6 million raw perinatal-themed tweets posted by global X users during the opioid epidemic from May 2019 to October 2021. After data cleaning and sampling, we used 500 tweets related to OUD in the perinatal period by US X users for a thematic analysis using NVivo (Lumivero) software. RESULTS Seven major themes emerged from our thematic analysis: (1) political views related to harmful opioid and other substance use, (2) perceptions of others' substance use, (3) lived experiences of opioid and other substance use, (4) news reports or papers related to opioid and other substance use, (5) health care initiatives, (6) adverse effects on children's health due to parental substance use, and (7) topics related to nonopioid substance use. Among these 7 themes, our user engagement analysis revealed that themes 4 and 5 received the highest average retweet counts, and theme 3 received the highest average tweet reply count. We further found that different computational methods excel in analyzing different themes. CONCLUSIONS Social media platforms such as X can serve as a valuable tool for analyzing real-time discourse and exploring public perceptions, opinions, and behaviors related to maternal substance use, particularly, harmful opioid use in the perinatal period. More health promotion strategies can be carried out on social media platforms to provide educational support for the OUD perinatal population.
Collapse
Affiliation(s)
- Dezhi Wu
- Department of Integrated Information Technology, University of South Carolina, Columbia, SC, United States
| | - Minnie Ng
- Department of Integrated Information Technology, University of South Carolina, Columbia, SC, United States
| | - Saborny Sen Gupta
- Department of Integrated Information Technology, University of South Carolina, Columbia, SC, United States
| | - Phyllis Raynor
- Department of Biobehavioral Health & Nursing Science, College of Nursing, University of South Carolina, Columbia, SC, United States
| | - Youyou Tao
- Department of Information Systems and Business Analytics, Loyola Marymount University, Los Angeles, CA, United States
| | - Yang Ren
- Department of Computer Science and Engineering, University of South Carolina, Columbia, SC, United States
| | - Peiyin Hung
- Department of Health Services Policy and Management, Arnold School of Public Health, University of South Carolina, Columbia, SC, United States
| | - Shan Qiao
- Department of Health Promotion, Education, and Behavior, Arnold School of Public Health, University of South Carolina, Columbia, SC, United States
| | - Jiajia Zhang
- Department of Epidemiology and Biostatistics, Arnold School of Public Health, University of South Carolina, Columbia, SC, United States
| | - Jennifer Fillo
- Department of Health Promotion, Education, and Behavior, Arnold School of Public Health, University of South Carolina, Columbia, SC, United States
| | - Xiaoming Li
- Department of Health Promotion, Education, and Behavior, Arnold School of Public Health, University of South Carolina, Columbia, SC, United States
| | - Constance Guille
- Department of Psychiatry and Behavioral Sciences, College of Medicine, Medical University of South Carolina, Charleston, SC, United States
| | - Kacey Eichelberger
- Department of Obstetrics and Gynecology, University of South Carolina School of Medicine Greenville, Prisma Health, Greenville, SC, United States
| | - Bankole Olatosi
- Department of Health Services Policy and Management, Arnold School of Public Health, University of South Carolina, Columbia, SC, United States
| |
Collapse
|
5
|
Barrera R, Poindexter K, Tucker C, Winkler MF, Dashti HS. Amplifying the lived experiences of parenteral nutrition consumers through the thematic analysis of social media posts. Nutr Clin Pract 2024; 39:850-858. [PMID: 38063263 PMCID: PMC11161556 DOI: 10.1002/ncp.11097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 10/29/2023] [Accepted: 11/04/2023] [Indexed: 06/09/2024] Open
Abstract
BACKGROUND Consumers of parenteral nutrition (PN) and their caregivers use social media to seek advice and support from their peers and to share experiences. We aimed to leverage posts from a social media patient community to identify common lived experiences of consumers of PN to prioritize opportunities for support through advocacy, education, and research. METHODS Anonymous posts with high engagement were collected over 4 months from a PN-focused social media support group platform. No personal information was collected or analyzed. Post content was reviewed for demographic characteristics. Thematic analysis involved inductive coding to identify content-based keywords. Keywords were then used to form major themes and subthemes that were then quantified by post counts. RESULTS A total of 306 social media posts were analyzed. Most were from adult PN consumers (80.4%) and pertained to home-based PN (82%). Equivalent number of posts (5%) were from new consumers and those who had not yet started or restarting PN. The analysis revealed 12 major themes with 2-11 subthemes each, spanning medical, nutrition, emotional, and social aspects. The most prevalent theme was "Best practices, care, and safety of PN use" (36.9%), covering posts seeking guidance on line care, personal hygiene, equipment use, and vascular access devices. Others included "Symptoms" (23.9%) and "Patient safety concerns of PN handling by healthcare providers" (16.0%). CONCLUSIONS The identified themes provide a broader understanding of contemporary shared lived experiences and concerns relevant to PN consumers and their caregivers. Given the evolving nature of daily stressors, periodic reanalysis may be necessary.
Collapse
Affiliation(s)
- Regina Barrera
- Department of Anesthesia, Critical Care and Pain Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | | | | | - Marion F. Winkler
- Department of Surgery, Rhode Island Hospital, Alpert Medical School of Brown University, Providence, RI, USA
| | - Hassan S. Dashti
- Department of Anesthesia, Critical Care and Pain Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| |
Collapse
|
6
|
Omisore OM, Odenigbo I, Orji J, Beltran AIH, Meier S, Baghaei N, Orji R. Extended Reality for Mental Health Evaluation: Scoping Review. JMIR Serious Games 2024; 12:e38413. [PMID: 39047289 PMCID: PMC11306946 DOI: 10.2196/38413] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 09/05/2022] [Accepted: 03/24/2024] [Indexed: 07/27/2024] Open
Abstract
BACKGROUND Mental health disorders are the leading cause of health-related problems worldwide. It is projected that mental health disorders will be the leading cause of morbidity among adults as the incidence rates of anxiety and depression grow worldwide. Recently, "extended reality" (XR), a general term covering virtual reality (VR), augmented reality (AR), and mixed reality (MR), is paving the way for the delivery of mental health care. OBJECTIVE We aimed to investigate the adoption and implementation of XR technology used in interventions for mental disorders and to provide statistical analyses of the design, usage, and effectiveness of XR technology for mental health interventions with a worldwide demographic focus. METHODS In this paper, we conducted a scoping review of the development and application of XR in the area of mental disorders. We performed a database search to identify relevant studies indexed in Google Scholar, PubMed, and the ACM Digital Library. A search period between August 2016 and December 2023 was defined to select papers related to the usage of VR, AR, and MR in a mental health context. The database search was performed with predefined queries, and a total of 831 papers were identified. Ten papers were identified through professional recommendation. Inclusion and exclusion criteria were designed and applied to ensure that only relevant studies were included in the literature review. RESULTS We identified a total of 85 studies from 27 countries worldwide that used different types of VR, AR, and MR techniques for managing 14 types of mental disorders. By performing data analysis, we found that most of the studies focused on high-income countries, such as the United States (n=14, 16.47%) and Germany (n=12, 14.12%). None of the studies were for African countries. The majority of papers reported that XR techniques lead to a significant reduction in symptoms of anxiety or depression. The majority of studies were published in 2021 (n=26, 30.59%). This could indicate that mental disorder intervention received higher attention when COVID-19 emerged. Most studies (n=65, 76.47%) focused on a population in the age range of 18-65 years, while few studies (n=2, 3.35%) focused on teenagers (ie, subjects in the age range of 10-19 years). In addition, more studies were conducted experimentally (n=67, 78.82%) rather than by using analytical and modeling approaches (n=8, 9.41%). This shows that there is a rapid development of XR technology for mental health care. Furthermore, these studies showed that XR technology can effectively be used for evaluating mental disorders in a similar or better way that conventional approaches. CONCLUSIONS In this scoping review, we studied the adoption and implementation of XR technology for mental disorder care. Our review shows that XR treatment yields high patient satisfaction, and follow-up assessments show significant improvement with large effect sizes. Moreover, the studies adopted unique designs that were set up to record and analyze the symptoms reported by their participants. This review may aid future research and development of various XR mechanisms for differentiated mental disorder procedures.
Collapse
Affiliation(s)
- Olatunji Mumini Omisore
- Research Centre for Medical Robotics and Minimally Invasive Surgical Devices, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Ifeanyi Odenigbo
- Faculty of Computer Science, Dalhousie University, Halifax, NS, Canada
| | - Joseph Orji
- Faculty of Computer Science, Dalhousie University, Halifax, NS, Canada
| | | | - Sandra Meier
- Department of Psychiatry, Dalhousie University, Halifax, NS, Canada
| | - Nilufar Baghaei
- School of Electrical Engineering and Computer Science, University of Queensland, St Lucia, Australia
| | - Rita Orji
- Faculty of Computer Science, Dalhousie University, Halifax, NS, Canada
| |
Collapse
|
7
|
Kumar VV, Sahoo A, Kumar R, Loyd N. Public Healthcare Informatics for COVID-19 from Social Media Data. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2024; 2024:1-4. [PMID: 40039566 DOI: 10.1109/embc53108.2024.10782707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/06/2025]
Abstract
The COVID-19 pandemic has profoundly affected various facets of society. This research explores the discourse on social media, specifically "X" (Twitter), to understand public sentiment and needs regarding healthcare supply chain challenges, such as the availability of testing kits, oxygen supplies, and hospital beds, during the pandemic. By identifying pertinent opinionated keyphrases and employing advanced natural language processing (NLP) techniques, we sifted through over 3.9 million tweets to collect health informatics. The content of these tweets is analyzed to investigate the word trends and sentiments. The extracted sentiments were then organized into ten distinct categories using the K-means clustering method that pointed out top-ten healthcare supply needs. The findings underscore the value of social media as a tool for gathering timely and actionable insights on healthcare supply chain logistics, offering critical data for effective response strategies during health crises.
Collapse
|
8
|
Jordan A, Park A. Understanding the Long Haulers of COVID-19: Mixed Methods Analysis of YouTube Content. JMIR AI 2024; 3:e54501. [PMID: 38875666 PMCID: PMC11184269 DOI: 10.2196/54501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 04/02/2024] [Accepted: 04/06/2024] [Indexed: 06/16/2024]
Abstract
BACKGROUND The COVID-19 pandemic had a devastating global impact. In the United States, there were >98 million COVID-19 cases and >1 million resulting deaths. One consequence of COVID-19 infection has been post-COVID-19 condition (PCC). People with this syndrome, colloquially called long haulers, experience symptoms that impact their quality of life. The root cause of PCC and effective treatments remains unknown. Many long haulers have turned to social media for support and guidance. OBJECTIVE In this study, we sought to gain a better understanding of the long hauler experience by investigating what has been discussed and how information about long haulers is perceived on social media. We specifically investigated the following: (1) the range of symptoms that are discussed, (2) the ways in which information about long haulers is perceived, (3) informational and emotional support that is available to long haulers, and (4) discourse between viewers and creators. We selected YouTube as our data source due to its popularity and wide range of audience. METHODS We systematically gathered data from 3 different types of content creators: medical sources, news sources, and long haulers. To computationally understand the video content and viewers' reactions, we used Biterm, a topic modeling algorithm created specifically for short texts, to analyze snippets of video transcripts and all top-level comments from the comment section. To triangulate our findings about viewers' reactions, we used the Valence Aware Dictionary and Sentiment Reasoner to conduct sentiment analysis on comments from each type of content creator. We grouped the comments into positive and negative categories and generated topics for these groups using Biterm. We then manually grouped resulting topics into broader themes for the purpose of analysis. RESULTS We organized the resulting topics into 28 themes across all sources. Examples of medical source transcript themes were Explanations in layman's terms and Biological explanations. Examples of news source transcript themes were Negative experiences and handling the long haul. The 2 long hauler transcript themes were Taking treatments into own hands and Changes to daily life. News sources received a greater share of negative comments. A few themes of these negative comments included Misinformation and disinformation and Issues with the health care system. Similarly, negative long hauler comments were organized into several themes, including Disillusionment with the health care system and Requiring more visibility. In contrast, positive medical source comments captured themes such as Appreciation of helpful content and Exchange of helpful information. In addition to this theme, one positive theme found in long hauler comments was Community building. CONCLUSIONS The results of this study could help public health agencies, policy makers, organizations, and health researchers understand symptomatology and experiences related to PCC. They could also help these agencies develop their communication strategy concerning PCC.
Collapse
Affiliation(s)
- Alexis Jordan
- Department of Software and Information Systems, UNC Charlotte, Charlotte, NC, United States
| | - Albert Park
- Department of Software and Information Systems, UNC Charlotte, Charlotte, NC, United States
| |
Collapse
|
9
|
Han L, Gladkoff S, Erofeev G, Sorokina I, Galiano B, Nenadic G. Neural machine translation of clinical text: an empirical investigation into multilingual pre-trained language models and transfer-learning. Front Digit Health 2024; 6:1211564. [PMID: 38468693 PMCID: PMC10926203 DOI: 10.3389/fdgth.2024.1211564] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Accepted: 01/12/2024] [Indexed: 03/13/2024] Open
Abstract
Clinical text and documents contain very rich information and knowledge in healthcare, and their processing using state-of-the-art language technology becomes very important for building intelligent systems for supporting healthcare and social good. This processing includes creating language understanding models and translating resources into other natural languages to share domain-specific cross-lingual knowledge. In this work, we conduct investigations on clinical text machine translation by examining multilingual neural network models using deep learning such as Transformer based structures. Furthermore, to address the language resource imbalance issue, we also carry out experiments using a transfer learning methodology based on massive multilingual pre-trained language models (MMPLMs). The experimental results on three sub-tasks including (1) clinical case (CC), (2) clinical terminology (CT), and (3) ontological concept (OC) show that our models achieved top-level performances in the ClinSpEn-2022 shared task on English-Spanish clinical domain data. Furthermore, our expert-based human evaluations demonstrate that the small-sized pre-trained language model (PLM) outperformed the other two extra-large language models by a large margin in the clinical domain fine-tuning, which finding was never reported in the field. Finally, the transfer learning method works well in our experimental setting using the WMT21fb model to accommodate a new language space Spanish that was not seen at the pre-training stage within WMT21fb itself, which deserves more exploitation for clinical knowledge transformation, e.g. to investigate into more languages. These research findings can shed some light on domain-specific machine translation development, especially in clinical and healthcare fields. Further research projects can be carried out based on our work to improve healthcare text analytics and knowledge transformation. Our data is openly available for research purposes at: https://github.com/HECTA-UoM/ClinicalNMT.
Collapse
Affiliation(s)
- Lifeng Han
- Department of Computer Science, The University of Manchester, Manchester, United Kingom
| | - Serge Gladkoff
- AI Lab, Logrus Global, Translation & Localization, Philadelphia, PA, United States
| | - Gleb Erofeev
- AI Lab, Logrus Global, Translation & Localization, Philadelphia, PA, United States
| | - Irina Sorokina
- AI Lab, Logrus Global, Translation & Localization, Philadelphia, PA, United States
| | - Betty Galiano
- Management Department, Ocean Translations, Rosario, Argentina
| | - Goran Nenadic
- Department of Computer Science, The University of Manchester, Manchester, United Kingom
| |
Collapse
|
10
|
Bhuptani PH, Hunter J, Goodwin C, Millman C, Orchowski LM. Characterizing Intimate Partner Violence in the United States During the COVID-19 Pandemic: A Systematic Review. TRAUMA, VIOLENCE & ABUSE 2023; 24:3220-3235. [PMID: 36321779 DOI: 10.1177/15248380221126187] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Intimate partner violence (IPV) is a significant global health concern. Numerous research studies document increases in IPV since the onset of the COVID-19 pandemic in March 2020. Despite this widespread recognition, research around the nature of this violence is still growing. This systematic review summarizes the existing literature documenting the prevalence and characteristics of IPV during the COVID-19 pandemic. Inclusion criteria are as follows: reported original data empirical study, assessed for IPV among adult population in the United States, and was published in English between December 2019 and March 2022. A total of 53 articles were then independently reviewed and sorted into four thematic subcategories: victimization, perpetration, articles addressing victimization and perpetration, and provider perspectives. Studies document consistent increases in the prevalence of IPV victimization and perpetration. Providers within agencies providing support to individuals impacted by IPV also documented increased strain on the agencies.
Collapse
Affiliation(s)
- Prachi H Bhuptani
- Rhode Island Hospital, Providence, USA
- Brown University, Providence, RI, USA
| | | | | | | | | |
Collapse
|
11
|
Dainty KN, Seaton MB, O'Neill B, Mohindra R. Going home positive: a qualitative study of the experiences of care for patients with COVID-19 who are not hospitalized. CMAJ Open 2023; 11:E1041-E1047. [PMID: 37935488 PMCID: PMC10635702 DOI: 10.9778/cmajo.20220085] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/09/2023] Open
Abstract
BACKGROUND Most Canadians diagnosed with COVID-19 have had mild symptoms not requiring hospitalization. We sought to understand the patient experience of care while being isolated at home after testing positive for SARS-CoV-2 infection. METHODS We conducted a phenomenologically informed qualitative descriptive study using in-depth semistructured interviews to identify common themes of experience for patients sent home from hospital with a positive COVID-19 diagnosis. Between July and December 2020, we conducted interviews with patients who were followed by the North York General Hospital COVID Follow-Up Clinic. Patients with mild to moderate symptoms were interviewed 4 weeks after their COVID-19 diagnosis. We conducted the interviews and performed a thematic analysis of the data concurrently, in keeping with the iterative process of qualitative methodology. RESULTS We conducted interviews with 26 patients. From our analysis, 3 themes were developed regarding participants' overall experience: lack of adequate communication, inconsistency of information from various sources, and the social implications of a COVID-19 diagnosis. The implications of a positive test for SARS-CoV-2 infection are substantial, even when symptoms are mild and patients self-isolate as recommended. Participants noted communication challenges and inconsistent information, leading to exacerbated stress. INTERPRETATION Participants shared their experiences of the stigma of testing positive and the frustration of poor communication structures and inconsistent information. Experiencing care during self-isolation at home is an area of increasing importance, and these findings can inform improved support, ensuring access to equitable and safe COVID-19 care for these patients.
Collapse
Affiliation(s)
- Katie N Dainty
- North York General Hospital (Dainty, Seaton, Mohindra); Institute for Health Policy, Management and Evaluation (Dainty), University of Toronto; Unity Health Toronto (O'Neill); Schwartz/Reisman Emergency Medicine Institute (Mohindra), Toronto, Ont.
| | - M Bianca Seaton
- North York General Hospital (Dainty, Seaton, Mohindra); Institute for Health Policy, Management and Evaluation (Dainty), University of Toronto; Unity Health Toronto (O'Neill); Schwartz/Reisman Emergency Medicine Institute (Mohindra), Toronto, Ont
| | - Braden O'Neill
- North York General Hospital (Dainty, Seaton, Mohindra); Institute for Health Policy, Management and Evaluation (Dainty), University of Toronto; Unity Health Toronto (O'Neill); Schwartz/Reisman Emergency Medicine Institute (Mohindra), Toronto, Ont
| | - Rohit Mohindra
- North York General Hospital (Dainty, Seaton, Mohindra); Institute for Health Policy, Management and Evaluation (Dainty), University of Toronto; Unity Health Toronto (O'Neill); Schwartz/Reisman Emergency Medicine Institute (Mohindra), Toronto, Ont
| |
Collapse
|
12
|
Michalski AA, Lis K, Stankiewicz J, Kloska SM, Sycz A, Dudziński M, Muras-Szwedziak K, Nowicki M, Bazan-Socha S, Dabrowski MJ, Basak GW. Supporting the Diagnosis of Fabry Disease Using a Natural Language Processing-Based Approach. J Clin Med 2023; 12:jcm12103599. [PMID: 37240705 DOI: 10.3390/jcm12103599] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 05/01/2023] [Accepted: 05/15/2023] [Indexed: 05/28/2023] Open
Abstract
In clinical practice, the consideration of non-specific symptoms of rare diseases in order to make a correct and timely diagnosis is often challenging. To support physicians, we developed a decision-support scoring system on the basis of retrospective research. Based on the literature and expert knowledge, we identified clinical features typical for Fabry disease (FD). Natural language processing (NLP) was used to evaluate patients' electronic health records (EHRs) to obtain detailed information about FD-specific patient characteristics. The NLP-determined elements, laboratory test results, and ICD-10 codes were transformed and grouped into pre-defined FD-specific clinical features that were scored in the context of their significance in the FD signs. The sum of clinical feature scores constituted the FD risk score. Then, medical records of patients with the highest FD risk score were reviewed by physicians who decided whether to refer a patient for additional tests or not. One patient who obtained a high-FD risk score was referred for DBS assay and confirmed to have FD. The presented NLP-based, decision-support scoring system achieved AUC of 0.998, which demonstrates that the applied approach enables for accurate identification of FD-suspected patients, with a high discrimination power.
Collapse
Affiliation(s)
- Adrian A Michalski
- Saventic Health, Polna 66/12 Street, 87-100 Torun, Poland
- Department of Analytical Chemistry, Nicolaus Copernicus University Ludwik Rydygier Collegium Medicum, 85-089 Bydgoszcz, Poland
| | - Karol Lis
- Saventic Health, Polna 66/12 Street, 87-100 Torun, Poland
- Department of Hematology, Transplantation and Internal Medicine, Medical University of Warsaw, 02-097 Warsaw, Poland
| | - Joanna Stankiewicz
- Saventic Health, Polna 66/12 Street, 87-100 Torun, Poland
- Department of Pediatrics, Hematology and Oncology, Nicolaus Copernicus University Ludwik Rydygier Collegium Medicum, 85-094 Bydgoszcz, Poland
| | - Sylwester M Kloska
- Saventic Health, Polna 66/12 Street, 87-100 Torun, Poland
- Department of Forensic Medicine, Nicolaus Copernicus University Ludwik Rydygier Collegium Medicum, 85-067 Bydgoszcz, Poland
| | - Arkadiusz Sycz
- Saventic Health, Polna 66/12 Street, 87-100 Torun, Poland
- Faculty of Mathematics and Information Science, Warsaw University of Technology, 00-662 Warsaw, Poland
| | - Marek Dudziński
- Saventic Health, Polna 66/12 Street, 87-100 Torun, Poland
- Department of Hematology, Institute of Medical Sciences, College of Medical Sciences, University of Rzeszow, 35-959 Rzeszow, Poland
| | - Katarzyna Muras-Szwedziak
- Saventic Foundation, Polna 66/12 Street, 87-100 Torun, Poland
- Department of Nephrology, Hypertension and Kidney Transplantation, Medical University of Lodz, 90-419 Lodz, Poland
| | - Michał Nowicki
- Saventic Foundation, Polna 66/12 Street, 87-100 Torun, Poland
- Department of Nephrology, Hypertension and Kidney Transplantation, Medical University of Lodz, 90-419 Lodz, Poland
| | - Stanisława Bazan-Socha
- Saventic Foundation, Polna 66/12 Street, 87-100 Torun, Poland
- Department of Internal Medicine, Faculty of Medicine, Jagiellonian University Medical College, 31-008 Krakow, Poland
| | - Michal J Dabrowski
- Saventic Health, Polna 66/12 Street, 87-100 Torun, Poland
- Computational Biology Group, Institute of Computer Science of the Polish Academy of Sciences, 01-248 Warsaw, Poland
| | - Grzegorz W Basak
- Saventic Health, Polna 66/12 Street, 87-100 Torun, Poland
- Department of Hematology, Transplantation and Internal Medicine, Medical University of Warsaw, 02-097 Warsaw, Poland
| |
Collapse
|
13
|
Weger R, Lossio-Ventura JA, Rose-McCandlish M, Shaw JS, Sinclair S, Pereira F, Chung JY, Atlas LY. Trends in Language Use During the COVID-19 Pandemic and Relationship Between Language Use and Mental Health: Text Analysis Based on Free Responses From a Longitudinal Study. JMIR Ment Health 2023; 10:e40899. [PMID: 36525362 PMCID: PMC9994427 DOI: 10.2196/40899] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Revised: 11/29/2022] [Accepted: 12/15/2022] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND The COVID-19 pandemic and its associated restrictions have been a major stressor that has exacerbated mental health worldwide. Qualitative data play a unique role in documenting mental states through both language features and content. Text analysis methods can provide insights into the associations between language use and mental health and reveal relevant themes that emerge organically in open-ended responses. OBJECTIVE The aim of this web-based longitudinal study on mental health during the early COVID-19 pandemic was to use text analysis methods to analyze free responses to the question, "Is there anything else you would like to tell us that might be important that we did not ask about?" Our goals were to determine whether individuals who responded to the item differed from nonresponders, to determine whether there were associations between language use and psychological status, and to characterize the content of responses and how responses changed over time. METHODS A total of 3655 individuals enrolled in the study were asked to complete self-reported measures of mental health and COVID-19 pandemic-related questions every 2 weeks for 6 months. Of these 3655 participants, 2497 (68.32%) provided at least 1 free response (9741 total responses). We used various text analysis methods to measure the links between language use and mental health and to characterize response themes over the first year of the pandemic. RESULTS Response likelihood was influenced by demographic factors and health status: those who were male, Asian, Black, or Hispanic were less likely to respond, and the odds of responding increased with age and education as well as with a history of physical health conditions. Although mental health treatment history did not influence the overall likelihood of responding, it was associated with more negative sentiment, negative word use, and higher use of first-person singular pronouns. Responses were dynamically influenced by psychological status such that distress and loneliness were positively associated with an individual's likelihood to respond at a given time point and were associated with more negativity. Finally, the responses were negative in valence overall and exhibited fluctuations linked with external events. The responses covered a variety of topics, with the most common being mental health and emotion, social or physical distancing, and policy and government. CONCLUSIONS Our results identify trends in language use during the first year of the pandemic and suggest that both the content of responses and overall sentiments are linked to mental health.
Collapse
Affiliation(s)
- Rachel Weger
- National Center for Complementary and Integrative Health, National Institutes of Health, Bethesda, MD, United States
| | | | - Margaret Rose-McCandlish
- National Center for Complementary and Integrative Health, National Institutes of Health, Bethesda, MD, United States
| | - Jacob S Shaw
- National Institute of Mental Health, National Institutes of Health, Bethesda, MD, United States
| | - Stephen Sinclair
- National Institute of Mental Health, National Institutes of Health, Bethesda, MD, United States
| | - Francisco Pereira
- National Institute of Mental Health, National Institutes of Health, Bethesda, MD, United States
| | - Joyce Y Chung
- National Institute of Mental Health, National Institutes of Health, Bethesda, MD, United States
| | - Lauren Yvette Atlas
- National Center for Complementary and Integrative Health, National Institutes of Health, Bethesda, MD, United States.,National Institute of Mental Health, National Institutes of Health, Bethesda, MD, United States.,National Institute on Drug Abuse, National Institutes of Health, Baltimore, MD, United States
| |
Collapse
|
14
|
Modeling and Moderation of COVID-19 Social Network Chat. INFORMATION 2023. [DOI: 10.3390/info14020124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/17/2023] Open
Abstract
Negative social media usage during the COVID-19 pandemic has highlighted the importance of understanding the spread of misinformation and toxicity in public online discussions. In this paper, we propose a novel unsupervised method to discover the structure of online COVID-19-related conversations. Our method trains a nine-state Hidden Markov Model (HMM) initialized from a biclustering of 23 features extracted from online messages. We apply our method to 16,000 conversations (1.5 million messages) that took place on the Facebook pages of 15 Canadian newspapers following COVID-19 news items, and show that it can effectively extract the conversation structure and discover the main themes of the messages. Furthermore, we demonstrate how the PageRank algorithm and the conversation graph discovered can be used to simulate the impact of five different moderation strategies, which makes it possible to easily develop and test new strategies to limit the spread of harmful messages. Although our work in this paper focuses on the COVID-19 pandemic, the methodology is general enough to be applied to handle communications during future pandemics and other crises, or to develop better practices for online community moderation in general.
Collapse
|
15
|
Qorib M, Oladunni T, Denis M, Ososanya E, Cotae P. Covid-19 vaccine hesitancy: Text mining, sentiment analysis and machine learning on COVID-19 vaccination Twitter dataset. EXPERT SYSTEMS WITH APPLICATIONS 2023; 212:118715. [PMID: 36092862 PMCID: PMC9443617 DOI: 10.1016/j.eswa.2022.118715] [Citation(s) in RCA: 29] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/02/2022] [Revised: 07/14/2022] [Accepted: 08/26/2022] [Indexed: 05/20/2023]
Abstract
In 2019 there was an outbreak of coronavirus pandemic also known as COVID-19. Many scientists believe that the pandemic originated from Wuhan, China, before spreading to other parts of the globe. To reduce the spread of the disease, decision makers encouraged measures such as hand washing, face masking, and social distancing. In early 2021, some countries including the United States began administering COVID-19 vaccines. Vaccination brought a relief to the public; it also generated a lot of debates from anti-vaccine and pro-vaccine groups. The controversy and debate surrounding COVID-19 vaccine influenced the decision of several people in either to accept or reject vaccination. Because of data limitations, social media data, collected through live streaming public tweets using an Application Programming Interface (API) search, is considered a viable and reliable resource to study the opinion of the public on Covid-19 vaccine hesitancy. Thus, this study examines 3 sentiment computation methods (Azure Machine Learning, VADER, and TextBlob) to analyze COVID-19 vaccine hesitancy. Five learning algorithms (Random Forest, Logistics Regression, Decision Tree, LinearSVC, and Naïve Bayes) with different combination of three vectorization methods (Doc2Vec, CountVectorizer, and TF-IDF) were deployed. Vocabulary normalization was threefold; potter stemming, lemmatization, and potter stemming with lemmatization. For each vocabulary normalization strategy, we designed, developed, and evaluated 42 models. The study shows that Covid-19 vaccine hesitancy slowly decreases over time; suggesting that the public gradually feels warm and optimistic about COVID-19 vaccination. Moreover, combining potter stemming and lemmatization increased model performances. Finally, the result of our experiment shows that TextBlob + TF-IDF + LinearSVC has the best performance in classifying public sentiment into positive, neutral, or negative with an accuracy, precision, recall and F1 score of 0.96752, 0.96921, 0.92807 and 0.94702 respectively. It means that the best performance was achieved when using TextBlob sentiment score, with TF-IDF vectorization and LinearSVC classification model. We also found out that combining two vectorizations (CountVectorizer and TF-IDF) decreases model accuracy.
Collapse
Affiliation(s)
- Miftahul Qorib
- Department of Computer Science and Information Technology, University of the District of Columbia, Washington, DC, United States
| | - Timothy Oladunni
- Department of Computer Science, Morgan State University, Baltimore, MD, United States
| | - Max Denis
- Department of Mechanical and Biomedical Engineering, University of the District of Columbia, Washington, DC, United States
| | - Esther Ososanya
- Department of Electrical and Computer Engineering, University of the District of Columbia, Washington, DC, United States
| | - Paul Cotae
- Department of Electrical and Computer Engineering, University of the District of Columbia, Washington, DC, United States
| |
Collapse
|
16
|
Liu Y, Shi J, Zhao C, Zhang C. Generalizing factors of COVID-19 vaccine attitudes in different regions: A summary generation and topic modeling approach. Digit Health 2023; 9:20552076231188852. [PMID: 37485330 PMCID: PMC10359653 DOI: 10.1177/20552076231188852] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2023] [Accepted: 06/26/2023] [Indexed: 07/25/2023] Open
Abstract
Objective The goal of this study is to use summary generation and topic modeling to identify factors contributing to vaccine attitudes for three different vaccine brands, with the aim of generalizing these factors across different regions. Methods A total of 5562 tweets about three vaccine brands (Sinovac, AstraZeneca, and Pfizer) were collected from 14 December 2020 to 30 December 2021. BERTopic clustering is used to group the tweets into topics, and then contrastive learning (CL) is adopted to generate summaries of each topic. The main content of each topic is generalized into three factors that contribute to vaccine attitudes: vaccine-related factors, health system-related factors, and individual social attributes. Results BERTopic clustering outperforms Latent Dirichlet Allocation clustering in our analysis. It can also be found that using CL for summary generation helped to better model the topics, particularly at the center-point of the clustering. Our model identifies three main factors contributing to vaccine attitudes that are consistent across different regions. Conclusions Our study demonstrates the effectiveness of deep learning methods for identifying factors contributing to vaccine attitudes in different regions. By determining these factors, policymakers and medical institutions can develop more effective strategies for addressing concerns related to the vaccination process.
Collapse
Affiliation(s)
- Yang Liu
- School of Information Management, Wuhan University, Wuhan, China
| | - Jiale Shi
- School of Computer Science, Wuhan University, Wuhan, China
| | - Chenxu Zhao
- School of Computer Science, Wuhan University, Wuhan, China
| | - Chengzhi Zhang
- Department of Information Management, Nanjing University of Science & Technology, Nanjing, China
| |
Collapse
|
17
|
Danesh F, Dastani M. Text classification technique for discovering country-based publications from international COVID-19 publications. Digit Health 2023; 9:20552076231185674. [PMID: 37426592 PMCID: PMC10328158 DOI: 10.1177/20552076231185674] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Accepted: 06/15/2023] [Indexed: 07/11/2023] Open
Abstract
Objective The significant increase in the number of COVID-19 publications, on the one hand, and the strategic importance of this subject area for research and treatment systems in the health field, on the other hand, reveals the need for text-mining research more than ever. The main objective of the present paper is to discover country-based publications from international COVID-19 publications with text classification techniques. Methods The present paper is applied research that has been performed using text-mining techniques such as clustering and text classification. The statistical population is all COVID-19 publications from PubMed Central® (PMC), extracted from November 2019 to June 2021. Latent Dirichlet allocation (LDA) was used for clustering, and support vector machine (SVM), scikit-learn library, and Python programming language were used for text classification. Text classification was applied to discover the consistency of Iranian and international topics. Results The findings showed that seven topics were extracted using the LDA algorithm for international and Iranian publications on COVID-19. Moreover, the COVID-19 publications show the largest share in the subject area of "Social and Technology in COVID-19" at the international (April 2021) and national (February 2021) levels with 50.61% and 39.44%, respectively. The highest rate of publications at international and national levels was in April 2021 and February 2021, respectively. Conclusion One of the most important results of this study was discovering a common trend and consistency of Iranian and international publications on COVID-19. Accordingly, in the topic category "Covid-19 Proteins: Vaccine and Antibody Response," Iranian publications have a common publishing and research trend with international ones.
Collapse
Affiliation(s)
| | - Meisam Dastani
- Statistics and Information Technology Department, Gonabad University of Medical Science, Gonabad, Iran
| |
Collapse
|
18
|
Oyebode O, Orji R. Identifying adverse drug reactions from patient reviews on social media using natural language processing. Health Informatics J 2023; 29:14604582221136712. [PMID: 36857033 DOI: 10.1177/14604582221136712] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/02/2023]
Abstract
Drugs have the potential of causing adverse reactions or side effects and prior knowledge of these reactions can help prevent hospitalizations and premature deaths. Public databases of common adverse drug reactions (ADRs) depend on individual reports from drug manufacturers and health professionals. However, this passive approach to ADR surveillance has been shown to suffer from severe under-reporting. Social media, such as online health forums where patients across the globe willingly share their drug intake experience, is a viable and rich source for detecting unreported ADRs. In this paper, we design an ADR Detection Framework (ADF) using Natural Language Processing techniques to identify ADRs in drug reviews mined from social media. We demonstrate the applicability of ADF in the domain of Diabetes by identifying ADRs associated with diabetes drugs using data extracted from three online patient-based health forums: askapatient.com, webmd.com, and iodine.com. Next, we analyze and visualize the ADRs identified and present valuable insights including prevalent and less prevalent ADRs, age and gender differences in ADRs detected, as well as the previously unknown ADRs detected by our framework. Our work could promote active (real-time) ADR surveillance and also advance pharmacovigilance research.
Collapse
Affiliation(s)
- Oladapo Oyebode
- Faculty of Computer Science, 3688Dalhousie University, Halifax, NS, Canada
| | - Rita Orji
- Faculty of Computer Science, 3688Dalhousie University, Halifax, NS, Canada
| |
Collapse
|
19
|
Al-Garadi MA, Yang YC, Sarker A. The Role of Natural Language Processing during the COVID-19 Pandemic: Health Applications, Opportunities, and Challenges. Healthcare (Basel) 2022; 10:2270. [PMID: 36421593 PMCID: PMC9690240 DOI: 10.3390/healthcare10112270] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Revised: 11/03/2022] [Accepted: 11/06/2022] [Indexed: 07/30/2023] Open
Abstract
The COVID-19 pandemic is the most devastating public health crisis in at least a century and has affected the lives of billions of people worldwide in unprecedented ways. Compared to pandemics of this scale in the past, societies are now equipped with advanced technologies that can mitigate the impacts of pandemics if utilized appropriately. However, opportunities are currently not fully utilized, particularly at the intersection of data science and health. Health-related big data and technological advances have the potential to significantly aid the fight against such pandemics, including the current pandemic's ongoing and long-term impacts. Specifically, the field of natural language processing (NLP) has enormous potential at a time when vast amounts of text-based data are continuously generated from a multitude of sources, such as health/hospital systems, published medical literature, and social media. Effectively mitigating the impacts of the pandemic requires tackling challenges associated with the application and deployment of NLP systems. In this paper, we review the applications of NLP to address diverse aspects of the COVID-19 pandemic. We outline key NLP-related advances on a chosen set of topics reported in the literature and discuss the opportunities and challenges associated with applying NLP during the current pandemic and future ones. These opportunities and challenges can guide future research aimed at improving the current health and social response systems and pandemic preparedness.
Collapse
Affiliation(s)
- Mohammed Ali Al-Garadi
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37240, USA
| | - Yuan-Chi Yang
- Department of Biomedical Informatics, School of Medicine, Emory University, Atlanta, GA 30322, USA
| | - Abeed Sarker
- Department of Biomedical Informatics, School of Medicine, Emory University, Atlanta, GA 30322, USA
| |
Collapse
|
20
|
Nia ZM, Ahmadi A, Bragazzi NL, Woldegerima WA, Mellado B, Wu J, Orbinski J, Asgary A, Kong JD. A cross-country analysis of macroeconomic responses to COVID-19 pandemic using Twitter sentiments. PLoS One 2022; 17:e0272208. [PMID: 36001531 PMCID: PMC9401163 DOI: 10.1371/journal.pone.0272208] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2022] [Accepted: 07/14/2022] [Indexed: 11/19/2022] Open
Abstract
The COVID-19 pandemic has had a devastating impact on the global economy. In this paper, we use the Phillips curve to compare and analyze the macroeconomics of three different countries with distinct income levels, namely, lower-middle (Nigeria), upper-middle (South Africa), and high (Canada) income. We aim to (1) find macroeconomic changes in the three countries during the pandemic compared to pre-pandemic time, (2) compare the countries in terms of response to the COVID-19 economic crisis, and (3) compare their expected economic reaction to the COVID-19 pandemic in the near future. An advantage to our work is that we analyze macroeconomics on a monthly basis to capture the shocks and rapid changes caused by on and off rounds of lockdowns. We use the volume and social sentiments of the Twitter data to approximate the macroeconomic statistics. We apply four different machine learning algorithms to estimate the unemployment rate of South Africa and Nigeria on monthly basis. The results show that at the beginning of the pandemic the unemployment rate increased for all the three countries. However, Canada was able to control and reduce the unemployment rate during the COVID-19 pandemic. Nonetheless, in line with the Phillips curve short-run, the inflation rate of Canada increased to a level that has never occurred in more than fifteen years. Nigeria and South Africa have not been able to control the unemployment rate and did not return to the pre-COVID-19 level. Yet, the inflation rate has increased in both countries. The inflation rate is still comparable to the pre-COVID-19 level in South Africa, but based on the Phillips curve short-run, it will increase further, if the unemployment rate decreases. Unfortunately, Nigeria is experiencing a horrible stagflation and a wild increase in both unemployment and inflation rates. This shows how vulnerable lower-middle-income countries could be to lockdowns and economic restrictions. In the near future, the main concern for all the countries is the high inflation rate. This work can potentially lead to more targeted and publicly acceptable policies based on social media content.
Collapse
Affiliation(s)
- Zahra Movahedi Nia
- Africa-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), York University, Toronto, Canada
- Laboratory for Industrial and Applied Mathematics, York University, Toronto, Canada
| | - Ali Ahmadi
- Faculty of Computer Engineering, K.N. Toosi University, Tehran, Iran
- Advanced Disaster, Emergency and Rapid-response Simulation (ADERSIM), York University, Toronto, Ontario, Canada
| | - Nicola L. Bragazzi
- Africa-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), York University, Toronto, Canada
- Laboratory for Industrial and Applied Mathematics, York University, Toronto, Canada
| | - Woldegebriel Assefa Woldegerima
- Africa-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), York University, Toronto, Canada
- Laboratory for Industrial and Applied Mathematics, York University, Toronto, Canada
| | - Bruce Mellado
- Africa-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), York University, Toronto, Canada
- School of Physics, Institute for Collider Particle Physics, University of the Witwatersrand, Johannesburg, South Africa
| | - Jianhong Wu
- Africa-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), York University, Toronto, Canada
- Laboratory for Industrial and Applied Mathematics, York University, Toronto, Canada
| | - James Orbinski
- Africa-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), York University, Toronto, Canada
- Dahdaleh Institute for Global Health Research, York University, Toronto, Canada
| | - Ali Asgary
- Africa-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), York University, Toronto, Canada
- Advanced Disaster, Emergency and Rapid-response Simulation (ADERSIM), York University, Toronto, Ontario, Canada
| | - Jude Dzevela Kong
- Africa-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), York University, Toronto, Canada
- Laboratory for Industrial and Applied Mathematics, York University, Toronto, Canada
| |
Collapse
|
21
|
Bakuri AZ, Antwi-Berko D. " What Other Information Is There?": Identifying Information Gaps, Perceptions and Misconceptions on COVID-19 Among Minority Ethnic Groups in the Netherlands. FRONTIERS IN HEALTH SERVICES 2022; 2:824591. [PMID: 36925797 PMCID: PMC10012720 DOI: 10.3389/frhs.2022.824591] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Accepted: 06/21/2022] [Indexed: 11/13/2022]
Abstract
Background Multiple media platforms and various resources are available for information on the novel coronavirus disease (COVID-19). Identifying people's preferences is key to building public confidence and planning for successful national or regional health intervention strategies. Methods Using exploratory mixed-methods including a short survey, interviews and participant observation, this cross-sectional study of 160 respondents from the Ghanaian-Dutch, Afro and Hindustani Surinamese-Dutch communities in Amsterdam, the Netherlands was conducted. Data collected between February to April 2021, included demographics characteristics, knowledge, opinions, preferred source of information, behavioral factors, and information gaps on COVID-19 prevention measures, responses and decision-making of respondents. Descriptive statistics and follow-up in-depth interviews were conducted to determine the relationship between respondents' demographics, information sources, and attitudes/behaviors toward COVID-19. Results The findings of this study indicated that although many of the respondents from these communities had good knowledge on COVID-19, its modes of transmission and prevention measures, their willingness to take up initiatives and prioritize self responsibility toward their health are tied to their communal life. The respondents in this study demonstrated high value for social lives and relied on their connections with friends and families in shaping, obtaining, processing and utilizing COVID-19 information to build a sense of responsibility toward the uptake of COVID-19 prevention measures despite recent decline in number of cases. Conclusion This sense of responsibility means their active participation and ownership of interventions to address the specific personal concerns and that of their community. However, different factors play influential roles toward the behavior choices of our respondents regarding the COVID-19 prevention.
Collapse
Affiliation(s)
- Amisah Zenabu Bakuri
- Amsterdam Institute of Social Science Research, University of Amsterdam, Amsterdam, Netherlands
- Center for Conflict Studies-History of International Relations, Utrecht Unviversity, Utrecht, Netherlands
| | - Daniel Antwi-Berko
- Department of Basic and Applied Biology, University of Energy and Natural Resources, Sunyani, Ghana
| |
Collapse
|
22
|
Marcec R, Likic R. Using Twitter for sentiment analysis towards AstraZeneca/Oxford, Pfizer/BioNTech and Moderna COVID-19 vaccines. Postgrad Med J 2022; 98:544-550. [PMID: 34373343 PMCID: PMC8354810 DOI: 10.1136/postgradmedj-2021-140685] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2021] [Accepted: 07/30/2021] [Indexed: 02/04/2023]
Abstract
INTRODUCTION A worldwide vaccination campaign is underway to bring an end to the SARS-CoV-2 pandemic; however, its success relies heavily on the actual willingness of individuals to get vaccinated. Social media platforms such as Twitter may prove to be a valuable source of information on the attitudes and sentiment towards SARS-CoV-2 vaccination that can be tracked almost instantaneously. MATERIALS AND METHODS The Twitter academic Application Programming Interface was used to retrieve all English-language tweets mentioning AstraZeneca/Oxford, Pfizer/BioNTech and Moderna vaccines in 4 months from 1 December 2020 to 31 March 2021. Sentiment analysis was performed using the AFINN lexicon to calculate the daily average sentiment of tweets which was evaluated longitudinally and comparatively for each vaccine throughout the 4 months. RESULTS A total of 701 891 tweets have been retrieved and included in the daily sentiment analysis. The sentiment regarding Pfizer and Moderna vaccines appeared positive and stable throughout the 4 months, with no significant differences in sentiment between the months. In contrast, the sentiment regarding the AstraZeneca/Oxford vaccine seems to be decreasing over time, with a significant decrease when comparing December with March (p<0.0000000001, mean difference=-0.746, 95% CI=-0.915 to -0.577). CONCLUSION Lexicon-based Twitter sentiment analysis is a valuable and easily implemented tool to track the sentiment regarding SARS-CoV-2 vaccines. It is worrisome that the sentiment regarding the AstraZeneca/Oxford vaccine appears to be turning negative over time, as this may boost hesitancy rates towards this specific SARS-CoV-2 vaccine.
Collapse
Affiliation(s)
- Robert Marcec
- University of Zagreb School of Medicine, Zagreb, Croatia
| | - Robert Likic
- Department of Internal Medicine, Division of Clinical Pharmacology and Therapeutics, Clinical Hospital Centre Zagreb and University of Zagreb Medical School, Zagreb, Croatia
| |
Collapse
|
23
|
Oyebode O, Ndulue C, Mulchandani D, Suruliraj B, Adib A, Orji FA, Milios E, Matwin S, Orji R. COVID-19 Pandemic: Identifying Key Issues Using Social Media and Natural Language Processing. JOURNAL OF HEALTHCARE INFORMATICS RESEARCH 2022; 6:174-207. [PMID: 35194569 PMCID: PMC8853170 DOI: 10.1007/s41666-021-00111-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2021] [Revised: 11/03/2021] [Accepted: 12/01/2021] [Indexed: 11/10/2022]
Abstract
The COVID-19 pandemic has affected people's lives in many ways. Social media data can reveal public perceptions and experience with respect to the pandemic, and also reveal factors that hamper or support efforts to curb global spread of the disease. In this paper, we analyzed COVID-19-related comments collected from six social media platforms using natural language processing (NLP) techniques. We identified relevant opinionated keyphrases and their respective sentiment polarity (negative or positive) from over 1 million randomly selected comments, and then categorized them into broader themes using thematic analysis. Our results uncover 34 negative themes out of which 17 are economic, socio-political, educational, and political issues. Twenty (20) positive themes were also identified. We discuss the negative issues and suggest interventions to tackle them based on the positive themes and research evidence.
Collapse
Affiliation(s)
- Oladapo Oyebode
- Faculty of Computer Science, Dalhousie University, Halifax, NS B3H 4R2 Canada
| | - Chinenye Ndulue
- Faculty of Computer Science, Dalhousie University, Halifax, NS B3H 4R2 Canada
| | - Dinesh Mulchandani
- Faculty of Computer Science, Dalhousie University, Halifax, NS B3H 4R2 Canada
| | | | - Ashfaq Adib
- Faculty of Computer Science, Dalhousie University, Halifax, NS B3H 4R2 Canada
| | - Fidelia Anulika Orji
- Department of Computer Science, University of Saskatchewan, Saskatoon, SK S7N 5C9 Canada
| | - Evangelos Milios
- Faculty of Computer Science, Dalhousie University, Halifax, NS B3H 4R2 Canada
| | - Stan Matwin
- Faculty of Computer Science, Dalhousie University, Halifax, NS B3H 4R2 Canada
- Institute of Computer Science, Polish Academy of Sciences, Warsaw, Poland
| | - Rita Orji
- Faculty of Computer Science, Dalhousie University, Halifax, NS B3H 4R2 Canada
| |
Collapse
|
24
|
Teague SJ, Shatte ABR, Weller E, Fuller-Tyszkiewicz M, Hutchinson DM. Methods and Applications of Social Media Monitoring of Mental Health During Disasters: Scoping Review. JMIR Ment Health 2022; 9:e33058. [PMID: 35225815 PMCID: PMC8922153 DOI: 10.2196/33058] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Revised: 11/01/2021] [Accepted: 11/26/2021] [Indexed: 01/04/2023] Open
Abstract
BACKGROUND With the increasing frequency and magnitude of disasters internationally, there is growing research and clinical interest in the application of social media sites for disaster mental health surveillance. However, important questions remain regarding the extent to which unstructured social media data can be harnessed for clinically meaningful decision-making. OBJECTIVE This comprehensive scoping review synthesizes interdisciplinary literature with a particular focus on research methods and applications. METHODS A total of 6 health and computer science databases were searched for studies published before April 20, 2021, resulting in the identification of 47 studies. Included studies were published in peer-reviewed outlets and examined mental health during disasters or crises by using social media data. RESULTS Applications across 31 mental health issues were identified, which were grouped into the following three broader themes: estimating mental health burden, planning or evaluating interventions and policies, and knowledge discovery. Mental health assessments were completed by primarily using lexical dictionaries and human annotations. The analyses included a range of supervised and unsupervised machine learning, statistical modeling, and qualitative techniques. The overall reporting quality was poor, with key details such as the total number of users and data features often not being reported. Further, biases in sample selection and related limitations in generalizability were often overlooked. CONCLUSIONS The application of social media monitoring has considerable potential for measuring mental health impacts on populations during disasters. Studies have primarily conceptualized mental health in broad terms, such as distress or negative affect, but greater focus is required on validating mental health assessments. There was little evidence for the clinical integration of social media-based disaster mental health monitoring, such as combining surveillance with social media-based interventions or developing and testing real-world disaster management tools. To address issues with study quality, a structured set of reporting guidelines is recommended to improve the methodological quality, replicability, and clinical relevance of future research on the social media monitoring of mental health during disasters.
Collapse
Affiliation(s)
- Samantha J Teague
- Centre for Social and Early Emotional Development, School of Psychology, Deakin University, Geelong, Australia
- Division of Tropical Health and Medicine, Department of Psychology, College of Healthcare Sciences, James Cook University, Townsville, Australia
| | - Adrian B R Shatte
- School of Engineering, Information Technology & Physical Sciences, Federation University, Melbourne, Australia
| | - Emmelyn Weller
- Centre for Social and Early Emotional Development, School of Psychology, Deakin University, Geelong, Australia
| | - Matthew Fuller-Tyszkiewicz
- Centre for Social and Early Emotional Development, School of Psychology, Deakin University, Geelong, Australia
| | - Delyse M Hutchinson
- Centre for Social and Early Emotional Development, School of Psychology, Deakin University, Geelong, Australia
- Murdoch Children's Research Institute, Melbourne, Australia
- Centre for Adolescent Health, Royal Children's Hospital, Melbourne, Australia
- Department of Paediatrics, University of Melbourne, Melbourne, Australia
- National Drug and Alcohol Research Centre, University of New South Wales, Sydney, Australia
| |
Collapse
|
25
|
Oduntan A, Oyebode O, Beltran AH, Fowles J, Steeves D, Orji R. I Let Depression and Anxiety Drown Me: Identifying Factors Associated with Resilience Based on Journaling using Machine Learning and Thematic Analysis. IEEE J Biomed Health Inform 2022; 26:3397-3408. [PMID: 35139031 DOI: 10.1109/jbhi.2022.3149862] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Over the years, there has been a global increase in the use of technology to deliver interventions for health and wellness, such as improving peoples mental health and resilience. An example of such technology is the Q-Life app which aims to improve peoples resilience to stress and adverse life events through various coping mechanisms, including journaling. Using a combination of sentiment and thematic analysis, this paper presents the results of analyzing 6023 journal entries from 755 users. We uncover both positive and negative factors that are associated with resilience. First, we apply two lexicon-based and eight machine learning (ML) techniques to classify journal entries into positive or negative sentiment polarity, and then compare the performance of these classifiers to determine the best performing classifier overall. Our results show that Support Vector Machine (SVM) is the best classifier overall, outperforming other ML classifiers and lexicon-based classifiers with a high F1-score of 89.7%. Second, we conduct thematic analysis of negative and positive journal entries to identify themes representing factors associated with resilience either negatively or positively, and to determine various coping mechanisms. Our findings reveal 14 negative themes such as stress, worry, loneliness, lack of motivation, sickness, relationship issues, as well as depression and anxiety. Also, 13 positive themes emerged including self-efficacy, gratitude, socialization, progression, relaxation, and physical activity. Seven (7) coping mechanisms are also identified including time management, quality sleep, and mindfulness. Finally, we reflect on our findings and suggest technological interventions that address the negative factors to promote resilience.
Collapse
|
26
|
Gómez-Salgado J, Palomino-Baldeón JC, Ortega-Moreno M, Fagundo-Rivera J, Allande-Cussó R, Ruiz-Frutos C. COVID-19 information received by the Peruvian population, during the first phase of the pandemic, and its association with developing psychological distress: Information about COVID-19 and distress in Peru. Medicine (Baltimore) 2022; 101:e28625. [PMID: 35119007 PMCID: PMC8812631 DOI: 10.1097/md.0000000000028625] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/22/2021] [Revised: 11/30/2021] [Accepted: 12/31/2021] [Indexed: 01/04/2023] Open
Abstract
ABSTRACT It is suspected that the information the population has about coronavirus disease 2019 (COVID-19) determines both its preventive measures and its effects on mental health. The internet and social media are the sources that have largely replaced the official and traditional channels of information. The objective of this study is to analyse the influence of the sources used by the population in Peru to obtain information on COVID-19 and its association with developing psychological distress (PD) and preventive measures against contagion.1699 questionnaires were analysed. A previously validated instrument adapted to Peru was used. Participants were questioned about the information received regarding COVID-19, its sources, time of exposition, assessment, or beliefs about it. Mental health was measured with the Goldberg General Health Questionnaire. Descriptive and bivariate analysis were performed, developing a classification and regression tree for PD based on beliefs and information about the pandemic.The most used source of information on COVID-19 in Peru was social media and this is associated with developing PD, both in the general population and among health professionals. The quality of the information about treatments for COVID-19 is associated with PD in the general population, whereas prognosis generates more distress among healthcare professionals. The biggest concern is transmitting the virus to family members, close persons, or patients, with more confidence in health professionals than in the health system.The health authorities should use the social media to transmit quality information about COVID-19 and, at the same time, to gather in real time the opinions on the implemented preventive measures. For all, this it is necessary to have higher credibility in the population to increase the confidence in the health system, looking at basic aspects for compliance with prevention measures and improvement of mental health.
Collapse
Affiliation(s)
- Juan Gómez-Salgado
- Department of Sociology, Social Work and Public Health, Faculty of Labour Sciences, University of Huelva, Huelva, Spain
- Safety and Health Postgraduate Programme, Universidad Espíritu Santo, Guayaquil, Ecuador
| | | | | | | | - Regina Allande-Cussó
- Department of Nursing, Faculty of Nursing, Physiotherapy and Podiatry, University of Seville, Seville, Spain
| | - Carlos Ruiz-Frutos
- Department of Sociology, Social Work and Public Health, Faculty of Labour Sciences, University of Huelva, Huelva, Spain
- Safety and Health Postgraduate Programme, Universidad Espíritu Santo, Guayaquil, Ecuador
| |
Collapse
|
27
|
Gulf Countries’ Citizens’ Acceptance of COVID-19 Vaccines—A Machine Learning Approach. MATHEMATICS 2022. [DOI: 10.3390/math10030467] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
The COVID-19 pandemic created a global emergency in many sectors. The spread of the disease can be subdued through timely vaccination. The COVID-19 vaccination process in various countries is ongoing and is slowing down due to multiple factors. Many studies on European countries and the USA have been conducted and have highlighted the public’s concern that over-vaccination results in slowing the vaccination rate. Similarly, we analyzed a collection of data from the gulf countries’ citizens’ COVID-19 vaccine-related discourse shared on social media websites, mainly via Twitter. The people’s feedback regarding different types of vaccines needs to be considered to increase the vaccination process. In this paper, the concerns of Gulf countries’ people are highlighted to lessen the vaccine hesitancy. The proposed approach emphasizes the Gulf region-specific concerns related to COVID-19 vaccination accurately using machine learning (ML)-based methods. The collected data were filtered and tokenized to analyze the sentiments extracted using three different methods: Ratio, TextBlob, and VADER methods. The sentiment-scored data were classified into positive and negative tweeted data using a proposed LSTM method. Subsequently, to obtain more confidence in classification, the in-depth features from the proposed LSTM were extracted and given to four different ML classifiers. The ratio, TextBlob, and VADER sentiment scores were separately provided to LSTM and four machine learning classifiers. The VADER sentiment scores had the best classification results using fine-KNN and Ensemble boost with 94.01% classification accuracy. Given the improved accuracy, the proposed scheme is robust and confident in classifying and determining sentiments in Twitter discourse.
Collapse
|
28
|
Huangfu L, Mo Y, Zhang P, Zeng D, He S. Analyzing COVID-19 vaccine tweets following vaccine rollout: A sentiment-based topic modeling approach. J Med Internet Res 2021; 24:e31726. [PMID: 34783665 PMCID: PMC8827037 DOI: 10.2196/31726] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Revised: 11/12/2021] [Accepted: 11/13/2021] [Indexed: 02/07/2023] Open
Abstract
Background COVID-19 vaccines are one of the most effective preventive strategies for containing the pandemic. Having a better understanding of the public’s conceptions of COVID-19 vaccines may aid in the effort to promptly and thoroughly vaccinate the community. However, because no empirical research has yet fully explored the public’s vaccine awareness through sentiment–based topic modeling, little is known about the evolution of public attitude since the rollout of COVID-19 vaccines. Objective In this study, we specifically focused on tweets about COVID-19 vaccines (Pfizer, Moderna, AstraZeneca, and Johnson & Johnson) after vaccines became publicly available. We aimed to explore the overall sentiments and topics of tweets about COVID-19 vaccines, as well as how such sentiments and main concerns evolved. Methods We collected 1,122,139 tweets related to COVID-19 vaccines from December 14, 2020, to April 30, 2021, using Twitter’s application programming interface. We removed retweets and duplicate tweets to avoid data redundancy, which resulted in 857,128 tweets. We then applied sentiment–based topic modeling by using the compound score to determine sentiment polarity and the coherence score to determine the optimal topic number for different sentiment polarity categories. Finally, we calculated the topic distribution to illustrate the topic evolution of main concerns. Results Overall, 398,661 (46.51%) were positive, 204,084 (23.81%) were negative, 245,976 (28.70%) were neutral, 6899 (0.80%) were highly positive, and 1508 (0.18%) were highly negative sentiments. The main topics of positive and highly positive tweets were planning for getting vaccination (251,979/405,560, 62.13%), getting vaccination (76,029/405,560, 18.75%), and vaccine information and knowledge (21,127/405,560, 5.21%). The main concerns in negative and highly negative tweets were vaccine hesitancy (115,206/205,592, 56.04%), extreme side effects of the vaccines (19,690/205,592, 9.58%), and vaccine supply and rollout (17,154/205,592, 8.34%). During the study period, negative sentiment trends were stable, while positive sentiments could be easily influenced. Topic heatmap visualization demonstrated how main concerns changed during the current widespread vaccination campaign. Conclusions To the best of our knowledge, this is the first study to evaluate public COVID-19 vaccine awareness and awareness trends on social media with automated sentiment–based topic modeling after vaccine rollout. Our results can help policymakers and research communities track public attitudes toward COVID-19 vaccines and help them make decisions to promote the vaccination campaign.
Collapse
Affiliation(s)
- Luwen Huangfu
- San Diego State University, 5500 Campanile Dr, San Diego, US.,Center for Human Dynamics in the Mobile Age, San Diego, US
| | - Yiwen Mo
- San Diego State University, 5500 Campanile Dr, San Diego, US
| | - Peijie Zhang
- The State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing, CN
| | - Daniel Zeng
- University of Chinese Academy of Sciences, No. 95 Zhongguancun East Road, Haidian District, Beijing, CN.,The State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing, CN
| | - Saike He
- University of Chinese Academy of Sciences, No. 95 Zhongguancun East Road, Haidian District, Beijing, CN.,The State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing, CN
| |
Collapse
|
29
|
Castilla-Puentes R, Pesa J, Brethenoux C, Furey P, Gil Valletta L, Falcone T. Applying the Health Belief Model to Characterize Racial/Ethnic Differences in Digital Conversations Related to Depression Pre– and Mid–COVID-19 (Preprint). JMIR Form Res 2021; 6:e33637. [PMID: 35275834 PMCID: PMC9217151 DOI: 10.2196/33637] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Revised: 01/25/2022] [Accepted: 02/22/2022] [Indexed: 11/13/2022] Open
Abstract
Background Objective Methods Results Conclusions
Collapse
Affiliation(s)
- Ruby Castilla-Puentes
- Janssen Research & Development, LLC, Titusville, NJ, United States
- Center for Public Health Practice, Drexel University, Philadelphia, PA, United States
- Hispanic Organization for Leadership and Advancement, Johnson & Johnson, Employee Resource Group, New Brunswick, NJ, United States
| | - Jacqueline Pesa
- Janssen Scientific Affairs, LLC, Titusville, NJ, United States
| | | | | | | | - Tatiana Falcone
- Department of Psychiatry and Psychology, Cleveland Clinic Lerner College of Medicine, Cleveland, OH, United States
| |
Collapse
|
30
|
Cardinale M. Preparing athletes and staff for the first "pandemic" Olympic Games. J Sports Med Phys Fitness 2021; 61:1052-1060. [PMID: 34256538 DOI: 10.23736/s0022-4707.21.12745-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Severe acute respiratory syndrome Coronavirus 2 (SARS-CoV-2) is the virus causing Coronavirus disease 2019 (COVID-19). Since the World Health Organization declared the outbreak of pandemic on March 11, 2020, cases have been increasing around the World with more than 3-million deaths recorded and a daily number of COVID-19 cases 20 times higher than when the Olympics were postponed, at the time of writing. Governments adopted various lockdown measures forcing isolation for many weeks/months depending on the evolution of the disease in each country. The rapid transmission of the disease forced the Tokyo 2020 Olympics to be postponed for one year. Travel restrictions, quarantine requirements and isolation have been the norm for many athletes in preparation for the Olympic Games. Also, due to the measures put in place to reduce the spread of the disease, sporting facilities have been closed and competitions cancelled forcing athletes and their staff to find alternative solutions to maintain performance and continue preparing for the Olympics. This unique challenge is affecting the whole World, and while vaccination programs start to be deployed, in a few months the world will see the first Olympic Games' edition during a pandemic. The aim of this special paper was to consider the various challenges posed by the COVID pandemic and to provide information for coaching support staff to improve the preparation for Tokyo Olympics as well as consider the possible performance implications of this unique Olympic edition.
Collapse
Affiliation(s)
- Marco Cardinale
- Department of Research and Scientific Support, Aspetar Orthopedic and Sports Medicine Hospital, Doha, Qatar - .,Department of Surgery and Interventional Medicine, Institute of Sport Exercise and Health, University College London, London, UK -
| |
Collapse
|