1
|
Wessel D, Pogrebnyakov N. Using Social Media as a Source of Real-World Data for Pharmaceutical Drug Development and Regulatory Decision Making. Drug Saf 2024; 47:495-511. [PMID: 38446405 PMCID: PMC11018692 DOI: 10.1007/s40264-024-01409-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/07/2024] [Indexed: 03/07/2024]
Abstract
INTRODUCTION While pharmaceutical companies aim to leverage real-world data (RWD) to bridge the gap between clinical drug development and real-world patient outcomes, extant research has mainly focused on the use of social media in a post-approval safety-surveillance setting. Recent regulatory and technological developments indicate that social media may serve as a rich source to expand the evidence base to pre-approval and drug development activities. However, use cases related to drug development have been largely omitted, thereby missing some of the benefits of RWD. In addition, an applied end-to-end understanding of RWD rooted in both industry and regulations is lacking. OBJECTIVE We aimed to investigate how social media can be used as a source of RWD to support regulatory decision making and drug development in the pharmaceutical industry. We aimed to specifically explore the data pipeline and examine how social-media derived RWD can align with regulatory guidance from the US Food and Drug Administration and industry needs. METHODS A machine learning pipeline was developed to extract patient insights related to anticoagulants from X (Twitter) data. These findings were then analysed from an industry perspective, and complemented by interviews with professionals from a pharmaceutical company. RESULTS The analysis reveals several use cases where RWD derived from social media can be beneficial, particularly in generating hypotheses around patient and therapeutic area needs. We also note certain limitations of social media data, particularly around inferring causality. CONCLUSIONS Social media display considerable potential as a source of RWD for guiding efforts in pharmaceutical drug development and pre-approval settings. Although further regulatory guidance on the use of social media for RWD is needed to encourage its use, regulatory and technological developments are suggested to warrant at least exploratory uses for drug development.
Collapse
Affiliation(s)
- Didrik Wessel
- Copenhagen Business School, Frederiksberg, Denmark.
- , Nørrebrogade 18A 3TH, 2200, Copenhagen N, Denmark.
| | | |
Collapse
|
2
|
Dong F, Guo W, Liu J, Patterson TA, Hong H. BERT-based language model for accurate drug adverse event extraction from social media: implementation, evaluation, and contributions to pharmacovigilance practices. Front Public Health 2024; 12:1392180. [PMID: 38716250 PMCID: PMC11074401 DOI: 10.3389/fpubh.2024.1392180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Accepted: 04/11/2024] [Indexed: 05/18/2024] Open
Abstract
Introduction Social media platforms serve as a valuable resource for users to share health-related information, aiding in the monitoring of adverse events linked to medications and treatments in drug safety surveillance. However, extracting drug-related adverse events accurately and efficiently from social media poses challenges in both natural language processing research and the pharmacovigilance domain. Method Recognizing the lack of detailed implementation and evaluation of Bidirectional Encoder Representations from Transformers (BERT)-based models for drug adverse event extraction on social media, we developed a BERT-based language model tailored to identifying drug adverse events in this context. Our model utilized publicly available labeled adverse event data from the ADE-Corpus-V2. Constructing the BERT-based model involved optimizing key hyperparameters, such as the number of training epochs, batch size, and learning rate. Through ten hold-out evaluations on ADE-Corpus-V2 data and external social media datasets, our model consistently demonstrated high accuracy in drug adverse event detection. Result The hold-out evaluations resulted in average F1 scores of 0.8575, 0.9049, and 0.9813 for detecting words of adverse events, words in adverse events, and words not in adverse events, respectively. External validation using human-labeled adverse event tweets data from SMM4H further substantiated the effectiveness of our model, yielding F1 scores 0.8127, 0.8068, and 0.9790 for detecting words of adverse events, words in adverse events, and words not in adverse events, respectively. Discussion This study not only showcases the effectiveness of BERT-based language models in accurately identifying drug-related adverse events in the dynamic landscape of social media data, but also addresses the need for the implementation of a comprehensive study design and evaluation. By doing so, we contribute to the advancement of pharmacovigilance practices and methodologies in the context of emerging information sources like social media.
Collapse
Affiliation(s)
| | | | | | | | - Huixiao Hong
- National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, United States
| |
Collapse
|
3
|
Watson T, Robertson C. Silencing the FDA's Voice - Drug Information on Trial. N Engl J Med 2023; 389:2312-2314. [PMID: 38108407 DOI: 10.1056/nejmp2311006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Affiliation(s)
- Tina Watson
- From Boston University School of Law, Boston
| | | |
Collapse
|
4
|
Salholz-Hillel M, Pugh-Jones M, Hildebrand N, Schult TA, Schwietering J, Grabitz P, Carlisle BG, Goldacre B, Strech D, DeVito NJ. Dissemination of Registered COVID-19 Clinical Trials (DIRECCT): a cross-sectional study. BMC Med 2023; 21:475. [PMID: 38031096 PMCID: PMC10687901 DOI: 10.1186/s12916-023-03161-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Accepted: 11/07/2023] [Indexed: 12/01/2023] Open
Abstract
BACKGROUND The results of clinical trials should be completely and rapidly reported during public health emergencies such as COVID-19. This study aimed to examine when, and where, the results of COVID-19 clinical trials were disseminated throughout the first 18 months of the pandemic. METHODS Clinical trials for COVID-19 treatment or prevention were identified from the WHO ICTRP database. All interventional trials with a registered completion date ≤ 30 June 2021 were included. Trial results, published as preprints, journal articles, or registry results, were located using automated and manual techniques across PubMed, Google Scholar, Google, EuropePMC, CORD-19, the Cochrane COVID-19 Study Register, and clinical trial registries. Our main analysis reports the rate of dissemination overall and per route, and the time from registered completion to results using Kaplan-Meier methods, with additional subgroup and sensitivity analyses reported. RESULTS Overall, 1643 trials with completion dates ranging from 46 to 561 days prior to the start of results searches were included. The cumulative probability of reporting was 12.5% at 3 months from completion, 21.6% at 6 months, and 32.8% at 12 months. Trial results were most commonly disseminated in journals (n = 278 trials, 69.2%); preprints were available for 194 trials (48.3%), 86 (44.3%) of which converted to a full journal article. Trials completed earlier in the pandemic were reported more rapidly than those later in the pandemic, and those involving ivermectin were more rapidly reported than other common interventions. Results were robust to various sensitivity analyses except when considering only trials in a "completed" status on the registry, which substantially increased reporting rates. Poor trial registry data on completion status and dates limits the precision of estimates. CONCLUSIONS COVID-19 trials saw marginal increases in reporting rates compared to standard practice; most registered trials failed to meet even the 12-month non-pandemic standard. Preprints were common, complementing journal publication; however, registries were underutilized for rapid reporting. Maintaining registry data enables accurate representation of clinical research; failing to do so undermines these registries' use for public accountability and analysis. Addressing rapid reporting and registry data quality must be emphasized at global, national, and institutional levels.
Collapse
Affiliation(s)
- Maia Salholz-Hillel
- QUEST Center for Responsible Research, Berlin Institute of Health (BIH), Charite Universitätsmedizin Berlin, Berlin, Germany
| | - Molly Pugh-Jones
- QUEST Center for Responsible Research, Berlin Institute of Health (BIH), Charite Universitätsmedizin Berlin, Berlin, Germany
| | - Nicole Hildebrand
- QUEST Center for Responsible Research, Berlin Institute of Health (BIH), Charite Universitätsmedizin Berlin, Berlin, Germany
| | - Tjada A Schult
- QUEST Center for Responsible Research, Berlin Institute of Health (BIH), Charite Universitätsmedizin Berlin, Berlin, Germany
| | - Johannes Schwietering
- QUEST Center for Responsible Research, Berlin Institute of Health (BIH), Charite Universitätsmedizin Berlin, Berlin, Germany
| | - Peter Grabitz
- QUEST Center for Responsible Research, Berlin Institute of Health (BIH), Charite Universitätsmedizin Berlin, Berlin, Germany
| | - Benjamin Gregory Carlisle
- QUEST Center for Responsible Research, Berlin Institute of Health (BIH), Charite Universitätsmedizin Berlin, Berlin, Germany
| | - Ben Goldacre
- Bennett Institute for Applied Data Science, Nuffield Department of Primary Care Health Sciences, University of Oxford, Oxford, UK
| | - Daniel Strech
- QUEST Center for Responsible Research, Berlin Institute of Health (BIH), Charite Universitätsmedizin Berlin, Berlin, Germany
| | - Nicholas J DeVito
- Bennett Institute for Applied Data Science, Nuffield Department of Primary Care Health Sciences, University of Oxford, Oxford, UK.
| |
Collapse
|
5
|
Litvinova O, Matin FB, Matin M, Zima-Kulisiewicz B, Tomasik C, Siddiquea BN, Stoyanov J, Atanasov AG, Willschke H. Patient safety discourse in a pandemic: a Twitter hashtag analysis study on #PatientSafety. Front Public Health 2023; 11:1268730. [PMID: 38035302 PMCID: PMC10687459 DOI: 10.3389/fpubh.2023.1268730] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Accepted: 10/27/2023] [Indexed: 12/02/2023] Open
Abstract
Background The digitalization of medicine is becoming a transformative force in modern healthcare systems. This study aims to investigate discussions regarding patient safety, as well as summarize perceived approaches to mitigating risks of adverse events expressed through the #PatientSafety Twitter hashtag during the COVID-19 pandemic. Methods This research is grounded in the analysis of data extracted from Twitter under the hashtag #PatientSafety between December 1, 2019 and February 1, 2023. Symplur Signals, which represents a tool offering a method to monitor tweets containing hashtags registered with the Symplur Healthcare Hashtag Project, was used for analyzing the tweets shared in the study period. For text analytics of the relevant data, we further used the word cloud generator MonkeyLearn, and VOSviewer. Results The analysis encompasses 358'809 tweets that were shared by 90'079 Twitter users, generating a total of 1'183'384'757 impressions. Physicians contributed to 18.65% of all tweets, followed by other healthcare professionals (14.31%), and health-focused individuals (10.91%). Geographically, more than a third of tweets (60.90%) were published in the United States. Canada and India followed in second and third positions, respectively. Blocks of trending terms of greater interest to the global Twitter community within the hashtag #PatientSafety were determined to be: "Patient," "Practical doctors," and "Health Care Safety Management." The findings demonstrate the engagement of the Twitter community with COVID-19 and problems related to the training, experience of doctors and patients during a pandemic, communication, the vaccine safety and effectiveness, and potential use of off-label drugs. Noteworthy, in the field of pharmacovigilance, Twitter has the possibility of identifying adverse reactions associated with the use of drugs, including vaccines. The issue of medical errors has been also discussed by Twitter users using the hashtag #PatientSafety. Conclusion It is clear that various stakeholders, including students, medical practitioners, health organizations, pharmaceutical companies, and regulatory bodies, leverage Twitter to rapidly exchange medical information, data on the disease symptoms, and the drug effects. Consequently, there is a need to further integrate Twitter-derived data into the operational routines of healthcare organizations.
Collapse
Affiliation(s)
- Olena Litvinova
- Department of Management and Quality Assurance in Pharmacy, National University of Pharmacy of the Ministry of Health of Ukraine, Kharkiv, Ukraine
- Ludwig Boltzmann Institute Digital Health and Patient Safety, Medical University of Vienna, Vienna, Austria
| | - Farhan Bin Matin
- Department of Pharmacy, East West University, Aftabnagar, Dhaka, Bangladesh
| | - Maima Matin
- Institute of Genetics and Animal Biotechnology of the Polish Academy of Sciences, Jastrzebiec, Poland
| | - Bogumila Zima-Kulisiewicz
- Institute of Genetics and Animal Biotechnology of the Polish Academy of Sciences, Jastrzebiec, Poland
| | - Cyprian Tomasik
- Institute of Genetics and Animal Biotechnology of the Polish Academy of Sciences, Jastrzebiec, Poland
| | - Bodrun Naher Siddiquea
- Department of Epidemiology and Preventive Medicine, School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, Australia
| | | | - Atanas G. Atanasov
- Ludwig Boltzmann Institute Digital Health and Patient Safety, Medical University of Vienna, Vienna, Austria
- Institute of Genetics and Animal Biotechnology of the Polish Academy of Sciences, Jastrzebiec, Poland
| | - Harald Willschke
- Ludwig Boltzmann Institute Digital Health and Patient Safety, Medical University of Vienna, Vienna, Austria
- Department of Anaesthesia, Intensive Care Medicine and Pain Medicine, Medical University Vienna, Vienna, Austria
| |
Collapse
|
6
|
Wu J, Wang L, Hua Y, Li M, Zhou L, Bates DW, Yang J. Trend and Co-occurrence Network of COVID-19 Symptoms From Large-Scale Social Media Data: Infoveillance Study. J Med Internet Res 2023; 25:e45419. [PMID: 36812402 PMCID: PMC10131634 DOI: 10.2196/45419] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Revised: 02/04/2023] [Accepted: 02/19/2023] [Indexed: 02/24/2023] Open
Abstract
BACKGROUND For an emergent pandemic, such as COVID-19, the statistics of symptoms based on hospital data may be biased or delayed due to the high proportion of asymptomatic or mild-symptom infections that are not recorded in hospitals. Meanwhile, the difficulty in accessing large-scale clinical data also limits many researchers from conducting timely research. OBJECTIVE Given the wide coverage and promptness of social media, this study aimed to present an efficient workflow to track and visualize the dynamic characteristics and co-occurrence of symptoms for the COVID-19 pandemic from large-scale and long-term social media data. METHODS This retrospective study included 471,553,966 COVID-19-related tweets from February 1, 2020, to April 30, 2022. We curated a hierarchical symptom lexicon for social media containing 10 affected organs/systems, 257 symptoms, and 1808 synonyms. The dynamic characteristics of COVID-19 symptoms over time were analyzed from the perspectives of weekly new cases, overall distribution, and temporal prevalence of reported symptoms. The symptom evolutions between virus strains (Delta and Omicron) were investigated by comparing the symptom prevalence during their dominant periods. A co-occurrence symptom network was developed and visualized to investigate inner relationships among symptoms and affected body systems. RESULTS This study identified 201 COVID-19 symptoms and grouped them into 10 affected body systems. There was a significant correlation between the weekly quantity of self-reported symptoms and new COVID-19 infections (Pearson correlation coefficient=0.8528; P<.001). We also observed a 1-week leading trend (Pearson correlation coefficient=0.8802; P<.001) between them. The frequency of symptoms showed dynamic changes as the pandemic progressed, from typical respiratory symptoms in the early stage to more musculoskeletal and nervous symptoms in the later stages. We identified the difference in symptoms between the Delta and Omicron periods. There were fewer severe symptoms (coma and dyspnea), more flu-like symptoms (throat pain and nasal congestion), and fewer typical COVID symptoms (anosmia and taste altered) in the Omicron period than in the Delta period (all P<.001). Network analysis revealed co-occurrences among symptoms and systems corresponding to specific disease progressions, including palpitations (cardiovascular) and dyspnea (respiratory), and alopecia (musculoskeletal) and impotence (reproductive). CONCLUSIONS This study identified more and milder COVID-19 symptoms than clinical research and characterized the dynamic symptom evolution based on 400 million tweets over 27 months. The symptom network revealed potential comorbidity risk and prognostic disease progression. These findings demonstrate that the cooperation of social media and a well-designed workflow can depict a holistic picture of pandemic symptoms to complement clinical studies.
Collapse
Affiliation(s)
- Jiageng Wu
- School of Public Health and the Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China.,The Key Laboratory of Intelligent Preventive Medicine of Zhejiang Province, Hangzhou, China
| | - Lumin Wang
- School of Public Health and the Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China.,The Key Laboratory of Intelligent Preventive Medicine of Zhejiang Province, Hangzhou, China
| | - Yining Hua
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, United States.,Division of General Internal Medicine and Primary Care, Brigham and Women's Hospital, Boston, MA, United States
| | - Minghui Li
- School of Public Health and the Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China.,The Key Laboratory of Intelligent Preventive Medicine of Zhejiang Province, Hangzhou, China
| | - Li Zhou
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, United States.,Division of General Internal Medicine and Primary Care, Brigham and Women's Hospital, Boston, MA, United States
| | - David W Bates
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, United States.,Division of General Internal Medicine and Primary Care, Brigham and Women's Hospital, Boston, MA, United States
| | - Jie Yang
- School of Public Health and the Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China.,The Key Laboratory of Intelligent Preventive Medicine of Zhejiang Province, Hangzhou, China
| |
Collapse
|
7
|
Shafiee A, Teymouri Athar MM, Mozhgani SH. A twisting tale of misinformation: should ivermectin be approved as a treatment for COVID-19 disease? Future Virol 2023:10.2217/fvl-2023-0006. [PMID: 36915278 PMCID: PMC10005062 DOI: 10.2217/fvl-2023-0006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Accepted: 02/17/2023] [Indexed: 03/12/2023]
Abstract
This editorial examines what has caused the evidence around ivermectin to be so controversial, provides a brief analysis of recently published evidence, and highlights why it is important to learn lessons from ivermectin for future re-purposed drugs.
Collapse
Affiliation(s)
- Arman Shafiee
- Clinical Research Development Unit, Alborz University of Medical Sciences, Karaj, Iran.,Student Research Committee, School of Medicine, Alborz University of Medical Sciences, Karaj, Iran
| | | | - Sayed-Hamidreza Mozhgani
- Department of Microbiology, School of Medicine, Alborz University of Medical Sciences, Karaj, Iran.,Non-communicable Disease Research Center, Alborz University of Medical Sciences, Karaj, Iran
| |
Collapse
|
8
|
Thakur N. MonkeyPox2022Tweets: A Large-Scale Twitter Dataset on the 2022 Monkeypox Outbreak, Findings from Analysis of Tweets, and Open Research Questions. Infect Dis Rep 2022; 14:855-883. [PMID: 36412745 PMCID: PMC9680479 DOI: 10.3390/idr14060087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Revised: 10/13/2022] [Accepted: 11/08/2022] [Indexed: 11/16/2022] Open
Abstract
The mining of Tweets to develop datasets on recent issues, global challenges, pandemics, virus outbreaks, emerging technologies, and trending matters has been of significant interest to the scientific community in the recent past, as such datasets serve as a rich data resource for the investigation of different research questions. Furthermore, the virus outbreaks of the past, such as COVID-19, Ebola, Zika virus, and flu, just to name a few, were associated with various works related to the analysis of the multimodal components of Tweets to infer the different characteristics of conversations on Twitter related to these respective outbreaks. The ongoing outbreak of the monkeypox virus, declared a Global Public Health Emergency (GPHE) by the World Health Organization (WHO), has resulted in a surge of conversations about this outbreak on Twitter, which is resulting in the generation of tremendous amounts of Big Data. There has been no prior work in this field thus far that has focused on mining such conversations to develop a Twitter dataset. Furthermore, no prior work has focused on performing a comprehensive analysis of Tweets about this ongoing outbreak. To address these challenges, this work makes three scientific contributions to this field. First, it presents an open-access dataset of 556,427 Tweets about monkeypox that have been posted on Twitter since the first detected case of this outbreak. A comparative study is also presented that compares this dataset with 36 prior works in this field that focused on the development of Twitter datasets to further uphold the novelty, relevance, and usefulness of this dataset. Second, the paper reports the results of a comprehensive analysis of the Tweets of this dataset. This analysis presents several novel findings; for instance, out of all the 34 languages supported by Twitter, English has been the most used language to post Tweets about monkeypox, about 40,000 Tweets related to monkeypox were posted on the day WHO declared monkeypox as a GPHE, a total of 5470 distinct hashtags have been used on Twitter about this outbreak out of which #monkeypox is the most used hashtag, and Twitter for iPhone has been the leading source of Tweets about the outbreak. The sentiment analysis of the Tweets was also performed, and the results show that despite a lot of discussions, debate, opinions, information, and misinformation, on Twitter on various topics in this regard, such as monkeypox and the LGBTQI+ community, monkeypox and COVID-19, vaccines for monkeypox, etc., "neutral" sentiment was present in most of the Tweets. It was followed by "negative" and "positive" sentiments, respectively. Finally, to support research and development in this field, the paper presents a list of 50 open research questions related to the outbreak in the areas of Big Data, Data Mining, Natural Language Processing, and Machine Learning that may be investigated based on this dataset.
Collapse
Affiliation(s)
- Nirmalya Thakur
- Department of Computer Science, Emory University, Atlanta, GA 30322, USA
| |
Collapse
|
9
|
Li M, Hua Y, Liao Y, Zhou L, Li X, Wang L, Yang J. Tracking the Impact of COVID-19 and Lockdown Policies on Public Mental Health Using Social Media: Infoveillance Study. J Med Internet Res 2022; 24:e39676. [PMID: 36191167 PMCID: PMC9566822 DOI: 10.2196/39676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Revised: 07/21/2022] [Accepted: 09/30/2022] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND The COVID-19 pandemic and its corresponding preventive and control measures have increased the mental burden on the public. Understanding and tracking changes in public mental status can facilitate optimizing public mental health intervention and control strategies. OBJECTIVE This study aimed to build a social media-based pipeline that tracks public mental changes and use it to understand public mental health status regarding the pandemic. METHODS This study used COVID-19-related tweets posted from February 2020 to April 2022. The tweets were downloaded using unique identifiers through the Twitter application programming interface. We created a lexicon of 4 mental health problems (depression, anxiety, insomnia, and addiction) to identify mental health-related tweets and developed a dictionary for identifying health care workers. We analyzed temporal and geographic distributions of public mental health status during the pandemic and further compared distributions among health care workers versus the general public, supplemented by topic modeling on their underlying foci. Finally, we used interrupted time series analysis to examine the statewide impact of a lockdown policy on public mental health in 12 states. RESULTS We extracted 4,213,005 tweets related to mental health and COVID-19 from 2,316,817 users. Of these tweets, 2,161,357 (51.3%) were related to "depression," whereas 1,923,635 (45.66%), 225,205 (5.35%), and 150,006 (3.56%) were related to "anxiety," "insomnia," and "addiction," respectively. Compared to the general public, health care workers had higher risks of all 4 types of problems (all P<.001), and they were more concerned about clinical topics than everyday issues (eg, "students' pressure," "panic buying," and "fuel problems") than the general public. Finally, the lockdown policy had significant associations with public mental health in 4 out of the 12 states we studied, among which Pennsylvania showed a positive association, whereas Michigan, North Carolina, and Ohio showed the opposite (all P<.05). CONCLUSIONS The impact of COVID-19 and the corresponding control measures on the public's mental status is dynamic and shows variability among different cohorts regarding disease types, occupations, and regional groups. Health agencies and policy makers should primarily focus on depression (reported by 51.3% of the tweets) and insomnia (which has had an ever-increasing trend since the beginning of the pandemic), especially among health care workers. Our pipeline timely tracks and analyzes public mental health changes, especially when primary studies and large-scale surveys are difficult to conduct.
Collapse
Affiliation(s)
- Minghui Li
- Department of Big Data in Health Science School of Public Health, Center of Clinical Big Data and Analytics of The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
- The Key Laboratory of Intelligent Preventive Medicine of Zhejiang Province, Hangzhou, China
| | - Yining Hua
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, United States
- Division of General Internal Medicine and Primary Care, Department of Medicine, Brigham and Women's Hospital, Boston, MA, United States
| | - Yanhui Liao
- Department of Psychiatry, Sir Run Run Shaw Hospital, School of Medicine, Zhejiang University, Hangzhou, China
| | - Li Zhou
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, United States
- Division of General Internal Medicine and Primary Care, Department of Medicine, Brigham and Women's Hospital, Boston, MA, United States
| | - Xue Li
- Department of Big Data in Health Science School of Public Health, Center of Clinical Big Data and Analytics of The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
- The Key Laboratory of Intelligent Preventive Medicine of Zhejiang Province, Hangzhou, China
| | - Ling Wang
- Florence Nightingale Faculty of Nursing, Midwifery & Palliative Care, King's College London, London, United Kingdom
| | - Jie Yang
- Department of Big Data in Health Science School of Public Health, Center of Clinical Big Data and Analytics of The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
- The Key Laboratory of Intelligent Preventive Medicine of Zhejiang Province, Hangzhou, China
| |
Collapse
|