1
|
Wang C, Bai YX, Li XW, Lin LT. Effects of extreme temperatures on public sentiment in 49 Chinese cities. Sci Rep 2024; 14:9954. [PMID: 38688992 PMCID: PMC11061318 DOI: 10.1038/s41598-024-60804-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Accepted: 04/26/2024] [Indexed: 05/02/2024] Open
Abstract
The rising sentiment challenges of the metropolitan residents may be attributed to the extreme temperatures. However, nationwide real-time empirical studies that examine this claim are rare. In this research, we construct a daily extreme temperature index and sentiment metric using geotagged posts on one of China's largest social media sites, Weibo, to verify this hypothesis. We find that extreme temperatures causally decrease individuals' sentiment, and extremely low temperature may decrease more than extremely high temperature. Heterogeneity analyses reveal that individuals living in high levels of PM2.5, existing new COVID-19 diagnoses and low-disposable income cities on workdays are more vulnerable to the impact of extreme temperatures on sentiment. More importantly, the results also demonstrate that the adverse effects of extremely low temperatures on sentiment are more minor for people living in northern cities with breezes. Finally, we estimate that with a one-standard increase of extremely high (low) temperature, the sentiment decreases by approximately 0.161 (0.272) units. Employing social media to monitor public sentiment can assist policymakers in developing data-driven and evidence-based policies to alleviate the adverse impacts of extreme temperatures.
Collapse
Affiliation(s)
- Chan Wang
- School of Economics, Guangdong University of Finance and Economics, Guangzhou, 510320, People's Republic of China
| | - Yi-Xiang Bai
- School of Economics, Guangdong University of Finance and Economics, Guangzhou, 510320, People's Republic of China.
| | - Xin-Wu Li
- School of Economics, Nankai University, Tianjin, 300071, People's Republic of China
| | - Lu-Tong Lin
- School of Economics, Guangdong University of Finance and Economics, Guangzhou, 510320, People's Republic of China
- School of Economics, Nankai University, Tianjin, 300071, People's Republic of China
| |
Collapse
|
2
|
McNeill E, Lindenfeld Z, Mostafa L, Zein D, Silver D, Pagán J, Weeks WB, Aerts A, Des Rosiers S, Boch J, Chang JE. Uses of Social Determinants of Health Data to Address Cardiovascular Disease and Health Equity: A Scoping Review. J Am Heart Assoc 2023; 12:e030571. [PMID: 37929716 PMCID: PMC10727404 DOI: 10.1161/jaha.123.030571] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Accepted: 09/06/2023] [Indexed: 11/07/2023]
Abstract
Background Cardiovascular disease is the leading cause of morbidity and mortality worldwide. Prior research suggests that social determinants of health have a compounding effect on health and are associated with cardiovascular disease. This scoping review explores what and how social determinants of health data are being used to address cardiovascular disease and improve health equity. Methods and Results After removing duplicate citations, the initial search yielded 4110 articles for screening, and 50 studies were identified for data extraction. Most studies relied on similar data sources for social determinants of health, including geocoded electronic health record data, national survey responses, and census data, and largely focused on health care access and quality, and the neighborhood and built environment. Most focused on developing interventions to improve health care access and quality or characterizing neighborhood risk and individual risk. Conclusions Given that few interventions addressed economic stability, education access and quality, or community context and social risk, the potential for harnessing social determinants of health data to reduce the burden of cardiovascular disease remains unrealized.
Collapse
Affiliation(s)
- Elizabeth McNeill
- Department of Public Health Policy and ManagementNew York University School of Global Public HealthNew YorkNYUSA
| | - Zoe Lindenfeld
- Department of Public Health Policy and ManagementNew York University School of Global Public HealthNew YorkNYUSA
| | - Logina Mostafa
- Department of Public Health Policy and ManagementNew York University School of Global Public HealthNew YorkNYUSA
| | - Dina Zein
- Department of Public Health Policy and ManagementNew York University School of Global Public HealthNew YorkNYUSA
| | - Diana Silver
- Department of Public Health Policy and ManagementNew York University School of Global Public HealthNew YorkNYUSA
| | - José Pagán
- Department of Public Health Policy and ManagementNew York University School of Global Public HealthNew YorkNYUSA
| | - William B. Weeks
- Microsoft Corporation, Precision Population Health, Microsoft ResearchRedmondWAUSA
| | - Ann Aerts
- The Novartis FoundationBaselSwitzerland
| | | | | | - Ji Eun Chang
- Department of Public Health Policy and ManagementNew York University School of Global Public HealthNew YorkNYUSA
| |
Collapse
|
3
|
Giorgi S, Eichstaedt JC, Preoţiuc-Pietro D, Gardner JR, Schwartz HA, Ungar LH. Filling in the white space: Spatial interpolation with Gaussian processes and social media data. CURRENT RESEARCH IN ECOLOGICAL AND SOCIAL PSYCHOLOGY 2023; 5:100159. [PMID: 38125747 PMCID: PMC10732585 DOI: 10.1016/j.cresp.2023.100159] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]
Abstract
Full national coverage below the state level is difficult to attain through survey-based data collection. Even the largest survey-based data collections, such as the CDC's Behavioral Risk Factor Surveillance System or the Gallup-Healthways Well-being Index (both with more than 300,000 responses p.a.) only allow for the estimation of annual averages for about 260 out of roughly U.S. 3,000 counties when a threshold of 300 responses per county is used. Using a relatively high threshold of 300 responses gives substantially higher convergent validity-higher correlations with health variables-than lower thresholds but covers a reduced and biased sample of the population. We present principled methods to interpolate spatial estimates and show that including large-scale geotagged social media data can increase interpolation accuracy. In this work, we focus on Gallup-reported life satisfaction, a widely-used measure of subjective well-being. We use Gaussian Processes (GP), a formal Bayesian model, to interpolate life satisfaction, which we optimally combine with estimates from low-count data. We interpolate over several spaces (geographic and socioeconomic) and extend these evaluations to the space created by variables encoding language frequencies of approximately 6 million geotagged Twitter users. We find that Twitter language use can serve as a rough aggregate measure of socioeconomic and cultural similarity, and improves upon estimates derived from a wide variety of socioeconomic, demographic, and geographic similarity measures. We show that applying Gaussian Processes to the limited Gallup data allows us to generate estimates for a much larger number of counties while maintaining the same level of convergent validity with external criteria (i.e., N = 1,133 vs. 2,954 counties). This work suggests that spatial coverage of psychological variables can be reliably extended through Bayesian techniques while maintaining out-of-sample prediction accuracy and that Twitter language adds important information about cultural similarity over and above traditional socio-demographic and geographic similarity measures. Finally, to facilitate the adoption of these methods, we have also open-sourced an online tool that researchers can freely use to interpolate their data across geographies.
Collapse
Affiliation(s)
- Salvatore Giorgi
- Department of Computer and Information Science, University of Pennsylvania, United States of America
| | - Johannes C. Eichstaedt
- Department of Psychology & Institute for Human-Centered AI, Stanford University, United States of America
| | | | - Jacob R. Gardner
- Department of Computer and Information Science, University of Pennsylvania, United States of America
| | - H. Andrew Schwartz
- Department of Computer Science, Stony Brook University, United States of America
| | - Lyle H. Ungar
- Department of Computer and Information Science, University of Pennsylvania, United States of America
| |
Collapse
|
4
|
Giorgi S, Yaden DB, Eichstaedt JC, Ungar LH, Schwartz HA, Kwarteng A, Curtis B. Predicting U.S. county opioid poisoning mortality from multi-modal social media and psychological self-report data. Sci Rep 2023; 13:9027. [PMID: 37270657 DOI: 10.1038/s41598-023-34468-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Accepted: 04/30/2023] [Indexed: 06/05/2023] Open
Abstract
Opioid poisoning mortality is a substantial public health crisis in the United States, with opioids involved in approximately 75% of the nearly 1 million drug related deaths since 1999. Research suggests that the epidemic is driven by both over-prescribing and social and psychological determinants such as economic stability, hopelessness, and isolation. Hindering this research is a lack of measurements of these social and psychological constructs at fine-grained spatial and temporal resolutions. To address this issue, we use a multi-modal data set consisting of natural language from Twitter, psychometric self-reports of depression and well-being, and traditional area-based measures of socio-demographics and health-related risk factors. Unlike previous work using social media data, we do not rely on opioid or substance related keywords to track community poisonings. Instead, we leverage a large, open vocabulary of thousands of words in order to fully characterize communities suffering from opioid poisoning, using a sample of 1.5 billion tweets from 6 million U.S. county mapped Twitter users. Results show that Twitter language predicted opioid poisoning mortality better than factors relating to socio-demographics, access to healthcare, physical pain, and psychological well-being. Additionally, risk factors revealed by the Twitter language analysis included negative emotions, discussions of long work hours, and boredom, whereas protective factors included resilience, travel/leisure, and positive emotions, dovetailing with results from the psychometric self-report data. The results show that natural language from public social media can be used as a surveillance tool for both predicting community opioid poisonings and understanding the dynamic social and psychological nature of the epidemic.
Collapse
Affiliation(s)
- Salvatore Giorgi
- National Institute on Drug Abuse, Intramural Research Program, Baltimore, MD, USA
- Department of Computer and Information Science, University of Pennsylvania, Philadelphia, PA, USA
| | - David B Yaden
- Department of Psychiatry and Behavioral Sciences, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Johannes C Eichstaedt
- Department of Psychology, Stanford University, Stanford, CA, USA
- Institute for Human-Centered AI, Stanford University, Stanford, CA, USA
| | - Lyle H Ungar
- Department of Computer and Information Science, University of Pennsylvania, Philadelphia, PA, USA
| | - H Andrew Schwartz
- Department of Computer Science, Stony Brook University, Stony Brook, NY, USA
| | - Amy Kwarteng
- National Institute on Drug Abuse, Intramural Research Program, Baltimore, MD, USA
| | - Brenda Curtis
- National Institute on Drug Abuse, Intramural Research Program, Baltimore, MD, USA.
| |
Collapse
|
5
|
Jabalameli S, Xu Y, Shetty S. Spatial and sentiment analysis of public opinion toward COVID-19 pandemic using twitter data: At the early stage of vaccination. INTERNATIONAL JOURNAL OF DISASTER RISK REDUCTION : IJDRR 2022; 80:103204. [PMID: 35935613 PMCID: PMC9341165 DOI: 10.1016/j.ijdrr.2022.103204] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Revised: 06/26/2022] [Accepted: 07/19/2022] [Indexed: 05/29/2023]
Abstract
During the crisis of Coronavirus pandemic, social media, like Twitter, have been the platforms on which people have been able to share their opinions and obtain information. The present study provides a detailed spatial-temporal analysis of the Twitter online discourse (approximately 280 thousand tweets) in Ohio and Michigan at the early stage of vaccination rollout (January 2021, till March 2021). This work aims to explore how people were feeling about the pandemic, the most frequent topics people were talking about, and how the topics spatially were distributed. Moreover, state government responses and important news were gathered to analyze their impacts on public opinion based on the temporal analysis of the tweets. In this project, Natural Language Processing using the LDA method was employed to identify 11 topics and 8 sub-topics in the Twitter data. The temporal analysis of topics shows the sensitivity of the online discourse to the significant state news and the local government's reactions to the pandemic. Moreover, the spatial distribution of Coronavirus-related tweets and sentiments demonstrates concentrations in the more populated urban areas with a high rate of COVID-19 cases in Ohio and Michigan. The government's economic and financial policies taken during this time, the vaccination timeline phases specified by each state, and the pandemic-related information can contribute to public opinion and sentiment trends. The findings of this study can help explore public demands, and reactions, follow the impacts of the local authorities' policies at the county level and manage their future responses to such a pandemic.
Collapse
Affiliation(s)
- Shaghayegh Jabalameli
- Department of Geography and Planning, The University of Toledo, Toledo, OH, 43606, USA
| | - Yanqing Xu
- School of Remote Sensing and Information Engineering, Wuhan University, Wuhan, Hubei, 430079, China
| | - Sujata Shetty
- Department of Geography and Planning, The University of Toledo, Toledo, OH, 43606, USA
| |
Collapse
|
6
|
Hoque Tania M, Hossain MR, Jahanara N, Andreev I, Clifton DA. Thinking Aloud or Screaming Inside: Exploratory Study of Sentiment Around Work. JMIR Form Res 2022; 6:e30113. [PMID: 36178712 PMCID: PMC9568814 DOI: 10.2196/30113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2021] [Revised: 07/03/2022] [Accepted: 08/10/2022] [Indexed: 11/30/2022] Open
Abstract
Background Millions of workers experience work-related ill health every year. The loss of working days often accounts for poor well-being because of discomfort and stress caused by the workplace. The ongoing pandemic and postpandemic shift in socioeconomic and work culture can continue to contribute to adverse work-related sentiments. Critically investigating state-of-the-art technologies, this study identifies the research gaps in recognizing workers’ need for well-being support, and we aspire to understand how such evidence can be collected to transform the workforce and workplace. Objective Building on recent advances in sentiment analysis, this study aims to closely examine the potential of social media as a tool to assess workers’ emotions toward the workplace. Methods This study collected a large Twitter data set comprising both pandemic and prepandemic tweets facilitated through a human-in-the-loop approach in combination with unsupervised learning and meta-heuristic optimization algorithms. The raw data preprocessed through natural language processing techniques were assessed using a generative statistical model and a lexicon-assisted rule-based model, mapping lexical features to emotion intensities. This study also assigned human annotations and performed work-related sentiment analysis. Results A mixed methods approach, including topic modeling using latent Dirichlet allocation, identified the top topics from the corpus to understand how Twitter users engage with discussions on work-related sentiments. The sorted aspects were portrayed through overlapped clusters and low intertopic distances. However, further analysis comprising the Valence Aware Dictionary for Sentiment Reasoner suggested a smaller number of negative polarities among diverse subjects. By contrast, the human-annotated data set created for this study contained more negative sentiments. In this study, sentimental juxtaposition revealed through the labeled data set was supported by the n-gram analysis as well. Conclusions The developed data set demonstrates that work-related sentiments are projected onto social media, which offers an opportunity to better support workers. The infrastructure of the workplace, the nature of the work, the culture within the industry and the particular organization, employers, colleagues, person-specific habits, and upbringing all play a part in the health and well-being of any working adult who contributes to the productivity of the organization. Therefore, understanding the origin and influence of the complex underlying factors both qualitatively and quantitatively can inform the next generation of workplaces to drive positive change by relying on empirically grounded evidence. Therefore, this study outlines a comprehensive approach to capture deeper insights into work-related health.
Collapse
Affiliation(s)
- Marzia Hoque Tania
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, United Kingdom
| | - Md Razon Hossain
- School of Information System, Queensland University of Technology, Brisbane, Australia
| | - Nuzhat Jahanara
- Department of Psychology, University of Dhaka, Dhaka, Bangladesh
| | - Ilya Andreev
- School of Engineering and the Built Environment, Anglia Ruskin University, Cambridge, United Kingdom
| | - David A Clifton
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, United Kingdom
- Oxford Centre for Advanced Research (OSCAR), University of Oxford, Suzhou, China
| |
Collapse
|
7
|
Golder S, Stevens R, O'Connor K, James R, Gonzalez-Hernandez G. Methods to Establish Race or Ethnicity of Twitter Users: Scoping Review. J Med Internet Res 2022; 24:e35788. [PMID: 35486433 PMCID: PMC9107046 DOI: 10.2196/35788] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Revised: 03/08/2022] [Accepted: 03/23/2022] [Indexed: 11/13/2022] Open
Abstract
Background A growing amount of health research uses social media data. Those critical of social media research often cite that it may be unrepresentative of the population; however, the suitability of social media data in digital epidemiology is more nuanced. Identifying the demographics of social media users can help establish representativeness. Objective This study aims to identify the different approaches or combination of approaches to extract race or ethnicity from social media and report on the challenges of using these methods. Methods We present a scoping review to identify methods used to extract the race or ethnicity of Twitter users from Twitter data sets. We searched 17 electronic databases from the date of inception to May 15, 2021, and carried out reference checking and hand searching to identify relevant studies. Sifting of each record was performed independently by at least two researchers, with any disagreement discussed. Studies were required to extract the race or ethnicity of Twitter users using either manual or computational methods or a combination of both. Results Of the 1249 records sifted, we identified 67 (5.36%) that met our inclusion criteria. Most studies (51/67, 76%) have focused on US-based users and English language tweets (52/67, 78%). A range of data was used, including Twitter profile metadata, such as names, pictures, information from bios (including self-declarations), or location or content of the tweets. A range of methodologies was used, including manual inference, linkage to census data, commercial software, language or dialect recognition, or machine learning or natural language processing. However, not all studies have evaluated these methods. Those that evaluated these methods found accuracy to vary from 45% to 93% with significantly lower accuracy in identifying categories of people of color. The inference of race or ethnicity raises important ethical questions, which can be exacerbated by the data and methods used. The comparative accuracies of the different methods are also largely unknown. Conclusions There is no standard accepted approach or current guidelines for extracting or inferring the race or ethnicity of Twitter users. Social media researchers must carefully interpret race or ethnicity and not overpromise what can be achieved, as even manual screening is a subjective, imperfect method. Future research should establish the accuracy of methods to inform evidence-based best practice guidelines for social media researchers and be guided by concerns of equity and social justice.
Collapse
Affiliation(s)
- Su Golder
- Department of Health Sciences, University of York, York, United Kingdom
| | - Robin Stevens
- School of Communication and Journalism, University of Southern California, Los Angeles, CA, United States
| | - Karen O'Connor
- Department of Biostatistics, Epidemiology, and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Richard James
- School of Nursing Liaison and Clinical Outreach Coordinator, University of Pennsylvania, Philadelphia, PA, United States
| | - Graciela Gonzalez-Hernandez
- Department of Biostatistics, Epidemiology, and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| |
Collapse
|
8
|
Schwartz AJ, Dodds PS, O’Neil-Dunne JPM, Ricketts TH, Danforth CM. Gauging the happiness benefit of US urban parks through Twitter. PLoS One 2022; 17:e0261056. [PMID: 35353831 PMCID: PMC8967001 DOI: 10.1371/journal.pone.0261056] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2020] [Accepted: 11/23/2021] [Indexed: 11/18/2022] Open
Abstract
The relationship between nature contact and mental well-being has received increasing attention in recent years. While a body of evidence has accumulated demonstrating a positive relationship between time in nature and mental well-being, there have been few studies comparing this relationship in different locations over long periods of time. In this study, we analyze over 1.5 million tweets to estimate a happiness benefit, the difference in expressed happiness between in- and out-of-park tweets, for the 25 largest cities in the US by population. People write happier words during park visits when compared with non-park user tweets collected around the same time. While the words people write are happier in parks on average and in most cities, we find considerable variation across cities. Tweets are happier in parks at all times of the day, week, and year, not just during the weekend or summer vacation. Across all cities, we find that the happiness benefit is highest in parks larger than 100 acres. Overall, our study suggests the happiness benefit associated with park visitation is on par with US holidays such as Thanksgiving and New Year’s Day.
Collapse
Affiliation(s)
- Aaron J. Schwartz
- Ecology & Evolutionary Biology, University of Colorado, Boulder, Colorado, United States of America
- Gund Institute for Environment, University of Vermont, Burlington, Vermont, United States of America
- Vermont Complex Systems Center, University of Vermont, Burlington, Vermont, United States of America
- Computational Story Lab, MassMutual Center of Excellence for Complex Systems and Data Science, Vermont Advanced Computing Core, University of Vermont, Burlington, Vermont, United States of America
- Rubenstein School of Environment and Natural Resources, University of Vermont, Burlington, Vermont, United States of America
- * E-mail:
| | - Peter Sheridan Dodds
- Vermont Complex Systems Center, University of Vermont, Burlington, Vermont, United States of America
- Computational Story Lab, MassMutual Center of Excellence for Complex Systems and Data Science, Vermont Advanced Computing Core, University of Vermont, Burlington, Vermont, United States of America
- Department of Computer Science, University of Vermont, Burlington, Vermont, United States of America
| | - Jarlath P. M. O’Neil-Dunne
- Gund Institute for Environment, University of Vermont, Burlington, Vermont, United States of America
- Vermont Complex Systems Center, University of Vermont, Burlington, Vermont, United States of America
- Rubenstein School of Environment and Natural Resources, University of Vermont, Burlington, Vermont, United States of America
| | - Taylor H. Ricketts
- Gund Institute for Environment, University of Vermont, Burlington, Vermont, United States of America
- Rubenstein School of Environment and Natural Resources, University of Vermont, Burlington, Vermont, United States of America
| | - Christopher M. Danforth
- Gund Institute for Environment, University of Vermont, Burlington, Vermont, United States of America
- Vermont Complex Systems Center, University of Vermont, Burlington, Vermont, United States of America
- Computational Story Lab, MassMutual Center of Excellence for Complex Systems and Data Science, Vermont Advanced Computing Core, University of Vermont, Burlington, Vermont, United States of America
- Department of Mathematics & Statistics, University of Vermont, Burlington, Vermont, United States of America
| |
Collapse
|
9
|
Bour C, Ahne A, Schmitz S, Perchoux C, Dessenne C, Fagherazzi G. The Use of Social Media for Health Research Purposes: Scoping Review. J Med Internet Res 2021; 23:e25736. [PMID: 34042593 PMCID: PMC8193478 DOI: 10.2196/25736] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2020] [Revised: 01/15/2021] [Accepted: 03/18/2021] [Indexed: 01/27/2023] Open
Abstract
BACKGROUND As social media are increasingly used worldwide, more and more scientists are relying on them for their health-related projects. However, social media features, methodologies, and ethical issues are unclear so far because, to our knowledge, there has been no overview of this relatively young field of research. OBJECTIVE This scoping review aimed to provide an evidence map of the different uses of social media for health research purposes, their fields of application, and their analysis methods. METHODS We followed the scoping review methodologies developed by Arksey and O'Malley and the Joanna Briggs Institute. After developing search strategies based on keywords (eg, social media, health research), comprehensive searches were conducted in the PubMed/MEDLINE and Web of Science databases. We limited the search strategies to documents written in English and published between January 1, 2005, and April 9, 2020. After removing duplicates, articles were screened at the title and abstract level and at the full text level by two independent reviewers. One reviewer extracted data, which were descriptively analyzed to map the available evidence. RESULTS After screening 1237 titles and abstracts and 407 full texts, 268 unique papers were included, dating from 2009 to 2020 with an average annual growth rate of 32.71% for the 2009-2019 period. Studies mainly came from the Americas (173/268, 64.6%, including 151 from the United States). Articles used machine learning or data mining techniques (60/268) to analyze the data, discussed opportunities and limitations of the use of social media for research (59/268), assessed the feasibility of recruitment strategies (45/268), or discussed ethical issues (16/268). Communicable (eg, influenza, 40/268) and then chronic (eg, cancer, 24/268) diseases were the two main areas of interest. CONCLUSIONS Since their early days, social media have been recognized as resources with high potential for health research purposes, yet the field is still suffering from strong heterogeneity in the methodologies used, which prevents the research from being compared and generalized. For the field to be fully recognized as a valid, complementary approach to more traditional health research study designs, there is now a need for more guidance by types of applications of social media for health research, both from a methodological and an ethical perspective. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID) RR2-10.1136/bmjopen-2020-040671.
Collapse
Affiliation(s)
- Charline Bour
- Department of Population Health, Luxembourg Institute of Health, Strassen, Luxembourg
| | - Adrian Ahne
- Inserm U1018, Center for Research in Epidemiology and Population Health (CESP), Paris Saclay University, Villejuif, France.,Epiconcept, Paris, France
| | - Susanne Schmitz
- Competence Centre for Methodology and Statistics, Luxembourg Institute of Health, Strassen, Luxembourg
| | - Camille Perchoux
- Luxembourg Institute of Socio-Economic Research, Esch/Alzette, Luxembourg
| | - Coralie Dessenne
- Department of Population Health, Luxembourg Institute of Health, Strassen, Luxembourg
| | - Guy Fagherazzi
- Department of Population Health, Luxembourg Institute of Health, Strassen, Luxembourg
| |
Collapse
|
10
|
Zhou J, Yang S, Xiao C, Chen F. Examination of Community Sentiment Dynamics due to COVID-19 Pandemic: A Case Study from a State in Australia. ACTA ACUST UNITED AC 2021; 2:201. [PMID: 33851137 PMCID: PMC8034046 DOI: 10.1007/s42979-021-00596-7] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2020] [Accepted: 03/17/2021] [Indexed: 11/24/2022]
Abstract
The outbreak of the novel Coronavirus Disease 2019 (COVID-19) has caused unprecedented impacts to people's daily life around the world. Various measures and policies such as lockdown and social-distancing are implemented by governments to combat the disease during the pandemic period. These measures and policies as well as virus itself may cause different mental health issues to people such as depression, anxiety, sadness, etc. In this paper, we exploit the massive text data posted by Twitter users to analyse the sentiment dynamics of people living in the state of New South Wales (NSW) in Australia during the pandemic period. Different from the existing work that mostly focuses on the country-level and static sentiment analysis, we analyse the sentiment dynamics at the fine-grained local government areas (LGAs). Based on the analysis of around 94 million tweets that posted by around 183 thousand users located at different LGAs in NSW in 5 months, we found that people in NSW showed an overall positive sentimental polarity and the COVID-19 pandemic decreased the overall positive sentimental polarity during the pandemic period. The fine-grained analysis of sentiment in LGAs found that despite the dominant positive sentiment most of days during the study period, some LGAs experienced significant sentiment changes from positive to negative. This study also analysed the sentimental dynamics delivered by the hot topics in Twitter such as government policies (e.g. the Australia's JobKeeper program, lockdown, social-distancing) as well as the focused social events (e.g. the Ruby Princess Cruise). The results showed that the policies and events did affect people's overall sentiment, and they affected people's overall sentiment differently at different stages.
Collapse
Affiliation(s)
- Jianlong Zhou
- Data Science Institute, University of Technology Sydney, Sydney, Australia
| | - Shuiqiao Yang
- Data Science Institute, University of Technology Sydney, Sydney, Australia
| | - Chun Xiao
- Faculty of Transdisciplinary Innovation, University of Technology Sydney, Sydney, Australia
| | - Fang Chen
- Data Science Institute, University of Technology Sydney, Sydney, Australia
| |
Collapse
|
11
|
Rahman MM, Ali GN, Li XJ, Samuel J, Paul KC, Chong PH, Yakubov M. Socioeconomic factors analysis for COVID-19 US reopening sentiment with Twitter and census data. Heliyon 2021; 7:e06200. [PMID: 33585707 PMCID: PMC7867397 DOI: 10.1016/j.heliyon.2021.e06200] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2020] [Revised: 09/19/2020] [Accepted: 02/01/2021] [Indexed: 02/06/2023] Open
Abstract
Investigating and classifying sentiments of social media users (e.g., positive, negative) towards an item, situation, and system are very popular among researchers. However, they rarely discuss the underlying socioeconomic factor associations for such sentiments. This study attempts to explore the factors associated with positive and negative sentiments of the people about reopening the economy, in the United States (US) amidst the COVID-19 global crisis. It takes into consideration the situational uncertainties (i.e., changes in work and travel patterns due to lockdown policies), economic downturn and associated trauma, and emotional factors such as depression. To understand the sentiment of the people about the reopening economy, Twitter data was collected, representing the 50 States of the US and Washington D.C, the capital city of the US. State-wide socioeconomic characteristics of the people (e.g., education, income, family size, and employment status), built environment data (e.g., population density), and the number of COVID-19 related cases were collected and integrated with Twitter data to perform the analysis. A binary logit model was used to identify the factors that influence people toward a positive or negative sentiment. The results from the logit model demonstrate that family households, people with low education levels, people in the labor force, low-income people, and people with higher house rent are more interested in reopening the economy. In contrast, households with a high number of family members and high income are less interested in reopening the economy. The accuracy of the model is reasonable (i.e., the model can correctly classify 56.18% of the sentiments). The Pearson chi-squared test indicates that this model has high goodness-of-fit. This study provides clear insights for public and corporate policymakers on potential areas to allocate resources, and directional guidance on potential policy options they can undertake to improve socioeconomic conditions, to mitigate the impact of pandemic in the current situation, and in the future as well.
Collapse
Affiliation(s)
- Md. Mokhlesur Rahman
- University of North Carolina at Charlotte, NC 28223, USA
- Khulna University of Engineering & Technology (KUET), Khulna 9203, Bangladesh
| | | | - Xue Jun Li
- Auckland University of Technology, Auckland 1010, New Zealand
| | | | | | | | | |
Collapse
|
12
|
Camacho-Rivera M, Vo H, Huang X, Lau J, Lawal A, Kawaguchi A. Evaluating Asthma Mobile Apps to Improve Asthma Self-Management: User Ratings and Sentiment Analysis of Publicly Available Apps. JMIR Mhealth Uhealth 2020; 8:e15076. [PMID: 33118944 PMCID: PMC7661227 DOI: 10.2196/15076] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2019] [Revised: 07/10/2020] [Accepted: 08/03/2020] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND The development and use of mobile health (mHealth) apps for asthma management have risen dramatically over the past two decades. Asthma apps vary widely in their content and features; however, prior research has rarely examined preferences of users of publicly available apps. OBJECTIVE The goals of this study were to provide a descriptive overview of asthma mobile apps that are publicly available and to assess the usability of asthma apps currently available on the market to identify content and features of apps associated with positive and negative user ratings. METHODS Reviews were collected on June 23, 2020, and included publicly posted reviews until June 21, 2020. To characterize features associated with high or low app ratings, we first dichotomized the average user rating of the asthma app into 2 categories: a high average rating and a low average rating. Asthma apps with average ratings of 4 and above were categorized as having a high average rating. Asthma apps with average ratings of less than 4 were categorized as having a low average rating. For the sentiment analysis, we modeled both 2-word (bi-gram) and 3-word (tri-gram) phrases which commonly appeared across highly rated and lowly rated apps. RESULTS Of the 10 apps that met the inclusion criteria, a total of 373 reviews were examined across all apps. Among apps reviewed, 53.4% (199/373) received high ratings (average ratings of 4 or 5) and 47.2% (176/373) received low ratings (average ratings of 3 or less). The number of ratings across all apps ranged from 188 (AsthmaMD) to 10 (My Asthma App); 30% (3/10) of apps were available on both Android and iOS. From the sentiment analysis, key features of asthma management that were common among highly rated apps included the tracking of peak flow readings (n=48), asthma symptom monitoring (n=11), and action plans (n=10). Key features related to functionality that were common among highly rated apps included ease of use (n=5). Users most commonly reported loss of data (n=14) and crashing of app (n=12) as functionality issues among poorly rated asthma apps. CONCLUSIONS Our study results demonstrate that asthma app quality, maintenance, and updates vary widely across apps and platforms. These findings may call into question the long-term engagement with asthma apps, a crucial factor for determining their potential to improve asthma self-management and asthma clinical outcomes.
Collapse
Affiliation(s)
- Marlene Camacho-Rivera
- Department of Community Health Sciences, SUNY Downstate Health Sciences University, Brooklyn, NY, United States
| | - Huy Vo
- Department of Computer Science, Grove School of Engineering, City College of New York, New York, NY, United States
| | - Xueqi Huang
- Department of Computer Science, Grove School of Engineering, City College of New York, New York, NY, United States
| | - Julia Lau
- Department of Computer Science, Grove School of Engineering, City College of New York, New York, NY, United States
| | - Adeola Lawal
- Department of Community Health and Social Medicine, CUNY School of Medicine, New York, NY, United States
| | - Akira Kawaguchi
- Department of Computer Science, Grove School of Engineering, City College of New York, New York, NY, United States
| |
Collapse
|
13
|
Nguyen H, Nguyen T, Nguyen DT. A graph-based approach for population health analysis using Geo-tagged tweets. MULTIMEDIA TOOLS AND APPLICATIONS 2020; 80:7187-7204. [PMID: 33132740 PMCID: PMC7585996 DOI: 10.1007/s11042-020-10034-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Revised: 08/13/2020] [Accepted: 10/06/2020] [Indexed: 06/11/2023]
Abstract
We propose in this work a graph-based approach for automatic public health analysis using social media. In our approach, graphs are created to model the interactions between features and between tweets in social media. We investigated different graph properties and methods in constructing graph-based representations for population health analysis. The proposed approach is applied in two case studies: (1) estimating health indices, and (2) classifying health situation of counties in the US. We evaluate our approach on a dataset including more than one billion tweets collected in three years 2014, 2015, and 2016, and the health surveys from the Behavioral Risk Factor Surveillance System. We conducted realistic and large-scale experiments on various textual features and graph-based representations. Experimental results verified the robustness of the proposed approach and its superiority over existing ones in both case studies, confirming the potential of graph-based approach for modeling interactions in social networks for population health analysis.
Collapse
Affiliation(s)
- Hung Nguyen
- Faculty of IT, Nha Trang University, Nha Trang, Vietnam
| | - Thin Nguyen
- Applied Artificial Intelligence Institute, Deakin University, Geelong, VIC 3220 Australia
| | - Duc Thanh Nguyen
- School of Information Technology, Deakin University, Geelong, VIC 3220 Australia
| |
Collapse
|
14
|
Understanding Perceived Site Qualities and Experiences of Urban Public Spaces: A Case Study of Social Media Reviews in Bryant Park, New York City. SUSTAINABILITY 2020. [DOI: 10.3390/su12198036] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Urban public spaces are a key component to the well-being and prosperity of modern society. It has been increasingly important to improve the qualities and maximize the usages of urban public spaces. There is a lack of studies that investigate how people use and perceive urban parks using quantitative analysis of location-based social media reviews. This study tackles this gap by introducing a case study that uses social media reviews (Tripadivisor.com) to understand the perceived site quality and experiences of Bryant Park in New York City. A large dataset including 11,419 Tripadvisor reviews from 10,615 users was collected. LDA (Latent Dirichlet Allocation), a natural language processing and machine learning technique, was used to perform topic modeling analysis that could reveal hidden themes in large amounts of text. The results include five semantic topics and their associated topic terms. A comprehensive overview of the user experiences in Bryant Park were provided along with their weekly and monthly dynamics. The findings provide insights for future public space designers and managers by revealing how users describe the designs and operations of Bryant Park.
Collapse
|
15
|
Abstract
Objective: To assess public response to cancellations of elective surgeries following the American College of Surgeons’ (ACS) recommendation on March 13. Methods: We queried text comments from Reddit, a social media platform and the fifth most popular website in the United States. Comments were manually reviewed to assess for relevance to elective surgery in the United States during the global coronavirus outbreak, whether the text was written by a healthcare worker (HCW), whether the user was based in the United States, and whether the text documented cancellations of surgery, expected cancellations of surgery, or surgery ongoing after the ACS announcement. Analysis of overall sentiment and negativity in comment text was performed using the Valence Aware Dictionary for sEntiment Reasoning (VADER), a validated natural language processing tool previously used in studies of health behaviors using social media. Non-parametric tests were used for subgroup comparisons based on posting date and characteristics identified during manual review. Results: Following manual review, 1272 comments were included for analysis. Overall sentiment among non-HCWs became significantly more negative following the ACS announcement (P = 0.037). Overall sentiment did not significantly differ between HCWs and non-HCWs prior to the ACS announcement (P = 0.98), but non-HCW sentiment became significantly more negative than HCW sentiment after the announcement (P = 0.027). Negativity scores in posts describing cancellations were significantly higher among posts written by non-HCWs than HCWs (P = 0.028). Conclusions: Cancellation of elective surgeries had an adverse emotional impact on non-HCWs. This finding highlights the importance of access to elective surgery to patients’ emotional well-being.
Collapse
|
16
|
Jaidka K, Giorgi S, Schwartz HA, Kern ML, Ungar LH, Eichstaedt JC. Estimating geographic subjective well-being from Twitter: A comparison of dictionary and data-driven language methods. Proc Natl Acad Sci U S A 2020; 117:10165-10171. [PMID: 32341156 PMCID: PMC7229753 DOI: 10.1073/pnas.1906364117] [Citation(s) in RCA: 55] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Researchers and policy makers worldwide are interested in measuring the subjective well-being of populations. When users post on social media, they leave behind digital traces that reflect their thoughts and feelings. Aggregation of such digital traces may make it possible to monitor well-being at large scale. However, social media-based methods need to be robust to regional effects if they are to produce reliable estimates. Using a sample of 1.53 billion geotagged English tweets, we provide a systematic evaluation of word-level and data-driven methods for text analysis for generating well-being estimates for 1,208 US counties. We compared Twitter-based county-level estimates with well-being measurements provided by the Gallup-Sharecare Well-Being Index survey through 1.73 million phone surveys. We find that word-level methods (e.g., Linguistic Inquiry and Word Count [LIWC] 2015 and Language Assessment by Mechanical Turk [LabMT]) yielded inconsistent county-level well-being measurements due to regional, cultural, and socioeconomic differences in language use. However, removing as few as three of the most frequent words led to notable improvements in well-being prediction. Data-driven methods provided robust estimates, approximating the Gallup data at up to r = 0.64. We show that the findings generalized to county socioeconomic and health outcomes and were robust when poststratifying the samples to be more representative of the general US population. Regional well-being estimation from social media data seems to be robust when supervised data-driven methods are used.
Collapse
Affiliation(s)
- Kokil Jaidka
- Department of Communications and New Media, National University of Singapore, Singapore 117416;
- Centre for Trusted Internet and Community, National University of Singapore, Singapore 117416
| | - Salvatore Giorgi
- Department of Computer and Information Science, University of Pennsylvania, Philadelphia, PA 19104
| | - H Andrew Schwartz
- Department of Computer Science, Stony Brook University, Stony Brook, NY 11794
| | - Margaret L Kern
- Melbourne Graduate School of Education, The University of Melbourne, Parkville, VIC 3010, Australia
| | - Lyle H Ungar
- Department of Computer and Information Science, University of Pennsylvania, Philadelphia, PA 19104
| | - Johannes C Eichstaedt
- Department of Psychology, Stanford University, Stanford, CA 94305;
- Institute for Human-Centered Artificial Intelligence, Stanford University, Stanford, CA 94305
| |
Collapse
|