Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: McIver DJ, Hawkins JB, Chunara R, Chatterjee AK, Bhandari A, Fitzgerald TP, Jain SH, Brownstein JS. Characterizing Sleep Issues Using Twitter. J Med Internet Res 2015;17:e140. [PMID: 26054530 PMCID: PMC4526927 DOI: 10.2196/jmir.4476] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2015] [Revised: 04/29/2015] [Accepted: 05/24/2015] [Indexed: 11/13/2022] Open

For:	McIver DJ, Hawkins JB, Chunara R, Chatterjee AK, Bhandari A, Fitzgerald TP, Jain SH, Brownstein JS. Characterizing Sleep Issues Using Twitter. J Med Internet Res 2015;17:e140. [PMID: 26054530 PMCID: PMC4526927 DOI: 10.2196/jmir.4476] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2015] [Revised: 04/29/2015] [Accepted: 05/24/2015] [Indexed: 11/13/2022] Open

Number

Cited by Other Article(s)

Ladis I, Valladares TL, Coppersmith DDL, Glenn JJ, Nobles AL, Barnes LE, Teachman BA. Inferring sleep disturbance from text messages of suicide attempt survivors: A pilot study. Suicide Life Threat Behav 2023;53:39-53. [PMID: 36083138 PMCID: PMC9908817 DOI: 10.1111/sltb.12920] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Revised: 08/13/2022] [Accepted: 08/26/2022] [Indexed: 11/30/2022]

Meyerson WU, Fineberg SK, Song YK, Faber A, Ash G, Andrade FC, Corlett P, Gerstein MB, Hoyle RH. Estimation of Bedtimes of Reddit Users: Integrated Analysis of Time Stamps and Surveys. JMIR Form Res 2023;7:e38112. [PMID: 36649054 PMCID: PMC9890352 DOI: 10.2196/38112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2022] [Revised: 11/22/2022] [Accepted: 12/06/2022] [Indexed: 01/18/2023] Open

Abstract

BACKGROUND

Individuals with later bedtimes have an increased risk of difficulties with mood and substances. To investigate the causes and consequences of late bedtimes and other sleep patterns, researchers are exploring social media as a data source. Pioneering studies inferred sleep patterns directly from social media data. While innovative, these efforts are variously unscalable, context dependent, confined to specific sleep parameters, or rest on untested assumptions, and none of the reviewed studies apply to the popular Reddit platform or release software to the research community.

OBJECTIVE

This study builds on this prior work. We estimate the bedtimes of Reddit users from the times tamps of their posts, test inference validity against survey data, and release our model as an R package (The R Foundation).

METHODS

We included 159 sufficiently active Reddit users with known time zones and known, nonanomalous bedtimes, together with the time stamps of their 2.1 million posts. The model's form was chosen by visualizing the aggregate distribution of the timing of users' posts relative to their reported bedtimes. The chosen model represents a user's frequency of Reddit posting by time of day, with a flat portion before bedtime and a quadratic depletion that begins near the user's bedtime, with parameters fitted to the data. This model estimates the bedtimes of individual Reddit users from the time stamps of their posts. Model performance is assessed through k-fold cross-validation. We then apply the model to estimate the bedtimes of 51,372 sufficiently active, nonbot Reddit users with known time zones from the time stamps of their 140 million posts.

RESULTS

The Pearson correlation between expected and observed Reddit posting frequencies in our model was 0.997 on aggregate data. On average, posting starts declining 45 minutes before bedtime, reaches a nadir 4.75 hours after bedtime that is 87% lower than the daytime rate, and returns to baseline 10.25 hours after bedtime. The Pearson correlation between inferred and reported bedtimes for individual users was 0.61 (P<.001). In 90 of 159 cases (56.6%), our estimate was within 1 hour of the reported bedtime; 128 cases (80.5%) were within 2 hours. There was equivalent accuracy in hold-out sets versus training sets of k-fold cross-validation, arguing against overfitting. The model was more accurate than a random forest approach.

CONCLUSIONS

We uncovered a simple, reproducible relationship between Reddit users' reported bedtimes and the time of day when high daytime posting rates transition to low nighttime posting rates. We captured this relationship in a model that estimates users' bedtimes from the time stamps of their posts. Limitations include applicability only to users who post frequently, the requirement for time zone data, and limits on generalizability. Nonetheless, it is a step forward for inferring the sleep parameters of social media users passively at scale. Our model and precomputed estimated bedtimes of 50,000 Reddit users are freely available.

Collapse

Cohen Zion M, Gescheit I, Levy N, Yom-Tov E. Identifying Sleep Disorders From Search Engine Activity: Combining User-Generated Data With a Clinically Validated Questionnaire. J Med Internet Res 2022;24:e41288. [DOI: 10.2196/41288] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Revised: 10/25/2022] [Accepted: 11/15/2022] [Indexed: 11/24/2022] Open

Abstract Background Sleep disorders are experienced by up to 40% of the population but their diagnosis is often delayed by the availability of specialists. Objective We propose the use of search engine activity in conjunction with a validated web-based sleep questionnaire to facilitate wide-scale screening of prevalent sleep disorders. Methods Search advertisements offering a web-based sleep disorder screening questionnaire were shown on the Bing search engine to individuals who indicated an interest in sleep disorders. People who clicked on the advertisements and completed the sleep questionnaire were identified as being at risk for 1 of 4 common sleep disorders. A machine learning algorithm was applied to previous search engine queries to predict their suspected sleep disorder, as identified by the questionnaire. Results A total of 397 users consented to participate in the study and completed the questionnaire. Of them, 132 had sufficient past query data for analysis. Our findings show that diurnal patterns of people with sleep disorders were shifted by 2-3 hours compared to those of the controls. Past query activity was predictive of sleep disorders, approaching an area under the receiver operating characteristic curve of 0.62-0.69, depending on the sleep disorder. Conclusions Targeted advertisements can be used as an initial screening tool for people with sleep disorders. However, search engine data are seemingly insufficient as a sole method for screening. Nevertheless, we believe that evaluable web-based information, easily collected and processed with little effort on part of the physician and with low burden on the individual, can assist in the diagnostic process and possibly drive people to seek sleep assessment and diagnosis earlier than they currently do. Collapse

Lee ITL, Juang SE, Chen ST, Ko C, Ma KSK. Sentiment analysis of tweets on alopecia areata, hidradenitis suppurativa, and psoriasis: Revealing the patient experience. Front Med (Lausanne) 2022;9:996378. [PMID: 36388938 PMCID: PMC9660311 DOI: 10.3389/fmed.2022.996378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Accepted: 09/26/2022] [Indexed: 11/26/2022] Open

Abstract

Background

Chronic dermatologic disorders can cause significant emotional distress. Sentiment analysis of disease-related tweets helps identify patients' experiences of skin disease.

Objective

To analyze the expressed sentiments in tweets related to alopecia areata (AA), hidradenitis suppurativa (HS), and psoriasis (PsO) in comparison to fibromyalgia (FM).

Methods

This is a cross-sectional analysis of Twitter users' expressed sentiment on AA, HS, PsO, and FM. Tweets related to the diseases of interest were identified with keywords and hashtags for one month (April, 2022) using the Twitter standard application programming interface (API). Text, account types, and numbers of retweets and likes were collected. The sentiment analysis was performed by the R "tidytext" package using the AFINN lexicon.

Results

A total of 1,505 tweets were randomly extracted, of which 243 (16.15%) referred to AA, 186 (12.36%) to HS, 510 (33.89%) to PsO, and 566 (37.61%) to FM. The mean sentiment score was -0.239 ± 2.90. AA, HS, and PsO had similar sentiment scores (p = 0.482). Although all skin conditions were associated with a negative polarity, their average was significantly less negative than FM (p < 0.0001). Tweets from private accounts were more negative, especially for AA (p = 0.0082). Words reflecting patients' psychological states varied in different diseases. "Anxiety" was observed in posts on AA and FM but not posts on HS and PsO, while "crying" was frequently used in posts on HS. There was no definite correlation between the sentiment score and the number of retweets or likes, although negative AA tweets from public accounts received more retweets (p = 0.03511) and likes (p = 0.0228).

Conclusion

The use of Twitter sentiment analysis is a promising method to document patients' experience of skin diseases, which may improve patient care through bridging misconceptions and knowledge gaps between patients and healthcare professionals.

Collapse

Shakeri Hossein Abad Z, Butler GP, Thompson W, Lee J. Crowdsourcing for Machine Learning in Public Health Surveillance: Lessons Learned From Amazon Mechanical Turk. J Med Internet Res 2022;24:e28749. [PMID: 35040794 PMCID: PMC8808350 DOI: 10.2196/28749] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2021] [Revised: 07/05/2021] [Accepted: 11/15/2021] [Indexed: 12/30/2022] Open

Abstract

Background

Crowdsourcing services, such as Amazon Mechanical Turk (AMT), allow researchers to use the collective intelligence of a wide range of web users for labor-intensive tasks. As the manual verification of the quality of the collected results is difficult because of the large volume of data and the quick turnaround time of the process, many questions remain to be explored regarding the reliability of these resources for developing digital public health systems.

Objective

This study aims to explore and evaluate the application of crowdsourcing, generally, and AMT, specifically, for developing digital public health surveillance systems.

Methods

We collected 296,166 crowd-generated labels for 98,722 tweets, labeled by 610 AMT workers, to develop machine learning (ML) models for detecting behaviors related to physical activity, sedentary behavior, and sleep quality among Twitter users. To infer the ground truth labels and explore the quality of these labels, we studied 4 statistical consensus methods that are agnostic of task features and only focus on worker labeling behavior. Moreover, to model the meta-information associated with each labeling task and leverage the potential of context-sensitive data in the truth inference process, we developed 7 ML models, including traditional classifiers (offline and active), a deep learning–based classification model, and a hybrid convolutional neural network model.

Results

Although most crowdsourcing-based studies in public health have often equated majority vote with quality, the results of our study using a truth set of 9000 manually labeled tweets showed that consensus-based inference models mask underlying uncertainty in data and overlook the importance of task meta-information. Our evaluations across 3 physical activity, sedentary behavior, and sleep quality data sets showed that truth inference is a context-sensitive process, and none of the methods studied in this paper were consistently superior to others in predicting the truth label. We also found that the performance of the ML models trained on crowd-labeled data was sensitive to the quality of these labels, and poor-quality labels led to incorrect assessment of these models. Finally, we have provided a set of practical recommendations to improve the quality and reliability of crowdsourced data.

Conclusions

Our findings indicate the importance of the quality of crowd-generated labels in developing ML models designed for decision-making purposes, such as public health surveillance decisions. A combination of inference models outlined and analyzed in this study could be used to quantitatively measure and improve the quality of crowd-generated labels for training ML models.

Collapse

Sachini E, Sioumalas-⁠ Christodoulou K, Bouras N, Karampekios N. Lessons for science and technology policy? Probing the Linkedin network of an RDI organisation. SN SOCIAL SCIENCES 2022;2:271. [PMCID: PMC9734916 DOI: 10.1007/s43545-022-00586-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Accepted: 11/30/2022] [Indexed: 12/14/2022]

Sakib AS, Mukta MSH, Huda FR, Islam AKMN, Islam T, Ali ME. Identifying Insomnia From Social Media Posts: Psycholinguistic Analyses of User Tweets. J Med Internet Res 2021;23:e27613. [PMID: 34889758 PMCID: PMC8704110 DOI: 10.2196/27613] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2021] [Revised: 05/12/2021] [Accepted: 10/05/2021] [Indexed: 11/21/2022] Open

Abstract

Background

Many people suffer from insomnia, a sleep disorder characterized by difficulty falling and staying asleep during the night. As social media have become a ubiquitous platform to share users’ thoughts, opinions, activities, and preferences with their friends and acquaintances, the shared content across these platforms can be used to diagnose different health problems, including insomnia. Only a few recent studies have examined the prediction of insomnia from Twitter data, and we found research gaps in predicting insomnia from word usage patterns and correlations between users’ insomnia and their Big 5 personality traits as derived from social media interactions.

Objective

The purpose of this study is to build an insomnia prediction model from users’ psycholinguistic patterns, including the elements of word usage, semantics, and their Big 5 personality traits as derived from tweets.

Methods

In this paper, we exploited both psycholinguistic and personality traits derived from tweets to identify insomnia patients. First, we built psycholinguistic profiles of the users from their word choices and the semantic relationships between the words of their tweets. We then determined the relationship between a users’ personality traits and insomnia. Finally, we built a double-weighted ensemble classification model to predict insomnia from both psycholinguistic and personality traits as derived from user tweets.

Results

Our classification model showed strong prediction potential (78.8%) to predict insomnia from tweets. As insomniacs are generally ill-tempered and feel more stress and mental exhaustion, we observed significant correlations of certain word usage patterns among them. They tend to use negative words (eg, “no,” “not,” “never”). Some people frequently use swear words (eg, “damn,” “piss,” “fuck”) with strong temperament. They also use anxious (eg, “worried,” “fearful,” “nervous”) and sad (eg, “crying,” “grief,” “sad”) words in their tweets. We also found that the users with high neuroticism and conscientiousness scores for the Big 5 personality traits likely have strong correlations with insomnia. Additionally, we observed that users with high conscientiousness scores have strong correlations with insomnia patterns, while negative correlation between extraversion and insomnia was also found.

Conclusions

Our model can help predict insomnia from users’ social media interactions. Thus, incorporating our model into a software system can help family members detect insomnia problems in individuals before they become worse. The software system can also help doctors to diagnose possible insomnia in patients.

Collapse

Vargas AN, Maier A, Vallim MBR, Banda JM, Preciado VM. Negative Perception of the COVID-19 Pandemic Is Dropping: Evidence From Twitter Posts. Front Psychol 2021;12:737882. [PMID: 34650494 PMCID: PMC8505703 DOI: 10.3389/fpsyg.2021.737882] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Accepted: 08/23/2021] [Indexed: 12/24/2022] Open

Thorpe Huerta D, Hawkins JB, Brownstein JS, Hswen Y. Exploring discussions of health and risk and public sentiment in Massachusetts during COVID-19 pandemic mandate implementation: A Twitter analysis. SSM Popul Health 2021;15:100851. [PMID: 34355055 PMCID: PMC8325089 DOI: 10.1016/j.ssmph.2021.100851] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Revised: 06/17/2021] [Accepted: 06/17/2021] [Indexed: 11/04/2022] Open

Abstract

As policies are adjusted throughout the COVID-19 pandemic according to public health best practices, there is a need to balance the importance of social distancing in preventing viral spread with the strain that these governmental public safety mandates put on public mental health. Thus, there is need for continuous observation of public sentiment and deliberation to inform further adaptation of mandated interventions. In this study, we explore how public response may be reflected in Massachusetts (MA) via social media by specifically exploring temporal patterns in Twitter posts (tweets) regarding sentiment and discussion of topics. We employ interrupted time series centered on (1) Massachusetts State of Emergency declaration (March 10), (2) US State of Emergency declaration (March 13) and (3) Massachusetts public school closure (March 17) to explore changes in tweet sentiment polarity (net negative/positive), expressed anxiety and discussion on risk and health topics on a random subset of all tweets coded within Massachusetts and published from January 1 to May 15, 2020 (n = 2.8 million). We find significant differences between tweets published before and after mandate enforcement for Massachusetts State of Emergency (increased discussion of risk and health, decreased polarity and increased anxiety expression), US State of Emergency (increased discussion of risk and health, and increased anxiety expression) and Massachusetts public school closure (increased discussion of risk and decreased polarity). Our work further validates that Twitter data is a reasonable way to monitor public sentiment and discourse within a crisis, especially in conjunction with other observation data.

•

Twitter can be used to track the emotions of the public during times of crises.

•

During COVID-19 shelter-in-place an increase in discussions about risk and health, and anxiety levels was seen on Twitter.

•

Real-time information from Twitter may be used to make quick evidence-based decisions based on public reactions.

Collapse

Grande D, Luna Marti X, Merchant RM, Asch DA, Dolan A, Sharma M, Cannuscio CC. Consumer Views on Health Applications of Consumer Digital Data and Health Privacy Among US Adults: Qualitative Interview Study. J Med Internet Res 2021;23:e29395. [PMID: 34106074 PMCID: PMC8262668 DOI: 10.2196/29395] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2021] [Revised: 05/10/2021] [Accepted: 05/16/2021] [Indexed: 11/15/2022] Open

Abstract

BACKGROUND

In 2020, the number of internet users surpassed 4.6 billion. Individuals who create and share digital data can leave a trail of information about their habits and preferences that collectively generate a digital footprint. Studies have shown that digital footprints can reveal important information regarding an individual's health status, ranging from diet and exercise to depression. Uses of digital applications have accelerated during the COVID-19 pandemic where public health organizations have utilized technology to reduce the burden of transmission, ultimately leading to policy discussions about digital health privacy. Though US consumers report feeling concerned about the way their personal data is used, they continue to use digital technologies.

OBJECTIVE

This study aimed to understand the extent to which consumers recognize possible health applications of their digital data and identify their most salient concerns around digital health privacy.

METHODS

We conducted semistructured interviews with a diverse national sample of US adults from November 2018 to January 2019. Participants were recruited from the Ipsos KnowledgePanel, a nationally representative panel. Participants were asked to reflect on their own use of digital technology, rate various sources of digital information, and consider several hypothetical scenarios with varying sources and health-related applications of personal digital information.

RESULTS

The final cohort included a diverse national sample of 45 US consumers. Participants were generally unaware what consumer digital data might reveal about their health. They also revealed limited knowledge of current data collection and aggregation practices. When responding to specific scenarios with health-related applications of data, they had difficulty weighing the benefits and harms but expressed a desire for privacy protection. They saw benefits in using digital data to improve health, but wanted limits to health programs' use of consumer digital data.

CONCLUSIONS

Current privacy restrictions on health-related data are premised on the notion that these data are derived only from medical encounters. Given that an increasing amount of health-related data is derived from digital footprints in consumer settings, our findings suggest the need for greater transparency of data collection and uses, and broader health privacy protections.

Collapse

Kim MT, Radhakrishnan K, Heitkemper EM, Choi E, Burgermaster M. Psychosocial phenotyping as a personalization strategy for chronic disease self-management interventions. Am J Transl Res 2021;13:1617-1635. [PMID: 33841684 PMCID: PMC8014371] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Accepted: 01/09/2021] [Indexed: 06/12/2023]

Abstract

BACKGROUND

As the U.S. population grows older and more diverse, self-management needs are increasingly complicated. In order to deliver effective personalized interventions to those suffer from chronic conditions social determinants of health must be considered. Therefore, psychosocial phenotyping holds strong promise as a tool for tailoring interventions based on precision health principles.

PURPOSE

To define psychosocial phenotyping and develop a research agenda that promotes its integration into chronic disease management as a tool for precision self-management interventions.

METHODS

Since psychosocial phenotyping is not yet used in interventions for self-management support, we conducted a literature review to identify potential phenotypes for chronic disease self-management. We also reviewed policy intervention case reports from the Centers for Medicare and Medicaid Services to examine factors related to social determinants of health in people with chronic illnesses. Finally, we reviewed methodological approaches for identifying patient profiles or phenotypes.

RESULTS

The literature review revealed areas within which to collect data for psychosocial phenotyping that can inform personalized interventions. The findings of our exemplar cases revealed that several environmental or key SDOH such as factors realted with economic stability and neighborhood environment have been closely linked with the success of chronic disease management interventions. We elucidated theory, definitions, and pragmatic conceptual boundaries related to psychosocial phenotyping for precision health.

CONCLUSIONS

Our literature review with case example analysis demonstrates the potential usefulness of psychosocial phenotyping as a tool to enhance personalized self-management interventions for people with chronic diseases, with implications for future research.

Collapse

Huang D, Huang Y, Khanna S, Dwivedi P, Slopen N, Green KM, He X, Puett R, Nguyen Q. Twitter-Derived Social Neighborhood Characteristics and Individual-Level Cardiometabolic Outcomes: Cross-Sectional Study in a Nationally Representative Sample. JMIR Public Health Surveill 2020;6:e17969. [PMID: 32808935 PMCID: PMC7485998 DOI: 10.2196/17969] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2020] [Revised: 04/26/2020] [Accepted: 05/27/2020] [Indexed: 11/13/2022] Open

Abstract

BACKGROUND

Social media platforms such as Twitter can serve as a potential data source for public health research to characterize the social neighborhood environment. Few studies have linked Twitter-derived characteristics to individual-level health outcomes.

OBJECTIVE

This study aims to assess the association between Twitter-derived social neighborhood characteristics, including happiness, food, and physical activity mentions, with individual cardiometabolic outcomes using a nationally representative sample.

METHODS

We collected a random 1% of the geotagged tweets from April 2015 to March 2016 using Twitter's Streaming Application Interface (API). Twitter-derived zip code characteristics on happiness, food, and physical activity were merged to individual outcomes from restricted-use National Health and Nutrition Examination Survey (NHANES) with residential zip codes. Separate regression analyses were performed for each of the neighborhood characteristics using NHANES 2011-2016 and 2007-2016.

RESULTS

Individuals living in the zip codes with the two highest tertiles of happy tweets reported BMI of 0.65 (95% CI -1.10 to -0.20) and 0.85 kg/m2 (95% CI -1.48 to -0.21) lower than those living in zip codes with the lowest frequency of happy tweets. Happy tweets were also associated with a 6%-8% lower prevalence of hypertension. A higher prevalence of healthy food tweets was linked with an 11% (95% CI 2% to 21%) lower prevalence of obesity. Those living in areas with the highest and medium tertiles of physical activity tweets were associated with a lower prevalence of hypertension by 10% (95% CI 4% to 15%) and 8% (95% CI 2% to 14%), respectively.

CONCLUSIONS

Twitter-derived social neighborhood characteristics were associated with individual-level obesity and hypertension in a nationally representative sample of US adults. Twitter data could be used for capturing neighborhood sociocultural influences on chronic conditions and may be used as a platform for chronic outcomes prevention.

Collapse

Anwar M, Khoury D, Aldridge AP, Parker SJ, Conway KP. Using Twitter to Surveil the Opioid Epidemic in North Carolina: An Exploratory Study. JMIR Public Health Surveill 2020;6:e17574. [PMID: 32469322 PMCID: PMC7380977 DOI: 10.2196/17574] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2019] [Revised: 04/27/2020] [Accepted: 05/15/2020] [Indexed: 02/04/2023] Open

Abstract

BACKGROUND

Over the last two decades, deaths associated with opioids have escalated in number and geographic spread, impacting more and more individuals, families, and communities. Reflecting on the shifting nature of the opioid overdose crisis, Dasgupta, Beletsky, and Ciccarone offer a triphasic framework to explain that opioid overdose deaths (OODs) shifted from prescription opioids for pain (beginning in 2000), to heroin (2010 to 2015), and then to synthetic opioids (beginning in 2013). Given the rapidly shifting nature of OODs, timelier surveillance data are critical to inform strategies that combat the opioid crisis. Using easily accessible and near real-time social media data to improve public health surveillance efforts related to the opioid crisis is a promising area of research.

OBJECTIVE

This study explored the potential of using Twitter data to monitor the opioid epidemic. Specifically, this study investigated the extent to which the content of opioid-related tweets corresponds with the triphasic nature of the opioid crisis and correlates with OODs in North Carolina between 2009 and 2017.

METHODS

Opioid-related Twitter posts were obtained using Crimson Hexagon, and were classified as relating to prescription opioids, heroin, and synthetic opioids using natural language processing. This process resulted in a corpus of 100,777 posts consisting of tweets, retweets, mentions, and replies. Using a random sample of 10,000 posts from the corpus, we identified opioid-related terms by analyzing word frequency for each year. OODs were obtained from the Multiple Cause of Death database from the Centers for Disease Control and Prevention Wide-ranging Online Data for Epidemiologic Research (CDC WONDER). Least squares regression and Granger tests compared patterns of opioid-related posts with OODs.

RESULTS

The pattern of tweets related to prescription opioids, heroin, and synthetic opioids resembled the triphasic nature of OODs. For prescription opioids, tweet counts and OODs were statistically unrelated. Tweets mentioning heroin and synthetic opioids were significantly associated with heroin OODs and synthetic OODs in the same year (P=.01 and P<.001, respectively), as well as in the following year (P=.03 and P=.01, respectively). Moreover, heroin tweets in a given year predicted heroin deaths better than lagged heroin OODs alone (P=.03).

CONCLUSIONS

Findings support using Twitter data as a timely indicator of opioid overdose mortality, especially for heroin.

Collapse

Jaidka K, Giorgi S, Schwartz HA, Kern ML, Ungar LH, Eichstaedt JC. Estimating geographic subjective well-being from Twitter: A comparison of dictionary and data-driven language methods. Proc Natl Acad Sci U S A 2020;117:10165-10171. [PMID: 32341156 PMCID: PMC7229753 DOI: 10.1073/pnas.1906364117] [Citation(s) in RCA: 55] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Cole DA, Nick EA, Varga G, Smith D, Zelkowitz RL, Ford MA, Lédeczi Á. Are Aspects of Twitter Use Associated with Reduced Depressive Symptoms? The Moderating Role of In-Person Social Support. CYBERPSYCHOLOGY BEHAVIOR AND SOCIAL NETWORKING 2020;22:692-699. [PMID: 31697601 DOI: 10.1089/cyber.2019.0035] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Gibbons J, Malouf R, Spitzberg B, Martinez L, Appleyard B, Thompson C, Nara A, Tsou MH. Twitter-based measures of neighborhood sentiment as predictors of residential population health. PLoS One 2019;14:e0219550. [PMID: 31295294 PMCID: PMC6622529 DOI: 10.1371/journal.pone.0219550] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2018] [Accepted: 06/26/2019] [Indexed: 12/03/2022] Open

Yin Z, Sulieman LM, Malin BA. A systematic literature review of machine learning in online personal health data. J Am Med Inform Assoc 2019;26:561-576. [PMID: 30908576 PMCID: PMC7647332 DOI: 10.1093/jamia/ocz009] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2018] [Revised: 01/06/2019] [Accepted: 01/11/2019] [Indexed: 02/07/2023] Open

Doan S, Yang EW, Tilak SS, Li PW, Zisook DS, Torii M. Extracting health-related causality from twitter messages using natural language processing. BMC Med Inform Decis Mak 2019;19:79. [PMID: 30943954 PMCID: PMC6448183 DOI: 10.1186/s12911-019-0785-0] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

Hswen Y, Qin Q, Brownstein JS, Hawkins JB. Feasibility of using social media to monitor outdoor air pollution in London, England. Prev Med 2019;121:86-93. [PMID: 30742873 PMCID: PMC7316422 DOI: 10.1016/j.ypmed.2019.02.005] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/26/2018] [Revised: 12/24/2018] [Accepted: 02/07/2019] [Indexed: 11/28/2022]

Hswen Y, Gopaluni A, Brownstein JS, Hawkins JB. Using Twitter to Detect Psychological Characteristics of Self-Identified Persons With Autism Spectrum Disorder: A Feasibility Study. JMIR Mhealth Uhealth 2019;7:e12264. [PMID: 30747718 PMCID: PMC6390184 DOI: 10.2196/12264] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2018] [Revised: 11/16/2018] [Accepted: 11/18/2018] [Indexed: 12/31/2022] Open

Abstract

Background

More than 3.5 million Americans live with autism spectrum disorder (ASD). Major challenges persist in diagnosing ASD as no medical test exists to diagnose this disorder. Digital phenotyping holds promise to guide in the clinical diagnoses and screening of ASD.

Objective

This study aims to explore the feasibility of using the Web-based social media platform Twitter to detect psychological and behavioral characteristics of self-identified persons with ASD.

Methods

Data from Twitter were retrieved from 152 self-identified users with ASD and 182 randomly selected control users from March 22, 2012 to July 20, 2017. We conducted a between-group comparative textual analysis of tweets about repetitive and obsessive-compulsive behavioral characteristics typically associated with ASD. In addition, common emotional characteristics of persons with ASD, such as fear, paranoia, and anxiety, were examined between groups through textual analysis. Furthermore, we compared the timing of tweets between users with ASD and control users to identify patterns in communication.

Results

Users with ASD posted a significantly higher frequency of tweets related to the specific repetitive behavior of counting compared with control users (P<.001). The textual analysis of obsessive-compulsive behavioral characteristics, such as fixate, excessive, and concern, were significantly higher among users with ASD compared with the control group (P<.001). In addition, emotional terms related to fear, paranoia, and anxiety were tweeted at a significantly higher rate among users with ASD compared with control users (P<.001). Users with ASD posted a smaller proportion of tweets during time intervals of 00:00-05:59 (P<.001), 06:00-11:59 (P<.001), and 18:00-23.59 (P<.001), as well as a greater proportion of tweets from 12:00 to 17:59 (P<.001) compared with control users.

Conclusions

Social media may be a valuable resource for observing unique psychological characteristics of self-identified persons with ASD. Collecting and analyzing data from these digital platforms may afford opportunities to identify the characteristics of ASD and assist in the diagnosis or verification of ASD. This study highlights the feasibility of leveraging digital data for gaining new insights into various health conditions.

Collapse

Anýž J, Bakštein E, Dudysová D, Veldová K, Kliková M, Fárková E, Kopřivová J, Španiel F. No wink of sleep: Population sleep characteristics in response to the brexit poll and the 2016 U.S. presidential election. Soc Sci Med 2019;222:112-121. [DOI: 10.1016/j.socscimed.2018.12.024] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2018] [Revised: 11/16/2018] [Accepted: 12/17/2018] [Indexed: 01/23/2023]

Hswen Y, Naslund JA, Brownstein JS, Hawkins JB. Monitoring Online Discussions About Suicide Among Twitter Users With Schizophrenia: Exploratory Study. JMIR Ment Health 2018;5:e11483. [PMID: 30545811 PMCID: PMC6315229 DOI: 10.2196/11483] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/04/2018] [Revised: 09/13/2018] [Accepted: 09/14/2018] [Indexed: 12/12/2022] Open

Abstract

BACKGROUND

People with schizophrenia experience elevated risk of suicide. Mental health symptoms, including depression and anxiety, contribute to increased risk of suicide. Digital technology could support efforts to detect suicide risk and inform suicide prevention efforts.

OBJECTIVE

This exploratory study examined the feasibility of monitoring online discussions about suicide among Twitter users who self-identify as having schizophrenia.

METHODS

Posts containing the terms suicide or suicidal were collected from a sample of Twitter users who self-identify as having schizophrenia (N=203) and a random sample of control users (N=173) over a 200-day period. Frequency and timing of posts about suicide were compared between groups. The associations between posting about suicide and common mental health symptoms were examined.

RESULTS

Twitter users who self-identify as having schizophrenia posted more tweets about suicide (mean 7.10, SD 15.98) compared to control users (mean 1.89, SD 4.79; t₃₇₄=-4.13, P<.001). Twitter users who self-identify as having schizophrenia showed greater odds of tweeting about suicide compared to control users (odds ratio 2.15, 95% CI 1.42-3.28). Among all users, tweets about suicide were associated with tweets about depression (r=0.62, P<.001) and anxiety (r=0.45, P<.001).

CONCLUSIONS

Twitter users who self-identify as having schizophrenia appear to commonly discuss suicide on social media, which is associated with greater discussion about other mental health symptoms. These findings should be interpreted cautiously, as it is not possible to determine whether online discussions about suicide correlate with suicide risk. However, these patterns of online discussion may be indicative of elevated risk of suicide observed in this patient group. There may be opportunities to leverage social media for supporting suicide prevention among individuals with schizophrenia.

Collapse

Nguyen TT, Meng HW, Sandeep S, McCullough M, Yu W, Lau Y, Huang D, Nguyen QC. Twitter-derived measures of sentiment towards minorities (2015-2016) and associations with low birth weight and preterm birth in the United States. COMPUTERS IN HUMAN BEHAVIOR 2018;89:308-315. [PMID: 30923420 PMCID: PMC6432619 DOI: 10.1016/j.chb.2018.08.010] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]

Hswen Y, Naslund JA, Brownstein JS, Hawkins JB. Online Communication about Depression and Anxiety among Twitter Users with Schizophrenia: Preliminary Findings to Inform a Digital Phenotype Using Social Media. Psychiatr Q 2018;89:569-580. [PMID: 29327218 PMCID: PMC6043409 DOI: 10.1007/s11126-017-9559-y] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Characterizing Depression Issues on Sina Weibo. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2018;15:ijerph15040764. [PMID: 29659489 PMCID: PMC5923806 DOI: 10.3390/ijerph15040764] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/20/2018] [Revised: 04/08/2018] [Accepted: 04/12/2018] [Indexed: 11/17/2022]

Twitter-derived neighborhood characteristics associated with obesity and diabetes. Sci Rep 2017;7:16425. [PMID: 29180792 PMCID: PMC5703998 DOI: 10.1038/s41598-017-16573-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2017] [Accepted: 11/14/2017] [Indexed: 11/08/2022] Open

Exploring online communication about cigarette smoking among Twitter users who self-identify as having schizophrenia. Psychiatry Res 2017;257:479-484. [PMID: 28841509 PMCID: PMC5877400 DOI: 10.1016/j.psychres.2017.08.002] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/13/2017] [Revised: 06/13/2017] [Accepted: 08/01/2017] [Indexed: 12/19/2022]

Huang T, Elghafari A, Relia K, Chunara R. High-resolution Temporal Representations of Alcohol and Tobacco Behaviors from Social Media Data. PROCEEDINGS OF THE ACM ON HUMAN-COMPUTER INTERACTION 2017;1:54. [PMID: 29264592 PMCID: PMC5734092 DOI: 10.1145/3134689] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Liu J, Weitzman ER, Chunara R. Assessing Behavioral Stages From Social Media Data. CSCW : PROCEEDINGS OF THE CONFERENCE ON COMPUTER-SUPPORTED COOPERATIVE WORK. CONFERENCE ON COMPUTER-SUPPORTED COOPERATIVE WORK 2017;2017:1320-1333. [PMID: 29034371 DOI: 10.1145/2998181.2998336] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Paul MJ, Dredze M. Social Monitoring for Public Health. ACTA ACUST UNITED AC 2017. [DOI: 10.2200/s00791ed1v01y201707icr060] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]

Hausmann JS, Touloumtzis C, White MT, Colbert JA, Gooding H. Adolescent and Young Adult Use of Social Media for Health and Its Implications. J Adolesc Health 2017;60:714-719. [PMID: 28259620 PMCID: PMC5441939 DOI: 10.1016/j.jadohealth.2016.12.025] [Citation(s) in RCA: 67] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/24/2016] [Revised: 12/26/2016] [Accepted: 12/27/2016] [Indexed: 10/20/2022]

Hawkins AN, Filtness AJ. Driver sleepiness on YouTube: A content analysis. ACCIDENT; ANALYSIS AND PREVENTION 2017;99:459-464. [PMID: 26653707 DOI: 10.1016/j.aap.2015.11.025] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2015] [Revised: 10/22/2015] [Accepted: 11/20/2015] [Indexed: 06/05/2023]

Burke-Garcia A, Stanton CA. A tale of two tools: Reliability and feasibility of social media measurement tools examining e-cigarette twitter mentions. INFORMATICS IN MEDICINE UNLOCKED 2017. [DOI: 10.1016/j.imu.2017.04.001] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open

Powell GE, Seifert HA, Reblin T, Burstein PJ, Blowers J, Menius JA, Painter JL, Thomas M, Pierce CE, Rodriguez HW, Brownstein JS, Freifeld CC, Bell HG, Dasgupta N. Social Media Listening for Routine Post-Marketing Safety Surveillance. Drug Saf 2016;39:443-54. [PMID: 26798054 DOI: 10.1007/s40264-015-0385-6] [Citation(s) in RCA: 64] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Abstract

INTRODUCTION

Post-marketing safety surveillance primarily relies on data from spontaneous adverse event reports, medical literature, and observational databases. Limitations of these data sources include potential under-reporting, lack of geographic diversity, and time lag between event occurrence and discovery. There is growing interest in exploring the use of social media ('social listening') to supplement established approaches for pharmacovigilance. Although social listening is commonly used for commercial purposes, there are only anecdotal reports of its use in pharmacovigilance. Health information posted online by patients is often publicly available, representing an untapped source of post-marketing safety data that could supplement data from existing sources.

OBJECTIVES

The objective of this paper is to describe one methodology that could help unlock the potential of social media for safety surveillance.

METHODS

A third-party vendor acquired 24 months of publicly available Facebook and Twitter data, then processed the data by standardizing drug names and vernacular symptoms, removing duplicates and noise, masking personally identifiable information, and adding supplemental data to facilitate the review process. The resulting dataset was analyzed for safety and benefit information.

RESULTS

In Twitter, a total of 6,441,679 Medical Dictionary for Regulatory Activities (MedDRA(®)) Preferred Terms (PTs) representing 702 individual PTs were discussed in the same post as a drug compared with 15,650,108 total PTs representing 946 individual PTs in Facebook. Further analysis revealed that 26 % of posts also contained benefit information.

CONCLUSION

Social media listening is an important tool to augment post-marketing safety surveillance. Much work remains to determine best practices for using this rapidly evolving data source.

Collapse

Sathyanarayana A, Joty S, Fernandez-Luque L, Ofli F, Srivastava J, Elmagarmid A, Arora T, Taheri S. Sleep Quality Prediction From Wearable Data Using Deep Learning. JMIR Mhealth Uhealth 2016;4:e125. [PMID: 27815231 PMCID: PMC5116102 DOI: 10.2196/mhealth.6562] [Citation(s) in RCA: 51] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2016] [Revised: 10/04/2016] [Accepted: 10/22/2016] [Indexed: 12/23/2022] Open

Abstract

Background

The importance of sleep is paramount to health. Insufficient sleep can reduce physical, emotional, and mental well-being and can lead to a multitude of health complications among people with chronic conditions. Physical activity and sleep are highly interrelated health behaviors. Our physical activity during the day (ie, awake time) influences our quality of sleep, and vice versa. The current popularity of wearables for tracking physical activity and sleep, including actigraphy devices, can foster the development of new advanced data analytics. This can help to develop new electronic health (eHealth) applications and provide more insights into sleep science.

Objective

The objective of this study was to evaluate the feasibility of predicting sleep quality (ie, poor or adequate sleep efficiency) given the physical activity wearable data during awake time. In this study, we focused on predicting good or poor sleep efficiency as an indicator of sleep quality.

Methods

Actigraphy sensors are wearable medical devices used to study sleep and physical activity patterns. The dataset used in our experiments contained the complete actigraphy data from a subset of 92 adolescents over 1 full week. Physical activity data during awake time was used to create predictive models for sleep quality, in particular, poor or good sleep efficiency. The physical activity data from sleep time was used for the evaluation. We compared the predictive performance of traditional logistic regression with more advanced deep learning methods: multilayer perceptron (MLP), convolutional neural network (CNN), simple Elman-type recurrent neural network (RNN), long short-term memory (LSTM-RNN), and a time-batched version of LSTM-RNN (TB-LSTM).

Results

Deep learning models were able to predict the quality of sleep (ie, poor or good sleep efficiency) based on wearable data from awake periods. More specifically, the deep learning methods performed better than traditional linear regression. CNN had the highest specificity and sensitivity, and an overall area under the receiver operating characteristic (ROC) curve (AUC) of 0.9449, which was 46% better as compared with traditional linear regression (0.6463).

Conclusions

Deep learning methods can predict the quality of sleep based on actigraphy data from awake periods. These predictive models can be an important tool for sleep research and to improve eHealth solutions for sleep.

Collapse

Yoon S. What Can We Learn About Mental Health Needs From Tweets Mentioning Dementia on World Alzheimer's Day? J Am Psychiatr Nurses Assoc 2016;22:498-503. [PMID: 27803262 PMCID: PMC5337405 DOI: 10.1177/1078390316663690] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Nguyen QC, Li D, Meng HW, Kath S, Nsoesie E, Li F, Wen M. Building a National Neighborhood Dataset From Geotagged Twitter Data for Indicators of Happiness, Diet, and Physical Activity. JMIR Public Health Surveill 2016;2:e158. [PMID: 27751984 PMCID: PMC5088343 DOI: 10.2196/publichealth.5869] [Citation(s) in RCA: 49] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2016] [Revised: 08/29/2016] [Accepted: 09/15/2016] [Indexed: 01/09/2023] Open

Abstract

Background

Studies suggest that where people live, play, and work can influence health and well-being. However, the dearth of neighborhood data, especially data that is timely and consistent across geographies, hinders understanding of the effects of neighborhoods on health. Social media data represents a possible new data resource for neighborhood research.

Objective

The aim of this study was to build, from geotagged Twitter data, a national neighborhood database with area-level indicators of well-being and health behaviors.

Methods

We utilized Twitter’s streaming application programming interface to continuously collect a random 1% subset of publicly available geolocated tweets for 1 year (April 2015 to March 2016). We collected 80 million geotagged tweets from 603,363 unique Twitter users across the contiguous United States. We validated our machine learning algorithms for constructing indicators of happiness, food, and physical activity by comparing predicted values to those generated by human labelers. Geotagged tweets were spatially mapped to the 2010 census tract and zip code areas they fall within, which enabled further assessment of the associations between Twitter-derived neighborhood variables and neighborhood demographic, economic, business, and health characteristics.

Results

Machine labeled and manually labeled tweets had a high level of accuracy: 78% for happiness, 83% for food, and 85% for physical activity for dichotomized labels with the F scores 0.54, 0.86, and 0.90, respectively. About 20% of tweets were classified as happy. Relatively few terms (less than 25) were necessary to characterize the majority of tweets on food and physical activity. Data from over 70,000 census tracts from the United States suggest that census tract factors like percentage African American and economic disadvantage were associated with lower census tract happiness. Urbanicity was related to higher frequency of fast food tweets. Greater numbers of fast food restaurants predicted higher frequency of fast food mentions. Surprisingly, fitness centers and nature parks were only modestly associated with higher frequency of physical activity tweets. Greater state-level happiness, positivity toward physical activity, and positivity toward healthy foods, assessed via tweets, were associated with lower all-cause mortality and prevalence of chronic conditions such as obesity and diabetes and lower physical inactivity and smoking, controlling for state median income, median age, and percentage white non-Hispanic.

Conclusions

Machine learning algorithms can be built with relatively high accuracy to characterize sentiment, food, and physical activity mentions on social media. Such data can be utilized to construct neighborhood indicators consistently and cost effectively. Access to neighborhood data, in turn, can be leveraged to better understand neighborhood effects and address social determinants of health. We found that neighborhoods with social and economic disadvantage, high urbanicity, and more fast food restaurants may exhibit lower happiness and fewer healthy behaviors.

Collapse

Kunkle S, Christie G, Yach D, El-Sayed AM. The Importance of Computer Science for Public Health Training: An Opportunity and Call to Action. JMIR Public Health Surveill 2016;2:e10. [PMID: 27227145 PMCID: PMC4869246 DOI: 10.2196/publichealth.5018] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2015] [Revised: 02/01/2016] [Accepted: 02/04/2016] [Indexed: 11/13/2022] Open

Katsuki T, Mackey TK, Cuomo R. Establishing a Link Between Prescription Drug Abuse and Illicit Online Pharmacies: Analysis of Twitter Data. J Med Internet Res 2015;17:e280. [PMID: 26677966 PMCID: PMC4704982 DOI: 10.2196/jmir.5144] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2015] [Revised: 10/28/2015] [Accepted: 11/11/2015] [Indexed: 11/13/2022] Open

Abstract

Background

Youth and adolescent non-medical use of prescription medications (NUPM) has become a national epidemic. However, little is known about the association between promotion of NUPM behavior and access via the popular social media microblogging site, Twitter, which is currently used by a third of all teens.

Objective

In order to better assess NUPM behavior online, this study conducts surveillance and analysis of Twitter data to characterize the frequency of NUPM-related tweets and also identifies illegal access to drugs of abuse via online pharmacies.

Methods

Tweets were collected over a 2-week period from April 1-14, 2015, by applying NUPM keyword filters for both generic/chemical and street names associated with drugs of abuse using the Twitter public streaming application programming interface. Tweets were then analyzed for relevance to NUPM and whether they promoted illegal online access to prescription drugs using a protocol of content coding and supervised machine learning.

Results

A total of 2,417,662 tweets were collected and analyzed for this study. Tweets filtered for generic drugs names comprised 232,108 tweets, including 22,174 unique associated uniform resource locators (URLs), and 2,185,554 tweets (376,304 unique URLs) filtered for street names. Applying an iterative process of manual content coding and supervised machine learning, 81.72% of the generic and 12.28% of the street NUPM datasets were predicted as having content relevant to NUPM respectively. By examining hyperlinks associated with NUPM relevant content for the generic Twitter dataset, we discovered that 75.72% of the tweets with URLs included a hyperlink to an online marketing affiliate that directly linked to an illicit online pharmacy advertising the sale of Valium without a prescription.

Conclusions

This study examined the association between Twitter content, NUPM behavior promotion, and online access to drugs using a broad set of prescription drug keywords. Initial results are concerning, as our study found over 45,000 tweets that directly promoted NUPM by providing a URL that actively marketed the illegal online sale of prescription drugs of abuse. Additional research is needed to further establish the link between Twitter content and NUPM, as well as to help inform future technology-based tools, online health promotion activities, and public policy to combat NUPM online.

Collapse

Albalawi Y, Sixsmith J. Agenda Setting for Health Promotion: Exploring an Adapted Model for the Social Media Era. JMIR Public Health Surveill 2015;1:e21. [PMID: 27227139 PMCID: PMC4869225 DOI: 10.2196/publichealth.5014] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2015] [Revised: 09/20/2015] [Accepted: 10/17/2015] [Indexed: 11/16/2022] Open

Abstract

Background

The foundation of best practice in health promotion is a robust theoretical base that informs design, implementation, and evaluation of interventions that promote the public’s health. This study provides a novel contribution to health promotion through the adaptation of the agenda-setting approach in response to the contribution of social media. This exploration and proposed adaptation is derived from a study that examined the effectiveness of Twitter in influencing agenda setting among users in relation to road traffic accidents in Saudi Arabia.

Objective

The proposed adaptations to the agenda-setting model to be explored reflect two levels of engagement: agenda setting within the social media sphere and the position of social media within classic agenda setting. This exploratory research aims to assess the veracity of the proposed adaptations on the basis of the hypotheses developed to test these two levels of engagement.

Methods

To validate the hypotheses, we collected and analyzed data from two primary sources: Twitter activities and Saudi national newspapers. Keyword mentions served as indicators of agenda promotion; for Twitter, interactions were used to measure the process of agenda setting within the platform. The Twitter final dataset comprised 59,046 tweets and 38,066 users who contributed by tweeting, replying, or retweeting. Variables were collected for each tweet and user. In addition, 518 keyword mentions were recorded from six popular Saudi national newspapers.

Results

The results showed significant ratification of the study hypotheses at both levels of engagement that framed the proposed adaptions. The results indicate that social media facilitates the contribution of individuals in influencing agendas (individual users accounted for 76.29%, 67.79%, and 96.16% of retweet impressions, total impressions, and amplification multipliers, respectively), a component missing from traditional constructions of agenda-setting models. The influence of organizations on agenda setting is also highlighted (in the data of user interactions, organizational accounts registered 17% and 14.74% as source and target of interactions, respectively). In addition, 13 striking similarities showed the relationship between newspapers and Twitter on the mentions trends line.

Conclusions

The effective use of social media platforms in health promotion intervention programs requires new strategies that consider the limitations of traditional communication channels. Conducting research is vital to establishing a strong basis for modifying, designing, and developing new health promotion strategies and approaches.

Collapse