1
|
Luca M, Campedelli GM, Centellegher S, Tizzoni M, Lepri B. Crime, inequality and public health: a survey of emerging trends in urban data science. Front Big Data 2023; 6:1124526. [PMID: 37303974 PMCID: PMC10248183 DOI: 10.3389/fdata.2023.1124526] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 05/10/2023] [Indexed: 06/13/2023] Open
Abstract
Urban agglomerations are constantly and rapidly evolving ecosystems, with globalization and increasing urbanization posing new challenges in sustainable urban development well summarized in the United Nations' Sustainable Development Goals (SDGs). The advent of the digital age generated by modern alternative data sources provides new tools to tackle these challenges with spatio-temporal scales that were previously unavailable with census statistics. In this review, we present how new digital data sources are employed to provide data-driven insights to study and track (i) urban crime and public safety; (ii) socioeconomic inequalities and segregation; and (iii) public health, with a particular focus on the city scale.
Collapse
Affiliation(s)
- Massimiliano Luca
- Mobile and Social Computing Lab, Bruno Kessler Foundation, Trento, Italy
- Faculty of Computer Science, Free University of Bolzano, Bolzano, Italy
| | | | | | - Michele Tizzoni
- Department of Sociology and Social Research, University of Trento, Trento, Italy
| | - Bruno Lepri
- Mobile and Social Computing Lab, Bruno Kessler Foundation, Trento, Italy
| |
Collapse
|
2
|
Sartirano D, Kalimeri K, Cattuto C, Delamónica E, Garcia-Herranz M, Mockler A, Paolotti D, Schifanella R. Strengths and limitations of relative wealth indices derived from big data in Indonesia. Front Big Data 2023; 6:1054156. [PMID: 36896443 PMCID: PMC9990410 DOI: 10.3389/fdata.2023.1054156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 01/27/2023] [Indexed: 02/23/2023] Open
Abstract
Accurate relative wealth estimates in Low and Middle-Income Countries (LMICS) are crucial to help policymakers address socio-demographic inequalities under the guidance of the Sustainable Development Goals set by the United Nations. Survey-based approaches have traditionally been employed to collect highly granular data about income, consumption, or household material goods to create index-based poverty estimates. However, these methods are only capture persons in households (i.e., in the household sample framework) and they do not include migrant populations or unhoused citizens. Novel approaches combining frontier data, computer vision, and machine learning have been proposed to complement these existing approaches. However, the strengths and limitations of these big-data-derived indices have yet to be sufficiently studied. In this paper, we focus on the case of Indonesia and examine one frontier-data derived Relative Wealth Index (RWI), created by the Facebook Data for Good initiative, that utilizes connectivity data from the Facebook Platform and satellite imagery data to produce a high-resolution estimate of relative wealth for 135 countries. We examine it concerning asset-based relative wealth indices estimated from existing high-quality national-level traditional survey instruments, the USAID-developed Demographic Health Survey (DHS), and the Indonesian National Socio-economic survey (SUSENAS). In this work, we aim to understand how the frontier-data derived index can be used to inform anti-poverty programs in Indonesia and the Asia Pacific region. First, we unveil key features that affect the comparison between the traditional and non-traditional sources, such as the publishing time and authority and the granularity of the spatial aggregation of the data. Second, to provide operational input, we hypothesize how a re-distribution of resources based on the RWI map would impact a current social program, the Social Protection Card (KPS) of Indonesia and assess impact. In this hypothetical scenario, we estimate the percentage of Indonesians eligible for the program, which would have been incorrectly excluded from a social protection payment had the RWI been used in place of the survey-based wealth index. The exclusion error in that case would be 32.82%. Within the context of the KPS program targeting, we noted significant differences between the RWI map's predictions and the SUSENAS ground truth index estimates.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Rossano Schifanella
- ISI Foundation, Turin, Italy.,Department of Computer Science, University of Turin, Turin, Italy
| |
Collapse
|
3
|
Kruzan KP, Fitzsimmons-Craft EE, Dobias M, Schleider JL, Pratap A. Developing, Deploying, and Evaluating Digital Mental Health Interventions in Spaces of Online Help- and Information-Seeking. PROCEDIA COMPUTER SCIENCE 2022; 206:6-22. [PMID: 37063642 PMCID: PMC10104522 DOI: 10.1016/j.procs.2022.09.081] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The internet is frequently the first point of contact for people seeking support for their mental health symptoms. Digital interventions designed to be deployed through the internet have significant promise to reach diverse populations who may not have access to, or are not yet engaged in, treatment and deliver evidence-based resources to address symptoms. The liminal nature of online interactions requires designing to prioritize needs detection, intervention potency, and efficiency. Real-world implementation, data privacy and safety are equally important and can involve transparent partnerships with stakeholders in industry and non-profit organizations. This commentary highlights challenges and opportunities for research in this space, grounded in learnings from multiple research projects and teams aligned with this effort.
Collapse
Affiliation(s)
- Kaylee P. Kruzan
- Northwestern University Feinberg School of Medicine, Chicago, IL, 60611, USA
| | | | - Mallory Dobias
- Department of Psychology, Stony Brook University, Stony Brook, NY 11794, USA
| | | | - Abhishek Pratap
- Center for Addiction and Mental Health, Toronto, ON, M5T 1R8 Canada
- Department of Psychiatry, University of Toronto, Toronto, ON, M5T 1R8, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, M5T 1R8, Canada
- Kings College London, London, WC2R 2LS, UK
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
4
|
Eysenbach G, Hosokawa R, Itatani T, Fujita S. Predicting the Number of Suicides in Japan Using Internet Search Queries: Vector Autoregression Time Series Model. JMIR Public Health Surveill 2021; 7:e34016. [PMID: 34823225 PMCID: PMC8647973 DOI: 10.2196/34016] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Revised: 11/08/2021] [Accepted: 11/21/2021] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND The number of suicides in Japan increased during the COVID-19 pandemic. Predicting the number of suicides is important to take timely preventive measures. OBJECTIVE This study aims to clarify whether the number of suicides can be predicted by suicide-related search queries used before searching for the keyword "suicide." METHODS This study uses the infoveillance approach for suicide in Japan by search trends in search engines. The monthly number of suicides by gender, collected and published by the National Police Agency, was used as an outcome variable. The number of searches by gender with queries associated with "suicide" on "Yahoo! JAPAN Search" from January 2016 to December 2020 was used as a predictive variable. The following five phrases highly relevant to suicide were used as search terms before searching for the keyword "suicide" and extracted and used for analyses: "abuse"; "work, don't want to go"; "company, want to quit"; "divorce"; and "no money." The augmented Dickey-Fuller and Johansen tests were performed for the original series and to verify the existence of unit roots and cointegration for each variable, respectively. The vector autoregression model was applied to predict the number of suicides. The Breusch-Godfrey Lagrangian multiplier (BG-LM) test, autoregressive conditional heteroskedasticity Lagrangian multiplier (ARCH-LM) test, and Jarque-Bera (JB) test were used to confirm model convergence. In addition, a Granger causality test was performed for each predictive variable. RESULTS In the original series, unit roots were found in the trend model, whereas in the first-order difference series, both men (minimum tau 3: -9.24; max tau 3: -5.38) and women (minimum tau 3: -9.24; max tau 3: -5.38) had no unit roots for all variables. In the Johansen test, a cointegration relationship was observed among several variables. The queries used in the converged models were "divorce" for men (BG-LM test: P=.55; ARCH-LM test: P=.63; JB test: P=.66) and "no money" for women (BG-LM test: P=.17; ARCH-LM test: P=.15; JB test: P=.10). In the Granger causality test for each variable, "divorce" was significant for both men (F104=3.29; P=.04) and women (F104=3.23; P=.04). CONCLUSIONS The number of suicides can be predicted by search queries related to the keyword "suicide." Previous studies have reported that financial poverty and divorce are associated with suicide. The results of this study, in which search queries on "no money" and "divorce" predicted suicide, support the findings of previous studies. Further research on the economic poverty of women and those with complex problems is necessary.
Collapse
Affiliation(s)
| | - Rikuya Hosokawa
- Department of Human Health Sciences, Graduate School of Medicine, Kyoto University, Kyoto, Japan
| | - Tomoya Itatani
- Division of Nursing, Faculty of Health Science Institute of Medical, Pharmaceutical and Health Science, Kanazawa University, Kanazawa, Japan
| | | |
Collapse
|
5
|
Hardinghaus M, Nieland S. Assessing cyclists' routing preferences by analyzing extensive user setting data from a bike-routing engine. EUROPEAN TRANSPORT RESEARCH REVIEW 2021; 13:41. [PMID: 38624843 PMCID: PMC8314267 DOI: 10.1186/s12544-021-00499-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Accepted: 06/25/2021] [Indexed: 04/17/2024]
Abstract
Introduction Many municipalities aim to support the uptake of cycling as an environmentally friendly and healthy mode of transport. It is therefore crucial to meet the demand of cyclists when adapting road infrastructure. Previous studies researching cyclists' route choice behavior deliver valuable insights but are constrained by laboratory conditions, limitations in the number of observations, or the observation period or relay on specific use cases. Methods The present study analyzes a dataset of over 450,000 observations of cyclists' routing settings for the navigation of individual trips in Berlin, Germany. It therefore analyzes query data recorded in the bike-routing engine BBBike and clusters the many different user settings with regard to preferred route characteristics. Results and Conclusion Results condense the large number of routing settings into characteristic preference clusters. Compared with earlier findings, the big data approach highlights the significance of short routes, side streets and the importance of high-quality surfaces for routing choices, while cycling on dedicated facilities seems a little less important.Consequentially, providing separated cycle facilities along main roads - often the main focal point of cycle plans - should be put into the context of an integrated strategy which fulfills distinct preferences to achieve greater success. It is therefore particularly important to provide a cycle network in calm residential streets as well as catering for short, direct cycle routes.
Collapse
Affiliation(s)
- Michael Hardinghaus
- German Aerospace Center DLR, Institute of Transport Research, Rudower Chaussee 7, 12489 Berlin, Germany
- Department of Geography, Humboldt University of Berlin, Unter den Linden 6, 10099 Berlin, Germany
| | - Simon Nieland
- German Aerospace Center DLR, Institute of Transport Research, Rudower Chaussee 7, 12489 Berlin, Germany
| |
Collapse
|
6
|
Areán PA, Pratap A, Hsin H, Huppert TK, Hendricks KE, Heagerty PJ, Cohen T, Bagge C, Comtois KA. Perceived Utility and Characterization of Personal Google Search Histories to Detect Data Patterns Proximal to a Suicide Attempt in Individuals Who Previously Attempted Suicide: Pilot Cohort Study. J Med Internet Res 2021; 23:e27918. [PMID: 33955838 PMCID: PMC8138707 DOI: 10.2196/27918] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Revised: 03/26/2021] [Accepted: 04/02/2021] [Indexed: 12/18/2022] Open
Abstract
BACKGROUND Despite decades of research to better understand suicide risk and to develop detection and prevention methods, suicide is still one of the leading causes of death globally. While large-scale studies using real-world evidence from electronic health records can identify who is at risk, they have not been successful at pinpointing when someone is at risk. Personalized social media and online search history data, by contrast, could provide an ongoing real-world datastream revealing internal thoughts and personal states of mind. OBJECTIVE We conducted this study to determine the feasibility and acceptability of using personalized online information-seeking behavior in the identification of risk for suicide attempts. METHODS This was a cohort survey study to assess attitudes of participants with a prior suicide attempt about using web search data for suicide prevention purposes, dates of lifetime suicide attempts, and an optional one-time download of their past web searches on Google. The study was conducted at the University of Washington School of Medicine Psychiatry Research Offices. The main outcomes were participants' opinions on internet search data for suicide prediction and intervention and any potential change in online information-seeking behavior proximal to a suicide attempt. Individualized nonparametric association analysis was used to assess the magnitude of difference in web search data features derived from time periods proximal (7, 15, 30, and 60 days) to the suicide attempts versus the typical (baseline) search behavior of participants. RESULTS A total of 62 participants who had attempted suicide in the past agreed to participate in the study. Internet search activity varied from person to person (median 2-24 searches per day). Changes in online search behavior proximal to suicide attempts were evident up to 60 days before attempt. For a subset of attempts (7/30, 23%) search features showed associations from 2 months to a week before the attempt. The top 3 search constructs associated with attempts were online searching patterns (9/30 attempts, 30%), semantic relatedness of search queries to suicide methods (7/30 attempts, 23%), and anger (7/30 attempts, 23%). Participants (40/59, 68%) indicated that use of this personalized web search data for prevention purposes was acceptable with noninvasive potential interventions such as connection to a real person (eg, friend, family member, or counselor); however, concerns were raised about detection accuracy, privacy, and the potential for overly invasive intervention. CONCLUSIONS Changes in online search behavior may be a useful and acceptable means of detecting suicide risk. Personalized analysis of online information-seeking behavior showed notable changes in search behavior and search terms that are tied to early warning signs of suicide and are evident 2 months to 7 days before a suicide attempt.
Collapse
Affiliation(s)
- Patricia A Areán
- Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, United States.,ALACRITY Center, University of Washington, Seattle, WA, United States
| | - Abhishek Pratap
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, United States.,Sage Bionetworks, Seattle, WA, United States
| | - Honor Hsin
- Kaiser Permanente, Northern California, CA, United States
| | - Tierney K Huppert
- Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, United States.,Center for Suicide Prevention and Research, University of Washington, Seattle, WA, United States
| | - Karin E Hendricks
- Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, United States.,Center for Suicide Prevention and Research, University of Washington, Seattle, WA, United States.,University of South Alabama, Mobile, AL, United States
| | - Patrick J Heagerty
- School of Public Health, University of Washington, Seattle, WA, United States
| | - Trevor Cohen
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, United States
| | - Courtney Bagge
- Department of Psychiatry, University of Michigan Medical Center, Ann Arbor, MI, United States.,VA Center for Clinical Management Research, VA Ann Arbor Healthcare System, Ann Arbor, MI, United States
| | - Katherine Anne Comtois
- Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, United States.,Center for Suicide Prevention and Research, University of Washington, Seattle, WA, United States
| |
Collapse
|
7
|
Measuring objective and subjective well-being: dimensions and data sources. INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS 2020. [DOI: 10.1007/s41060-020-00224-2] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
AbstractWell-being is an important value for people’s lives, and it could be considered as an index of societal progress. Researchers have suggested two main approaches for the overall measurement of well-being, the objective and the subjective well-being. Both approaches, as well as their relevant dimensions, have been traditionally captured with surveys. During the last decades, new data sources have been suggested as an alternative or complement to traditional data. This paper aims to present the theoretical background of well-being, by distinguishing between objective and subjective approaches, their relevant dimensions, the new data sources used for their measurement and relevant studies. We also intend to shed light on still barely unexplored dimensions and data sources that could potentially contribute as a key for public policing and social development.
Collapse
|
8
|
Jimenez A, Santed-Germán MA, Ramos V. Google Searches and Suicide Rates in Spain, 2004-2013: Correlation Study. JMIR Public Health Surveill 2020; 6:e10919. [PMID: 32281540 PMCID: PMC7186868 DOI: 10.2196/10919] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2018] [Revised: 04/21/2019] [Accepted: 11/15/2019] [Indexed: 01/19/2023] Open
Abstract
Background Different studies have suggested that web search data are useful in forecasting several phenomena from the field of economics to epidemiology or health issues. Objective This study aimed to (1) evaluate the correlation between suicide rates released by the Spanish National Statistics Institute (INE) and internet search trends in Spain reported by Google Trends (GT) for 57 suicide-related terms representing major known risks of suicide and an analysis of these results using a linear regression model and (2) study the differential association between male and female suicide rates published by the INE and internet searches of these 57 terms. Methods The study period was from 2004 to 2013. In this study, suicide data were collected from (1) Spain’s INE and (2) local internet search data from GT, both from January 2004 to December 2013. We investigated and validated 57 suicide-related terms already tested in scientific studies before 2015 that would be the best predictors of new suicide cases. We then evaluated the nowcasting effects of a GT search through a cross-correlation analysis and by linear regression of the suicide incidence data with the GT data. Results Suicide rates in Spain in the study period were positively associated (r<-0.2) for the general population with the search volume for 7 terms and negatively for 1 from the 57 terms used in previous studies. Suicide rates for men were found to be significantly different than those of women. The search term, “allergy,” demonstrated a lead effect for new suicide cases (r=0.513; P=.001). The next significant correlating terms for those 57 studied were “antidepressant,” “alcohol abstinence,” “relationship breakup” (r=0.295, P=.001; r=0.295, P=.001; and r=0.268, P=.002, respectively). Significantly different results were obtained for men and women. Search terms that correlate with suicide rates of women are consistent with previous studies, showing that the incidence of depression is higher in women than in men, and showing different gender searching patterns. Conclusions A better understanding of internet search behavior of both men and women in relation to suicide and related topics may help design effective suicide prevention programs based on information provided by search robots and other big data sources.
Collapse
Affiliation(s)
- Alberto Jimenez
- Instituto de Salud Carlos III, ISCIII, Information and Communication Technologies Unit, Madrid, Spain
| | | | - Victoria Ramos
- Instituto de Salud Carlos III, ISCIII, Telemedicine and Health Research Unit, Madrid, Spain
| |
Collapse
|
9
|
Chen Y, He G, Chen B, Wang S, Ju G, Ge T. The association between PM2.5 exposure and suicidal ideation: a prefectural panel study. BMC Public Health 2020; 20:293. [PMID: 32138702 PMCID: PMC7059660 DOI: 10.1186/s12889-020-8409-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2019] [Accepted: 02/25/2020] [Indexed: 11/15/2022] Open
Abstract
Background Suicidal ideation is subject to serious underestimation among existing public health studies. While numerous factors have been recognized in affecting suicidal thoughts and behaviors (STB), the associated environmental risks have been poorly understood. Foremost among the various environment risks were air pollution, in particular, the PM2.5. The present study attempted to examine the relationship between PM2.5 level and local weekly index of suicidal ideation (ISI). Methods Using Internet search query volumes in Baidu (2017), the largest internet search engine in China, we constructed a prefectural panel data (278 prefectures, 52 weeks) and employed dynamic panel GMM system estimation to analyze the relationship between weekly concentration of PM2.5 (Mean = 87 μg·m− 3) and the index of suicidal ideation (Mean = 49.9). Results The results indicate that in the spring and winter, a 10 μg·m− 3 increase in the prior week’s PM2.5 in a Chinese city is significantly associated with 0.020 increase in ISI in spring and a 0.007 increase in ISI in winter, after taking account other co-pollutants and meteorological conditions. Conclusion We innovatively proposed the measure of suicidal ideation and provided suggestive evidence of a positive association between suicidal ideation and PM2.5 level.
Collapse
Affiliation(s)
- Yunsong Chen
- Johns Hopkins University-Nanjing University Center for Chinese and American Studies, Gulou District, Nanjing, 210093, China.
| | - Guangye He
- School of Social and Behavioral Sciences, Nanjing University, 163 Xianlin Road, Qixia District, Nanjing, 210023, China.
| | - Buwei Chen
- The First Affiliated Hospital with Nanjing Medical University, Nanjing, 210029, China.
| | - Senhu Wang
- The University of Cambridge, 16 Mill Lane, Cambridge, CB2 1SB, UK
| | - Guodong Ju
- School of Social and Behavioral Sciences, Nanjing University, 163 Xianlin Road, Qixia District, Nanjing, 210023, China
| | - Ting Ge
- School of Social and Behavioral Sciences, Nanjing University, 163 Xianlin Road, Qixia District, Nanjing, 210023, China
| |
Collapse
|
10
|
Traditional versus Facebook-based surveys: Evaluation of biases in self-reported demographic and psychometric information. DEMOGRAPHIC RESEARCH 2020. [DOI: 10.4054/demres.2020.42.5] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
|