1
|
Pokhriyal N, Letouzé E, Vosoughi S. Accurate intercensal estimates of energy access to track Sustainable Development Goal 7. EPJ Data Sci 2022; 11:60. [PMID: 36530792 PMCID: PMC9734985 DOI: 10.1140/epjds/s13688-022-00371-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/16/2022] [Accepted: 11/23/2022] [Indexed: 06/17/2023]
Abstract
UNLABELLED Intercensal estimates of access to electricity and clean cooking fuels at policy planning microregions in a country are essential for understanding their evolution and tracking progress towards Sustainable Development Goals (SDG) 7. Surveys are prohibitively expensive to get such intercensal microestimates. Existing works, mainly, focus on electrification rates, make predictions at the coarse spatial granularity, and generalize poorly to intercensal periods. Limited works focus on estimating clean cooking fuel access, which is one of the crucial indicators for measuring progress towards SDG 7. We propose a novel spatio-temporal multi-target Bayesian regression model that provides accurate intercensal microestimates for household electrification and clean cooking fuel access by combining multiple types of earth-observation data, census, and surveys. Our model's estimates are produced for Senegal for 2020 at policy planning microregions, and they explain 77% and 86% of variation in regional aggregates for electrification and clean fuels, respectively, when validated against the most recent survey. The diagnostic nature of our microestimates reveals a slow evolution and significant lack of clean cooking fuel access in both urban and rural areas in Senegal. It underscores the challenge of expanding energy access even in urban areas owing to their rapid population growth. Owing to the timeliness and accuracy of our microestimates, they can help plan interventions by local governments or track the attainment of SDGs when no ground-truth data are available. SUPPLEMENTARY INFORMATION The online version contains supplementary material available at 10.1140/epjds/s13688-022-00371-5.
Collapse
Affiliation(s)
| | - Emmanuel Letouzé
- Data-Pop Alliance, New York, USA
- University Pompeu Fabra, Barcelona, Spain
| | | |
Collapse
|
2
|
Fatehkia M, del Villar Z, Koebe T, Letouzé E, Lozano A, Al Feel R, Mrad F, Weber I. Using Facebook advertising data to describe the socio-economic situation of Syrian refugees in Lebanon. Front Big Data 2022; 5:1033530. [DOI: 10.3389/fdata.2022.1033530] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 11/11/2022] [Indexed: 12/05/2022] Open
Abstract
While the fighting in the Syrian civil war has mostly stopped, an estimated 5.6 million Syrians remain living in neighboring countries1. Of these, an estimated 1.5 million are sheltering in Lebanon. Ongoing efforts by organizations such as UNHCR to support the refugee population are often ineffective in reaching those most in need. According to UNHCR's 2019 Vulnerability Assessment of Syrian Refugees Report (VASyR), only 44% of the Syrian refugee families eligible for multipurpose cash assistance were provided with help, as the others were not captured in the data. In this project, we are investigating the use of non-traditional data, derived from Facebook advertising data, for population level vulnerability assessment. In a nutshell, Facebook provides advertisers with an estimate of how many of its users match certain targeting criteria, e.g., how many Facebook users currently living in Beirut are “living abroad,” aged 18–34, speak Arabic, and primarily use an iOS device. We evaluate the use of such audience estimates to describe the spatial variation in the socioeconomic situation of Syrian refugees across Lebanon. Using data from VASyR as ground truth, we find that iOS device usage explains 90% of the out-of-sample variance in poverty across the Lebanese governorates. However, evaluating predictions at a smaller spatial resolution also indicate limits related to sparsity, as Facebook, for privacy reasons, does not provide audience estimates for fewer than 1,000 users. Furthermore, comparing the population distribution by age and gender of Facebook users with that of the Syrian refugees from VASyR suggests an under-representation of Syrian women on the social media platform. This work adds to growing body of literature demonstrating the value of anonymous and aggregate Facebook advertising data for analysing large-scale humanitarian crises and migration events.
Collapse
|
3
|
Abstract
Georeferenced digital trace data offer unprecedented flexibility in migration estimation. Because of their high temporal granularity, many migration estimates can be generated from the same data set by changing the definition parameters. Yet despite the growing application of digital trace data to migration research, strategies for taking advantage of their temporal granularity remain largely underdeveloped. In this paper, we provide a general framework for converting digital trace data into estimates of migration transitions and for systematically analyzing their variation along a quasi-continuous time scale, analogous to a survival function. From migration theory, we develop two simple hypotheses regarding how we expect our estimated migration transition functions to behave. We then test our hypotheses on simulated data and empirical data from three platforms in two internal migration contexts: geotagged Tweets and Gowalla check-ins in the United States, and cell-phone call detail records in Senegal. Our results demonstrate the need for evaluating the internal consistency of migration estimates derived from digital trace data before using them in substantive research. At the same time, however, common patterns across our three empirical data sets point to an emergent research agenda using digital trace data to study the specific functional relationship between estimates of migration and time and how this relationship varies by geography and population characteristics.
Collapse
Affiliation(s)
- Lee Fiorio
- Department of Geography, University of Washington, Seattle, WA, USA
| | - Emilio Zagheni
- Max Planck Institute for Demographic Research, Rostock, Germany
| | - Guy Abel
- Asian Demographic Research Institute, Shanghai University, Shanghai, China.,Wittgenstein Centre (IIASA, VID/ÖAW, WU), International Institute for Applied Systems Analysis, Vienna, Austria
| | - Johnathan Hill
- Department of Geography, University of Washington, Seattle, WA, USA
| | | | - Emmanuel Letouzé
- Data-Pop Alliance, New York, NY, USA.,Massachusetts Institute of Technology Media Lab, Cambridge, MA, USA
| | - Jixuan Cai
- Department of Geography and Resource Management, The Chinese University of Hong Kong, Vienna, Austria.,Wittgenstein Centre (IIASA, VID/ÖAW, WU), International Institute for Applied Systems Analysis, Vienna, Austria
| |
Collapse
|
4
|
Oliver N, Lepri B, Sterly H, Lambiotte R, Deletaille S, De Nadai M, Letouzé E, Salah AA, Benjamins R, Cattuto C, Colizza V, de Cordes N, Fraiberger SP, Koebe T, Lehmann S, Murillo J, Pentland A, Pham PN, Pivetta F, Saramäki J, Scarpino SV, Tizzoni M, Verhulst S, Vinck P. Mobile phone data for informing public health actions across the COVID-19 pandemic life cycle. Sci Adv 2020; 6:eabc0764. [PMID: 32548274 PMCID: PMC7274807 DOI: 10.1126/sciadv.abc0764] [Citation(s) in RCA: 268] [Impact Index Per Article: 67.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2020] [Accepted: 04/23/2020] [Indexed: 05/19/2023]
Affiliation(s)
- Nuria Oliver
- ELLIS, the European Laboratory for Learning and Intelligent Systems, Alicante, Spain
- DataPop Alliance, New York, NY, USA
| | - Bruno Lepri
- DataPop Alliance, New York, NY, USA
- Fondazione Bruno Kessler, Trento, Italy
| | | | - Renaud Lambiotte
- University of Oxford, Oxford, UK
- The Alan Turing Institute, London, UK
| | | | | | - Emmanuel Letouzé
- DataPop Alliance, New York, NY, USA
- Open Algorithms (OPAL) collaborative project, New York, NY, USA
| | - Albert Ali Salah
- DataPop Alliance, New York, NY, USA
- Utrecht University, Utrecht, Netherlands
| | | | - Ciro Cattuto
- University of Turin, Turin, Italy
- Orange Group, Paris, France
| | - Vittoria Colizza
- INSERM, Sorbonne Université, Pierre Louis Institute of Epidemiology and Public Health, Paris, France
| | | | | | - Till Koebe
- DataPop Alliance, New York, NY, USA
- Freie University, Berlin, Germany
| | - Sune Lehmann
- Technical University of Denmark, Copenhagen, Denmark
| | | | - Alex Pentland
- Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Phuong N Pham
- DataPop Alliance, New York, NY, USA
- Harvard University, Cambridge, MA, USA
| | | | | | | | | | | | - Patrick Vinck
- DataPop Alliance, New York, NY, USA
- Harvard University, Cambridge, MA, USA
- Corresponding author.
| |
Collapse
|
5
|
|
6
|
Lepri B, Staiano J, Sangokoya D, Letouzé E, Oliver N. The Tyranny of Data? The Bright and Dark Sides of Data-Driven Decision-Making for Social Good. Studies in Big Data 2017. [DOI: 10.1007/978-3-319-54024-5_1] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
|
7
|
Bogomolov A, Lepri B, Staiano J, Letouzé E, Oliver N, Pianesi F, Pentland A. Moves on the Street: Classifying Crime Hotspots Using Aggregated Anonymized Data on People Dynamics. Big Data 2015; 3:148-158. [PMID: 27442957 DOI: 10.1089/big.2014.0054] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
The wealth of information provided by real-time streams of data has paved the way for life-changing technological advancements, improving the quality of life of people in many ways, from facilitating knowledge exchange to self-understanding and self-monitoring. Moreover, the analysis of anonymized and aggregated large-scale human behavioral data offers new possibilities to understand global patterns of human behavior and helps decision makers tackle problems of societal importance. In this article, we highlight the potential societal benefits derived from big data applications with a focus on citizen safety and crime prevention. First, we introduce the emergent new research area of big data for social good. Next, we detail a case study tackling the problem of crime hotspot classification, that is, the classification of which areas in a city are more likely to witness crimes based on past data. In the proposed approach we use demographic information along with human mobility characteristics as derived from anonymized and aggregated mobile network data. The hypothesis that aggregated human behavioral data captured from the mobile network infrastructure, in combination with basic demographic information, can be used to predict crime is supported by our findings. Our models, built on and evaluated against real crime data from London, obtain accuracy of almost 70% when classifying whether a specific area in the city will be a crime hotspot or not in the following month.
Collapse
Affiliation(s)
| | | | | | - Emmanuel Letouzé
- 4 University of California-Berkeley , Berkeley, California
- 5 Data-Pop Alliance , New York, New York
| | | | | | | |
Collapse
|
8
|
Bertoin F, Letouzé E, Grignani P, Patey M, Rossignol S, Libé R, Pasqual C, Lardière-Deguelte S, Hoeffel-Fornes C, Gaillard D, Previderè C, Delemer B, Lalli E. Genome-wide paternal uniparental disomy as a cause of Beckwith-Wiedemann syndrome associated with recurrent virilizing adrenocortical tumors. Horm Metab Res 2015; 47:497-503. [PMID: 25365508 DOI: 10.1055/s-0034-1394371] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
Beckwith-Wiedemann syndrome (BWS) is an overgrowth syndrome characterized by fetal macrosomia, macroglossia, and abdominal wall defects. BWS patients are at risk to develop Wilms tumor, neuroblastoma, hepatoblastoma, and adrenal tumors. A young woman with BWS features, but with inconclusive genetic evidence for the disease, came to clinical observation for signs of virilization at the age of 16 years. An adrenocortical tumor was diagnosed and surgically resected. The tumor underwent 2 local relapses that were also surgically treated. The patient was also operated to remove a breast fibroadenoma. SNP arrays were used to analyze chromosome abnormalities in normal and tumor samples from the patient and her parents. The patient presented genome-wide mosaic paternal uniparental disomy (patUPD) both in the adrenocortical and the breast tumors, with different degrees of loss of heterozygosity (LOH). The more recent relapses of the adrenocortical tumor showed a loss of part of chromosome 17p that was absent in the first tumor. Analysis of a skin biopsy sample also showed mosaic patUPD with partial LOH, while no LOH was detected in leukocyte DNA. This case shows that virilizing adrenocortical tumors may be a clinical feature of patients with BWS. The SNP array technology is useful to diagnose genome-wide patUPD mosaicism in BWS patients with an inconclusive molecular diagnosis and underlines the tumorigenic potential of the absence of the maternal genome combined with an excess of the paternal genome.
Collapse
Affiliation(s)
- F Bertoin
- Department of Endocrinology, CHU Reims, Hôpital Robert Debré, Reims, France
| | - E Letouzé
- Programme Cartes d'Identité des Tumeurs, Ligue Nationale contre le Cancer, Paris, France
| | - P Grignani
- Department of Public Health, Experimental and Forensic Medicine, University of Pavia, Pavia, Italy
| | - M Patey
- Department of Pathology, CHU Reims, Hôpital Robert Debré, Reims, France
| | - S Rossignol
- Laboratory of Endocrine Functional Explorations, APHP, Hôpital Armand Trousseau, INSERM UMRS 938 Team 4, Université Pierre et Marie Curie-Paris, Faculté de Médecine, Paris, France
| | - R Libé
- Department of Endocrinology-Metabolism-Cancer, APHP, Institut Cochin, Université Paris V-René Descartes, Paris, France
| | - C Pasqual
- Service of Diabetology, Hospital of Troyes, Troyes, France
| | - S Lardière-Deguelte
- Department of Endocrine Surgery, CHU Reims, Hôpital Robert Debré, Reims, France
| | - C Hoeffel-Fornes
- Department of Radiology, CHU Reims, Hôpital Robert Debré, Reims, France
| | - D Gaillard
- Department of Genetics, CHU Reims, Hôpital Robert Debré, Reims, France
| | - C Previderè
- Department of Public Health, Experimental and Forensic Medicine, University of Pavia, Pavia, Italy
| | - B Delemer
- Department of Endocrinology, CHU Reims, Hôpital Robert Debré, Reims, France
| | - E Lalli
- Institut de Pharmacologie Moléculaire et Cellulaire, CNRS UMR7275, Sophia Antipolis-Valbonne, France
| |
Collapse
|