1
|
Dovers E, Brooks W, Popovic GC, Warton DI. Fast, approximate maximum likelihood estimation of log-Gaussian Cox processes. J Comput Graph Stat 2023. [DOI: 10.1080/10618600.2023.2182311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/24/2023]
Affiliation(s)
- Elliot Dovers
- School of Mathematics and Statistics and Evolution & Ecology Research Centre, University of New South Wales
| | - Wesley Brooks
- School of Mathematics and Statistics and Evolution & Ecology Research Centre, University of New South Wales
| | - Gordana C. Popovic
- School of Mathematics and Statistics and Evolution & Ecology Research Centre, University of New South Wales
| | - David I. Warton
- School of Mathematics and Statistics and Evolution & Ecology Research Centre, University of New South Wales
| |
Collapse
|
2
|
Semenova E, Xu Y, Howes A, Rashid T, Bhatt S, Mishra S, Flaxman S. PriorVAE: encoding spatial priors with variational autoencoders for small-area estimation. J R Soc Interface 2022; 19:20220094. [PMID: 35673858 PMCID: PMC9174721 DOI: 10.1098/rsif.2022.0094] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Accepted: 05/12/2022] [Indexed: 01/31/2023] Open
Abstract
Gaussian processes (GPs), implemented through multivariate Gaussian distributions for a finite collection of data, are the most popular approach in small-area spatial statistical modelling. In this context, they are used to encode correlation structures over space and can generalize well in interpolation tasks. Despite their flexibility, off-the-shelf GPs present serious computational challenges which limit their scalability and practical usefulness in applied settings. Here, we propose a novel, deep generative modelling approach to tackle this challenge, termed PriorVAE: for a particular spatial setting, we approximate a class of GP priors through prior sampling and subsequent fitting of a variational autoencoder (VAE). Given a trained VAE, the resultant decoder allows spatial inference to become incredibly efficient due to the low dimensional, independently distributed latent Gaussian space representation of the VAE. Once trained, inference using the VAE decoder replaces the GP within a Bayesian sampling framework. This approach provides tractable and easy-to-implement means of approximately encoding spatial priors and facilitates efficient statistical inference. We demonstrate the utility of our VAE two-stage approach on Bayesian, small-area estimation tasks.
Collapse
Affiliation(s)
| | - Yidan Xu
- University of Michigan, Ann Arbor, MI, USA
| | | | | | - Samir Bhatt
- Imperial College London, London, UK
- University of Copenhagen, Kobenhavn, Denmark
| | - Swapnil Mishra
- Imperial College London, London, UK
- University of Copenhagen, Kobenhavn, Denmark
| | | |
Collapse
|
3
|
Nightingale G, Laxton M, Illian JB. How does the community COVID-19 level of risk impact on that of a care home? PLoS One 2022; 16:e0260051. [PMID: 34972103 PMCID: PMC8719703 DOI: 10.1371/journal.pone.0260051] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2021] [Accepted: 10/29/2021] [Indexed: 11/21/2022] Open
Abstract
Objectives To model the risk of COVID-19 mortality in British care homes conditional on the community level risk. Methods A two stage modeling process (“doubly latent”) which includes a Besag York Mollie model (BYM) and a Log Gaussian Cox Process. The BYM is adopted so as to estimate the community level risks. These are incorporated in the Log Gaussian Cox Process to estimate the impact of these risks on that in care homes. Results For an increase in the risk at the community level, the number of COVID-19 related deaths in the associated care home would be increased by exp (0.833), 2. This is based on a simulated dataset. In the context of COVID-19 related deaths, this study has illustrated the estimation of the risk to care homes in the presence of background community risk. This approach will be useful in facilitating the identification of the most vulnerable care homes and in predicting risk to new care homes. Conclusions The modeling of two latent processes have been shown to be successfully facilitated by the use of the BYM and Log Gaussian Cox Process Models. Community COVID-19 risks impact on that of the care homes embedded in these communities.
Collapse
Affiliation(s)
- Glenna Nightingale
- School of Health in Social Sciences, University of Edinburgh, Edinburgh, United Kingdom
- * E-mail:
| | - Megan Laxton
- School of Mathematics and Statistics, University of Glasgow, Glasgow, United Kingdom
| | - Janine B. Illian
- School of Mathematics and Statistics, University of Glasgow, Glasgow, United Kingdom
| |
Collapse
|
4
|
Lucas TCD, Nandi AK, Chestnutt EG, Twohig KA, Keddie SH, Collins EL, Howes RE, Nguyen M, Rumisha SF, Python A, Arambepola R, Bertozzi‐Villa A, Hancock P, Amratia P, Battle KE, Cameron E, Gething PW, Weiss DJ. Mapping malaria by sharing spatial information between incidence and prevalence data sets. J R Stat Soc Ser C Appl Stat 2021. [DOI: 10.1111/rssc.12484] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
| | | | | | | | | | | | | | | | | | - Andre Python
- Big Data Institute University of Oxford Oxford UK
| | | | - Amelia Bertozzi‐Villa
- Big Data Institute University of Oxford Oxford UK
- Institute for Disease Modeling Bellevue Washington USA
| | | | | | | | - Ewan Cameron
- Big Data Institute University of Oxford Oxford UK
| | - Peter W. Gething
- Big Data Institute University of Oxford Oxford UK
- Telethon Kids Institute Perth Children's Hospital Perth Australia
- Curtin University Perth Australia
| | | |
Collapse
|
5
|
Johnson O, Gatheral T, Knight J, Giorgi E. A modelling framework for developing early warning systems of COPD emergency admissions. Spat Spatiotemporal Epidemiol 2021; 36:100392. [PMID: 33509425 DOI: 10.1016/j.sste.2020.100392] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/15/2020] [Revised: 10/22/2020] [Accepted: 11/06/2020] [Indexed: 11/26/2022]
Abstract
Chronic Obstructive Pulmonary Disease (COPD) is one of the leading causes of mortality worldwide and is a major contributor to the number of emergency admissions in the UK. We introduce a modelling framework for the development of early warning systems for COPD emergency admissions. We analyse the number of COPD emergency admissions using a Poisson generalised linear mixed model. We group risk factors into three main groups, namely pollution, weather and deprivation. We then carry out variable selection within each of the three domains of COPD risk. Based on a threshold of incidence rate, we then identify the model giving the highest sensitivity and specificity through the use of exceedance probabilities. The developed modelling framework provides a principled likelihood-based approach for detecting the exceedance of thresholds in COPD emergency admissions. Our results indicate that socio-economic risk factors are key to enhance the predictive power of the model.
Collapse
Affiliation(s)
- Olatunji Johnson
- CHICAS Research Group, Lancaster Medical School, Lancaster University, Bailrigg, Lancaster, UK.
| | - Tim Gatheral
- Respiratory Medicine, Royal Lancaster Infirmary, Lancaster, UK
| | - Jo Knight
- CHICAS Research Group, Lancaster Medical School, Lancaster University, Bailrigg, Lancaster, UK
| | - Emanuele Giorgi
- CHICAS Research Group, Lancaster Medical School, Lancaster University, Bailrigg, Lancaster, UK
| |
Collapse
|
6
|
Lucas TC, Nandi AK, Keddie SH, Chestnutt EG, Howes RE, Rumisha SF, Arambepola R, Bertozzi-Villa A, Python A, Symons TL, Millar JJ, Amratia P, Hancock P, Battle KE, Cameron E, Gething PW, Weiss DJ. Improving disaggregation models of malaria incidence by ensembling non-linear models of prevalence. Spat Spatiotemporal Epidemiol 2020; 41:100357. [PMID: 35691633 PMCID: PMC9205339 DOI: 10.1016/j.sste.2020.100357] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/10/2020] [Revised: 04/13/2020] [Accepted: 06/18/2020] [Indexed: 10/24/2022]
|
7
|
Johnson O, Diggle P, Giorgi E. Dealing with spatial misalignment to model the relationship between deprivation and life expectancy: a model-based geostatistical approach. Int J Health Geogr 2020; 19:6. [PMID: 32131836 PMCID: PMC7057663 DOI: 10.1186/s12942-020-00200-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2019] [Accepted: 02/21/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND : Life expectancy at birth (LEB), one of the main indicators of human longevity, has often been used to characterise the health status of a population. Understanding its relationships with the deprivation is key to develop policies and evaluate interventions that are aimed at reducing health inequalities. However, methodological challenges in the analysis of LEB data arise from the fact that different Government agencies often provide spatially aggregated information on LEB and the index of multiple deprivation (IMD) at different spatial scales. Our objective is to develop a geostatistical framework that, unlike existing methods of inference, allows to carry out spatially continuous prediction while dealing with spatial misalignment of the areal-level data. METHODS : We developed a model-based geostatistical approach for the joint analysis of LEB and IMD, when these are available over different partitions of the study region. We model the spatial correlation in LEB and IMD across the areal units using inter-point distances based on a regular grid covering the whole of the study area. The advantages and strengths of the new methodology are illustrated through an analysis of LEB and IMD data from the Liverpool district council. RESULTS : We found that the effect of IMD on LEB is stronger in males than in females, explaining about 63.35% of the spatial variation in LEB in the former group and 38.92% in the latter. We also estimate that LEB is about 8.5 years lower between the most and least deprived area of Liverpool for men, and 7.1 years for women. Finally, we find that LEB, both in males and females, is at least 80% likely to be above the England wide average only in some areas falling in the electoral wards of Childwall, Woolton and Church. CONCLUSION : The novel model-based geostatistical framework provides a feasible solution to the spatial misalignment problem. More importantly, the proposed methodology has the following advantages over existing methods used model LEB: (1) it can deliver spatially continuous inferences using spatially aggregated data; (2) it can be applied to any form of misalignment with information provided at a range of spatial scales, from areal-level to pixel-level.
Collapse
Affiliation(s)
- Olatunji Johnson
- CHICAS Research Group, Lancaster Medical School, Lancaster University, Bailrigg, Lancaster, UK
| | - Peter Diggle
- CHICAS Research Group, Lancaster Medical School, Lancaster University, Bailrigg, Lancaster, UK
| | - Emanuele Giorgi
- CHICAS Research Group, Lancaster Medical School, Lancaster University, Bailrigg, Lancaster, UK.
| |
Collapse
|
8
|
Johnson O, Diggle P, Giorgi E. A spatially discrete approximation to log-Gaussian Cox processes for modelling aggregated disease count data. Stat Med 2019; 38:4871-4887. [PMID: 31452235 DOI: 10.1002/sim.8339] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2019] [Revised: 07/16/2019] [Accepted: 07/18/2019] [Indexed: 11/08/2022]
Abstract
In this paper, we develop a computationally efficient discrete approximation to log-Gaussian Cox process (LGCP) models for the analysis of spatially aggregated disease count data. Our approach overcomes an inherent limitation of spatial models based on Markov structures, namely, that each such model is tied to a specific partition of the study area, and allows for spatially continuous prediction. We compare the predictive performance of our modelling approach with LGCP through a simulation study and an application to primary biliary cirrhosis incidence data in Newcastle upon Tyne, UK. Our results suggest that, when disease risk is assumed to be a spatially continuous process, the proposed approximation to LGCP provides reliable estimates of disease risk both on spatially continuous and aggregated scales. The proposed methodology is implemented in the open-source R package SDALGCP.
Collapse
Affiliation(s)
- Olatunji Johnson
- CHICAS, Lancaster Medical School, Lancaster University, Lancaster, UK
| | - Peter Diggle
- CHICAS, Lancaster Medical School, Lancaster University, Lancaster, UK
| | - Emanuele Giorgi
- CHICAS, Lancaster Medical School, Lancaster University, Lancaster, UK
| |
Collapse
|