1
|
John P, Varga C, Cooke M, Majowicz SE. Temporal, spatial and space-time distribution of infections caused by five major enteric pathogens, Ontario, Canada, 2010-2017. Zoonoses Public Health 2024; 71:178-190. [PMID: 37990481 DOI: 10.1111/zph.13096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Revised: 10/15/2023] [Accepted: 11/01/2023] [Indexed: 11/23/2023]
Abstract
AIMS In Canada, enteric diseases pose substantial health and economic burdens. The distribution of these diseases is uneven across both geography and time and understanding these patterns is therefore important for the prevention of future outbreaks. We evaluated temporal, spatial and space-time clustering of laboratory-confirmed cases of Campylobacter spp. (n = 28,728), non-typhoidal Salmonella spp. (n = 22,640), Shiga toxin-producing Escherichia coli (STEC; n = 1340), Yersinia spp. (n = 1674) and Listeria monocytogenes (n = 471) infections, reported between 2010 and 2017 inclusive in Ontario, the most populous province in Canada (population ~ 13,500,000 in 2016). METHODS AND RESULTS For each enteric pathogen, we calculated the mean incidence rates (IRs) for Ontario's 35 public health unit (PHU) areas and visualized them using choropleth maps. We identified temporal, spatial and space-time high infection rate clusters using retrospective Poisson scan statistics. Campylobacter and Salmonella infections had the highest IRs, while Listeria infections had the lowest. Campylobacter, Salmonella, STEC and Listeria mostly clustered temporally in the spring/summer and sometimes extended into fall, while Yersinia showed a less clear seasonal pattern. The IR visualizations and spatial and space-time scan statistics showed geographic heterogeneity of infection rates with high infection rate clusters detected mainly in PHUs across the southwestern and central-western regions of Ontario for Campylobacter, Salmonella and STEC infections, and mainly in PHUs located in the central-eastern regions for Yersinia and Listeria. A high proportion of cases in some of the significant Salmonella, STEC and Listeria infection clusters were linked to disease outbreaks. CONCLUSIONS Results from this study will inform heightened public health surveillance, and prevention and control programmes, in populations and regions of high infection rates. Further research is needed to determine the pathogen-specific socioeconomic, environmental and agricultural risk factors that may be related to the temporal and spatial disease patterns we observed in our study.
Collapse
Affiliation(s)
- Patience John
- School of Public Health Sciences, University of Waterloo, Waterloo, Ontario, Canada
| | - Csaba Varga
- School of Public Health Sciences, University of Waterloo, Waterloo, Ontario, Canada
- Department of Pathobiology, College of Veterinary Medicine, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Martin Cooke
- School of Public Health Sciences, University of Waterloo, Waterloo, Ontario, Canada
- Department of Sociology and Legal Studies, University of Waterloo, Waterloo, Ontario, Canada
| | - Shannon E Majowicz
- School of Public Health Sciences, University of Waterloo, Waterloo, Ontario, Canada
| |
Collapse
|
2
|
Russo M, Wang SV. An open-source implementation of tree-based scan statistics. Pharmacoepidemiol Drug Saf 2024; 33:e5765. [PMID: 38453354 DOI: 10.1002/pds.5765] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2023] [Revised: 01/20/2024] [Accepted: 01/22/2024] [Indexed: 03/09/2024]
Abstract
PURPOSE We develop an open-source R package to implement tree-based scan statistics (TBSS) analyses. METHODS TBSS are data mining methods used by the United States Food and Drug Administration and the Centers for Disease Control. They simultaneously screen thousands of hierarchically aggregated outcomes to identify unsuspected adverse effects of drugs or vaccines, accounting for multiple comparisons. The general structure of TBSS is highly adaptable, with four essential components: (1) a hierarchical outcome structure, (2) a test statistic to be computed for each element of the hierarchy, (3) an algorithm to generate data replicates under a null distribution, and (4) observed outcomes at the lower level of the hierarchy. We encode the general TBSS framework in a convenient R package that offers user-friendly functions for the most used TBSS methods. To illustrate the performance of our software, we evaluated two examples of archetypical TBSS analyses previously analyzed using proprietary, closed-source TreeScan™ software. The first considers the risk of congenital malformations associated with first-trimester exposure to valproate, and the second compares exposure to newly prescribed canagliflozin with a dipeptidyl peptidase 4 inhibitor in adults affected by type 2 diabetes. RESULTS The results of the original studies are replicated. CONCLUSIONS The diffusion of an open-source implementation of TBSS can enhance innovation of TBSS methods and foster collaborations. We offer an intuitive R package implementing standard TBSS methods with accompanying tutorials. Our unified object-oriented implementation allows expert users to extend the framework, introduce new features, or enhance existing ones.
Collapse
Affiliation(s)
- Massimiliano Russo
- Department of Statistics, The Ohio State University, Columbus, Ohio, USA
- Division of Pharmacoepidemiology and Pharmacoeconomics, Brigham and Women's Hospital, Boston, Massachusetts, USA
| | - Shirley V Wang
- Division of Pharmacoepidemiology and Pharmacoeconomics, Brigham and Women's Hospital, Boston, Massachusetts, USA
| |
Collapse
|
3
|
Lu FW, Conway E, Liang YL, Chen YY, Gunnell D, Chang SS. Space-time self-harm and suicide clusters in two cities in Taiwan. Epidemiol Psychiatr Sci 2023; 32:e37. [PMID: 37258458 DOI: 10.1017/s2045796023000513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 06/02/2023] Open
Abstract
AIMS Suicidal acts may cluster in time and space and lead to community concerns about further imitative suicidal episodes. Although suicide clusters have been researched in previous studies, less is known about the clustering of non-fatal suicidal behaviour (self-harm). Furthermore, most previous studies used crude temporal and spatial information, e.g., numbers aggregated by month and residence area, for cluster detection analysis. This study aimed to (i) identify space-time clusters of self-harm and suicide using daily incidence data and exact address and (ii) investigate the characteristics of cluster-related suicidal acts. METHODS Data on emergency department presentations for self-harm and suicide deaths in Taipei City and New Taipei City, Taiwan, were used in this study. In all-age and age-specific analyses, self-harm and suicide clusters were identified using space-time permutation scan statistics. A cut-off of 0.10 for the p value was used to identify possible clusters. Logistic regression was used to investigate the characteristics associated with cluster-related episodes. RESULTS A total of 5,291 self-harm episodes and 1,406 suicides in Taipei City (2004-2006) and 20,531 self-harm episodes and 2,329 suicides in New Taipei City (2012-2016) were included in the analysis. In the two cities, two self-harm clusters (n [number of self-harm episodes or suicide deaths in the cluster] = 4 and 8 in Taipei City), four suicide clusters (n = 3 in Taipei City and n = 4, 11 and 4 in New Taipei City) and two self-harm and suicide combined clusters (n = 4 in Taipei City and n = 8 in New Taipei City) were identified. Space-time clusters of self-harm, suicide, and self-harm and suicide combined accounted for 0.05%, 0.59%, and 0.08% of the respective groups of suicidal acts. Cluster-related episodes of self-harm and suicide were more likely to be male (adjusted odds ratio [aOR] = 2.22, 95% confidence interval [CI] 1.26, 3.89) and young people aged 10-29 years (aOR = 2.72, 95% CI 1.43, 5.21) than their cluster-unrelated counterparts. CONCLUSIONS Space-time clusters of self-harm, suicide, and self-harm and suicide combined accounted for a relatively small proportion of suicidal acts and were associated with some sex/age characteristics. Focusing on suicide deaths alone may underestimate the size of some clusters and/or lead to some clusters being overlooked. Future research could consider combining self-harm and suicide data and use social connection information to investigate possible clusters of suicidal acts.
Collapse
Affiliation(s)
- Fang-Wen Lu
- Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan
- Institute of Health Behaviors and Community Sciences, College of Public Health, National Taiwan University, Taipei, Taiwan
| | - Erica Conway
- Institute of Health Behaviors and Community Sciences, College of Public Health, National Taiwan University, Taipei, Taiwan
- Global Health Program, College of Public Health, National Taiwan University, Taipei, Taiwan
| | - Ya-Lun Liang
- Institute of Health Behaviors and Community Sciences, College of Public Health, National Taiwan University, Taipei, Taiwan
| | - Ying-Yeh Chen
- Taipei City Psychiatric Centre, Taipei City Hospital, Taipei, Taiwan
- Institute of Public Health and Department of Public Health, National Yang-Ming Chiao Tung University, Taipei, Taiwan
| | - David Gunnell
- Centre for Academic Mental Health, Population Health Sciences, University of Bristol, Bristol, UK
- National Institute of Health Research Biomedical Research Centre, University Hospitals Bristol and Weston National Health Service Foundation Trust, Bristol, UK
| | - Shu-Sen Chang
- Institute of Health Behaviors and Community Sciences, College of Public Health, National Taiwan University, Taipei, Taiwan
- Global Health Program, College of Public Health, National Taiwan University, Taipei, Taiwan
- Psychiatric Research Center, Wan Fang Hospital, Taipei Medical University, Taipei, Taiwan
| |
Collapse
|
4
|
Freitas LP, Lowe R, Koepp AE, Alves SV, Dondero M, Marteleto LJ. Identifying hidden Zika hotspots in Pernambuco, Brazil: a spatial analysis. Trans R Soc Trop Med Hyg 2023; 117:189-196. [PMID: 36326785 PMCID: PMC9977212 DOI: 10.1093/trstmh/trac099] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Revised: 05/23/2022] [Accepted: 10/17/2022] [Indexed: 11/06/2022] Open
Abstract
BACKGROUND Northeast Brazil has the world's highest rate of Zika-related microcephaly. However, Zika case counts cannot accurately describe burden because mandatory reporting was only established when the epidemic was declining in the region. METHODS To advance the study of the Zika epidemic, we identified hotspots of Zika in Pernambuco state, Northeast Brazil, using Aedes-borne diseases (dengue, chikungunya and Zika) and microcephaly data. We used Kulldorff's Poisson purely spatial scan statistic to detect low- and high-risk clusters for Aedes-borne diseases (2014-2017) and for microcephaly (2015-2017), separately. Municipalities were classified according to a proposed gradient of Zika burden during the epidemic, based on the combination of cluster status in each analysis and considering the strength of the evidence. RESULTS We identified 26 Aedes-borne diseases clusters (11 high-risk) and 5 microcephaly clusters (3 high-risk) in Pernambuco. According to the proposed Zika burden gradient, our results indicate that the northeast of Pernambuco and the Sertão region were hit hardest by the Zika epidemic. The first is the most populous area of Pernambuco, while the second has one of the highest rates of social and economic inequality in Brazil. CONCLUSION We successfully identified possible hidden Zika hotspots using a simple methodology combining Aedes-borne diseases and microcephaly information.
Collapse
Affiliation(s)
- Laís Picinini Freitas
- Population Research Center, University of Texas at Austin, Austin, Texas 78712-1699, USA
| | - Rachel Lowe
- Department of Earth Sciences, Barcelona Supercomputing Center (BSC), Barcelona 08034, Spain
- Catalan Institution for Research and Advanced Studies (ICREA), Barcelona, Spain
- Centre on Climate Change & Planetary Health and Centre for Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, London WC1E 7HT, UK
| | - Andrew E Koepp
- Population Research Center, University of Texas at Austin, Austin, Texas 78712-1699, USA
- Department of Human Development and Family Sciences, University of Texas at Austin, Austin, Texas 78712, USA
| | - Sandra Valongueiro Alves
- Post-graduation Program of Public Health, Centro de Ciências Médicas, Universidade Federal de Pernambuco, Recife, Pernambuco 50670-901, Brazil
| | - Molly Dondero
- Department of Sociology, American University, Washington, D.C. 20016-8072, USA
| | - Letícia J Marteleto
- Population Research Center, University of Texas at Austin, Austin, Texas 78712-1699, USA
- Department of Sociology, University of Texas at Austin, Austin, Texas 78712, USA
| |
Collapse
|
5
|
Lin PS. Identification of geographic clusters for temporal heterogeneity with application to dengue surveillance. Stat Med 2022; 41:146-162. [PMID: 34964513 PMCID: PMC9298438 DOI: 10.1002/sim.9227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Revised: 07/30/2021] [Accepted: 09/20/2021] [Indexed: 11/05/2022]
Abstract
Identifying transmission of hot spots with temporal trends is important for reducing infectious disease propagation. Cluster analysis is a particularly useful tool to explore underlying stochastic processes between observations by grouping items into categories by their similarity. In a study of epidemic propagation, clustering geographic regions that have similar time series could help researchers track diffusion routes from a common source of an infectious disease. In this article, we propose a two‐stage scan statistic to classify regions into various geographic clusters by their temporal heterogeneity. The proposed scan statistic is more flexible than traditional methods in that contiguous and nonproximate regions with similar temporal patterns can be identified simultaneously. A simulation study and data analysis for a dengue fever infection are also presented for illustration.
Collapse
Affiliation(s)
- Pei-Sheng Lin
- Institute of Population Health Sciences, National Health Research Institutes, Miaoli, Taiwan.,Department of Mathematics, National Chung Cheng University, Minxiong, Taiwan
| |
Collapse
|
6
|
Islam A, Sayeed MA, Rahman MK, Ferdous J, Islam S, Hassan MM. Geospatial dynamics of COVID-19 clusters and hotspots in Bangladesh. Transbound Emerg Dis 2021; 68:3643-3657. [PMID: 33386654 DOI: 10.1111/tbed.13973] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Revised: 12/24/2020] [Accepted: 12/30/2020] [Indexed: 12/20/2022]
Abstract
The coronavirus disease 2019 (COVID-19) is an emerging and rapidly evolving profound pandemic, which causes severe acute respiratory syndrome and results in significant case fatality around the world including Bangladesh. We conducted this study to assess how COVID-19 cases clustered across districts in Bangladesh and whether the pattern and duration of clusters changed following the country's containment strategy using Geographic information system (GIS) software. We calculated the epidemiological measures including incidence, case fatality rate (CFR) and spatiotemporal pattern of COVID-19. We used inverse distance weighting (IDW), Geographically weighted regression (GWR), Moran's I and Getis-Ord Gi* statistics for prediction, spatial autocorrelation and hotspot identification. We used retrospective space-time scan statistic to analyse clusters of COVID-19 cases. COVID-19 has a CFR of 1.4%. Over 50% of cases were reported among young adults (21-40 years age). The incidence varies from 0.03 - 0.95 at the end of March to 15.59-308.62 per 100,000, at the end of July. Global Moran's Index indicates a robust spatial autocorrelation of COVID-19 cases. Local Moran's I analysis stated a distinct High-High (HH) clustering of COVID-19 cases among Dhaka, Gazipur and Narayanganj districts. Twelve statistically significant high rated clusters were identified by space-time scan statistics using a discrete Poisson model. IDW predicted the cases at the undetermined area, and GWR showed a strong relationship between population density and case frequency, which was further established with Moran's I (0.734; p ≤ 0.01). Dhaka and its surrounding six districts were identified as the significant hotspot whereas Chattogram was an extended infected area, indicating the gradual spread of the virus to peripheral districts. This study provides novel insights into the geostatistical analysis of COVID-19 clusters and hotspots that might assist the policy planner to predict the spatiotemporal transmission dynamics and formulate imperative control strategies of SARS-CoV-2 in Bangladesh. The geospatial modeling tools can be used to prevent and control future epidemics and pandemics.
Collapse
Affiliation(s)
- Ariful Islam
- School of Life and Environmental Science, Centre for Integrative Ecology, Deakin University, Vic., Australia.,EcoHealth Alliance, New York City, NY, USA
| | - Md Abu Sayeed
- EcoHealth Alliance, New York City, NY, USA.,Department of Medicine, Jhenaidah Government Veterinary College, Jhenaidah, Bangladesh
| | - Md Kaisar Rahman
- EcoHealth Alliance, New York City, NY, USA.,Faculty of Veterinary Medicine, Chattogram Veterinary and Animal Sciences University, Chattogram, Bangladesh
| | | | - Shariful Islam
- EcoHealth Alliance, New York City, NY, USA.,Bangladesh Livestock Research Institute, Dhaka, Savar, Bangladesh
| | - Mohammad Mahmudul Hassan
- Faculty of Veterinary Medicine, Chattogram Veterinary and Animal Sciences University, Chattogram, Bangladesh
| |
Collapse
|
7
|
Wang S, Fan J, Pocock G, Arena ET, Eliceiri KW, Yuan M. STRUCTURED CORRELATION DETECTION WITH APPLICATION TO COLOCALIZATION ANALYSIS IN DUAL-CHANNEL FLUORESCENCE MICROSCOPIC IMAGING. Stat Sin 2021; 31:333-360. [PMID: 35046630 PMCID: PMC8765712 DOI: 10.5705/ss.202018.0230] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2023]
Abstract
Current workflows for colocalization analysis in fluorescence microscopic imaging introduce significant bias in terms of the user's choice of region of interest (ROI). In this work, we introduce an automatic, unbiased structured detection method for correlated region detection between two random processes observed on a common domain. We argue that although intuitive, using the maximum log-likelihood statistic directly suffers from potential bias and substantially reduced power. Therefore, we introduce a simple size-based normalization to overcome this problem. We show that scanning using the proposed statistic leads to optimal correlated region detection over a large collection of structured correlation detection problems.
Collapse
|
8
|
Li Z, Li X, Liu Y, Shen J, Chen H, Zhou H, Morrison AC, Boerwinkle E, Lin X. Dynamic Scan Procedure for Detecting Rare-Variant Association Regions in Whole-Genome Sequencing Studies. Am J Hum Genet 2019; 104:802-814. [PMID: 30982610 PMCID: PMC6507043 DOI: 10.1016/j.ajhg.2019.03.002] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2018] [Accepted: 03/01/2019] [Indexed: 11/19/2022] Open
Abstract
Whole-genome sequencing (WGS) studies are being widely conducted in order to identify rare variants associated with human diseases and disease-related traits. Classical single-marker association analyses for rare variants have limited power, and variant-set-based analyses are commonly used by researchers for analyzing rare variants. However, existing variant-set-based approaches need to pre-specify genetic regions for analysis; hence, they are not directly applicable to WGS data because of the large number of intergenic and intron regions that consist of a massive number of non-coding variants. The commonly used sliding-window method requires the pre-specification of fixed window sizes, which are often unknown as a priori, are difficult to specify in practice, and are subject to limitations given that the sizes of genetic-association regions are likely to vary across the genome and phenotypes. We propose a computationally efficient and dynamic scan-statistic method (Scan the Genome [SCANG]) for analyzing WGS data; this method flexibly detects the sizes and the locations of rare-variant association regions without the need to specify a prior, fixed window size. The proposed method controls for the genome-wise type I error rate and accounts for the linkage disequilibrium among genetic variants. It allows the detected sizes of rare-variant association regions to vary across the genome. Through extensive simulated studies that consider a wide variety of scenarios, we show that SCANG substantially outperforms several alternative methods for detecting rare-variant-associations while controlling for the genome-wise type I error rates. We illustrate SCANG by analyzing the WGS lipids data from the Atherosclerosis Risk in Communities (ARIC) study.
Collapse
Affiliation(s)
- Zilin Li
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
| | - Xihao Li
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
| | - Yaowu Liu
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
| | - Jincheng Shen
- Department of Population Health Sciences, University of Utah, Salt Lake City, UT 84108, USA
| | - Han Chen
- Human Genetics Center, Department of Epidemiology, Human Genetics and Environmental Sciences, School of Public Health, the University of Texas Health Science Center at Houston, Houston, TX 77030, USA; Center for Precision Health, School of Public Health and School of Biomedical Informatics, the University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Hufeng Zhou
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
| | - Alanna C Morrison
- Human Genetics Center, Department of Epidemiology, Human Genetics and Environmental Sciences, School of Public Health, the University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Eric Boerwinkle
- Human Genetics Center, Department of Epidemiology, Human Genetics and Environmental Sciences, School of Public Health, the University of Texas Health Science Center at Houston, Houston, TX 77030, USA; Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Xihong Lin
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA; Department of Statistics, Harvard University, Cambridge, MA 02138, USA.
| |
Collapse
|
9
|
Alkhamis M, Hijmans RJ, Al-Enezi A, Martínez-López B, Perea AM. The Use of Spatial and Spatiotemporal Modeling for Surveillance of H5N1 Highly Pathogenic Avian Influenza in Poultry in the Middle East. Avian Dis 2017; 60:146-55. [PMID: 27309050 DOI: 10.1637/11106-042115-reg] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
Since 2005, H5N1 highly pathogenic avian influenza virus (HPAIV) has severely impacted the economy and public health in the Middle East (ME) with Egypt as the most affected country. Understanding the high-risk areas and spatiotemporal distribution of the H5N1 HPAIV in poultry is prerequisite for establishing risk-based surveillance activities at a regional level in the ME. Here, we aimed to predict the geographic range of H5N1 HPAIV outbreaks in poultry in the ME using a set of environmental variables and to investigate the spatiotemporal clustering of outbreaks in the region. Data from the ME for the period 2005-14 were analyzed using maximum entropy ecological niche modeling and the permutation model of the scan statistics. The predicted range of high-risk areas (P > 0.60) for H5N1 HPAIV in poultry included parts of the ME northeastern countries, whereas the Egyptian Nile delta and valley were estimated to be the most suitable locations for occurrence of H5N1 HPAIV outbreaks. The most important environmental predictor that contributed to risk for H5N1 HPAIV was the precipitation of the warmest quarter (47.2%), followed by the type of global livestock production system (18.1%). Most significant spatiotemporal clusters (P < 0.001) were detected in Egypt, Turkey, Kuwait, Saudi Arabia, and Sudan. Results suggest that more information related to poultry holding demographics is needed to further improve prediction of risk for H5N1 HPAIV in the ME, whereas the methodology presented here may be useful in guiding the design of surveillance programs and in identifying areas in which underreporting may have occurred.
Collapse
Affiliation(s)
- Mohammad Alkhamis
- A Environmental and Life Sciences Research Center, Kuwait Institute For Scientific Research, P.O. Box 24885, Safat 13109, Kuwait.,B Veterinary Population Medicine Department, Veterinary Medical Center, University of Minnesota, St. Paul, MN 55108
| | - Robert J Hijmans
- C Department of Environmental Science and Policy, One Shields Avenue, University of California, Davis, CA 95616
| | - Abdullah Al-Enezi
- A Environmental and Life Sciences Research Center, Kuwait Institute For Scientific Research, P.O. Box 24885, Safat 13109, Kuwait
| | - Beatriz Martínez-López
- D Center for Animal Disease Modeling and Surveillance, Department of Medicine and Epidemiology, School of Veterinary Medicine, One Shields Avenue, University of California, Davis, CA 95616
| | - Andres M Perea
- B Veterinary Population Medicine Department, Veterinary Medical Center, University of Minnesota, St. Paul, MN 55108
| |
Collapse
|
10
|
Beuscart JB, Genin M, Dupont C, Verloop D, Duhamel A, Defebvre MM, Puisieux F. Potentially inappropriate medication prescribing is associated with socioeconomic factors: a spatial analysis in the French Nord-Pas-de-Calais Region. Age Ageing 2017; 46:607-613. [PMID: 28064169 DOI: 10.1093/ageing/afw245] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2016] [Indexed: 11/12/2022] Open
Abstract
Background potentially inappropriate medication (PIM) prescribing is common in older people and leads to adverse events and hospital admissions. Objective to determine whether prevalence of PIM prescribing varies according to healthcare supply and socioeconomic status. Methods all prescriptions dispensed at community pharmacies for patients aged 75 and older between 1 January and 31 March 2012 were retrieved from French Health Insurance Information System of the Nord-Pas-de-Calais Region for patients affiliated to the Social Security scheme. PIM was defined according to the French list of Laroche. The geographic distribution of PIM prescribing in this area was analysed using spatial scan statistics. Results overall, 65.6% (n = 207,979) of people aged 75 years and over living in the Nord-Pas-de-Calais Region were included. Among them, 32.6% (n = 67,863) received at least one PIM. The spatial analysis identified 16 and 10 clusters of municipalities with a high and a low prevalence of PIM prescribing, respectively. Municipalities with a low prevalence of PIM were characterised by a high socioeconomic status whereas those with a high prevalence of PIM were mainly characterised by a low socioeconomic status, such as a high unemployment rate and low household incomes. Markers of healthcare supply were weakly associated with high or low prevalence clusters. Conclusion significant geographic variation in PIM prescribing was observed in the study territory and was mainly associated with socioeconomic factors.
Collapse
Affiliation(s)
- Jean-Baptiste Beuscart
- Univ. Lille, EA 2694 - Santé publique: épidémiologie et qualité des soins, F-59000 Lille, France
- CHU Lille, Geriatric Department, F-59000 Lille, France
| | - Michael Genin
- Univ. Lille, EA 2694 - Santé publique: épidémiologie et qualité des soins, F-59000 Lille, France
| | - Corrine Dupont
- Agence Régionale de Santé Nord-Pas-de-Calais, Lille, France
| | - David Verloop
- Agence Régionale de Santé Nord-Pas-de-Calais, Lille, France
| | - Alain Duhamel
- Univ. Lille, EA 2694 - Santé publique: épidémiologie et qualité des soins, F-59000 Lille, France
| | | | | |
Collapse
|
11
|
Li L, Xi Y, Ren F. Spatio-Temporal Distribution Characteristics and Trajectory Similarity Analysis of Tuberculosis in Beijing, China. Int J Environ Res Public Health 2016; 13:ijerph13030291. [PMID: 26959048 PMCID: PMC4808954 DOI: 10.3390/ijerph13030291] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/28/2015] [Revised: 02/26/2016] [Accepted: 03/01/2016] [Indexed: 11/16/2022]
Abstract
Tuberculosis (TB) is an infectious disease with one of the highest reported incidences in China. The detection of the spatio-temporal distribution characteristics of TB is indicative of its prevention and control conditions. Trajectory similarity analysis detects variations and loopholes in prevention and provides urban public health officials and related decision makers more information for the allocation of public health resources and the formulation of prioritized health-related policies. This study analysed the spatio-temporal distribution characteristics of TB from 2009 to 2014 by utilizing spatial statistics, spatial autocorrelation analysis, and space-time scan statistics. Spatial statistics measured the TB incidence rate (TB patients per 100,000 residents) at the district level to determine its spatio-temporal distribution and to identify characteristics of change. Spatial autocorrelation analysis was used to detect global and local spatial autocorrelations across the study area. Purely spatial, purely temporal and space-time scan statistics were used to identify purely spatial, purely temporal and spatio-temporal clusters of TB at the district level. The other objective of this study was to compare the trajectory similarities between the incidence rates of TB and new smear-positive (NSP) TB patients in the resident population (NSPRP)/new smear-positive TB patients in the TB patient population (NSPTBP)/retreated smear-positive (RSP) TB patients in the resident population (RSPRP)/retreated smear-positive TB patients in the TB patient population (RSPTBP) to detect variations and loopholes in TB prevention and control among the districts in Beijing. The incidence rates in Beijing exhibited a gradual decrease from 2009 to 2014. Although global spatial autocorrelation was not detected overall across all of the districts of Beijing, individual districts did show evidence of local spatial autocorrelation: Chaoyang and Daxing were Low-Low districts over the six-year period. The purely spatial scan statistics analysis showed significant spatial clusters of high and low incidence rates; the purely temporal scan statistics showed the temporal cluster with a three-year period from 2009 to 2011 characterized by a high incidence rate; and the space-time scan statistics analysis showed significant spatio-temporal clusters. The distribution of the mean centres (MCs) showed that the general distributions of the NSPRP MCs and NSPTBP MCs were to the east of the incidence rate MCs. Conversely, the general distributions of the RSPRP MCs and the RSPTBP MCs were to the south of the incidence rate MCs. Based on the combined analysis of MC distribution characteristics and trajectory similarities, the NSP trajectory was most similar to the incidence rate trajectory. Thus, more attention should be focused on the discovery of NSP patients in the western part of Beijing, whereas the northern part of Beijing needs intensive treatment for RSP patients.
Collapse
Affiliation(s)
- Lan Li
- School of Resource and Environmental Sciences, Wuhan University, 129 Luoyu Rd., Wuhan 430079, China.
- Key Laboratory of GIS, Ministry of Education, Wuhan University, 129 Luoyu Rd., Wuhan 430079, China.
- Key Laboratory of Digital Mapping and Land information Application Engineering, National Administration of Surveying, Mapping and Geoinformation, Wuhan University, 129 Luoyu Rd., Wuhan 430079, China.
| | - Yuliang Xi
- School of Resource and Environmental Sciences, Wuhan University, 129 Luoyu Rd., Wuhan 430079, China.
- Key Laboratory of GIS, Ministry of Education, Wuhan University, 129 Luoyu Rd., Wuhan 430079, China.
- Key Laboratory of Digital Mapping and Land information Application Engineering, National Administration of Surveying, Mapping and Geoinformation, Wuhan University, 129 Luoyu Rd., Wuhan 430079, China.
| | - Fu Ren
- School of Resource and Environmental Sciences, Wuhan University, 129 Luoyu Rd., Wuhan 430079, China.
- Key Laboratory of GIS, Ministry of Education, Wuhan University, 129 Luoyu Rd., Wuhan 430079, China.
- Key Laboratory of Digital Mapping and Land information Application Engineering, National Administration of Surveying, Mapping and Geoinformation, Wuhan University, 129 Luoyu Rd., Wuhan 430079, China.
- Collaborative Innovation Center of Geospatial Technology, Wuhan University, 129 Luoyu Rd., Wuhan 430079, China.
| |
Collapse
|
12
|
Abstract
Copy number variation (CNV) detection has become an integral part many of genetic studies and new technologies promise to revolutionize our ability to detect and link them to disease. However, recent studies highlight discrepancies in the genome wide CNV profile when measured by different technologies and even by the same technology. Furthermore, the change point algorithms used to call CNVs can have substantial disagreement on the same data set. We focus this article on comparative genomic hybridization (CGH) arrays because this platform lends itself well to accurate statistical modeling. We describe some newer methodological developments in local statistics that are well suited for CNV detection and calling on CGH arrays. Then we use both simulation studies and public data to compare these new local methods with the global methods that currently dominate literature. These results offer suggestions for choosing a particular method and provide insight to the lack of reproducibility that has been seen in the field so far.
Collapse
Affiliation(s)
- Siddharth Roy
- Department of Statistics, College of Physical and Mathematical Sciences, North Carolina State University Raleigh, NC, USA
| | | |
Collapse
|
13
|
de Lima MS, Duczmal LH, Pinto LP. Spatial Scan Statistics for Models with Excess Zeros and Overdispersion. Online J Public Health Inform 2013. [PMCID: PMC3692937] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Objective To propose a more realistic model for disease cluster detection, through a modification of the spatial scan statistic to account simultaneously for inflated zeros and overdispersion. Introduction Spatial Scan Statistics [1] usually assume Poisson or Binomial distributed data, which is not adequate in many disease surveillance scenarios. For example, small areas distant from hospitals may exhibit a smaller number of cases than expected in those simple models. Also, underreporting may occur in underdeveloped regions, due to inefficient data collection or the difficulty to access remote sites. Those factors generate excess zero case counts or overdispersion, inducing a violation of the statistical model and also increasing the type I error (false alarms). Overdispersion occurs when data variance is greater than the predicted by the used model. To accommodate it, an extra parameter must be included; in the Poisson model, one makes the variance equal to the mean. Methods Tools like the Generalized Poisson (GP) and the Double Poisson [2] may be a better option for this kind of problem, modeling separately the mean and variance, which could be easily adjusted by covariates. When excess zeros occur, the Zero Inflated Poisson (ZIP) model is used, although ZIP’s estimated parameters may be severely biased if nonzero counts are too dispersed, compared to the Poisson distribution. In this case the Inflated Zero models for the Generalized Poisson (ZIGP), Double Poisson (ZIDP) and Negative Binomial (ZINB) could be good alternatives to the joint modeling of excess zeros and overdispersion. By one hand, Zero Inflated Poisson (ZIP) models were proposed using the spatial scan statistic to deal with the excess zeros [3]. By the other hand, another spatial scan statistic was based on a Poisson-Gamma mixture model for overdispersion [4]. In this work we present a model which includes inflated zeros and overdispersion simultaneously, based on the ZIDP model. Let the parameter p indicate the zero inflation. As the the remaining parameters of the observed cases map and the parameter p are not independent, the likelihood maximization process is not straightforward; it becomes even more complicated when we include covariates in the analysis. To solve this problem we introduce a vector of latent variables in order to factorize the likelihood, and obtain a facilitator for the maximization process using the E-M (Expectation-Maximization) algorithm. We derive the formulas to maximize iteratively the likelihood, and implement a computer program using the E-M algorithm to estimate the parameters under null and alternative hypothesis. The p-value is obtained via the Fast Double Bootstrap Test [5]. Results Numerical simulations are conducted to assess the effectiveness of the method. We present results for Hanseniasis surveillance in the Brazilian Amazon in 2010 using this technique. We obtain the most likely spatial clusters for the Poisson, ZIP, Poisson-Gamma mixture and ZIDP models and compare the results. Conclusions The Zero Inflated Double Poisson Spatial Scan Statistic for disease cluster detection incorporates the flexibility of previous models, accounting for inflated zeros and overdispersion simultaneously. The Hanseniasis study case map, due to excess of zero cases counts in many municipalities of the Brazilian Amazon and the presence of overdispersion, was a good benchmark to test the ZIDP model. The results obtained are easier to understand compared to each of the previous spatial scan statistic models, the Zero Inflated Poisson (ZIP) model and the Poisson-Gamma mixture model for overdispersion, taken separetely. The E-M algorithm and the Fast Double Bootstrap test are computationally efficient for this type of problem.
Collapse
Affiliation(s)
| | - Luiz H. Duczmal
- Universidade Federal de Minas Gerais, Belo Horizonte, Brazil;,Luiz H. Duczmal, E-mail:
| | | |
Collapse
|
14
|
Neill DB, Kumar T. Fast Multidimensional Subset Scan for Outbreak Detection and Characterization. Online J Public Health Inform 2013. [PMCID: PMC3692941] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022] Open
Abstract
Objective Introduction Methods Results Conclusions
Collapse
|
15
|
Abstract
Objective Introduction Methods Results Conclusions
Collapse
|
16
|
Xu S, Hambidge SJ, McClure DL, Daley MF, Glanz JM. A scan statistic for identifying optimal risk windows in vaccine safety studies using self-controlled case series design. Stat Med 2013; 32:3290-9. [PMID: 23303643 DOI: 10.1002/sim.5733] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2011] [Accepted: 12/18/2012] [Indexed: 11/06/2022]
Abstract
In the examination of the association between vaccines and rare adverse events after vaccination in postlicensure observational studies, it is challenging to define appropriate risk windows because prelicensure RCTs provide little insight on the timing of specific adverse events. Past vaccine safety studies have often used prespecified risk windows based on prior publications, biological understanding of the vaccine, and expert opinion. Recently, a data-driven approach was developed to identify appropriate risk windows for vaccine safety studies that use the self-controlled case series design. This approach employs both the maximum incidence rate ratio and the linear relation between the estimated incidence rate ratio and the inverse of average person time at risk, given a specified risk window. In this paper, we present a scan statistic that can identify appropriate risk windows in vaccine safety studies using the self-controlled case series design while taking into account the dependence of time intervals within an individual and while adjusting for time-varying covariates such as age and seasonality. This approach uses the maximum likelihood ratio test based on fixed-effects models, which has been used for analyzing data from self-controlled case series design in addition to conditional Poisson models.
Collapse
Affiliation(s)
- Stanley Xu
- The Institute for Health Research, Kaiser Permanente Colorado, Denver, CO, USA.
| | | | | | | | | |
Collapse
|
17
|
Abstract
A review of some methods for analysis of space-time disease surveillance data is presented. Increasingly, surveillance systems are capturing spatial and temporal data on disease and health outcomes in a variety of public health contexts. A vast and growing suite of methods exists for detection of outbreaks and trends in surveillance data and the selection of appropriate methods in a given surveillance context is not always clear. While most reviews of methods focus on algorithm performance, in practice, a variety of factors determine what methods are appropriate for surveillance. In this review, we focus on the role of contextual factors such as scale, scope, surveillance objective, disease characteristics, and technical issues in relation to commonly used approaches to surveillance. Methods are classified as testing-based or model-based approaches. Reviewing methods in the context of factors other than algorithm performance highlights important aspects of implementing and selecting appropriate disease surveillance methods.
Collapse
Affiliation(s)
- Colin Robertson
- Spatial Pattern Analysis & Research (SPAR) Laboratory, Dept. of Geography, University of Victoria, P.O. Box 3060, Victoria, BC, Canada V8W 3R4.
| | | | | | | |
Collapse
|
18
|
Abstract
Palindromes are symmetrical words of DNA in the sense that they read exactly the same as their reverse complementary sequences. Representing the occurrences of palindromes in a DNA molecule as points on the unit interval, the scan statistics can be used to identify regions of unusually high concentration of palindromes. These regions have been associated with the replication origins on a few herpesviruses in previous studies. However, the use of scan statistics requires the assumption that the points representing the palindromes are independently and uniformly distributed on the unit interval. In this paper, we provide a mathematical basis for this assumption by showing that in randomly generated DNA sequences, the occurrences of palindromes can be approximated by a Poisson process. An easily computable upper bound on the Wasserstein distance between the palindrome process and the Poisson process is obtained. This bound is then used as a guide to choose an optimal palindrome length in the analysis of a collection of 16 herpesvirus genomes. Regions harboring significant palindrome clusters are identified and compared to known locations of replication origins. This analysis brings out a few interesting extensions of the scan statistics that can help formulate an algorithm for more accurate prediction of replication origins.
Collapse
Affiliation(s)
- Ming-Ying Leung
- Department of Mathematical Sciences, University of Texas at El Paso, El Paso, TX 79968-0514, USA.
| | | | | | | |
Collapse
|
19
|
Abstract
Researchers working on the Department of Defense Global Emerging Infections System (DoD-GEIS) pilot system, the Electronic Surveillance System for the Early Notification of Community-Based Epidemics (ESSENCE), have applied scan statistics for early outbreak detection using both traditional and nontraditional data sources. These sources include medical data indexed by International Classification of Disease, 9th Revision (ICD-9) diagnosis codes, as well as less-specific, but potentially timelier, indicators such as records of over-the-counter remedy sales and of school absenteeism. Early efforts employed the Kulldorff scan statistic as implemented in the SaTScan software of the National Cancer Institute. A key obstacle to this application is that the input data streams are typically based on time-varying factors, such as consumer behavior, rather than simply on the populations of the component subregions. We have used both modeling and recent historical data distributions to obtain background spatial distributions. Data analyses have provided guidance on how to condition and model input data to avoid excessive clustering. We have used this methodology in combining data sources for both retrospective studies of known outbreaks and surveillance of high-profile events of concern to local public health authorities. We have integrated the scan statistic capability into a Microsoft Access-based system in which we may include or exclude data sources, vary time windows separately for different data sources, censor data from subsets of individual providers or subregions, adjust the background computation method, and run retrospective or simulated studies.
Collapse
Affiliation(s)
- Howard S Burkom
- National Security Technology Department, Johns Hopkins University Applied Physics Laboratory, 11100 Johns Hopkins Road, Laurel, MD 20723, USA.
| |
Collapse
|