1
|
Featherstone LA, Ingle DJ, Wirth W, Duchene S. How does date-rounding affect phylodynamic inference for public health? PLoS Comput Biol 2025; 21:e1012900. [PMID: 40215457 PMCID: PMC11991728 DOI: 10.1371/journal.pcbi.1012900] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2024] [Accepted: 02/21/2025] [Indexed: 04/14/2025] Open
Abstract
Phylodynamic analyses infer epidemiological parameters from pathogen genome sequences for enhanced genomic surveillance in public health. Pathogen genome sequences and their associated sampling dates are the essential data in every analysis. However, sampling dates are usually associated with hospitalisation or testing and can sometimes be used to identify individual patients, posing a threat to patient confidentiality. To lower this risk, sampling dates are often given with reduced date-resolution to the month or year, which can potentially bias inference. Here, we introduce a practical guideline on when date-rounding biases the inference of epidemiologically important parameters across a diverse range of empirical and simulated datasets. We show that the direction of bias varies for different parameters, datasets, and tree priors, while compounding with lower date-resolution and higher substitution rates. We also find that bias decreases for datasets with longer sampling intervals, implying that our guideline is most applicable to emerging datasets. We conclude by discussing future solutions that prioritise patient confidentiality and propose a method for safer sharing of sampling dates that translates them them uniformly by a random number.
Collapse
Affiliation(s)
- Leo A. Featherstone
- Research School of Biology, Australian National University, Canberra, Australian Capital Territory, Australia
- Department of Microbiology and Immunology at the Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, Victoria, Australia
- The Kirby Institute, UNSW Sydney, Sydney, NSW, Australia
| | - Danielle J. Ingle
- Department of Microbiology and Immunology at the Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, Victoria, Australia
| | - Wytamma Wirth
- Department of Microbiology and Immunology at the Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, Victoria, Australia
- Centre for Pathogen Genomics, University of Melbourne, Melbourne, Victoria, Australia
| | - Sebastian Duchene
- Department of Microbiology and Immunology at the Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, Victoria, Australia
- DEMI unit, Department of Computational Biology, Institut Pasteur, Paris, France
| |
Collapse
|
2
|
Jaya FR, Brito BP, Darling AE. Evaluation of recombination detection methods for viral sequencing. Virus Evol 2023; 9:vead066. [PMID: 38131005 PMCID: PMC10734630 DOI: 10.1093/ve/vead066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Revised: 08/03/2023] [Accepted: 11/15/2023] [Indexed: 12/23/2023] Open
Abstract
Recombination is a key evolutionary driver in shaping novel viral populations and lineages. When unaccounted for, recombination can impact evolutionary estimations or complicate their interpretation. Therefore, identifying signals for recombination in sequencing data is a key prerequisite to further analyses. A repertoire of recombination detection methods (RDMs) have been developed over the past two decades; however, the prevalence of pandemic-scale viral sequencing data poses a computational challenge for existing methods. Here, we assessed eight RDMs: PhiPack (Profile), 3SEQ, GENECONV, recombination detection program (RDP) (OpenRDP), MaxChi (OpenRDP), Chimaera (OpenRDP), UCHIME (VSEARCH), and gmos; to determine if any are suitable for the analysis of bulk sequencing data. To test the performance and scalability of these methods, we analysed simulated viral sequencing data across a range of sequence diversities, recombination frequencies, and sample sizes. Furthermore, we provide a practical example for the analysis and validation of empirical data. We find that RDMs need to be scalable, use an analytical approach and resolution that is suitable for the intended research application, and are accurate for the properties of a given dataset (e.g. sequence diversity and estimated recombination frequency). Analysis of simulated and empirical data revealed that the assessed methods exhibited considerable trade-offs between these criteria. Overall, we provide general guidelines for the validation of recombination detection results, the benefits and shortcomings of each assessed method, and future considerations for recombination detection methods for the assessment of large-scale viral sequencing data.
Collapse
Affiliation(s)
- Frederick R Jaya
- Australian Institute for Microbiology & Infection, University of Technology Sydney, 15 Broadway, Ultimo, New South Wales 2007, Australia
- Ecology and Evolution, Research School of Biology, Australian National University, 134 Linnaeus Way, Acton, Australian Capital Territory 2600, Australia
| | - Barbara P Brito
- Australian Institute for Microbiology & Infection, University of Technology Sydney, 15 Broadway, Ultimo, New South Wales 2007, Australia
- New South Wales Department of Primary Industries, Elizabeth Macarthur Agricultural Institute, Woodbridge Road, Menangle, New South Wales 2568, Australia
| | - Aaron E Darling
- Australian Institute for Microbiology & Infection, University of Technology Sydney, 15 Broadway, Ultimo, New South Wales 2007, Australia
- Illumina Australia Pty Ltd, Ultimo, New South Wales 2007, Australia
| |
Collapse
|
3
|
Duvvuri VR, Hicks JT, Damodaran L, Grunnill M, Braukmann T, Wu J, Gubbay JB, Patel SN, Bahl J. Comparing the transmission potential from sequence and surveillance data of 2009 North American influenza pandemic waves. Infect Dis Model 2023; 8:240-252. [PMID: 36844759 PMCID: PMC9944206 DOI: 10.1016/j.idm.2023.02.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Revised: 02/10/2023] [Accepted: 02/15/2023] [Indexed: 02/18/2023] Open
Abstract
Technological advancements in phylodynamic modeling coupled with the accessibility of real-time pathogen genetic data are increasingly important for understanding the infectious disease transmission dynamics. In this study, we compare the transmission potentials of North American influenza A(H1N1)pdm09 derived from sequence data to that derived from surveillance data. The impact of the choice of tree-priors, informative epidemiological priors, and evolutionary parameters on the transmission potential estimation is evaluated. North American Influenza A(H1N1)pdm09 hemagglutinin (HA) gene sequences are analyzed using the coalescent and birth-death tree prior models to estimate the basic reproduction number (R 0 ). Epidemiological priors gathered from published literature are used to simulate the birth-death skyline models. Path-sampling marginal likelihood estimation is conducted to assess model fit. A bibliographic search to gather surveillance-based R 0 values were consistently lower (mean ≤ 1.2) when estimated by coalescent models than by the birth-death models with informative priors on the duration of infectiousness (mean ≥ 1.3 to ≤2.88 days). The user-defined informative priors for use in the birth-death model shift the directionality of epidemiological and evolutionary parameters compared to non-informative estimates. While there was no certain impact of clock rate and tree height on the R 0 estimation, an opposite relationship was observed between coalescent and birth-death tree priors. There was no significant difference (p = 0.46) between the birth-death model and surveillance R 0 estimates. This study concludes that tree-prior methodological differences may have a substantial impact on the transmission potential estimation as well as the evolutionary parameters. The study also reports a consensus between the sequence-based R 0 estimation and surveillance-based R 0 estimates. Altogether, these outcomes shed light on the potential role of phylodynamic modeling to augment existing surveillance and epidemiological activities to better assess and respond to emerging infectious diseases.
Collapse
Affiliation(s)
- Venkata R. Duvvuri
- Public Health Ontario, Toronto, Ontario, Canada,Department of Laboratory Medicine and Pathobiology, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada,Laboratory for Industrial and Applied Mathematics, Department of Mathematics and Statistics, York University, Toronto, Ontario, Canada,Center for the Ecology of Infectious Disease, Department of Infectious Diseases, Institute of Bioinformatics, University of Georgia, Athens, Georgia,Department of Epidemiology and Biostatistics, Institute of Bioinformatics, University of Georgia, Athens, Georgia,Corresponding author. Public Health Ontario, Toronto, Ontario, Canada.
| | - Joseph T. Hicks
- Center for the Ecology of Infectious Disease, Department of Infectious Diseases, Institute of Bioinformatics, University of Georgia, Athens, Georgia,Department of Epidemiology and Biostatistics, Institute of Bioinformatics, University of Georgia, Athens, Georgia
| | - Lambodhar Damodaran
- Center for the Ecology of Infectious Disease, Department of Infectious Diseases, Institute of Bioinformatics, University of Georgia, Athens, Georgia,Department of Epidemiology and Biostatistics, Institute of Bioinformatics, University of Georgia, Athens, Georgia
| | - Martin Grunnill
- Public Health Ontario, Toronto, Ontario, Canada,Laboratory for Industrial and Applied Mathematics, Department of Mathematics and Statistics, York University, Toronto, Ontario, Canada
| | | | - Jianhong Wu
- Laboratory for Industrial and Applied Mathematics, Department of Mathematics and Statistics, York University, Toronto, Ontario, Canada
| | - Jonathan B. Gubbay
- Public Health Ontario, Toronto, Ontario, Canada,Department of Laboratory Medicine and Pathobiology, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Samir N. Patel
- Public Health Ontario, Toronto, Ontario, Canada,Department of Laboratory Medicine and Pathobiology, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Justin Bahl
- Center for the Ecology of Infectious Disease, Department of Infectious Diseases, Institute of Bioinformatics, University of Georgia, Athens, Georgia,Department of Epidemiology and Biostatistics, Institute of Bioinformatics, University of Georgia, Athens, Georgia,Duke-NUS Graduate Medical School, Singapore,Corresponding author. Center for the Ecology of Infectious Disease, Department of Infectious Diseases, Institute of Bioinformatics, University of Georgia, Athens, Georgia, USA.
| |
Collapse
|
4
|
Ghafari M, du Plessis L, Raghwani J, Bhatt S, Xu B, Pybus OG, Katzourakis A. Purifying Selection Determines the Short-Term Time Dependency of Evolutionary Rates in SARS-CoV-2 and pH1N1 Influenza. Mol Biol Evol 2022; 39:6509523. [PMID: 35038728 PMCID: PMC8826518 DOI: 10.1093/molbev/msac009] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
High-throughput sequencing enables rapid genome sequencing during infectious disease outbreaks and provides an opportunity to quantify the evolutionary dynamics of pathogens in near real-time. One difficulty of undertaking evolutionary analyses over short timescales is the dependency of the inferred evolutionary parameters on the timespan of observation. Crucially, there are an increasing number of molecular clock analyses using external evolutionary rate priors to infer evolutionary parameters. However, it is not clear which rate prior is appropriate for a given time window of observation due to the time-dependent nature of evolutionary rate estimates. Here, we characterize the molecular evolutionary dynamics of SARS-CoV-2 and 2009 pandemic H1N1 (pH1N1) influenza during the first 12 months of their respective pandemics. We use Bayesian phylogenetic methods to estimate the dates of emergence, evolutionary rates, and growth rates of SARS-CoV-2 and pH1N1 over time and investigate how varying sampling window and data set sizes affect the accuracy of parameter estimation. We further use a generalized McDonald-Kreitman test to estimate the number of segregating nonneutral sites over time. We find that the inferred evolutionary parameters for both pandemics are time dependent, and that the inferred rates of SARS-CoV-2 and pH1N1 decline by ∼50% and ∼100%, respectively, over the course of 1 year. After at least 4 months since the start of sequence sampling, inferred growth rates and emergence dates remain relatively stable and can be inferred reliably using a logistic growth coalescent model. We show that the time dependency of the mean substitution rate is due to elevated substitution rates at terminal branches which are 2-4 times higher than those of internal branches for both viruses. The elevated rate at terminal branches is strongly correlated with an increasing number of segregating nonneutral sites, demonstrating the role of purifying selection in generating the time dependency of evolutionary parameters during pandemics.
Collapse
Affiliation(s)
- Mahan Ghafari
- Department of Zoology, University of Oxford, Oxford, United Kingdom
| | - Louis du Plessis
- Department of Zoology, University of Oxford, Oxford, United Kingdom
| | - Jayna Raghwani
- Department of Zoology, University of Oxford, Oxford, United Kingdom
| | - Samir Bhatt
- MRC Centre for Global Infectious Disease Analysis, Jameel Institute for Disease and Emergency Analytics, Imperial College London, London, United Kingdom
| | - Bo Xu
- Department of Earth System Science, Tsinghua University, Beijing, China
| | - Oliver G Pybus
- Department of Zoology, University of Oxford, Oxford, United Kingdom
| | - Aris Katzourakis
- Department of Zoology, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
5
|
Van Poelvoorde LAE, Bogaerts B, Fu Q, De Keersmaecker SCJ, Thomas I, Van Goethem N, Van Gucht S, Winand R, Saelens X, Roosens N, Vanneste K. Whole-genome-based phylogenomic analysis of the Belgian 2016-2017 influenza A(H3N2) outbreak season allows improved surveillance. Microb Genom 2021; 7. [PMID: 34477544 PMCID: PMC8715427 DOI: 10.1099/mgen.0.000643] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Seasonal influenza epidemics are associated with high mortality and morbidity in the human population. Influenza surveillance is critical for providing information to national influenza programmes and for making vaccine composition predictions. Vaccination prevents viral infections, but rapid influenza evolution results in emerging mutants that differ antigenically from vaccine strains. Current influenza surveillance relies on Sanger sequencing of the haemagglutinin (HA) gene. Its classification according to World Health Organization (WHO) and European Centre for Disease Prevention and Control (ECDC) guidelines is based on combining certain genotypic amino acid mutations and phylogenetic analysis. Next-generation sequencing technologies enable a shift to whole-genome sequencing (WGS) for influenza surveillance, but this requires laboratory workflow adaptations and advanced bioinformatics workflows. In this study, 253 influenza A(H3N2) positive clinical specimens from the 2016–2017 Belgian season underwent WGS using the Illumina MiSeq system. HA-based classification according to WHO/ECDC guidelines did not allow classification of all samples. A new approach, considering the whole genome, was investigated based on using powerful phylogenomic tools including beast and Nextstrain, which substantially improved phylogenetic classification. Moreover, Bayesian inference via beast facilitated reassortment detection by both manual inspection and computational methods, detecting intra-subtype reassortants at an estimated rate of 15 %. Real-time analysis (i.e. as an outbreak is ongoing) via Nextstrain allowed positioning of the Belgian isolates into the globally circulating context. Finally, integration of patient data with phylogenetic groups and reassortment status allowed detection of several associations that would have been missed when solely considering HA, such as hospitalized patients being more likely to be infected with A(H3N2) reassortants, and the possibility to link several phylogenetic groups to disease severity indicators could be relevant for epidemiological monitoring. Our study demonstrates that WGS offers multiple advantages for influenza monitoring in (inter)national influenza surveillance, and proposes an improved methodology. This allows leveraging all information contained in influenza genomes, and allows for more accurate genetic characterization and reassortment detection.
Collapse
Affiliation(s)
- Laura A E Van Poelvoorde
- Transversal Activities in Applied Genomics, Sciensano, Juliette Wytsmanstraat 14, Brussels, Belgium.,National Influenza Centre, Sciensano, Juliette Wytsmanstraat 14, Brussels, Belgium.,Department of Biochemistry and Microbiology, Ghent University, Ghent, Belgium.,VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
| | - Bert Bogaerts
- Transversal Activities in Applied Genomics, Sciensano, Juliette Wytsmanstraat 14, Brussels, Belgium.,Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium.,Department of Information Technology, IDLab, IMEC, Ghent University, Ghent, Belgium
| | - Qiang Fu
- Transversal Activities in Applied Genomics, Sciensano, Juliette Wytsmanstraat 14, Brussels, Belgium
| | | | - Isabelle Thomas
- National Influenza Centre, Sciensano, Juliette Wytsmanstraat 14, Brussels, Belgium
| | | | - Steven Van Gucht
- National Influenza Centre, Sciensano, Juliette Wytsmanstraat 14, Brussels, Belgium
| | - Raf Winand
- Transversal Activities in Applied Genomics, Sciensano, Juliette Wytsmanstraat 14, Brussels, Belgium
| | - Xavier Saelens
- Department of Biochemistry and Microbiology, Ghent University, Ghent, Belgium.,VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
| | - Nancy Roosens
- Transversal Activities in Applied Genomics, Sciensano, Juliette Wytsmanstraat 14, Brussels, Belgium
| | - Kevin Vanneste
- Transversal Activities in Applied Genomics, Sciensano, Juliette Wytsmanstraat 14, Brussels, Belgium
| |
Collapse
|
6
|
Balak N, Inan D, Ganau M, Zoia C, Sönmez S, Kurt B, Akgül A, Tez M. A simple mathematical tool to forecast COVID-19 cumulative case numbers. CLINICAL EPIDEMIOLOGY AND GLOBAL HEALTH 2021; 12:100853. [PMID: 34395949 PMCID: PMC8352661 DOI: 10.1016/j.cegh.2021.100853] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Revised: 07/20/2021] [Accepted: 08/04/2021] [Indexed: 02/07/2023] Open
Abstract
Objective Mathematical models are known to help determine potential intervention strategies by providing an approximate idea of the transmission dynamics of infectious diseases. To develop proper responses, not only are more accurate disease spread models needed, but also those that are easy to use. Materials and methods As of July 1, 2020, we selected the 20 countries with the highest numbers of COVID-19 cases in the world. Using the Verhulst–Pearl logistic function formula, we calculated estimates for the total number of cases for each country. We compared these estimates to the actual figures given by the WHO on the same dates. Finally, the formula was tested for longer-term reliability at t = 18 and t = 40 weeks. Results The Verhulst–Pearl logistic function formula estimated the actual numbers precisely, with only a 0.5% discrepancy on average for the first month. For all countries in the study and the world at large, the estimates for the 40th week were usually overestimated, although the estimates for some countries were still relatively close to the actual numbers in the forecasting long term. The estimated number for the world in general was about 8 times that actually observed for the long term. Conclusions The Verhulst–Pearl equation has the advantage of being very straightforward and applicable in clinical use for predicting the demand on hospitals in the short term of 4–6 weeks, which is usually enough time to reschedule elective procedures and free beds for new waves of the pandemic patients.
Collapse
Affiliation(s)
- Naci Balak
- Department of Neurosurgery, Istanbul Medeniyet University, Göztepe Education and Research Hospital, Istanbul, Turkey
- School of Applied Sciences, Marmara University, Istanbul, Turkey
| | - Deniz Inan
- Department of Statistics, Faculty of Arts and Sciences, Marmara University, Istanbul, Turkey
| | - Mario Ganau
- Department of Neurosurgery, Oxford University Hospitals NHS Foundation Trust, Oxford, UK
| | - Cesare Zoia
- Department of Neurosurgery, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy
| | - Sinan Sönmez
- School of Applied Sciences, Marmara University, Istanbul, Turkey
| | - Batuhan Kurt
- School of Applied Sciences, Marmara University, Istanbul, Turkey
| | - Ahmet Akgül
- School of Applied Sciences, Marmara University, Istanbul, Turkey
| | - Müjgan Tez
- Department of Statistics, Faculty of Arts and Sciences, Marmara University, Istanbul, Turkey
| |
Collapse
|
7
|
Featherstone LA, Di Giallonardo F, Holmes EC, Vaughan TG, Duchêne S. Infectious disease phylodynamics with occurrence data. Methods Ecol Evol 2021. [DOI: 10.1111/2041-210x.13620] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Leo A. Featherstone
- Department of Microbiology and Immunology Peter Doherty Institute for Infection and Immunity University of Melbourne Melbourne Vic. Australia
| | | | - Edward C. Holmes
- Marie Bashir Institute for Infectious Diseases and BiosecurityThe University of Sydney Sydney NSW Australia
- Charles Perkins Centre School of Life and Environmental Sciences The University of Sydney Sydney NSW Australia
- School of Medical Sciences The University of Sydney Sydney NSW Australia
| | - Timothy G. Vaughan
- Department of Biosystems Science and Engineering ETH Zurich Basel Switzerland
- Swiss Institute of Bioinformatics (SIB) Lausanne Switzerland
| | - Sebastián Duchêne
- Department of Microbiology and Immunology Peter Doherty Institute for Infection and Immunity University of Melbourne Melbourne Vic. Australia
| |
Collapse
|
8
|
Duchene S, Lemey P, Stadler T, Ho SYW, Duchene DA, Dhanasekaran V, Baele G. Bayesian Evaluation of Temporal Signal in Measurably Evolving Populations. Mol Biol Evol 2021; 37:3363-3379. [PMID: 32895707 PMCID: PMC7454806 DOI: 10.1093/molbev/msaa163] [Citation(s) in RCA: 72] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Phylogenetic methods can use the sampling times of molecular sequence data to calibrate the molecular clock, enabling the estimation of evolutionary rates and timescales for rapidly evolving pathogens and data sets containing ancient DNA samples. A key aspect of such calibrations is whether a sufficient amount of molecular evolution has occurred over the sampling time window, that is, whether the data can be treated as having come from a measurably evolving population. Here, we investigate the performance of a fully Bayesian evaluation of temporal signal (BETS) in sequence data. The method involves comparing the fit to the data of two models: a model in which the data are accompanied by the actual (heterochronous) sampling times, and a model in which the samples are constrained to be contemporaneous (isochronous). We conducted simulations under a wide range of conditions to demonstrate that BETS accurately classifies data sets according to whether they contain temporal signal or not, even when there is substantial among-lineage rate variation. We explore the behavior of this classification in analyses of five empirical data sets: modern samples of A/H1N1 influenza virus, the bacterium Bordetella pertussis, coronaviruses from mammalian hosts, ancient DNA from Hepatitis B virus, and mitochondrial genomes of dog species. Our results indicate that BETS is an effective alternative to other tests of temporal signal. In particular, this method has the key advantage of allowing a coherent assessment of the entire model, including the molecular clock and tree prior which are essential aspects of Bayesian phylodynamic analyses.
Collapse
Affiliation(s)
- Sebastian Duchene
- Department of Microbiology and Immunology, Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, VIC, Australia
| | - Philippe Lemey
- Department of Microbiology, Immunology and Transplantation, Rega Institute, KU Leuven, Leuven, Belgium
| | - Tanja Stadler
- Department of Biosystems Science and Engineering, ETH Zürich, Zürich, Switzerland
| | - Simon Y W Ho
- Swiss Institute of Bioinformatics, Basel, Switzerland.,School of Life and Environmental Sciences, University of Sydney, Sydney, NSW, Australia
| | - David A Duchene
- Research School of Biology, Australian National University, Canberra, ACT, Australia
| | - Vijaykrishna Dhanasekaran
- Department of Microbiology, Biomedicine Discovery Institute, Monash University, Melbourne, VIC, Australia
| | - Guy Baele
- Department of Microbiology, Immunology and Transplantation, Rega Institute, KU Leuven, Leuven, Belgium
| |
Collapse
|
9
|
Optimal Sampling Regimes for Estimating Population Dynamics. STATS 2021. [DOI: 10.3390/stats4020020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Ecologists are interested in modeling the population growth of species in various ecosystems. Specifically, logistic growth arises as a common model for population growth. Studying such growth can assist environmental managers in making better decisions when collecting data. Traditionally, ecological data is recorded on a regular time frequency and is very well-documented. However, sampling can be an expensive process due to available resources, money and time. Limiting sampling makes it challenging to properly track the growth of a population. Thus, this design study proposes an approach to sampling based on the dynamics associated with logistic growth. The proposed method is demonstrated via a simulation study across various theoretical scenarios to evaluate its performance in identifying optimal designs that best estimate the curves. Markov Chain Monte Carlo sampling techniques are implemented to predict the probability of the model parameters using Bayesian inference. The intention of this study is to demonstrate a method that can minimize the amount of time ecologists spend in the field, while maximizing the information provided by the data.
Collapse
|
10
|
Abstract
Phylogenetic trees inferred from sequence data often have branch lengths measured in the expected number of substitutions and therefore, do not have divergence times estimated. These trees give an incomplete view of evolutionary histories since many applications of phylogenies require time trees. Many methods have been developed to convert the inferred branch lengths from substitution unit to time unit using calibration points, but none is universally accepted as they are challenged in both scalability and accuracy under complex models. Here, we introduce a new method that formulates dating as a nonconvex optimization problem where the variance of log-transformed rate multipliers is minimized across the tree. On simulated and real data, we show that our method, wLogDate, is often more accurate than alternatives and is more robust to various model assumptions.
Collapse
Affiliation(s)
- Uyen Mai
- Department of Computer Science and Engineering, UC, San Diego, CA
| | - Siavash Mirarab
- Department of Electrical and Computer Engineering, UC, San Diego, CA
| |
Collapse
|
11
|
Duchene S, Featherstone L, Haritopoulou-Sinanidou M, Rambaut A, Lemey P, Baele G. Temporal signal and the phylodynamic threshold of SARS-CoV-2. Virus Evol 2020; 6:veaa061. [PMID: 33235813 PMCID: PMC7454936 DOI: 10.1093/ve/veaa061] [Citation(s) in RCA: 279] [Impact Index Per Article: 55.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
The ongoing SARS-CoV-2 outbreak marks the first time that large amounts of genome sequence data have been generated and made publicly available in near real time. Early analyses of these data revealed low sequence variation, a finding that is consistent with a recently emerging outbreak, but which raises the question of whether such data are sufficiently informative for phylogenetic inferences of evolutionary rates and time scales. The phylodynamic threshold is a key concept that refers to the point in time at which sufficient molecular evolutionary change has accumulated in available genome samples to obtain robust phylodynamic estimates. For example, before the phylodynamic threshold is reached, genomic variation is so low that even large amounts of genome sequences may be insufficient to estimate the virus’s evolutionary rate and the time scale of an outbreak. We collected genome sequences of SARS-CoV-2 from public databases at eight different points in time and conducted a range of tests of temporal signal to determine if and when the phylodynamic threshold was reached, and the range of inferences that could be reliably drawn from these data. Our results indicate that by 2 February 2020, estimates of evolutionary rates and time scales had become possible. Analyses of subsequent data sets, that included between 47 and 122 genomes, converged at an evolutionary rate of about 1.1 × 10−3 subs/site/year and a time of origin of around late November 2019. Our study provides guidelines to assess the phylodynamic threshold and demonstrates that establishing this threshold constitutes a fundamental step for understanding the power and limitations of early data in outbreak genome surveillance.
Collapse
Affiliation(s)
- Sebastian Duchene
- Department of Microbiology and Immunology, Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, Victoria, Australia, 3000
| | - Leo Featherstone
- Department of Microbiology and Immunology, Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, Victoria, Australia, 3000
| | - Melina Haritopoulou-Sinanidou
- Department of Microbiology and Immunology, Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, Victoria, Australia, 3000
| | - Andrew Rambaut
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, UK
| | - Philippe Lemey
- Department of Microbiology, Immunology and Transplantation, Rega Institute, KU Leuven, Leuven, Belgium
| | - Guy Baele
- Department of Microbiology, Immunology and Transplantation, Rega Institute, KU Leuven, Leuven, Belgium
| |
Collapse
|
12
|
Alkhamis MA, Li C, Torremorell M. Animal Disease Surveillance in the 21st Century: Applications and Robustness of Phylodynamic Methods in Recent U.S. Human-Like H3 Swine Influenza Outbreaks. Front Vet Sci 2020; 7:176. [PMID: 32373634 PMCID: PMC7186338 DOI: 10.3389/fvets.2020.00176] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2019] [Accepted: 03/16/2020] [Indexed: 11/22/2022] Open
Abstract
Emerging and endemic animal viral diseases continue to impose substantial impacts on animal and human health. Most current and past molecular surveillance studies of animal diseases investigated spatio-temporal and evolutionary dynamics of the viruses in a disjointed analytical framework, ignoring many uncertainties and made joint conclusions from both analytical approaches. Phylodynamic methods offer a uniquely integrated platform capable of inferring complex epidemiological and evolutionary processes from the phylogeny of viruses in populations using a single Bayesian statistical framework. In this study, we reviewed and outlined basic concepts and aspects of phylodynamic methods and attempted to summarize essential components of the methodology in one analytical pipeline to facilitate the proper use of the methods by animal health researchers. Also, we challenged the robustness of the posterior evolutionary parameters, inferred by the commonly used phylodynamic models, using hemagglutinin (HA) and polymerase basic 2 (PB2) segments of the currently circulating human-like H3 swine influenza (SI) viruses isolated in the United States and multiple priors. Subsequently, we compared similarities and differences between the posterior parameters inferred from sequence data using multiple phylodynamic models. Our suggested phylodynamic approach attempts to reduce the impact of its inherent limitations to offer less biased and biologically plausible inferences about the pathogen evolutionary characteristics to properly guide intervention activities. We also pinpointed requirements and challenges for integrating phylodynamic methods in routine animal disease surveillance activities.
Collapse
Affiliation(s)
- Moh A Alkhamis
- Department of Epidemiology and Biostatistics, Faculty of Public Health, Health Sciences Center, Kuwait University, Kuwait City, Kuwait.,Department of Veterinary Population Medicine, College of Veterinary Medicine, University of Minnesota, St. Paul, MN, United States
| | - Chong Li
- Department of Veterinary Population Medicine, College of Veterinary Medicine, University of Minnesota, St. Paul, MN, United States
| | - Montserrat Torremorell
- Department of Veterinary Population Medicine, College of Veterinary Medicine, University of Minnesota, St. Paul, MN, United States
| |
Collapse
|
13
|
Dudas G, Bedford T. The ability of single genes vs full genomes to resolve time and space in outbreak analysis. BMC Evol Biol 2019; 19:232. [PMID: 31878875 PMCID: PMC6933756 DOI: 10.1186/s12862-019-1567-0] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2019] [Accepted: 12/17/2019] [Indexed: 12/12/2022] Open
Abstract
Background Inexpensive pathogen genome sequencing has had a transformative effect on the field of phylodynamics, where ever increasing volumes of data have promised real-time insight into outbreaks of infectious disease. As well as the sheer volume of pathogen isolates being sequenced, the sequencing of whole pathogen genomes, rather than select loci, has allowed phylogenetic analyses to be carried out at finer time scales, often approaching serial intervals for infections caused by rapidly evolving RNA viruses. Despite its utility, whole genome sequencing of pathogens has not been adopted universally and targeted sequencing of loci is common in some pathogen-specific fields. Results In this study we highlighted the utility of sequencing whole genomes of pathogens by re-analysing a well-characterised collection of Ebola virus sequences in the form of complete viral genomes (≈19 kb long) or the rapidly evolving glycoprotein (GP, ≈2 kb long) gene. We have quantified changes in phylogenetic, temporal, and spatial inference resolution as a result of this reduction in data and compared these to theoretical expectations. Conclusions We propose a simple intuitive metric for quantifying temporal resolution, i.e. the time scale over which sequence data might be informative of various processes as a quick back-of-the-envelope calculation of statistical power available to molecular clock analyses.
Collapse
Affiliation(s)
- Gytis Dudas
- Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, 98109, USA. .,Gothenburg Global Biodiversity Centre, Carl Skottsbergs gata 22B, Gothenburg, 413 19, Sweden.
| | - Trevor Bedford
- Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, 98109, USA
| |
Collapse
|
14
|
Hidano A, Gates MC. Assessing biases in phylodynamic inferences in the presence of super-spreaders. Vet Res 2019; 50:74. [PMID: 31558163 PMCID: PMC6764146 DOI: 10.1186/s13567-019-0692-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2019] [Accepted: 08/28/2019] [Indexed: 12/03/2022] Open
Abstract
Phylodynamic analyses using pathogen genetic data have become popular for making epidemiological inferences. However, many methods assume that the underlying host population follows homogenous mixing patterns. Nevertheless, in real disease outbreaks, a small number of individuals infect a disproportionately large number of others (super-spreaders). Our objective was to quantify the degree of bias in estimating the epidemic starting date in the presence of super-spreaders using different sample selection strategies. We simulated 100 epidemics of a hypothetical pathogen (fast evolving foot and mouth disease virus-like) over a real livestock movement network allowing the genetic mutations in pathogen sequence. Genetic sequences were sampled serially over the epidemic, which were then used to estimate the epidemic starting date using Extended Bayesian Coalescent Skyline plot (EBSP) and Birth–death skyline plot (BDSKY) models. Our results showed that the degree of bias varies over different epidemic situations, with substantial overestimations on the epidemic duration occurring in some occasions. While the accuracy and precision of BDSKY were deteriorated when a super-spreader generated a larger proportion of secondary cases, those of EBSP were deteriorated when epidemics were shorter. The accuracies of the inference were similar irrespective of whether the analysis used all sampled sequences or only a subset of them, although the former required substantially longer computational times. When phylodynamic analyses need to be performed under a time constraint to inform policy makers, we suggest multiple phylodynamics models to be used simultaneously for a subset of data to ascertain the robustness of inferences.
Collapse
Affiliation(s)
- Arata Hidano
- EpiCentre, School of Veterinary Science, Massey University, Palmerston North, New Zealand.
| | - M Carolyn Gates
- EpiCentre, School of Veterinary Science, Massey University, Palmerston North, New Zealand
| |
Collapse
|
15
|
Adam DC, Scotch M, MacIntyre CR. Phylodynamics of Influenza A/H1N1pdm09 in India Reveals Circulation Patterns and Increased Selection for Clade 6b Residues and Other High Mortality Mutants. Viruses 2019; 11:E791. [PMID: 31462006 PMCID: PMC6783925 DOI: 10.3390/v11090791] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2019] [Revised: 08/23/2019] [Accepted: 08/24/2019] [Indexed: 01/03/2023] Open
Abstract
The clinical severity and observed case fatality ratio of influenza A/H1N1pdm09 in India, particularly in 2015 and 2017 far exceeds current global estimates. Reasons for these frequent and severe epidemic waves remain unclear. We used Bayesian phylodynamic methods to uncover possible genetic explanations for this, while also identifying the transmission dynamics of A/H1N1pdm09 between 2009 and 2017 to inform future public health interventions. We reveal a disproportionate selection at haemagglutinin residue positions associated with increased morbidity and mortality in India such as position 222 and clade 6B characteristic residues, relative to equivalent isolates circulating globally. We also identify for the first time, increased selection at position 186 as potentially explaining the severity of recent A/H1N1pdm09 epidemics in India. We reveal national routes of A/H1N1pdm09 transmission, identifying Maharashtra as the most important state for the spread throughout India, while quantifying climactic, ecological, and transport factors as drivers of within-country transmission. Together these results have important implications for future A/H1N1pdm09 surveillance and control within India, but also for epidemic and pandemic risk prediction around the world.
Collapse
Affiliation(s)
- Dillon C Adam
- Biosecurity Program, The Kirby Institute, Faculty of Medicine, University of New South Wales, Sydney, NSW 2052, Australia.
| | - Matthew Scotch
- Biosecurity Program, The Kirby Institute, Faculty of Medicine, University of New South Wales, Sydney, NSW 2052, Australia
- Biodesign Center for Environmental Health Engineering, Biodesign Institute, Arizona State University, Tempe, AZ 85287, USA
- College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
| | - C Raina MacIntyre
- Biosecurity Program, The Kirby Institute, Faculty of Medicine, University of New South Wales, Sydney, NSW 2052, Australia
- College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
- College of Public Service & Community Solutions, Arizona State University, Tempe, AZ 85004, USA
| |
Collapse
|
16
|
Duchene S, Bouckaert R, Duchene DA, Stadler T, Drummond AJ. Phylodynamic Model Adequacy Using Posterior Predictive Simulations. Syst Biol 2019; 68:358-364. [PMID: 29945220 PMCID: PMC6368481 DOI: 10.1093/sysbio/syy048] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2018] [Accepted: 06/15/2018] [Indexed: 11/18/2022] Open
Abstract
Rapidly evolving pathogens, such as viruses and bacteria, accumulate genetic change at a similar timescale over which their epidemiological processes occur, such that, it is possible to make inferences about their infectious spread using phylogenetic time-trees. For this purpose it is necessary to choose a phylodynamic model. However, the resulting inferences are contingent on whether the model adequately describes key features of the data. Model adequacy methods allow formal rejection of a model if it cannot generate the main features of the data. We present TreeModelAdequacy, a package for the popular BEAST2 software that allows assessing the adequacy of phylodynamic models. We illustrate its utility by analyzing phylogenetic trees from two viral outbreaks of Ebola and H1N1 influenza. The main features of the Ebola data were adequately described by the coalescent exponential-growth model, whereas the H1N1 influenza data were best described by the birth–death susceptible-infected-recovered model.
Collapse
Affiliation(s)
- Sebastian Duchene
- Department of Biochemistry and Molecular Biology, Bio21 Molecular Science and Biotechnology Institute, University of Melbourne, Melbourne, Australia
| | - Remco Bouckaert
- Centre for Computational Evolution, University of Auckland, Auckland, New Zealand.,Max Planck Institute for the Science of Human History, Jena, Germany
| | - David A Duchene
- School of Life and Environmental Sciences, University of Sydney, Sydney, Australia
| | - Tanja Stadler
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Alexei J Drummond
- Centre for Computational Evolution, University of Auckland, Auckland, New Zealand
| |
Collapse
|
17
|
Baele G, Suchard MA, Rambaut A, Lemey P. Emerging Concepts of Data Integration in Pathogen Phylodynamics. Syst Biol 2017; 66:e47-e65. [PMID: 28173504 PMCID: PMC5837209 DOI: 10.1093/sysbio/syw054] [Citation(s) in RCA: 57] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2015] [Accepted: 06/02/2016] [Indexed: 12/24/2022] Open
Abstract
Phylodynamics has become an increasingly popular statistical framework to extract evolutionary and epidemiological information from pathogen genomes. By harnessing such information, epidemiologists aim to shed light on the spatio-temporal patterns of spread and to test hypotheses about the underlying interaction of evolutionary and ecological dynamics in pathogen populations. Although the field has witnessed a rich development of statistical inference tools with increasing levels of sophistication, these tools initially focused on sequences as their sole primary data source. Integrating various sources of information, however, promises to deliver more precise insights in infectious diseases and to increase opportunities for statistical hypothesis testing. Here, we review how the emerging concept of data integration is stimulating new advances in Bayesian evolutionary inference methodology which formalize a marriage of statistical thinking and evolutionary biology. These approaches include connecting sequence to trait evolution, such as for host, phenotypic and geographic sampling information, but also the incorporation of covariates of evolutionary and epidemic processes in the reconstruction procedures. We highlight how a full Bayesian approach to covariate modeling and testing can generate further insights into sequence evolution, trait evolution, and population dynamics in pathogen populations. Specific examples demonstrate how such approaches can be used to test the impact of host on rabies and HIV evolutionary rates, to identify the drivers of influenza dispersal as well as the determinants of rabies cross-species transmissions, and to quantify the evolutionary dynamics of influenza antigenicity. Finally, we briefly discuss how data integration is now also permeating through the inference of transmission dynamics, leading to novel insights into tree-generative processes and detailed reconstructions of transmission trees. [Bayesian inference; birth–death models; coalescent models; continuous trait evolution; covariates; data integration; discrete trait evolution; pathogen phylodynamics.
Collapse
Affiliation(s)
- Guy Baele
- Department of Microbiology and Immunology, Rega Institute, KU Leuven - University of Leuven, Leuven, Belgium
| | - Marc A. Suchard
- Department of Biomathematics, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
- Department of Biostatistics, School of Public Health, University of California, Los Angeles, CA 90095, USA
| | - Andrew Rambaut
- Institute of Evolutionary Biology, University of Edinburgh, Kings Buildings, Edinburgh EH9 3FL, UK
- Centre for Immunity, Infection and Evolution, University of Edinburgh, Kings Buildings, Edinburgh EH9 3FL, UK
| | - Philippe Lemey
- Department of Microbiology and Immunology, Rega Institute, KU Leuven - University of Leuven, Leuven, Belgium
| |
Collapse
|
18
|
Otieno JR, Agoti CN, Gitahi CW, Bett A, Ngama M, Medley GF, Cane PA, Nokes DJ. Molecular Evolutionary Dynamics of Respiratory Syncytial Virus Group A in Recurrent Epidemics in Coastal Kenya. J Virol 2016; 90:4990-5002. [PMID: 26937038 PMCID: PMC4859726 DOI: 10.1128/jvi.03105-15] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2015] [Accepted: 02/25/2016] [Indexed: 11/23/2022] Open
Abstract
UNLABELLED The characteristic recurrent epidemics of human respiratory syncytial virus (RSV) within communities may result from the genetic variability of the virus and associated evolutionary adaptation, reducing the efficiency of preexisting immune responses. We analyzed the molecular evolutionary changes in the attachment (G) glycoprotein of RSV-A viruses collected over 13 epidemic seasons (2000 to 2012) in Kilifi (n = 649), Kenya, and contemporaneous sequences (n = 1,131) collected elsewhere within Kenya and 28 other countries. Genetic diversity in the G gene in Kilifi was dynamic both within and between epidemics, characterized by frequent new variant introductions and limited variant persistence between consecutive epidemics. Four RSV-A genotypes were detected in Kilifi: ON1 (11.9%), GA2 (75.5%), GA5 (12.3%), and GA3 (0.3%), with predominant genotype replacement of GA5 by GA2 and then GA2 by ON1. Within these genotypes, there was considerable variation in potential N-glycosylation sites, with GA2 and ON1 viruses showing up to 15 different patterns involving eight possible sites. Further, we identified 15 positively selected and 34 genotype-distinguishing codon sites, with six of these sites exhibiting both characteristics. The mean substitution rate of the G ectodomain for the Kilifi data set was estimated at 3.58 × 10(-3) (95% highest posterior density interval = 3.04 to 4.16) nucleotide substitutions/site/year. Kilifi viruses were interspersed in the global phylogenetic tree, clustering mostly with Kenyan and European sequences. Our findings highlight ongoing genetic evolution and high diversity of circulating RSV-A strains, locally and globally, with potential antigenic differences. Taken together, these provide a possible explanation on the nature of recurrent local RSV epidemics. IMPORTANCE The mechanisms underlying recurrent epidemics of RSV are poorly understood. We observe high genetic diversity in circulating strains within and between epidemics in both local and global settings. On longer time scales (∼7 years) there is sequential replacement of genotypes, whereas on shorter time scales (one epidemic to the next or within epidemics) there is a high turnover of variants within genotypes. Further, this genetic diversity is predicted to be associated with variation in antigenic profiles. These observations provide an explanation for recurrent RSV epidemics and have potential implications on the long-term effectiveness of vaccines.
Collapse
Affiliation(s)
- James R Otieno
- Epidemiology and Demography Department, Kenya Medical Research Institute-Wellcome Trust Research Programme, Kilifi, Kenya
| | - Charles N Agoti
- Epidemiology and Demography Department, Kenya Medical Research Institute-Wellcome Trust Research Programme, Kilifi, Kenya
- Department of Biomedical Sciences, Pwani University, Kilifi, Kenya
| | - Caroline W Gitahi
- Epidemiology and Demography Department, Kenya Medical Research Institute-Wellcome Trust Research Programme, Kilifi, Kenya
| | - Ann Bett
- Epidemiology and Demography Department, Kenya Medical Research Institute-Wellcome Trust Research Programme, Kilifi, Kenya
| | - Mwanajuma Ngama
- Epidemiology and Demography Department, Kenya Medical Research Institute-Wellcome Trust Research Programme, Kilifi, Kenya
| | - Graham F Medley
- Department of Global Health and Development, London School of Hygiene and Tropical Medicine, London, United Kingdom
| | | | - D James Nokes
- Epidemiology and Demography Department, Kenya Medical Research Institute-Wellcome Trust Research Programme, Kilifi, Kenya
- School of Life Sciences and WIDER, University of Warwick, Coventry, United Kingdom
| |
Collapse
|
19
|
Pathogen Epidemiology. ENCYCLOPEDIA OF EVOLUTIONARY BIOLOGY 2016. [PMCID: PMC7148661 DOI: 10.1016/b978-0-12-800049-6.00228-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
20
|
Pybus OG, Tatem AJ, Lemey P. Virus evolution and transmission in an ever more connected world. Proc Biol Sci 2015; 282:20142878. [PMID: 26702033 PMCID: PMC4707738 DOI: 10.1098/rspb.2014.2878] [Citation(s) in RCA: 80] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2014] [Accepted: 04/15/2015] [Indexed: 01/03/2023] Open
Abstract
The frequency and global impact of infectious disease outbreaks, particularly those caused by emerging viruses, demonstrate the need for a better understanding of how spatial ecology and pathogen evolution jointly shape epidemic dynamics. Advances in computational techniques and the increasing availability of genetic and geospatial data are helping to address this problem, particularly when both information sources are combined. Here, we review research at the intersection of evolutionary biology, human geography and epidemiology that is working towards an integrated view of spatial incidence, host mobility and viral genetic diversity. We first discuss how empirical studies have combined viral spatial and genetic data, focusing particularly on the contribution of evolutionary analyses to epidemiology and disease control. Second, we explore the interplay between virus evolution and global dispersal in more depth for two pathogens: human influenza A virus and chikungunya virus. We discuss the opportunities for future research arising from new analyses of human transportation and trade networks, as well as the associated challenges in accessing and sharing relevant spatial and genetic data.
Collapse
Affiliation(s)
- Oliver G Pybus
- Department of Zoology, University of Oxford, South Parks Road, Oxford OX1 3PS, UK
| | - Andrew J Tatem
- Department of Geography and Environment, University of Southampton, Highfield, Southampton SO17 1BJ, UK Fogarty International Center, National Institutes of Health, Bethesda, MA, USA Flowminder Foundation, Stockholm, Sweden
| | - Philippe Lemey
- Department of Microbiology and Immunology, Rega Institute, KU Leuven-University of Leuven, Leuven, Belgium
| |
Collapse
|
21
|
Valley-Omar Z, Nindo F, Mudau M, Hsiao M, Martin DP. Phylogenetic Exploration of Nosocomial Transmission Chains of 2009 Influenza A/H1N1 among Children Admitted at Red Cross War Memorial Children's Hospital, Cape Town, South Africa in 2011. PLoS One 2015; 10:e0141744. [PMID: 26565994 PMCID: PMC4643913 DOI: 10.1371/journal.pone.0141744] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2015] [Accepted: 10/11/2015] [Indexed: 12/27/2022] Open
Abstract
Traditional modes of investigating influenza nosocomial transmission have entailed a combination of confirmatory molecular diagnostic testing and epidemiological investigation. Common hospital-acquired infections like influenza require a discerning ability to distinguish between viral isolates to accurately identify patient transmission chains. We assessed whether influenza hemagglutinin sequence phylogenies can be used to enrich epidemiological data when investigating the extent of nosocomial transmission over a four-month period within a paediatric Hospital in Cape Town South Africa. Possible transmission chains/channels were initially determined through basic patient admission data combined with Maximum likelihood and time-scaled Bayesian phylogenetic analyses. These analyses suggested that most instances of potential hospital-acquired infections resulted from multiple introductions of Influenza A into the hospital, which included instances where virus hemagglutinin sequences were identical between different patients. Furthermore, a general inability to establish epidemiological transmission linkage of patients/viral isolates implied that identified isolates could have originated from asymptomatic hospital patients, visitors or hospital staff. In contrast, a traditional epidemiological investigation that used no viral phylogenetic analyses, based on patient co-admission into specific wards during a particular time-frame, suggested that multiple hospital acquired infection instances may have stemmed from a limited number of identifiable index viral isolates/patients. This traditional epidemiological analysis by itself could incorrectly suggest linkage between unrelated cases, underestimate the number of unique infections and may overlook the possible diffuse nature of hospital transmission, which was suggested by sequencing data to be caused by multiple unique introductions of influenza A isolates into individual hospital wards. We have demonstrated a functional role for viral sequence data in nosocomial transmission investigation through its ability to enrich traditional, non-molecular observational epidemiological investigation by teasing out possible transmission pathways and working toward more accurately enumerating the number of possible transmission events.
Collapse
Affiliation(s)
- Ziyaad Valley-Omar
- Centre for Respiratory Diseases and Meningitis, Virology, National Institute for Communicable Diseases, Sandringham, Johannesburg, South Africa
- University of Cape Town, Faculty of Health Sciences, Department of Clinical Laboratory Sciences Medical Virology, Observatory, Cape Town, South Africa
- * E-mail:
| | - Fredrick Nindo
- University of Cape Town, Faculty of Health Sciences, Institute of Infectious Disease and Molecular Medicine, Computational Biology Group, Observatory, Cape Town, South Africa
| | - Maanda Mudau
- Centre for Tuberculosis, National Institute for Communicable Diseases, Sandringham, Johannesburg, South Africa
| | - Marvin Hsiao
- University of Cape Town, Faculty of Health Sciences, Department of Clinical Laboratory Sciences Medical Virology, Observatory, Cape Town, South Africa
- National Health Laboratory Service, Groote Schuur Complex, Department of Clinical Virology, Observatory, Cape Town, South Africa
| | - Darren Patrick Martin
- University of Cape Town, Faculty of Health Sciences, Institute of Infectious Disease and Molecular Medicine, Computational Biology Group, Observatory, Cape Town, South Africa
| |
Collapse
|
22
|
Duvvuri VR, Granados A, Rosenfeld P, Bahl J, Eshaghi A, Gubbay JB. Genetic diversity and evolutionary insights of respiratory syncytial virus A ON1 genotype: global and local transmission dynamics. Sci Rep 2015; 5:14268. [PMID: 26420660 PMCID: PMC4588507 DOI: 10.1038/srep14268] [Citation(s) in RCA: 102] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2015] [Accepted: 08/21/2015] [Indexed: 12/04/2022] Open
Abstract
Human respiratory syncytial virus (RSV) A ON1 genotype, first detected in 2010 in Ontario, Canada, has been documented in 21 countries to date. This study investigated persistence and transmission dynamics of ON1 by grouping 406 randomly selected RSV-positive specimens submitted to Public Health Ontario from August 2011 to August 2012; RSV-A-positive specimens were genotyped. We identified 370 RSV-A (181 NA1, 135 NA2, 51 ON1 3 GA5) and 36 RSV-B positive specimens. We aligned time-stamped second hypervariable region (330 bp) of G-gene sequence data (global, n = 483; and Ontario, n = 60) to evaluate transmission dynamics. Global data suggests that the most recent common ancestor of ON1 emerged during the 2008–2009 season. Mean evolutionary rate of the global ON1 was 4.10 × 10−3 substitutions/site/year (95% BCI 3.1–5.0 × 10−3), not significantly different to that of Ontario ON1. The estimated mean reproductive number (R0 = ∼ 1.01) from global and Ontario sequences showed no significant difference and implies stability among global RSV-A ON1. This study suggests that local epidemics exhibit similar underlying evolutionary and epidemiological dynamics to that of the persistent global RSV-A ON1 population. These findings underscore the importance of continual molecular surveillance of RSV in order to gain a better understanding of epidemics.
Collapse
Affiliation(s)
- Venkata R Duvvuri
- Public Health Ontario, Toronto, Ontario, Canada.,University of Waterloo, Waterloo, Ontario, Canada (MPH student)
| | - Andrea Granados
- Public Health Ontario, Toronto, Ontario, Canada.,University of Toronto, Ontario, Canada
| | | | - Justin Bahl
- Center for Infectious Diseases, The University of Texas School of Public Health, Houston, Texas, United States of America
| | | | - Jonathan B Gubbay
- Public Health Ontario, Toronto, Ontario, Canada.,University of Toronto, Ontario, Canada.,Mount Sinai Hospital, Toronto, Ontario, Canada.,The Hospital for Sick Children, Toronto, Ontario, Canada
| |
Collapse
|
23
|
To TH, Jung M, Lycett S, Gascuel O. Fast Dating Using Least-Squares Criteria and Algorithms. Syst Biol 2015; 65:82-97. [PMID: 26424727 PMCID: PMC4678253 DOI: 10.1093/sysbio/syv068] [Citation(s) in RCA: 243] [Impact Index Per Article: 24.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2014] [Accepted: 09/16/2015] [Indexed: 11/26/2022] Open
Abstract
Phylogenies provide a useful way to understand the evolutionary history of genetic samples, and data sets with more than a thousand taxa are becoming increasingly common, notably with viruses (e.g., human immunodeficiency virus (HIV)). Dating ancestral events is one of the first, essential goals with such data. However, current sophisticated probabilistic approaches struggle to handle data sets of this size. Here, we present very fast dating algorithms, based on a Gaussian model closely related to the Langley–Fitch molecular-clock model. We show that this model is robust to uncorrelated violations of the molecular clock. Our algorithms apply to serial data, where the tips of the tree have been sampled through times. They estimate the substitution rate and the dates of all ancestral nodes. When the input tree is unrooted, they can provide an estimate for the root position, thus representing a new, practical alternative to the standard rooting methods (e.g., midpoint). Our algorithms exploit the tree (recursive) structure of the problem at hand, and the close relationships between least-squares and linear algebra. We distinguish between an unconstrained setting and the case where the temporal precedence constraint (i.e., an ancestral node must be older that its daughter nodes) is accounted for. With rooted trees, the former is solved using linear algebra in linear computing time (i.e., proportional to the number of taxa), while the resolution of the latter, constrained setting, is based on an active-set method that runs in nearly linear time. With unrooted trees the computing time becomes (nearly) quadratic (i.e., proportional to the square of the number of taxa). In all cases, very large input trees (>10,000 taxa) can easily be processed and transformed into time-scaled trees. We compare these algorithms to standard methods (root-to-tip, r8s version of Langley–Fitch method, and BEAST). Using simulated data, we show that their estimation accuracy is similar to that of the most sophisticated methods, while their computing time is much faster. We apply these algorithms on a large data set comprising 1194 strains of Influenza virus from the pdm09 H1N1 Human pandemic. Again the results show that these algorithms provide a very fast alternative with results similar to those of other computer programs. These algorithms are implemented in the LSD software (least-squares dating), which can be downloaded from http://www.atgc-montpellier.fr/LSD/, along with all our data sets and detailed results. An Online Appendix, providing additional algorithm descriptions, tables, and figures can be found in the Supplementary Material available on Dryad at http://dx.doi.org/10.5061/dryad.968t3.
Collapse
Affiliation(s)
- Thu-Hien To
- Institut de Biologie Computationnelle, LIRMM, UMR 5506 CNRS - Université de Montpellier, France
| | - Matthieu Jung
- Institut de Biologie Computationnelle, LIRMM, UMR 5506 CNRS - Université de Montpellier, France; IGBMC (Institut de Génétique et de Biologie Moléculaire et Cellulaire), INSERM, U596, CNRS, UMR7104, Université de Strasbourg, Illkirch, France
| | - Samantha Lycett
- Institute of Evolutionary Biology, University of Edinburgh, Ashworth Laboratories, Edinburgh, UK
| | - Olivier Gascuel
- Institut de Biologie Computationnelle, LIRMM, UMR 5506 CNRS - Université de Montpellier, France;
| |
Collapse
|
24
|
Su YCF, Bahl J, Joseph U, Butt KM, Peck HA, Koay ESC, Oon LLE, Barr IG, Vijaykrishna D, Smith GJD. Phylodynamics of H1N1/2009 influenza reveals the transition from host adaptation to immune-driven selection. Nat Commun 2015; 6:7952. [PMID: 26245473 PMCID: PMC4918339 DOI: 10.1038/ncomms8952] [Citation(s) in RCA: 96] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2014] [Accepted: 06/30/2015] [Indexed: 11/25/2022] Open
Abstract
Influenza A H1N1/2009 virus that emerged from swine rapidly replaced the previous seasonal H1N1 virus. Although the early emergence and diversification of H1N1/2009 is well characterized, the ongoing evolutionary and global transmission dynamics of the virus remain poorly investigated. To address this we analyse >3,000 H1N1/2009 genomes, including 214 full genomes generated from our surveillance in Singapore, in conjunction with antigenic data. Here we show that natural selection acting on H1N1/2009 directly after introduction into humans was driven by adaptation to the new host. Since then, selection has been driven by immunological escape, with these changes corresponding to restricted antigenic diversity in the virus population. We also show that H1N1/2009 viruses have been subject to regular seasonal bottlenecks and a global reduction in antigenic and genetic diversity in 2014.
Collapse
Affiliation(s)
- Yvonne C. F. Su
- Program in Emerging Infectious Diseases, Duke-NUS Graduate Medical School, 8 College Road, Singapore 169857, Singapore
| | - Justin Bahl
- Program in Emerging Infectious Diseases, Duke-NUS Graduate Medical School, 8 College Road, Singapore 169857, Singapore
- Division of Epidemiology, Human Genetics and Environmental Sciences, School of Public Health, The University of Texas, Houston, Texas 77030, USA
| | - Udayan Joseph
- Program in Emerging Infectious Diseases, Duke-NUS Graduate Medical School, 8 College Road, Singapore 169857, Singapore
| | - Ka Man Butt
- Program in Emerging Infectious Diseases, Duke-NUS Graduate Medical School, 8 College Road, Singapore 169857, Singapore
| | - Heidi A. Peck
- World Health Organisation Collaborating Centre for Reference and Research on Influenza, Melbourne, Victoria 3000, Australia
| | - Evelyn S. C. Koay
- Molecular Diagnosis Centre, Department of Laboratory Medicine, National University Hospital, Singapore 119074, Singapore
| | - Lynette L. E. Oon
- Department of Pathology, Singapore General Hospital, Singapore 169608, Singapore
| | - Ian G. Barr
- World Health Organisation Collaborating Centre for Reference and Research on Influenza, Melbourne, Victoria 3000, Australia
| | - Dhanasekaran Vijaykrishna
- Program in Emerging Infectious Diseases, Duke-NUS Graduate Medical School, 8 College Road, Singapore 169857, Singapore
- World Health Organisation Collaborating Centre for Reference and Research on Influenza, Melbourne, Victoria 3000, Australia
- Department of Microbiology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117597, Singapore
| | - Gavin J. D. Smith
- Program in Emerging Infectious Diseases, Duke-NUS Graduate Medical School, 8 College Road, Singapore 169857, Singapore
- World Health Organisation Collaborating Centre for Reference and Research on Influenza, Melbourne, Victoria 3000, Australia
- Duke Global Health Institute, Duke University, Durham, North Carolina 27708, USA
| |
Collapse
|
25
|
Gambhir M, Bozio C, O'Hagan JJ, Uzicanin A, Johnson LE, Biggerstaff M, Swerdlow DL. Infectious disease modeling methods as tools for informing response to novel influenza viruses of unknown pandemic potential. Clin Infect Dis 2015; 60 Suppl 1:S11-9. [PMID: 25878297 DOI: 10.1093/cid/civ083] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The rising importance of infectious disease modeling makes this an appropriate time for a guide for public health practitioners tasked with preparing for, and responding to, an influenza pandemic. We list several questions that public health practitioners commonly ask about pandemic influenza and match these with analytical methods, giving details on when during a pandemic the methods can be used, how long it might take to implement them, and what data are required. Although software to perform these tasks is available, care needs to be taken to understand: (1) the type of data needed, (2) the implementation of the methods, and (3) the interpretation of results in terms of model uncertainty and sensitivity. Public health leaders can use this article to evaluate the modeling literature, determine which methods can provide appropriate evidence for decision-making, and to help them request modeling work from in-house teams or academic groups.
Collapse
Affiliation(s)
- Manoj Gambhir
- Epidemiological Modelling Unit, Department of Epidemiology and Preventive Medicine, Monash University, Melbourne, Australia Modeling Unit, National Center for Immunization and Respiratory Diseases (NCIRD), Centers for Disease Control and Prevention (CDC) IHRC Inc
| | - Catherine Bozio
- Graduate Program in Epidemiology and Molecules to Mankind, Laney Graduate School, Emory University
| | - Justin J O'Hagan
- Modeling Unit, National Center for Immunization and Respiratory Diseases (NCIRD), Centers for Disease Control and Prevention (CDC) IHRC Inc
| | - Amra Uzicanin
- Division of Global Migration and Quarantine, National Center for Emerging and Zoonotic Infectious Diseases
| | | | | | - David L Swerdlow
- Modeling Unit and Office of the Director, NCIRD, CDC, Atlanta, Georgia
| |
Collapse
|
26
|
Duchêne S, Duchêne D, Holmes EC, Ho SYW. The Performance of the Date-Randomization Test in Phylogenetic Analyses of Time-Structured Virus Data. Mol Biol Evol 2015; 32:1895-906. [PMID: 25771196 DOI: 10.1093/molbev/msv056] [Citation(s) in RCA: 124] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Rates and timescales of viral evolution can be estimated using phylogenetic analyses of time-structured molecular sequences. This involves the use of molecular-clock methods, calibrated by the sampling times of the viral sequences. However, the spread of these sampling times is not always sufficient to allow the substitution rate to be estimated accurately. We conducted Bayesian phylogenetic analyses of simulated virus data to evaluate the performance of the date-randomization test, which is sometimes used to investigate whether time-structured data sets have temporal signal. An estimate of the substitution rate passes this test if its mean does not fall within the 95% credible intervals of rate estimates obtained using replicate data sets in which the sampling times have been randomized. We find that the test sometimes fails to detect rate estimates from data with no temporal signal. This error can be minimized by using a more conservative criterion, whereby the 95% credible interval of the estimate with correct sampling times should not overlap with those obtained with randomized sampling times. We also investigated the behavior of the test when the sampling times are not uniformly distributed throughout the tree, which sometimes occurs in empirical data sets. The test performs poorly in these circumstances, such that a modification to the randomization scheme is needed. Finally, we illustrate the behavior of the test in analyses of nucleotide sequences of cereal yellow dwarf virus. Our results validate the use of the date-randomization test and allow us to propose guidelines for interpretation of its results.
Collapse
Affiliation(s)
- Sebastián Duchêne
- School of Biological Sciences, University of Sydney, Sydney, NSW, Australia
| | - David Duchêne
- Research School of Biology, Australian National University, Canberra, ACT, Australia
| | - Edward C Holmes
- School of Biological Sciences, University of Sydney, Sydney, NSW, Australia Marie Bashir Institute for Infectious Diseases and Biosecurity, Charles Perkins Centre, Sydney Medical School, University of Sydney, Sydney, NSW, Australia
| | - Simon Y W Ho
- School of Biological Sciences, University of Sydney, Sydney, NSW, Australia
| |
Collapse
|
27
|
Meyer AG, Spielman SJ, Bedford T, Wilke CO. Time dependence of evolutionary metrics during the 2009 pandemic influenza virus outbreak. Virus Evol 2015; 1:vev006. [PMID: 26770819 PMCID: PMC4710376 DOI: 10.1093/ve/vev006] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
With the expansion of DNA sequencing technology, quantifying evolution in emerging viral outbreaks has become an important tool for scientists and public health officials. Although it is known that the degree of sequence divergence significantly affects the calculation of evolutionary metrics in viral outbreaks, the extent and duration of this effect during an actual outbreak remains unclear. We have analyzed how limited divergence time during an early viral outbreak affects the accuracy of molecular evolutionary metrics. Using sequence data from the first 25 months of the 2009 pandemic H1N1 (pH1N1) outbreak, we calculated each of three different standard evolutionary metrics-molecular clock rate (i.e., evolutionary rate), whole gene dN/dS, and site-wise dN/dS-for hemagglutinin and neuraminidase, using increasingly longer time windows, from 1 month to 25 months. For the molecular clock rate, we found that at least three to four months of temporal divergence from the start of sampling was required to make precise estimates that also agreed with long-term values. For whole gene dN/dS, we found that at least two months of data were required to generate precise estimates, but six to nine months were required for estimates to approach their long term values. For site-wise dN/dS estimates, we found that at least six months of sampling divergence was required before the majority of sites had at least one mutation and were thus evolutionarily informative. Furthermore, eight months of sampling divergence was required before the site-wise estimates appropriately reflected the distribution of values expected from known protein-structure-based evolutionary pressure in influenza. In summary, we found that evolutionary metrics calculated from gene sequence data in early outbreaks should be expected to deviate from their long-term estimates for at least several months after the initial emergence and sequencing of the virus.
Collapse
Affiliation(s)
- Austin G. Meyer
- Department of Integrative Biology, Institute for Cellular and Molecular Biology, and Center for Computational Biology and Bioinformatics, The University of Texas at Austin, Austin, TX, USA, 78712
- School of Medicine, Texas Tech University Health Sciences Center, Lubbock, TX, USA, 79430
| | - Stephanie J. Spielman
- Department of Integrative Biology, Institute for Cellular and Molecular Biology, and Center for Computational Biology and Bioinformatics, The University of Texas at Austin, Austin, TX, USA, 78712
| | - Trevor Bedford
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA, 98109
| | - Claus O. Wilke
- Department of Integrative Biology, Institute for Cellular and Molecular Biology, and Center for Computational Biology and Bioinformatics, The University of Texas at Austin, Austin, TX, USA, 78712
| |
Collapse
|
28
|
Abstract
The number of emerging infectious diseases is increasing. Characterizing novel or re-emerging infections is aided by the availability of pathogen genomes. In this review, we evaluate methods that exploit pathogen sequences and the contribution of genomic analysis to understand the epidemiology of recently emerged infectious diseases.
Collapse
|
29
|
Kao RR, Haydon DT, Lycett SJ, Murcia PR. Supersize me: how whole-genome sequencing and big data are transforming epidemiology. Trends Microbiol 2014; 22:282-91. [PMID: 24661923 PMCID: PMC7125769 DOI: 10.1016/j.tim.2014.02.011] [Citation(s) in RCA: 90] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2013] [Revised: 02/17/2014] [Accepted: 02/24/2014] [Indexed: 01/08/2023]
Abstract
Whole-genome sequencing is used for forensic epidemiology. Big data can transform forensic epidemiology. Clustering, biases, wildlife reservoirs, and emerging infections can all be addressed. Phylodynamics approaches to integrate epidemiological and evolutionary data have been highly successful but still face scientific challenges.
In epidemiology, the identification of ‘who infected whom’ allows us to quantify key characteristics such as incubation periods, heterogeneity in transmission rates, duration of infectiousness, and the existence of high-risk groups. Although invaluable, the existence of many plausible infection pathways makes this difficult, and epidemiological contact tracing either uncertain, logistically prohibitive, or both. The recent advent of next-generation sequencing technology allows the identification of traceable differences in the pathogen genome that are transforming our ability to understand high-resolution disease transmission, sometimes even down to the host-to-host scale. We review recent examples of the use of pathogen whole-genome sequencing for the purpose of forensic tracing of transmission pathways, focusing on the particular problems where evolutionary dynamics must be supplemented by epidemiological information on the most likely timing of events as well as possible transmission pathways. We also discuss potential pitfalls in the over-interpretation of these data, and highlight the manner in which a confluence of this technology with sophisticated mathematical and statistical approaches has the potential to produce a paradigm shift in our understanding of infectious disease transmission and control.
Collapse
Affiliation(s)
- Rowland R Kao
- Boyd Orr Centre for Population and Ecosystem Health, College of Medical Veterinary and Life Sciences, University of Glasgow, G61 1QH, UK.
| | - Daniel T Haydon
- Boyd Orr Centre for Population and Ecosystem Health, College of Medical Veterinary and Life Sciences, University of Glasgow, G61 1QH, UK
| | - Samantha J Lycett
- Boyd Orr Centre for Population and Ecosystem Health, College of Medical Veterinary and Life Sciences, University of Glasgow, G61 1QH, UK
| | - Pablo R Murcia
- Medical Research Council (MRC) Centre for Virus Research, College of Medical, Veterinary and Life Sciences, University of Glasgow, G61 1QH, UK
| |
Collapse
|