Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Vaughan TG, Drummond AJ. A stochastic simulator of birth-death master equations with application to phylodynamics. Mol Biol Evol 2013;30:1480-93. [PMID: 23505043 PMCID: PMC3649681 DOI: 10.1093/molbev/mst057] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open

For:	Vaughan TG, Drummond AJ. A stochastic simulator of birth-death master equations with application to phylodynamics. Mol Biol Evol 2013;30:1480-93. [PMID: 23505043 PMCID: PMC3649681 DOI: 10.1093/molbev/mst057] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open

Number

Cited by Other Article(s)

Thompson A, Liebeskind BJ, Scully EJ, Landis MJ. Deep Learning and Likelihood Approaches for Viral Phylogeography Converge on the Same Answers Whether the Inference Model Is Right or Wrong. Syst Biol 2024;73:183-206. [PMID: 38189575 DOI: 10.1093/sysbio/syad074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 11/22/2023] [Accepted: 01/05/2024] [Indexed: 01/09/2024] Open

Abstract

Analysis of phylogenetic trees has become an essential tool in epidemiology. Likelihood-based methods fit models to phylogenies to draw inferences about the phylodynamics and history of viral transmission. However, these methods are often computationally expensive, which limits the complexity and realism of phylodynamic models and makes them ill-suited for informing policy decisions in real-time during rapidly developing outbreaks. Likelihood-free methods using deep learning are pushing the boundaries of inference beyond these constraints. In this paper, we extend, compare, and contrast a recently developed deep learning method for likelihood-free inference from trees. We trained multiple deep neural networks using phylogenies from simulated outbreaks that spread among 5 locations and found they achieve close to the same levels of accuracy as Bayesian inference under the true simulation model. We compared robustness to model misspecification of a trained neural network to that of a Bayesian method. We found that both models had comparable performance, converging on similar biases. We also implemented a method of uncertainty quantification called conformalized quantile regression that we demonstrate has similar patterns of sensitivity to model misspecification as Bayesian highest posterior density (HPD) and greatly overlap with HPDs, but have lower precision (more conservative). Finally, we trained and tested a neural network against phylogeographic data from a recent study of the SARS-Cov-2 pandemic in Europe and obtained similar estimates of region-specific epidemiological parameters and the location of the common ancestor in Europe. Along with being as accurate and robust as likelihood-based methods, our trained neural networks are on average over 3 orders of magnitude faster after training. Our results support the notion that neural networks can be trained with simulated data to accurately mimic the good and bad statistical properties of the likelihood functions of generative phylogenetic models.

Collapse

Müller NF, Bouckaert RR, Wu CH, Bedford T. MASCOT-Skyline integrates population and migration dynamics to enhance phylogeographic reconstructions. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.06.583734. [PMID: 38496513 PMCID: PMC10942421 DOI: 10.1101/2024.03.06.583734] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]

Soewongsono AC, Landis MJ. A Diffusion-Based Approach for Simulating Forward-in-Time State-Dependent Speciation and Extinction Dynamics. ARXIV 2024:arXiv:2402.00246v1. [PMID: 38351931 PMCID: PMC10862938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]

Vaughan TG. ReMASTER: improved phylodynamic simulation for BEAST 2.7. Bioinformatics 2024;40:btae015. [PMID: 38195927 PMCID: PMC10796175 DOI: 10.1093/bioinformatics/btae015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 12/30/2023] [Accepted: 01/08/2024] [Indexed: 01/11/2024] Open

Weber A, Översti S, Kühnert D. Reconstructing relative transmission rates in Bayesian phylodynamics: Two-fold transmission advantage of Omicron in Berlin, Germany during December 2021. Virus Evol 2023;9:vead070. [PMID: 38107332 PMCID: PMC10725310 DOI: 10.1093/ve/vead070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Revised: 11/08/2023] [Accepted: 11/27/2023] [Indexed: 12/19/2023] Open

Johnson B, Shuai Y, Schweinsberg J, Curtius K. cloneRate: fast estimation of single-cell clonal dynamics using coalescent theory. Bioinformatics 2023;39:btad561. [PMID: 37699006 PMCID: PMC10534056 DOI: 10.1093/bioinformatics/btad561] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 08/25/2023] [Indexed: 09/14/2023] Open

Lewinsohn MA, Bedford T, Müller NF, Feder AF. State-dependent evolutionary models reveal modes of solid tumour growth. Nat Ecol Evol 2023;7:581-596. [PMID: 36894662 PMCID: PMC10089931 DOI: 10.1038/s41559-023-02000-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Accepted: 01/26/2023] [Indexed: 03/11/2023]

Danesh G, Saulnier E, Gascuel O, Choisy M, Alizon S. TiPS : Rapidly simulating trajectories and phylogenies from compartmental models. Methods Ecol Evol 2022. [DOI: 10.1111/2041-210x.14038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Didelot X, Helekal D, Kendall M, Ribeca P. Distinguishing imported cases from locally acquired cases within a geographically limited genomic sample of an infectious disease. Bioinformatics 2022;39:6849542. [PMID: 36440957 PMCID: PMC9805578 DOI: 10.1093/bioinformatics/btac761] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Revised: 11/17/2022] [Accepted: 11/24/2022] [Indexed: 11/30/2022] Open

Shchur V, Spirin V, Sirotkin D, Burovski E, De Maio N, Corbett-Detig R. VGsim: Scalable viral genealogy simulator for global pandemic. PLoS Comput Biol 2022;18:e1010409. [PMID: 36001646 PMCID: PMC9447924 DOI: 10.1371/journal.pcbi.1010409] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Revised: 09/06/2022] [Accepted: 07/18/2022] [Indexed: 11/24/2022] Open

Abstract

Accurate simulation of complex biological processes is an essential component of developing and validating new technologies and inference approaches. As an effort to help contain the COVID-19 pandemic, large numbers of SARS-CoV-2 genomes have been sequenced from most regions in the world. More than 5.5 million viral sequences are publicly available as of November 2021. Many studies estimate viral genealogies from these sequences, as these can provide valuable information about the spread of the pandemic across time and space. Additionally such data are a rich source of information about molecular evolutionary processes including natural selection, for example allowing the identification of new variants with transmissibility and immunity evasion advantages. To our knowledge, there is no framework that is both efficient and flexible enough to simulate the pandemic to approximate world-scale scenarios and generate viral genealogies of millions of samples. Here, we introduce a new fast simulator VGsim which addresses the problem of simulation genealogies under epidemiological models. The simulation process is split into two phases. During the forward run the algorithm generates a chain of population-level events reflecting the dynamics of the pandemic using an hierarchical version of the Gillespie algorithm. During the backward run a coalescent-like approach generates a tree genealogy of samples conditioning on the population-level events chain generated during the forward run. Our software can model complex population structure, epistasis and immunity escape.

We develop a fast and flexible simulation software package VGsim for modeling epidemiological processes and generating genealogies of large pathogen samples. The software takes into account host population structure, pathogen evolution, host immunity and some other epidemiological aspects. The computational efficiency of the package allows to simulate genealogies of tens of millions of samples, which is important, e.g., for SARS-CoV-2 genome studies.

Collapse

Robust Phylodynamic Analysis of Genetic Sequencing Data from Structured Populations. Viruses 2022;14:v14081648. [PMID: 36016270 PMCID: PMC9413058 DOI: 10.3390/v14081648] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 07/22/2022] [Indexed: 02/04/2023] Open

Menardo F. Understanding drivers of phylogenetic clustering and terminal branch lengths distribution in epidemics of Mycobacterium tuberculosis. eLife 2022;11:76780. [PMID: 35762734 PMCID: PMC9239681 DOI: 10.7554/elife.76780] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Accepted: 06/15/2022] [Indexed: 11/13/2022] Open

Shchur V, Spirin V, Sirotkin D, Burovski E, De Maio N, Corbett-Detig R. VGsim: scalable viral genealogy simulator for global pandemic. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2021:2021.04.21.21255891. [PMID: 33948608 PMCID: PMC8095227 DOI: 10.1101/2021.04.21.21255891] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Featherstone LA, Di Giallonardo F, Holmes EC, Vaughan TG, Duchêne S. Infectious disease phylodynamics with occurrence data. Methods Ecol Evol 2021. [DOI: 10.1111/2041-210x.13620] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Müller NF, Wagner C, Frazar CD, Roychoudhury P, Lee J, Moncla LH, Pelle B, Richardson M, Ryke E, Xie H, Shrestha L, Addetia A, Rachleff VM, Lieberman NAP, Huang ML, Gautom R, Melly G, Hiatt B, Dykema P, Adler A, Brandstetter E, Han PD, Fay K, Ilcisin M, Lacombe K, Sibley TR, Truong M, Wolf CR, Boeckh M, Englund JA, Famulare M, Lutz BR, Rieder MJ, Thompson M, Duchin JS, Starita LM, Chu HY, Shendure J, Jerome KR, Lindquist S, Greninger AL, Nickerson DA, Bedford T. Viral genomes reveal patterns of the SARS-CoV-2 outbreak in Washington State. Sci Transl Med 2021;13:eabf0202. [PMID: 33941621 PMCID: PMC8158963 DOI: 10.1126/scitranslmed.abf0202] [Citation(s) in RCA: 41] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Revised: 01/23/2021] [Accepted: 04/25/2021] [Indexed: 12/16/2022]

Affiliation(s)

Nicola F Müller Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA.
Cassia Wagner Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
Chris D Frazar Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
Pavitra Roychoudhury Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA 98195, USA
Jover Lee Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
Louise H Moncla Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
Benjamin Pelle Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
Matthew Richardson Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
Erica Ryke Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
Hong Xie Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA 98195, USA
Lasata Shrestha Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA 98195, USA
Amin Addetia Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA 98195, USA
Victoria M Rachleff Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA 98195, USA
Nicole A P Lieberman Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA 98195, USA
Meei-Li Huang Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA 98195, USA
Romesh Gautom Washington State Department of Health, Shoreline, WA 98155, USA
Geoff Melly Washington State Department of Health, Shoreline, WA 98155, USA
Brian Hiatt Washington State Department of Health, Shoreline, WA 98155, USA
Philip Dykema Washington State Department of Health, Shoreline, WA 98155, USA
Amanda Adler Seattle Children's Research Institute, Seattle, WA 98101, USA
Elisabeth Brandstetter Department of Medicine, Division of Allergy and Infectious Diseases, University of Washington, Seattle, WA 98195, USA
Peter D Han Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
Kairsten Fay Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
Misja Ilcisin Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
Kirsten Lacombe Seattle Children's Research Institute, Seattle, WA 98101, USA
Thomas R Sibley Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
Melissa Truong Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
Caitlin R Wolf Department of Medicine, Division of Allergy and Infectious Diseases, University of Washington, Seattle, WA 98195, USA
Michael Boeckh Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA Department of Medicine, Division of Allergy and Infectious Diseases, University of Washington, Seattle, WA 98195, USA Brotman Baty Institute for Precision Medicine, Seattle, WA 98195, USA
Janet A Englund Seattle Children's Research Institute, Seattle, WA 98101, USA Department of Pediatrics, University of Washington, Seattle, WA 98105, USA
Michael Famulare Institute for Disease Modeling, Bellevue, WA 98105, USA
Barry R Lutz Brotman Baty Institute for Precision Medicine, Seattle, WA 98195, USA Department of Bioengineering, University of Washington, Seattle, WA 98105, USA
Mark J Rieder Brotman Baty Institute for Precision Medicine, Seattle, WA 98195, USA
Matthew Thompson Department of Global Health, University of Washington, Seattle, WA 98195, USA
Jeffrey S Duchin Department of Medicine, Division of Allergy and Infectious Diseases, University of Washington, Seattle, WA 98195, USA Public Health - Seattle & King County, Seattle, WA98121, USA
Lea M Starita Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA Brotman Baty Institute for Precision Medicine, Seattle, WA 98195, USA
Helen Y Chu Department of Medicine, Division of Allergy and Infectious Diseases, University of Washington, Seattle, WA 98195, USA Brotman Baty Institute for Precision Medicine, Seattle, WA 98195, USA
Jay Shendure Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA Brotman Baty Institute for Precision Medicine, Seattle, WA 98195, USA Howard Hughes Medical Institute, Seattle, WA 98195, USA
Keith R Jerome Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA 98195, USA
Scott Lindquist Washington State Department of Health, Shoreline, WA 98155, USA
Alexander L Greninger Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA 98195, USA
Deborah A Nickerson Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA Brotman Baty Institute for Precision Medicine, Seattle, WA 98195, USA
Trevor Bedford Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA. Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA Brotman Baty Institute for Precision Medicine, Seattle, WA 98195, USA

Collapse

Quantifying transmission fitness costs of multi-drug resistant tuberculosis. Epidemics 2021;36:100471. [PMID: 34256273 DOI: 10.1016/j.epidem.2021.100471] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2019] [Revised: 01/14/2020] [Accepted: 05/17/2021] [Indexed: 11/22/2022] Open

Abstract

As multi-drug resistant tuberculosis (MDR-TB) continues to spread, investigating the transmission potential of different drug-resistant strains becomes an ever more pressing topic in public health. While phylogenetic and transmission tree inferences provide valuable insight into possible transmission chains, phylodynamic inference combines evolutionary and epidemiological analyses to estimate the parameters of the underlying epidemiological processes, allowing us to describe the overall dynamics of disease spread in the population. In this study, we introduce an approach to Mycobacterium tuberculosis (M. tuberculosis) phylodynamic analysis employing an existing computationally efficient model to quantify the transmission fitness costs of drug resistance with respect to drug-sensitive strains. To determine the accuracy and precision of our approach, we first perform a simulation study, mimicking the simultaneous spread of drug-sensitive and drug-resistant tuberculosis (TB) strains. We analyse the simulated transmission trees using the phylodynamic multi-type birth-death model (MTBD, (Kühnert et al., 2016)) within the BEAST2 framework and show that this model can estimate the parameters of the epidemic well, despite the simplifying assumptions that MTBD makes compared to the complex TB transmission dynamics used for simulation. We then apply the MTBD model to an M. tuberculosis lineage 4 dataset that primarily consists of MDR sequences. Some of the MDR strains additionally exhibit resistance to pyrazinamide - an important first-line anti-tuberculosis drug. Our results support the previously proposed hypothesis that pyrazinamide resistance confers a transmission fitness cost to the bacterium, which we quantify for the given dataset. Importantly, our sensitivity analyses show that the estimates are robust to different prior distributions on the resistance acquisition rate, but are affected by the size of the dataset - i.e. we estimate a higher fitness cost when using fewer sequences for analysis. Overall, we propose that MTBD can be used to quantify the transmission fitness cost for a wide range of pathogens where the strains can be appropriately divided into two or more categories with distinct properties.

Collapse

Parag KV, Pybus OG, Wu CH. Are Skyline Plot-Based Demographic Estimates Overly Dependent on Smoothing Prior Assumptions? Syst Biol 2021;71:121-138. [PMID: 33989428 PMCID: PMC8677568 DOI: 10.1093/sysbio/syab037] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2020] [Revised: 05/07/2021] [Accepted: 05/08/2021] [Indexed: 11/13/2022] Open

Duchene S, Lemey P, Stadler T, Ho SYW, Duchene DA, Dhanasekaran V, Baele G. Bayesian Evaluation of Temporal Signal in Measurably Evolving Populations. Mol Biol Evol 2021;37:3363-3379. [PMID: 32895707 PMCID: PMC7454806 DOI: 10.1093/molbev/msaa163] [Citation(s) in RCA: 64] [Impact Index Per Article: 21.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open

Stadler T, Pybus OG, Stumpf MPH. Phylodynamics for cell biologists. Science 2021;371:371/6526/eaah6266. [PMID: 33446527 DOI: 10.1126/science.aah6266] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Accepted: 08/13/2020] [Indexed: 12/12/2022]

The Impacts of Low Diversity Sequence Data on Phylodynamic Inference during an Emerging Epidemic. Viruses 2021;13:v13010079. [PMID: 33430050 PMCID: PMC7826997 DOI: 10.3390/v13010079] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2020] [Revised: 01/05/2021] [Accepted: 01/05/2021] [Indexed: 01/06/2023] Open

Volz EM, Carsten W, Grad YH, Frost SDW, Dennis AM, Didelot X. Identification of Hidden Population Structure in Time-Scaled Phylogenies. Syst Biol 2021;69:884-896. [PMID: 32049340 PMCID: PMC8559910 DOI: 10.1093/sysbio/syaa009] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2019] [Revised: 01/09/2020] [Accepted: 01/23/2020] [Indexed: 11/13/2022] Open

Abstract

Population structure influences genealogical patterns, however, data pertaining to how populations are structured are often unavailable or not directly observable. Inference of population structure is highly important in molecular epidemiology where pathogen phylogenetics is increasingly used to infer transmission patterns and detect outbreaks. Discrepancies between observed and idealized genealogies, such as those generated by the coalescent process, can be quantified, and where significant differences occur, may reveal the action of natural selection, host population structure, or other demographic and epidemiological heterogeneities. We have developed a fast non-parametric statistical test for detection of cryptic population structure in time-scaled phylogenetic trees. The test is based on contrasting estimated phylogenies with the theoretically expected phylodynamic ordering of common ancestors in two clades within a coalescent framework. These statistical tests have also motivated the development of algorithms which can be used to quickly screen a phylogenetic tree for clades which are likely to share a distinct demographic or epidemiological history. Epidemiological applications include identification of outbreaks in vulnerable host populations or rapid expansion of genotypes with a fitness advantage. To demonstrate the utility of these methods for outbreak detection, we applied the new methods to large phylogenies reconstructed from thousands of HIV-1 partial pol sequences. This revealed the presence of clades which had grown rapidly in the recent past and was significantly concentrated in young men, suggesting recent and rapid transmission in that group. Furthermore, to demonstrate the utility of these methods for the study of antimicrobial resistance, we applied the new methods to a large phylogeny reconstructed from whole genome Neisseria gonorrhoeae sequences. We find that population structure detected using these methods closely overlaps with the appearance and expansion of mutations conferring antimicrobial resistance. [Antimicrobial resistance; coalescent; HIV; population structure.].

Collapse

Müller NF, Wagner C, Frazar CD, Roychoudhury P, Lee J, Moncla LH, Pelle B, Richardson M, Ryke E, Xie H, Shrestha L, Addetia A, Rachleff VM, Lieberman NAP, Huang ML, Gautom R, Melly G, Hiatt B, Dykema P, Adler A, Brandstetter E, Han PD, Fay K, Llcisin M, Lacombe K, Sibley TR, Truong M, Wolf CR, Boeckh M, Englund JA, Famulare M, Lutz BR, Rieder MJ, Thompson M, Duchin JS, Starita LM, Chu HY, Shendure J, Jerome KR, Lindquist S, Greninger AL, Nickerson DA, Bedford T. Viral genomes reveal patterns of the SARS-CoV-2 outbreak in Washington State. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2020:2020.09.30.20204230. [PMID: 33024981 PMCID: PMC7536883 DOI: 10.1101/2020.09.30.20204230] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]

Affiliation(s)

Nicola F Müller Fred Hutchinson Cancer Research Center, Seattle, WA, USA
Cassia Wagner Fred Hutchinson Cancer Research Center, Seattle, WA, USA University of Washington, Seattle, WA, USA
Chris D Frazar University of Washington, Seattle, WA, USA
Pavitra Roychoudhury Fred Hutchinson Cancer Research Center, Seattle, WA, USA University of Washington, Seattle, WA, USA
Jover Lee Fred Hutchinson Cancer Research Center, Seattle, WA, USA
Louise H Moncla Fred Hutchinson Cancer Research Center, Seattle, WA, USA
Benjamin Pelle University of Washington, Seattle, WA, USA
Matthew Richardson University of Washington, Seattle, WA, USA
Erica Ryke University of Washington, Seattle, WA, USA
Hong Xie University of Washington, Seattle, WA, USA
Lasata Shrestha University of Washington, Seattle, WA, USA
Amin Addetia University of Washington, Seattle, WA, USA
Victoria M Rachleff Fred Hutchinson Cancer Research Center, Seattle, WA, USA University of Washington, Seattle, WA, USA
Nicole A P Lieberman University of Washington, Seattle, WA, USA
Meei-Li Huang University of Washington, Seattle, WA, USA
Romesh Gautom Washington State Department of Health, Shoreline, WA, USA
Geoff Melly Washington State Department of Health, Shoreline, WA, USA
Brian Hiatt Washington State Department of Health, Shoreline, WA, USA
Philip Dykema Washington State Department of Health, Shoreline, WA, USA
Amanda Adler Seattle Children's Research Institute, Seattle, WA, USA
Elisabeth Brandstetter University of Washington, Seattle, WA, USA
Peter D Han University of Washington, Seattle, WA, USA
Kairsten Fay Fred Hutchinson Cancer Research Center, Seattle, WA, USA
Misja Llcisin Fred Hutchinson Cancer Research Center, Seattle, WA, USA
Kristen Lacombe Seattle Children's Research Institute, Seattle, WA, USA
Thomas R Sibley Fred Hutchinson Cancer Research Center, Seattle, WA, USA
Melissa Truong University of Washington, Seattle, WA, USA
Caitlin R Wolf University of Washington, Seattle, WA, USA
Michael Boeckh Fred Hutchinson Cancer Research Center, Seattle, WA, USA University of Washington, Seattle, WA, USA Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
Janet A Englund University of Washington, Seattle, WA, USA Seattle Children's Research Institute, Seattle, WA, USA
Michael Famulare Institute for Disease Modeling, Bellevue, WA, USA
Barry R Lutz University of Washington, Seattle, WA, USA Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
Mark J Rieder Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
Matthew Thompson University of Washington, Seattle, WA, USA
Jeffrey S Duchin University of Washington, Seattle, WA, USA Public Health - Seattle & King County, Seattle, WA, USA
Lea M Starita University of Washington, Seattle, WA, USA Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
Helen Y Chu University of Washington, Seattle, WA, USA Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
Jay Shendure University of Washington, Seattle, WA, USA Brotman Baty Institute for Precision Medicine, Seattle, WA, USA Howard Hughes Medical Institute, Seattle, WA, USA
Keith R Jerome Fred Hutchinson Cancer Research Center, Seattle, WA, USA University of Washington, Seattle, WA, USA
Scott Lindquist Washington State Department of Health, Shoreline, WA, USA
Alexander L Greninger Fred Hutchinson Cancer Research Center, Seattle, WA, USA University of Washington, Seattle, WA, USA
Deborah A Nickerson University of Washington, Seattle, WA, USA Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
Trevor Bedford Fred Hutchinson Cancer Research Center, Seattle, WA, USA University of Washington, Seattle, WA, USA Brotman Baty Institute for Precision Medicine, Seattle, WA, USA

Collapse

Müller NF, Rasmussen D, Stadler T. MASCOT: parameter and state inference under the marginal structured coalescent approximation. Bioinformatics 2019;34:3843-3848. [PMID: 29790921 PMCID: PMC6223361 DOI: 10.1093/bioinformatics/bty406] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2017] [Accepted: 05/16/2018] [Indexed: 11/16/2022] Open

Müller NF, Dudas G, Stadler T. Inferring time-dependent migration and coalescence patterns from genetic sequence and predictor data in structured populations. Virus Evol 2019;5:vez030. [PMID: 31428459 PMCID: PMC6693038 DOI: 10.1093/ve/vez030] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open

Bloomfield S, Vaughan T, Benschop J, Marshall J, Hayman D, Biggs P, Carter P, French N. Investigation of the validity of two Bayesian ancestral state reconstruction models for estimating Salmonella transmission during outbreaks. PLoS One 2019;14:e0214169. [PMID: 31329588 PMCID: PMC6645465 DOI: 10.1371/journal.pone.0214169] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2019] [Accepted: 07/08/2019] [Indexed: 01/24/2023] Open

Duchene S, Bouckaert R, Duchene DA, Stadler T, Drummond AJ. Phylodynamic Model Adequacy Using Posterior Predictive Simulations. Syst Biol 2019;68:358-364. [PMID: 29945220 PMCID: PMC6368481 DOI: 10.1093/sysbio/syy048] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2018] [Accepted: 06/15/2018] [Indexed: 11/18/2022] Open

Bouckaert R, Vaughan TG, Barido-Sottani J, Duchêne S, Fourment M, Gavryushkina A, Heled J, Jones G, Kühnert D, De Maio N, Matschiner M, Mendes FK, Müller NF, Ogilvie HA, du Plessis L, Popinga A, Rambaut A, Rasmussen D, Siveroni I, Suchard MA, Wu CH, Xie D, Zhang C, Stadler T, Drummond AJ. BEAST 2.5: An advanced software platform for Bayesian evolutionary analysis. PLoS Comput Biol 2019;15:e1006650. [PMID: 30958812 PMCID: PMC6472827 DOI: 10.1371/journal.pcbi.1006650] [Citation(s) in RCA: 1552] [Impact Index Per Article: 310.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2018] [Revised: 04/18/2019] [Accepted: 02/04/2019] [Indexed: 11/18/2022] Open

Affiliation(s)

Remco Bouckaert Centre of Computational Evolution, University of Auckland, Auckland, New Zealand Max Planck Institute for the Science of Human History, Jena, Germany
Timothy G. Vaughan ETH Zürich, Department of Biosystems Science and Engineering, 4058 Basel, Switzerland Swiss Institute of Bioinformatics, Lausanne, Switzerland
Joëlle Barido-Sottani ETH Zürich, Department of Biosystems Science and Engineering, 4058 Basel, Switzerland Swiss Institute of Bioinformatics, Lausanne, Switzerland
Sebastián Duchêne Department of Biochemistry and Molecular Biology, University of Melbourne, Melbourne, Victoria, Australia
Mathieu Fourment ithree institute, University of Technology Sydney, Sydney, Australia
Alexandra Gavryushkina Department of Biochemistry, University of Otago, Dunedin 9016, New Zealand
Joseph Heled Independent researcher, Auckland, New Zealand
Graham Jones Department of Biological and Environmental Sciences, University of Gothenburg, Box 461, SE 405 30 Göteborg, Sweden
Denise Kühnert Max Planck Institute for the Science of Human History, Jena, Germany
Nicola De Maio European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridgeshire, UK
Michael Matschiner Department of Environmental Sciences, University of Basel, 4051 Basel, Switzerland
Fábio K. Mendes Centre of Computational Evolution, University of Auckland, Auckland, New Zealand
Nicola F. Müller ETH Zürich, Department of Biosystems Science and Engineering, 4058 Basel, Switzerland Swiss Institute of Bioinformatics, Lausanne, Switzerland
Huw A. Ogilvie Department of Computer Science, Rice University, Houston, TX 77005-1892, USA
Louis du Plessis Department of Zoology, University of Oxford, Oxford, OX1 3PS, UK
Alex Popinga Centre of Computational Evolution, University of Auckland, Auckland, New Zealand
Andrew Rambaut Institute of Evolutionary Biology, University of Edinburgh, Ashworth Laboratories, Edinburgh, EH9 3FL UK
David Rasmussen Department of Entomology and Plant Pathology, North Carolina State University, Raleigh, NC 27695, USA
Igor Siveroni Department of Infectious Disease Epidemiology, Imperial College London, Norfolk Place, W2 1PG, UK
Marc A. Suchard Department of Biomathematics, David Geffen School of Medicine, University of California, Los Angeles, CA, USA
Chieh-Hsi Wu Department of Statistics, University of Oxford, OX1 3LB, UK
Dong Xie Centre of Computational Evolution, University of Auckland, Auckland, New Zealand
Chi Zhang Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, Beijing, China
Tanja Stadler ETH Zürich, Department of Biosystems Science and Engineering, 4058 Basel, Switzerland Swiss Institute of Bioinformatics, Lausanne, Switzerland
Alexei J. Drummond Centre of Computational Evolution, University of Auckland, Auckland, New Zealand

Collapse

Magiorkinis G, Karamitros T, Vasylyeva TI, Williams LD, Mbisa JL, Hatzakis A, Paraskevis D, Friedman SR. An Innovative Study Design to Assess the Community Effect of Interventions to Mitigate HIV Epidemics Using Transmission-Chain Phylodynamics. Am J Epidemiol 2018;187:2615-2622. [PMID: 30101288 DOI: 10.1093/aje/kwy160] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2018] [Accepted: 07/24/2018] [Indexed: 11/13/2022] Open

Volz EM, Didelot X. Modeling the Growth and Decline of Pathogen Effective Population Size Provides Insight into Epidemic Dynamics and Drivers of Antimicrobial Resistance. Syst Biol 2018;67:719-728. [PMID: 29432602 PMCID: PMC6005154 DOI: 10.1093/sysbio/syy007] [Citation(s) in RCA: 57] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2017] [Accepted: 02/04/2018] [Indexed: 12/15/2022] Open

Abstract

Nonparametric population genetic modeling provides a simple and flexible approach for studying demographic history and epidemic dynamics using pathogen sequence data. Existing Bayesian approaches are premised on stochastic processes with stationary increments which may provide an unrealistic prior for epidemic histories which feature extended period of exponential growth or decline. We show that nonparametric models defined in terms of the growth rate of the effective population size can provide a more realistic prior for epidemic history. We propose a nonparametric autoregressive model on the growth rate as a prior for effective population size, which corresponds to the dynamics expected under many epidemic situations. We demonstrate the use of this model within a Bayesian phylodynamic inference framework. Our method correctly reconstructs trends of epidemic growth and decline from pathogen genealogies even when genealogical data are sparse and conventional skyline estimators erroneously predict stable population size. We also propose a regression approach for relating growth rates of pathogen effective population size and time-varying variables that may impact the replicative fitness of a pathogen. The model is applied to real data from rabies virus and Staphylococcus aureus epidemics. We find a close correspondence between the estimated growth rates of a lineage of methicillin-resistant S. aureus and population-level prescription rates of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}$\beta$\end{document}-lactam antibiotics. The new models are implemented in an open source R package called skygrowth which is available at https://github.com/mrc-ide/skygrowth.

Collapse

Vaughan TG. IcyTree: rapid browser-based visualization for phylogenetic trees and networks. Bioinformatics 2018;33:2392-2394. [PMID: 28407035 PMCID: PMC5860111 DOI: 10.1093/bioinformatics/btx155] [Citation(s) in RCA: 57] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2016] [Accepted: 03/21/2017] [Indexed: 01/25/2023] Open

Müller NF, Rasmussen DA, Stadler T. The Structured Coalescent and Its Approximations. Mol Biol Evol 2018;34:2970-2981. [PMID: 28666382 PMCID: PMC5850743 DOI: 10.1093/molbev/msx186] [Citation(s) in RCA: 64] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open

Boskova V, Stadler T, Magnus C. The influence of phylodynamic model specifications on parameter estimates of the Zika virus epidemic. Virus Evol 2018;4:vex044. [PMID: 29403651 PMCID: PMC5789282 DOI: 10.1093/ve/vex044] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open

Abstract

Each new virus introduced into the human population could potentially spread and cause a worldwide epidemic. Thus, early quantification of epidemic spread is crucial. Real-time sequencing followed by Bayesian phylodynamic analysis has proven to be extremely informative in this respect. Bayesian phylodynamic analyses require a model to be chosen and prior distributions on model parameters to be specified. We study here how choices regarding the tree prior influence quantification of epidemic spread in an emerging epidemic by focusing on estimates of the parameters clock rate, tree height, and reproductive number in the currently ongoing Zika virus epidemic in the Americas. While parameter estimates are quite robust to reasonable variations in the model settings when studying the complete data set, it is impossible to obtain unequivocal estimates when reducing the data to local Zika epidemics in Brazil and Florida, USA. Beyond the empirical insights, this study highlights the conceptual differences between the so-called birth-death and coalescent tree priors: while sequence sampling times alone can strongly inform the tree height and reproductive number under a birth-death model, the coalescent tree height prior is typically only slightly influenced by this information. Such conceptual differences together with non-trivial interactions of different priors complicate proper interpretation of empirical results. Overall, our findings indicate that phylodynamic analyses of early viral spread data must be carried out with care as data sets may not necessarily be informative enough yet to provide estimates robust to prior settings. It is necessary to do a robustness check of these data sets by scanning several models and prior distributions. Only if the posterior distributions are robust to reasonable changes of the prior distribution, the parameter estimates can be trusted. Such robustness tests will help making real-time phylodynamic analyses of spreading epidemic more reliable in the future.

Collapse

McCloskey RM, Poon AFY. A model-based clustering method to detect infectious disease transmission outbreaks from sequence variation. PLoS Comput Biol 2017;13:e1005868. [PMID: 29131825 PMCID: PMC5703573 DOI: 10.1371/journal.pcbi.1005868] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2017] [Revised: 11/27/2017] [Accepted: 11/02/2017] [Indexed: 01/07/2023] Open

Abstract

Clustering infections by genetic similarity is a popular technique for identifying potential outbreaks of infectious disease, in part because sequences are now routinely collected for clinical management of many infections. A diverse number of nonparametric clustering methods have been developed for this purpose. These methods are generally intuitive, rapid to compute, and readily scale with large data sets. However, we have found that nonparametric clustering methods can be biased towards identifying clusters of diagnosis—where individuals are sampled sooner post-infection—rather than the clusters of rapid transmission that are meant to be potential foci for public health efforts. We develop a fundamentally new approach to genetic clustering based on fitting a Markov-modulated Poisson process (MMPP), which represents the evolution of transmission rates along the tree relating different infections. We evaluated this model-based method alongside five nonparametric clustering methods using both simulated and actual HIV sequence data sets. For simulated clusters of rapid transmission, the MMPP clustering method obtained higher mean sensitivity (85%) and specificity (91%) than the nonparametric methods. When we applied these clustering methods to published sequences from a study of HIV-1 genetic clusters in Seattle, USA, we found that the MMPP method categorized about half (46%) as many individuals to clusters compared to the other methods. Furthermore, the mean internal branch lengths that approximate transmission rates were significantly shorter in clusters extracted using MMPP, but not by other methods. We determined that the computing time for the MMPP method scaled linearly with the size of trees, requiring about 30 seconds for a tree of 1,000 tips and about 20 minutes for 50,000 tips on a single computer. This new approach to genetic clustering has significant implications for the application of pathogen sequence analysis to public health, where it is critical to robustly and accurately identify clusters for the most cost-effective deployment of outbreak management and prevention resources.

Many pathogens evolve so rapidly that they accumulate genetic differences within a host before becoming transmitted to the next host. Consequently, clusters of sampled infections with nearly identical genomes may reveal outbreaks of recent or ongoing transmissions. There is rapidly growing interest in using model-free genetic clustering methods to guide public health responses to epidemics in near real-time, including HIV, Ebola virus and tuberculosis. However, we show that current methods are relatively ineffective at detecting transmission outbreaks; instead, they are predominantly influenced by how infections are sampled from the population. We describe a fundamentally new approach to genetic clustering that is based on modelling changes in transmission rates during the spread of the epidemic. We use simulated and real pathogen sequence data sets to demonstrate that this model-based approach is substantially more effective for detecting transmission outbreaks, and remains fast enough for real-time applications to large sequence databases.

Collapse

Dearlove BL, Xiang F, Frost SDW. Biased phylodynamic inferences from analysing clusters of viral sequences. Virus Evol 2017;3:vex020. [PMID: 28852573 PMCID: PMC5570026 DOI: 10.1093/ve/vex020] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open

Inferring epidemiological parameters from phylogenies using regression-ABC: A comparative study. PLoS Comput Biol 2017;13:e1005416. [PMID: 28263987 PMCID: PMC5358897 DOI: 10.1371/journal.pcbi.1005416] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2016] [Revised: 03/20/2017] [Accepted: 02/16/2017] [Indexed: 02/06/2023] Open

Abstract

Inferring epidemiological parameters such as the R₀ from time-scaled phylogenies is a timely challenge. Most current approaches rely on likelihood functions, which raise specific issues that range from computing these functions to finding their maxima numerically. Here, we present a new regression-based Approximate Bayesian Computation (ABC) approach, which we base on a large variety of summary statistics intended to capture the information contained in the phylogeny and its corresponding lineage-through-time plot. The regression step involves the Least Absolute Shrinkage and Selection Operator (LASSO) method, which is a robust machine learning technique. It allows us to readily deal with the large number of summary statistics, while avoiding resorting to Markov Chain Monte Carlo (MCMC) techniques. To compare our approach to existing ones, we simulated target trees under a variety of epidemiological models and settings, and inferred parameters of interest using the same priors. We found that, for large phylogenies, the accuracy of our regression-ABC is comparable to that of likelihood-based approaches involving birth-death processes implemented in BEAST2. Our approach even outperformed these when inferring the host population size with a Susceptible-Infected-Removed epidemiological model. It also clearly outperformed a recent kernel-ABC approach when assuming a Susceptible-Infected epidemiological model with two host types. Lastly, by re-analyzing data from the early stages of the recent Ebola epidemic in Sierra Leone, we showed that regression-ABC provides more realistic estimates for the duration parameters (latency and infectiousness) than the likelihood-based method. Overall, ABC based on a large variety of summary statistics and a regression method able to perform variable selection and avoid overfitting is a promising approach to analyze large phylogenies.

Given the rapid evolution of many pathogens, analysing their genomes by means of phylogenies can inform us about how they spread. This is the focus of the field known as “phylodynamics”. Most existing methods inferring epidemiological parameters from virus phylogenies are limited by the difficulty of handling complex likelihood functions, which commonly incorporate latent variables. Here, we use an alternative method known as regression-based Approximate Bayesian Computation (ABC), which circumvents this problem by using simulations and dataset comparisons. Since phylogenies are difficult to compare to one another, we introduce many summary statistics to describe them and take advantage of current machine learning techniques able to perform variable selection. We show that the accuracy we reach is comparable to that of existing methods. This accuracy increases with phylogeny size and can even be higher than that of existing methods for some parameters. Overall, regression-based ABC opens new perspectives to infer epidemiological parameters from large phylogenies.

Collapse

Poon AFY. Impacts and shortcomings of genetic clustering methods for infectious disease outbreaks. Virus Evol 2016;2:vew031. [PMID: 28058111 PMCID: PMC5210024 DOI: 10.1093/ve/vew031] [Citation(s) in RCA: 61] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open

Spiro A, Shapiro E. eSTGt: a programming and simulation environment for population dynamics. BMC Bioinformatics 2016;17:187. [PMID: 27117841 PMCID: PMC4847376 DOI: 10.1186/s12859-016-1004-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2015] [Accepted: 03/29/2016] [Indexed: 11/10/2022] Open

Kühnert D, Stadler T, Vaughan TG, Drummond AJ. Phylodynamics with Migration: A Computational Framework to Quantify Population Structure from Genomic Data. Mol Biol Evol 2016;33:2102-16. [PMID: 27189573 PMCID: PMC4948704 DOI: 10.1093/molbev/msw064] [Citation(s) in RCA: 80] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open

Volz EM, Frost SDW. Sampling through time and phylodynamic inference with coalescent and birth-death models. J R Soc Interface 2015;11:20140945. [PMID: 25401173 PMCID: PMC4223917 DOI: 10.1098/rsif.2014.0945] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Poon AFY. Phylodynamic Inference with Kernel ABC and Its Application to HIV Epidemiology. Mol Biol Evol 2015;32:2483-95. [PMID: 26006189 PMCID: PMC4540972 DOI: 10.1093/molbev/msv123] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open

Inferring epidemiological dynamics with Bayesian coalescent inference: the merits of deterministic and stochastic models. Genetics 2014;199:595-607. [PMID: 25527289 PMCID: PMC4317665 DOI: 10.1534/genetics.114.172791] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open

Spiro A, Cardelli L, Shapiro E. Lineage grammars: describing, simulating and analyzing population dynamics. BMC Bioinformatics 2014;15:249. [PMID: 25047682 PMCID: PMC4223406 DOI: 10.1186/1471-2105-15-249] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2014] [Accepted: 07/07/2014] [Indexed: 11/17/2022] Open

Abstract

Background

Precise description of the dynamics of biological processes would enable the mathematical analysis and computational simulation of complex biological phenomena. Languages such as Chemical Reaction Networks and Process Algebras cater for the detailed description of interactions among individuals and for the simulation and analysis of ensuing behaviors of populations. However, often knowledge of such interactions is lacking or not available. Yet complete oblivion to the environment would make the description of any biological process vacuous. Here we present a language for describing population dynamics that abstracts away detailed interaction among individuals, yet captures in broad terms the effect of the changing environment, based on environment-dependent Stochastic Tree Grammars (eSTG). It is comprised of a set of stochastic tree grammar transition rules, which are context-free and as such abstract away specific interactions among individuals. Transition rule probabilities and rates, however, can depend on global parameters such as population size, generation count, and elapsed time.

Results

We show that eSTGs conveniently describe population dynamics at multiple levels including cellular dynamics, tissue development and niches of organisms. Notably, we show the utilization of eSTG for cases in which the dynamics is regulated by environmental factors, which affect the fate and rate of decisions of the different species. eSTGs are lineage grammars, in the sense that execution of an eSTG program generates the corresponding lineage trees, which can be used to analyze the evolutionary and developmental history of the biological system under investigation. These lineage trees contain a representation of the entire events history of the system, including the dynamics that led to the existing as well as to the extinct individuals.

Conclusions

We conclude that our suggested formalism can be used to easily specify, simulate and analyze complex biological systems, and supports modular description of local biological dynamics that can be later used as “black boxes” in a larger scope, thus enabling a gradual and hierarchical definition and simulation of complex biological systems. The simple, yet robust formalism enables to target a broad class of stochastic dynamic behaviors, especially those that can be modeled using global environmental feedback regulation rather than direct interaction between individuals.

Collapse

Vaughan TG, Kühnert D, Popinga A, Welch D, Drummond AJ. Efficient Bayesian inference under the structured coalescent. Bioinformatics 2014;30:2272-9. [PMID: 24753484 PMCID: PMC4207426 DOI: 10.1093/bioinformatics/btu201] [Citation(s) in RCA: 97] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open

Affiliation(s)

Timothy G Vaughan Allan Wilson Centre for Molecular Ecology and Evolution, Massey University, Palmerston North 4442, New Zealand, Institute of Integrative Biology, Swiss Federal Institute of Technology (ETH), Zurich 8092, Switzerland and Department of Computer Science, University of Auckland, Auckland 1142, New Zealand
Denise Kühnert Allan Wilson Centre for Molecular Ecology and Evolution, Massey University, Palmerston North 4442, New Zealand, Institute of Integrative Biology, Swiss Federal Institute of Technology (ETH), Zurich 8092, Switzerland and Department of Computer Science, University of Auckland, Auckland 1142, New ZealandAllan Wilson Centre for Molecular Ecology and Evolution, Massey University, Palmerston North 4442, New Zealand, Institute of Integrative Biology, Swiss Federal Institute of Technology (ETH), Zurich 8092, Switzerland and Department of Computer Science, University of Auckland, Auckland 1142, New ZealandAllan Wilson Centre for Molecular Ecology and Evolution, Massey University, Palmerston North 4442, New Zealand, Institute of Integrative Biology, Swiss Federal Institute of Technology (ETH), Zurich 8092, Switzerland and Department of Computer Science, University of Auckland, Auckland 1142, New Zealand
Alex Popinga Allan Wilson Centre for Molecular Ecology and Evolution, Massey University, Palmerston North 4442, New Zealand, Institute of Integrative Biology, Swiss Federal Institute of Technology (ETH), Zurich 8092, Switzerland and Department of Computer Science, University of Auckland, Auckland 1142, New ZealandAllan Wilson Centre for Molecular Ecology and Evolution, Massey University, Palmerston North 4442, New Zealand, Institute of Integrative Biology, Swiss Federal Institute of Technology (ETH), Zurich 8092, Switzerland and Department of Computer Science, University of Auckland, Auckland 1142, New Zealand
David Welch Allan Wilson Centre for Molecular Ecology and Evolution, Massey University, Palmerston North 4442, New Zealand, Institute of Integrative Biology, Swiss Federal Institute of Technology (ETH), Zurich 8092, Switzerland and Department of Computer Science, University of Auckland, Auckland 1142, New ZealandAllan Wilson Centre for Molecular Ecology and Evolution, Massey University, Palmerston North 4442, New Zealand, Institute of Integrative Biology, Swiss Federal Institute of Technology (ETH), Zurich 8092, Switzerland and Department of Computer Science, University of Auckland, Auckland 1142, New Zealand
Alexei J Drummond Allan Wilson Centre for Molecular Ecology and Evolution, Massey University, Palmerston North 4442, New Zealand, Institute of Integrative Biology, Swiss Federal Institute of Technology (ETH), Zurich 8092, Switzerland and Department of Computer Science, University of Auckland, Auckland 1142, New ZealandAllan Wilson Centre for Molecular Ecology and Evolution, Massey University, Palmerston North 4442, New Zealand, Institute of Integrative Biology, Swiss Federal Institute of Technology (ETH), Zurich 8092, Switzerland and Department of Computer Science, University of Auckland, Auckland 1142, New Zealand

Collapse

Bouckaert R, Heled J, Kühnert D, Vaughan T, Wu CH, Xie D, Suchard MA, Rambaut A, Drummond AJ. BEAST 2: a software platform for Bayesian evolutionary analysis. PLoS Comput Biol 2014;10:e1003537. [PMID: 24722319 PMCID: PMC3985171 DOI: 10.1371/journal.pcbi.1003537] [Citation(s) in RCA: 3701] [Impact Index Per Article: 370.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2013] [Accepted: 01/20/2014] [Indexed: 12/15/2022] Open

Bouckaert R, Heled J, Kühnert D, Vaughan T, Wu CH, Xie D, Suchard MA, Rambaut A, Drummond AJ. BEAST 2: a software platform for Bayesian evolutionary analysis. PLoS Comput Biol 2014. [PMID: 24722319 DOI: 10.1371/journal.pcbi.1003537i] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2023] Open