51
|
Verotta D, Haagensen J, Spormann AM, Yang K. Mathematical Modeling of Biofilm Structures Using COMSTAT Data. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2017; 2017:7246286. [PMID: 29422943 PMCID: PMC5751404 DOI: 10.1155/2017/7246286] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/10/2017] [Revised: 11/14/2017] [Accepted: 11/26/2017] [Indexed: 01/26/2023]
Abstract
Mathematical modeling holds great potential for quantitatively describing biofilm growth in presence or absence of chemical agents used to limit or promote biofilm growth. In this paper, we describe a general mathematical/statistical framework that allows for the characterization of complex data in terms of few parameters and the capability to (i) compare different experiments and exposures to different agents, (ii) test different hypotheses regarding biofilm growth and interaction with different agents, and (iii) simulate arbitrary administrations of agents. The mathematical framework is divided to submodels characterizing biofilm, including new models characterizing live biofilm growth and dead cell accumulation; the interaction with agents inhibiting or stimulating growth; the kinetics of the agents. The statistical framework can take into account measurement and interexperiment variation. We demonstrate the application of (some of) the models using confocal microscopy data obtained using the computer program COMSTAT.
Collapse
Affiliation(s)
- Davide Verotta
- Department of Clinical Pharmacy, School of Pharmacy, University of California San Francisco, San Francisco, CA, USA
| | - Janus Haagensen
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kogle Alle 6, 2970 Hørsholm, Denmark
| | - Alfred M. Spormann
- Department of Civil and Environmental Engineering, James H. Clark Center, Stanford University, Rm E250, 318 Campus Drive, Stanford, CA 94305, USA
| | - Katherine Yang
- Department of Clinical Pharmacy, School of Pharmacy, University of California San Francisco, San Francisco, CA, USA
| |
Collapse
|
52
|
Gonçalves FB, Łatuszyński K, Roberts GO. Barker’s algorithm for Bayesian inference with intractable likelihoods. BRAZ J PROBAB STAT 2017. [DOI: 10.1214/17-bjps374] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
53
|
Coupling stochastic EM and approximate Bayesian computation for parameter inference in state-space models. Comput Stat 2017. [DOI: 10.1007/s00180-017-0770-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
54
|
Affiliation(s)
- Minh-Ngoc Tran
- University of Sydney Business School, University of Sydney, NSW, Australia
| | - David J. Nott
- Department of Statistics and Applied Probability, National University of Singapore, Singapore
| | - Robert Kohn
- UNSW Business School, University of New South Wales, Sydney NSW, Australia
| |
Collapse
|
55
|
Affiliation(s)
- L. F. Price
- School of Mathematical Sciences, Queensland University of Technology, Australia and Australian Research Council Centre of Excellence for Mathematical and Statistical Frontiers (ACEMS)
| | - C. C. Drovandi
- School of Mathematical Sciences, Queensland University of Technology, Australia and Australian Research Council Centre of Excellence for Mathematical and Statistical Frontiers (ACEMS)
| | - A. Lee
- Department of Statistics, University of Warwick, Coventry, UK
| | - D. J. Nott
- Department of Statistics and Applied Probability, National University of Singapore, Singapore
| |
Collapse
|
56
|
Quiroz M, Tran MN, Villani M, Kohn R. Speeding up MCMC by Delayed Acceptance and Data Subsampling. J Comput Graph Stat 2017. [DOI: 10.1080/10618600.2017.1307117] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Affiliation(s)
- Matias Quiroz
- Division of Statistics and Machine Learning, Linköping University, Linköping, Sweden
- Research Division, Sveriges Riksbank, Stockholm, Sweden
| | - Minh-Ngoc Tran
- Discipline of Business Analytics, University of Sydney, Camperdown NSW, Australia
| | - Mattias Villani
- Division of Statistics and Machine Learning, Linköping University, Linköping, Sweden
| | - Robert Kohn
- Australian School of Business, University of New South Wales, Sydney NSW, Australia
| |
Collapse
|
57
|
Affiliation(s)
| | | | - Anthony Lee
- Department of Statistics, University of Warwick, Coventry, UK
| |
Collapse
|
58
|
Salmona J, Heller R, Quéméré E, Chikhi L. Climate change and human colonization triggered habitat loss and fragmentation in Madagascar. Mol Ecol 2017; 26:5203-5222. [DOI: 10.1111/mec.14173] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2016] [Revised: 04/24/2017] [Accepted: 05/02/2017] [Indexed: 12/20/2022]
Affiliation(s)
- Jordi Salmona
- Instituto Gulbenkian de Ciênca; Oeiras Portugal
- Laboratoire Evolution & Diversité Biologique; UMR 5174 CNRS; Université Paul Sabatier; Toulouse France
- UMR 5174 EDB; Université de Toulouse; Toulouse France
| | - Rasmus Heller
- Department of Biology; University of Copenhagen; Copenhagen N Denmark
| | - Erwan Quéméré
- CEFS; Université de Toulouse; INRA; Castanet-Tolosan France
| | - Lounès Chikhi
- Instituto Gulbenkian de Ciênca; Oeiras Portugal
- Laboratoire Evolution & Diversité Biologique; UMR 5174 CNRS; Université Paul Sabatier; Toulouse France
- UMR 5174 EDB; Université de Toulouse; Toulouse France
| |
Collapse
|
59
|
Sherlock C, Thiery AH, Lee A. Pseudo-marginal Metropolis–Hastings sampling using averages of unbiased estimators. Biometrika 2017. [DOI: 10.1093/biomet/asx031] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Summary
We consider a pseudo-marginal Metropolis–Hastings kernel ${\mathbb{P}}_m$ that is constructed using an average of $m$ exchangeable random variables, and an analogous kernel ${\mathbb{P}}_s$ that averages $s<m$ of these same random variables. Using an embedding technique to facilitate comparisons, we provide a lower bound for the asymptotic variance of any ergodic average associated with ${\mathbb{P}}_m$ in terms of the asymptotic variance of the corresponding ergodic average associated with ${\mathbb{P}}_s$. We show that the bound is tight and disprove a conjecture that when the random variables to be averaged are independent, the asymptotic variance under ${\mathbb{P}}_m$ is never less than $s/m$ times the variance under ${\mathbb{P}}_s$. The conjecture does, however, hold for continuous-time Markov chains. These results imply that if the computational cost of the algorithm is proportional to $m$, it is often better to set $m=1$. We provide intuition as to why these findings differ so markedly from recent results for pseudo-marginal kernels employing particle filter approximations. Our results are exemplified through two simulation studies; in the first the computational cost is effectively proportional to $m$ and in the second there is a considerable start-up cost at each iteration.
Collapse
Affiliation(s)
- Chris Sherlock
- Department of Mathematics and Statistics, Lancaster University, Lancaster LA1 4YF, U.K.
| | - Alexandre H. Thiery
- Department of Statistics and Applied Probability, National University of Singapore, Singapore 117543
| | - Anthony Lee
- Department of Statistics, University of Warwick, Coventry CV4 7AL, U.K.
| |
Collapse
|
60
|
Miura O, Kanaya G, Nakai S, Itoh H, Chiba S, Makino W, Nishimura T, Kojima S, Urabe J. Ecological and genetic impact of the 2011 Tohoku Earthquake Tsunami on intertidal mud snails. Sci Rep 2017; 7:44375. [PMID: 28281698 PMCID: PMC5345064 DOI: 10.1038/srep44375] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2016] [Accepted: 02/07/2017] [Indexed: 11/13/2022] Open
Abstract
Natural disturbances often destroy local populations and can considerably affect the genetic properties of these populations. The 2011 Tohoku Earthquake Tsunami greatly damaged local populations of various coastal organisms, including the mud snail Batillaria attramentaria, which was an abundant macroinvertebrate on the tidal flats in the Tohoku region. To evaluate the impact of the tsunami on the ecology and population genetic properties of these snails, we monitored the density, shell size, and microsatellite DNA variation of B. attramentaria for more than ten years (2005–2015) throughout the disturbance event. We found that the density of snails declined immediately after the tsunami. Bayesian inference of the genetically effective population size (Ne) demonstrated that the Ne declined by 60–99% at the study sites exposed to the tsunami. However, we found that their genetic diversity was not significantly reduced after the tsunami. The maintenance of genetic diversity is essential for long-term survival of local populations, and thus, the observed genetic robustness could play a key role in the persistence of snail populations in this region which has been devastated by similar tsunamis every 500–800 years. Our findings have significant implications for understanding the sustainability of populations damaged by natural disturbances.
Collapse
Affiliation(s)
- Osamu Miura
- Faculty of Agriculture and Marine Science, Kochi University, 200 Monobe, Nankoku, Kochi 783-8502, Japan
| | - Gen Kanaya
- National Institute for Environmental Studies, 16-2 Onogawa, Tsukuba, Ibaraki 305-8506, Japan
| | - Shizuko Nakai
- Department of Marine Science and Resources, College of Bioresource Sciences Nihon University, 1866 Kameino, Fujisawa, Kanagawa 252-0880, Japan
| | - Hajime Itoh
- Atmosphere and Ocean Research Institute, The University of Tokyo, 5-1-5, Kashiwanoha, Kashiwa, Chiba 277-8564, Japan
| | - Satoshi Chiba
- Department of Environmental Life Sciences, Graduate School of Life Sciences, Tohoku University, Kawauchi 41, Aoba-ku, Sendai, Miyagi 980-0862, Japan
| | - Wataru Makino
- Division of Ecology and Evolutionary Biology, Graduate School of Life Sciences, Tohoku University, Sendai, Miyagi 980-8578, Japan
| | - Tomohiro Nishimura
- Laboratory of Aquatic Environmental Science, Faculty of Agriculture, Kochi University, Nankoku, Kochi 783-8502, Japan
| | - Shigeaki Kojima
- Atmosphere and Ocean Research Institute, The University of Tokyo, 5-1-5, Kashiwanoha, Kashiwa, Chiba 277-8564, Japan
| | - Jotaro Urabe
- Division of Ecology and Evolutionary Biology, Graduate School of Life Sciences, Tohoku University, Sendai, Miyagi 980-8578, Japan
| |
Collapse
|
61
|
Harris LN, Palstra FP, Bajno R, Gallagher CP, Howland KL, Taylor EB, Reist JD. Assessing conservation risks to populations of an anadromous Arctic salmonid, the northern Dolly Varden (Salvelinus malma malma), via estimates of effective and census population sizes and approximate Bayesian computation. CONSERV GENET 2016. [DOI: 10.1007/s10592-016-0915-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
62
|
Andrieu C, Vihola M. Establishing some order amongst exact approximations of MCMCs. ANN APPL PROBAB 2016. [DOI: 10.1214/15-aap1158] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
63
|
Wang J, Santiago E, Caballero A. Prediction and estimation of effective population size. Heredity (Edinb) 2016; 117:193-206. [PMID: 27353047 PMCID: PMC5026755 DOI: 10.1038/hdy.2016.43] [Citation(s) in RCA: 189] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2015] [Revised: 05/03/2016] [Accepted: 05/16/2016] [Indexed: 12/19/2022] Open
Abstract
Effective population size (Ne) is a key parameter in population genetics. It has important applications in evolutionary biology, conservation genetics and plant and animal breeding, because it measures the rates of genetic drift and inbreeding and affects the efficacy of systematic evolutionary forces, such as mutation, selection and migration. We review the developments in predictive equations and estimation methodologies of effective size. In the prediction part, we focus on the equations for populations with different modes of reproduction, for populations under selection for unlinked or linked loci and for the specific applications to conservation genetics. In the estimation part, we focus on methods developed for estimating the current or recent effective size from molecular marker or sequence data. We discuss some underdeveloped areas in predicting and estimating Ne for future research.
Collapse
Affiliation(s)
- J Wang
- Institute of Zoology, Zoological Society of London, London, UK
| | - E Santiago
- Departamento de Biología Funcional, Facultad de Biología, Universidad de Oviedo, Oviedo, Spain
| | - A Caballero
- Departamento de Bioquímica, Genética e Inmunología, Facultad de Biología, Universidad de Vigo, Vigo, Spain
| |
Collapse
|
64
|
Gingrich TR, Rotskoff GM, Crooks GE, Geissler PL. Near-optimal protocols in complex nonequilibrium transformations. Proc Natl Acad Sci U S A 2016; 113:10263-8. [PMID: 27573816 PMCID: PMC5027427 DOI: 10.1073/pnas.1606273113] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The development of sophisticated experimental means to control nanoscale systems has motivated efforts to design driving protocols that minimize the energy dissipated to the environment. Computational models are a crucial tool in this practical challenge. We describe a general method for sampling an ensemble of finite-time, nonequilibrium protocols biased toward a low average dissipation. We show that this scheme can be carried out very efficiently in several limiting cases. As an application, we sample the ensemble of low-dissipation protocols that invert the magnetization of a 2D Ising model and explore how the diversity of the protocols varies in response to constraints on the average dissipation. In this example, we find that there is a large set of protocols with average dissipation close to the optimal value, which we argue is a general phenomenon.
Collapse
Affiliation(s)
- Todd R Gingrich
- Physics of Living Systems Group, Department of Physics, Massachusetts Institute of Technology, Cambridge, MA 02139; Department of Chemistry, University of California, Berkeley, CA 94720;
| | - Grant M Rotskoff
- Biophysics Graduate Group, University of California, Berkeley, CA 94720
| | - Gavin E Crooks
- Physical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720; Kavli Energy NanoSciences Institute, Berkeley, CA 94720
| | - Phillip L Geissler
- Department of Chemistry, University of California, Berkeley, CA 94720; Biophysics Graduate Group, University of California, Berkeley, CA 94720; Physical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720; Chemical Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720
| |
Collapse
|
65
|
|
66
|
Bryant JV, Gottelli D, Zeng X, Hong X, Chan BPL, Fellowes JR, Zhang Y, Luo J, Durrant C, Geissmann T, Chatterjee HJ, Turvey ST. Assessing current genetic status of the Hainan gibbon using historical and demographic baselines: implications for conservation management of species of extreme rarity. Mol Ecol 2016; 25:3540-56. [PMID: 27273107 DOI: 10.1111/mec.13716] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2015] [Revised: 05/13/2016] [Accepted: 05/31/2016] [Indexed: 12/30/2022]
Abstract
Evidence-based conservation planning is crucial for informing management decisions for species of extreme rarity, but collection of robust data on genetic status or other parameters can be extremely challenging for such species. The Hainan gibbon, possibly the world's rarest mammal, consists of a single population of ~25 individuals restricted to one protected area on Hainan Island, China, and has persisted for over 30 years at exceptionally low population size. Analysis of genotypes at 11 microsatellite loci from faecal samples for 36% of the current global population and tissue samples from 62% of existing historical museum specimens demonstrates limited current genetic diversity (Na = 2.27, Ar = 2.24, He = 0.43); diversity has declined since the 19th century and even further within the last 30 years, representing declines of ~30% from historical levels (Na = 3.36, Ar = 3.29, He = 0.63). Significant differentiation is seen between current and historical samples (FST = 0.156, P = 0.0315), and the current population exhibits extremely small Ne (current Ne = 2.16). There is evidence for both a recent population bottleneck and an earlier bottleneck, with population size already reasonably low by the late 19th century (historical Ne = 1162.96). Individuals in the current population are related at the level of half- to full-siblings between social groups, and full-siblings or parent-offspring within a social group, suggesting that inbreeding is likely to increase in the future. The species' current reduced genetic diversity must be considered during conservation planning, particularly for expectations of likely population recovery, indicating that intensive, carefully planned management is essential.
Collapse
Affiliation(s)
- J V Bryant
- Institute of Zoology, Zoological Society of London, Regent's Park, London, NW1 4RY, UK.,Department of Genetics, Evolution & Environment, University College London, Gower Street, London, WC1E 6BT, UK
| | - D Gottelli
- Institute of Zoology, Zoological Society of London, Regent's Park, London, NW1 4RY, UK
| | - X Zeng
- Bawangling National Nature Reserve Management Office, Changjiang Lizu Autonomous County, Hainan, 572722, China
| | - X Hong
- Bawangling National Nature Reserve Management Office, Changjiang Lizu Autonomous County, Hainan, 572722, China
| | - B P L Chan
- Kadoorie Conservation China, Kadoorie Farm and Botanic Garden, Lam Kam Road, Tai Po, New Territories, Hong Kong
| | - J R Fellowes
- The Kadoorie Institute, University of Hong Kong, Pokfulam Road, Hong Kong
| | - Y Zhang
- Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China.,College of Life Sciences, Yunnan University, Kunming, 650091, China
| | - J Luo
- College of Life Sciences, Yunnan University, Kunming, 650091, China
| | - C Durrant
- Institute of Zoology, Zoological Society of London, Regent's Park, London, NW1 4RY, UK
| | - T Geissmann
- Anthropological Institute, University Zurich-Irchel, Winterthurerstrasse 190, Zurich, CH-8057, Switzerland
| | - H J Chatterjee
- Department of Genetics, Evolution & Environment, University College London, Gower Street, London, WC1E 6BT, UK
| | - S T Turvey
- Institute of Zoology, Zoological Society of London, Regent's Park, London, NW1 4RY, UK
| |
Collapse
|
67
|
Riquet F, Le Cam S, Fonteneau E, Viard F. Moderate genetic drift is driven by extreme recruitment events in the invasive mollusk Crepidula fornicata. Heredity (Edinb) 2016; 117:42-50. [PMID: 27118155 PMCID: PMC4901356 DOI: 10.1038/hdy.2016.24] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2015] [Revised: 03/07/2016] [Accepted: 03/11/2016] [Indexed: 11/09/2022] Open
Abstract
Effective population size (Ne) is a measure of genetic drift and is thus a central parameter in evolution, conservation genetics and invasion biology. Interestingly, in native marine species, Ne is typically several orders of magnitude lower than the census size. This pattern has often been explained by high fecundity, variation in reproductive success and pronounced early mortality, resulting in genetic drift across generations. Data documenting genetic drift and/or Ne in marine invasive species are, however, still scarce. We examined the importance of genetic drift in the invasive species Crepidula fornicata by genotyping 681 juveniles sampled during each annual recruitment peak over nine consecutive years in the Bay of Morlaix (Brittany, France). Observed variations in genetic diversity were partially explained by variation in recruitment intensity. In addition, substantial temporal genetic differentiation was documented (that is, genetic drift), and was attributed to nonrandom variance in the reproductive success of different breeding groups across years in the study species. Using a set of single-sample and temporal estimators for Ne, we estimated Ne to be three or four orders of magnitude smaller than the census size (Nc). On one hand, this reduction in Ne relative to Nc appeared congruent with, although slight higher than, values commonly observed in native marine species. Particular life-history traits of this invasive species may play an important role in buffering genetic drift. On the other hand, Ne still remained far below Nc, hence, possibly reducing the efficiency of selection effects.
Collapse
Affiliation(s)
- F Riquet
- UPMC Univ Paris 06, UMR 7144, Team Diversity and Connectivity in Coastal Marine Landscapes, Station Biologique de Roscoff, Roscoff, France
- CNRS, UMR 7144, Laboratory Adaptation and Diversity in the Marine Environment, Station Biologique de Roscoff, Roscoff, France
| | - S Le Cam
- UPMC Univ Paris 06, UMR 7144, Team Diversity and Connectivity in Coastal Marine Landscapes, Station Biologique de Roscoff, Roscoff, France
- CNRS, UMR 7144, Laboratory Adaptation and Diversity in the Marine Environment, Station Biologique de Roscoff, Roscoff, France
| | - E Fonteneau
- UPMC Univ Paris 06, UMR 7144, Team Diversity and Connectivity in Coastal Marine Landscapes, Station Biologique de Roscoff, Roscoff, France
- CNRS, UMR 7144, Laboratory Adaptation and Diversity in the Marine Environment, Station Biologique de Roscoff, Roscoff, France
| | - F Viard
- UPMC Univ Paris 06, UMR 7144, Team Diversity and Connectivity in Coastal Marine Landscapes, Station Biologique de Roscoff, Roscoff, France
- CNRS, UMR 7144, Laboratory Adaptation and Diversity in the Marine Environment, Station Biologique de Roscoff, Roscoff, France
| |
Collapse
|
68
|
Georgoulas A, Hillston J, Sanguinetti G. Unbiased Bayesian inference for population Markov jump processes via random truncations. STATISTICS AND COMPUTING 2016; 27:991-1002. [PMID: 28690370 PMCID: PMC5477715 DOI: 10.1007/s11222-016-9667-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/28/2015] [Accepted: 05/02/2016] [Indexed: 05/24/2023]
Abstract
We consider continuous time Markovian processes where populations of individual agents interact stochastically according to kinetic rules. Despite the increasing prominence of such models in fields ranging from biology to smart cities, Bayesian inference for such systems remains challenging, as these are continuous time, discrete state systems with potentially infinite state-space. Here we propose a novel efficient algorithm for joint state/parameter posterior sampling in population Markov Jump processes. We introduce a class of pseudo-marginal sampling algorithms based on a random truncation method which enables a principled treatment of infinite state spaces. Extensive evaluation on a number of benchmark models shows that this approach achieves considerable savings compared to state of the art methods, retaining accuracy and fast convergence. We also present results on a synthetic biology data set showing the potential for practical usefulness of our work.
Collapse
Affiliation(s)
| | - Jane Hillston
- School of Informatics, University of Edinburgh, Edinburgh, UK
| | - Guido Sanguinetti
- School of Informatics, University of Edinburgh, Edinburgh, UK
- Synthetic and Systems Biology, University of Edinburgh, Edinburgh, UK
| |
Collapse
|
69
|
Koepke AA, Longini IM, Halloran ME, Wakefield J, Minin VN. PREDICTIVE MODELING OF CHOLERA OUTBREAKS IN BANGLADESH. Ann Appl Stat 2016; 10:575-595. [PMID: 27746850 PMCID: PMC5061460 DOI: 10.1214/16-aoas908] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
Despite seasonal cholera outbreaks in Bangladesh, little is known about the relationship between environmental conditions and cholera cases. We seek to develop a predictive model for cholera outbreaks in Bangladesh based on environmental predictors. To do this, we estimate the contribution of environmental variables, such as water depth and water temperature, to cholera outbreaks in the context of a disease transmission model. We implement a method which simultaneously accounts for disease dynamics and environmental variables in a Susceptible-Infected-Recovered-Susceptible (SIRS) model. The entire system is treated as a continuous-time hidden Markov model, where the hidden Markov states are the numbers of people who are susceptible, infected, or recovered at each time point, and the observed states are the numbers of cholera cases reported. We use a Bayesian framework to fit this hidden SIRS model, implementing particle Markov chain Monte Carlo methods to sample from the posterior distribution of the environmental and transmission parameters given the observed data. We test this method using both simulation and data from Mathbaria, Bangladesh. Parameter estimates are used to make short-term predictions that capture the formation and decline of epidemic peaks. We demonstrate that our model can successfully predict an increase in the number of infected individuals in the population weeks before the observed number of cholera cases increases, which could allow for early notification of an epidemic and timely allocation of resources.
Collapse
|
70
|
Coalescent Inference Using Serially Sampled, High-Throughput Sequencing Data from Intrahost HIV Infection. Genetics 2016; 202:1449-72. [PMID: 26857628 DOI: 10.1534/genetics.115.177931] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2015] [Accepted: 01/31/2016] [Indexed: 01/11/2023] Open
Abstract
Human immunodeficiency virus (HIV) is a rapidly evolving pathogen that causes chronic infections, so genetic diversity within a single infection can be very high. High-throughput "deep" sequencing can now measure this diversity in unprecedented detail, particularly since it can be performed at different time points during an infection, and this offers a potentially powerful way to infer the evolutionary dynamics of the intrahost viral population. However, population genomic inference from HIV sequence data is challenging because of high rates of mutation and recombination, rapid demographic changes, and ongoing selective pressures. In this article we develop a new method for inference using HIV deep sequencing data, using an approach based on importance sampling of ancestral recombination graphs under a multilocus coalescent model. The approach further extends recent progress in the approximation of so-called conditional sampling distributions, a quantity of key interest when approximating coalescent likelihoods. The chief novelties of our method are that it is able to infer rates of recombination and mutation, as well as the effective population size, while handling sampling over different time points and missing data without extra computational difficulty. We apply our method to a data set of HIV-1, in which several hundred sequences were obtained from an infected individual at seven time points over 2 years. We find mutation rate and effective population size estimates to be comparable to those produced by the software BEAST. Additionally, our method is able to produce local recombination rate estimates. The software underlying our method, Coalescenator, is freely available.
Collapse
|
71
|
Malaspinas AS. Methods to characterize selective sweeps using time serial samples: an ancient DNA perspective. Mol Ecol 2015; 25:24-41. [DOI: 10.1111/mec.13492] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2015] [Revised: 11/08/2015] [Accepted: 11/10/2015] [Indexed: 01/20/2023]
Affiliation(s)
- Anna-Sapfo Malaspinas
- Institute of Ecology and Evolution; University of Bern; Baltzerstrasse 6 CH-3012 Bern Switzerland
- Centre for GeoGenetics; Natural History Museum of Denmark; University of Copenhagen; Øster Voldgade 5-7 1350 Copenhagen Denmark
| |
Collapse
|
72
|
Marjoram P, Hamblin S, Foley B. Simulation-based Bayesian Analysis of Complex Data. SUMMER COMPUTER SIMULATION CONFERENCE : (SCSC 2015) : 2015 SUMMER SIMULATION MULTI-CONFERENCE (SUMMERSIM'15) : CHICAGO, ILLINOIS, USA, 26-29 JULY 2015. SUMMER COMPUTER SIMULATION CONFERENCE (2015 : CHICAGO, ILLINOIS) 2015; 47:176-183. [PMID: 27840859 PMCID: PMC5102508] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Our ability to collect large datasets is growing rapidly. Such richness of data offers great promise in terms of addressing detailed scientific questions in great depth. However, this benefit is not without scientific difficulty: many traditional analysis methods become computationally intractable for very large datasets. However, one can frequently still simulate data from scientific models for which direct calculation is no longer possible. In this paper we propose a Bayesian perspective for such analyses, and argue for the advantage of a simulation-based approximate Bayesian method that remains tractable when tractability of other methods is lost. This method, which is known as "approximate Bayesian computation" [ABC], has now been used in a variety of contexts, such as the analysis of tumor data (a tumor being a complex population of cells), and the analysis of human genetic variation data (which arise from a population of individual people). We review a number of ABC methods, with specific attention to the use of ABC in agent-based models, and give pointers to software that allows straightforward implementation of the ABC approach. In this way we demonstrate the utility of simulation-based analyses of large datasets within a rigorous statistical framework.
Collapse
Affiliation(s)
- Paul Marjoram
- University of Southern California, Dept of Preventive Medicine, Keck School of Medicine, Los Angeles, CA
| | - Steven Hamblin
- University of Southern California, Dept. of Molecular and Computational Biology, Los Angeles, CA
| | - Brad Foley
- University of Southern California, Dept. of Molecular and Computational Biology, Los Angeles, CA
| |
Collapse
|
73
|
Medina-Aguayo FJ, Lee A, Roberts GO. Stability of noisy Metropolis-Hastings. STATISTICS AND COMPUTING 2015; 26:1187-1211. [PMID: 32055107 PMCID: PMC6991990 DOI: 10.1007/s11222-015-9604-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/23/2015] [Accepted: 10/15/2015] [Indexed: 06/10/2023]
Abstract
Pseudo-marginal Markov chain Monte Carlo methods for sampling from intractable distributions have gained recent interest and have been theoretically studied in considerable depth. Their main appeal is that they are exact, in the sense that they target marginally the correct invariant distribution. However, the pseudo-marginal Markov chain can exhibit poor mixing and slow convergence towards its target. As an alternative, a subtly different Markov chain can be simulated, where better mixing is possible but the exactness property is sacrificed. This is the noisy algorithm, initially conceptualised as Monte Carlo within Metropolis, which has also been studied but to a lesser extent. The present article provides a further characterisation of the noisy algorithm, with a focus on fundamental stability properties like positive recurrence and geometric ergodicity. Sufficient conditions for inheriting geometric ergodicity from a standard Metropolis-Hastings chain are given, as well as convergence of the invariant distribution towards the true target distribution.
Collapse
Affiliation(s)
| | - A. Lee
- Department of Statistics, University of Warwick, Coventry, CV4 7AL UK
| | - G. O. Roberts
- Department of Statistics, University of Warwick, Coventry, CV4 7AL UK
| |
Collapse
|
74
|
Lyne AM, Girolami M, Atchadé Y, Strathmann H, Simpson D. On Russian Roulette Estimates for Bayesian Inference with Doubly-Intractable Likelihoods. Stat Sci 2015. [DOI: 10.1214/15-sts523] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
75
|
Sherlock C. Optimal Scaling for the Pseudo-Marginal Random Walk Metropolis: Insensitivity to the Noise Generating Mechanism. Methodol Comput Appl Probab 2015. [DOI: 10.1007/s11009-015-9471-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
76
|
Green GM, Schlafly EF, Finkbeiner DP, Rix HW, Martin N, Burgett W, Draper PW, Flewelling H, Hodapp K, Kaiser N, Kudritzki RP, Magnier E, Metcalfe N, Price P, Tonry J, Wainscoat R. A THREE-DIMENSIONAL MAP OF MILKY WAY DUST. ACTA ACUST UNITED AC 2015. [DOI: 10.1088/0004-637x/810/1/25] [Citation(s) in RCA: 369] [Impact Index Per Article: 36.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
|
77
|
Gilbert KJ, Whitlock MC. Evaluating methods for estimating local effective population size with and without migration. Evolution 2015; 69:2154-66. [DOI: 10.1111/evo.12713] [Citation(s) in RCA: 114] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2015] [Revised: 06/16/2015] [Accepted: 06/20/2015] [Indexed: 01/20/2023]
Affiliation(s)
- Kimberly J. Gilbert
- Department of Zoology; University of British Columbia; Vancouver BC V6T 1Z4 Canada
| | - Michael C. Whitlock
- Department of Zoology; University of British Columbia; Vancouver BC V6T 1Z4 Canada
| |
Collapse
|
78
|
Muchmore P, Marjoram P. Exact likelihood-free Markov chain Monte Carlo for elliptically contoured distributions. Stat Appl Genet Mol Biol 2015; 14:317-32. [PMID: 26167984 DOI: 10.1515/sagmb-2014-0063] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Recent results in Markov chain Monte Carlo (MCMC) show that a chain based on an unbiased estimator of the likelihood can have a stationary distribution identical to that of a chain based on exact likelihood calculations. In this paper we develop such an estimator for elliptically contoured distributions, a large family of distributions that includes and generalizes the multivariate normal. We then show how this estimator, combined with pseudorandom realizations of an elliptically contoured distribution, can be used to run MCMC in a way that replicates the stationary distribution of a likelihood based chain, but does not require explicit likelihood calculations. Because many elliptically contoured distributions do not have closed form densities, our simulation based approach enables exact MCMC based inference in a range of cases where previously it was impossible.
Collapse
|
79
|
Haario H, Kalachev L, Hakkarainen J. Generalized correlation integral vectors: A distance concept for chaotic dynamical systems. CHAOS (WOODBURY, N.Y.) 2015; 25:063102. [PMID: 26117096 DOI: 10.1063/1.4921939] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Several concepts of fractal dimension have been developed to characterise properties of attractors of chaotic dynamical systems. Numerical approximations of them must be calculated by finite samples of simulated trajectories. In principle, the quantities should not depend on the choice of the trajectory, as long as it provides properly distributed samples of the underlying attractor. In practice, however, the trajectories are sensitive with respect to varying initial values, small changes of the model parameters, to the choice of a solver, numeric tolerances, etc. The purpose of this paper is to present a statistically sound approach to quantify this variability. We modify the concept of correlation integral to produce a vector that summarises the variability at all selected scales. The distribution of this stochastic vector can be estimated, and it provides a statistical distance concept between trajectories. Here, we demonstrate the use of the distance for the purpose of estimating model parameters of a chaotic dynamic model. The methodology is illustrated using computational examples for the Lorenz 63 and Lorenz 95 systems, together with a framework for Markov chain Monte Carlo sampling to produce posterior distributions of model parameters.
Collapse
Affiliation(s)
- Heikki Haario
- School of Engineering Science, Lappeenranta University of Technology, Lappeenranta, Finland
| | - Leonid Kalachev
- Department of Mathematical Sciences, University of Montana, Missoula, Montana 59812-0864, USA
| | - Janne Hakkarainen
- Earth Observation Unit, Finnish Meteorological Institute, Helsinki, Finland
| |
Collapse
|
80
|
Affiliation(s)
- Mathieu Gerber
- Université de Lausanne; Switzerland
- Centre de Recherche en Economie et Statistique; Paris France
| | - Nicolas Chopin
- Centre de Recherche en Economie et Statistique; Paris France
- Ecole Nationale de la Statistique et de l'Administration Economique; Paris France
| |
Collapse
|
81
|
|
82
|
Andrieu C, Vihola M. Convergence properties of pseudo-marginal Markov chain Monte Carlo algorithms. ANN APPL PROBAB 2015. [DOI: 10.1214/14-aap1022] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
83
|
Doucet A, Pitt MK, Deligiannidis G, Kohn R. Efficient implementation of Markov chain Monte Carlo when using an unbiased likelihood estimator. Biometrika 2015. [DOI: 10.1093/biomet/asu075] [Citation(s) in RCA: 121] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
84
|
Palstra FP, Heyer E, Austerlitz F. Statistical inference on genetic data reveals the complex demographic history of human populations in central Asia. Mol Biol Evol 2015; 32:1411-24. [PMID: 25678589 DOI: 10.1093/molbev/msv030] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
The demographic history of modern humans constitutes a combination of expansions, colonizations, contractions, and remigrations. The advent of large scale genetic data combined with statistically refined methods facilitates inference of this complex history. Here we study the demographic history of two genetically admixed ethnic groups in Central Asia, an area characterized by high levels of genetic diversity and a history of recurrent immigration. Using Approximate Bayesian Computation, we infer that the timing of admixture markedly differs between the two groups. Admixture in the traditionally agricultural Tajiks could be dated back to the onset of the Neolithic transition in the region, whereas admixture in Kyrgyz is more recent, and may have involved the westward movement of Turkic peoples. These results are confirmed by a coalescent method that fits an isolation-with-migration model to the genetic data, with both Central Asian groups having received gene flow from the extremities of Eurasia. Interestingly, our analyses also uncover signatures of gene flow from Eastern to Western Eurasia during Paleolithic times. In conclusion, the high genetic diversity currently observed in these two Central Asian peoples most likely reflects the effects of recurrent immigration that likely started before historical times. Conversely, conquests during historical times may have had a relatively limited genetic impact. These results emphasize the need for a better understanding of the genetic consequences of transmission of culture and technological innovations, as well as those of invasions and conquests.
Collapse
Affiliation(s)
- Friso P Palstra
- Laboratoire d'Eco-Anthropologie et Ethnobiologie, UMR 7206, Muséum National d'Histoire Naturelle-Centre National de la Recherche Scientifique-Université Paris 7 Diderot, Paris, France
| | - Evelyne Heyer
- Laboratoire d'Eco-Anthropologie et Ethnobiologie, UMR 7206, Muséum National d'Histoire Naturelle-Centre National de la Recherche Scientifique-Université Paris 7 Diderot, Paris, France
| | - Frédéric Austerlitz
- Laboratoire d'Eco-Anthropologie et Ethnobiologie, UMR 7206, Muséum National d'Histoire Naturelle-Centre National de la Recherche Scientifique-Université Paris 7 Diderot, Paris, France
| |
Collapse
|
85
|
Lindsten F, Douc R, Moulines E. Uniform Ergodicity of the Particle Gibbs Sampler. Scand Stat Theory Appl 2015. [DOI: 10.1111/sjos.12136] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Affiliation(s)
- Fredrik Lindsten
- Department of Engineering; The University of Cambridge; Cambridge, CB2 1PZ UK
- Division of Automatic Control; Linköping University; Linköping 58183 Sweden
| | - Randal Douc
- Département CITI; Télécom SudParis; Evry France
| | - Eric Moulines
- Institut Mines-Télécom; Télécom ParisTech; CNRS LTCI, Paris France
| |
Collapse
|
86
|
Sherlock C, Thiery AH, Roberts GO, Rosenthal JS. On the efficiency of pseudo-marginal random walk Metropolis algorithms. Ann Stat 2015. [DOI: 10.1214/14-aos1278] [Citation(s) in RCA: 86] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
87
|
Inferring epidemiological dynamics with Bayesian coalescent inference: the merits of deterministic and stochastic models. Genetics 2014; 199:595-607. [PMID: 25527289 PMCID: PMC4317665 DOI: 10.1534/genetics.114.172791] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Estimation of epidemiological and population parameters from molecular sequence data has become central to the understanding of infectious disease dynamics. Various models have been proposed to infer details of the dynamics that describe epidemic progression. These include inference approaches derived from Kingman’s coalescent theory. Here, we use recently described coalescent theory for epidemic dynamics to develop stochastic and deterministic coalescent susceptible–infected–removed (SIR) tree priors. We implement these in a Bayesian phylogenetic inference framework to permit joint estimation of SIR epidemic parameters and the sample genealogy. We assess the performance of the two coalescent models and also juxtapose results obtained with a recently published birth–death-sampling model for epidemic inference. Comparisons are made by analyzing sets of genealogies simulated under precisely known epidemiological parameters. Additionally, we analyze influenza A (H1N1) sequence data sampled in the Canterbury region of New Zealand and HIV-1 sequence data obtained from known United Kingdom infection clusters. We show that both coalescent SIR models are effective at estimating epidemiological parameters from data with large fundamental reproductive number R0 and large population size S0. Furthermore, we find that the stochastic variant generally outperforms its deterministic counterpart in terms of error, bias, and highest posterior density coverage, particularly for smaller R0 and S0. However, each of these inference models is shown to have undesirable properties in certain circumstances, especially for epidemic outbreaks with R0 close to one or with small effective susceptible populations.
Collapse
|
88
|
Kobayashi G. A transdimensional approximate Bayesian computation using the pseudo-marginal approach for model choice. Comput Stat Data Anal 2014. [DOI: 10.1016/j.csda.2014.06.025] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
89
|
Filippone M, Girolami M. Pseudo-Marginal Bayesian Inference for Gaussian Processes. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2014; 36:2214-2226. [PMID: 26353062 DOI: 10.1109/tpami.2014.2316530] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
The main challenges that arise when adopting Gaussian process priors in probabilistic modeling are how to carry out exact Bayesian inference and how to account for uncertainty on model parameters when making model-based predictions on out-of-sample data. Using probit regression as an illustrative working example, this paper presents a general and effective methodology based on the pseudo-marginal approach to Markov chain Monte Carlo that efficiently addresses both of these issues. The results presented in this paper show improvements over existing sampling methods to simulate from the posterior distribution over the parameters defining the covariance function of the Gaussian Process prior. This is particularly important as it offers a powerful tool to carry out full Bayesian inference of Gaussian Process based hierarchic statistical models in general. The results also demonstrate that Monte Carlo based integration of all model parameters is actually feasible in this class of models providing a superior quantification of uncertainty in predictions. Extensive comparisons with respect to state-of-the-art probabilistic classifiers confirm this assertion.
Collapse
|
90
|
Herbei R, Berliner LM. Estimating Ocean Circulation: An MCMC Approach With Approximated Likelihoods via the Bernoulli Factory. J Am Stat Assoc 2014. [DOI: 10.1080/01621459.2014.914439] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
91
|
Buzbas EO, Rosenberg NA. AABC: approximate approximate Bayesian computation for inference in population-genetic models. Theor Popul Biol 2014; 99:31-42. [PMID: 25261426 DOI: 10.1016/j.tpb.2014.09.002] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2014] [Revised: 09/14/2014] [Accepted: 09/15/2014] [Indexed: 01/01/2023]
Abstract
Approximate Bayesian computation (ABC) methods perform inference on model-specific parameters of mechanistically motivated parametric models when evaluating likelihoods is difficult. Central to the success of ABC methods, which have been used frequently in biology, is computationally inexpensive simulation of data sets from the parametric model of interest. However, when simulating data sets from a model is so computationally expensive that the posterior distribution of parameters cannot be adequately sampled by ABC, inference is not straightforward. We present "approximate approximate Bayesian computation" (AABC), a class of computationally fast inference methods that extends ABC to models in which simulating data is expensive. In AABC, we first simulate a number of data sets small enough to be computationally feasible to simulate from the parametric model. Conditional on these data sets, we use a statistical model that approximates the correct parametric model and enables efficient simulation of a large number of data sets. We show that under mild assumptions, the posterior distribution obtained by AABC converges to the posterior distribution obtained by ABC, as the number of data sets simulated from the parametric model and the sample size of the observed data set increase. We demonstrate the performance of AABC on a population-genetic model of natural selection, as well as on a model of the admixture history of hybrid populations. This latter example illustrates how, in population genetics, AABC is of particular utility in scenarios that rely on conceptually straightforward but potentially slow forward-in-time simulations.
Collapse
Affiliation(s)
- Erkan O Buzbas
- Department of Biology, Stanford University, Stanford, CA 94305-5020, USA; Department of Statistical Science, University of Idaho, Moscow, ID 84844-1104, USA.
| | - Noah A Rosenberg
- Department of Biology, Stanford University, Stanford, CA 94305-5020, USA.
| |
Collapse
|
92
|
Abstract
Longitudinal allele frequency data are becoming increasingly prevalent. Such samples permit statistical inference of the population genetics parameters that influence the fate of mutant variants. To infer these parameters by maximum likelihood, the mutant frequency is often assumed to evolve according to the Wright–Fisher model. For computational reasons, this discrete model is commonly approximated by a diffusion process that requires the assumption that the forces of natural selection and mutation are weak. This assumption is not always appropriate. For example, mutations that impart drug resistance in pathogens may evolve under strong selective pressure. Here, we present an alternative approximation to the mutant-frequency distribution that does not make any assumptions about the magnitude of selection or mutation and is much more computationally efficient than the standard diffusion approximation. Simulation studies are used to compare the performance of our method to that of the Wright–Fisher and Gaussian diffusion approximations. For large populations, our method is found to provide a much better approximation to the mutant-frequency distribution when selection is strong, while all three methods perform comparably when selection is weak. Importantly, maximum-likelihood estimates of the selection coefficient are severely attenuated when selection is strong under the two diffusion models, but not when our method is used. This is further demonstrated with an application to mutant-frequency data from an experimental study of bacteriophage evolution. We therefore recommend our method for estimating the selection coefficient when the effective population size is too large to utilize the discrete Wright–Fisher model.
Collapse
|
93
|
Skoglund P, Sjödin P, Skoglund T, Lascoux M, Jakobsson M. Investigating population history using temporal genetic differentiation. Mol Biol Evol 2014; 31:2516-27. [PMID: 24939468 PMCID: PMC4137715 DOI: 10.1093/molbev/msu192] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
The rapid advance of sequencing technology, coupled with improvements in molecular methods for obtaining genetic data from ancient sources, holds the promise of producing a wealth of genomic data from time-separated individuals. However, the population-genetic properties of time-structured samples have not been extensively explored. Here, we consider the implications of temporal sampling for analyses of genetic differentiation and use a temporal coalescent framework to show that complex historical events such as size reductions, population replacements, and transient genetic barriers between populations leave a footprint of genetic differentiation that can be traced through history using temporal samples. Our results emphasize explicit consideration of the temporal structure when making inferences and indicate that genomic data from ancient individuals will greatly increase our ability to reconstruct population history.
Collapse
Affiliation(s)
- Pontus Skoglund
- Department of Evolutionary Biology, Uppsala University, Uppsala, Sweden
| | - Per Sjödin
- Department of Evolutionary Biology, Uppsala University, Uppsala, Sweden
| | - Tobias Skoglund
- Department of Evolutionary Biology, Uppsala University, Uppsala, SwedenDepartment of Information Technology, Uppsala University, Uppsala, Sweden
| | - Martin Lascoux
- Department of Ecology and Genetics, Program in Plant Ecology and Evolution, Uppsala University, Uppsala, SwedenScience for Life Laboratory, Uppsala, Sweden
| | - Mattias Jakobsson
- Department of Evolutionary Biology, Uppsala University, Uppsala, SwedenScience for Life Laboratory, Uppsala, Sweden
| |
Collapse
|
94
|
Spatial genetic architecture of the critically-endangered Maui Parrotbill (Pseudonestor xanthophrys): management considerations for reintroduction strategies. CONSERV GENET 2014. [DOI: 10.1007/s10592-014-0641-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
95
|
Lee A, atuszy ski K. Variance bounding and geometric ergodicity of Markov chain Monte Carlo kernels for approximate Bayesian computation. Biometrika 2014. [DOI: 10.1093/biomet/asu027] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
96
|
Maire F, Douc R, Olsson J. Comparison of asymptotic variances of inhomogeneous Markov chains with application to Markov chain Monte Carlo methods. Ann Stat 2014. [DOI: 10.1214/14-aos1209] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
97
|
Static-parameter estimation in piecewise deterministic processes using particle Gibbs samplers. ANN I STAT MATH 2014. [DOI: 10.1007/s10463-014-0455-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
98
|
McKinley TJ, Ross JV, Deardon R, Cook AR. Simulation-based Bayesian inference for epidemic models. Comput Stat Data Anal 2014. [DOI: 10.1016/j.csda.2012.12.012] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
99
|
Kühnert D, Stadler T, Vaughan TG, Drummond AJ. Simultaneous reconstruction of evolutionary history and epidemiological dynamics from viral sequences with the birth-death SIR model. J R Soc Interface 2014; 11:20131106. [PMID: 24573331 PMCID: PMC3973358 DOI: 10.1098/rsif.2013.1106] [Citation(s) in RCA: 89] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
The evolution of RNA viruses, such as human immunodeficiency virus (HIV), hepatitis C virus and influenza virus, occurs so rapidly that the viruses' genomes contain information on past ecological dynamics. Hence, we develop a phylodynamic method that enables the joint estimation of epidemiological parameters and phylogenetic history. Based on a compartmental susceptible–infected–removed (SIR) model, this method provides separate information on incidence and prevalence of infections. Detailed information on the interaction of host population dynamics and evolutionary history can inform decisions on how to contain or entirely avoid disease outbreaks. We apply our birth–death SIR method to two viral datasets. First, five HIV type 1 clusters sampled in the UK between 1999 and 2003 are analysed. The estimated basic reproduction ratios range from 1.9 to 3.2 among the clusters. All clusters show a decline in the growth rate of the local epidemic in the middle or end of the 1990s. The analysis of a hepatitis C virus genotype 2c dataset shows that the local epidemic in the Córdoban city Cruz del Eje originated around 1906 (median), coinciding with an immigration wave from Europe to central Argentina that dates from 1880 to 1920. The estimated time of epidemic peak is around 1970.
Collapse
Affiliation(s)
- Denise Kühnert
- Department of Computer Science, University of Auckland, , Auckland, New Zealand
| | | | | | | |
Collapse
|
100
|
Gasca-Pineda J, Cassaigne I, Alonso RA, Eguiarte LE. Effective population size, genetic variation, and their relevance for conservation: the bighorn sheep in Tiburon Island and comparisons with managed artiodactyls. PLoS One 2013; 8:e78120. [PMID: 24147115 PMCID: PMC3795651 DOI: 10.1371/journal.pone.0078120] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2012] [Accepted: 09/18/2013] [Indexed: 12/03/2022] Open
Abstract
The amount of genetic diversity in a finite biological population mostly depends on the interactions among evolutionary forces and the effective population size (N(e)) as well as the time since population establishment. Because the N(e) estimation helps to explore population demographic history, and allows one to predict the behavior of genetic diversity through time, N(e) is a key parameter for the genetic management of small and isolated populations. Here, we explored an N(e)-based approach using a bighorn sheep population on Tiburon Island, Mexico (TI) as a model. We estimated the current (N(crnt)) and ancestral stable (N(stbl)) inbreeding effective population sizes as well as summary statistics to assess genetic diversity and the demographic scenarios that could explain such diversity. Then, we evaluated the feasibility of using TI as a source population for reintroduction programs. We also included data from other bighorn sheep and artiodactyl populations in the analysis to compare their inbreeding effective size estimates. The TI population showed high levels of genetic diversity with respect to other managed populations. However, our analysis suggested that TI has been under a genetic bottleneck, indicating that using individuals from this population as the only source for reintroduction could lead to a severe genetic diversity reduction. Analyses of the published data did not show a strict correlation between H(E) and N(crnt) estimates. Moreover, we detected that ancient anthropogenic and climatic pressures affected all studied populations. We conclude that the estimation of N(crnt) and N(stbl) are informative genetic diversity estimators and should be used in addition to summary statistics for conservation and population management planning.
Collapse
Affiliation(s)
- Jaime Gasca-Pineda
- Departamento de Ecología Evolutiva, Instituto de Ecología, Universidad Nacional Autónoma de México, México City, México
| | - Ivonne Cassaigne
- Departamento de Ecología Evolutiva, Instituto de Ecología, Universidad Nacional Autónoma de México, México City, México
| | - Rogelio A. Alonso
- Facultad de Medicina Veterinaria and Zootecnia, Universidad Nacional Autónoma de México, México City, México
| | - Luis E. Eguiarte
- Departamento de Ecología Evolutiva, Instituto de Ecología, Universidad Nacional Autónoma de México, México City, México
| |
Collapse
|