1
|
Parreira BR, Gopalakrishnan S, Chikhi L. Effects of Social Structure on Effective Population Size Change Estimates. Evol Appl 2025; 18:e70063. [PMID: 39816161 PMCID: PMC11732743 DOI: 10.1111/eva.70063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2024] [Revised: 11/26/2024] [Accepted: 12/03/2024] [Indexed: 01/18/2025] Open
Abstract
Most methods currently used to infer the "demographic history of species" interpret this expression as a history of population size changes. The detection, quantification, and dating of demographic changes often rely on the assumption that population structure can be neglected. However, most vertebrates are typically organized in populations subdivided into social groups that are usually ignored in the interpretation of genetic data. This could be problematic since an increasing number of studies have shown that population structure can generate spurious signatures of population size change. Here, we simulate microsatellite data from a species subdivided into social groups where reproduction occurs according to different mating systems (monogamy, polygynandry, and polygyny). We estimate the effective population size (N e) and quantify the effect of social structure on estimates of changes in N e. We analyze the simulated data with two widely used methods for demographic inference. The first approach, BOTTLENECK, tests whether the samples are at mutation-drift equilibrium and thus whether a single N e can be estimated. The second approach, msvar, aims at quantifying and dating changes in N e. We find that social structure may lead to signals of departure from mutation-drift equilibrium including signals of expansion and bottlenecks. We also find that expansion signals may be observed under simple stationary Wright-Fisher models with low diversity. Since small populations tend to characterize many endangered species, we stress that methods trying to infer N e should be interpreted with care and validated with simulated data incorporating information about structure. Spurious expansion signals due to social structure can mask critical population size changes. These can obscure true bottleneck events and be particularly problematic in endangered species.
Collapse
Affiliation(s)
- Bárbara Ribeiro Parreira
- Center for Evolutionary HologenomicsGlobe Institute, University of CopenhagenCopenhagenDenmark
- Instituto Gulbenkian de CiênciaOeirasPortugal
| | - Shyam Gopalakrishnan
- Center for Evolutionary HologenomicsGlobe Institute, University of CopenhagenCopenhagenDenmark
| | - Lounès Chikhi
- Instituto Gulbenkian de CiênciaOeirasPortugal
- Centre de Recherche sur la Biodiversité et l'Environnement (CRBE) UMR 5300Université de Toulouse, CNRS, IRD, Toulouse INP, Université Toulouse 3 Paul Sabatier (UT3)ToulouseFrance
- Centre for Ecology, Evolution and Environmental Changes (cE3c)Faculdade de Ciências da Universidade de LisboaLisboaPortugal
| |
Collapse
|
2
|
Pflug FG, Haendeler S, Esk C, Lindenhofer D, Knoblich JA, von Haeseler A. Neutral competition explains the clonal composition of neural organoids. PLoS Comput Biol 2024; 20:e1012054. [PMID: 38648250 PMCID: PMC11065252 DOI: 10.1371/journal.pcbi.1012054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Revised: 05/02/2024] [Accepted: 04/03/2024] [Indexed: 04/25/2024] Open
Abstract
Neural organoids model the development of the human brain and are an indispensable tool for studying neurodevelopment. Whole-organoid lineage tracing has revealed the number of progenies arising from each initial stem cell to be highly diverse, with lineage sizes ranging from one to more than 20,000 cells. This high variability exceeds what can be explained by existing stochastic models of corticogenesis and indicates the existence of an additional source of stochasticity. To explain this variability, we introduce the SAN model which distinguishes Symmetrically diving, Asymmetrically dividing, and Non-proliferating cells. In the SAN model, the additional source of stochasticity is the survival time of a lineage's pool of symmetrically dividing cells. These survival times result from neutral competition within the sub-population of all symmetrically dividing cells. We demonstrate that our model explains the experimentally observed variability of lineage sizes and derive the quantitative relationship between survival time and lineage size. We also show that our model implies the existence of a regulatory mechanism which keeps the size of the symmetrically dividing cell population constant. Our results provide quantitative insight into the clonal composition of neural organoids and how it arises. This is relevant for many applications of neural organoids, and similar processes may occur in other developing tissues both in vitro and in vivo.
Collapse
Affiliation(s)
- Florian G. Pflug
- Biological Complexity Unit, Okinawa Institute of Science and Technology Graduate University (OIST), Onna, Okinawa, Japan
- Center for Integrative Bioinformatics Vienna (CIBIV), Max Perutz Labs, University of Vienna and Medical University of Vienna, Vienna BioCenter (VBC), Vienna, Austria
| | - Simon Haendeler
- Center for Integrative Bioinformatics Vienna (CIBIV), Max Perutz Labs, University of Vienna and Medical University of Vienna, Vienna BioCenter (VBC), Vienna, Austria
- Vienna Biocenter (VBC) PhD Program, a Doctoral School of the University of Vienna and the Medical University of Vienna, Vienna, Austria
| | - Christopher Esk
- Institute of Molecular Biotechnology of the Austrian Academy of Science (IMBA), Vienna BioCenter (VBC), Vienna, Austria
- Institute of Molecular Biology, University of Innsbruck, Innsbruck, Austria
| | - Dominik Lindenhofer
- Institute of Molecular Biotechnology of the Austrian Academy of Science (IMBA), Vienna BioCenter (VBC), Vienna, Austria
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Jürgen A. Knoblich
- Institute of Molecular Biotechnology of the Austrian Academy of Science (IMBA), Vienna BioCenter (VBC), Vienna, Austria
- Department of Neurology, Medical University of Vienna, Vienna, Austria
| | - Arndt von Haeseler
- Center for Integrative Bioinformatics Vienna (CIBIV), Max Perutz Labs, University of Vienna and Medical University of Vienna, Vienna BioCenter (VBC), Vienna, Austria
- Faculty of Computer Science Bioinformatics and Computational Biology, University of Vienna, Vienna, Austria
| |
Collapse
|
3
|
Vandecasteele H, Samaey G. Pseudo-marginal approximation to the free energy in a micro-macro Markov chain Monte Carlo method. J Chem Phys 2024; 160:104702. [PMID: 38465681 DOI: 10.1063/5.0199562] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Accepted: 02/13/2024] [Indexed: 03/12/2024] Open
Abstract
We introduce a generalized micro-macro Markov chain Monte Carlo (mM-MCMC) method with pseudo-marginal approximation to the free energy that is able to accelerate sampling of the microscopic Gibbs distributions when there is a time-scale separation between the macroscopic dynamics of a reaction coordinate and the remaining microscopic degrees of freedom. The mM-MCMC method attains this efficiency by iterating four steps: (i) propose a new value of the reaction coordinate, (ii) accept or reject the macroscopic sample, (iii) run a biased simulation that creates a microscopic molecular instance that lies close to the newly sampled macroscopic reaction coordinate value, and (iv) microscopic accept/reject step for the new microscopic sample. In the present paper, we eliminate the main computational bottleneck of earlier versions of this method: the necessity to have an accurate approximation of free energy. We show that the introduction of a pseudo-marginal approximation significantly reduces the computational cost of the microscopic accept/reject step while still providing unbiased samples. We illustrate the method's behavior on several molecular systems with low-dimensional reaction coordinates.
Collapse
Affiliation(s)
- Hannes Vandecasteele
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, 3400 N. Charles Street Baltimore, Maryland 21218, USA
- Department of Computer Science, KU Leuven, Celestijnenlaan 200A, 3001 Leuven, Belgium
| | - Giovanni Samaey
- Department of Computer Science, KU Leuven, Celestijnenlaan 200A, 3001 Leuven, Belgium
| |
Collapse
|
4
|
Liang X, Livingstone S, Griffin J. Adaptive MCMC for Bayesian Variable Selection in Generalised Linear Models and Survival Models. ENTROPY (BASEL, SWITZERLAND) 2023; 25:1310. [PMID: 37761609 PMCID: PMC10528396 DOI: 10.3390/e25091310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Revised: 08/30/2023] [Accepted: 09/05/2023] [Indexed: 09/29/2023]
Abstract
Developing an efficient computational scheme for high-dimensional Bayesian variable selection in generalised linear models and survival models has always been a challenging problem due to the absence of closed-form solutions to the marginal likelihood. The Reversible Jump Markov Chain Monte Carlo (RJMCMC) approach can be employed to jointly sample models and coefficients, but the effective design of the trans-dimensional jumps of RJMCMC can be challenging, making it hard to implement. Alternatively, the marginal likelihood can be derived conditional on latent variables using a data-augmentation scheme (e.g., Pólya-gamma data augmentation for logistic regression) or using other estimation methods. However, suitable data-augmentation schemes are not available for every generalised linear model and survival model, and estimating the marginal likelihood using a Laplace approximation or a correlated pseudo-marginal method can be computationally expensive. In this paper, three main contributions are presented. Firstly, we present an extended Point-wise implementation of Adaptive Random Neighbourhood Informed proposal (PARNI) to efficiently sample models directly from the marginal posterior distributions of generalised linear models and survival models. Secondly, in light of the recently proposed approximate Laplace approximation, we describe an efficient and accurate estimation method for marginal likelihood that involves adaptive parameters. Additionally, we describe a new method to adapt the algorithmic tuning parameters of the PARNI proposal by replacing Rao-Blackwellised estimates with the combination of a warm-start estimate and the ergodic average. We present numerous numerical results from simulated data and eight high-dimensional genetic mapping data-sets to showcase the efficiency of the novel PARNI proposal compared with the baseline add-delete-swap proposal.
Collapse
Affiliation(s)
- Xitong Liang
- Department of Statistical Science, University College London, London WC1E 6BT, UK; (S.L.); (J.G.)
| | | | | |
Collapse
|
5
|
Dove S, Böhm M, Freeman R, McRae L, Murrell DJ. Quantifying reliability and data deficiency in global vertebrate population trends using the Living Planet Index. GLOBAL CHANGE BIOLOGY 2023; 29:4966-4982. [PMID: 37376728 DOI: 10.1111/gcb.16841] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/18/2023] [Accepted: 04/15/2023] [Indexed: 06/29/2023]
Abstract
Global biodiversity is facing a crisis, which must be solved through effective policies and on-the-ground conservation. But governments, NGOs, and scientists need reliable indicators to guide research, conservation actions, and policy decisions. Developing reliable indicators is challenging because the data underlying those tools is incomplete and biased. For example, the Living Planet Index tracks the changing status of global vertebrate biodiversity, but taxonomic, geographic and temporal gaps and biases are present in the aggregated data used to calculate trends. However, without a basis for real-world comparison, there is no way to directly assess an indicator's accuracy or reliability. Instead, a modelling approach can be used. We developed a model of trend reliability, using simulated datasets as stand-ins for the "real world", degraded samples as stand-ins for indicator datasets (e.g., the Living Planet Database), and a distance measure to quantify reliability by comparing partially sampled to fully sampled trends. The model revealed that the proportion of species represented in the database is not always indicative of trend reliability. Important factors are the number and length of time series, as well as their mean growth rates and variance in their growth rates, both within and between time series. We found that many trends in the Living Planet Index need more data to be considered reliable, particularly trends across the global south. In general, bird trends are the most reliable, while reptile and amphibian trends are most in need of additional data. We simulated three different solutions for reducing data deficiency, and found that collating existing data (where available) is the most efficient way to improve trend reliability, whereas revisiting previously studied populations is a quick and efficient way to improve trend reliability until new long-term studies can be completed and made available.
Collapse
Affiliation(s)
- Shawn Dove
- Centre for Biodiversity and Environment Research, University College London, London, UK
- Institute of Zoology, Zoological Society of London, London, UK
| | - Monika Böhm
- Institute of Zoology, Zoological Society of London, London, UK
- Global Center for Species Survival, Indianapolis Zoo, Indianapolis, Indiana, USA
| | - Robin Freeman
- Institute of Zoology, Zoological Society of London, London, UK
| | - Louise McRae
- Institute of Zoology, Zoological Society of London, London, UK
| | - David J Murrell
- Centre for Biodiversity and Environment Research, University College London, London, UK
| |
Collapse
|
6
|
Friedli L, Linde N. Solving Geophysical Inversion Problems with Intractable Likelihoods: Linearized Gaussian Approximations Versus the Correlated Pseudo-marginal Method. MATHEMATICAL GEOSCIENCES 2023; 56:55-75. [PMID: 38283870 PMCID: PMC10817994 DOI: 10.1007/s11004-023-10064-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Accepted: 04/02/2023] [Indexed: 01/30/2024]
Abstract
A geophysical Bayesian inversion problem may target the posterior distribution of geological or hydrogeological parameters given geophysical data. To account for the scatter in the petrophysical relationship linking the target parameters to the geophysical properties, this study treats the intermediate geophysical properties as latent (unobservable) variables. To perform inversion in such a latent variable model, the intractable likelihood function of the (hydro)geological parameters given the geophysical data needs to be estimated. This can be achieved by approximation with a Gaussian probability density function based on local linearization of the geophysical forward operator, thereby, accounting for the noise in the petrophysical relationship by a corresponding addition to the data covariance matrix. The new approximate method is compared against the general correlated pseudo-marginal method, which estimates the likelihood by Monte Carlo averaging over samples of the latent variable. First, the performances of the two methods are tested on a synthetic test example, in which a multivariate Gaussian porosity field is inferred using crosshole ground-penetrating radar first-arrival travel times. For this example with rather small petrophysical uncertainty, the two methods provide near-identical estimates, while an inversion that ignores petrophysical uncertainty leads to biased estimates. The results of a sensitivity analysis are then used to suggest that the linearized Gaussian approach, while attractive due to its relative computational speed, suffers from a decreasing accuracy with increasing scatter in the petrophysical relationship. The computationally more expensive correlated pseudo-marginal method performs very well even for settings with high petrophysical uncertainty.
Collapse
Affiliation(s)
- Lea Friedli
- Institute of Earth Sciences, University of Lausanne, Lausanne, Switzerland
| | - Niklas Linde
- Institute of Earth Sciences, University of Lausanne, Lausanne, Switzerland
| |
Collapse
|
7
|
Baey C, Smith HG, Rundlöf M, Olsson O, Clough Y, Sahlin U. Calibration of a bumble bee foraging model using Approximate Bayesian Computation. Ecol Modell 2023. [DOI: 10.1016/j.ecolmodel.2022.110251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
|
8
|
Martin GM, Frazier DT, Robert CP. Computing Bayes: From Then ‘Til Now. Stat Sci 2023. [DOI: 10.1214/22-sts876] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2023]
Affiliation(s)
- Gael M. Martin
- Gael M. Martin is Professor, Department of Econometrics and Business Statistics, Monash University, Melbourne, Australia
| | - David T. Frazier
- David T. Frazier is Associate Professor, Department of Econometrics and Business Statistics, Monash University, Melbourne, Australia
| | - Christian P. Robert
- Christian P. Robert is Professor, Ceremade, Université Paris-Dauphine, Paris, France, and Department of Statistics, Warwick University, Coventry, UK
| |
Collapse
|
9
|
Martin GM, Frazier DT, Robert CP. Approximating Bayes in the 21st Century. Stat Sci 2023. [DOI: 10.1214/22-sts875] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/25/2023]
Affiliation(s)
- Gael M. Martin
- Gael M. Martin is Professor, Department of Econometrics and Business Statistics, Monash University, Melbourne, Australia
| | - David T. Frazier
- David T. Frazier is Associate Professor, Department of Econometrics and Business Statistics, Monash University, Melbourne, Australia
| | | |
Collapse
|
10
|
Viaud G, Chen Y, Cournède PH. Full Bayesian inference in hidden Markov models of plant growth. Ann Appl Stat 2022. [DOI: 10.1214/21-aoas1594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Gautier Viaud
- Université Paris-Saclay, CentraleSupélec, Mathématiques et Informatique pour la Complexité et les Systèmes
| | - Yuting Chen
- Energy Analysis and Environmental Impacts Division, Lawrence Berkeley National Laboratory
| | - Paul-Henry Cournède
- Université Paris-Saclay, CentraleSupélec, Mathématiques et Informatique pour la Complexité et les Systèmes
| |
Collapse
|
11
|
Gloaguen P, Le Corff S, Olsson J. A pseudo-marginal sequential Monte Carlo online smoothing algorithm. BERNOULLI 2022. [DOI: 10.3150/21-bej1431] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
| | | | - Jimmy Olsson
- KTH Royal Institute of Technology, Stockholm, Sweden
| |
Collapse
|
12
|
Wang J. MLNe: Simulating and Estimating Effective Size and Migration Rate from Temporal Changes in Allele Frequencies. J Hered 2022; 113:563-567. [PMID: 35932284 DOI: 10.1093/jhered/esac039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Accepted: 08/03/2022] [Indexed: 11/12/2022] Open
Abstract
In studies of molecular ecology, conservation biology and evolutionary biology, the current or recent effective size (Ne) of a population is frequently estimated from the marker genotype data of two or more temporally spaced samples of individuals taken from the population. Despite the developments of numerous Bayesian, likelihood and moment estimators, only a couple of them can use both temporally and spatially spaced samples of individuals to estimate jointly the effective size (Ne) of and the migration rate (m) into a population. In this note I describe new implementations of these joint estimators of Ne and m in software MLNe which runs on multiple platforms (Windows, Mac, Linux) with or without a graphical user interface (GUI), has an integrated simulation module to simulate genotype data for investigating the impacts of various factors (such as sample size and sampling interval) on estimation precision and accuracy, exploits both Message Passing Interface (MPI) and openMP for parallel computations using multiple cores and nodes to speed up analysis. The program does not require data pre-processing and accepts multiple formats of a file of original genotype data and a file of parameters as input. The GUI facilitates data and parameter inputs and produces publication-quality output graphs, while the non-GUI version of software is convenient for batch analysis of multiple datasets as in simulations. MLNe will help advance the analysis of temporal genetic marker data for estimating Ne of and m between populations, which are important parameters that will help biologists for the conservation management of natural and managed populations. MLNe can be downloaded free from the website http://www.zsl.org/science/research/software/.
Collapse
Affiliation(s)
- Jinliang Wang
- Institute of Zoology, Zoological Society of London, London NW1 4RY, United Kingdom
| |
Collapse
|
13
|
Roy A, Shen L, Balasubramanian K, Ghadimi S. Stochastic zeroth-order discretizations of Langevin diffusions for Bayesian inference. BERNOULLI 2022. [DOI: 10.3150/21-bej1400] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Abhishek Roy
- Department of Statistics, University of California, Davis, Davis, CA 95616, USA
| | - Lingqing Shen
- Tepper School of Business, Carnegie Mellon University, Pittsburgh, PA 15213
| | | | - Saeed Ghadimi
- Department of Management Sciences, University of Waterloo, Waterloo, ON N2L 3G1, Canada
| |
Collapse
|
14
|
Dunne M, Mohammadi H, Challenor P, Borgo R, Porphyre T, Vernon I, Firat EE, Turkay C, Torsney-Weir T, Goldstein M, Reeve R, Fang H, Swallow B. Complex model calibration through emulation, a worked example for a stochastic epidemic model. Epidemics 2022; 39:100574. [PMID: 35617882 PMCID: PMC9109972 DOI: 10.1016/j.epidem.2022.100574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 04/22/2022] [Accepted: 04/29/2022] [Indexed: 12/03/2022] Open
Abstract
Uncertainty quantification is a formal paradigm of statistical estimation that aims to account for all uncertainties inherent in the modelling process of real-world complex systems. The methods are directly applicable to stochastic models in epidemiology, however they have thus far not been widely used in this context. In this paper, we provide a tutorial on uncertainty quantification of stochastic epidemic models, aiming to facilitate the use of the uncertainty quantification paradigm for practitioners with other complex stochastic simulators of applied systems. We provide a formal workflow including the important decisions and considerations that need to be taken, and illustrate the methods over a simple stochastic epidemic model of UK SARS-CoV-2 transmission and patient outcome. We also present new approaches to visualisation of outputs from sensitivity analyses and uncertainty quantification more generally in high input and/or output dimensions.
Collapse
Affiliation(s)
- Michael Dunne
- College of Engineering, Mathematics and Physical Sciences, University of Exeter, Exeter, UK
| | - Hossein Mohammadi
- College of Engineering, Mathematics and Physical Sciences, University of Exeter, Exeter, UK
| | - Peter Challenor
- College of Engineering, Mathematics and Physical Sciences, University of Exeter, Exeter, UK
| | - Rita Borgo
- Department of Informatics, King's College London, London, UK
| | - Thibaud Porphyre
- Laboratoire de Biométrie et Biologie Evolutive, VetAgro Sup, Marcy l'Etoile, France
| | - Ian Vernon
- Department of Mathematical Sciences, Durham University, Durham, UK
| | - Elif E Firat
- Department of Computer Science, University of Nottingham, Nottingham, UK
| | - Cagatay Turkay
- Centre for Interdisciplinary Methodologies, University of Warwick, Coventry, UK
| | - Thomas Torsney-Weir
- VRVis Zentrum für Virtual Reality und Visualisierung Forschungs-GmbH, Vienna, Austria
| | | | - Richard Reeve
- Boyd Orr Centre for Population and Ecosystem Health, Institute of Biodiversity, Animal Health and Comparative Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, UK
| | - Hui Fang
- Department of Computer Science, Loughborough University, Loughborough, UK
| | - Ben Swallow
- School of Mathematics and Statistics, University of Glasgow, Glasgow, UK.
| |
Collapse
|
15
|
Borowska A, King R. Semi-Complete Data Augmentation for Efficient State Space Model Fitting. J Comput Graph Stat 2022. [DOI: 10.1080/10618600.2022.2077350] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
| | - Ruth King
- School of Mathematics, University of Edinburgh, UK
| |
Collapse
|
16
|
Medina-Aguayo FJ, Christen JA. Penalised t-walk MCMC. J Stat Plan Inference 2022. [DOI: 10.1016/j.jspi.2022.04.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
17
|
Kaji T, Ročková V. Metropolis-Hastings via Classification. J Am Stat Assoc 2022. [DOI: 10.1080/01621459.2022.2060836] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
18
|
Nadachowska‐Brzyska K, Konczal M, Babik W. Navigating the temporal continuum of effective population size. Methods Ecol Evol 2021. [DOI: 10.1111/2041-210x.13740] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Affiliation(s)
| | | | - Wieslaw Babik
- Jagiellonian University in Kraków Faculty of Biology Institute of Environmental Sciences Kraków Poland
| |
Collapse
|
19
|
Sherlock C, Thiery AH, Golightly A. Efficiency of delayed-acceptance random walk Metropolis algorithms. Ann Stat 2021. [DOI: 10.1214/21-aos2068] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Chris Sherlock
- Department of Mathematics and Statistics, Lancaster University
| | - Alexandre H. Thiery
- Department of Statistics and Applied Probability, National University of Singapore
| | - Andrew Golightly
- School of Mathematics, Statistics and Physics, Newcastle University
| |
Collapse
|
20
|
Kidner J, Theodorou P, Engler JO, Taubert M, Husemann M. A brief history and popularity of methods and tools used to estimate micro-evolutionary forces. Ecol Evol 2021; 11:13723-13743. [PMID: 34707813 PMCID: PMC8525119 DOI: 10.1002/ece3.8076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2020] [Revised: 07/12/2021] [Accepted: 08/12/2021] [Indexed: 11/30/2022] Open
Abstract
Population genetics is a field of research that predates the current generations of sequencing technology. Those approaches, that were established before massively parallel sequencing methods, have been adapted to these new marker systems (in some cases involving the development of new methods) that allow genome-wide estimates of the four major micro-evolutionary forces-mutation, gene flow, genetic drift, and selection. Nevertheless, classic population genetic markers are still commonly used and a plethora of analysis methods and programs is available for these and high-throughput sequencing (HTS) data. These methods employ various and diverse theoretical and statistical frameworks, to varying degrees of success, to estimate similar evolutionary parameters making it difficult to get a concise overview across the available approaches. Presently, reviews on this topic generally focus on a particular class of methods to estimate one or two evolutionary parameters. Here, we provide a brief history of methods and a comprehensive list of available programs for estimating micro-evolutionary forces. We furthermore analyzed their usage within the research community based on popularity (citation bias) and discuss the implications of this bias for the software community. We found that a few programs received the majority of citations, with program success being independent of both the parameters estimated and the computing platform. The only deviation from a model of exponential growth in the number of citations was found for the presence of a graphical user interface (GUI). Interestingly, no relationship was found for the impact factor of the journals, when the tools were published, suggesting accessibility might be more important than visibility.
Collapse
Affiliation(s)
- Jonathan Kidner
- General Zoology Institute for Biology Martin Luther University Halle-Wittenberg Halle (Saale) Germany
| | - Panagiotis Theodorou
- General Zoology Institute for Biology Martin Luther University Halle-Wittenberg Halle (Saale) Germany
| | - Jan O Engler
- Terrestrial Ecology Unit Department of Biology Ghent University Ghent Belgium
| | - Martin Taubert
- Aquatic Geomicrobiology Institute for Biodiversity Friedrich Schiller University Jena Jena Germany
| | - Martin Husemann
- General Zoology Institute for Biology Martin Luther University Halle-Wittenberg Halle (Saale) Germany
- Centrum für Naturkunde University of Hamburg Hamburg Germany
| |
Collapse
|
21
|
Jacob PE, Gong R, Edlefsen PT, Dempster AP. A Gibbs sampler for a class of random convex polytopes. J Am Stat Assoc 2021; 116:1181-1192. [PMID: 35340357 PMCID: PMC8945543 DOI: 10.1080/01621459.2021.1945458] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Accepted: 06/16/2021] [Indexed: 10/20/2022]
Abstract
We present a Gibbs sampler for the Dempster-Shafer (DS) approach to statistical inference for Categorical distributions. The DS framework extends the Bayesian approach, allows in particular the use of partial prior information, and yields three-valued uncertainty assessments representing probabilities "for", "against", and "don't know" about formal assertions of interest. The proposed algorithm targets the distribution of a class of random convex polytopes which encapsulate the DS inference. The sampler relies on an equivalence between the iterative constraints of the vertex configuration and the non-negativity of cycles in a fully connected directed graph. Illustrations include the testing of independence in 2 × 2 contingency tables and parameter estimation of the linkage model.
Collapse
Affiliation(s)
| | - Ruobin Gong
- Department of Statistics, Rutgers University
| | | | | |
Collapse
|
22
|
Pseudo-marginal Bayesian inference for Gaussian process latent variable models. Mach Learn 2021. [DOI: 10.1007/s10994-021-05971-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
23
|
Crouse WL, Kelada SNP, Valdar W. Inferring the Allelic Series at QTL in Multiparental Populations. Genetics 2020; 216:957-983. [PMID: 33082282 PMCID: PMC7768242 DOI: 10.1534/genetics.120.303393] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2020] [Accepted: 10/12/2020] [Indexed: 12/25/2022] Open
Abstract
Multiparental populations (MPPs) are experimental populations in which the genome of every individual is a mosaic of known founder haplotypes. These populations are useful for detecting quantitative trait loci (QTL) because tests of association can leverage inferred founder haplotype descent. It is difficult, however, to determine how haplotypes at a locus group into distinct functional alleles, termed the allelic series. The allelic series is important because it provides information about the number of causal variants at a QTL and their combined effects. In this study, we introduce a fully Bayesian model selection framework for inferring the allelic series. This framework accounts for sources of uncertainty found in typical MPPs, including the number and composition of functional alleles. Our prior distribution for the allelic series is based on the Chinese restaurant process, a relative of the Dirichlet process, and we leverage its connection to the coalescent to introduce additional prior information about haplotype relatedness via a phylogenetic tree. We evaluate our approach via simulation and apply it to QTL from two MPPs: the Collaborative Cross (CC) and the Drosophila Synthetic Population Resource (DSPR). We find that, although posterior inference of the exact allelic series is often uncertain, we are able to distinguish biallelic QTL from more complex multiallelic cases. Additionally, our allele-based approach improves haplotype effect estimation when the true number of functional alleles is small. Our method, Tree-Based Inference of Multiallelism via Bayesian Regression (TIMBR), provides new insight into the genetic architecture of QTL in MPPs.
Collapse
Affiliation(s)
- Wesley L Crouse
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill, North Carolina 27599
- Department of Genetics, University of North Carolina, Chapel Hill, North Carolina 27599
| | - Samir N P Kelada
- Department of Genetics, University of North Carolina, Chapel Hill, North Carolina 27599
- Marsico Lung Institute, University of North Carolina, Chapel Hill, North Carolina 27599
| | - William Valdar
- Department of Genetics, University of North Carolina, Chapel Hill, North Carolina 27599
- Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina 27599
| |
Collapse
|
24
|
He Z, Dai X, Beaumont M, Yu F. Detecting and Quantifying Natural Selection at Two Linked Loci from Time Series Data of Allele Frequencies with Forward-in-Time Simulations. Genetics 2020; 216:521-541. [PMID: 32826299 PMCID: PMC7536848 DOI: 10.1534/genetics.120.303463] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2019] [Accepted: 08/15/2020] [Indexed: 12/16/2022] Open
Abstract
Recent advances in DNA sequencing techniques have made it possible to monitor genomes in great detail over time. This improvement provides an opportunity for us to study natural selection based on time serial samples of genomes while accounting for genetic recombination effect and local linkage information. Such time series genomic data allow for more accurate estimation of population genetic parameters and hypothesis testing on the recent action of natural selection. In this work, we develop a novel Bayesian statistical framework for inferring natural selection at a pair of linked loci by capitalising on the temporal aspect of DNA data with the additional flexibility of modeling the sampled chromosomes that contain unknown alleles. Our approach is built on a hidden Markov model where the underlying process is a two-locus Wright-Fisher diffusion with selection, which enables us to explicitly model genetic recombination and local linkage. The posterior probability distribution for selection coefficients is computed by applying the particle marginal Metropolis-Hastings algorithm, which allows us to efficiently calculate the likelihood. We evaluate the performance of our Bayesian inference procedure through extensive simulations, showing that our approach can deliver accurate estimates of selection coefficients, and the addition of genetic recombination and local linkage brings about significant improvement in the inference of natural selection. We also illustrate the utility of our method on real data with an application to ancient DNA data associated with white spotting patterns in horses.
Collapse
Affiliation(s)
- Zhangyi He
- School of Mathematics, University of Bristol, BS8 1UG, United Kingdom
| | - Xiaoyang Dai
- School of Biological Sciences, University of Bristol, BS8 1TQ, United Kingdom
| | - Mark Beaumont
- School of Biological Sciences, University of Bristol, BS8 1TQ, United Kingdom
| | - Feng Yu
- School of Mathematics, University of Bristol, BS8 1UG, United Kingdom
| |
Collapse
|
25
|
Sethuraman A, Janzen FJ, Weisrock DW, Obrycki JJ. Insights from Population Genomics to Enhance and Sustain Biological Control of Insect Pests. INSECTS 2020; 11:E462. [PMID: 32708047 PMCID: PMC7469154 DOI: 10.3390/insects11080462] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/18/2020] [Revised: 07/15/2020] [Accepted: 07/17/2020] [Indexed: 01/25/2023]
Abstract
Biological control-the use of organisms (e.g., nematodes, arthropods, bacteria, fungi, viruses) for the suppression of insect pest species-is a well-established, ecologically sound and economically profitable tactic for crop protection. This approach has served as a sustainable solution for many insect pest problems for over a century in North America. However, all pest management tactics have associated risks. Specifically, the ecological non-target effects of biological control have been examined in numerous systems. In contrast, the need to understand the short- and long-term evolutionary consequences of human-mediated manipulation of biological control organisms for importation, augmentation and conservation biological control has only recently been acknowledged. Particularly, population genomics presents exceptional opportunities to study adaptive evolution and invasiveness of pests and biological control organisms. Population genomics also provides insights into (1) long-term biological consequences of releases, (2) the ecological success and sustainability of this pest management tactic and (3) non-target effects on native species, populations and ecosystems. Recent advances in genomic sequencing technology and model-based statistical methods to analyze population-scale genomic data provide a much needed impetus for biological control programs to benefit by incorporating a consideration of evolutionary consequences. Here, we review current technology and methods in population genomics and their applications to biological control and include basic guidelines for biological control researchers for implementing genomic technology and statistical modeling.
Collapse
Affiliation(s)
- Arun Sethuraman
- Department of Biological Sciences, California State University San Marcos, San Marcos, CA 92096, USA
| | - Fredric J Janzen
- Department of Ecology, Evolution, & Organismal Biology, Iowa State University, Ames, IA 50010, USA
- Kellogg Biological Station, Michigan State University, Hickory Corners, MI 49060, USA
| | - David W Weisrock
- Department of Biology, University of Kentucky, Lexington, KY 40506, USA
| | - John J Obrycki
- Department of Entomology, University of Kentucky, Lexington, KY 40506, USA
| |
Collapse
|
26
|
Walker CR, Black AJ, Ross JV. Bayesian model discrimination for partially-observed epidemic models. Math Biosci 2019; 317:108266. [PMID: 31589881 DOI: 10.1016/j.mbs.2019.108266] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2019] [Revised: 08/22/2019] [Accepted: 09/27/2019] [Indexed: 10/25/2022]
Abstract
An efficient method for Bayesian model selection is presented for a broad class of continuous-time Markov chain models and is subsequently applied to two important problems in epidemiology. The first problem is to identify the shape of the infectious period distribution; the second problem is to determine whether individuals display symptoms before, at the same time, or after they become infectious. In both cases we show that the correct model can be identified, in the majority of cases, from symptom onset data generated from multiple outbreaks in small populations. The method works by evaluating the likelihood using a particle filter that incorporates a novel importance sampling algorithm designed for partially-observed continuous-time Markov chains. This is combined with another importance sampling method to unbiasedly estimate the model evidence. These come with estimates of precision, which allow for stopping criterion to be employed. Our method is general and can be applied to a wide range of model selection problems in biological and epidemiological systems with intractable likelihood functions.
Collapse
Affiliation(s)
- Camelia R Walker
- School of Mathematical Sciences, University of Adelaide, Adelaide, SA 5005, Australia; ACEMS, School of Mathematical Sciences, University of Adelaide, Adelaide, SA 5005, Australia.
| | - Andrew J Black
- School of Mathematical Sciences, University of Adelaide, Adelaide, SA 5005, Australia; ACEMS, School of Mathematical Sciences, University of Adelaide, Adelaide, SA 5005, Australia
| | - Joshua V Ross
- School of Mathematical Sciences, University of Adelaide, Adelaide, SA 5005, Australia; ACEMS, School of Mathematical Sciences, University of Adelaide, Adelaide, SA 5005, Australia
| |
Collapse
|
27
|
Tiberi S, Walsh M, Cavallaro M, Hebenstreit D, Finkenstädt B. Bayesian inference on stochastic gene transcription from flow cytometry data. Bioinformatics 2019; 34:i647-i655. [PMID: 30423089 PMCID: PMC6129284 DOI: 10.1093/bioinformatics/bty568] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Motivation Transcription in single cells is an inherently stochastic process as mRNA levels vary greatly between cells, even for genetically identical cells under the same experimental and environmental conditions. We present a stochastic two-state switch model for the population of mRNA molecules in single cells where genes stochastically alternate between a more active ON state and a less active OFF state. We prove that the stationary solution of such a model can be written as a mixture of a Poisson and a Poisson-beta probability distribution. This finding facilitates inference for single cell expression data, observed at a single time point, from flow cytometry experiments such as FACS or fluorescence in situ hybridization (FISH) as it allows one to sample directly from the equilibrium distribution of the mRNA population. We hence propose a Bayesian inferential methodology using a pseudo-marginal approach and a recent approximation to integrate over unobserved states associated with measurement error. Results We provide a general inferential framework which can be widely used to study transcription in single cells from the kind of data arising in flow cytometry experiments. The approach allows us to separate between the intrinsic stochasticity of the molecular dynamics and the measurement noise. The methodology is tested in simulation studies and results are obtained for experimental multiple single cell expression data from FISH flow cytometry experiments. Availability and implementation All analyses were implemented in R. Source code and the experimental data are available at https://github.com/SimoneTiberi/Bayesian-inference-on-stochastic-gene-transcription-from-flow-cytometry-data. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Simone Tiberi
- Institute of Molecular Life Sciences, University of Zürich, Zürich, Switzerland.,Swiss Institue of Bioinformatics, University of Zürich, Zürich, Switzerland.,Department of Statistics, University of Warwick, Coventry, UK
| | - Mark Walsh
- School of Life Sciences, University of Warwick, Coventry, UK
| | - Massimo Cavallaro
- Department of Statistics, University of Warwick, Coventry, UK.,School of Life Sciences, University of Warwick, Coventry, UK
| | | | | |
Collapse
|
28
|
Inbreeding reduces long-term growth of Alpine ibex populations. Nat Ecol Evol 2019; 3:1359-1364. [PMID: 31477848 DOI: 10.1038/s41559-019-0968-1] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2018] [Accepted: 07/26/2019] [Indexed: 11/08/2022]
Abstract
Many studies document negative inbreeding effects on individuals, and conservation efforts to preserve rare species routinely employ strategies to reduce inbreeding. Despite this, there are few clear examples in nature of inbreeding decreasing the growth rates of populations, and the extent of population-level effects of inbreeding in the wild remains controversial. Here, we take advantage of a long-term dataset of 26 reintroduced Alpine ibex (Capra ibex ibex) populations spanning nearly 100 years to show that inbreeding substantially reduced per capita population growth rates, particularly for populations in harsher environments. Populations with high average inbreeding (F ≈ 0.2) had population growth rates reduced by 71% compared with populations with no inbreeding. Our results show that inbreeding can have long-term demographic consequences even when environmental variation is large and deleterious alleles may have been purged during bottlenecks. Thus, efforts to guard against inbreeding effects in populations of endangered species have not been misplaced.
Collapse
|
29
|
Golightly A, Bradley E, Lowe T, Gillespie CS. Correlated pseudo-marginal schemes for time-discretised stochastic kinetic models. Comput Stat Data Anal 2019. [DOI: 10.1016/j.csda.2019.01.006] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
30
|
Affiliation(s)
- Jaewoo Park
- Department of Statistics, The Pennsylvania State University, University Park, PA
| | - Murali Haran
- Department of Statistics, The Pennsylvania State University, University Park, PA
| |
Collapse
|
31
|
Chen Z, Doss H. Inference for the Number of Topics in the Latent Dirichlet Allocation Model via Bayesian Mixture Modeling. J Comput Graph Stat 2019. [DOI: 10.1080/10618600.2018.1558063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
- Zhe Chen
- Department of Statistics, University of Florida, Gainesville, FL
| | - Hani Doss
- Department of Statistics, University of Florida, Gainesville, FL
| |
Collapse
|
32
|
Williams JP, Hannig J. Nonpenalized variable selection in high-dimensional linear model settings via generalized fiducial inference. Ann Stat 2019. [DOI: 10.1214/18-aos1733] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
33
|
Abstract
Summary
We consider the problem of approximating the product of $n$ expectations with respect to a common probability distribution $\mu$. Such products routinely arise in statistics as values of the likelihood in latent variable models. Motivated by pseudo-marginal Markov chain Monte Carlo schemes, we focus on unbiased estimators of such products. The standard approach is to sample $N$ particles from $\mu$ and assign each particle to one of the expectations; this is wasteful and typically requires the number of particles to grow quadratically with the number of expectations. We propose an alternative estimator that approximates each expectation using most of the particles while preserving unbiasedness, which is computationally more efficient when the cost of simulations greatly exceeds the cost of likelihood evaluations. We carefully study the properties of our proposed estimator, showing that in latent variable contexts it needs only ${O} (n)$ particles to match the performance of the standard approach with ${O}(n^{2})$ particles. We demonstrate the procedure on two latent variable examples from approximate Bayesian computation and single-cell gene expression analysis, observing computational gains by factors of about 25 and 450, respectively.
Collapse
Affiliation(s)
- A Lee
- School of Mathematics, University of Bristol, University Walk, Bristol BS8 1TW, UK
| | - S Tiberi
- Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| | - G Zanella
- Department of Decision Sciences, BIDSA and IGIER, Bocconi University, Via Roentgen 1, 20136 Milan, Italy
| |
Collapse
|
34
|
Quiroz M, Villani M, Kohn R, Tran MN, Dang KD. Subsampling MCMC - an Introduction for the Survey Statistician. SANKHYA A 2018. [DOI: 10.1007/s13171-018-0153-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
35
|
|
36
|
Wijeyakulasuriya DA, Hanks EM, Shaby BA, Cross PC. Extreme Value-Based Methods for Modeling Elk Yearly Movements. JOURNAL OF AGRICULTURAL, BIOLOGICAL AND ENVIRONMENTAL STATISTICS 2018. [DOI: 10.1007/s13253-018-00342-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
37
|
|
38
|
Stoehr J, Benson A, Friel N. Noisy Hamiltonian Monte Carlo for Doubly Intractable Distributions. J Comput Graph Stat 2018. [DOI: 10.1080/10618600.2018.1506346] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Affiliation(s)
- Julien Stoehr
- School of Mathematics and Statistics, University College Dublin and Insight Centre for Data Analytics, Dublin, Ireland
| | - Alan Benson
- School of Mathematics and Statistics, University College Dublin and Insight Centre for Data Analytics, Dublin, Ireland
| | - Nial Friel
- School of Mathematics and Statistics, University College Dublin and Insight Centre for Data Analytics, Dublin, Ireland
| |
Collapse
|
39
|
Affiliation(s)
- Jaewoo Park
- Department of Statistics, Pennsylvania State University, Pennsylvania, PA
| | - Murali Haran
- Department of Statistics, Pennsylvania State University, Pennsylvania, PA
| |
Collapse
|
40
|
Koskela J, Jenkins PA, Spanò D. Bayesian non-parametric inference for $\Lambda$-coalescents: Posterior consistency and a parametric method. BERNOULLI 2018. [DOI: 10.3150/16-bej923] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
41
|
|
42
|
da Silva FM, Miño CI, Izbicki R, Del Lama SN. Considerations for monitoring population trends of colonial waterbirds using the effective number of breeders and census estimates. Ecol Evol 2018; 8:8088-8101. [PMID: 30250686 PMCID: PMC6144984 DOI: 10.1002/ece3.4347] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2017] [Revised: 04/04/2018] [Accepted: 06/15/2018] [Indexed: 11/24/2022] Open
Abstract
Detecting trends in population size fluctuations is a major focus in ecology, evolution, and conservation biology. Populations of colonial waterbirds have been monitored using demographic approaches to determine annual census size (Na). We propose the addition of genetic estimates of the effective number of breeders (Nb) as indirect measures of the risk of loss of genetic diversity to improve the evaluation of demographics and increase the accuracy of trend estimates in breeding colonies. Here, we investigated which methods of the estimation of Nb are more precise under conditions of moderate genetic diversity, limited sample sizes and few microsatellite loci, as often occurs with natural populations. We used the wood stork as a model species and we offered a workflow that researchers can follow for monitoring bird breeding colonies. Our approach started with simulations using five estimators of Nb and the theoretical results were validated with empirical data collected from breeding colonies settled in the Brazilian Pantanal wetland. In parallel, we estimated census size using a corrected method based on counting active nests. Both in simulations and in natural populations, the approximate Bayesian computation (ABC) and sibship assignment (SA) methods yielded more precise estimates than the linkage disequilibrium, heterozygosity excess, and molecular coancestry methods. In particular, the ABC method performed best with few loci and small sample sizes, while the other estimators required larger sample sizes and at least 13 loci to not underestimate Nb. Moreover, according to our Nb/Na estimates (values were often ≤0.1), the wood stork colonies evaluated could be facing the loss of genetic diversity. We demonstrate that the combination of genetic and census estimates is a useful approach for monitoring natural breeding bird populations. This methodology has been recommended for populations of rare species or with a known history of population decline to support conservation efforts.
Collapse
Affiliation(s)
- Fagner M. da Silva
- Departamento de Genética e EvoluçãoUniversidade Federal de São CarlosSão CarlosSão PauloBrazil
| | - Carolina I. Miño
- Instituto de Biología Subtropical (IBS)Universidad Nacional de MisionesCONICETPosadasMisionesArgentina
| | - Rafael Izbicki
- Departamento de EstatísticaUniversidade Federal de São CarlosSão CarlosSão PauloBrazil
| | - Silvia N. Del Lama
- Departamento de Genética e EvoluçãoUniversidade Federal de São CarlosSão CarlosSão PauloBrazil
| |
Collapse
|
43
|
Deligiannidis G, Doucet A, Pitt MK. The correlated pseudomarginal method. J R Stat Soc Series B Stat Methodol 2018. [DOI: 10.1111/rssb.12280] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
44
|
Tak H, Meng XL, van Dyk DA. A Repelling–Attracting Metropolis Algorithm for Multimodality. J Comput Graph Stat 2018. [DOI: 10.1080/10618600.2017.1415911] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- Hyungsuk Tak
- Statistical and Applied Mathematical Sciences Institute, Durham, NC
| | - Xiao-Li Meng
- Department of Statistics, Harvard University, Cambridge, MA
| | - David A. van Dyk
- Statistics Section, Department of Mathematics, Imperial College London, London, UK
| |
Collapse
|
45
|
Affiliation(s)
- Matias Quiroz
- Australian School of Business, University of New South Wales, Sydney, Australia
| | - Robert Kohn
- Australian School of Business, University of New South Wales, Sydney, Australia
| | - Mattias Villani
- Division of Statistics and Machine Learning, Department of Computer and Information Science, Linköping University, Linköping, Sweden
| | - Minh-Ngoc Tran
- Discipline of Business Analytics, University of Sydney, Sydney, Australia
| |
Collapse
|
46
|
Elleouet JS, Aitken SN. Exploring Approximate Bayesian Computation for inferring recent demographic history with genomic markers in nonmodel species. Mol Ecol Resour 2018; 18:525-540. [DOI: 10.1111/1755-0998.12758] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2017] [Accepted: 01/16/2018] [Indexed: 01/11/2023]
Affiliation(s)
- Joane S. Elleouet
- Department of Forest and Conservation Sciences; Faculty of Forestry; University of British Columbia; Vancouver BC Canada
| | - Sally N. Aitken
- Department of Forest and Conservation Sciences; Faculty of Forestry; University of British Columbia; Vancouver BC Canada
| |
Collapse
|
47
|
Drovandi CC, Moores MT, Boys RJ. Accelerating pseudo-marginal MCMC using Gaussian processes. Comput Stat Data Anal 2018. [DOI: 10.1016/j.csda.2017.09.002] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
48
|
McKinley TJ, Vernon I, Andrianakis I, McCreesh N, Oakley JE, Nsubuga RN, Goldstein M, White RG. Approximate Bayesian Computation and Simulation-Based Inference for Complex Stochastic Epidemic Models. Stat Sci 2018. [DOI: 10.1214/17-sts618] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
49
|
Inferring sex-specific demographic history from SNP data. PLoS Genet 2018; 14:e1007191. [PMID: 29385127 PMCID: PMC5809101 DOI: 10.1371/journal.pgen.1007191] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2017] [Revised: 02/12/2018] [Accepted: 01/08/2018] [Indexed: 12/04/2022] Open
Abstract
The relative female and male contributions to demography are of great importance to better understand the history and dynamics of populations. While earlier studies relied on uniparental markers to investigate sex-specific questions, the increasing amount of sequence data now enables us to take advantage of tens to hundreds of thousands of independent loci from autosomes and the X chromosome. Here, we develop a novel method to estimate effective sex ratios or ESR (defined as the female proportion of the effective population) from allele count data for each branch of a rooted tree topology that summarizes the history of the populations of interest. Our method relies on Kimura’s time-dependent diffusion approximation for genetic drift, and is based on a hierarchical Bayesian model to integrate over the allele frequencies along the branches. We show via simulations that parameters are inferred robustly, even under scenarios that violate some of the model assumptions. Analyzing bovine SNP data, we infer a strongly female-biased ESR in both dairy and beef cattle, as expected from the underlying breeding scheme. Conversely, we observe a strongly male-biased ESR in early domestication times, consistent with an easier taming and management of cows, and/or introgression from wild auroch males, that would both cause a relative increase in male effective population size. In humans, analyzing a subsample of non-African populations, we find a male-biased ESR in Oceanians that may reflect complex marriage patterns in Aboriginal Australians. Because our approach relies on allele count data, it may be applied on a wide range of species. The history of populations and their social organization is often intricate due to breeding structures, migration patterns or population bottlenecks. Estimation of the female proportion of the effective population (sex ratio) is therefore important to better understand this underlying social structure and dynamics. This question has been mainly investigated so far by comparing genetic variation of mitochondrial DNA and the Y chromosome, two uniparentally inherited markers that reflect the demographic history of females and males, respectively. To overcome the intrinsic limitations of these genetic markers, and to take advantage of the increasing amount of sequence data, we propose a new approach that uses large numbers of independent polymorphisms from autosomes and the X chromosome to estimate sex ratios, throughout the history of populations. This method allows us to confirm a strongly female-biased sex ratio in modern dairy and beef cattle breeds. Yet, we find a strongly male-biased sex ratio during domestication times, consistent with an easier taming and management of cows, and/or introgression from wild auroch males. Analyzing human data from a sample of non-African populations, we find a male bias in Oceanians, possibly indicating complex marriage patterns among Aboriginal Australian groups.
Collapse
|
50
|
Karabatsos G, Leisen F. An approximate likelihood perspective on ABC methods. STATISTICS SURVEYS 2018. [DOI: 10.1214/18-ss120] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|