1
|
Raymond M, Descary MH, Beaulac C, Larribe F. Constructing ancestral recombination graphs through reinforcement learning. Front Genet 2025; 16:1569358. [PMID: 40364947 PMCID: PMC12069460 DOI: 10.3389/fgene.2025.1569358] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2025] [Accepted: 04/16/2025] [Indexed: 05/15/2025] Open
Abstract
Introduction Over the years, many approaches have been proposed to build ancestral recombination graphs (ARGs), graphs used to represent the genetic relationship between individuals. Among these methods, many rely on the assumption that the most likely graph is among those with the fewest recombination events. In this paper, we propose a new approach to build maximum parsimony ARGs: Reinforcement Learning (RL). Methods We exploit the similarities between finding the shortest path between a set of genetic sequences and their most recent common ancestor and finding the shortest path between the entrance and exit of a maze, a classic RL problem. In the maze problem, the learner, called the agent, must learn the directions to take in order to escape as quickly as possible, whereas in our problem, the agent must learn the actions to take between coalescence, mutation, and recombination in order to reach the most recent common ancestor as quickly as possible. Results Our results show that RL can be used to build ARGs with as few recombination events as those built with a heuristic algorithm optimized to build minimal ARGs, and sometimes even fewer. Moreover, our method allows to build a distribution of ARGs with few recombination events for a given sample, and can also generalize learning to new samples not used during the learning process. Discussion RL is a promising and innovative approach to build ARGs. By learning to construct ARGs just from the data, our method differs from conventional methods that rely on heuristic rules or complex theoretical models.
Collapse
Affiliation(s)
- Mélanie Raymond
- Department of Mathematics, Université du Québec à Montréal, Montréal, QC, Canada
| | | | | | | |
Collapse
|
2
|
Cousins T, Scally A, Durbin R. A structured coalescent model reveals deep ancestral structure shared by all modern humans. Nat Genet 2025; 57:856-864. [PMID: 40102687 PMCID: PMC11985351 DOI: 10.1038/s41588-025-02117-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2024] [Accepted: 02/05/2025] [Indexed: 03/20/2025]
Abstract
Understanding the history of admixture events and population size changes leading to modern humans is central to human evolutionary genetics. Here we introduce a coalescence-based hidden Markov model, cobraa, that explicitly represents an ancestral population split and rejoin, and demonstrate its application on simulated and real data across multiple species. Using cobraa, we present evidence for an extended period of structure in the history of all modern humans, in which two ancestral populations that diverged ~1.5 million years ago came together in an admixture event ~300 thousand years ago, in a ratio of ~80:20%. Immediately after their divergence, we detect a strong bottleneck in the major ancestral population. We inferred regions of the present-day genome derived from each ancestral population, finding that material from the minority correlates strongly with distance to coding sequence, suggesting it was deleterious against the majority background. Moreover, we found a strong correlation between regions of majority ancestry and human-Neanderthal or human-Denisovan divergence, suggesting the majority population was also ancestral to those archaic humans.
Collapse
Affiliation(s)
- Trevor Cousins
- Department of Genetics, University of Cambridge, Cambridge, UK
| | - Aylwyn Scally
- Department of Genetics, University of Cambridge, Cambridge, UK
| | - Richard Durbin
- Department of Genetics, University of Cambridge, Cambridge, UK.
| |
Collapse
|
3
|
Ishigohoka J, Liedvogel M. High-recombining genomic regions affect demography inference based on ancestral recombination graphs. Genetics 2025; 229:iyaf004. [PMID: 39790013 PMCID: PMC11912872 DOI: 10.1093/genetics/iyaf004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Accepted: 12/23/2024] [Indexed: 01/12/2025] Open
Abstract
Multiple methods of demography inference are based on the ancestral recombination graph. This powerful approach uses observed mutations to model local genealogies changing along chromosomes by historical recombination events. However, inference of underlying genealogies is difficult in regions with high recombination rate relative to mutation rate due to the lack of mutations representing genealogies. Despite the prevalence of high-recombining genomic regions in some organisms, such as birds, its impact on demography inference based on ancestral recombination graphs has not been well studied. Here, we use population genomic simulations to investigate the impact of high-recombining regions on demography inference based on ancestral recombination graphs. We demonstrate that inference of effective population size and the time of population split events is systematically affected when high-recombining regions cover wide breadths of the chromosomes. Excluding high-recombining genomic regions can practically mitigate this impact, and population genomic inference of recombination maps is informative in defining such regions although the estimated values of local recombination rate can be biased. Finally, we confirm the relevance of our findings in empirical analysis by contrasting demography inferences applied for a bird species, the Eurasian blackcap (Sylvia atricapilla), using different parts of the genome with high and low recombination rates. Our results suggest that demography inference methods based on ancestral recombination graphs should be carried out with caution when applied in species whose genomes contain long stretches of high-recombining regions.
Collapse
Affiliation(s)
- Jun Ishigohoka
- Max Planck Research Group Behavioural Genomics, Max Planck Institute for Evolutionary Biology, August-Thienemann-Straße 2, Plön 24306, Germany
| | - Miriam Liedvogel
- Max Planck Research Group Behavioural Genomics, Max Planck Institute for Evolutionary Biology, August-Thienemann-Straße 2, Plön 24306, Germany
- Institute of Avian Research, An der Vogelwarte 21, Wilhelmshaven 26386, Germany
- Department of Biology and Environmental Sciences, Carl von Ossietzky Universität Oldenburg, Ammerländer Heerstraße 114-118, Oldenburg 26129, Germany
| |
Collapse
|
4
|
Ragsdale AP. Archaic introgression and the distribution of shared variation under stabilizing selection. PLoS Genet 2025; 21:e1011623. [PMID: 40163477 PMCID: PMC11964463 DOI: 10.1371/journal.pgen.1011623] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2024] [Revised: 04/02/2025] [Accepted: 02/14/2025] [Indexed: 04/02/2025] Open
Abstract
Many phenotypic traits are under stabilizing selection, which maintains a population's mean phenotypic value near some optimum. The dynamics of traits and trait architectures under stabilizing selection have been extensively studied for single populations at steady state. However, natural populations are seldom at steady state and are often structured in some way. Admixture and introgression events may be common, including over human evolutionary history. Because stabilizing selection results in selection against the minor allele at a trait-affecting locus, alleles from the minor parental ancestry will be selected against after admixture. We show that the site-frequency spectrum can be used to model the genetic architecture of such traits, allowing for the study of trait architecture dynamics in complex multi-population settings. We use a simple deterministic two-locus model to predict the reduction of introgressed ancestry around trait-contributing loci. From this and individual-based simulations, we show that introgressed-ancestry is depleted around such loci. When introgression between two diverged populations occurs in both directions, as has been inferred between humans and Neanderthals, the locations of such regions with depleted introgressed ancestry will tend to be shared across populations. We argue that stabilizing selection for shared phenotypic optima may explain recent observations in which regions of depleted human-introgressed ancestry in the Neanderthal genome overlap with Neanderthal-ancestry deserts in humans.
Collapse
Affiliation(s)
- Aaron P Ragsdale
- Department of Integrative Biology, University of Wisconsin–Madison, Madison, Wisconsin, United States of America
| |
Collapse
|
5
|
Bisschop G, Kelleher J, Ralph P. Likelihoods for a general class of ARGs under the SMC. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.02.24.639977. [PMID: 40060524 PMCID: PMC11888268 DOI: 10.1101/2025.02.24.639977] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 03/22/2025]
Abstract
Ancestral recombination graphs (ARGs) are the focus of much ongoing research interest. Recent progress in inference has made ARG-based approaches feasible across of range of applications, and many new methods using inferred ARGs as input have appeared. This progress on the long-standing problem of ARG inference has proceeded in two distinct directions. First, the Bayesian inference of ARGs under the Sequentially Markov Coalescent (SMC), is now practical for tens-to-hundreds of samples. Second, approximate models and heuristics can now scale to sample sizes two to three orders of magnitude larger. Although these heuristic methods are reasonably accurate under many metrics, one significant drawback is that the ARGs they estimate do not have the topological properties required to compute a likelihood under models such as the SMC under present-day formulations. In particular, heuristic inference methods typically do not estimate precise details about recombination events, which are currently required to compute a likelihood. In this paper we present a backwards-time formulation of the SMC and derive a straightforward definition of the likelihood of a general class of ARG under this model. We show that this formulation does not require precise details of recombination events to be estimated, and is robust to the presence of polytomies. We discuss the possibilities for inference that this opens.
Collapse
|
6
|
Tajima Y, Vargas CDM, Ito K, Wang W, Luo JD, Xing J, Kuru N, Machado LC, Siepel A, Carroll TS, Jarvis ED, Darnell RB. A humanized NOVA1 splicing factor alters mouse vocal communications. Nat Commun 2025; 16:1542. [PMID: 39966351 PMCID: PMC11836289 DOI: 10.1038/s41467-025-56579-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2024] [Accepted: 01/21/2025] [Indexed: 02/20/2025] Open
Abstract
NOVA1, a neuronal RNA-binding protein expressed in the central nervous system, is essential for survival in mice and normal development in humans. A single amino acid change (I197V) in NOVA1's second RNA binding domain is unique to modern humans. To study its physiological effects, we generated mice carrying the human-specific I197V variant (Nova1hu/hu) and analyzed the molecular and behavioral consequences. While the I197V substitution had minimal impact on NOVA1's RNA binding capacity, it led to specific effects on alternative splicing, and CLIP revealed multiple binding peaks in mouse brain transcripts involved in vocalization. These molecular findings were associated with behavioral differences in vocalization patterns in Nova1hu/hu mice as pups and adults. Our findings suggest that this human-specific NOVA1 substitution may have been part of an ancient evolutionary selective sweep in a common ancestral population of Homo sapiens, possibly contributing to the development of spoken language through differential RNA regulation during brain development.
Collapse
Affiliation(s)
- Yoko Tajima
- The Laboratory of Molecular Neuro-oncology, The Rockefeller University, New York, NY, USA.
| | - César D M Vargas
- The Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
| | - Keiichi Ito
- The Laboratory of Biochemistry and Molecular Biology, The Rockefeller University, New York, NY, USA
| | - Wei Wang
- Bioinformatics Resource Center, The Rockefeller University, New York, NY, USA
| | - Ji-Dung Luo
- Bioinformatics Resource Center, The Rockefeller University, New York, NY, USA
| | - Jiawei Xing
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, NY, USA
| | - Nurdan Kuru
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, NY, USA
| | - Luiz Carlos Machado
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, NY, USA
| | - Adam Siepel
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, NY, USA
| | - Thomas S Carroll
- Bioinformatics Resource Center, The Rockefeller University, New York, NY, USA
| | - Erich D Jarvis
- The Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
- Howard Hughes Medical Institute, The Rockefeller University, New York, NY, USA
| | - Robert B Darnell
- The Laboratory of Molecular Neuro-oncology, The Rockefeller University, New York, NY, USA.
- Howard Hughes Medical Institute, The Rockefeller University, New York, NY, USA.
| |
Collapse
|
7
|
Tagore D, Akey JM. Archaic hominin admixture and its consequences for modern humans. Curr Opin Genet Dev 2025; 90:102280. [PMID: 39577372 PMCID: PMC11770379 DOI: 10.1016/j.gde.2024.102280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2024] [Revised: 10/18/2024] [Accepted: 10/22/2024] [Indexed: 11/24/2024]
Abstract
As anatomically modern humans dispersed out of Africa, they encountered and mated with now extinct hominins, including Neanderthals and Denisovans. It is now well established that all non-African individuals derive approximately 2% of their genome from Neanderthal ancestors and individuals of Melanesian and Australian aboriginal ancestry inherited an additional 2%-5% of their genomes from Denisovan ancestors. Attention has started to shift from documenting amounts of archaic admixture and identifying introgressed segments to understanding their molecular, phenotypic, and evolutionary consequences and refining models of human history. Here, we review recent insights into admixture between modern and archaic humans, emphasizing methodological innovations and the functional and phenotypic effects Neanderthal and Denisovan sequences have in contemporary individuals.
Collapse
Affiliation(s)
- Debashree Tagore
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton 08540, USA. https://twitter.com/@TagoreDebashree
| | - Joshua M Akey
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton 08540, USA.
| |
Collapse
|
8
|
Higashino A, Nakamura K, Osada N. Population Genomics of Japanese Macaques (Macaca fuscata): Insights Into Deep Population Divergence and Multiple Merging Histories. Genome Biol Evol 2025; 17:evaf001. [PMID: 39763347 PMCID: PMC11735745 DOI: 10.1093/gbe/evaf001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2024] [Revised: 12/19/2024] [Accepted: 12/27/2024] [Indexed: 01/18/2025] Open
Abstract
The influence of long-term climatic changes such as glacial cycles on the history of living organisms has been a subject of research for decades, but the detailed population dynamics during the environmental fluctuations and their effects on genetic diversity and genetic load are not well understood on a genome-wide scale. The Japanese macaque (Macaca fuscata) is a unique primate adapted to the cold environments of the Japanese archipelago. Despite the past intensive research for the Japanese macaque population genetics, the genetic background of Japanese macaques at the whole-genome level has been limited to a few individuals, and the comprehensive demographic history and genetic differentiation of Japanese macaques have been underexplored. We conducted whole-genome sequencing of 64 Japanese macaque individuals from 5 different regions, revealing significant genetic differentiation and functional variant diversity across populations. In particular, Japanese macaques have low genetic diversity and harbor many shared and population-specific gene loss, which might contribute to population-specific phenotypes. Our estimation of population demography using phased haplotypes suggested that, after the strong population bottleneck shared among all populations around 400 to 500 kya, the divergence among populations initiated around 150 to 200 kya, but there has been the time with strong gene flow between some populations after the split, indicating multiple population split and merge events probably due to habitat fragmentation and fusion during glacial cycles. These findings not only present a complex population history of Japanese macaques but also enhance their value as research models, particularly in neuroscience and behavioral studies. This comprehensive genomic analysis sheds light on the adaptation and evolution of Japanese macaques, contributing valuable insights to both evolutionary biology and biomedical research.
Collapse
Affiliation(s)
- Atsunori Higashino
- Center for the Evolutionary Origins of Human Behavior, Kyoto University, Inuyama, Aichi 484-8506, Japan
| | - Katsuki Nakamura
- Center for the Evolutionary Origins of Human Behavior, Kyoto University, Inuyama, Aichi 484-8506, Japan
| | - Naoki Osada
- Faculty of Information Science and Technology, Hokkaido University, Sapporo, Hokkaido 060-0814, Japan
| |
Collapse
|
9
|
Nielsen R, Vaughn AH, Deng Y. Inference and applications of ancestral recombination graphs. Nat Rev Genet 2025; 26:47-58. [PMID: 39349760 PMCID: PMC12036574 DOI: 10.1038/s41576-024-00772-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/16/2024] [Indexed: 12/15/2024]
Abstract
Ancestral recombination graphs (ARGs) summarize the complex genealogical relationships between individuals represented in a sample of DNA sequences. Their use is currently revolutionizing the field of population genetics and is leading to the development of powerful new methods to elucidate individual and population genetic processes, including population size history, migration, admixture, recombination, mutation and selection. In this Review, we introduce the readers to the structure of ARGs and discuss how they relate to processes such as recombination and genetic drift. We explore differences and similarities between methods of estimating ARGs and provide concrete illustrative examples of how ARGs can be used to elucidate population-level processes.
Collapse
Affiliation(s)
- Rasmus Nielsen
- Department of Integrative Biology and Department of Statistics, UC Berkeley, Berkeley, CA, USA.
- GLOBE Institute, University of Copenhagen, Copenhagen, Denmark.
- Center for Computational Biology, UC Berkeley, Berkeley, CA, USA.
| | - Andrew H Vaughn
- Center for Computational Biology, UC Berkeley, Berkeley, CA, USA
| | - Yun Deng
- Center for Computational Biology, UC Berkeley, Berkeley, CA, USA
| |
Collapse
|
10
|
Villanea FA, Peede D, Kaufman EJ, Añorve-Garibay V, Chevy ET, Villa-Islas V, Witt KE, Zeloni R, Marnetto D, Moorjani P, Jay F, Valdmanis PN, Ávila-Arcos MC, Huerta-Sánchez E. The MUC19 gene in Denisovans, Neanderthals, and Modern Humans: An Evolutionary History of Recurrent Introgression and Natural Selection. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.09.25.559202. [PMID: 37808839 PMCID: PMC10557577 DOI: 10.1101/2023.09.25.559202] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/10/2023]
Abstract
We study the gene MUC19, for which modern humans carry a Denisovan-like haplotype. MUC19 is a mucin, a glycoprotein that forms gels with various biological functions. We find the diagnostic variants for the Denisovan-like MUC19 haplotype at high frequencies in admixed Latin American individuals among global populations, and at highest frequency in 23 ancient Indigenous American individuals, all predating population admixture with Europeans and Africans. We find that the Denisovan-like MUC19 haplotype carries a higher copy number of a 30 base-pair variable number tandem repeat, and that copy numbers of this repeat are exceedingly high in American populations and are under positive selection. This study provides the first example of positive selection acting on archaic alleles at coding sites and VNTRs. Finally, we find that some Neanderthals carry the Denisovan-like MUC19 haplotype, and that it was likely introgressed into human populations through Neanderthal introgression rather than Denisovan introgression.
Collapse
Affiliation(s)
| | - David Peede
- Department of Ecology, Evolution, and Organismal Biology, Brown University
- Center for Computational Molecular Biology, Brown University
- Institute at Brown for Environment and Society, Brown University
| | - Eli J Kaufman
- Division of Medical Genetics, Department of Medicine, University of Washington School of Medicine
| | - Valeria Añorve-Garibay
- Center for Computational Molecular Biology, Brown University
- International Laboratory for Human Genome Research, Universidad Nacional Autónoma de México
| | | | - Viridiana Villa-Islas
- International Laboratory for Human Genome Research, Universidad Nacional Autónoma de México
| | - Kelsey E Witt
- Center for Human Genetics and Department of Genetics and Biochemistry, Clemson University
| | - Roberta Zeloni
- Department of Neurosciences "Rita Levi Montalcini", University of Turin
| | - Davide Marnetto
- Department of Neurosciences "Rita Levi Montalcini", University of Turin
| | - Priya Moorjani
- Department of Molecular and Cell Biology, University of California, Berkeley
- Center for Computational Biology, University of California, Berkeley
| | - Flora Jay
- Université Paris-Saclay, CNRS, INRIA, Laboratoire Interdisciplinaire des Sciences du Numérique, 91400, Orsay, France
| | - Paul N Valdmanis
- Division of Medical Genetics, Department of Medicine, University of Washington School of Medicine
| | - María C Ávila-Arcos
- International Laboratory for Human Genome Research, Universidad Nacional Autónoma de México
| | - Emilia Huerta-Sánchez
- Department of Ecology, Evolution, and Organismal Biology, Brown University
- Center for Computational Molecular Biology, Brown University
- Smurfit Institute of Genetics, Trinity College Dublin, Dublin 2, Ireland
| |
Collapse
|
11
|
Witt KE, Villanea FA. Computational Genomics and Its Applications to Anthropological Questions. AMERICAN JOURNAL OF BIOLOGICAL ANTHROPOLOGY 2024; 186 Suppl 78:e70010. [PMID: 40071816 PMCID: PMC11898561 DOI: 10.1002/ajpa.70010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/24/2024] [Revised: 10/14/2024] [Accepted: 12/19/2024] [Indexed: 03/15/2025]
Abstract
The advent of affordable genome sequencing and the development of new computational tools have established a new era of genomic knowledge. Sequenced human genomes number in the tens of thousands, including thousands of ancient human genomes. The abundance of data has been met with new analysis tools that can be used to understand populations' demographic and evolutionary histories. Thus, a variety of computational methods now exist that can be leveraged to answer anthropological questions. This includes novel likelihood and Bayesian methods, machine learning techniques, and a vast array of population simulators. These computational tools provide powerful insights gained from genomic datasets, although they are generally inaccessible to those with less computational experience. Here, we outline the theoretical workings behind computational genomics methods, limitations and other considerations when applying these computational methods, and examples of how computational methods have already been applied to anthropological questions. We hope this review will empower other anthropologists to utilize these powerful tools in their own research.
Collapse
Affiliation(s)
- Kelsey E. Witt
- Department of Genetics and Biochemistry and Center for Human GeneticsClemson UniversityClemsonSouth CarolinaUSA
| | | |
Collapse
|
12
|
Schwob MR, Hooten MB, Narasimhan V. Composite dyadic models for spatio-temporal data. Biometrics 2024; 80:ujae107. [PMID: 39360904 DOI: 10.1093/biomtc/ujae107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Revised: 08/10/2024] [Accepted: 09/11/2024] [Indexed: 10/05/2024]
Abstract
Mechanistic statistical models are commonly used to study the flow of biological processes. For example, in landscape genetics, the aim is to infer spatial mechanisms that govern gene flow in populations. Existing statistical approaches in landscape genetics do not account for temporal dependence in the data and may be computationally prohibitive. We infer mechanisms with a Bayesian hierarchical dyadic model that scales well with large data sets and that accounts for spatial and temporal dependence. We construct a fully connected network comprising spatio-temporal data for the dyadic model and use normalized composite likelihoods to account for the dependence structure in space and time. We develop a dyadic model to account for physical mechanisms commonly found in physical-statistical models and apply our methods to ancient human DNA data to infer the mechanisms that affected human movement in Bronze Age Europe.
Collapse
Affiliation(s)
- Michael R Schwob
- Department of Statistics and Data Sciences, The University of Texas at Austin, Austin, TX 78712, United States
| | - Mevin B Hooten
- Department of Statistics and Data Sciences, The University of Texas at Austin, Austin, TX 78712, United States
| | - Vagheesh Narasimhan
- Department of Statistics and Data Sciences, The University of Texas at Austin, Austin, TX 78712, United States
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX 78712, United States
- Department of Population Health, Dell Medical School, Austin, TX 78712, United States
| |
Collapse
|
13
|
Sellinger T, Johannes F, Tellier A. Improved inference of population histories by integrating genomic and epigenomic data. eLife 2024; 12:RP89470. [PMID: 39264367 PMCID: PMC11392530 DOI: 10.7554/elife.89470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/13/2024] Open
Abstract
With the availability of high-quality full genome polymorphism (SNPs) data, it becomes feasible to study the past demographic and selective history of populations in exquisite detail. However, such inferences still suffer from a lack of statistical resolution for recent, for example bottlenecks, events, and/or for populations with small nucleotide diversity. Additional heritable (epi)genetic markers, such as indels, transposable elements, microsatellites, or cytosine methylation, may provide further, yet untapped, information on the recent past population history. We extend the Sequential Markovian Coalescent (SMC) framework to jointly use SNPs and other hyper-mutable markers. We are able to (1) improve the accuracy of demographic inference in recent times, (2) uncover past demographic events hidden to SNP-based inference methods, and (3) infer the hyper-mutable marker mutation rates under a finite site model. As a proof of principle, we focus on demographic inference in Arabidopsis thaliana using DNA methylation diversity data from 10 European natural accessions. We demonstrate that segregating single methylated polymorphisms (SMPs) satisfy the modeling assumptions of the SMC framework, while differentially methylated regions (DMRs) are not suitable as their length exceeds that of the genomic distance between two recombination events. Combining SNPs and SMPs while accounting for site- and region-level epimutation processes, we provide new estimates of the glacial age bottleneck and post-glacial population expansion of the European A. thaliana population. Our SMC framework readily accounts for a wide range of heritable genomic markers, thus paving the way for next-generation inference of evolutionary history by combining information from several genetic and epigenetic markers.
Collapse
Affiliation(s)
- Thibaut Sellinger
- Professorship for Population Genetics, Department of Life Science Systems, Technical University of Munich, Munich, Germany
- Department of Environment and Biodiversity, Paris Lodron University of Salzburg, Salzburg, Austria
| | - Frank Johannes
- Professorship for Plant Epigenomics, Department of Molecular Life Sciences, Technical University of Munich, Freising, Germany
| | - Aurélien Tellier
- Professorship for Population Genetics, Department of Life Science Systems, Technical University of Munich, Munich, Germany
| |
Collapse
|
14
|
Wong Y, Ignatieva A, Koskela J, Gorjanc G, Wohns AW, Kelleher J. A general and efficient representation of ancestral recombination graphs. Genetics 2024; 228:iyae100. [PMID: 39013109 PMCID: PMC11373519 DOI: 10.1093/genetics/iyae100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Accepted: 06/05/2024] [Indexed: 07/18/2024] Open
Abstract
As a result of recombination, adjacent nucleotides can have different paths of genetic inheritance and therefore the genealogical trees for a sample of DNA sequences vary along the genome. The structure capturing the details of these intricately interwoven paths of inheritance is referred to as an ancestral recombination graph (ARG). Classical formalisms have focused on mapping coalescence and recombination events to the nodes in an ARG. However, this approach is out of step with some modern developments, which do not represent genetic inheritance in terms of these events or explicitly infer them. We present a simple formalism that defines an ARG in terms of specific genomes and their intervals of genetic inheritance, and show how it generalizes these classical treatments and encompasses the outputs of recent methods. We discuss nuances arising from this more general structure, and argue that it forms an appropriate basis for a software standard in this rapidly growing field.
Collapse
Affiliation(s)
- Yan Wong
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford OX3 7LF, UK
| | - Anastasia Ignatieva
- School of Mathematics and Statistics, University of Glasgow, Glasgow G12 8TA, UK
- Department of Statistics, University of Oxford, Oxford OX1 3LB, UK
| | - Jere Koskela
- School of Mathematics, Statistics and Physics, Newcastle University, Newcastle NE1 7RU, UK
- Department of Statistics, University of Warwick, Coventry CV4 7AL, UK
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Edinburgh EH25 9RG, UK
| | - Anthony W Wohns
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305-5101, USA
| | - Jerome Kelleher
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford OX3 7LF, UK
| |
Collapse
|
15
|
Pfennig A, Lachance J. The evolutionary fate of Neanderthal DNA in 30,780 admixed genomes with recent African-like ancestry. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.25.605203. [PMID: 39091830 PMCID: PMC11291122 DOI: 10.1101/2024.07.25.605203] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/04/2024]
Abstract
Following introgression, Neanderthal DNA was initially purged from non-African genomes, but the evolutionary fate of remaining introgressed DNA has not been explored yet. To fill this gap, we analyzed 30,780 admixed genomes with African-like ancestry from the All of Us research program, in which Neanderthal alleles encountered novel genetic backgrounds during the last 15 generations. Observed amounts of Neanderthal DNA approximately match expectations based on ancestry proportions, suggesting neutral evolution. Nevertheless, we identified genomic regions that have significantly less or more Neanderthal ancestry than expected and are associated with spermatogenesis, innate immunity, and other biological processes. We also identified three novel introgression desert-like regions in recently admixed genomes, whose genetic features are compatible with hybrid incompatibilities and intrinsic negative selection. Overall, we find that much of the remaining Neanderthal DNA in human genomes is not under strong selection, and complex evolutionary dynamics have shaped introgression landscapes in our species.
Collapse
Affiliation(s)
- Aaron Pfennig
- School of Biological Sciences, Georgia Institute of Technology, 950 Atlantic Dr, Atlanta, 30332, GA, USA
| | - Joseph Lachance
- School of Biological Sciences, Georgia Institute of Technology, 950 Atlantic Dr, Atlanta, 30332, GA, USA
| |
Collapse
|
16
|
Li L, Comi TJ, Bierman RF, Akey JM. Recurrent gene flow between Neanderthals and modern humans over the past 200,000 years. Science 2024; 385:eadi1768. [PMID: 38991054 DOI: 10.1126/science.adi1768] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Accepted: 05/14/2024] [Indexed: 07/13/2024]
Abstract
Although it is well known that the ancestors of modern humans and Neanderthals admixed, the effects of gene flow on the Neanderthal genome are not well understood. We develop methods to estimate the amount of human-introgressed sequences in Neanderthals and apply it to whole-genome sequence data from 2000 modern humans and three Neanderthals. We estimate that Neanderthals have 2.5 to 3.7% human ancestry, and we leverage human-introgressed sequences in Neanderthals to revise estimates of Neanderthal ancestry in modern humans, show that Neanderthal population sizes were significantly smaller than previously estimated, and identify two distinct waves of modern human gene flow into Neanderthals. Our data provide insights into the genetic legacy of recurrent gene flow between modern humans and Neanderthals.
Collapse
Affiliation(s)
- Liming Li
- Department of Medical Genetics and Developmental Biology, School of Medicine, The Key Laboratory of Developmental Genes and Human Diseases, Ministry of Education, Southeast University, Nanjing 210009, China
- The Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA
| | - Troy J Comi
- The Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA
| | - Rob F Bierman
- The Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA
| | - Joshua M Akey
- The Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA
| |
Collapse
|
17
|
Wong Y, Ignatieva A, Koskela J, Gorjanc G, Wohns AW, Kelleher J. A general and efficient representation of ancestral recombination graphs. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.11.03.565466. [PMID: 37961279 PMCID: PMC10635123 DOI: 10.1101/2023.11.03.565466] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
As a result of recombination, adjacent nucleotides can have different paths of genetic inheritance and therefore the genealogical trees for a sample of DNA sequences vary along the genome. The structure capturing the details of these intricately interwoven paths of inheritance is referred to as an ancestral recombination graph (ARG). Classical formalisms have focused on mapping coalescence and recombination events to the nodes in an ARG. This approach is out of step with modern developments, which do not represent genetic inheritance in terms of these events or explicitly infer them. We present a simple formalism that defines an ARG in terms of specific genomes and their intervals of genetic inheritance, and show how it generalises these classical treatments and encompasses the outputs of recent methods. We discuss nuances arising from this more general structure, and argue that it forms an appropriate basis for a software standard in this rapidly growing field.
Collapse
Affiliation(s)
- Yan Wong
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, UK
| | - Anastasia Ignatieva
- School of Mathematics and Statistics, University of Glasgow, UK
- Department of Statistics, University of Oxford, UK
| | - Jere Koskela
- School of Mathematics, Statistics and Physics, Newcastle University, UK
- Department of Statistics, University of Warwick, UK
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, UK
| | - Anthony W. Wohns
- Broad Institute of MIT and Harvard, Cambridge, USA
- Department of Genetics, Stanford University School of Medicine, Stanford, USA
| | - Jerome Kelleher
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, UK
| |
Collapse
|
18
|
Brandt DYC, Huber CD, Chiang CWK, Ortega-Del Vecchyo D. The Promise of Inferring the Past Using the Ancestral Recombination Graph. Genome Biol Evol 2024; 16:evae005. [PMID: 38242694 PMCID: PMC10834162 DOI: 10.1093/gbe/evae005] [Citation(s) in RCA: 19] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 12/11/2023] [Accepted: 12/17/2023] [Indexed: 01/21/2024] Open
Abstract
The ancestral recombination graph (ARG) is a structure that represents the history of coalescent and recombination events connecting a set of sequences (Hudson RR. In: Futuyma D, Antonovics J, editors. Gene genealogies and the coalescent process. In: Oxford Surveys in Evolutionary Biology; 1991. p. 1 to 44.). The full ARG can be represented as a set of genealogical trees at every locus in the genome, annotated with recombination events that change the topology of the trees between adjacent loci and the mutations that occurred along the branches of those trees (Griffiths RC, Marjoram P. An ancestral recombination graph. In: Donnelly P, Tavare S, editors. Progress in population genetics and human evolution. Springer; 1997. p. 257 to 270.). Valuable insights can be gained into past evolutionary processes, such as demographic events or the influence of natural selection, by studying the ARG. It is regarded as the "holy grail" of population genetics (Hubisz M, Siepel A. Inference of ancestral recombination graphs using ARGweaver. In: Dutheil JY, editors. Statistical population genomics. New York, NY: Springer US; 2020. p. 231-266.) since it encodes the processes that generate all patterns of allelic and haplotypic variation from which all commonly used summary statistics in population genetic research (e.g. heterozygosity and linkage disequilibrium) can be derived. Many previous evolutionary inferences relied on summary statistics extracted from the genotype matrix. Evolutionary inferences using the ARG represent a significant advancement as the ARG is a representation of the evolutionary history of a sample that shows the past history of recombination, coalescence, and mutation events across a particular sequence. This representation in theory contains as much information, if not more, than the combination of all independent summary statistics that could be derived from the genotype matrix. Consistent with this idea, some of the first ARG-based analyses have proven to be more powerful than summary statistic-based analyses (Speidel L, Forest M, Shi S, Myers SR. A method for genome-wide genealogy estimation for thousands of samples. Nat Genet. 2019:51(9):1321 to 1329.; Stern AJ, Wilton PR, Nielsen R. An approximate full-likelihood method for inferring selection and allele frequency trajectories from DNA sequence data. PLoS Genet. 2019:15(9):e1008384.; Hubisz MJ, Williams AL, Siepel A. Mapping gene flow between ancient hominins through demography-aware inference of the ancestral recombination graph. PLoS Genet. 2020:16(8):e1008895.; Fan C, Mancuso N, Chiang CWK. A genealogical estimate of genetic relationships. Am J Hum Genet. 2022:109(5):812-824.; Fan C, Cahoon JL, Dinh BL, Ortega-Del Vecchyo D, Huber C, Edge MD, Mancuso N, Chiang CWK. A likelihood-based framework for demographic inference from genealogical trees. bioRxiv. 2023.10.10.561787. 2023.; Hejase HA, Mo Z, Campagna L, Siepel A. A deep-learning approach for inference of selective sweeps from the ancestral recombination graph. Mol Biol Evol. 2022:39(1):msab332.; Link V, Schraiber JG, Fan C, Dinh B, Mancuso N, Chiang CWK, Edge MD. Tree-based QTL mapping with expected local genetic relatedness matrices. bioRxiv. 2023.04.07.536093. 2023.; Zhang BC, Biddanda A, Gunnarsson ÁF, Cooper F, Palamara PF. Biobank-scale inference of ancestral recombination graphs enables genealogical analysis of complex traits. Nat Genet. 2023:55(5):768-776.). As such, there has been significant interest in the field to investigate 2 main problems related to the ARG: (i) How can we estimate the ARG based on genomic data, and (ii) how can we extract information of past evolutionary processes from the ARG? In this perspective, we highlight 3 topics that pertain to these main issues: The development of computational innovations that enable the estimation of the ARG; remaining challenges in estimating the ARG; and methodological advances for deducing evolutionary forces and mechanisms using the ARG. This perspective serves to introduce the readers to the types of questions that can be explored using the ARG and to highlight some of the most pressing issues that must be addressed in order to make ARG-based inference an indispensable tool for evolutionary research.
Collapse
Affiliation(s)
- Débora Y C Brandt
- Department of Genetics Evolution and Environment, University College London, London, UK
| | - Christian D Huber
- Department of Biology, Pennsylvania State University, University Park, PA, USA
| | - Charleston W K Chiang
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Diego Ortega-Del Vecchyo
- Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma De México, Querétaro, Querétaro, Mexico
| |
Collapse
|
19
|
Peyrégne S, Slon V, Kelso J. More than a decade of genetic research on the Denisovans. Nat Rev Genet 2024; 25:83-103. [PMID: 37723347 DOI: 10.1038/s41576-023-00643-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/19/2023] [Indexed: 09/20/2023]
Abstract
Denisovans, a group of now extinct humans who lived in Eastern Eurasia in the Middle and Late Pleistocene, were first identified from DNA sequences just over a decade ago. Only ten fragmentary remains from two sites have been attributed to Denisovans based entirely on molecular information. Nevertheless, there has been great interest in using genetic data to understand Denisovans and their place in human history. From the reconstruction of a single high-quality genome, it has been possible to infer their population history, including events of admixture with other human groups. Additionally, the identification of Denisovan DNA in the genomes of present-day individuals has provided insights into the timing and routes of dispersal of ancient modern humans into Asia and Oceania, as well as the contributions of archaic DNA to the physiology of present-day people. In this Review, we synthesize more than a decade of research on Denisovans, reconcile controversies and summarize insights into their population history and phenotype. We also highlight how our growing knowledge about Denisovans has provided insights into our own evolutionary history.
Collapse
Affiliation(s)
- Stéphane Peyrégne
- Department of Evolutionary Genetics, Max-Planck-Institute for Evolutionary Anthropology, Leipzig, Germany.
| | - Viviane Slon
- Department of Evolutionary Genetics, Max-Planck-Institute for Evolutionary Anthropology, Leipzig, Germany
- Department of Anatomy and Anthropology, Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- Department of Human Molecular Genetics and Biochemistry, Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- The Dan David Center for Human Evolution and Biohistory Research, Tel Aviv University, Tel Aviv, Israel
| | - Janet Kelso
- Department of Evolutionary Genetics, Max-Planck-Institute for Evolutionary Anthropology, Leipzig, Germany.
| |
Collapse
|
20
|
Lewanski AL, Grundler MC, Bradburd GS. The era of the ARG: An introduction to ancestral recombination graphs and their significance in empirical evolutionary genomics. PLoS Genet 2024; 20:e1011110. [PMID: 38236805 PMCID: PMC10796009 DOI: 10.1371/journal.pgen.1011110] [Citation(s) in RCA: 26] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2024] Open
Abstract
In the presence of recombination, the evolutionary relationships between a set of sampled genomes cannot be described by a single genealogical tree. Instead, the genomes are related by a complex, interwoven collection of genealogies formalized in a structure called an ancestral recombination graph (ARG). An ARG extensively encodes the ancestry of the genome(s) and thus is replete with valuable information for addressing diverse questions in evolutionary biology. Despite its potential utility, technological and methodological limitations, along with a lack of approachable literature, have severely restricted awareness and application of ARGs in evolution research. Excitingly, recent progress in ARG reconstruction and simulation have made ARG-based approaches feasible for many questions and systems. In this review, we provide an accessible introduction and exploration of ARGs, survey recent methodological breakthroughs, and describe the potential for ARGs to further existing goals and open avenues of inquiry that were previously inaccessible in evolutionary genomics. Through this discussion, we aim to more widely disseminate the promise of ARGs in evolutionary genomics and encourage the broader development and adoption of ARG-based inference.
Collapse
Affiliation(s)
- Alexander L. Lewanski
- Department of Integrative Biology, Michigan State University, East Lansing, Michigan, United States of America
- W.K. Kellogg Biological Station, Michigan State University, Hickory Corners, Michigan, United States of America
- Ecology, Evolution, and Behavior Program, Michigan State University, East Lansing, Michigan, United States of America
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Michael C. Grundler
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Gideon S. Bradburd
- W.K. Kellogg Biological Station, Michigan State University, Hickory Corners, Michigan, United States of America
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan, United States of America
| |
Collapse
|
21
|
Wroblewski TH, Witt KE, Lee SB, Malhi RS, Peede D, Huerta-Sánchez E, Villanea FA, Claw KG. Pharmacogenetic Variation in Neanderthals and Denisovans and Implications for Human Health and Response to Medications. Genome Biol Evol 2023; 15:evad222. [PMID: 38051947 PMCID: PMC10727477 DOI: 10.1093/gbe/evad222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Revised: 11/08/2023] [Accepted: 11/16/2023] [Indexed: 12/07/2023] Open
Abstract
Modern humans carry both Neanderthal and Denisovan (archaic) genome elements that are part of the human gene pool and affect the life and health of living individuals. The impact of archaic DNA may be particularly evident in pharmacogenes-genes responsible for the processing of exogenous substances such as food, pollutants, and medications-as these can relate to changing environmental effects, and beneficial variants may have been retained as modern humans encountered new environments. However, the health implications and contribution of archaic ancestry in pharmacogenes of modern humans remain understudied. Here, we explore 11 key cytochrome P450 genes (CYP450) involved in 75% of all drug metabolizing reactions in three Neanderthal and one Denisovan individuals and examine archaic introgression in modern human populations. We infer the metabolizing efficiency of these 11 CYP450 genes in archaic individuals and find important predicted phenotypic differences relative to modern human variants. We identify several single nucleotide variants shared between archaic and modern humans in each gene, including some potentially function-altering mutations in archaic CYP450 genes, which may result in altered metabolism in living people carrying these variants. We also identified several variants in the archaic CYP450 genes that are novel and unique to archaic humans as well as one gene, CYP2B6, that shows evidence for a gene duplication found only in Neanderthals and modern Africans. Finally, we highlight CYP2A6, CYP2C9, and CYP2J2, genes which show evidence for archaic introgression into modern humans and posit evolutionary hypotheses that explain their allele frequencies in modern populations.
Collapse
Affiliation(s)
- Tadeusz H Wroblewski
- Department of Biomedical Informatics, Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, Colorado, USA
| | - Kelsey E Witt
- Center for Human Genetics and Department of Genetics and Biochemistry, Clemson University, South Carolina, USA
| | - Seung-been Lee
- Precision Medicine Institute, Macrogen Inc., Seoul, Republic of Korea
| | - Ripan S Malhi
- Department of Anthropology and Carl R. Woese Institute for Genomic Biology, University of Illinois Urbana-Champaign, Illinois, USA
| | - David Peede
- Department of Ecology, Evolution, and Organismal Biology and Center for Computational and Molecular Biology, Brown University, Providence, Rhode Island, USA
- Institute at Brown for Environment and Society, Brown University, Providence, Rhode Island, USA
| | - Emilia Huerta-Sánchez
- Department of Ecology, Evolution, and Organismal Biology and Center for Computational and Molecular Biology, Brown University, Providence, Rhode Island, USA
| | | | - Katrina G Claw
- Department of Biomedical Informatics, Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, Colorado, USA
| |
Collapse
|
22
|
Ragsdale AP. Human evolution: Neanderthal footprints in African genomes. Curr Biol 2023; 33:R1197-R1200. [PMID: 37989099 DOI: 10.1016/j.cub.2023.10.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2023]
Abstract
Human and Neanderthal populations met and mixed on multiple occasions over evolutionary time, resulting in the exchange of genetic material. New genomic analyses of diverse African populations reveal a history of bidirectional gene flow and selection acting on introgressed alleles.
Collapse
Affiliation(s)
- Aaron P Ragsdale
- Department of Integrative Biology, University of Wisconsin-Madison, Madison, WI 53706, USA.
| |
Collapse
|
23
|
Harris DN, Platt A, Hansen MEB, Fan S, McQuillan MA, Nyambo T, Mpoloka SW, Mokone GG, Belay G, Fokunang C, Njamnshi AK, Tishkoff SA. Diverse African genomes reveal selection on ancient modern human introgressions in Neanderthals. Curr Biol 2023; 33:4905-4916.e5. [PMID: 37837965 PMCID: PMC10841429 DOI: 10.1016/j.cub.2023.09.066] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Revised: 07/18/2023] [Accepted: 09/26/2023] [Indexed: 10/16/2023]
Abstract
Comparisons of Neanderthal genomes to anatomically modern human (AMH) genomes show a history of Neanderthal-to-AMH introgression stemming from interbreeding after the migration of AMHs from Africa to Eurasia. All non-sub-Saharan African AMHs have genomic regions genetically similar to Neanderthals that descend from this introgression. Regions of the genome with Neanderthal similarities have also been identified in sub-Saharan African populations, but their origins have been unclear. To better understand how these regions are distributed across sub-Saharan Africa, the source of their origin, and what their distribution within the genome tells us about early AMH and Neanderthal evolution, we analyzed a dataset of high-coverage, whole-genome sequences from 180 individuals from 12 diverse sub-Saharan African populations. In sub-Saharan African populations with non-sub-Saharan African ancestry, as much as 1% of their genomes can be attributed to Neanderthal sequence introduced by recent migration, and subsequent admixture, of AMH populations originating from the Levant and North Africa. However, most Neanderthal homologous regions in sub-Saharan African populations originate from migration of AMH populations from Africa to Eurasia ∼250 kya, and subsequent admixture with Neanderthals, resulting in ∼6% AMH ancestry in Neanderthals. These results indicate that there have been multiple migration events of AMHs out of Africa and that Neanderthal and AMH gene flow has been bi-directional. Observing that genomic regions where AMHs show a depletion of Neanderthal introgression are also regions where Neanderthal genomes show a depletion of AMH introgression points to deleterious interactions between introgressed variants and background genomes in both groups-a hallmark of incipient speciation.
Collapse
Affiliation(s)
- Daniel N Harris
- Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Alexander Platt
- Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Matthew E B Hansen
- Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Shaohua Fan
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, School of Life Science, Fudan University, Shanghai 200438, China
| | - Michael A McQuillan
- Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Thomas Nyambo
- Department of Biochemistry and Molecular Biology, Hubert Kairuki Memorial University, Dar es Salaam, Tanzania
| | - Sununguko Wata Mpoloka
- Department of Biological Sciences, Faculty of Science, University of Botswana, Private Bag UB 0022, Gaborone, Botswana
| | - Gaonyadiwe George Mokone
- Department of Biomedical Sciences, Faculty of Medicine, University of Botswana, Private Bag UB 0022, Gaborone, Botswana
| | - Gurja Belay
- Department of Microbial Cellular and Molecular Biology, Addis Ababa University, P.O. Box 1176, Addis Ababa, Ethiopia
| | - Charles Fokunang
- Department of Pharmacotoxicology and Pharmacokinetics, Faculty of Medicine and Biomedical Sciences, The University of Yaoundé I, P.O. Box 337, Yaoundé, Cameroon
| | - Alfred K Njamnshi
- Brain Research Africa Initiative (BRAIN), P.O. Box 25625, Yaoundé, Cameroon; Neuroscience Lab, Faculty of Medicine and Biomedical Sciences, The University of Yaoundé I, Yaoundé, Cameroon
| | - Sarah A Tishkoff
- Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA.
| |
Collapse
|
24
|
Rasmussen DA, Guo F. Espalier: Efficient Tree Reconciliation and Ancestral Recombination Graphs Reconstruction Using Maximum Agreement Forests. Syst Biol 2023; 72:1154-1170. [PMID: 37458753 PMCID: PMC10627558 DOI: 10.1093/sysbio/syad040] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Revised: 06/26/2023] [Accepted: 06/30/2023] [Indexed: 11/08/2023] Open
Abstract
In the presence of recombination individuals may inherit different regions of their genome from different ancestors, resulting in a mosaic of phylogenetic histories across their genome. Ancestral recombination graphs (ARGs) can capture how phylogenetic relationships vary across the genome due to recombination, but reconstructing ARGs from genomic sequence data is notoriously difficult. Here, we present a method for reconciling discordant phylogenetic trees and reconstructing ARGs using maximum agreement forests (MAFs). Given two discordant trees, a MAF identifies the smallest possible set of topologically concordant subtrees present in both trees. We show how discordant trees can be reconciled through their MAF in a way that retains discordances strongly supported by sequence data while eliminating conflicts likely attributable to phylogenetic noise. We further show how MAFs and our reconciliation approach can be combined to select a path of local trees across the genome that maximizes the likelihood of the genomic sequence data, minimizes discordance between neighboring local trees, and identifies the recombination events necessary to explain remaining discordances to obtain a fully connected ARG. While heuristic, our ARG reconstruction approach is often as accurate as more exact methods while being much more computationally efficient. Moreover, important demographic parameters such as recombination rates can be accurately estimated from reconstructed ARGs. Finally, we apply our approach to plant infecting RNA viruses in the genus Potyvirus to demonstrate how true recombination events can be disentangled from phylogenetic noise using our ARG reconstruction methods.
Collapse
Affiliation(s)
- David A Rasmussen
- Department of Entomology and Plant Pathology, North Carolina State University, Campus Box 7613, Raleigh, NC 27695, USA
- Bioinformatics Research Center, North Carolina State University, Campus Box 7566, Raleigh, NC 27695, USA
| | - Fangfang Guo
- Department of Entomology and Plant Pathology, North Carolina State University, Campus Box 7613, Raleigh, NC 27695, USA
| |
Collapse
|
25
|
Lewanski AL, Grundler MC, Bradburd GS. The era of the ARG: an empiricist's guide to ancestral recombination graphs. ARXIV 2023:arXiv:2310.12070v1. [PMID: 37904740 PMCID: PMC10614969] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 11/01/2023]
Abstract
In the presence of recombination, the evolutionary relationships between a set of sampled genomes cannot be described by a single genealogical tree. Instead, the genomes are related by a complex, interwoven collection of genealogies formalized in a structure called an ancestral recombination graph (ARG). An ARG extensively encodes the ancestry of the genome(s) and thus is replete with valuable information for addressing diverse questions in evolutionary biology. Despite its potential utility, technological and methodological limitations, along with a lack of approachable literature, have severely restricted awareness and application of ARGs in empirical evolution research. Excitingly, recent progress in ARG reconstruction and simulation have made ARG-based approaches feasible for many questions and systems. In this review, we provide an accessible introduction and exploration of ARGs, survey recent methodological breakthroughs, and describe the potential for ARGs to further existing goals and open avenues of inquiry that were previously inaccessible in evolutionary genomics. Through this discussion, we aim to more widely disseminate the promise of ARGs in evolutionary genomics and encourage the broader development and adoption of ARG-based inference.
Collapse
Affiliation(s)
- Alexander L Lewanski
- Department of Integrative Biology, Michigan State University, East Lansing, MI, US
- W.K. Kellogg Biological Station, Michigan State University, Hickory Corners, MI, US
- Ecology, Evolution, and Behavior Program, Michigan State University, East Lansing, MI, US
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, US
| | - Michael C Grundler
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, US
| | - Gideon S Bradburd
- W.K. Kellogg Biological Station, Michigan State University, Hickory Corners, MI, US
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, US
| |
Collapse
|
26
|
Roca-Umbert A, Garcia-Calleja J, Vogel-González M, Fierro-Villegas A, Ill-Raga G, Herrera-Fernández V, Bosnjak A, Muntané G, Gutiérrez E, Campelo F, Vicente R, Bosch E. Human genetic adaptation related to cellular zinc homeostasis. PLoS Genet 2023; 19:e1010950. [PMID: 37747921 PMCID: PMC10553801 DOI: 10.1371/journal.pgen.1010950] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 10/05/2023] [Accepted: 08/31/2023] [Indexed: 09/27/2023] Open
Abstract
SLC30A9 encodes a ubiquitously zinc transporter (ZnT9) and has been consistently suggested as a candidate for positive selection in humans. However, no direct adaptive molecular phenotype has been demonstrated. Our results provide evidence for directional selection operating in two major complementary haplotypes in Africa and East Asia. These haplotypes are associated with differential gene expression but also differ in the Met50Val substitution (rs1047626) in ZnT9, which we show is found in homozygosis in the Denisovan genome and displays accompanying signatures suggestive of archaic introgression. Although we found no significant differences in systemic zinc content between individuals with different rs1047626 genotypes, we demonstrate that the expression of the derived isoform (ZnT9 50Val) in HEK293 cells shows a gain of function when compared with the ancestral (ZnT9 50Met) variant. Notably, the ZnT9 50Val variant was found associated with differences in zinc handling by the mitochondria and endoplasmic reticulum, with an impact on mitochondrial metabolism. Given the essential role of the mitochondria in skeletal muscle and since the derived allele at rs1047626 is known to be associated with greater susceptibility to several neuropsychiatric traits, we propose that adaptation to cold may have driven this selection event, while also impacting predisposition to neuropsychiatric disorders in modern humans.
Collapse
Affiliation(s)
- Ana Roca-Umbert
- Institut de Biologia Evolutiva (UPF-CSIC), Departament de Medicina i Ciències de la Vida, Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona, Barcelona, Spain
| | - Jorge Garcia-Calleja
- Institut de Biologia Evolutiva (UPF-CSIC), Departament de Medicina i Ciències de la Vida, Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona, Barcelona, Spain
| | - Marina Vogel-González
- Laboratory of Molecular Physiology, Department of Medicine and Life Sciences (MELIS), Universitat Pompeu Fabra, Barcelona, Spain
| | - Alejandro Fierro-Villegas
- Laboratory of Molecular Physiology, Department of Medicine and Life Sciences (MELIS), Universitat Pompeu Fabra, Barcelona, Spain
| | - Gerard Ill-Raga
- Laboratory of Molecular Physiology, Department of Medicine and Life Sciences (MELIS), Universitat Pompeu Fabra, Barcelona, Spain
| | - Víctor Herrera-Fernández
- Laboratory of Molecular Physiology, Department of Medicine and Life Sciences (MELIS), Universitat Pompeu Fabra, Barcelona, Spain
| | - Anja Bosnjak
- Laboratory of Molecular Physiology, Department of Medicine and Life Sciences (MELIS), Universitat Pompeu Fabra, Barcelona, Spain
| | - Gerard Muntané
- Institut de Biologia Evolutiva (UPF-CSIC), Departament de Medicina i Ciències de la Vida, Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona, Barcelona, Spain
- Hospital Universitari Institut Pere Mata, IISPV, Universitat Rovira i Virgili, Reus, Spain
- Centro de Investigación Biomédica en Red de Salud Mental, Instituto de Salud Carlos III, Madrid, Spain
| | - Esteban Gutiérrez
- Laboratory of Molecular Physiology, Department of Medicine and Life Sciences (MELIS), Universitat Pompeu Fabra, Barcelona, Spain
| | - Felix Campelo
- ICFO-Institut de Ciencies Fotoniques, The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Rubén Vicente
- Laboratory of Molecular Physiology, Department of Medicine and Life Sciences (MELIS), Universitat Pompeu Fabra, Barcelona, Spain
| | - Elena Bosch
- Institut de Biologia Evolutiva (UPF-CSIC), Departament de Medicina i Ciències de la Vida, Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona, Barcelona, Spain
- Centro de Investigación Biomédica en Red de Salud Mental, Instituto de Salud Carlos III, Madrid, Spain
| |
Collapse
|
27
|
Flegontov P, Işıldak U, Maier R, Yüncü E, Changmai P, Reich D. Modeling of African population history using f-statistics is biased when applying all previously proposed SNP ascertainment schemes. PLoS Genet 2023; 19:e1010931. [PMID: 37676865 PMCID: PMC10508636 DOI: 10.1371/journal.pgen.1010931] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2023] [Revised: 09/19/2023] [Accepted: 08/21/2023] [Indexed: 09/09/2023] Open
Abstract
f-statistics have emerged as a first line of analysis for making inferences about demographic history from genome-wide data. Not only are they guaranteed to allow robust tests of the fits of proposed models of population history to data when analyzing full genome sequencing data-that is, all single nucleotide polymorphisms (SNPs) in the individuals being analyzed-but they are also guaranteed to allow robust tests of models for SNPs ascertained as polymorphic in a population that is an outgroup in a phylogenetic sense to all groups being analyzed. True "outgroup ascertainment" is in practice impossible in humans because our species has arisen from a substructured ancestral population that does not descend from a homogeneous ancestral population going back many hundreds of thousands of years into the past. However, initial studies suggested that non-outgroup-ascertainment schemes might produce robust enough results using f-statistics, and that motivated widespread fitting of models to data using non-outgroup-ascertained SNP panels such as the "Affymetrix Human Origins array" which has been genotyped on thousands of modern individuals from hundreds of populations, or the "1240k" in-solution enrichment reagent which has been the source of about 70% of published genome-wide data for ancient humans. In this study, we show that while analyses of population history using such panels work well for studies of relationships among non-African populations and one African outgroup, when co-modeling more than one sub-Saharan African and/or archaic human groups (Neanderthals and Denisovans), fitting of f-statistics to such SNP sets is expected to frequently lead to false rejection of true demographic histories, and failure to reject incorrect models. Analyzing panels of SNPs polymorphic in archaic humans, which has been suggested as a solution for the ascertainment problem, has limited statistical power and retains important biases. However, by carrying out simulations of diverse demographic histories, we show that bias in inferences based on f-statistics can be minimized by ascertaining on variants common in a union of diverse African groups; such ascertainment retains high statistical power while allowing co-analysis of archaic and modern groups.
Collapse
Affiliation(s)
- Pavel Flegontov
- Department of Human Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czechia
- Kalmyk Research Center of the Russian Academy of Sciences, Elista, Russia
| | - Ulaş Işıldak
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czechia
| | - Robert Maier
- Department of Human Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
| | - Eren Yüncü
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czechia
| | - Piya Changmai
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czechia
| | - David Reich
- Department of Human Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
- Howard Hughes Medical Institute, Harvard Medical School, Boston, Massachusetts, United States of America
- Broad Institute of Harvard and MIT, Cambridge, Massachusetts, United States of America
| |
Collapse
|
28
|
Salehi Nowbandegani P, Wohns AW, Ballard JL, Lander ES, Bloemendal A, Neale BM, O'Connor LJ. Extremely sparse models of linkage disequilibrium in ancestrally diverse association studies. Nat Genet 2023; 55:1494-1502. [PMID: 37640881 DOI: 10.1038/s41588-023-01487-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Accepted: 07/24/2023] [Indexed: 08/31/2023]
Abstract
Linkage disequilibrium (LD) is the correlation among nearby genetic variants. In genetic association studies, LD is often modeled using large correlation matrices, but this approach is inefficient, especially in ancestrally diverse studies. In the present study, we introduce LD graphical models (LDGMs), which are an extremely sparse and efficient representation of LD. LDGMs are derived from genome-wide genealogies; statistical relationships among alleles in the LDGM correspond to genealogical relationships among haplotypes. We published LDGMs and ancestry-specific LDGM precision matrices for 18 million common variants (minor allele frequency >1%) in five ancestry groups, validated their accuracy and demonstrated order-of-magnitude improvements in runtime for commonly used LD matrix computations. We implemented an extremely fast multiancestry polygenic prediction method, BLUPx-ldgm, which performs better than a similar method based on the reference LD correlation matrix. LDGMs will enable sophisticated methods that scale to ancestrally diverse genetic association data across millions of variants and individuals.
Collapse
Affiliation(s)
- Pouria Salehi Nowbandegani
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
| | - Anthony Wilder Wohns
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
- Stanford University School of Medicine, Stanford, CA, USA.
| | - Jenna L Ballard
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Graduate Group in Genomics and Computational Biology, University of Pennsylvania, Philadelphia, PA, USA
| | - Eric S Lander
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Biology, MIT, Cambridge, MA, USA
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Alex Bloemendal
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- The Novo Nordisk Foundation Center for Genomic Mechanisms of Disease, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Benjamin M Neale
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- The Novo Nordisk Foundation Center for Genomic Mechanisms of Disease, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Luke J O'Connor
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
29
|
Ragsdale AP, Thornton KR. Multiple Sources of Uncertainty Confound Inference of Historical Human Generation Times. Mol Biol Evol 2023; 40:msad160. [PMID: 37450583 PMCID: PMC10404577 DOI: 10.1093/molbev/msad160] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 07/05/2023] [Accepted: 07/07/2023] [Indexed: 07/18/2023] Open
Abstract
Wang et al. (2023) recently proposed an approach to infer the history of human generation intervals from changes in mutation profiles over time. As the relative proportions of different mutation types depend on the ages of parents, binning variants by the time they arose allows for the inference of changes in average paternal and maternal generation intervals. Applying this approach to published allele age estimates, Wang et al. (2023) inferred long-lasting sex differences in average generation times and surprisingly found that ancestral generation times of West African populations remained substantially higher than those of Eurasian populations extending tens of thousands of generations into the past. Here, we argue that the results and interpretations in Wang et al. (2023) are primarily driven by noise and biases in input data and a lack of validation using independent approaches for estimating allele ages. With the recent development of methods to reconstruct genome-wide gene genealogies, coalescence times, and allele ages, we caution that downstream analyses may be strongly influenced by uncharacterized biases in their output.
Collapse
Affiliation(s)
- Aaron P Ragsdale
- Department of Integrative Biology, University of Wisconsin-Madison, Madison, WI, USA
| | - Kevin R Thornton
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA, USA
| |
Collapse
|
30
|
Chevy ET, Huerta-Sánchez E, Ramachandran S. Integrating sex-bias into studies of archaic introgression on chromosome X. PLoS Genet 2023; 19:e1010399. [PMID: 37578977 PMCID: PMC10449224 DOI: 10.1371/journal.pgen.1010399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Revised: 08/24/2023] [Accepted: 07/10/2023] [Indexed: 08/16/2023] Open
Abstract
Evidence of interbreeding between archaic hominins and humans comes from methods that infer the locations of segments of archaic haplotypes, or 'archaic coverage' using the genomes of people living today. As more estimates of archaic coverage have emerged, it has become clear that most of this coverage is found on the autosomes- very little is retained on chromosome X. Here, we summarize published estimates of archaic coverage on autosomes and chromosome X from extant human samples. We find on average 7 times more archaic coverage on autosomes than chromosome X, and identify broad continental patterns in this ratio: greatest in European samples, and least in South Asian samples. We also perform extensive simulation studies to investigate how the amount of archaic coverage, lengths of coverage, and rates of purging of archaic coverage are affected by sex-bias caused by an unequal sex ratio within the archaic introgressors. Our results generally confirm that, with increasing male sex-bias, less archaic coverage is retained on chromosome X. Ours is the first study to explicitly model such sex-bias and its potential role in creating the dearth of archaic coverage on chromosome X.
Collapse
Affiliation(s)
- Elizabeth T. Chevy
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, United States of America
| | - Emilia Huerta-Sánchez
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, United States of America
- Department of Ecology, Evolution, and Organismal Biology, Brown University, Providence, Rhode Island, United States of America
| | - Sohini Ramachandran
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, United States of America
- Department of Ecology, Evolution, and Organismal Biology, Brown University, Providence, Rhode Island, United States of America
- Data Science Initiative, Brown University, Providence, Rhode Island, United States of America
| |
Collapse
|
31
|
Skov L, Coll Macià M, Lucotte EA, Cavassim MIA, Castellano D, Schierup MH, Munch K. Extraordinary selection on the human X chromosome associated with archaic admixture. CELL GENOMICS 2023; 3:100274. [PMID: 36950386 PMCID: PMC10025451 DOI: 10.1016/j.xgen.2023.100274] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Revised: 09/15/2022] [Accepted: 01/26/2023] [Indexed: 03/04/2023]
Abstract
The X chromosome in non-African humans shows less diversity and less Neanderthal introgression than expected from neutral evolution. Analyzing 162 human male X chromosomes worldwide, we identified fourteen chromosomal regions where nearly identical haplotypes spanning several hundred kilobases are found at high frequencies in non-Africans. Genetic drift alone cannot explain the existence of these haplotypes, which must have been associated with strong positive selection in partial selective sweeps. Moreover, the swept haplotypes are entirely devoid of archaic ancestry as opposed to the non-swept haplotypes in the same genomic regions. The ancient Ust'-Ishim male dated at 45,000 before the present (BP) also carries the swept haplotypes, implying that selection on the haplotypes must have occurred between 45,000 and 55,000 years ago. Finally, we find that the chromosomal positions of sweeps overlap previously reported hotspots of selective sweeps in great ape evolution, suggesting a mechanism of selection unique to X chromosomes.
Collapse
Affiliation(s)
- Laurits Skov
- Department of Molecular and Cell Biology, University of California, Berkeley, CA 94720-5800, USA
| | - Moisès Coll Macià
- Bioinformatics Research Centre, Aarhus University, 8000 Aarhus, Denmark
| | - Elise Anne Lucotte
- Ecologie Systématique Evolution, Univ. Paris-Sud, AgroParisTech, CNRS, Université Paris-Saclay, Gif-sur-Yvette, France
| | | | - David Castellano
- Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ 85721, USA
| | | | - Kasper Munch
- Bioinformatics Research Centre, Aarhus University, 8000 Aarhus, Denmark
- Corresponding author
| |
Collapse
|
32
|
Shipilina D, Pal A, Stankowski S, Chan YF, Barton NH. On the origin and structure of haplotype blocks. Mol Ecol 2023; 32:1441-1457. [PMID: 36433653 PMCID: PMC10946714 DOI: 10.1111/mec.16793] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2022] [Revised: 11/16/2022] [Accepted: 11/18/2022] [Indexed: 11/27/2022]
Abstract
The term "haplotype block" is commonly used in the developing field of haplotype-based inference methods. We argue that the term should be defined based on the structure of the Ancestral Recombination Graph (ARG), which contains complete information on the ancestry of a sample. We use simulated examples to demonstrate key features of the relationship between haplotype blocks and ancestral structure, emphasizing the stochasticity of the processes that generate them. Even the simplest cases of neutrality or of a "hard" selective sweep produce a rich structure, often missed by commonly used statistics. We highlight a number of novel methods for inferring haplotype structure, based on the full ARG, or on a sequence of trees, and illustrate how they can be used to define haplotype blocks using an empirical data set. While the advent of new, computationally efficient methods makes it possible to apply these concepts broadly, they (and additional new methods) could benefit from adding features to explore haplotype blocks, as we define them. Understanding and applying the concept of the haplotype block will be essential to fully exploit long and linked-read sequencing technologies.
Collapse
Affiliation(s)
- Daria Shipilina
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG)Uppsala UniversityUppsalaSweden
- Institute of Science and Technology AustriaKlosterneuburgAustria
- Swedish Collegium for Advanced StudyUppsalaSweden
| | - Arka Pal
- Institute of Science and Technology AustriaKlosterneuburgAustria
| | - Sean Stankowski
- Institute of Science and Technology AustriaKlosterneuburgAustria
| | | | | |
Collapse
|
33
|
Flegontov P, Işıldak U, Maier R, Yüncü E, Changmai P, Reich D. Modeling of African population history using f -statistics can be highly biased and is not addressed by previously suggested SNP ascertainment schemes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.22.525077. [PMID: 36711923 PMCID: PMC9882349 DOI: 10.1101/2023.01.22.525077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
f -statistics have emerged as a first line of analysis for making inferences about demographic history from genome-wide data. These statistics can provide strong evidence for either admixture or cladality, which can be robust to substantial rates of errors or missing data. f -statistics are guaranteed to be unbiased under "SNP ascertainment" (analyzing non-randomly chosen subsets of single nucleotide polymorphisms) only if it relies on a population that is an outgroup for all groups analyzed. However, ascertainment on a true outgroup that is not co-analyzed with other populations is often impractical and uncommon in the literature. In this study focused on practical rather than theoretical aspects of SNP ascertainment, we show that many non-outgroup ascertainment schemes lead to false rejection of true demographic histories, as well as to failure to reject incorrect models. But the bias introduced by common ascertainments such as the 1240K panel is mostly limited to situations when more than one sub-Saharan African and/or archaic human groups (Neanderthals and Denisovans) or non-human outgroups are co-modelled, for example, f 4 -statistics involving one non-African group, two African groups, and one archaic group. Analyzing panels of SNPs polymorphic in archaic humans, which has been suggested as a solution for the ascertainment problem, cannot fix all these problems since for some classes of f -statistics it is not a clean outgroup ascertainment, and in other cases it demonstrates relatively low power to reject incorrect demographic models since it provides a relatively small number of variants common in anatomically modern humans. And due to the paucity of high-coverage archaic genomes, archaic individuals used for ascertainment often act as sole representatives of the respective groups in an analysis, and we show that this approach is highly problematic. By carrying out large numbers of simulations of diverse demographic histories, we find that bias in inferences based on f -statistics introduced by non-outgroup ascertainment can be minimized if the derived allele frequency spectrum in the population used for ascertainment approaches the spectrum that existed at the root of all groups being co-analyzed. Ascertaining on sites with variants common in a diverse group of African individuals provides a good approximation to such a set of SNPs, addressing the great majority of biases and also retaining high statistical power for studying population history. Such a "pan-African" ascertainment, although not completely problem-free, allows unbiased exploration of demographic models for the widest set of archaic and modern human populations, as compared to the other ascertainment schemes we explored.
Collapse
Affiliation(s)
- Pavel Flegontov
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czechia
- Kalmyk Research Center of the Russian Academy of Sciences, Elista, Russia
| | - Ulaş Işıldak
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czechia
| | - Robert Maier
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
| | - Eren Yüncü
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czechia
| | - Piya Changmai
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czechia
| | - David Reich
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Department of Genetics, Harvard Medical School, Boston, MA 02115, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
| |
Collapse
|
34
|
Reilly PF, Tjahjadi A, Miller SL, Akey JM, Tucci S. The contribution of Neanderthal introgression to modern human traits. Curr Biol 2022; 32:R970-R983. [PMID: 36167050 PMCID: PMC9741939 DOI: 10.1016/j.cub.2022.08.027] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Neanderthals, our closest extinct relatives, lived in western Eurasia from 400,000 years ago until they went extinct around 40,000 years ago. DNA retrieved from ancient specimens revealed that Neanderthals mated with modern human contemporaries. As a consequence, introgressed Neanderthal DNA survives scattered across the human genome such that 1-4% of the genome of present-day people outside Africa are inherited from Neanderthal ancestors. Patterns of Neanderthal introgressed genomic sequences suggest that Neanderthal alleles had distinct fates in the modern human genetic background. Some Neanderthal alleles facilitated human adaptation to new environments such as novel climate conditions, UV exposure levels and pathogens, while others had deleterious consequences. Here, we review the body of work on Neanderthal introgression over the past decade. We describe how evolutionary forces shaped the genomic landscape of Neanderthal introgression and highlight the impact of introgressed alleles on human biology and phenotypic variation.
Collapse
Affiliation(s)
| | - Audrey Tjahjadi
- Department of Anthropology, Yale University, New Haven, CT, USA
| | | | - Joshua M Akey
- Lewis Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA.
| | - Serena Tucci
- Department of Anthropology, Yale University, New Haven, CT, USA; Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, USA.
| |
Collapse
|
35
|
Guo F, Carbone I, Rasmussen DA. Recombination-aware phylogeographic inference using the structured coalescent with ancestral recombination. PLoS Comput Biol 2022; 18:e1010422. [PMID: 35984849 PMCID: PMC9447913 DOI: 10.1371/journal.pcbi.1010422] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Revised: 09/06/2022] [Accepted: 07/21/2022] [Indexed: 11/19/2022] Open
Abstract
Movement of individuals between populations or demes is often restricted, especially between geographically isolated populations. The structured coalescent provides an elegant theoretical framework for describing how movement between populations shapes the genealogical history of sampled individuals and thereby structures genetic variation within and between populations. However, in the presence of recombination an individual may inherit different regions of their genome from different parents, resulting in a mosaic of genealogical histories across the genome, which can be represented by an Ancestral Recombination Graph (ARG). In this case, different genomic regions may have different ancestral histories and so different histories of movement between populations. Recombination therefore poses an additional challenge to phylogeographic methods that aim to reconstruct the movement of individuals from genealogies, although also a potential benefit in that different loci may contain additional information about movement. Here, we introduce the Structured Coalescent with Ancestral Recombination (SCAR) model, which builds on recent approximations to the structured coalescent by incorporating recombination into the ancestry of sampled individuals. The SCAR model allows us to infer how the migration history of sampled individuals varies across the genome from ARGs, and improves estimation of key population genetic parameters such as population sizes, recombination rates and migration rates. Using the SCAR model, we explore the potential and limitations of phylogeographic inference using full ARGs. We then apply the SCAR to lineages of the recombining fungus Aspergillus flavus sampled across the United States to explore patterns of recombination and migration across the genome.
Collapse
Affiliation(s)
- Fangfang Guo
- Department of Entomology and Plant Pathology, North Carolina State University, Raleigh, North Carolina, United States of America
| | - Ignazio Carbone
- Department of Entomology and Plant Pathology, North Carolina State University, Raleigh, North Carolina, United States of America
- Center for Integrated Fungal Research, North Carolina State University, Raleigh, North Carolina, United States of America
| | - David A. Rasmussen
- Department of Entomology and Plant Pathology, North Carolina State University, Raleigh, North Carolina, United States of America
- Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, United States of America
| |
Collapse
|
36
|
Peyrégne S, Kelso J, Peter BM, Pääbo S. The evolutionary history of human spindle genes includes back-and-forth gene flow with Neandertals. eLife 2022; 11:e75464. [PMID: 35816093 PMCID: PMC9273211 DOI: 10.7554/elife.75464] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Accepted: 06/14/2022] [Indexed: 12/13/2022] Open
Abstract
Proteins associated with the spindle apparatus, a cytoskeletal structure that ensures the proper segregation of chromosomes during cell division, experienced an unusual number of amino acid substitutions in modern humans after the split from the ancestors of Neandertals and Denisovans. Here, we analyze the history of these substitutions and show that some of the genes in which they occur may have been targets of positive selection. We also find that the two changes in the kinetochore scaffold 1 (KNL1) protein, previously believed to be specific to modern humans, were present in some Neandertals. We show that the KNL1 gene of these Neandertals shared a common ancestor with present-day Africans about 200,000 years ago due to gene flow from the ancestors (or relatives) of modern humans into Neandertals. Subsequently, some non-Africans inherited this modern human-like gene variant from Neandertals, but none inherited the ancestral gene variants. These results add to the growing evidence of early contacts between modern humans and archaic groups in Eurasia and illustrate the intricate relationships among these groups.
Collapse
Affiliation(s)
- Stéphane Peyrégne
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary AnthropologyLeipzigGermany
| | - Janet Kelso
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary AnthropologyLeipzigGermany
| | - Benjamin M Peter
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary AnthropologyLeipzigGermany
| | - Svante Pääbo
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary AnthropologyLeipzigGermany
| |
Collapse
|
37
|
Andirkó A, Moriano J, Vitriolo A, Kuhlwilm M, Testa G, Boeckx C. Temporal mapping of derived high-frequency gene variants supports the mosaic nature of the evolution of Homo sapiens. Sci Rep 2022; 12:9937. [PMID: 35705575 PMCID: PMC9200848 DOI: 10.1038/s41598-022-13589-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Accepted: 05/25/2022] [Indexed: 11/25/2022] Open
Abstract
Large-scale estimations of the time of emergence of variants are essential to examine hypotheses concerning human evolution with precision. Using an open repository of genetic variant age estimations, we offer here a temporal evaluation of various evolutionarily relevant datasets, such as Homo sapiens-specific variants, high-frequency variants found in genetic windows under positive selection, introgressed variants from extinct human species, as well as putative regulatory variants specific to various brain regions. We find a recurrent bimodal distribution of high-frequency variants, but also evidence for specific enrichments of gene categories in distinct time windows, pointing to different periods of phenotypic changes, resulting in a mosaic. With a temporal classification of genetic mutations in hand, we then applied a machine learning tool to predict what genes have changed more in certain time windows, and which tissues these genes may have impacted more. Overall, we provide a fine-grained temporal mapping of derived variants in Homo sapiens that helps to illuminate the intricate evolutionary history of our species.
Collapse
Affiliation(s)
- Alejandro Andirkó
- Universitat de Barcelona, Barcelona, Spain
- Universitat de Barcelona Institute of Complex Systems (UBICS), Barcelona, Spain
| | - Juan Moriano
- Universitat de Barcelona, Barcelona, Spain
- Universitat de Barcelona Institute of Complex Systems (UBICS), Barcelona, Spain
| | - Alessandro Vitriolo
- University of Milan, Milan, Italy
- European Institute of Oncology (IEO), Milan, Italy
- Human Technopole, Milan, Italy
| | - Martin Kuhlwilm
- University of Vienna, Vienna, Austria
- Human Evolution and Archaeological Sciences (HEAS), University of Vienna, Vienna, Austria
| | - Giuseppe Testa
- University of Milan, Milan, Italy
- European Institute of Oncology (IEO), Milan, Italy
- Human Technopole, Milan, Italy
| | - Cedric Boeckx
- Universitat de Barcelona, Barcelona, Spain.
- Universitat de Barcelona Institute of Complex Systems (UBICS), Barcelona, Spain.
- Catalan Institute for Research and Advanced Studies (ICREA), Catalonia, Spain.
| |
Collapse
|
38
|
Abstract
The polar bear (Ursus maritimus) has become a symbol of the threat to biodiversity from climate change. Understanding polar bear evolutionary history may provide insights into apex carnivore responses and prospects during periods of extreme environmental perturbations. In recent years, genomic studies have examined bear speciation and population history, including evidence for ancient admixture between polar bears and brown bears (Ursus arctos). Here, we extend our earlier studies of a 130,000- to 115,000-y-old polar bear from the Svalbard Archipelago using a 10× coverage genome sequence and 10 new genomes of polar and brown bears from contemporary zones of overlap in northern Alaska. We demonstrate a dramatic decline in effective population size for this ancient polar bear’s lineage, followed by a modest increase just before its demise. A slightly higher genetic diversity in the ancient polar bear suggests a severe genetic erosion over a prolonged bottleneck in modern polar bears. Statistical fitting of data to alternative admixture graph scenarios favors at least one ancient introgression event from brown bears into the ancestor of polar bears, possibly dating back over 150,000 y. Gene flow was likely bidirectional, but allelic transfer from brown into polar bear is the strongest detected signal, which contrasts with other published work. These findings may have implications for our understanding of climate change impacts: Polar bears, a specialist Arctic lineage, may not only have undergone severe genetic bottlenecks but also been the recipient of generalist, boreal genetic variants from brown bears during critical phases of Northern Hemisphere glacial oscillations.
Collapse
|
39
|
Y. C. Brandt D, Wei X, Deng Y, Vaughn AH, Nielsen R. Evaluation of methods for estimating coalescence times using ancestral recombination graphs. Genetics 2022; 221:iyac044. [PMID: 35333304 PMCID: PMC9071567 DOI: 10.1093/genetics/iyac044] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2021] [Accepted: 03/08/2022] [Indexed: 11/12/2022] Open
Abstract
The ancestral recombination graph is a structure that describes the joint genealogies of sampled DNA sequences along the genome. Recent computational methods have made impressive progress toward scalably estimating whole-genome genealogies. In addition to inferring the ancestral recombination graph, some of these methods can also provide ancestral recombination graphs sampled from a defined posterior distribution. Obtaining good samples of ancestral recombination graphs is crucial for quantifying statistical uncertainty and for estimating population genetic parameters such as effective population size, mutation rate, and allele age. Here, we use standard neutral coalescent simulations to benchmark the estimates of pairwise coalescence times from 3 popular ancestral recombination graph inference programs: ARGweaver, Relate, and tsinfer+tsdate. We compare (1) the true coalescence times to the inferred times at each locus; (2) the distribution of coalescence times across all loci to the expected exponential distribution; (3) whether the sampled coalescence times have the properties expected of a valid posterior distribution. We find that inferred coalescence times at each locus are most accurate in ARGweaver, and often more accurate in Relate than in tsinfer+tsdate. However, all 3 methods tend to overestimate small coalescence times and underestimate large ones. Lastly, the posterior distribution of ARGweaver is closer to the expected posterior distribution than Relate's, but this higher accuracy comes at a substantial trade-off in scalability. The best choice of method will depend on the number and length of input sequences and on the goal of downstream analyses, and we provide guidelines for the best practices.
Collapse
Affiliation(s)
- Débora Y. C. Brandt
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA 94720, USA
| | - Xinzhu Wei
- Department of Computational Biology, Cornell University, Ithaca, NY 14850, USA
| | - Yun Deng
- Center for Computational Biology, University of California, Berkeley, CA 94720, USA
| | - Andrew H Vaughn
- Center for Computational Biology, University of California, Berkeley, CA 94720, USA
| | - Rasmus Nielsen
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA 94720, USA
- Center for Computational Biology, University of California, Berkeley, CA 94720, USA
- Department of Statistics, University of California Berkeley, Berkeley, CA 94720, USA
- GLOBE Institute, University of Copenhagen, Copenhagen K 1350, Denmark
| |
Collapse
|
40
|
Estimating bonobo ( Pan paniscus) and chimpanzee ( Pan troglodytes) evolutionary history from nucleotide site patterns. Proc Natl Acad Sci U S A 2022; 119:e2200858119. [PMID: 35452306 PMCID: PMC9170072 DOI: 10.1073/pnas.2200858119] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
There is genomic evidence of widespread admixture in deep time between many closely related species, including humans. Our closest living relatives, bonobos and chimpanzees, may also exhibit such patterns. However, assessing the exact degree of interbreeding remains challenging because previous studies have resulted in multiple inconsistent demographic models. We use an approach that addresses these gaps by analyzing all lineages, simultaneously estimating parameters, and comparing previously models. We find evidence of considerable introgression from western into eastern chimpanzees. We also show more breeding females than males and evidence of male-biased dispersal in western chimpanzees. These findings highlight the extent of admixture in bonobo and chimpanzee evolutionary history and are consistent with substantial differences between past and present chimpanzee biogeography. Admixture appears increasingly ubiquitous in the evolutionary history of various taxa, including humans. Such gene flow likely also occurred among our closest living relatives: bonobos (Pan paniscus) and chimpanzees (Pan troglodytes). However, our understanding of their evolutionary history has been limited by studies that do not consider all Pan lineages or do not analyze all lineages simultaneously, resulting in conflicting demographic models. Here, we investigate this gap in knowledge using nucleotide site patterns calculated from whole-genome sequences from the autosomes of 71 bonobos and chimpanzees, representing all five extant Pan lineages. We estimated demographic parameters and compared all previously proposed demographic models for this clade. We further considered sex bias in Pan evolutionary history by analyzing the site patterns from the X chromosome. We show that 1) 21% of autosomal DNA in eastern chimpanzees derives from western chimpanzee introgression and that 2) all four chimpanzee lineages share a common ancestor about 987,000 y ago, much earlier than previous estimates. In addition, we suggest that 3) there was male reproductive skew throughout Pan evolutionary history and find evidence of 4) male-biased dispersal from western to eastern chimpanzees. Collectively, these results offer insight into bonobo and chimpanzee evolutionary history and suggest considerable differences between current and historic chimpanzee biogeography.
Collapse
|
41
|
Wohns AW, Wong Y, Jeffery B, Akbari A, Mallick S, Pinhasi R, Patterson N, Reich D, Kelleher J, McVean G. A unified genealogy of modern and ancient genomes. Science 2022; 375:eabi8264. [PMID: 35201891 PMCID: PMC10027547 DOI: 10.1126/science.abi8264] [Citation(s) in RCA: 63] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
The sequencing of modern and ancient genomes from around the world has revolutionized our understanding of human history and evolution. However, the problem of how best to characterize ancestral relationships from the totality of human genomic variation remains unsolved. Here, we address this challenge with nonparametric methods that enable us to infer a unified genealogy of modern and ancient humans. This compact representation of multiple datasets explores the challenges of missing and erroneous data and uses ancient samples to constrain and date relationships. We demonstrate the power of the method to recover relationships between individuals and populations as well as to identify descendants of ancient samples. Finally, we introduce a simple nonparametric estimator of the geographical location of ancestors that recapitulates key events in human history.
Collapse
Affiliation(s)
- Anthony Wilder Wohns
- Broad Institute of MIT and Harvard; Cambridge, MA 02142, USA
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford; Oxford OX3 7LF, UK
| | - Yan Wong
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford; Oxford OX3 7LF, UK
| | - Ben Jeffery
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford; Oxford OX3 7LF, UK
| | - Ali Akbari
- Broad Institute of MIT and Harvard; Cambridge, MA 02142, USA
- Department of Human Evolutionary Biology, Harvard University; Cambridge, MA 02138, USA
- Department of Genetics, Harvard Medical School; Boston, MA 02115, USA
| | - Swapan Mallick
- Broad Institute of MIT and Harvard; Cambridge, MA 02142, USA
- Howard Hughes Medical Institute; Boston, MA 02115, USA
| | - Ron Pinhasi
- Department of Evolutionary Anthropology, University of Vienna; 1090 Vienna, Austria
| | - Nick Patterson
- Broad Institute of MIT and Harvard; Cambridge, MA 02142, USA
- Department of Human Evolutionary Biology, Harvard University; Cambridge, MA 02138, USA
- Howard Hughes Medical Institute; Boston, MA 02115, USA
- Department of Genetics, Harvard Medical School; Boston, MA 02115, USA
| | - David Reich
- Broad Institute of MIT and Harvard; Cambridge, MA 02142, USA
- Department of Human Evolutionary Biology, Harvard University; Cambridge, MA 02138, USA
- Howard Hughes Medical Institute; Boston, MA 02115, USA
- Department of Genetics, Harvard Medical School; Boston, MA 02115, USA
| | - Jerome Kelleher
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford; Oxford OX3 7LF, UK
| | - Gil McVean
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford; Oxford OX3 7LF, UK
- Corresponding author.
| |
Collapse
|
42
|
Slimak L, Zanolli C, Higham T, Frouin M, Schwenninger JL, Arnold LJ, Demuro M, Douka K, Mercier N, Guérin G, Valladas H, Yvorra P, Giraud Y, Seguin-Orlando A, Orlando L, Lewis JE, Muth X, Camus H, Vandevelde S, Buckley M, Mallol C, Stringer C, Metz L. Modern human incursion into Neanderthal territories 54,000 years ago at Mandrin, France. SCIENCE ADVANCES 2022; 8:eabj9496. [PMID: 35138885 PMCID: PMC8827661 DOI: 10.1126/sciadv.abj9496] [Citation(s) in RCA: 48] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
Determining the extent of overlap between modern humans and other hominins in Eurasia, such as Neanderthals and Denisovans, is fundamental to understanding the nature of their interactions and what led to the disappearance of archaic hominins. Apart from a possible sporadic pulse recorded in Greece during the Middle Pleistocene, the first settlements of modern humans in Europe have been constrained to ~45,000 to 43,000 years ago. Here, we report hominin fossils from Grotte Mandrin in France that reveal the earliest known presence of modern humans in Europe between 56,800 and 51,700 years ago. This early modern human incursion in the Rhône Valley is associated with technologies unknown in any industry of that age outside Africa or the Levant. Mandrin documents the first alternating occupation of Neanderthals and modern humans, with a modern human fossil and associated Neronian lithic industry found stratigraphically between layers containing Neanderthal remains associated with Mousterian industries.
Collapse
Affiliation(s)
- Ludovic Slimak
- CNRS, UMR 5608, TRACES, Université de Toulouse Jean Jaurès, 5 Allées Antonio Machado, 31058 Toulouse Cedex 9, France
- Corresponding author. (L.S.); (C.Z.)
| | - Clément Zanolli
- Université de Bordeaux, CNRS, MCC, PACEA, UMR 5199, 33600 Pessac, France
- Corresponding author. (L.S.); (C.Z.)
| | - Tom Higham
- Research Laboratory for Archaeology and the History of Art, University of Oxford, Dyson Perrins Building, South Parks Road, Oxford OX1 3QY, UK
- Department of Evolutionary Anthropology, University of Vienna, University Biology Building, Djerassiplatz 1, A-1030 Vienna, Austria
| | - Marine Frouin
- Research Laboratory for Archaeology and the History of Art, University of Oxford, Dyson Perrins Building, South Parks Road, Oxford OX1 3QY, UK
- Department of Geosciences, Stony Brook University, 255 Earth and Space Sciences Building, Stony Brook, NY 11794-2100, USA
- Turkana Basin Institute, Stony Brook University, Stony Brook, NY 11794-4364, USA
| | - Jean-Luc Schwenninger
- Research Laboratory for Archaeology and the History of Art, University of Oxford, Dyson Perrins Building, South Parks Road, Oxford OX1 3QY, UK
| | - Lee J. Arnold
- School of Physical Sciences, Environment Institute, Institute for Photonics and Advanced Sensing (IPAS), University of Adelaide, North Terrace Campus, Adelaide, SA 5005, Australia
| | - Martina Demuro
- School of Physical Sciences, Environment Institute, Institute for Photonics and Advanced Sensing (IPAS), University of Adelaide, North Terrace Campus, Adelaide, SA 5005, Australia
| | - Katerina Douka
- Department of Evolutionary Anthropology, University of Vienna, University Biology Building, Djerassiplatz 1, A-1030 Vienna, Austria
- Department of Archaeology, Max Planck Institute for the Science of Human History, Kahlaische, Str. 10, 07745 Jena, Germany
| | - Norbert Mercier
- CNRS, UMR 5060, Institut de Recherche sur les Archéomatériaux and Centre de Recherche en Physique Appliquée à l’Archéologie (CRP2A), Maison de l’Archéologie, Université Bordeaux Montaigne, 33607 Pessac, France
| | - Gilles Guérin
- Laboratoire des Sciences du Climat et de l’Environnement, LSCE/IPSL, UMR 8212 CEA CNRS UVSQ, Université Paris-Saclay, 91191 Gif-sur-Yvette, France
| | - Hélène Valladas
- Laboratoire des Sciences du Climat et de l’Environnement, LSCE/IPSL, UMR 8212 CEA CNRS UVSQ, Université Paris-Saclay, 91191 Gif-sur-Yvette, France
| | - Pascale Yvorra
- Aix-Marseille Université, CNRS, Min. Culture, UMR 7269, LAMPEA, Maison Méditerranéenne des Sciences de l’Homme, BP 647, 5 rue du Château de l’Horloge, F-13094, Aix-en-Provence Cedex 2, France
| | - Yves Giraud
- Aix-Marseille Université, CNRS, Min. Culture, UMR 7269, LAMPEA, Maison Méditerranéenne des Sciences de l’Homme, BP 647, 5 rue du Château de l’Horloge, F-13094, Aix-en-Provence Cedex 2, France
| | | | - Ludovic Orlando
- CNRS, UMR 5288, CAGT, Université Toulouse III Paul Sabatier, Toulouse, France
| | - Jason E. Lewis
- Turkana Basin Institute, Stony Brook University, Stony Brook, NY 11794-4364, USA
- Department of Anthropology, Stony Brook University, Stony Brook, NY 11794-4364, USA
| | | | - Hubert Camus
- PROTEE-EXPERT, 4 rue des Aspholdèles, 34750 Villeneuve-lès-Maguelone, France
| | - Ségolène Vandevelde
- Laboratoire des Sciences du Climat et de l’Environnement, LSCE/IPSL, UMR 8212 CEA CNRS UVSQ, Université Paris-Saclay, 91191 Gif-sur-Yvette, France
- Université Paris 1–Panthéon-Sorbonne, Équipe Archéologies Environnementales, UMR 7041, ArScAn, Équipe Archéologies Environnementales, 21 allée de l’Université, 92023 Nanterre Cedex, France
| | - Mike Buckley
- Department of Earth and Environmental Sciences, University of Manchester, Oxford Road, Manchester M13 9PL, UK
| | - Carolina Mallol
- Archaeological Micromorphology and Biomarkers Laboratory (AMBI Lab), Instituto Universitario de Bio-Orgánica Antonio González, Departamento de Geografía e Historia, UDI Prehistoria, Arqueología e Historia Antigua, Facultad de Geografía e Historia, Universidad de La Laguna, Tenerife, Spain
| | - Chris Stringer
- Centre for Human Evolution Research (CHER), Department of Earth Sciences, Natural History Museum, London SW7 5BD, UK
| | - Laure Metz
- Aix-Marseille Université, CNRS, Min. Culture, UMR 7269, LAMPEA, Maison Méditerranéenne des Sciences de l’Homme, BP 647, 5 rue du Château de l’Horloge, F-13094, Aix-en-Provence Cedex 2, France
- College of Liberal Arts and Sciences, University of Connecticut, 215 Glenbrook Road, U-4098, Storrs, CT 06269-4098, USA
| |
Collapse
|
43
|
Simonin-Wilmer I, Orozco-del-Pino P, Bishop DT, Iles MM, Robles-Espinoza CD. An Overview of Strategies for Detecting Genotype-Phenotype Associations Across Ancestrally Diverse Populations. Front Genet 2021; 12:703901. [PMID: 34804113 PMCID: PMC8602802 DOI: 10.3389/fgene.2021.703901] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2021] [Accepted: 10/14/2021] [Indexed: 11/13/2022] Open
Abstract
Genome-wide association studies (GWAS) have been very successful at identifying genetic variants influencing a large number of traits. Although the great majority of these studies have been performed in European-descent individuals, it has been recognised that including populations with differing ancestries enhances the potential for identifying causal SNPs due to their differing patterns of linkage disequilibrium. However, when individuals from distinct ethnicities are included in a GWAS, it is necessary to implement a number of control steps to ensure that the identified associations are real genotype-phenotype relationships. In this Review, we discuss the analyses that are required when performing multi-ethnic studies, including methods for determining ancestry at the global and local level for sample exclusion, controlling for ancestry in association testing, and post-GWAS interrogation methods such as genomic control and meta-analysis. We hope that this overview provides a primer for those researchers interested in including distinct populations in their studies.
Collapse
Affiliation(s)
- Irving Simonin-Wilmer
- Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Campus Juriquilla, Queretaro, Mexico
| | | | - D. Timothy Bishop
- Leeds Institute for Data Analytics and Leeds Institute of Medical Research at St. James’s, University of Leeds, Leeds, United Kingdom
| | - Mark M. Iles
- Leeds Institute for Data Analytics and Leeds Institute of Medical Research at St. James’s, University of Leeds, Leeds, United Kingdom
| | - Carla Daniela Robles-Espinoza
- Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Campus Juriquilla, Queretaro, Mexico
- Wellcome Sanger Institute, Hinxton, Cambridge, United Kingdom
| |
Collapse
|
44
|
Li Y, Wu DD. Finding unknown species in the genomes of extant species. J Genet Genomics 2021; 48:867-871. [PMID: 34509382 DOI: 10.1016/j.jgg.2021.05.013] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Revised: 05/21/2021] [Accepted: 05/23/2021] [Indexed: 11/18/2022]
Abstract
Although many species have gone extinct, their genetic components might exist in extant species because of ancient hybridization. Via advances in genome sequencing and development of modern population genetics, one can find the legacy of unknown or extinct species in the context of available genomes from extant species. Such discovery can be used as a strategy to search for hidden species or fossils in conservation biology and archeology, gain novel insight into complex evolutionary history, and provide the new sources of genetic variation for breeding and trait improvement in agriculture.
Collapse
Affiliation(s)
- Yan Li
- State Key Laboratory for Conservation and Utilization of Bio-resource in Yunnan and School of Life Science & School of Ecology and Environmental Science, Yunnan University, Kunming 650091, China.
| | - Dong-Dong Wu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China; Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650223, China
| |
Collapse
|
45
|
Beyer RM, Krapp M, Eriksson A, Manica A. Climatic windows for human migration out of Africa in the past 300,000 years. Nat Commun 2021; 12:4889. [PMID: 34429408 PMCID: PMC8384873 DOI: 10.1038/s41467-021-24779-1] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2020] [Accepted: 07/02/2021] [Indexed: 11/17/2022] Open
Abstract
Whilst an African origin of modern humans is well established, the timings and routes of their expansions into Eurasia are the subject of heated debate, due to the scarcity of fossils and the lack of suitably old ancient DNA. Here, we use high-resolution palaeoclimate reconstructions to estimate how difficult it would have been for humans in terms of rainfall availability to leave the African continent in the past 300k years. We then combine these results with an anthropologically and ecologically motivated estimate of the minimum level of rainfall required by hunter-gatherers to survive, allowing us to reconstruct when, and along which geographic paths, expansions out of Africa would have been climatically feasible. The estimated timings and routes of potential contact with Eurasia are compatible with archaeological and genetic evidence of human expansions out of Africa, highlighting the key role of palaeoclimate variability for modern human dispersals.
Collapse
Affiliation(s)
- Robert M Beyer
- Department of Zoology, University of Cambridge, Cambridge, UK.
- Potsdam Institute for Climate Impact Research (PIK), Member of the Leibniz Association, Potsdam, Germany.
| | - Mario Krapp
- Department of Zoology, University of Cambridge, Cambridge, UK
- GNS Science, Lower Hutt, New Zealand
| | - Anders Eriksson
- cGEM, cGEM, Institute of Genomics, University of Tartu, Tartu, Estonia
- Department of Medical and Molecular Genetics, King's College London, London, UK
| | - Andrea Manica
- Department of Zoology, University of Cambridge, Cambridge, UK.
| |
Collapse
|
46
|
Middle Pleistocene fire use: The first signal of widespread cultural diffusion in human evolution. Proc Natl Acad Sci U S A 2021; 118:2101108118. [PMID: 34301807 PMCID: PMC8346817 DOI: 10.1073/pnas.2101108118] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Control of fire is one of the most important technological innovations within the evolution of humankind. The archaeological signal of fire use becomes very visible from around 400,000 y ago onward. Interestingly, this occurs at a geologically similar time over major parts of the Old World, in Africa, as well as in western Eurasia, and in different subpopulations of the wider hominin metapopulation. We interpret this spatiotemporal pattern as the result of cultural diffusion, and as representing the earliest clear-cut case of widespread cultural change resulting from diffusion in human evolution. This fire-use pattern is followed slightly later by a similar spatiotemporal distribution of Levallois technology, at the beginning of the African Middle Stone Age and the western Eurasian Middle Paleolithic. These archaeological data, as well as studies of ancient genomes, lead us to hypothesize that at the latest by 400,000 y ago, hominin subpopulations encountered one another often enough and were sufficiently tolerant toward one another to transmit ideas and techniques over large regions within relatively short time periods. Furthermore, it is likely that the large-scale social networks necessary to transmit complicated skills were also in place. Most importantly, this suggests a form of cultural behavior significantly more similar to that of extant Homo sapiens than to our great ape relatives.
Collapse
|
47
|
Bourgeois Y, Fields P, Bento G, Ebert D. Balancing selection for pathogen resistance reveals an intercontinental signature of Red Queen coevolution. Mol Biol Evol 2021; 38:4918-4933. [PMID: 34289047 PMCID: PMC8557431 DOI: 10.1093/molbev/msab217] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The link between long-term host–parasite coevolution and genetic diversity is key to understanding genetic epidemiology and the evolution of resistance. The model of Red Queen host–parasite coevolution posits that high genetic diversity is maintained when rare host resistance variants have a selective advantage, which is believed to be the mechanistic basis for the extraordinarily high levels of diversity at disease-related genes such as the major histocompatibility complex in jawed vertebrates and R-genes in plants. The parasites that drive long-term coevolution are, however, often elusive. Here we present evidence for long-term balancing selection at the phenotypic (variation in resistance) and genomic (resistance locus) level in a particular host–parasite system: the planktonic crustacean Daphnia magna and the bacterium Pasteuria ramosa. The host shows widespread polymorphisms for pathogen resistance regardless of geographic distance, even though there is a clear genome-wide pattern of isolation by distance at other sites. In the genomic region of a previously identified resistance supergene, we observed consistent molecular signals of balancing selection, including higher genetic diversity, older coalescence times, and lower differentiation between populations, which set this region apart from the rest of the genome. We propose that specific long-term coevolution by negative-frequency-dependent selection drives this elevated diversity at the host's resistance loci on an intercontinental scale and provide an example of a direct link between the host’s resistance to a virulent pathogen and the large-scale diversity of its underlying genes.
Collapse
Affiliation(s)
- Yann Bourgeois
- University of Basel, Department of Environmental Sciences, Zoology, Vesalgasse 1, 4051 Basel, Switzerland
| | - Peter Fields
- University of Basel, Department of Environmental Sciences, Zoology, Vesalgasse 1, 4051 Basel, Switzerland
| | - Gilberto Bento
- University of Basel, Department of Environmental Sciences, Zoology, Vesalgasse 1, 4051 Basel, Switzerland
| | - Dieter Ebert
- University of Basel, Department of Environmental Sciences, Zoology, Vesalgasse 1, 4051 Basel, Switzerland
| |
Collapse
|
48
|
Ahlquist KD, Bañuelos MM, Funk A, Lai J, Rong S, Villanea FA, Witt KE. Our Tangled Family Tree: New Genomic Methods Offer Insight into the Legacy of Archaic Admixture. Genome Biol Evol 2021; 13:evab115. [PMID: 34028527 PMCID: PMC8480178 DOI: 10.1093/gbe/evab115] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Revised: 05/07/2021] [Accepted: 05/22/2021] [Indexed: 11/30/2022] Open
Abstract
The archaic ancestry present in the human genome has captured the imagination of both scientists and the wider public in recent years. This excitement is the result of new studies pushing the envelope of what we can learn from the archaic genetic information that has survived for over 50,000 years in the human genome. Here, we review the most recent ten years of literature on the topic of archaic introgression, including the current state of knowledge on Neanderthal and Denisovan introgression, as well as introgression from other as-yet unidentified archaic populations. We focus this review on four topics: 1) a reimagining of human demographic history, including evidence for multiple admixture events between modern humans, Neanderthals, Denisovans, and other archaic populations; 2) state-of-the-art methods for detecting archaic ancestry in population-level genomic data; 3) how these novel methods can detect archaic introgression in modern African populations; and 4) the functional consequences of archaic gene variants, including how those variants were co-opted into novel function in modern human populations. The goal of this review is to provide a simple-to-access reference for the relevant methods and novel data, which has changed our understanding of the relationship between our species and its siblings. This body of literature reveals the large degree to which the genetic legacy of these extinct hominins has been integrated into the human populations of today.
Collapse
Affiliation(s)
- K D Ahlquist
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
- Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University, Providence, Rhode Island, USA
| | - Mayra M Bañuelos
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
- Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University, Providence, Rhode Island, USA
| | - Alyssa Funk
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
- Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University, Providence, Rhode Island, USA
| | - Jiaying Lai
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
- Brown Center for Biomedical Informatics, Brown University, Providence, Rhode Island, USA
| | - Stephen Rong
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
- Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University, Providence, Rhode Island, USA
| | - Fernando A Villanea
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
- Department of Anthropology, University of Colorado Boulder, Colorado, USA
| | - Kelsey E Witt
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
- Department of Ecology and Evolutionary Biology, Brown University, Providence, Rhode Island, USA
| |
Collapse
|
49
|
Deng Y, Song YS, Nielsen R. The distribution of waiting distances in ancestral recombination graphs. Theor Popul Biol 2021; 141:34-43. [PMID: 34186053 DOI: 10.1016/j.tpb.2021.06.003] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Revised: 06/11/2021] [Accepted: 06/16/2021] [Indexed: 11/25/2022]
Abstract
The ancestral recombination graph (ARG) contains the full genealogical information of the sample, and many population genetic inference problems can be solved using inferred or sampled ARGs. In particular, the waiting distance between tree changes along the genome can be used to make inference about the distribution and evolution of recombination rates. To this end, we here derive an analytic expression for the distribution of waiting distances between tree changes under the sequentially Markovian coalescent model and obtain an accurate approximation to the distribution of waiting distances for topology changes. We use these results to show that some of the recently proposed methods for inferring sequences of trees along the genome provide strongly biased distributions of waiting distances. In addition, we provide a correction to an undercounting problem facing all available ARG inference methods, thereby facilitating the use of ARG inference methods to estimate temporal changes in the recombination rate.
Collapse
Affiliation(s)
- Yun Deng
- Center for Computational Biology, University of California, Berkeley, CA 94720, United States of America.
| | - Yun S Song
- Department of Statistics, University of California, Berkeley, CA 94720, United States of America; Computer Science Division, University of California, Berkeley, CA 94720, United States of America; Chan Zuckerberg Biohub, San Francisco, CA 94158, United States of America
| | - Rasmus Nielsen
- Department of Statistics, University of California, Berkeley, CA 94720, United States of America; Department of Integrative biology, University of California, Berkeley, CA 94720, United States of America.
| |
Collapse
|
50
|
Hershkovitz I, May H, Sarig R, Pokhojaev A, Grimaud-Hervé D, Bruner E, Fornai C, Quam R, Arsuaga JL, Krenn VA, Martinón-Torres M, de Castro JMB, Martín-Francés L, Slon V, Albessard-Ball L, Vialet A, Schüler T, Manzi G, Profico A, Di Vincenzo F, Weber GW, Zaidner Y. A Middle Pleistocene
Homo
from Nesher Ramla, Israel. Science 2021. [DOI: 10.1126/science.abh3169] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Affiliation(s)
- Israel Hershkovitz
- Department of Anatomy and Anthropology, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- The Shmunis Family Anthropology Institute, the Dan David Center for Human Evolution and Biohistory Research, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Hila May
- Department of Anatomy and Anthropology, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- The Shmunis Family Anthropology Institute, the Dan David Center for Human Evolution and Biohistory Research, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Rachel Sarig
- The Shmunis Family Anthropology Institute, the Dan David Center for Human Evolution and Biohistory Research, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- Department of Oral Biology, the Goldschleger School of Dental Medicine, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Ariel Pokhojaev
- Department of Anatomy and Anthropology, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- The Shmunis Family Anthropology Institute, the Dan David Center for Human Evolution and Biohistory Research, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- Department of Oral Biology, the Goldschleger School of Dental Medicine, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Dominique Grimaud-Hervé
- UMR7194, HNHP, Département Homme et Environnement, Muséum national d’Histoire naturelle, CNRS, UPVD, Paris, France
| | - Emiliano Bruner
- CENIEH (National Research Center on Human Evolution), Burgos, Spain
| | - Cinzia Fornai
- Institute of Evolutionary Medicine, University of Zurich, Zurich, Switzerland
- Department of Evolutionary Anthropology, University of Vienna, Vienna, Austria
| | - Rolf Quam
- Department of Anthropology, Binghamton University (SUNY), Binghamton, NY, USA
- Centro UCM-ISCIII de Evolución y Comportamiento Humanos, Madrid, Spain
- Division of Anthropology, American Museum of Natural History, New York, NY, USA
| | - Juan Luis Arsuaga
- Centro UCM-ISCIII de Evolución y Comportamiento Humanos, Madrid, Spain
- Departamento de Geodináica, Estratigrafía y Paleontología, Facultad de Ciencias Geológicas, Universidad Complutense de Madrid, Ciudad Universitaria s/n, 28040, Madrid, Spain
| | - Viktoria A. Krenn
- Institute of Evolutionary Medicine, University of Zurich, Zurich, Switzerland
- Department of Evolutionary Anthropology, University of Vienna, Vienna, Austria
| | - Maria Martinón-Torres
- CENIEH (National Research Center on Human Evolution), Burgos, Spain
- Department of Anthropology, University College London, London, UK
| | - José María Bermúdez de Castro
- CENIEH (National Research Center on Human Evolution), Burgos, Spain
- Department of Anthropology, University College London, London, UK
| | - Laura Martín-Francés
- CENIEH (National Research Center on Human Evolution), Burgos, Spain
- Department of Anthropology, University College London, London, UK
| | - Viviane Slon
- Department of Anatomy and Anthropology, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- The Shmunis Family Anthropology Institute, the Dan David Center for Human Evolution and Biohistory Research, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- Department of Human Molecular Genetics and Biochemistry, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Lou Albessard-Ball
- UMR7194, HNHP, Département Homme et Environnement, Muséum national d’Histoire naturelle, CNRS, UPVD, Paris, France
- PalaeoHub, Department of Archaeology, University of York, York, UK
| | - Amélie Vialet
- UMR7194, HNHP, Département Homme et Environnement, Muséum national d’Histoire naturelle, CNRS, UPVD, Paris, France
| | - Tim Schüler
- Thuringian State Office for the Preservation of Historical Monuments and Archaeology Weimar, Germany
| | - Giorgio Manzi
- Department of Environmental Biology, Sapienza University of Rome, Roma, Italy
| | - Antonio Profico
- PalaeoHub, Department of Archaeology, University of York, York, UK
- Department of Environmental Biology, Sapienza University of Rome, Roma, Italy
| | - Fabio Di Vincenzo
- Department of Environmental Biology, Sapienza University of Rome, Roma, Italy
| | - Gerhard W. Weber
- Department of Evolutionary Anthropology, University of Vienna, Vienna, Austria
- Core Facility for Micro-Computed Tomography, University of Vienna, Vienna, Austria
| | - Yossi Zaidner
- Institute of Archaeology, The Hebrew University of Jerusalem, Jerusalem, Israel
- Zinman Institute of Archaeology, University of Haifa, Haifa, Mount Carmel, Israel
| |
Collapse
|