1
|
Ferrando-Bernal M, Brand CM, Capra JA. Inferring human phenotypes using ancient DNA: from molecules to populations. Curr Opin Genet Dev 2025; 90:102283. [PMID: 39612613 DOI: 10.1016/j.gde.2024.102283] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2024] [Revised: 10/04/2024] [Accepted: 11/04/2024] [Indexed: 12/01/2024]
Abstract
The increasing availability of ancient DNA (aDNA) from human groups across space and time has yielded deep insights into the movements of our species. However, given the challenges of mapping from genotype to phenotype, aDNA has revealed less about the phenotypes of ancient individuals. In this review, we highlight recent advances in inferring ancient phenotypes - from the molecular to population scale - with a focus on applications enabled by new machine learning approaches. The genetic architecture of complex traits across human groups suggests that the prediction of individual-level complex traits, like behavior or disease risk, is often challenging across the relevant evolutionary distances. Thus, we propose an approach that integrates predictions of molecular phenotypes, whose mechanisms are more conserved, with nongenetic data.
Collapse
Affiliation(s)
- Manuel Ferrando-Bernal
- Bakar Computational Health Science Institute, University of California San Francisco, San Francisco, CA, USA; Department of Epidemiology and Biostatistics, University of California San Francisco, San Francisco, CA, USA.
| | - Colin M Brand
- Bakar Computational Health Science Institute, University of California San Francisco, San Francisco, CA, USA; Department of Epidemiology and Biostatistics, University of California San Francisco, San Francisco, CA, USA.
| | - John A Capra
- Bakar Computational Health Science Institute, University of California San Francisco, San Francisco, CA, USA; Department of Epidemiology and Biostatistics, University of California San Francisco, San Francisco, CA, USA; Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA.
| |
Collapse
|
2
|
Rong S, Root E, Reilly SK. Massively parallel approaches for characterizing noncoding functional variation in human evolution. Curr Opin Genet Dev 2024; 88:102256. [PMID: 39217658 PMCID: PMC11648527 DOI: 10.1016/j.gde.2024.102256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Revised: 08/02/2024] [Accepted: 08/16/2024] [Indexed: 09/04/2024]
Abstract
The genetic differences underlying unique phenotypes in humans compared to our closest primate relatives have long remained a mystery. Similarly, the genetic basis of adaptations between human groups during our expansion across the globe is poorly characterized. Uncovering the downstream phenotypic consequences of these genetic variants has been difficult, as a substantial portion lies in noncoding regions, such as cis-regulatory elements (CREs). Here, we review recent high-throughput approaches to measure the functions of CREs and the impact of variation within them. CRISPR screens can directly perturb CREs in the genome to understand downstream impacts on gene expression and phenotypes, while massively parallel reporter assays can decipher the regulatory impact of sequence variants. Machine learning has begun to be able to predict regulatory function from sequence alone, further scaling our ability to characterize genome function. Applying these tools across diverse phenotypes, model systems, and ancestries is beginning to revolutionize our understanding of noncoding variation underlying human evolution.
Collapse
Affiliation(s)
- Stephen Rong
- Department of Genetics, Yale University, New Haven, CT, USA.
| | - Elise Root
- Department of Genetics, Yale University, New Haven, CT, USA
| | - Steven K Reilly
- Department of Genetics, Yale University, New Haven, CT, USA; Wu Tsai Institute, Yale University, New Haven, CT, USA.
| |
Collapse
|
3
|
Williams MP, Flegontov P, Maier R, Huber CD. Testing times: disentangling admixture histories in recent and complex demographies using ancient DNA. Genetics 2024; 228:iyae110. [PMID: 39013011 PMCID: PMC11373510 DOI: 10.1093/genetics/iyae110] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Revised: 04/08/2024] [Accepted: 06/11/2024] [Indexed: 07/18/2024] Open
Abstract
Our knowledge of human evolutionary history has been greatly advanced by paleogenomics. Since the 2020s, the study of ancient DNA has increasingly focused on reconstructing the recent past. However, the accuracy of paleogenomic methods in resolving questions of historical and archaeological importance amidst the increased demographic complexity and decreased genetic differentiation remains an open question. We evaluated the performance and behavior of two commonly used methods, qpAdm and the f3-statistic, on admixture inference under a diversity of demographic models and data conditions. We performed two complementary simulation approaches-firstly exploring a wide demographic parameter space under four simple demographic models of varying complexities and configurations using branch-length data from two chromosomes-and secondly, we analyzed a model of Eurasian history composed of 59 populations using whole-genome data modified with ancient DNA conditions such as SNP ascertainment, data missingness, and pseudohaploidization. We observe that population differentiation is the primary factor driving qpAdm performance. Notably, while complex gene flow histories influence which models are classified as plausible, they do not reduce overall performance. Under conditions reflective of the historical period, qpAdm most frequently identifies the true model as plausible among a small candidate set of closely related populations. To increase the utility for resolving fine-scaled hypotheses, we provide a heuristic for further distinguishing between candidate models that incorporates qpAdm model P-values and f3-statistics. Finally, we demonstrate a significant performance increase for qpAdm using whole-genome branch-length f2-statistics, highlighting the potential for improved demographic inference that could be achieved with future advancements in f-statistic estimations.
Collapse
Affiliation(s)
- Matthew P Williams
- Department of Biology, Pennsylvania State University, University Park, PA 16802, USA
| | - Pavel Flegontov
- Department of Biology and Ecology, University of Ostrava, Ostrava 701 03, Czechia
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
| | - Robert Maier
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
| | - Christian D Huber
- Department of Biology, Pennsylvania State University, University Park, PA 16802, USA
| |
Collapse
|
4
|
Meiri M, Bar-Oz G. Unraveling the diversity and cultural heritage of fruit crops through paleogenomics. Trends Genet 2024; 40:398-409. [PMID: 38423916 PMCID: PMC11079635 DOI: 10.1016/j.tig.2024.02.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 02/06/2024] [Accepted: 02/06/2024] [Indexed: 03/02/2024]
Abstract
Abundant and plentiful fruit crops are threatened by the loss of diverse legacy cultivars which are being replaced by a limited set of high-yielding ones. This article delves into the potential of paleogenomics that utilizes ancient DNA analysis to revive lost diversity. By focusing on grapevines, date palms, and tomatoes, recent studies showcase the effectiveness of paleogenomic techniques in identifying and understanding genetic traits crucial for crop resilience, disease resistance, and nutritional value. The approach not only tracks landrace dispersal and introgression but also sheds light on domestication events. In the face of major future environmental challenges, integrating paleogenomics with modern breeding strategies emerges as a promising avenue to significantly bolster fruit crop sustainability.
Collapse
Affiliation(s)
- Meirav Meiri
- The Steinhardt Museum of Natural History and Israel National Center for Biodiversity Studies, Tel Aviv University, Tel Aviv 6997801, Israel.
| | - Guy Bar-Oz
- School of Archaeology and Maritime Cultures, University of Haifa, Haifa, 3498837 Mount Carmel, Israel
| |
Collapse
|
5
|
Padilla-Iglesias C, Derkx I. Hunter-gatherer genetics research: Importance and avenues. EVOLUTIONARY HUMAN SCIENCES 2024; 6:e15. [PMID: 38516374 PMCID: PMC10955370 DOI: 10.1017/ehs.2024.7] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2023] [Revised: 01/17/2024] [Accepted: 02/02/2024] [Indexed: 03/23/2024] Open
Abstract
Major developments in the field of genetics in the past few decades have revolutionised notions of what it means to be human. Although currently only a few populations around the world practise a hunting and gathering lifestyle, this mode of subsistence has characterised members of our species since its very origins and allowed us to migrate across the planet. Therefore, the geographical distribution of hunter-gatherer populations, dependence on local ecosystems and connections to past populations and neighbouring groups have provided unique insights into our evolutionary origins. However, given the vulnerable status of hunter-gatherers worldwide, the development of the field of anthropological genetics requires that we reevaluate how we conduct research with these communities. Here, we review how the inclusion of hunter-gatherer populations in genetics studies has advanced our understanding of human origins, ancient population migrations and interactions as well as phenotypic adaptations and adaptability to different environments, and the important scientific and medical applications of these advancements. At the same time, we highlight the necessity to address yet unresolved questions and identify areas in which the field may benefit from improvements.
Collapse
Affiliation(s)
| | - Inez Derkx
- Department of Evolutionary Anthropology, University of Zurich, Zurich, Switzerland
| |
Collapse
|
6
|
Izarraras-Gomez A, Ortega-Del Vecchyo D. Ancient DNA uncovers past migrations in California. Nature 2023; 624:43-44. [PMID: 37993617 DOI: 10.1038/d41586-023-03503-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2023]
|
7
|
Williams MP, Flegontov P, Maier R, Huber CD. Testing Times: Challenges in Disentangling Admixture Histories in Recent and Complex Demographies. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.13.566841. [PMID: 38014190 PMCID: PMC10680674 DOI: 10.1101/2023.11.13.566841] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
Paleogenomics has expanded our knowledge of human evolutionary history. Since the 2020s, the study of ancient DNA has increased its focus on reconstructing the recent past. However, the accuracy of paleogenomic methods in answering questions of historical and archaeological importance amidst the increased demographic complexity and decreased genetic differentiation within the historical period remains an open question. We used two simulation approaches to evaluate the limitations and behavior of commonly used methods, qpAdm and the f3-statistic, on admixture inference. The first is based on branch-length data simulated from four simple demographic models of varying complexities and configurations. The second, an analysis of Eurasian history composed of 59 populations using whole-genome data modified with ancient DNA conditions such as SNP ascertainment, data missingness, and pseudo-haploidization. We show that under conditions resembling historical populations, qpAdm can identify a small candidate set of true sources and populations closely related to them. However, in typical ancient DNA conditions, qpAdm is unable to further distinguish between them, limiting its utility for resolving fine-scaled hypotheses. Notably, we find that complex gene-flow histories generally lead to improvements in the performance of qpAdm and observe no bias in the estimation of admixture weights. We offer a heuristic for admixture inference that incorporates admixture weight estimate and P-values of qpAdm models, and f3-statistics to enhance the power to distinguish between multiple plausible candidates. Finally, we highlight the future potential of qpAdm through whole-genome branch-length f2-statistics, demonstrating the improved demographic inference that could be achieved with advancements in f-statistic estimations.
Collapse
Affiliation(s)
- Matthew P. Williams
- Pennsylvania State University, Department of Biology, University Park, PA 16802, USA
| | - Pavel Flegontov
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czechia
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
| | - Robert Maier
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
| | - Christian D. Huber
- Pennsylvania State University, Department of Biology, University Park, PA 16802, USA
| |
Collapse
|