1
|
Shen CC, Miura I, Lin TH, Toda M, Nguyen HN, Tseng HY, Lin SM. Exploring Mitonuclear Discordance: Ghost Introgression From an Ancient Extinction Lineage in the Odorrana swinhoana Complex. Mol Ecol 2025; 34:e17763. [PMID: 40219663 DOI: 10.1111/mec.17763] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2024] [Revised: 03/11/2025] [Accepted: 03/31/2025] [Indexed: 04/14/2025]
Abstract
Mitonuclear discordance, the incongruence between mitochondrial DNA (mtDNA) and nuclear DNA (nuDNA), is a well-documented phenomenon with various potential explanations. One emerging hypothesis, ghost introgression, refers to the genetic contribution of an ancient, extinct or unsampled lineage and can now be tested using modern genomic data and demographic models. In this study, we investigated the evolutionary history of the Odorrana swinhoana complex (Anura: Ranidae), which includes O. swinhoana, O. utsunomiyaorum and an unidentified population with highly divergent mtDNA. While mitochondrial phylogeny suggested this population as a basal lineage, nuclear data from ddRADseq revealed it as a mixture of the most derived O. swinhoana nuclear sequences combined with ancient mtDNA. Demographic modelling further supported ghost introgression, as all models incorporating a ghost population outperformed those without it. These findings suggest that an eastward expansion of western O. swinhoana replaced an ancient Odorrana lineage, leaving only its mtDNA and fragments of its nuclear genome in the hybrid population. Our results provide one of the first documented cases of ghost introgression in amphibians and highlight its potential as a widespread evolutionary process. This study also underscores the risks of relying solely on mtDNA for phylogenetic reconstruction and species delimitation.
Collapse
Affiliation(s)
- Chin-Chia Shen
- School of Life Science, National Taiwan Normal University, Taipei, Taiwan
| | - Ikuo Miura
- Amphibian Research Center, Hiroshima University, Higashi-Hiroshima, Japan
| | - Tzong-Han Lin
- School of Life Science, National Taiwan Normal University, Taipei, Taiwan
| | - Mamoru Toda
- Tropical Biosphere Research Center, University of the Ryukyus, Okinawa, Japan
| | - Hung Ngoc Nguyen
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, Chiba-ken, Japan
| | - Hui-Yun Tseng
- Department of Entomology, National Taiwan University, Taipei, Taiwan
| | - Si-Min Lin
- School of Life Science, National Taiwan Normal University, Taipei, Taiwan
| |
Collapse
|
2
|
Hackl J, Huang X. Revisiting adaptive introgression at the HLA genes in Lithuanian genomes with machine learning. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2025; 127:105708. [PMID: 39732272 DOI: 10.1016/j.meegid.2024.105708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/14/2024] [Revised: 12/17/2024] [Accepted: 12/20/2024] [Indexed: 12/30/2024]
Affiliation(s)
- Josef Hackl
- Department of Evolutionary Anthropology, University of Vienna, Vienna, Austria
| | - Xin Huang
- Department of Evolutionary Anthropology, University of Vienna, Vienna, Austria; Human Evolution and Archaeological Sciences (HEAS), University of Vienna, Vienna, Austria.
| |
Collapse
|
3
|
Witt KE, Villanea FA. Computational Genomics and Its Applications to Anthropological Questions. AMERICAN JOURNAL OF BIOLOGICAL ANTHROPOLOGY 2024; 186 Suppl 78:e70010. [PMID: 40071816 PMCID: PMC11898561 DOI: 10.1002/ajpa.70010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/24/2024] [Revised: 10/14/2024] [Accepted: 12/19/2024] [Indexed: 03/15/2025]
Abstract
The advent of affordable genome sequencing and the development of new computational tools have established a new era of genomic knowledge. Sequenced human genomes number in the tens of thousands, including thousands of ancient human genomes. The abundance of data has been met with new analysis tools that can be used to understand populations' demographic and evolutionary histories. Thus, a variety of computational methods now exist that can be leveraged to answer anthropological questions. This includes novel likelihood and Bayesian methods, machine learning techniques, and a vast array of population simulators. These computational tools provide powerful insights gained from genomic datasets, although they are generally inaccessible to those with less computational experience. Here, we outline the theoretical workings behind computational genomics methods, limitations and other considerations when applying these computational methods, and examples of how computational methods have already been applied to anthropological questions. We hope this review will empower other anthropologists to utilize these powerful tools in their own research.
Collapse
Affiliation(s)
- Kelsey E. Witt
- Department of Genetics and Biochemistry and Center for Human GeneticsClemson UniversityClemsonSouth CarolinaUSA
| | | |
Collapse
|
4
|
Kato S, Arakaki S, Nagano AJ, Kikuchi K, Hirase S. Genomic landscape of introgression from the ghost lineage in a gobiid fish uncovers the generality of forces shaping hybrid genomes. Mol Ecol 2024; 33:e17216. [PMID: 38047388 DOI: 10.1111/mec.17216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 09/23/2023] [Accepted: 10/26/2023] [Indexed: 12/05/2023]
Abstract
Extinct lineages can leave legacies in the genomes of extant lineages through ancient introgressive hybridization. The patterns of genomic survival of these extinct lineages provide insight into the role of extinct lineages in current biodiversity. However, our understanding on the genomic landscape of introgression from extinct lineages remains limited due to challenges associated with locating the traces of unsampled 'ghost' extinct lineages without ancient genomes. Herein, we conducted population genomic analyses on the East China Sea (ECS) lineage of Chaenogobius annularis, which was suspected to have originated from ghost introgression, with the aim of elucidating its genomic origins and characterizing its landscape of introgression. By combining phylogeographic analysis and demographic modelling, we demonstrated that the ECS lineage originated from ancient hybridization with an extinct ghost lineage. Forward simulations based on the estimated demography indicated that the statistic γ of the HyDe analysis can be used to distinguish the differences in local introgression rates in our data. Consistent with introgression between extant organisms, we found reduced introgression from extinct lineage in regions with low recombination rates and with functional importance, thereby suggesting a role of linked selection that has eliminated the extinct lineage in shaping the hybrid genome. Moreover, we identified enrichment of repetitive elements in regions associated with ghost introgression, which was hitherto little known but was also observed in the re-analysis of published data on introgression between extant organisms. Overall, our findings underscore the unexpected similarities in the characteristics of introgression landscapes across different taxa, even in cases of ghost introgression.
Collapse
Affiliation(s)
- Shuya Kato
- Fisheries Laboratory, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Hamamatsu, Shizuoka, Japan
| | - Seiji Arakaki
- Amakusa Marine Biological Laboratory, Kyushu University, Amakusa, Kumamoto, Japan
| | - Atsushi J Nagano
- Department of Life Sciences, Faculty of Agriculture, Ryukoku University, Ōtsu, Shiga, Japan
- Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata, Japan
| | - Kiyoshi Kikuchi
- Fisheries Laboratory, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Hamamatsu, Shizuoka, Japan
| | - Shotaro Hirase
- Fisheries Laboratory, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Hamamatsu, Shizuoka, Japan
| |
Collapse
|
5
|
Reyna-Blanco CS, Caduff M, Galimberti M, Leuenberger C, Wegmann D. Inference of Locus-Specific Population Mixtures from Linked Genome-Wide Allele Frequencies. Mol Biol Evol 2024; 41:msae137. [PMID: 38958167 PMCID: PMC11255385 DOI: 10.1093/molbev/msae137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 06/26/2024] [Accepted: 06/27/2024] [Indexed: 07/04/2024] Open
Abstract
Admixture between populations and species is common in nature. Since the influx of new genetic material might be either facilitated or hindered by selection, variation in mixture proportions along the genome is expected in organisms undergoing recombination. Various graph-based models have been developed to better understand these evolutionary dynamics of population splits and mixtures. However, current models assume a single mixture rate for the entire genome and do not explicitly account for linkage. Here, we introduce TreeSwirl, a novel method for inferring branch lengths and locus-specific mixture proportions by using genome-wide allele frequency data, assuming that the admixture graph is known or has been inferred. TreeSwirl builds upon TreeMix that uses Gaussian processes to estimate the presence of gene flow between diverged populations. However, in contrast to TreeMix, our model infers locus-specific mixture proportions employing a hidden Markov model that accounts for linkage. Through simulated data, we demonstrate that TreeSwirl can accurately estimate locus-specific mixture proportions and handle complex demographic scenarios. It also outperforms related D- and f-statistics in terms of accuracy and sensitivity to detect introgressed loci.
Collapse
Affiliation(s)
- Carlos S Reyna-Blanco
- Department of Biology, University of Fribourg, Fribourg 1700, Switzerland
- Swiss Institute of Bioinformatics, Fribourg 1700, Switzerland
| | - Madleina Caduff
- Department of Biology, University of Fribourg, Fribourg 1700, Switzerland
- Swiss Institute of Bioinformatics, Fribourg 1700, Switzerland
| | - Marco Galimberti
- Department of Biology, University of Fribourg, Fribourg 1700, Switzerland
- Swiss Institute of Bioinformatics, Fribourg 1700, Switzerland
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT, USA
- Veterans Affairs Connecticut Healthcare System, West Haven, CT, USA
| | | | - Daniel Wegmann
- Department of Biology, University of Fribourg, Fribourg 1700, Switzerland
- Swiss Institute of Bioinformatics, Fribourg 1700, Switzerland
| |
Collapse
|
6
|
Ray DD, Flagel L, Schrider DR. IntroUNET: Identifying introgressed alleles via semantic segmentation. PLoS Genet 2024; 20:e1010657. [PMID: 38377104 PMCID: PMC10906877 DOI: 10.1371/journal.pgen.1010657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Revised: 03/01/2024] [Accepted: 01/29/2024] [Indexed: 02/22/2024] Open
Abstract
A growing body of evidence suggests that gene flow between closely related species is a widespread phenomenon. Alleles that introgress from one species into a close relative are typically neutral or deleterious, but sometimes confer a significant fitness advantage. Given the potential relevance to speciation and adaptation, numerous methods have therefore been devised to identify regions of the genome that have experienced introgression. Recently, supervised machine learning approaches have been shown to be highly effective for detecting introgression. One especially promising approach is to treat population genetic inference as an image classification problem, and feed an image representation of a population genetic alignment as input to a deep neural network that distinguishes among evolutionary models (i.e. introgression or no introgression). However, if we wish to investigate the full extent and fitness effects of introgression, merely identifying genomic regions in a population genetic alignment that harbor introgressed loci is insufficient-ideally we would be able to infer precisely which individuals have introgressed material and at which positions in the genome. Here we adapt a deep learning algorithm for semantic segmentation, the task of correctly identifying the type of object to which each individual pixel in an image belongs, to the task of identifying introgressed alleles. Our trained neural network is thus able to infer, for each individual in a two-population alignment, which of those individual's alleles were introgressed from the other population. We use simulated data to show that this approach is highly accurate, and that it can be readily extended to identify alleles that are introgressed from an unsampled "ghost" population, performing comparably to a supervised learning method tailored specifically to that task. Finally, we apply this method to data from Drosophila, showing that it is able to accurately recover introgressed haplotypes from real data. This analysis reveals that introgressed alleles are typically confined to lower frequencies within genic regions, suggestive of purifying selection, but are found at much higher frequencies in a region previously shown to be affected by adaptive introgression. Our method's success in recovering introgressed haplotypes in challenging real-world scenarios underscores the utility of deep learning approaches for making richer evolutionary inferences from genomic data.
Collapse
Affiliation(s)
- Dylan D. Ray
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Lex Flagel
- Division of Data Science, Gencove Inc., New York, New York, United States of America
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, Minnesota, United States of America
| | - Daniel R. Schrider
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| |
Collapse
|
7
|
Ray DD, Flagel L, Schrider DR. IntroUNET: identifying introgressed alleles via semantic segmentation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.02.07.527435. [PMID: 36865105 PMCID: PMC9979274 DOI: 10.1101/2023.02.07.527435] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
A growing body of evidence suggests that gene flow between closely related species is a widespread phenomenon. Alleles that introgress from one species into a close relative are typically neutral or deleterious, but sometimes confer a significant fitness advantage. Given the potential relevance to speciation and adaptation, numerous methods have therefore been devised to identify regions of the genome that have experienced introgression. Recently, supervised machine learning approaches have been shown to be highly effective for detecting introgression. One especially promising approach is to treat population genetic inference as an image classification problem, and feed an image representation of a population genetic alignment as input to a deep neural network that distinguishes among evolutionary models (i.e. introgression or no introgression). However, if we wish to investigate the full extent and fitness effects of introgression, merely identifying genomic regions in a population genetic alignment that harbor introgressed loci is insufficient-ideally we would be able to infer precisely which individuals have introgressed material and at which positions in the genome. Here we adapt a deep learning algorithm for semantic segmentation, the task of correctly identifying the type of object to which each individual pixel in an image belongs, to the task of identifying introgressed alleles. Our trained neural network is thus able to infer, for each individual in a two-population alignment, which of those individual's alleles were introgressed from the other population. We use simulated data to show that this approach is highly accurate, and that it can be readily extended to identify alleles that are introgressed from an unsampled "ghost" population, performing comparably to a supervised learning method tailored specifically to that task. Finally, we apply this method to data from Drosophila, showing that it is able to accurately recover introgressed haplotypes from real data. This analysis reveals that introgressed alleles are typically confined to lower frequencies within genic regions, suggestive of purifying selection, but are found at much higher frequencies in a region previously shown to be affected by adaptive introgression. Our method's success in recovering introgressed haplotypes in challenging real-world scenarios underscores the utility of deep learning approaches for making richer evolutionary inferences from genomic data.
Collapse
Affiliation(s)
- Dylan D. Ray
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Lex Flagel
- Division of Data Science, Gencove Inc., New York, NY 11101, USA
- Department of Plant and Microbial Biology, University of Minnesota, St Paul MN, 55108, USA
| | - Daniel R. Schrider
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| |
Collapse
|
8
|
Huang X, Rymbekova A, Dolgova O, Lao O, Kuhlwilm M. Harnessing deep learning for population genetic inference. Nat Rev Genet 2024; 25:61-78. [PMID: 37666948 DOI: 10.1038/s41576-023-00636-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/11/2023] [Indexed: 09/06/2023]
Abstract
In population genetics, the emergence of large-scale genomic data for various species and populations has provided new opportunities to understand the evolutionary forces that drive genetic diversity using statistical inference. However, the era of population genomics presents new challenges in analysing the massive amounts of genomes and variants. Deep learning has demonstrated state-of-the-art performance for numerous applications involving large-scale data. Recently, deep learning approaches have gained popularity in population genetics; facilitated by the advent of massive genomic data sets, powerful computational hardware and complex deep learning architectures, they have been used to identify population structure, infer demographic history and investigate natural selection. Here, we introduce common deep learning architectures and provide comprehensive guidelines for implementing deep learning models for population genetic inference. We also discuss current challenges and future directions for applying deep learning in population genetics, focusing on efficiency, robustness and interpretability.
Collapse
Affiliation(s)
- Xin Huang
- Department of Evolutionary Anthropology, University of Vienna, Vienna, Austria.
- Human Evolution and Archaeological Sciences (HEAS), University of Vienna, Vienna, Austria.
| | - Aigerim Rymbekova
- Department of Evolutionary Anthropology, University of Vienna, Vienna, Austria
- Human Evolution and Archaeological Sciences (HEAS), University of Vienna, Vienna, Austria
| | - Olga Dolgova
- Integrative Genomics Laboratory, CIC bioGUNE - Centro de Investigación Cooperativa en Biociencias, Derio, Biscaya, Spain
| | - Oscar Lao
- Institute of Evolutionary Biology, CSIC-Universitat Pompeu Fabra, Barcelona, Spain.
| | - Martin Kuhlwilm
- Department of Evolutionary Anthropology, University of Vienna, Vienna, Austria.
- Human Evolution and Archaeological Sciences (HEAS), University of Vienna, Vienna, Austria.
| |
Collapse
|
9
|
Urnikyte A, Masiulyte A, Pranckeniene L, Kučinskas V. Disentangling archaic introgression and genomic signatures of selection at human immunity genes. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2023; 116:105528. [PMID: 37977419 DOI: 10.1016/j.meegid.2023.105528] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 11/04/2023] [Accepted: 11/14/2023] [Indexed: 11/19/2023]
Abstract
Pathogens and infectious diseases have imposed exceptionally strong selective pressure on ancient and modern human genomes and contributed to the current variation in many genes. There is evidence that modern humans acquired immune variants through interbreeding with ancient hominins, but the impact of such variants on human traits is not fully understood. The main objectives of this research were to infer the genetic signatures of positive selection that may be involved in adaptation to infectious diseases and to investigate the function of Neanderthal alleles identified within a set of 50 Lithuanian genomes. Introgressed regions were identified using the machine learning tool ArchIE. Recent positive selection signatures were analysed using iHS. We detected high-scoring signals of positive selection at innate immunity genes (EMB, PARP8, HLAC, and CDSN) and evaluated their interactions with the structural proteins of pathogens. Interactions with human immunodeficiency virus (HIV) 1 and severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) were identified. Overall, genomic regions introgressed from Neanderthals were shown to be enriched in genes related to immunity, keratinocyte differentiation, and sensory perception.
Collapse
Affiliation(s)
- Alina Urnikyte
- Faculty of Medicine, Department of Human and Medical Genetics, Institute of Biomedical Sciences, Vilnius University, Santariskiu Street 2, Vilnius LT-08661, Lithuania.
| | - Abigaile Masiulyte
- Faculty of Medicine, Department of Human and Medical Genetics, Institute of Biomedical Sciences, Vilnius University, Santariskiu Street 2, Vilnius LT-08661, Lithuania
| | - Laura Pranckeniene
- Faculty of Medicine, Department of Human and Medical Genetics, Institute of Biomedical Sciences, Vilnius University, Santariskiu Street 2, Vilnius LT-08661, Lithuania.
| | - Vaidutis Kučinskas
- Faculty of Medicine, Department of Human and Medical Genetics, Institute of Biomedical Sciences, Vilnius University, Santariskiu Street 2, Vilnius LT-08661, Lithuania.
| |
Collapse
|
10
|
Kinneberg VB, Lü DS, Peris D, Ravinet M, Skrede I. Introgression between highly divergent fungal sister species. J Evol Biol 2023; 36:1133-1149. [PMID: 37363874 DOI: 10.1111/jeb.14190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2022] [Revised: 05/15/2023] [Accepted: 05/17/2023] [Indexed: 06/28/2023]
Abstract
To understand how species evolve and adapt to changing environments, it is important to study gene flow and introgression due to their influence on speciation and radiation events. Here, we apply a novel experimental system for investigating these mechanisms using natural populations. The system is based on two fungal sister species with morphological and ecological similarities occurring in overlapping habitats. We examined introgression between these species by conducting whole genome sequencing of individuals from populations in North America and Europe. We assessed genome-wide nucleotide divergence and performed crossing experiments to study reproductive barriers. We further used ABBA-BABA statistics together with a network analysis to investigate introgression, and conducted demographic modelling to gain insight into divergence times and introgression events. The results revealed that the species are highly divergent and incompatible in vitro. Despite this, small regions of introgression were scattered throughout the genomes and one introgression event likely involves a ghost population (extant or extinct). This study demonstrates that introgression can be found among divergent species and that population histories can be studied without collections of all the populations involved. Moreover, the experimental system is shown to be a useful tool for research on reproductive isolation in natural populations.
Collapse
Affiliation(s)
- Vilde Bruhn Kinneberg
- Section for Genetics and Evolutionary Biology, Department of Biosciences, University of Oslo, Oslo, Norway
- Evolution and Paleobiology, Natural History Museum, University of Oslo, Oslo, Norway
| | - Dabao Sun Lü
- Section for Genetics and Evolutionary Biology, Department of Biosciences, University of Oslo, Oslo, Norway
| | - David Peris
- Section for Genetics and Evolutionary Biology, Department of Biosciences, University of Oslo, Oslo, Norway
- Department of Food Biotechnology, Institute of Agrochemistry and Food Technology (IATA), CSIC, Valencia, Spain
| | - Mark Ravinet
- School of Life Sciences, University of Nottingham, Nottingham, UK
| | - Inger Skrede
- Section for Genetics and Evolutionary Biology, Department of Biosciences, University of Oslo, Oslo, Norway
| |
Collapse
|
11
|
Scerri EML. One species, many roots? Nat Ecol Evol 2023; 7:975-976. [PMID: 37198291 DOI: 10.1038/s41559-023-02080-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Affiliation(s)
- Eleanor M L Scerri
- Pan-African Evolution Research Group, Max Planck Institute of Geoanthropology, Jena, Germany.
- Department of Prehistory, University of Cologne, Cologne, Germany.
- Department of Classics and Archaeology, University of Malta, Msida, Malta.
| |
Collapse
|
12
|
Zhang X, Kim B, Singh A, Sankararaman S, Durvasula A, Lohmueller KE. MaLAdapt Reveals Novel Targets of Adaptive Introgression From Neanderthals and Denisovans in Worldwide Human Populations. Mol Biol Evol 2023; 40:msad001. [PMID: 36617238 PMCID: PMC9887621 DOI: 10.1093/molbev/msad001] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Revised: 12/25/2022] [Accepted: 12/28/2022] [Indexed: 01/09/2023] Open
Abstract
Adaptive introgression (AI) facilitates local adaptation in a wide range of species. Many state-of-the-art methods detect AI with ad-hoc approaches that identify summary statistic outliers or intersect scans for positive selection with scans for introgressed genomic regions. Although widely used, approaches intersecting outliers are vulnerable to a high false-negative rate as the power of different methods varies, especially for complex introgression events. Moreover, population genetic processes unrelated to AI, such as background selection or heterosis, may create similar genomic signals to AI, compromising the reliability of methods that rely on neutral null distributions. In recent years, machine learning (ML) methods have been increasingly applied to population genetic questions. Here, we present a ML-based method called MaLAdapt for identifying AI loci from genome-wide sequencing data. Using an Extra-Trees Classifier algorithm, our method combines information from a large number of biologically meaningful summary statistics to capture a powerful composite signature of AI across the genome. In contrast to existing methods, MaLAdapt is especially well-powered to detect AI with mild beneficial effects, including selection on standing archaic variation, and is robust to non-AI selective sweeps, heterosis from deleterious mutations, and demographic misspecification. Furthermore, MaLAdapt outperforms existing methods for detecting AI based on the analysis of simulated data and the validation of empirical signals through visual inspection of haplotype patterns. We apply MaLAdapt to the 1000 Genomes Project human genomic data and discover novel AI candidate regions in non-African populations, including genes that are enriched in functionally important biological pathways regulating metabolism and immune responses.
Collapse
Affiliation(s)
- Xinjun Zhang
- Department of Ecology and Evolutionary Biology, UCLA, Los Angeles, CA
| | - Bernard Kim
- Department of Biology, Stanford University, Palo Alto, CA
| | - Armaan Singh
- Department of Computer Science, UCLA, Los Angeles, CA
| | - Sriram Sankararaman
- Department of Computer Science, UCLA, Los Angeles, CA
- Department of Computational Medicine, UCLA, Los Angeles, CA
- Department of Human Genetics, David Geffen School of Medicine, UCLA, Los Angeles, CA
| | - Arun Durvasula
- Department of Genetics, Harvard Medical School, Boston, MA
| | - Kirk E Lohmueller
- Department of Ecology and Evolutionary Biology, UCLA, Los Angeles, CA
- Department of Human Genetics, David Geffen School of Medicine, UCLA, Los Angeles, CA
| |
Collapse
|
13
|
Koller D, Wendt FR, Pathak GA, De Lillo A, De Angelis F, Cabrera-Mendoza B, Tucci S, Polimanti R. Denisovan and Neanderthal archaic introgression differentially impacted the genetics of complex traits in modern populations. BMC Biol 2022; 20:249. [PMID: 36344982 PMCID: PMC9641937 DOI: 10.1186/s12915-022-01449-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Accepted: 10/24/2022] [Indexed: 11/09/2022] Open
Abstract
BACKGROUND Introgression from extinct Neanderthal and Denisovan human species has been shown to contribute to the genetic pool of modern human populations and their phenotypic spectrum. Evidence of how Neanderthal introgression shaped the genetics of human traits and diseases has been extensively studied in populations of European descent, with signatures of admixture reported for instance in genes associated with pigmentation, immunity, and metabolic traits. However, limited information is currently available about the impact of archaic introgression on other ancestry groups. Additionally, to date, no study has been conducted with respect to the impact of Denisovan introgression on the health and disease of modern populations. Here, we compare the way evolutionary pressures shaped the genetics of complex traits in East Asian and European populations, and provide evidence of the impact of Denisovan introgression on the health of East Asian and Central/South Asian populations. RESULTS Leveraging genome-wide association statistics from the Biobank Japan and UK Biobank, we assessed whether Denisovan and Neanderthal introgression together with other evolutionary genomic signatures were enriched for the heritability of physiological and pathological conditions in populations of East Asian and European descent. In EAS, Denisovan-introgressed loci were enriched for coronary artery disease heritability (1.69-fold enrichment, p=0.003). No enrichment for archaic introgression was observed in EUR. We also performed a phenome-wide association study of Denisovan and Neanderthal alleles in six ancestry groups available in the UK Biobank. In EAS, the Denisovan-introgressed SNP rs62391664 in the major histocompatibility complex region was associated with albumin/globulin ratio (beta=-0.17, p=3.57×10-7). Neanderthal-introgressed alleles were associated with psychiatric and cognitive traits in EAS (e.g., "No Bipolar or Depression"-rs79043717 beta=-1.5, p=1.1×10-7), and with blood biomarkers (e.g., alkaline phosphatase-rs11244089 beta=0.1, p=3.69×10-116) and red hair color (rs60733936 beta=-0.86, p=4.49×10-165) in EUR. In the other ancestry groups, Neanderthal alleles were associated with several traits, also including the use of certain medications (e.g., Central/South East Asia: indapamide - rs732632 beta=-2.38, p=5.22×10-7). CONCLUSIONS Our study provides novel evidence regarding the impact of archaic introgression on the genetics of complex traits in worldwide populations, highlighting the specific contribution of Denisovan introgression in EAS populations.
Collapse
Affiliation(s)
- Dora Koller
- Department of Psychiatry, Yale University School of Medicine, West Haven, CT, 06516, USA
- VA CT Healthcare Center, West Haven, CT, 06516, USA
- Department of Genetics, Microbiology and Statistics, Faculty of Biology, University of Barcelona, Barcelona, Catalonia, 08028, Spain
| | - Frank R Wendt
- Department of Psychiatry, Yale University School of Medicine, West Haven, CT, 06516, USA
- VA CT Healthcare Center, West Haven, CT, 06516, USA
| | - Gita A Pathak
- Department of Psychiatry, Yale University School of Medicine, West Haven, CT, 06516, USA
- VA CT Healthcare Center, West Haven, CT, 06516, USA
| | - Antonella De Lillo
- Department of Psychiatry, Yale University School of Medicine, West Haven, CT, 06516, USA
- Department of Biology, University of Rome "Tor Vergata", Rome, 00133, Italy
| | - Flavio De Angelis
- Department of Psychiatry, Yale University School of Medicine, West Haven, CT, 06516, USA
- VA CT Healthcare Center, West Haven, CT, 06516, USA
- Department of Biology, University of Rome "Tor Vergata", Rome, 00133, Italy
| | - Brenda Cabrera-Mendoza
- Department of Psychiatry, Yale University School of Medicine, West Haven, CT, 06516, USA
- VA CT Healthcare Center, West Haven, CT, 06516, USA
| | - Serena Tucci
- Department of Anthropology, Yale University, New Haven, CT, 06511, USA
| | - Renato Polimanti
- Department of Psychiatry, Yale University School of Medicine, West Haven, CT, 06516, USA.
- VA CT Healthcare Center, West Haven, CT, 06516, USA.
| |
Collapse
|
14
|
Reilly PF, Tjahjadi A, Miller SL, Akey JM, Tucci S. The contribution of Neanderthal introgression to modern human traits. Curr Biol 2022; 32:R970-R983. [PMID: 36167050 PMCID: PMC9741939 DOI: 10.1016/j.cub.2022.08.027] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Neanderthals, our closest extinct relatives, lived in western Eurasia from 400,000 years ago until they went extinct around 40,000 years ago. DNA retrieved from ancient specimens revealed that Neanderthals mated with modern human contemporaries. As a consequence, introgressed Neanderthal DNA survives scattered across the human genome such that 1-4% of the genome of present-day people outside Africa are inherited from Neanderthal ancestors. Patterns of Neanderthal introgressed genomic sequences suggest that Neanderthal alleles had distinct fates in the modern human genetic background. Some Neanderthal alleles facilitated human adaptation to new environments such as novel climate conditions, UV exposure levels and pathogens, while others had deleterious consequences. Here, we review the body of work on Neanderthal introgression over the past decade. We describe how evolutionary forces shaped the genomic landscape of Neanderthal introgression and highlight the impact of introgressed alleles on human biology and phenotypic variation.
Collapse
Affiliation(s)
| | - Audrey Tjahjadi
- Department of Anthropology, Yale University, New Haven, CT, USA
| | | | - Joshua M Akey
- Lewis Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA.
| | - Serena Tucci
- Department of Anthropology, Yale University, New Haven, CT, USA; Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, USA.
| |
Collapse
|
15
|
Simonin-Wilmer I, Orozco-del-Pino P, Bishop DT, Iles MM, Robles-Espinoza CD. An Overview of Strategies for Detecting Genotype-Phenotype Associations Across Ancestrally Diverse Populations. Front Genet 2021; 12:703901. [PMID: 34804113 PMCID: PMC8602802 DOI: 10.3389/fgene.2021.703901] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2021] [Accepted: 10/14/2021] [Indexed: 11/13/2022] Open
Abstract
Genome-wide association studies (GWAS) have been very successful at identifying genetic variants influencing a large number of traits. Although the great majority of these studies have been performed in European-descent individuals, it has been recognised that including populations with differing ancestries enhances the potential for identifying causal SNPs due to their differing patterns of linkage disequilibrium. However, when individuals from distinct ethnicities are included in a GWAS, it is necessary to implement a number of control steps to ensure that the identified associations are real genotype-phenotype relationships. In this Review, we discuss the analyses that are required when performing multi-ethnic studies, including methods for determining ancestry at the global and local level for sample exclusion, controlling for ancestry in association testing, and post-GWAS interrogation methods such as genomic control and meta-analysis. We hope that this overview provides a primer for those researchers interested in including distinct populations in their studies.
Collapse
Affiliation(s)
- Irving Simonin-Wilmer
- Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Campus Juriquilla, Queretaro, Mexico
| | | | - D. Timothy Bishop
- Leeds Institute for Data Analytics and Leeds Institute of Medical Research at St. James’s, University of Leeds, Leeds, United Kingdom
| | - Mark M. Iles
- Leeds Institute for Data Analytics and Leeds Institute of Medical Research at St. James’s, University of Leeds, Leeds, United Kingdom
| | - Carla Daniela Robles-Espinoza
- Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Campus Juriquilla, Queretaro, Mexico
- Wellcome Sanger Institute, Hinxton, Cambridge, United Kingdom
| |
Collapse
|
16
|
Pathak GA, Wendt FR, Goswami A, Koller D, De Angelis F, COVID-19 Host Genetics Initiative, Polimanti R. ACE2 Netlas: In silico Functional Characterization and Drug-Gene Interactions of ACE2 Gene Network to Understand Its Potential Involvement in COVID-19 Susceptibility. Front Genet 2021; 12:698033. [PMID: 34512723 PMCID: PMC8429844 DOI: 10.3389/fgene.2021.698033] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Accepted: 07/29/2021] [Indexed: 12/15/2022] Open
Abstract
Angiotensin-converting enzyme-2 (ACE2) receptor has been identified as the key adhesion molecule for the transmission of the SARS-CoV-2. However, there is no evidence that human genetic variation in ACE2 is singularly responsible for COVID-19 susceptibility. Therefore, we performed an integrative multi-level characterization of genes that interact with ACE2 (ACE2-gene network) for their statistically enriched biological properties in the context of COVID-19. The phenome-wide association of 51 genes including ACE2 with 4,756 traits categorized into 26 phenotype categories, showed enrichment of immunological, respiratory, environmental, skeletal, dermatological, and metabolic domains (p < 4e-4). Transcriptomic regulation of ACE2-gene network was enriched for tissue-specificity in kidney, small intestine, and colon (p < 4.7e-4). Leveraging the drug-gene interaction database we identified 47 drugs, including dexamethasone and spironolactone, among others. Considering genetic variants within ± 10 kb of ACE2-network genes we identified miRNAs whose binding sites may be altered as a consequence of genetic variation. The identified miRNAs revealed statistical over-representation of inflammation, aging, diabetes, and heart conditions. The genetic variant associations in RORA, SLC12A6, and SLC6A19 genes were observed in genome-wide association study (GWAS) of COVID-19 susceptibility. We also report the GWAS-identified variant in 3p21.31 locus, serves as trans-QTL for RORA and RORC genes. Overall, functional characterization of ACE2-gene network highlights several potential mechanisms in COVID-19 susceptibility. The data can also be accessed at https://gpwhiz.github.io/ACE2Netlas/.
Collapse
Affiliation(s)
- Gita A. Pathak
- Division of Human Genetics, Department of Psychiatry, Yale School of Medicine, New Haven, CT, United States
- Veteran Affairs Connecticut Healthcare System, West Haven, CT, United States
| | - Frank R. Wendt
- Division of Human Genetics, Department of Psychiatry, Yale School of Medicine, New Haven, CT, United States
- Veteran Affairs Connecticut Healthcare System, West Haven, CT, United States
| | - Aranyak Goswami
- Division of Human Genetics, Department of Psychiatry, Yale School of Medicine, New Haven, CT, United States
- Veteran Affairs Connecticut Healthcare System, West Haven, CT, United States
| | - Dora Koller
- Division of Human Genetics, Department of Psychiatry, Yale School of Medicine, New Haven, CT, United States
- Veteran Affairs Connecticut Healthcare System, West Haven, CT, United States
| | - Flavio De Angelis
- Division of Human Genetics, Department of Psychiatry, Yale School of Medicine, New Haven, CT, United States
- Veteran Affairs Connecticut Healthcare System, West Haven, CT, United States
| | | | - Renato Polimanti
- Division of Human Genetics, Department of Psychiatry, Yale School of Medicine, New Haven, CT, United States
- Veteran Affairs Connecticut Healthcare System, West Haven, CT, United States
| |
Collapse
|
17
|
Ahlquist KD, Bañuelos MM, Funk A, Lai J, Rong S, Villanea FA, Witt KE. Our Tangled Family Tree: New Genomic Methods Offer Insight into the Legacy of Archaic Admixture. Genome Biol Evol 2021; 13:evab115. [PMID: 34028527 PMCID: PMC8480178 DOI: 10.1093/gbe/evab115] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Revised: 05/07/2021] [Accepted: 05/22/2021] [Indexed: 11/30/2022] Open
Abstract
The archaic ancestry present in the human genome has captured the imagination of both scientists and the wider public in recent years. This excitement is the result of new studies pushing the envelope of what we can learn from the archaic genetic information that has survived for over 50,000 years in the human genome. Here, we review the most recent ten years of literature on the topic of archaic introgression, including the current state of knowledge on Neanderthal and Denisovan introgression, as well as introgression from other as-yet unidentified archaic populations. We focus this review on four topics: 1) a reimagining of human demographic history, including evidence for multiple admixture events between modern humans, Neanderthals, Denisovans, and other archaic populations; 2) state-of-the-art methods for detecting archaic ancestry in population-level genomic data; 3) how these novel methods can detect archaic introgression in modern African populations; and 4) the functional consequences of archaic gene variants, including how those variants were co-opted into novel function in modern human populations. The goal of this review is to provide a simple-to-access reference for the relevant methods and novel data, which has changed our understanding of the relationship between our species and its siblings. This body of literature reveals the large degree to which the genetic legacy of these extinct hominins has been integrated into the human populations of today.
Collapse
Affiliation(s)
- K D Ahlquist
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
- Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University, Providence, Rhode Island, USA
| | - Mayra M Bañuelos
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
- Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University, Providence, Rhode Island, USA
| | - Alyssa Funk
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
- Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University, Providence, Rhode Island, USA
| | - Jiaying Lai
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
- Brown Center for Biomedical Informatics, Brown University, Providence, Rhode Island, USA
| | - Stephen Rong
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
- Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University, Providence, Rhode Island, USA
| | - Fernando A Villanea
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
- Department of Anthropology, University of Colorado Boulder, Colorado, USA
| | - Kelsey E Witt
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
- Department of Ecology and Evolutionary Biology, Brown University, Providence, Rhode Island, USA
| |
Collapse
|
18
|
Gower G, Picazo PI, Fumagalli M, Racimo F. Detecting adaptive introgression in human evolution using convolutional neural networks. eLife 2021; 10:64669. [PMID: 34032215 PMCID: PMC8192126 DOI: 10.7554/elife.64669] [Citation(s) in RCA: 51] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Accepted: 05/24/2021] [Indexed: 01/10/2023] Open
Abstract
Studies in a variety of species have shown evidence for positively selected variants introduced into a population via introgression from another, distantly related population—a process known as adaptive introgression. However, there are few explicit frameworks for jointly modelling introgression and positive selection, in order to detect these variants using genomic sequence data. Here, we develop an approach based on convolutional neural networks (CNNs). CNNs do not require the specification of an analytical model of allele frequency dynamics and have outperformed alternative methods for classification and parameter estimation tasks in various areas of population genetics. Thus, they are potentially well suited to the identification of adaptive introgression. Using simulations, we trained CNNs on genotype matrices derived from genomes sampled from the donor population, the recipient population and a related non-introgressed population, in order to distinguish regions of the genome evolving under adaptive introgression from those evolving neutrally or experiencing selective sweeps. Our CNN architecture exhibits 95% accuracy on simulated data, even when the genomes are unphased, and accuracy decreases only moderately in the presence of heterosis. As a proof of concept, we applied our trained CNNs to human genomic datasets—both phased and unphased—to detect candidates for adaptive introgression that shaped our evolutionary history.
Collapse
Affiliation(s)
- Graham Gower
- Lundbeck GeoGenetics Centre, Globe Institute, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Pablo Iáñez Picazo
- Lundbeck GeoGenetics Centre, Globe Institute, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Matteo Fumagalli
- Department of Life Sciences, Silwood Park Campus, Imperial College London, London, United Kingdom
| | - Fernando Racimo
- Lundbeck GeoGenetics Centre, Globe Institute, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
19
|
Gopalan S, Atkinson EG, Buck LT, Weaver TD, Henn BM. Inferring archaic introgression from hominin genetic data. Evol Anthropol 2021; 30:199-220. [PMID: 33951239 PMCID: PMC8360192 DOI: 10.1002/evan.21895] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Revised: 08/03/2020] [Accepted: 03/29/2021] [Indexed: 01/05/2023]
Abstract
Questions surrounding the timing, extent, and evolutionary consequences of archaic admixture into human populations have a long history in evolutionary anthropology. More recently, advances in human genetics, particularly in the field of ancient DNA, have shed new light on the question of whether or not Homo sapiens interbred with other hominin groups. By the late 1990s, published genetic work had largely concluded that archaic groups made no lasting genetic contribution to modern humans; less than a decade later, this conclusion was reversed following the successful DNA sequencing of an ancient Neanderthal. This reversal of consensus is noteworthy, but the reasoning behind it is not widely understood across all academic communities. There remains a communication gap between population geneticists and paleoanthropologists. In this review, we endeavor to bridge this gap by outlining how technological advancements, new statistical methods, and notable controversies ultimately led to the current consensus.
Collapse
Affiliation(s)
- Shyamalika Gopalan
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, New York, USA.,Department of Evolutionary Anthropology, Duke University, Durham, North Carolina, USA
| | - Elizabeth G Atkinson
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, New York, USA.,Analytic and Translational Genetics Unit, Massachusetts General Hospital and Stanley Center for Psychiatric Research, Broad Institute, Boston, Massachusetts, USA
| | - Laura T Buck
- Research Centre in Evolutionary Anthropology and Palaeoecology, Liverpool John Moores University, Liverpool, UK
| | - Timothy D Weaver
- Department of Anthropology, University of California, Davis, California, USA
| | - Brenna M Henn
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, New York, USA.,Department of Anthropology, University of California, Davis, California, USA.,UC Davis Genome Center, University of California, Davis, California, USA
| |
Collapse
|
20
|
Abstract
Throughout human history, large-scale migrations have facilitated the formation of populations with ancestry from multiple previously separated populations. This process leads to subsequent shuffling of genetic ancestry through recombination, producing variation in ancestry between populations, among individuals in a population, and along the genome within an individual. Recent methodological and empirical developments have elucidated the genomic signatures of this admixture process, bringing previously understudied admixed populations to the forefront of population and medical genetics. Under this theme, we present a collection of recent PLOS Genetics publications that exemplify recent progress in human genetic admixture studies, and we discuss potential areas for future work.
Collapse
Affiliation(s)
- Katharine L. Korunes
- Department of Evolutionary Anthropology, Duke University, Durham, North Carolina, United States of America
| | - Amy Goldberg
- Department of Evolutionary Anthropology, Duke University, Durham, North Carolina, United States of America
| |
Collapse
|
21
|
Wendt FR, Pathak GA, Overstreet C, Tylee DS, Gelernter J, Atkinson EG, Polimanti R. Characterizing the effect of background selection on the polygenicity of brain-related traits. Genomics 2021; 113:111-119. [PMID: 33278486 PMCID: PMC7855394 DOI: 10.1016/j.ygeno.2020.11.032] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2020] [Revised: 11/20/2020] [Accepted: 11/30/2020] [Indexed: 01/10/2023]
Abstract
BACKGROUND Genome-wide association studies (GWAS) have demonstrated that psychopathology phenotypes are affected by many risk alleles with small effect (polygenicity). It is unclear how ubiquitously evolutionary pressures influence the genetic architecture of these traits. METHODS We partitioned SNP heritability to assess the contribution of background (BGS) and positive selection, Neanderthal local ancestry, functional significance, and genotype networks in 75 brain-related traits (8411 ≤ N ≤ 1,131,181, mean N = 205,289). We applied binary annotations by dichotomizing each measure based on top 2%, 1%, and 0.5% of all scores genome-wide. Effect size distribution features were calculated using GENESIS. We tested the relationship between effect size distribution descriptive statistics and natural selection. In a subset of traits, we explore the inclusion of diagnostic heterogeneity (e.g., number of diagnostic combinations and total symptoms) in the tested relationship. RESULTS SNP-heritability was enriched (false discovery rate q < 0.05) for loci with elevated BGS (7 phenotypes) and in genic (34 phenotypes) and loss-of-function (LoF)-intolerant regions (67 phenotypes). These effects were strongest in GWAS of schizophrenia (1.90-fold BGS, 1.16-fold genic, and 1.92-fold LoF), educational attainment (1.86-fold BGS, 1.12-fold genic, and 1.79-fold LoF), and cognitive performance (2.29-fold BGS, 1.12-fold genic, and 1.79-fold LoF). BGS (top 2%) significantly predicted effect size variance for trait-associated loci (σ2 parameter) in 75 brain-related traits (β = 4.39 × 10-5, p = 1.43 × 10-5, model r2 = 0.548). Considering the number of DSM-5 diagnostic combinations per psychiatric disorder improved model fit (σ2 ~ BTop2% × Genic × diagnostic combinations; model r2 = 0.661). CONCLUSIONS Brain-related phenotypes with larger variance in risk locus effect sizes are associated with loci under BGS. We show exploratory results suggesting that diagnostic complexity may also contribute to the increased polygenicity of psychiatric disorders.
Collapse
Affiliation(s)
- Frank R Wendt
- Department of Psychiatry, Yale School of Medicine and VA CT Healthcare System, West Haven, CT 06516, USA
| | - Gita A Pathak
- Department of Psychiatry, Yale School of Medicine and VA CT Healthcare System, West Haven, CT 06516, USA
| | - Cassie Overstreet
- National Center for Posttraumatic Stress Disorder, Clinical Neurosciences Division, VA CT Healthcare System and Department of Psychiatry, Yale University School of Medicine, USA
| | - Daniel S Tylee
- Department of Psychiatry, Yale School of Medicine and VA CT Healthcare System, West Haven, CT 06516, USA
| | - Joel Gelernter
- Department of Psychiatry, Yale School of Medicine and VA CT Healthcare System, West Haven, CT 06516, USA; Departments of Genetics and Neuroscience, Yale University School of Medicine, New Haven, CT 06510, USA
| | - Elizabeth G Atkinson
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA; Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Renato Polimanti
- Department of Psychiatry, Yale School of Medicine and VA CT Healthcare System, West Haven, CT 06516, USA.
| |
Collapse
|
22
|
Pathak GA, Wendt FR, Goswami A, Angelis FD, Polimanti R. ACE2 Netlas: In-silico functional characterization and drug-gene interactions of ACE2 gene network to understand its potential involvement in COVID-19 susceptibility. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2020:2020.10.27.20220665. [PMID: 33140059 PMCID: PMC7605570 DOI: 10.1101/2020.10.27.20220665] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Angiotensin-converting enzyme-2 ( ACE2 ) receptor has been identified as the key adhesion molecule for the transmission of the SARS-CoV-2. However, there is no evidence that human genetic variation in ACE2 is singularly responsible for COVID-19 susceptibility. Therefore, we performed a multi-level characterization of genes that interact with ACE2 (ACE2-gene network) for their over-represented biological properties in the context of COVID-19. The phenome-wide association of 51 genes including ACE2 with 4,756 traits categorized into 26 phenotype categories, showed enrichment of immunological, respiratory, environmental, skeletal, dermatological, and metabolic domains (p<4e-4). Transcriptomic regulation of ACE2-gene network was enriched for tissue-specificity in kidney, small intestine, and colon (p<4.7e-4). Leveraging the drug-gene interaction database we identified 47 drugs, including dexamethasone and spironolactone, among others. Considering genetic variants within ± 10 kb of ACE2-network genes we characterized functional consequences (among others) using miRNA binding-site targets. MiRNAs affected by ACE2-network variants revealed statistical over-representation of inflammation, aging, diabetes, and heart conditions. With respect to variants mapped to the ACE2-network, we observed COVID-19 related associations in RORA, SLC12A6 and SLC6A19 genes. Overall, functional characterization of ACE2-gene network highlights several potential mechanisms in COVID-19 susceptibility. The data can also be accessed at https://gpwhiz.github.io/ACE2Netlas/.
Collapse
Affiliation(s)
- Gita A Pathak
- Yale School of Medicine, Department of Psychiatry, Division of Human Genetics, New Haven, CT Veteran Affairs Connecticut Healthcare System, West Haven, CT
| | - Frank R Wendt
- Yale School of Medicine, Department of Psychiatry, Division of Human Genetics, New Haven, CT Veteran Affairs Connecticut Healthcare System, West Haven, CT
| | - Aranyak Goswami
- Yale School of Medicine, Department of Psychiatry, Division of Human Genetics, New Haven, CT Veteran Affairs Connecticut Healthcare System, West Haven, CT
| | - Flavio De Angelis
- Yale School of Medicine, Department of Psychiatry, Division of Human Genetics, New Haven, CT Veteran Affairs Connecticut Healthcare System, West Haven, CT
| | - Renato Polimanti
- Yale School of Medicine, Department of Psychiatry, Division of Human Genetics, New Haven, CT Veteran Affairs Connecticut Healthcare System, West Haven, CT
| |
Collapse
|
23
|
VolcanoFinder: Genomic scans for adaptive introgression. PLoS Genet 2020; 16:e1008867. [PMID: 32555579 PMCID: PMC7326285 DOI: 10.1371/journal.pgen.1008867] [Citation(s) in RCA: 53] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2019] [Revised: 06/30/2020] [Accepted: 05/18/2020] [Indexed: 12/16/2022] Open
Abstract
Recent research shows that introgression between closely-related species is an important source of adaptive alleles for a wide range of taxa. Typically, detection of adaptive introgression from genomic data relies on comparative analyses that require sequence data from both the recipient and the donor species. However, in many cases, the donor is unknown or the data is not currently available. Here, we introduce a genome-scan method—VolcanoFinder—to detect recent events of adaptive introgression using polymorphism data from the recipient species only. VolcanoFinder detects adaptive introgression sweeps from the pattern of excess intermediate-frequency polymorphism they produce in the flanking region of the genome, a pattern which appears as a volcano-shape in pairwise genetic diversity. Using coalescent theory, we derive analytical predictions for these patterns. Based on these results, we develop a composite-likelihood test to detect signatures of adaptive introgression relative to the genomic background. Simulation results show that VolcanoFinder has high statistical power to detect these signatures, even for older sweeps and for soft sweeps initiated by multiple migrant haplotypes. Finally, we implement VolcanoFinder to detect archaic introgression in European and sub-Saharan African human populations, and uncovered interesting candidates in both populations, such as TSHR in Europeans and TCHH-RPTN in Africans. We discuss their biological implications and provide guidelines for identifying and circumventing artifactual signals during empirical applications of VolcanoFinder. The process by which beneficial alleles are introduced into a species from a closely-related species is termed adaptive introgression. We present an analytically-tractable model for the effects of adaptive introgression on non-adaptive genetic variation in the genomic region surrounding the beneficial allele. The result we describe is a characteristic volcano-shaped pattern of increased variability that arises around the positively-selected site, and we introduce an open-source method VolcanoFinder to detect this signal in genomic data. Importantly, VolcanoFinder is a population-genetic likelihood-based approach, rather than a comparative-genomic approach, and can therefore probe genomic variation data from a single population for footprints of adaptive introgression, even from a priori unknown and possibly extinct donor species.
Collapse
|
24
|
Battey CJ, Ralph PL, Kern AD. Predicting geographic location from genetic variation with deep neural networks. eLife 2020; 9:e54507. [PMID: 32511092 PMCID: PMC7324158 DOI: 10.7554/elife.54507] [Citation(s) in RCA: 56] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Accepted: 06/03/2020] [Indexed: 12/12/2022] Open
Abstract
Most organisms are more closely related to nearby than distant members of their species, creating spatial autocorrelations in genetic data. This allows us to predict the location of origin of a genetic sample by comparing it to a set of samples of known geographic origin. Here, we describe a deep learning method, which we call Locator, to accomplish this task faster and more accurately than existing approaches. In simulations, Locator infers sample location to within 4.1 generations of dispersal and runs at least an order of magnitude faster than a recent model-based approach. We leverage Locator's computational efficiency to predict locations separately in windows across the genome, which allows us to both quantify uncertainty and describe the mosaic ancestry and patterns of geographic mixing that characterize many populations. Applied to whole-genome sequence data from Plasmodium parasites, Anopheles mosquitoes, and global human populations, this approach yields median test errors of 16.9km, 5.7km, and 85km, respectively.
Collapse
Affiliation(s)
- CJ Battey
- University of Oregon, Institute of Ecology and EvolutionEugeneUnited States
| | - Peter L Ralph
- University of Oregon, Institute of Ecology and EvolutionEugeneUnited States
| | - Andrew D Kern
- University of Oregon, Institute of Ecology and EvolutionEugeneUnited States
| |
Collapse
|
25
|
Sankararaman S. Methods for detecting introgressed archaic sequences. Curr Opin Genet Dev 2020; 62:85-90. [PMID: 32717667 PMCID: PMC7484293 DOI: 10.1016/j.gde.2020.05.026] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Revised: 05/12/2020] [Accepted: 05/22/2020] [Indexed: 11/16/2022]
Abstract
Analysis of genome sequences from archaic and modern humans have revealed multiple episodes of admixture between highly-diverged population groups. Statistical methods that attempt to localize DNA segments introduced by these events offer a powerful tool to investigate recent human evolution. We review recent advances in methods for detecting introgressed sequences.
Collapse
Affiliation(s)
- Sriram Sankararaman
- Department of Computer Science, University of California, Los Angeles, CA 90095, United States; Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, United States; Department of Computational Medicine, University of California, Los Angeles, CA 90095, United States.
| |
Collapse
|
26
|
Ottenburghs J. Ghost Introgression: Spooky Gene Flow in the Distant Past. Bioessays 2020; 42:e2000012. [DOI: 10.1002/bies.202000012] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2020] [Revised: 02/25/2020] [Indexed: 01/25/2023]
Affiliation(s)
- Jente Ottenburghs
- Department of Evolutionary Biology, Evolutionary Biology Centre Uppsala University Norbyvägen 18D Uppsala SE‐752 36 Sweden
- Wildlife Ecology and Conservation Group Wageningen University Droevendaalsesteeg 3a Wageningen 6708 PB The Netherlands
- Forest Ecology and Forest Management Group Wageningen University Droevendaalsesteeg 3a Wageningen 6708 PB The Netherlands
| |
Collapse
|
27
|
Durvasula A, Sankararaman S. Recovering signals of ghost archaic introgression in African populations. SCIENCE ADVANCES 2020; 6:eaax5097. [PMID: 32095519 PMCID: PMC7015685 DOI: 10.1126/sciadv.aax5097] [Citation(s) in RCA: 68] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Accepted: 12/03/2019] [Indexed: 05/18/2023]
Abstract
While introgression from Neanderthals and Denisovans has been documented in modern humans outside Africa, the contribution of archaic hominins to the genetic variation of present-day Africans remains poorly understood. We provide complementary lines of evidence for archaic introgression into four West African populations. Our analyses of site frequency spectra indicate that these populations derive 2 to 19% of their genetic ancestry from an archaic population that diverged before the split of Neanderthals and modern humans. Using a method that can identify segments of archaic ancestry without the need for reference archaic genomes, we built genome-wide maps of archaic ancestry in the Yoruba and the Mende populations. Analyses of these maps reveal segments of archaic ancestry at high frequency in these populations that represent potential targets of adaptive introgression. Our results reveal the substantial contribution of archaic ancestry in shaping the gene pool of present-day West African populations.
Collapse
Affiliation(s)
- Arun Durvasula
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | - Sriram Sankararaman
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, USA
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Computational Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| |
Collapse
|
28
|
Abstract
Context: Africa's role in the narrative of human evolution is indisputably emphasised in the emergence of Homo sapiens. However, once humans dispersed beyond Africa, the history of those who stayed remains vastly under-studied, lacking the proper attention the birthplace of both modern and archaic humans deserves. The sequencing of Neanderthal and Denisovan genomes has elucidated evidence of admixture between archaic and modern humans outside of Africa, but has not aided efforts in answering whether archaic admixture happened within Africa. Objectives: This article reviews the state of research for archaic introgression in African populations and discusses recent insights into this topic. Methods: Gathering published sources and recently released preprints, this review reports on the different methods developed for detecting archaic introgression. Particularly it discusses how relevant these are when implemented on African populations and what findings these studies have shown so far. Results: Methods for detecting archaic introgression have been predominantly developed and implemented on non-African populations. Recent preprints present new methods considering African populations. While a number of studies using these methods suggest archaic introgression in Africa, without an African archaic genome to validate these results, such findings remain as putative archaic introgression. Conclusion: In light of the caveats with implementing current archaic introgression detection methods in Africa, we recommend future studies to concentrate on unravelling the complicated demographic history of Africa through means of ancient DNA where possible and through more focused efforts to sequence modern DNA from more representative populations across the African continent.
Collapse
Affiliation(s)
- Cindy Santander
- a Department of Zoology , University of Oxford , Oxford , UK
| | - Francesco Montinaro
- a Department of Zoology , University of Oxford , Oxford , UK.,b Estonian Biocentre , University of Tartu , Tartu , Estonia
| | | |
Collapse
|