1
|
Venkatesh SS, Wittemans LBL, Palmer DS, Baya NA, Ferreira T, Hill B, Lassen FH, Parker MJ, Reibe S, Elhakeem A, Banasik K, Bruun MT, Erikstrup C, Aagard Jensen B, Juul A, Mikkelsen C, Nielsen HS, Ostrowski SR, Pedersen OB, Rohde PD, Sørensen E, Ullum H, Westergaard D, Haraldsson A, Holm H, Jonsdottir I, Olafsson I, Steingrimsdottir T, Steinthorsdottir V, Thorleifsson G, Figueredo J, Karjalainen MK, Pasanen A, Jacobs BM, Kalantzis G, Hubers N, Lippincott M, Fraser A, Lawlor DA, Timpson NJ, Nyegaard M, Stefansson K, Magi R, Laivuori H, van Heel DA, Boomsma DI, Balasubramanian R, Seminara SB, Chan YM, Laisk T, Lindgren CM. Genome-wide analyses identify 25 infertility loci and relationships with reproductive traits across the allele frequency spectrum. Nat Genet 2025:10.1038/s41588-025-02156-8. [PMID: 40229599 DOI: 10.1038/s41588-025-02156-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Accepted: 03/07/2025] [Indexed: 04/16/2025]
Abstract
Genome-wide association studies (GWASs) may help inform the etiology of infertility. Here, we perform GWAS meta-analyses across seven cohorts in up to 42,629 cases and 740,619 controls and identify 25 genetic risk loci for male and female infertility. We additionally identify up to 269 genetic loci associated with follicle-stimulating hormone, luteinizing hormone, estradiol and testosterone through sex-specific GWAS meta-analyses (n = 6,095-246,862). Exome sequencing analyses reveal that women carrying testosterone-lowering rare variants in some genes are at risk of infertility. However, we find no local or genome-wide genetic correlation between female infertility and reproductive hormones. While infertility is genetically correlated with endometriosis and polycystic ovary syndrome, we find limited genetic overlap between infertility and obesity. Finally, we show that the evolutionary persistence of infertility-risk alleles may be explained by directional selection. Taken together, we provide a comprehensive view of the genetic determinants of infertility across multiple diagnostic criteria.
Collapse
Affiliation(s)
- Samvida S Venkatesh
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK.
- Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, UK.
| | - Laura B L Wittemans
- Novo Nordisk Research Centre Oxford, Oxford, UK
- Nuffield Department of Women's and Reproductive Health, Medical Sciences Division, University of Oxford, Oxford, UK
| | - Duncan S Palmer
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK
- Nuffield Department of Population Health, Medical Sciences Division, University of Oxford, Oxford, UK
| | - Nikolas A Baya
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK
- Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Teresa Ferreira
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK
| | - Barney Hill
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK
- Nuffield Department of Population Health, Medical Sciences Division, University of Oxford, Oxford, UK
| | - Frederik Heymann Lassen
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK
- Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Melody J Parker
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK
- Nuffield Department of Clinical Medicine, University of Oxford, John Radcliffe Hospital, Oxford, UK
| | - Saskia Reibe
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK
- Nuffield Department of Population Health, Medical Sciences Division, University of Oxford, Oxford, UK
| | - Ahmed Elhakeem
- MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, UK
- Population Health Science, Bristol Medical School, University of Bristol, Bristol, UK
| | - Karina Banasik
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
- Department of Obstetrics and Gynecology, Copenhagen University Hospital, Hvidovre, Copenhagen, Denmark
| | - Mie T Bruun
- Department of Clinical Immunology, Odense University Hospital, Odense, Denmark
| | - Christian Erikstrup
- Department of Clinical Immunology, Aarhus University Hospital, Aarhus, Denmark
- Department of Clinical Medicine, Health, Aarhus University, Aarhus, Denmark
| | - Bitten Aagard Jensen
- Department of Clinical Immunology, Aalborg University Hospital, Aalborg, Denmark
| | - Anders Juul
- Department of Clinical Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
- Department of Growth and Reproduction, Copenhagen University Hospital-Rigshospitalet, Copenhagen, Denmark
| | - Christina Mikkelsen
- Department of Clinical Immunology, Copenhagen University Hospital-Rigshospitalet, Copenhagen, Denmark
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Science, Copenhagen University, Copenhagen, Denmark
| | - Henriette S Nielsen
- Department of Clinical Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
- Department of Obstetrics and Gynecology, The Fertility Clinic, Hvidovre University Hospital, Copenhagen, Denmark
| | - Sisse R Ostrowski
- Department of Clinical Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
- Department of Clinical Immunology, Copenhagen University Hospital-Rigshospitalet, Copenhagen, Denmark
| | - Ole B Pedersen
- Department of Clinical Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
- Department of Clinical Immunology, Zealand University Hospital-Køge, Køge, Denmark
| | - Palle Duun Rohde
- Genomic Medicine, Department of Health Science and Technology, Aalborg University, Aalborg, Denmark
| | - Erik Sørensen
- Department of Clinical Immunology, Copenhagen University Hospital-Rigshospitalet, Copenhagen, Denmark
| | | | - David Westergaard
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
- Department of Obstetrics and Gynecology, Copenhagen University Hospital, Hvidovre, Copenhagen, Denmark
| | - Asgeir Haraldsson
- Faculty of Medicine, University of Iceland, Reykjavik, Iceland
- Children's Hospital Iceland, Landspitali University Hospital, Reykjavik, Iceland
| | - Hilma Holm
- deCODE genetics/Amgen Inc., Reykjavik, Iceland
| | - Ingileif Jonsdottir
- Faculty of Medicine, University of Iceland, Reykjavik, Iceland
- deCODE genetics/Amgen Inc., Reykjavik, Iceland
| | - Isleifur Olafsson
- Department of Clinical Biochemistry, Landspitali University Hospital, Reykjavik, Iceland
| | - Thora Steingrimsdottir
- Faculty of Medicine, University of Iceland, Reykjavik, Iceland
- Department of Obstetrics and Gynecology, Landspitali University Hospital, Reykjavik, Iceland
| | | | | | - Jessica Figueredo
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Minna K Karjalainen
- Institute for Molecular Medicine Finland, Helsinki Institute of Life Science, University of Helsinki, Helsinki, Finland
- Research Unit of Population Health, Faculty of Medicine, University of Oulu, Oulu, Finland
- Northern Finland Birth Cohorts, Arctic Biobank, Infrastructure for Population Studies, Faculty of Medicine, University of Oulu, Oulu, Finland
| | - Anu Pasanen
- Research Unit of Clinical Medicine, Medical Research Center Oulu, University of Oulu, and Department of Children and Adolescents, Oulu University Hospital, Oulu, Finland
| | - Benjamin M Jacobs
- Centre for Preventive Neurology, Wolfson Institute of Population Health, Queen Mary University London, London, UK
| | | | - Nikki Hubers
- Department of Biological Psychology, Netherlands Twin Register, Vrije Universiteit, Amsterdam, The Netherlands
- Amsterdam Reproduction and Development Institute, Amsterdam, The Netherlands
- Department of Complex Trait Genetics, Center for Neurogenomics and Cognitive Research, Vrije Universiteit, Amsterdam, The Netherlands
| | - Margaret Lippincott
- Reproductive Endocrine Unit, Massachusetts General Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Abigail Fraser
- MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, UK
- Population Health Science, Bristol Medical School, University of Bristol, Bristol, UK
| | - Deborah A Lawlor
- MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, UK
- Population Health Science, Bristol Medical School, University of Bristol, Bristol, UK
| | - Nicholas J Timpson
- MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, UK
- Population Health Science, Bristol Medical School, University of Bristol, Bristol, UK
| | - Mette Nyegaard
- Genomic Medicine, Department of Health Science and Technology, Aalborg University, Aalborg, Denmark
- Statens Serum Institut, Copenhagen, Denmark
| | - Kari Stefansson
- Faculty of Medicine, University of Iceland, Reykjavik, Iceland
- deCODE genetics/Amgen Inc., Reykjavik, Iceland
| | - Reedik Magi
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Hannele Laivuori
- Institute for Molecular Medicine Finland, Helsinki Institute of Life Science, University of Helsinki, Helsinki, Finland
- Medical and Clinical Genetics, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
- Department of Obstetrics and Gynecology, Tampere University Hospital, The Wellbeing Services County of Pirkanmaa, Tampere, Finland
- Center for Child, Adolescent, and Maternal Health Research, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
| | | | - Dorret I Boomsma
- Amsterdam Reproduction and Development Institute, Amsterdam, The Netherlands
- Department of Complex Trait Genetics, Center for Neurogenomics and Cognitive Research, Vrije Universiteit, Amsterdam, The Netherlands
| | - Ravikumar Balasubramanian
- Reproductive Endocrine Unit, Massachusetts General Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Stephanie B Seminara
- Reproductive Endocrine Unit, Massachusetts General Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Yee-Ming Chan
- Harvard Medical School, Boston, MA, USA
- Division of Endocrinology, Department of Pediatrics, Boston Children's Hospital, Boston, MA, USA
| | - Triin Laisk
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Cecilia M Lindgren
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK.
- Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, UK.
- Nuffield Department of Women's and Reproductive Health, Medical Sciences Division, University of Oxford, Oxford, UK.
- Broad Institute of Harvard and MIT, Cambridge, MA, USA.
| |
Collapse
|
2
|
Zhang H, Sun S, Liu J, Guo Q, Meng L, Chen J, Xiang X, Zhou Y, Zhang N, Liu H, Liu Y, Yan G, Ji Q, He L, Cai S, Cai C, Huang X, Xu S, Xiao Y, Zhang Y, Wang K, Liu Y, Chen H, Yue Z, He S, Wang J, Yang H, Liu X, Seim I, Gu Y, Li Q, Zhang G, Lee SMY, Kristiansen K, Xu X, Liu S, Fan G. The amphipod genome reveals population dynamics and adaptations to hadal environment. Cell 2025; 188:1378-1392.e18. [PMID: 40054448 DOI: 10.1016/j.cell.2025.01.030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Revised: 12/16/2024] [Accepted: 01/20/2025] [Indexed: 05/13/2025]
Abstract
The amphipod Hirondellea gigas is a dominant species inhabiting the deepest part of the ocean (∼6,800-11,000 m), but little is known about its genetic adaptation and population dynamics. Here, we present a chromosome-level genome of H. gigas, characterized by a large genome size of 13.92 Gb. Whole-genome sequencing of 510 individuals from the Mariana Trench indicates no population differentiation across depths, suggesting its capacity to tolerate hydrostatic pressure across wide ranges. H. gigas in the West Philippine Basin is genetically divergent from the Mariana and Yap Trenches, suggesting genetic isolation attributed to the geographic separation of hadal features. A drastic reduction in effective population size potentially reflects glacial-interglacial changes. By integrating multi-omics analysis, we propose host-symbiotic microbial interactions may be crucial in the adaptation of H. gigas to the extremely high-pressure and food-limited environment. Our findings provide clues for adaptation to the hadal zone and population genetics.
Collapse
Affiliation(s)
- Haibin Zhang
- Institute of Deep-sea Science and Engineering, Chinese Academy of Sciences, Sanya 572000, China; Institution of Deep-sea Life Sciences, IDSSE-BGI, Hainan Deep-sea Technology Laboratory, Sanya 57200, China.
| | - Shuai Sun
- BGI Research, Qingdao 266555, China; State Key Laboratory of Genome and Multi-omics Technologies, BGI Research, Shenzhen 518083, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China; Shenzhen Key Laboratory of Marine Biology Genomics, BGI Research, Shenzhen 518083, China
| | - Jun Liu
- Institute of Deep-sea Science and Engineering, Chinese Academy of Sciences, Sanya 572000, China
| | - Qunfei Guo
- State Key Laboratory of Genome and Multi-omics Technologies, BGI Research, Shenzhen 518083, China; BGI Research, Wuhan 430074, China
| | - Liang Meng
- BGI Research, Qingdao 266555, China; BGI Research, Sanya 572025, China; Institution of Deep-sea Life Sciences, IDSSE-BGI, Hainan Deep-sea Technology Laboratory, Sanya 57200, China
| | - Jianwei Chen
- BGI Research, Qingdao 266555, China; Qingdao Key Laboratory of Marine Genomics and Qingdao-Europe Advanced Institute for Life Sciences, BGI Research, Qingdao 266555, China; Laboratory of Integrative Biomedicine, Department of Biology, University of Copenhagen, Copenhagen 2100, Denmark
| | - Xueyan Xiang
- State Key Laboratory of Genome and Multi-omics Technologies, BGI Research, Shenzhen 518083, China; BGI Research, Wuhan 430074, China; Shenzhen Key Laboratory of Marine Biology Genomics, BGI Research, Shenzhen 518083, China
| | - Yang Zhou
- Institute of Deep-sea Science and Engineering, Chinese Academy of Sciences, Sanya 572000, China
| | - Nannan Zhang
- BGI Research, Qingdao 266555, China; BGI Research, Sanya 572025, China
| | - Helu Liu
- Institute of Deep-sea Science and Engineering, Chinese Academy of Sciences, Sanya 572000, China
| | | | - Guoyong Yan
- Institute of Deep-sea Science and Engineering, Chinese Academy of Sciences, Sanya 572000, China
| | | | - Lisheng He
- Institute of Deep-sea Science and Engineering, Chinese Academy of Sciences, Sanya 572000, China
| | - Shanya Cai
- Institute of Deep-sea Science and Engineering, Chinese Academy of Sciences, Sanya 572000, China
| | | | - Xin Huang
- Institute of Deep-sea Science and Engineering, Chinese Academy of Sciences, Sanya 572000, China
| | - Shiyu Xu
- BGI Research, Qingdao 266555, China
| | - Yunlu Xiao
- Institute of Deep-sea Science and Engineering, Chinese Academy of Sciences, Sanya 572000, China
| | | | - Kun Wang
- School of Ecology and Environment, Northwestern Polytechnical University, Xi'an 710072, China
| | | | - Haixin Chen
- BGI Research, Sanya 572025, China; Institution of Deep-sea Life Sciences, IDSSE-BGI, Hainan Deep-sea Technology Laboratory, Sanya 57200, China
| | - Zhen Yue
- BGI Research, Sanya 572025, China; Institution of Deep-sea Life Sciences, IDSSE-BGI, Hainan Deep-sea Technology Laboratory, Sanya 57200, China
| | - Shunping He
- Institute of Deep-sea Science and Engineering, Chinese Academy of Sciences, Sanya 572000, China
| | | | - Huanming Yang
- State Key Laboratory of Genome and Multi-omics Technologies, BGI Research, Shenzhen 518083, China
| | - Xin Liu
- State Key Laboratory of Genome and Multi-omics Technologies, BGI Research, Shenzhen 518083, China; Qingdao Key Laboratory of Marine Genomics and Qingdao-Europe Advanced Institute for Life Sciences, BGI Research, Qingdao 266555, China
| | - Inge Seim
- Institute of Deep-sea Science and Engineering, Chinese Academy of Sciences, Sanya 572000, China
| | - Ying Gu
- State Key Laboratory of Genome and Multi-omics Technologies, BGI Research, Shenzhen 518083, China
| | - Qiye Li
- State Key Laboratory of Genome and Multi-omics Technologies, BGI Research, Shenzhen 518083, China
| | - Guojie Zhang
- Center of Evolutionary & Organismal Biology and Women's Hospital, Zhejiang University School of Medicine, Hangzhou 310058, China; Liangzhu Laboratory, Zhejiang University Medical Center, Hangzhou 311121, China
| | - Simon Ming-Yuen Lee
- Department of Food Science and Nutrition and PolyU-BGI Joint Research Centre for Genomics and Synthetic Biology in Global Ocean Resources, The Hong Kong Polytechnic University, Hong Kong, China
| | - Karsten Kristiansen
- Qingdao Key Laboratory of Marine Genomics and Qingdao-Europe Advanced Institute for Life Sciences, BGI Research, Qingdao 266555, China; Laboratory of Integrative Biomedicine, Department of Biology, University of Copenhagen, Copenhagen 2100, Denmark
| | - Xun Xu
- State Key Laboratory of Genome and Multi-omics Technologies, BGI Research, Shenzhen 518083, China; Qingdao Key Laboratory of Marine Genomics and Qingdao-Europe Advanced Institute for Life Sciences, BGI Research, Qingdao 266555, China; BGI Research, Hangzhou 310030, China.
| | - Shanshan Liu
- State Key Laboratory of Genome and Multi-omics Technologies, BGI Research, Shenzhen 518083, China; Institution of Deep-sea Life Sciences, IDSSE-BGI, Hainan Deep-sea Technology Laboratory, Sanya 57200, China; BGI, Shenzhen 518083, China; Shenzhen Key Laboratory of Marine Biology Genomics, BGI Research, Shenzhen 518083, China.
| | - Guangyi Fan
- BGI Research, Qingdao 266555, China; State Key Laboratory of Genome and Multi-omics Technologies, BGI Research, Shenzhen 518083, China; Qingdao Key Laboratory of Marine Genomics and Qingdao-Europe Advanced Institute for Life Sciences, BGI Research, Qingdao 266555, China; BGI Research, Sanya 572025, China; Department of Food Science and Nutrition and PolyU-BGI Joint Research Centre for Genomics and Synthetic Biology in Global Ocean Resources, The Hong Kong Polytechnic University, Hong Kong, China; Shenzhen Key Laboratory of Marine Biology Genomics, BGI Research, Shenzhen 518083, China.
| |
Collapse
|
3
|
Herzog T, Larena M, Kutanan W, Lukas H, Fieder M, Schaschl H. Natural selection and adaptive traits in the Maniq, a nomadic hunter-gatherer society from Mainland Southeast Asia. Sci Rep 2025; 15:4809. [PMID: 39924514 PMCID: PMC11808089 DOI: 10.1038/s41598-024-83657-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Accepted: 12/16/2024] [Indexed: 02/11/2025] Open
Abstract
Asia is home to diverse hunter-gatherer populations characterized by significant morphological, anthropological, cultural, and linguistic diversity. Despite their importance in understanding ancestral human subsistence, little is known about the essential genetic adaptations of these groups. This study investigates the evolutionary pressures shaping the genome of the Maniq population, a nomadic hunter-gatherer group inhabiting the rainforests of southern Thailand. Using genome-wide approaches, including iHS, xp-EHH, PBE, and beta statistics, we identified signatures of positive and balancing selection. Genes under positive selection were enriched in pathways related to immunity, metabolic regulation, structural adaptation, cardiovascular performance, and neuromodulatory traits. Several genes associated with the Southeast Asian 'negrito-like' phenotype were also under positive selection. Balancing selection was primarily detected in immune-related genes, particularly within the HLA region, underscoring the critical role of genetic diversity in surviving pathogen-rich environments. Additionally, balancing selection in olfactory receptor genes highlights their importance in environmental sensing and adaptation. These results reveal the intricate interplay of positive and balancing selection in shaping the genetic landscape of the Maniq population and highlight their adaptations to the ecological and lifestyle challenges of life in the rainforest. This study contributes to our understanding of human evolutionary processes in tropical environments and hunter-gatherer societies.
Collapse
Affiliation(s)
- Tobias Herzog
- Department of Evolutionary Anthropology, Faculty of Life Sciences, University of Vienna, Djerassiplatz 1, Vienna, 1030, Austria.
| | - Maximilian Larena
- Human Evolution, Department of Organismal Biology, Uppsala University, Norbyvägen 18C, Uppsala, 75236, Sweden
| | - Wibhu Kutanan
- Department of Biology, Faculty of Science, Naresuan University, Phitsanulok, 65000, Thailand
| | - Helmut Lukas
- Institute for Social Anthropology, Austrian Academy of Sciences, Georg-Coch-Platz 2, Vienna, 1010, Austria
| | - Martin Fieder
- Department of Evolutionary Anthropology, Faculty of Life Sciences, University of Vienna, Djerassiplatz 1, Vienna, 1030, Austria
| | - Helmut Schaschl
- Department of Evolutionary Anthropology, Faculty of Life Sciences, University of Vienna, Djerassiplatz 1, Vienna, 1030, Austria.
| |
Collapse
|
4
|
Witt KE, Villanea FA. Computational Genomics and Its Applications to Anthropological Questions. AMERICAN JOURNAL OF BIOLOGICAL ANTHROPOLOGY 2024; 186 Suppl 78:e70010. [PMID: 40071816 PMCID: PMC11898561 DOI: 10.1002/ajpa.70010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/24/2024] [Revised: 10/14/2024] [Accepted: 12/19/2024] [Indexed: 03/15/2025]
Abstract
The advent of affordable genome sequencing and the development of new computational tools have established a new era of genomic knowledge. Sequenced human genomes number in the tens of thousands, including thousands of ancient human genomes. The abundance of data has been met with new analysis tools that can be used to understand populations' demographic and evolutionary histories. Thus, a variety of computational methods now exist that can be leveraged to answer anthropological questions. This includes novel likelihood and Bayesian methods, machine learning techniques, and a vast array of population simulators. These computational tools provide powerful insights gained from genomic datasets, although they are generally inaccessible to those with less computational experience. Here, we outline the theoretical workings behind computational genomics methods, limitations and other considerations when applying these computational methods, and examples of how computational methods have already been applied to anthropological questions. We hope this review will empower other anthropologists to utilize these powerful tools in their own research.
Collapse
Affiliation(s)
- Kelsey E. Witt
- Department of Genetics and Biochemistry and Center for Human GeneticsClemson UniversityClemsonSouth CarolinaUSA
| | | |
Collapse
|
5
|
Cheng X, Steinrücken M. Population Genomic Scans for Natural Selection and Demography. Annu Rev Genet 2024; 58:319-339. [PMID: 39227130 DOI: 10.1146/annurev-genet-111523-102651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/05/2024]
Abstract
Uncovering the fundamental processes that shape genomic variation in natural populations is a primary objective of population genetics. These processes include demographic effects such as past changes in effective population size or gene flow between structured populations. Furthermore, genomic variation is affected by selection on nonneutral genetic variants, for example, through the adaptation of beneficial alleles or balancing selection that maintains genetic variation. In this article, we discuss the characterization of these processes using population genetic models, and we review methods developed on the basis of these models to unravel the underlying processes from modern population genomic data sets. We briefly discuss the conditions in which these approaches can be used to infer demography or identify specific nonneutral genetic variants and cases in which caution is warranted. Moreover, we summarize the challenges of jointly inferring demography and selective processes that affect neutral variation genome-wide.
Collapse
Affiliation(s)
- Xiaoheng Cheng
- Department of Ecology and Evolution, University of Chicago, Chicago, Illinois, USA;
| | - Matthias Steinrücken
- Department of Human Genetics, University of Chicago, Chicago, Illinois, USA
- Department of Ecology and Evolution, University of Chicago, Chicago, Illinois, USA;
| |
Collapse
|
6
|
Braichenko S, Borges R, Kosiol C. Polymorphism-Aware Models in RevBayes: Species Trees, Disentangling Balancing Selection, and GC-Biased Gene Conversion. Mol Biol Evol 2024; 41:msae138. [PMID: 38980178 PMCID: PMC11272101 DOI: 10.1093/molbev/msae138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Revised: 04/19/2024] [Accepted: 07/06/2024] [Indexed: 07/10/2024] Open
Abstract
The role of balancing selection is a long-standing evolutionary puzzle. Balancing selection is a crucial evolutionary process that maintains genetic variation (polymorphism) over extended periods of time; however, detecting it poses a significant challenge. Building upon the Polymorphism-aware phylogenetic Models (PoMos) framework rooted in the Moran model, we introduce a PoMoBalance model. This novel approach is designed to disentangle the interplay of mutation, genetic drift, and directional selection (GC-biased gene conversion), along with the previously unexplored balancing selection pressures on ultra-long timescales comparable with species divergence times by analyzing multi-individual genomic and phylogenetic divergence data. Implemented in the open-source RevBayes Bayesian framework, PoMoBalance offers a versatile tool for inferring phylogenetic trees as well as quantifying various selective pressures. The novel aspect of our approach in studying balancing selection lies in polymorphism-aware phylogenetic models' ability to account for ancestral polymorphisms and incorporate parameters that measure frequency-dependent selection, allowing us to determine the strength of the effect and exact frequencies under selection. We implemented validation tests and assessed the model on the data simulated with SLiM and a custom Moran model simulator. Real sequence analysis of Drosophila populations reveals insights into the evolutionary dynamics of regions subject to frequency-dependent balancing selection, particularly in the context of sex-limited color dimorphism in Drosophila erecta.
Collapse
Affiliation(s)
- Svitlana Braichenko
- Centre for Biological Diversity, School of Biology, University of St Andrews, Fife KY16 9TH, UK
- Institute of Genetics and Cancer, University of Edinburgh, Edinburgh EH4 2XU, UK
| | - Rui Borges
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien 1210, Austria
| | - Carolin Kosiol
- Centre for Biological Diversity, School of Biology, University of St Andrews, Fife KY16 9TH, UK
| |
Collapse
|
7
|
Soni V, Jensen JD. Temporal challenges in detecting balancing selection from population genomic data. G3 (BETHESDA, MD.) 2024; 14:jkae069. [PMID: 38551137 DOI: 10.1093/g3journal/jkae069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 12/21/2023] [Accepted: 03/19/2024] [Indexed: 04/28/2024]
Abstract
The role of balancing selection in maintaining genetic variation remains an open question in population genetics. Recent years have seen numerous studies identifying candidate loci potentially experiencing balancing selection, most predominantly in human populations. There are however numerous alternative evolutionary processes that may leave similar patterns of variation, thereby potentially confounding inference, and the expected signatures of balancing selection additionally change in a temporal fashion. Here we use forward-in-time simulations to quantify expected statistical power to detect balancing selection using both site frequency spectrum- and linkage disequilibrium-based methods under a variety of evolutionarily realistic null models. We find that whilst site frequency spectrum-based methods have little power immediately after a balanced mutation begins segregating, power increases with time since the introduction of the balanced allele. Conversely, linkage disequilibrium-based methods have considerable power whilst the allele is young, and power dissipates rapidly as the time since introduction increases. Taken together, this suggests that site frequency spectrum-based methods are most effective at detecting long-term balancing selection (>25N generations since the introduction of the balanced allele) whilst linkage disequilibrium-based methods are effective over much shorter timescales (<1N generations), thereby leaving a large time frame over which current methods have little power to detect the action of balancing selection. Finally, we investigate the extent to which alternative evolutionary processes may mimic these patterns, and demonstrate the need for caution in attempting to distinguish the signatures of balancing selection from those of both neutral processes (e.g. population structure and admixture) as well as of alternative selective processes (e.g. partial selective sweeps).
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ 85281, USA
| | - Jeffrey D Jensen
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ 85281, USA
| |
Collapse
|
8
|
Venkatesh SS, Wittemans LBL, Palmer DS, Baya NA, Ferreira T, Hill B, Lassen FH, Parker MJ, Reibe S, Elhakeem A, Banasik K, Bruun MT, Erikstrup C, Jensen BA, Juul A, Mikkelsen C, Nielsen HS, Ostrowski SR, Pedersen OB, Rohde PD, Sorensen E, Ullum H, Westergaard D, Haraldsson A, Holm H, Jonsdottir I, Olafsson I, Steingrimsdottir T, Steinthorsdottir V, Thorleifsson G, Figueredo J, Karjalainen MK, Pasanen A, Jacobs BM, Hubers N, Lippincott M, Fraser A, Lawlor DA, Timpson NJ, Nyegaard M, Stefansson K, Magi R, Laivuori H, van Heel DA, Boomsma DI, Balasubramanian R, Seminara SB, Chan YM, Laisk T, Lindgren CM. Genome-wide analyses identify 21 infertility loci and over 400 reproductive hormone loci across the allele frequency spectrum. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.03.19.24304530. [PMID: 38562841 PMCID: PMC10984039 DOI: 10.1101/2024.03.19.24304530] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Genome-wide association studies (GWASs) may help inform treatments for infertility, whose causes remain unknown in many cases. Here we present GWAS meta-analyses across six cohorts for male and female infertility in up to 41,200 cases and 687,005 controls. We identified 21 genetic risk loci for infertility (P≤5E-08), of which 12 have not been reported for any reproductive condition. We found positive genetic correlations between endometriosis and all-cause female infertility (r g=0.585, P=8.98E-14), and between polycystic ovary syndrome and anovulatory infertility (r g=0.403, P=2.16E-03). The evolutionary persistence of female infertility-risk alleles in EBAG9 may be explained by recent directional selection. We additionally identified up to 269 genetic loci associated with follicle-stimulating hormone (FSH), luteinising hormone, oestradiol, and testosterone through sex-specific GWAS meta-analyses (N=6,095-246,862). While hormone-associated variants near FSHB and ARL14EP colocalised with signals for anovulatory infertility, we found no r g between female infertility and reproductive hormones (P>0.05). Exome sequencing analyses in the UK Biobank (N=197,340) revealed that women carrying testosterone-lowering rare variants in GPC2 were at higher risk of infertility (OR=2.63, P=1.25E-03). Taken together, our results suggest that while individual genes associated with hormone regulation may be relevant for fertility, there is limited genetic evidence for correlation between reproductive hormones and infertility at the population level. We provide the first comprehensive view of the genetic architecture of infertility across multiple diagnostic criteria in men and women, and characterise its relationship to other health conditions.
Collapse
Affiliation(s)
- Samvida S Venkatesh
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford OX3 7LF, United Kingdom
- Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford OX3 7BN, United Kingdom
| | - Laura B L Wittemans
- Novo Nordisk Research Centre Oxford, Oxford, United Kingdom
- Nuffield Department of Women's and Reproductive Health, Medical Sciences Division, University of Oxford, United Kingdom
| | - Duncan S Palmer
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford OX3 7LF, United Kingdom
- Nuffield Department of Population Health, Medical Sciences Division, University of Oxford, Oxford, United Kingdom
| | - Nikolas A Baya
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford OX3 7LF, United Kingdom
- Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford OX3 7BN, United Kingdom
| | - Teresa Ferreira
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford OX3 7LF, United Kingdom
| | - Barney Hill
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford OX3 7LF, United Kingdom
- Nuffield Department of Population Health, Medical Sciences Division, University of Oxford, Oxford, United Kingdom
| | - Frederik Heymann Lassen
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford OX3 7LF, United Kingdom
- Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford OX3 7BN, United Kingdom
| | - Melody J Parker
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford OX3 7LF, United Kingdom
- Nuffield Department of Clinical Medicine, University of Oxford, John Radcliffe Hospital, Oxford, United Kingdom
| | - Saskia Reibe
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford OX3 7LF, United Kingdom
- Nuffield Department of Population Health, Medical Sciences Division, University of Oxford, Oxford, United Kingdom
| | - Ahmed Elhakeem
- MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, United Kingdom
- Population Health Science, Bristol Medical School, University of Bristol, Bristol, United Kingdom
| | - Karina Banasik
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
- Department of Obstetrics and Gynecology, Copenhagen University Hospital, Hvidovre, Copenhagen, Denmark
| | - Mie T Bruun
- Department of Clinical Immunology, Odense University Hospital, Odense, Denmark
| | - Christian Erikstrup
- Department of Clinical Immunology, Aarhus University Hospital, Aarhus, Denmark
- Department of Clinical Medicine, Health, Aarhus University, Aarhus, Denmark
| | - Bitten A Jensen
- Department of Clinical Immunology, Aalborg University Hospital, Aalborg, Denmark
| | - Anders Juul
- Department of Clinical Medicine, Faculty of Health and Medical Sciences, University of Copenhagen; Copenhagen, Denmark
- Department of Growth and Reproduction, Copenhagen University Hospital-Rigshospitalet, Copenhagen, Denmark
| | - Christina Mikkelsen
- Department of Clinical Immunology, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Science, Copenhagen University, Copenhagen, Denmark
| | - Henriette S Nielsen
- Department of Obstetrics and Gynecology, The Fertility Clinic, Hvidovre University Hospital, Copenhagen, Denmark
- Department of Clinical Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Sisse R Ostrowski
- Department of Clinical Immunology, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
- Department of Clinical Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Ole B Pedersen
- Department of Clinical Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
- Department of Clinical Immunology, Zealand University Hospital, Kge, Denmark
| | - Palle D Rohde
- Genomic Medicine, Department of Health Science and Technology, Aalborg University, Aalborg, Denmark
| | - Erik Sorensen
- Department of Clinical Immunology, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
| | | | - David Westergaard
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
- Department of Obstetrics and Gynecology, Copenhagen University Hospital, Hvidovre, Copenhagen, Denmark
| | - Asgeir Haraldsson
- Faculty of Medicine, University of Iceland, Reykjavik, Iceland
- Children's Hospital Iceland, Landspitali University Hospital, Reykjavik, Iceland
| | - Hilma Holm
- deCODE genetics/Amgen, Inc., Reykjavik, Iceland
| | - Ingileif Jonsdottir
- Faculty of Medicine, University of Iceland, Reykjavik, Iceland
- deCODE genetics/Amgen, Inc., Reykjavik, Iceland
| | - Isleifur Olafsson
- Department of Clinical Biochemistry, Landspitali University Hospital, Reykjavik, Iceland
| | - Thora Steingrimsdottir
- Faculty of Medicine, University of Iceland, Reykjavik, Iceland
- Department of Obstetrics and Gynecology, Landspitali University Hospital, Reykjavik, Iceland
| | | | | | - Jessica Figueredo
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Minna K Karjalainen
- Institute for Molecular Medicine Finland, Helsinki Institute of Life Science, University of Helsinki, Helsinki, Finland
- Research Unit of Population Health, Faculty of Medicine, University of Oulu, Finland
- Northern Finland Birth Cohorts, Arctic Biobank, Infrastructure for Population Studies, Faculty of Medicine, University of Oulu, Oulu, Finland
| | - Anu Pasanen
- Research Unit of Clinical Medicine, Medical Research Center Oulu, University of Oulu, and Department of Children and Adolescents, Oulu University Hospital, Oulu, Finland
| | - Benjamin M Jacobs
- Centre for Preventive Neurology, Wolfson Institute of Population Health, Queen Mary University London, London, EC1M 6BQ, United Kingdom
| | - Nikki Hubers
- Department of Biological Psychology, Netherlands Twin Register, Vrije Universiteit, Amsterdam, The Netherlands
- Amsterdam Reproduction and Development Institute, Amsterdam, The Netherlands
| | - Margaret Lippincott
- Harvard Reproductive Sciences Center and Reproductive Endocrine Unit, Massachusetts General Hospital, Boston, Massachusetts, United States of America
- Harvard Medical School, Boston, Massachusetts, United States of America
| | - Abigail Fraser
- MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, United Kingdom
- Population Health Science, Bristol Medical School, University of Bristol, Bristol, United Kingdom
| | - Deborah A Lawlor
- MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, United Kingdom
- Population Health Science, Bristol Medical School, University of Bristol, Bristol, United Kingdom
| | - Nicholas J Timpson
- MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, United Kingdom
- Population Health Science, Bristol Medical School, University of Bristol, Bristol, United Kingdom
| | - Mette Nyegaard
- Genomic Medicine, Department of Health Science and Technology, Aalborg University, Aalborg, Denmark
| | - Kari Stefansson
- Faculty of Medicine, University of Iceland, Reykjavik, Iceland
- deCODE genetics/Amgen, Inc., Reykjavik, Iceland
| | - Reedik Magi
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Hannele Laivuori
- Institute for Molecular Medicine Finland, Helsinki Institute of Life Science, University of Helsinki, Helsinki, Finland
- Medical and Clinical Genetics, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
- Department of Obstetrics and Gynecology, Tampere University Hospital, Finland
- Center for Child, Adolescent, and Maternal Health Research, Faculty of Medicine and Health Technology, Tampere University, Finland
| | - David A van Heel
- Blizard Institute, Queen Mary University London, London, E1 2AT, United Kingdom
| | - Dorret I Boomsma
- Department of Biological Psychology, Netherlands Twin Register, Vrije Universiteit, Amsterdam, The Netherlands
- Amsterdam Reproduction and Development Institute, Amsterdam, The Netherlands
| | - Ravikumar Balasubramanian
- Harvard Reproductive Sciences Center and Reproductive Endocrine Unit, Massachusetts General Hospital, Boston, Massachusetts, United States of America
- Harvard Medical School, Boston, Massachusetts, United States of America
| | - Stephanie B Seminara
- Harvard Reproductive Sciences Center and Reproductive Endocrine Unit, Massachusetts General Hospital, Boston, Massachusetts, United States of America
- Harvard Medical School, Boston, Massachusetts, United States of America
| | - Yee-Ming Chan
- Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Endocrinology, Department of Pediatrics, Boston Children's Hospital, Boston, Massachusetts, United States of America
| | - Triin Laisk
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Cecilia M Lindgren
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford OX3 7LF, United Kingdom
- Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford OX3 7BN, United Kingdom
- Nuffield Department of Women's and Reproductive Health, Medical Sciences Division, University of Oxford, United Kingdom
- Broad Institute of Harvard and MIT, Cambridge, Massachusetts, United States of America
| |
Collapse
|
9
|
Lu Y, Luo F, Zhou A, Yi C, Chen H, Li J, Guo Y, Xie Y, Zhang W, Lin D, Yang Y, Wu Z, Zhang Y, Xu S, Hu W. Whole-genome sequencing of the invasive golden apple snail Pomacea canaliculata from Asia reveals rapid expansion and adaptive evolution. Gigascience 2024; 13:giae064. [PMID: 39311763 PMCID: PMC11417965 DOI: 10.1093/gigascience/giae064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 02/08/2024] [Accepted: 08/07/2024] [Indexed: 09/26/2024] Open
Abstract
Pomacea canaliculata, an invasive species native to South America, is recognized for its broad geographic distribution and adaptability to a variety of ecological conditions. The details concerning the evolution and adaptation of P. canaliculate remain unclear due to a lack of whole-genome resequencing data. We examined 173 P. canaliculata genomes representing 17 geographic populations in East and Southeast Asia. Interestingly, P. canaliculata showed a higher level of genetic diversity than other mollusks, and our analysis suggested that the dispersal of P. canaliculata could have been driven by climate changes and human activities. Notably, we identified a set of genes associated with low temperature adaptation, including Csde1, a cold shock protein coding gene. Further RNA sequencing analysis and reverse transcription quantitative polymerase chain reaction experiments demonstrated the gene's dynamic pattern and biological functions during cold exposure. Moreover, both positive selection and balancing selection are likely to have contributed to the rapid environmental adaptation of P. canaliculata populations. In particular, genes associated with energy metabolism and stress response were undergoing positive selection, while a large number of immune-related genes showed strong signatures of balancing selection. Our study has advanced our understanding of the evolution of P. canaliculata and has provided a valuable resource concerning an invasive species.
Collapse
Affiliation(s)
- Yan Lu
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, China
- Center for Evolutionary Biology, Ministry of Education Key Laboratory of Contemporary Anthropology, Fudan University, Shanghai 200438, China
| | - Fang Luo
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, China
| | - An Zhou
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, China
- Center for Evolutionary Biology, Ministry of Education Key Laboratory of Contemporary Anthropology, Fudan University, Shanghai 200438, China
| | - Cun Yi
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, China
- Joint Research Laboratory of Genetics and Ecology on Parasite-host Interaction, Chinese Center for Disease Control and Prevention & Fudan University, Shanghai 200438, China
| | - Hao Chen
- Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Jian Li
- China Basic Medical College, Guangxi Traditional Chinese Medical University, Nanning 530005, China
| | - Yunhai Guo
- National Institute of Parasitic Diseases, Chinese Center for Disease Control and Prevention (Chinese Center for Tropical Diseases Research); NHC Key Laboratory of Parasite and Vector Biology; WHO Collaborating Centre for Tropical Diseases; National Center for International Research on Tropical Diseases, Shanghai 200025, China
| | - Yuxiang Xie
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, China
- Joint Research Laboratory of Genetics and Ecology on Parasite-host Interaction, Chinese Center for Disease Control and Prevention & Fudan University, Shanghai 200438, China
| | - Wei Zhang
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, China
- Joint Research Laboratory of Genetics and Ecology on Parasite-host Interaction, Chinese Center for Disease Control and Prevention & Fudan University, Shanghai 200438, China
| | - Datao Lin
- Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, China
| | - Yaming Yang
- Yunnan Institute of Parasitic Diseases, Yunnan 665000, China
| | - Zhongdao Wu
- Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, China
| | - Yi Zhang
- National Institute of Parasitic Diseases, Chinese Center for Disease Control and Prevention (Chinese Center for Tropical Diseases Research); NHC Key Laboratory of Parasite and Vector Biology; WHO Collaborating Centre for Tropical Diseases; National Center for International Research on Tropical Diseases, Shanghai 200025, China
| | - Shuhua Xu
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, China
- Center for Evolutionary Biology, Ministry of Education Key Laboratory of Contemporary Anthropology, Fudan University, Shanghai 200438, China
| | - Wei Hu
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, China
- Joint Research Laboratory of Genetics and Ecology on Parasite-host Interaction, Chinese Center for Disease Control and Prevention & Fudan University, Shanghai 200438, China
- College of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| |
Collapse
|
10
|
Pivirotto AM, Platt A, Patel R, Kumar S, Hey J. Analyses of allele age and fitness impact reveal human beneficial alleles to be older than neutral controls. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.09.561569. [PMID: 37873438 PMCID: PMC10592680 DOI: 10.1101/2023.10.09.561569] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
A classic population genetic prediction is that alleles experiencing directional selection should swiftly traverse allele frequency space, leaving detectable reductions in genetic variation in linked regions. However, despite this expectation, identifying clear footprints of beneficial allele passage has proven to be surprisingly challenging. We addressed the basic premise underlying this expectation by estimating the ages of large numbers of beneficial and deleterious alleles in a human population genomic data set. Deleterious alleles were found to be young, on average, given their allele frequency. However, beneficial alleles were older on average than non-coding, non-regulatory alleles of the same frequency. This finding is not consistent with directional selection and instead indicates some type of balancing selection. Among derived beneficial alleles, those fixed in the population show higher local recombination rates than those still segregating, consistent with a model in which new beneficial alleles experience an initial period of balancing selection due to linkage disequilibrium with deleterious recessive alleles. Alleles that ultimately fix following a period of balancing selection will leave a modest 'soft' sweep impact on the local variation, consistent with the overall paucity of species-wide 'hard' sweeps in human genomes.
Collapse
Affiliation(s)
| | - Alexander Platt
- Temple University, Department of Biology, Philadelphia PA 19122, USA
- University of Pennsylvania, Department of Genetics, Philadelphia PA 19104, USA
| | - Ravi Patel
- Temple University, Department of Biology, Philadelphia PA 19122, USA
- Institute for Genomics and Evolutionary Medicine, Temple University, PA 19122, USA
| | - Sudhir Kumar
- Temple University, Department of Biology, Philadelphia PA 19122, USA
- Institute for Genomics and Evolutionary Medicine, Temple University, PA 19122, USA
| | - Jody Hey
- Temple University, Department of Biology, Philadelphia PA 19122, USA
| |
Collapse
|
11
|
Zhao S, Chi L, Chen H. CEGA: a method for inferring natural selection by comparative population genomic analysis across species. Genome Biol 2023; 24:219. [PMID: 37789379 PMCID: PMC10548728 DOI: 10.1186/s13059-023-03068-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 09/20/2023] [Indexed: 10/05/2023] Open
Abstract
We developed maximum likelihood method for detecting positive selection or balancing selection using multilocus or genomic polymorphism and divergence data from two species. The method is especially useful for investigating natural selection in noncoding regions. Simulations demonstrate that the method outperforms existing methods in detecting both positive and balancing selection. We apply the method to population genomic data from human and chimpanzee. The list of genes identified under selection in the noncoding regions is prominently enriched in pathways related to the brain and nervous system. Therefore, our method will serve as a useful tool for comparative population genomic analysis.
Collapse
Affiliation(s)
- Shilei Zhao
- CAS Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100101, China
- China National Center for Bioinformation, Beijing, 100101, China
- School of Future Technology, College of Life Sciences and Sino-Danish College, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Lianjiang Chi
- CAS Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100101, China
- China National Center for Bioinformation, Beijing, 100101, China
| | - Hua Chen
- CAS Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100101, China.
- China National Center for Bioinformation, Beijing, 100101, China.
- School of Future Technology, College of Life Sciences and Sino-Danish College, University of Chinese Academy of Sciences, Beijing, 100049, China.
- CAS Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, 650223, China.
| |
Collapse
|
12
|
Gao L, Kantar MB, Moxley D, Ortiz-Barrientos D, Rieseberg LH. Crop adaptation to climate change: An evolutionary perspective. MOLECULAR PLANT 2023; 16:1518-1546. [PMID: 37515323 DOI: 10.1016/j.molp.2023.07.011] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Revised: 06/20/2023] [Accepted: 07/26/2023] [Indexed: 07/30/2023]
Abstract
The disciplines of evolutionary biology and plant and animal breeding have been intertwined throughout their development, with responses to artificial selection yielding insights into the action of natural selection and evolutionary biology providing statistical and conceptual guidance for modern breeding. Here we offer an evolutionary perspective on a grand challenge of the 21st century: feeding humanity in the face of climate change. We first highlight promising strategies currently under way to adapt crops to current and future climate change. These include methods to match crop varieties with current and predicted environments and to optimize breeding goals, management practices, and crop microbiomes to enhance yield and sustainable production. We also describe the promise of crop wild relatives and recent technological innovations such as speed breeding, genomic selection, and genome editing for improving environmental resilience of existing crop varieties or for developing new crops. Next, we discuss how methods and theory from evolutionary biology can enhance these existing strategies and suggest novel approaches. We focus initially on methods for reconstructing the evolutionary history of crops and their pests and symbionts, because such historical information provides an overall framework for crop-improvement efforts. We then describe how evolutionary approaches can be used to detect and mitigate the accumulation of deleterious mutations in crop genomes, identify alleles and mutations that underlie adaptation (and maladaptation) to agricultural environments, mitigate evolutionary trade-offs, and improve critical proteins. Continuing feedback between the evolution and crop biology communities will ensure optimal design of strategies for adapting crops to climate change.
Collapse
Affiliation(s)
- Lexuan Gao
- CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai 200032, China
| | - Michael B Kantar
- Department of Tropical Plant & Soil Sciences, University of Hawaii at Manoa, Honolulu, HI, USA
| | - Dylan Moxley
- Department of Botany and Biodiversity Research Centre, University of British Columbia, Vancouver, BC, Canada
| | - Daniel Ortiz-Barrientos
- School of Biological Sciences and Australian Research Council Centre of Excellence for Plant Success in Nature and Agriculture, The University of Queensland, Brisbane, QLD, Australia
| | - Loren H Rieseberg
- Department of Botany and Biodiversity Research Centre, University of British Columbia, Vancouver, BC, Canada.
| |
Collapse
|
13
|
Tong X, Chen D, Hu J, Lin S, Ling Z, Ai H, Zhang Z, Huang L. Accurate haplotype construction and detection of selection signatures enabled by high quality pig genome sequences. Nat Commun 2023; 14:5126. [PMID: 37612277 PMCID: PMC10447580 DOI: 10.1038/s41467-023-40434-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Accepted: 07/27/2023] [Indexed: 08/25/2023] Open
Abstract
High-quality whole-genome resequencing in large-scale pig populations with pedigree structure and multiple breeds would enable accurate construction of haplotype and robust selection-signature detection. Here, we sequence 740 pigs, combine with 149 of our previously published resequencing data, retrieve 207 resequencing datasets, and form a panel of worldwide distributed wild boars, aboriginal and highly selected pigs with pedigree structures, amounting to 1096 genomes from 43 breeds. Combining with their haplotype-informative reads and pedigree structure, we accurately construct a panel of 1874 haploid genomes with 41,964,356 genetic variants. We further demonstrate its valuable applications in GWAS by identifying five novel loci for intramuscular fat content, and in genomic selection by increasing the accuracy of estimated breeding value by 36.7%. In evolutionary selection, we detect MUC13 gene under a long-term balancing selection, as well as NPR3 gene under positive selection for pig stature. Our study provides abundant genomic variations for robust selection-signature detection and accurate haplotypes for deciphering complex traits in pigs.
Collapse
Affiliation(s)
- Xinkai Tong
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, PR China
- College of Life Sciences, Jiangxi Normal University, NanChang, Jiangxi Province, PR China
| | - Dong Chen
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, PR China
| | - Jianchao Hu
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, PR China
| | - Shiyao Lin
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, PR China
| | - Ziqi Ling
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, PR China
| | - Huashui Ai
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, PR China
| | - Zhiyan Zhang
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, PR China.
| | - Lusheng Huang
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, PR China.
| |
Collapse
|
14
|
Teterina AA, Willis JH, Lukac M, Jovelin R, Cutter AD, Phillips PC. Genomic diversity landscapes in outcrossing and selfing Caenorhabditis nematodes. PLoS Genet 2023; 19:e1010879. [PMID: 37585484 PMCID: PMC10461856 DOI: 10.1371/journal.pgen.1010879] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Revised: 08/28/2023] [Accepted: 07/21/2023] [Indexed: 08/18/2023] Open
Abstract
Caenorhabditis nematodes form an excellent model for studying how the mode of reproduction affects genetic diversity, as some species reproduce via outcrossing whereas others can self-fertilize. Currently, chromosome-level patterns of diversity and recombination are only available for self-reproducing Caenorhabditis, making the generality of genomic patterns across the genus unclear given the profound potential influence of reproductive mode. Here we present a whole-genome diversity landscape, coupled with a new genetic map, for the outcrossing nematode C. remanei. We demonstrate that the genomic distribution of recombination in C. remanei, like the model nematode C. elegans, shows high recombination rates on chromosome arms and low rates toward the central regions. Patterns of genetic variation across the genome are also similar between these species, but differ dramatically in scale, being tenfold greater for C. remanei. Historical reconstructions of variation in effective population size over the past million generations echo this difference in polymorphism. Evolutionary simulations demonstrate how selection, recombination, mutation, and selfing shape variation along the genome, and that multiple drivers can produce patterns similar to those observed in natural populations. The results illustrate how genome organization and selection play a crucial role in shaping the genomic pattern of diversity whereas demographic processes scale the level of diversity across the genome as a whole.
Collapse
Affiliation(s)
- Anastasia A. Teterina
- Institute of Ecology and Evolution, University of Oregon, Eugene, Oregon, United States of America
- Center of Parasitology, Severtsov Institute of Ecology and Evolution RAS, Moscow, Russia
| | - John H. Willis
- Institute of Ecology and Evolution, University of Oregon, Eugene, Oregon, United States of America
| | - Matt Lukac
- Institute of Ecology and Evolution, University of Oregon, Eugene, Oregon, United States of America
| | - Richard Jovelin
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada
| | - Asher D. Cutter
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada
| | - Patrick C. Phillips
- Institute of Ecology and Evolution, University of Oregon, Eugene, Oregon, United States of America
| |
Collapse
|
15
|
Nandakumar M, Lundberg M, Carlsson F, Råberg L. Balancing selection on the complement system of a wild rodent. BMC Ecol Evol 2023; 23:21. [PMID: 37231383 DOI: 10.1186/s12862-023-02122-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Accepted: 05/10/2023] [Indexed: 05/27/2023] Open
Abstract
BACKGROUND Selection pressure exerted by pathogens can influence patterns of genetic diversity in the host. In the immune system especially, numerous genes encode proteins involved in antagonistic interactions with pathogens, paving the way for coevolution that results in increased genetic diversity as a consequence of balancing selection. The complement system is a key component of innate immunity. Many complement proteins interact directly with pathogens, either by recognising pathogen molecules for complement activation, or by serving as targets of pathogen immune evasion mechanisms. Complement genes can therefore be expected to be important targets of pathogen-mediated balancing selection, but analyses of such selection on this part of the immune system have been limited. RESULTS Using a population sample of whole-genome resequencing data from wild bank voles (n = 31), we estimated the extent of genetic diversity and tested for signatures of balancing selection in multiple complement genes (n = 44). Complement genes showed higher values of standardised β (a statistic expected to be high under balancing selection) than the genome-wide average of protein coding genes. One complement gene, FCNA, a pattern recognition molecule that interacts directly with pathogens, was found to have a signature of balancing selection, as indicated by the Hudson-Kreitman-Aguadé test (HKA) test. Scans for localised signatures of balancing selection in this gene indicated that the target of balancing selection was found in exonic regions involved in ligand binding. CONCLUSION The present study adds to the growing evidence that balancing selection may be an important evolutionary force on components of the innate immune system. The identified target in the complement system typifies the expectation that balancing selection acts on genes encoding proteins involved in direct interactions with pathogens.
Collapse
Affiliation(s)
| | - Max Lundberg
- Department of Biology, Lund University, Lund, Sweden
| | | | - Lars Råberg
- Department of Biology, Lund University, Lund, Sweden
| |
Collapse
|
16
|
Huang G, Wu W, Chen Y, Zhi X, Zou P, Ning Z, Fan Q, Liu Y, Deng S, Zeng K, Zhou R. Balancing selection on an MYB transcription factor maintains the twig trichome color variation in Melastoma normale. BMC Biol 2023; 21:122. [PMID: 37226197 DOI: 10.1186/s12915-023-01611-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Accepted: 05/03/2023] [Indexed: 05/26/2023] Open
Abstract
BACKGROUND The factors that maintain phenotypic and genetic variation within a population have received long-term attention in evolutionary biology. Here the genetic basis and evolution of the geographically widespread variation in twig trichome color (from red to white) in a shrub Melastoma normale was investigated using Pool-seq and evolutionary analyses. RESULTS The results show that the twig trichome coloration is under selection in different light environments and that a 6-kb region containing an R2R3 MYB transcription factor gene is the major region of divergence between the extreme red and white morphs. This gene has two highly divergent groups of alleles, one of which likely originated from introgression from another species in this genus and has risen to high frequency (> 0.6) within each of the three populations under investigation. In contrast, polymorphisms in other regions of the genome show no sign of differentiation between the two morphs, suggesting that genomic patterns of diversity have been shaped by homogenizing gene flow. Population genetics analysis reveals signals of balancing selection acting on this gene, and it is suggested that spatially varying selection is the most likely mechanism of balancing selection in this case. CONCLUSIONS This study demonstrate that polymorphisms on a single transcription factor gene largely confer the twig trichome color variation in M. normale, while also explaining how adaptive divergence can occur and be maintained in the face of gene flow.
Collapse
Affiliation(s)
- Guilian Huang
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, China
| | - Wei Wu
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, China
| | - Yongmei Chen
- College of Chemical Engineering, Sichuan University of Science & Engineering, Zigong, Sichuan, 643000, China
| | - Xueke Zhi
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, China
| | - Peishan Zou
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, China
| | - Zulin Ning
- Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
| | - Qiang Fan
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, China
| | - Ying Liu
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, China
| | - Shulin Deng
- Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
| | - Kai Zeng
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, UK.
| | - Renchao Zhou
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, China.
| |
Collapse
|
17
|
Tunström K, Woronik A, Hanly JJ, Rastas P, Chichvarkhin A, Warren AD, Kawahara AY, Schoville SD, Ficarrotta V, Porter AH, Watt WB, Martin A, Wheat CW. Evidence for a single, ancient origin of a genus-wide alternative life history strategy. SCIENCE ADVANCES 2023; 9:eabq3713. [PMID: 36947619 PMCID: PMC10032607 DOI: 10.1126/sciadv.abq3713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Accepted: 02/21/2023] [Indexed: 06/18/2023]
Abstract
Understanding the evolutionary origins and factors maintaining alternative life history strategies (ALHS) within species is a major goal of evolutionary research. While alternative alleles causing discrete ALHS are expected to purge or fix over time, one-third of the ~90 species of Colias butterflies are polymorphic for a female-limited ALHS called Alba. Whether Alba arose once, evolved in parallel, or has been exchanged among taxa is currently unknown. Using comparative genome-wide association study (GWAS) and population genomic analyses, we placed the genetic basis of Alba in time-calibrated phylogenomic framework, revealing that Alba evolved once near the base of the genus and has been subsequently maintained via introgression and balancing selection. CRISPR-Cas9 mutagenesis was then used to verify a putative cis-regulatory region of Alba, which we identified using phylogenetic foot printing. We hypothesize that this cis-regulatory region acts as a modular enhancer for the induction of the Alba ALHS, which has likely facilitated its long evolutionary persistence.
Collapse
Affiliation(s)
- Kalle Tunström
- Department of Zoology, Stockholm University, Stockholm, Sweden
| | - Alyssa Woronik
- Department of Zoology, Stockholm University, Stockholm, Sweden
- Department of Biology, Sacred Heart University, Fairfield, CT, USA
| | - Joseph J. Hanly
- Department of Biological Sciences, The George Washington University, Washington, DC, USA
| | - Pasi Rastas
- Institute of Biotechnology, University of Helsinki, 00014 Helsinki, Finland
| | - Anton Chichvarkhin
- National Scientific Center of Marine Biology, Far Eastern Branch of Russian Academy of Sciences, Palchevskogo 17, Vladivostok 690022, Russia
| | - Andrew D. Warren
- McGuire Center for Lepidoptera and Biodiversity, Florida Museum of Natural History, University of Florida, Gainesville, FL 32611, USA
| | - Akito Y. Kawahara
- McGuire Center for Lepidoptera and Biodiversity, Florida Museum of Natural History, University of Florida, Gainesville, FL 32611, USA
| | - Sean D. Schoville
- Department of Entomology, University of Wisconsin-Madison, Madison, WI, USA
| | - Vincent Ficarrotta
- Department of Biological Sciences, The George Washington University, Washington, DC, USA
| | - Adam H. Porter
- Department of Biology, University of Massachusetts Amherst, Amherst, MA 01003, USA
| | - Ward B. Watt
- Department of Biology, University of South Carolina, Columbia, SC 29208, USA
- Rocky Mountain Biological Laboratory, Crested Butte, CO 81224, USA
| | - Arnaud Martin
- Department of Biological Sciences, The George Washington University, Washington, DC, USA
| | | |
Collapse
|
18
|
Wang PY, Yang Y, Shi XQ, Chen Y, Liu SD, Wang HY, Peng T, Shi Q, Zhang W, Sun C. Distilling functional variations for human UGT2B4 upstream region based on selection signals and implications for phenotypes of Neanderthal and Denisovan. Sci Rep 2023; 13:3134. [PMID: 36823244 PMCID: PMC9950360 DOI: 10.1038/s41598-023-29682-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Accepted: 02/08/2023] [Indexed: 02/25/2023] Open
Abstract
Our previous work identified one region upstream human UGT2B4 (UDP glucuronosyltransferase family 2 member B4) which is associated with breast cancer and under balancing selection. However, the distribution, functional variation and molecular mechanism underlying breast cancer and balancing selection remain unclear. In current study, the two haplotypes with deep divergence are described by analyzing 1000 genomes project data and observed to be with high frequencies in all human populations. Through population genetics analysis and genome annotation, the potential functional region is identified and verified by reporter gene assay. Further mutagenesis indicates that the functional mutations are rs66862535 and rs68096061. Both SNPs can alter the interaction efficiency of transcription factor POU2F1 (POU class 2 homeobox 1). Through chromosome conformation capture, it is identified that the enhancer containing these two SNPs can interact with UGT2B4 promoter. Expression quantitative trait loci analysis indicates that UGT2B4 expression is dependent on the genotype of this locus. The common haplotype in human is lost in four genomes of archaic hominins, which suggests that Neanderthal and Denisovan should present relatively lower UGT2B4 expression and further higher steroid hormone level. This study provides new insight into the contribution of ancient population structure to human phenotypes.
Collapse
Affiliation(s)
- Pin-Yi Wang
- grid.412498.20000 0004 1759 8395College of Life Sciences, Shaanxi Normal University, Xi’an, 710119 Shaanxi People’s Republic of China ,grid.440773.30000 0000 9342 2456State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, 650091 Yunnan People’s Republic of China
| | - Yuan Yang
- grid.412498.20000 0004 1759 8395College of Life Sciences, Shaanxi Normal University, Xi’an, 710119 Shaanxi People’s Republic of China
| | - Xiao-Qian Shi
- grid.412498.20000 0004 1759 8395College of Life Sciences, Shaanxi Normal University, Xi’an, 710119 Shaanxi People’s Republic of China
| | - Ying Chen
- grid.412498.20000 0004 1759 8395College of Life Sciences, Shaanxi Normal University, Xi’an, 710119 Shaanxi People’s Republic of China
| | - Shao-Dong Liu
- grid.412498.20000 0004 1759 8395College of Life Sciences, Shaanxi Normal University, Xi’an, 710119 Shaanxi People’s Republic of China
| | - Hong-Yan Wang
- grid.412498.20000 0004 1759 8395College of Life Sciences, Shaanxi Normal University, Xi’an, 710119 Shaanxi People’s Republic of China
| | - Tao Peng
- grid.440773.30000 0000 9342 2456State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, 650091 Yunnan People’s Republic of China
| | - Qiang Shi
- grid.412498.20000 0004 1759 8395College of Life Sciences, Shaanxi Normal University, Xi’an, 710119 Shaanxi People’s Republic of China
| | - Wei Zhang
- grid.16753.360000 0001 2299 3507Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL 60611 USA ,grid.449428.70000 0004 1797 7280Institute of Precision Medicine, Jining Medical University, Jining, 272067 Shandong People’s Republic of China
| | - Chang Sun
- College of Life Sciences, Shaanxi Normal University, Xi'an, 710119, Shaanxi, People's Republic of China.
| |
Collapse
|
19
|
Aqil A, Speidel L, Pavlidis P, Gokcumen O. Balancing selection on genomic deletion polymorphisms in humans. eLife 2023; 12:79111. [PMID: 36625544 PMCID: PMC9943071 DOI: 10.7554/elife.79111] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Accepted: 01/05/2023] [Indexed: 01/11/2023] Open
Abstract
A key question in biology is why genomic variation persists in a population for extended periods. Recent studies have identified examples of genomic deletions that have remained polymorphic in the human lineage for hundreds of millennia, ostensibly owing to balancing selection. Nevertheless, genome-wide investigation of ancient and possibly adaptive deletions remains an imperative exercise. Here, we demonstrate an excess of polymorphisms in present-day humans that predate the modern human-Neanderthal split (ancient polymorphisms), which cannot be explained solely by selectively neutral scenarios. We analyze the adaptive mechanisms that underlie this excess in deletion polymorphisms. Using a previously published measure of balancing selection, we show that this excess of ancient deletions is largely owing to balancing selection. Based on the absence of signatures of overdominance, we conclude that it is a rare mode of balancing selection among ancient deletions. Instead, more complex scenarios involving spatially and temporally variable selective pressures are likely more common mechanisms. Our results suggest that balancing selection resulted in ancient deletions harboring disproportionately more exonic variants with GWAS (genome-wide association studies) associations. We further found that ancient deletions are significantly enriched for traits related to metabolism and immunity. As a by-product of our analysis, we show that deletions are, on average, more deleterious than single nucleotide variants. We can now argue that not only is a vast majority of common variants shared among human populations, but a considerable portion of biologically relevant variants has been segregating among our ancestors for hundreds of thousands, if not millions, of years.
Collapse
Affiliation(s)
- Alber Aqil
- Department of Biological Sciences, University at BuffaloBuffaloUnited States
| | - Leo Speidel
- University College London, Genetics InstituteLondonUnited Kingdom
- The Francis Crick InstituteLondonUnited Kingdom
| | - Pavlos Pavlidis
- Institute of Computer Science (ICS), Foundation of Research and Technology-HellasHeraklionGreece
| | - Omer Gokcumen
- Department of Biological Sciences, University at BuffaloBuffaloUnited States
| |
Collapse
|
20
|
Abraham A, LaBella AL, Capra JA, Rokas A. Mosaic patterns of selection in genomic regions associated with diverse human traits. PLoS Genet 2022; 18:e1010494. [PMID: 36342969 PMCID: PMC9671423 DOI: 10.1371/journal.pgen.1010494] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Revised: 11/17/2022] [Accepted: 10/21/2022] [Indexed: 11/09/2022] Open
Abstract
Natural selection shapes the genetic architecture of many human traits. However, the prevalence of different modes of selection on genomic regions associated with variation in traits remains poorly understood. To address this, we developed an efficient computational framework to calculate positive and negative enrichment of different evolutionary measures among regions associated with complex traits. We applied the framework to summary statistics from >900 genome-wide association studies (GWASs) and 11 evolutionary measures of sequence constraint, population differentiation, and allele age while accounting for linkage disequilibrium, allele frequency, and other potential confounders. We demonstrate that this framework yields consistent results across GWASs with variable sample sizes, numbers of trait-associated SNPs, and analytical approaches. The resulting evolutionary atlas maps diverse signatures of selection on genomic regions associated with complex human traits on an unprecedented scale. We detected positive enrichment for sequence conservation among trait-associated regions for the majority of traits (>77% of 290 high power GWASs), which included reproductive traits. Many traits also exhibited substantial positive enrichment for population differentiation, especially among hair, skin, and pigmentation traits. In contrast, we detected widespread negative enrichment for signatures of balancing selection (51% of GWASs) and absence of enrichment for evolutionary signals in regions associated with late-onset Alzheimer's disease. These results support a pervasive role for negative selection on regions of the human genome that contribute to variation in complex traits, but also demonstrate that diverse modes of evolution are likely to have shaped trait-associated loci. This atlas of evolutionary signatures across the diversity of available GWASs will enable exploration of the relationship between the genetic architecture and evolutionary processes in the human genome.
Collapse
Affiliation(s)
- Abin Abraham
- Vanderbilt University Medical Center, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Abigail L. LaBella
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, United States of America
- Evolutionary Studies Initiative, Vanderbilt University, Nashville, Tennessee, United States of America
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, North Carolina, United States of America
- North Carolina Research Center, Kannapolis, North Carolina, United States of America
| | - John A. Capra
- Bakar Computational Health Sciences Institute, University of California, San Francisco, California, United States of America
- Department of Epidemiology and Biostatistics, University of California, San Francisco, California, United States of America
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, United States of America
- Evolutionary Studies Initiative, Vanderbilt University, Nashville, Tennessee, United States of America
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, Tennessee, United States of America
| |
Collapse
|
21
|
Schield DR, Perry BW, Adams RH, Holding ML, Nikolakis ZL, Gopalan SS, Smith CF, Parker JM, Meik JM, DeGiorgio M, Mackessy SP, Castoe TA. The roles of balancing selection and recombination in the evolution of rattlesnake venom. Nat Ecol Evol 2022; 6:1367-1380. [PMID: 35851850 PMCID: PMC9888523 DOI: 10.1038/s41559-022-01829-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Accepted: 06/15/2022] [Indexed: 02/02/2023]
Abstract
The origin of snake venom involved duplication and recruitment of non-venom genes into venom systems. Several studies have predicted that directional positive selection has governed this process. Venom composition varies substantially across snake species and venom phenotypes are locally adapted to prey, leading to coevolutionary interactions between predator and prey. Venom origins and contemporary snake venom evolution may therefore be driven by fundamentally different selection regimes, yet investigations of population-level patterns of selection have been limited. Here, we use whole-genome data from 68 rattlesnakes to test hypotheses about the factors that drive genomic diversity and differentiation in major venom gene regions. We show that selection has resulted in long-term maintenance of genetic diversity within and between species in multiple venom gene families. Our findings are inconsistent with a dominant role of directional positive selection and instead support a role of long-term balancing selection in shaping venom evolution. We also detect rapid decay of linkage disequilibrium due to high recombination rates in venom regions, suggesting that venom genes have reduced selective interference with nearby loci, including other venom paralogues. Our results provide an example of long-term balancing selection that drives trans-species polymorphism and help to explain how snake venom keeps pace with prey resistance.
Collapse
Affiliation(s)
- Drew R Schield
- Department of Biology, University of Texas at Arlington, Arlington, TX, USA.
- Department of Ecology and Evolutionary Biology, University of Colorado, Boulder, CO, USA.
| | - Blair W Perry
- Department of Biology, University of Texas at Arlington, Arlington, TX, USA
- School of Biological Sciences, Washington State University, Pullman, WA, USA
| | - Richard H Adams
- Department of Biological and Environmental Sciences, Georgia College and State University, Milledgeville, GA, USA
| | | | | | | | - Cara F Smith
- School of Biological Sciences, University of Northern Colorado, Greeley, CO, USA
| | - Joshua M Parker
- Life Science Department, Fresno City College, Fresno, CA, USA
| | - Jesse M Meik
- Department of Biological Sciences, Tarleton State University, Stephenville, TX, USA
| | - Michael DeGiorgio
- Department of Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL, USA
| | - Stephen P Mackessy
- School of Biological Sciences, University of Northern Colorado, Greeley, CO, USA
| | - Todd A Castoe
- Department of Biology, University of Texas at Arlington, Arlington, TX, USA.
| |
Collapse
|
22
|
Jeong H, Baran NM, Sun D, Chatterjee P, Layman TS, Balakrishnan CN, Maney DL, Yi SV. Dynamic molecular evolution of a supergene with suppressed recombination in white-throated sparrows. eLife 2022; 11:e79387. [PMID: 36040313 PMCID: PMC9427109 DOI: 10.7554/elife.79387] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2022] [Accepted: 08/17/2022] [Indexed: 12/11/2022] Open
Abstract
In white-throated sparrows, two alternative morphs differing in plumage and behavior segregate with a large chromosomal rearrangement. As with sex chromosomes such as the mammalian Y, the rearranged version of chromosome two (ZAL2m) is in a near-constant state of heterozygosity, offering opportunities to investigate both degenerative and selective processes during the early evolutionary stages of 'supergenes.' Here, we generated, synthesized, and analyzed extensive genome-scale data to better understand the forces shaping the evolution of the ZAL2 and ZAL2m chromosomes in this species. We found that features of ZAL2m are consistent with substantially reduced recombination and low levels of degeneration. We also found evidence that selective sweeps took place both on ZAL2m and its standard counterpart, ZAL2, after the rearrangement event. Signatures of positive selection were associated with allelic bias in gene expression, suggesting that antagonistic selection has operated on gene regulation. Finally, we discovered a region exhibiting long-range haplotypes inside the rearrangement on ZAL2m. These haplotypes appear to have been maintained by balancing selection, retaining genetic diversity within the supergene. Together, our analyses illuminate mechanisms contributing to the evolution of a young chromosomal polymorphism, revealing complex selective processes acting concurrently with genetic degeneration to drive the evolution of supergenes.
Collapse
Affiliation(s)
- Hyeonsoo Jeong
- School of Biological Sciences, Georgia Institute of TechnologyAtlantaUnited States
| | - Nicole M Baran
- School of Biological Sciences, Georgia Institute of TechnologyAtlantaUnited States
- Department of Psychology, Emory UniversityAtlantaUnited States
- Department of Ecology, Evolution, Marine Biology, University of California, Santa BarbaraSanta BarbaraUnited States
| | - Dan Sun
- School of Biological Sciences, Georgia Institute of TechnologyAtlantaUnited States
- Department of Medicine Huddinge, Karolinska InstitutetStockholmSweden
| | - Paramita Chatterjee
- School of Biological Sciences, Georgia Institute of TechnologyAtlantaUnited States
| | - Thomas S Layman
- School of Biological Sciences, Georgia Institute of TechnologyAtlantaUnited States
| | | | - Donna L Maney
- Department of Psychology, Emory UniversityAtlantaUnited States
| | - Soojin V Yi
- School of Biological Sciences, Georgia Institute of TechnologyAtlantaUnited States
- Department of Ecology, Evolution, Marine Biology, University of California, Santa BarbaraSanta BarbaraUnited States
| |
Collapse
|
23
|
Epstein B, Burghardt LT, Heath KD, Grillo MA, Kostanecki A, Hämälä T, Young ND, Tiffin P. Combining GWAS and population genomic analyses to characterize coevolution in a legume-rhizobia symbiosis. Mol Ecol 2022. [PMID: 35793264 DOI: 10.1111/mec.16602] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 06/03/2022] [Accepted: 07/04/2022] [Indexed: 11/28/2022]
Abstract
The mutualism between legumes and rhizobia is clearly the product of past coevolution. However, the nature of ongoing evolution between these partners is less clear. To characterize the nature of recent coevolution between legumes and rhizobia, we used population genomic analysis to characterize selection on functionally annotated symbiosis genes as well as on symbiosis gene candidates identified through a two-species association analysis. For the association analysis, we inoculated each of 202 accessions of the legume host Medicago truncatula with a community of 88 Sinorhizobia (Ensifer) meliloti strains. Multistrain inoculation, which better reflects the ecological reality of rhizobial selection in nature than single-strain inoculation, allows strains to compete for nodulation opportunities and host resources and for hosts to preferentially form nodules and provide resources to some strains. We found extensive host by symbiont, that is, genotype-by-genotype, effects on rhizobial fitness and some annotated rhizobial genes bear signatures of recent positive selection. However, neither genes responsible for this variation nor annotated host symbiosis genes are enriched for signatures of either positive or balancing selection. This result suggests that stabilizing selection dominates selection acting on symbiotic traits and that variation in these traits is under mutation-selection balance. Consistent with the lack of positive selection acting on host genes, we found that among-host variation in growth was similar whether plants were grown with rhizobia or N-fertilizer, suggesting that the symbiosis may not be a major driver of variation in plant growth in multistrain contexts.
Collapse
Affiliation(s)
- Brendan Epstein
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul, Minnesota, USA
| | - Liana T Burghardt
- Department of Plant Sciences, The University of Pennsylvania, University Park, Pennsylvania, USA
| | - Katy D Heath
- Department of Plant Biology, University of Illinois, Urbana, Illinois, USA.,Carl R. Woese Institute for Genomic Biology, University of Illinois, Urbana, Illinois, USA
| | - Michael A Grillo
- Department of Biology, Loyola University Chicago, Chicago, Illinois, USA
| | - Adam Kostanecki
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul, Minnesota, USA
| | - Tuomas Hämälä
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul, Minnesota, USA.,School of Life Sciences, University of Nottingham, Nottingham, UK
| | - Nevin D Young
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul, Minnesota, USA.,Department of Plant Pathology, University of Minnesota, St. Paul, Minnesota, USA
| | - Peter Tiffin
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul, Minnesota, USA
| |
Collapse
|
24
|
Velazquez-Arcelay K, Benton ML, Capra JA. Diverse functions associate with non-coding polymorphisms shared between humans and chimpanzees. BMC Ecol Evol 2022; 22:68. [PMID: 35606693 PMCID: PMC9125839 DOI: 10.1186/s12862-022-02020-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2022] [Accepted: 05/09/2022] [Indexed: 11/24/2022] Open
Abstract
Background Long-term balancing selection (LTBS) can maintain allelic variation at a locus over millions of years and through speciation events. Variants shared between species in the state of identity-by-descent, hereafter “trans-species polymorphisms”, can result from LTBS, often due to host–pathogen interactions. For instance, the major histocompatibility complex (MHC) locus contains TSPs present across primates. Several hundred candidate LTBS regions have been identified in humans and chimpanzees; however, because many are in non-protein-coding regions of the genome, the functions and potential adaptive roles for most remain unknown. Results We integrated diverse genomic annotations to explore the functions of 60 previously identified regions with multiple shared polymorphisms (SPs) between humans and chimpanzees, including 19 with strong evidence of LTBS. We analyzed genome-wide functional assays, expression quantitative trait loci (eQTL), genome-wide association studies (GWAS), and phenome-wide association studies (PheWAS) for all the regions. We identify functional annotations for 59 regions, including 58 with evidence of gene regulatory function from GTEx or functional genomics data and 19 with evidence of trait association from GWAS or PheWAS. As expected, the SPs associate in humans with many immune system phenotypes, including response to pathogens, but we also find associations with a range of other phenotypes, including body size, alcohol intake, cognitive performance, risk-taking behavior, and urate levels. Conclusions The diversity of traits associated with non-coding regions with multiple SPs support previous hypotheses that functions beyond the immune system are likely subject to LTBS. Furthermore, several of these trait associations provide support and candidate genetic loci for previous hypothesis about behavioral diversity in human and chimpanzee populations, such as the importance of variation in risk sensitivity. Supplementary Information The online version contains supplementary material available at 10.1186/s12862-022-02020-x.
Collapse
|
25
|
Li G, Tang J, Huang J, Jiang Y, Fan Y, Wang X, Ren J. Genome-Wide Estimates of Runs of Homozygosity, Heterozygosity, and Genetic Load in Two Chinese Indigenous Goat Breeds. Front Genet 2022; 13:774196. [PMID: 35559012 PMCID: PMC9086400 DOI: 10.3389/fgene.2022.774196] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2021] [Accepted: 03/14/2022] [Indexed: 11/13/2022] Open
Abstract
Runs of homozygosity (ROH) and heterozygosity (ROHet) are windows into population demographic history and adaptive evolution. Numerous studies have shown that deleterious mutations are enriched in the ROH of humans, pigs, cattle, and chickens. However, the relationship of deleterious variants to ROH and the pattern of ROHet in goats have been largely understudied. Here, 240 Guangfeng and Ganxi goats from Jiangxi Province, China, were genotyped using the Illumina GoatSNP50 BeadChip and genome-wide ROH, ROHet, and genetic load analyses were performed in the context of 32 global goat breeds. The classes with the highest percentage of ROH and ROHet were 0.5–2 Mb and 0.5–1 Mb, respectively. The results of inbreeding coefficients (based on SNP and ROH) and ROHet measurements showed that Guangfeng goats had higher genetic variability than most Chinese goats, while Ganxi goats had a high degree of inbreeding, even exceeding that of commercial goat breeds. Next, the predicted damaging homozygotes were more enriched in long ROHs, especially in Guangfeng goats. Therefore, we suggest that information on damaging alleles should also be incorporated into the design of breeding and conservation programs. A list of genes related to fecundity, growth, and environmental adaptation were identified in the ROH hotspots of two Jiangxi goats. A sense-related ROH hotspot (chromosome 12: 50.55–50.81 Mb) was shared across global goat breeds and may have undergone selection prior to goat domestication. Furthermore, an identical ROHet hotspot (chromosome 1: 132.21–132.54 Mb) containing two genes associated with embryonic development (STAG1 and PCCB) was detected in domestic goat breeds worldwide. Tajima’s D and BetaScan2 statistics indicated that this region may be caused by long-term balancing selection. These findings not only provide guidance for the design of conservation strategies for Jiangxi goat breeds but also enrich our understanding of the adaptive evolution of goats.
Collapse
Affiliation(s)
- Guixin Li
- Guangdong Laboratory for Lingnan Modern Agriculture, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Jianhong Tang
- Guangdong Laboratory for Lingnan Modern Agriculture, College of Animal Science, South China Agricultural University, Guangzhou, China.,Laboratory Animal Engineering Research Center of Ganzhou, Gannan Medical University, Ganzhou, China
| | - Jinyan Huang
- Guangdong Laboratory for Lingnan Modern Agriculture, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Yongchuang Jiang
- Guangdong Laboratory for Lingnan Modern Agriculture, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Yin Fan
- Department of Animal Science, Jiangxi Biotech Vocational College, Nanchang, China
| | - Xiaopeng Wang
- Guangdong Laboratory for Lingnan Modern Agriculture, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Jun Ren
- Guangdong Laboratory for Lingnan Modern Agriculture, College of Animal Science, South China Agricultural University, Guangzhou, China
| |
Collapse
|
26
|
Allingham J, Floriano WB. Genetic diversity in the IZUMO1-JUNO protein-receptor pair involved in human reproduction. PLoS One 2021; 16:e0260692. [PMID: 34879103 PMCID: PMC8654184 DOI: 10.1371/journal.pone.0260692] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2020] [Accepted: 11/15/2021] [Indexed: 11/19/2022] Open
Abstract
Fertilization in mammals begins with the union of egg and sperm, an event that starts a cascade of cellular processes. The molecular-level understanding of these processes can guide the development of new strategies for controlling and/or promoting fertilization, and inform researchers and medical professional on the best choice of interventions. The proteins encoded by the IZUMO1 and JUNO genes form a ligand-receptor protein pair involved in the recognition of sperm and egg. Due to their role in the fertilization process, these proteins are potential targets for the development of novel anti-contraceptive, as well as infertility treatments. Here we present a comprehensive analysis of these gene sequences, with the objective of identifying evolutionary patterns that may support their relevance as targets for preventing or improving fertility among humans. JUNO and IZUMO1 gene sequences were identified within the genomes of over 2,000 humans sequenced in the 1000 Genomes Project. The human sequences were subjected to analyses of nucleotide diversity, deviation from neutrality of genetic variation, population-based differentiation (FST), haplotype inference, and whole chromosome scanning for signals of positive or of balancing selection. Derived alleles were determined by comparison to archaic hominin and other primate genomes. The potential effect of common non-synonymous variants on protein-protein interaction was also assessed. IZUMO1 displays higher variability among human individuals than JUNO. Genetic differentiation between continental population pairs was within whole-genome estimates for all but the JUNO gene in the African population group with respect to the other 4 population groups (American, East Asian, South Asian, and European). Tajima’s D values demonstrated deviation from neutrality for both genes in comparison to a group of genes identified in the literature as under balancing or positive selection. Tajima’s D for IZUMO1 aligns with values calculated for genes presumed to be under balancing selection, whereas JUNO’s value aligned with genes presumed to be under positive selection. These inferences on selection are both supported by SNP density, nucleotide diversity and haplotype analysis. A JUNO haplotype carrying 3 derived alleles out of 5, one of which is a missense mutation implicated in polyspermy, was found to be significant in a population of African ancestry. Polyspermy has a disadvantageous impact on fertility and its presence in approximately 30% of the population of African ancestry may be associated to a potentially beneficial role of this haplotype. This role has not been established and may be related to a non-reproductive role of JUNO. The high degree of conservation of the JUNO sequence combined with a dominant haplotype across multiple population groups supports JUNO as a potential target for the development of contraceptive treatments. In addition to providing a detailed account of human genetic diversity across these 2 important and related genes, this study also provides a framework for large population-based studies investigating protein-protein interactions at the genome level.
Collapse
Affiliation(s)
- Jessica Allingham
- Department of Chemistry, Lakehead University, Thunder Bay, Ontario, Canada
| | - Wely B. Floriano
- Department of Chemistry, Lakehead University, Thunder Bay, Ontario, Canada
- * E-mail:
| |
Collapse
|
27
|
Grace CA, Forrester S, Silva VC, Carvalho KSS, Kilford H, Chew YP, James S, Costa DL, Mottram JC, Costa CCHN, Jeffares DC. Candidates for Balancing Selection in Leishmania donovani Complex Parasites. Genome Biol Evol 2021; 13:6448231. [PMID: 34865011 PMCID: PMC8717319 DOI: 10.1093/gbe/evab265] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/22/2021] [Indexed: 12/19/2022] Open
Abstract
The Leishmania donovani species complex is the causative agent of visceral leishmaniasis, which cause 20–40,000 fatalities a year. Here, we conduct a screen for balancing selection in this species complex. We used 384 publicly available L. donovani and L. infantum genomes, and sequence 93 isolates of L. infantum from Brazil to describe the global diversity of this species complex. We identify five genetically distinct populations that are sufficiently represented by genomic data to search for signatures of selection. We find that signals of balancing selection are generally not shared between populations, consistent with transient adaptive events, rather than long-term balancing selection. We then apply multiple diversity metrics to identify candidate genes with robust signatures of balancing selection, identifying a curated set of 24 genes with robust signatures. These include zeta toxin, nodulin-like, and flagellum attachment proteins. This study highlights the extent of genetic divergence between L. donovani complex parasites and provides genes for further study.
Collapse
Affiliation(s)
- Cooper Alastair Grace
- Department of Biology, York Biomedical Research Institute, University of York, York, United Kingdom
| | - Sarah Forrester
- Department of Biology, York Biomedical Research Institute, University of York, York, United Kingdom
| | - Vladimir Costa Silva
- Instituto de Doenças do Sertão, Instituto de Doenças Tropicais Natan Portella, Centro de Ciências da Saúde da Universidade Federal do Piauí, Teresina-PI, Brazil
| | - Kátia Silene Sousa Carvalho
- Instituto de Doenças do Sertão, Instituto de Doenças Tropicais Natan Portella, Centro de Ciências da Saúde da Universidade Federal do Piauí, Teresina-PI, Brazil
| | - Hannah Kilford
- Department of Biology, York Biomedical Research Institute, University of York, York, United Kingdom
| | - Yen Peng Chew
- Department of Biology, York Biomedical Research Institute, University of York, York, United Kingdom.,Institute of Molecular Plant Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Sally James
- Department of Biology, York Biomedical Research Institute, University of York, York, United Kingdom
| | - Dorcas L Costa
- Instituto de Doenças do Sertão, Instituto de Doenças Tropicais Natan Portella, Centro de Ciências da Saúde da Universidade Federal do Piauí, Teresina-PI, Brazil
| | - Jeremy C Mottram
- Department of Biology, York Biomedical Research Institute, University of York, York, United Kingdom
| | - Carlos C H N Costa
- Instituto de Doenças do Sertão, Instituto de Doenças Tropicais Natan Portella, Centro de Ciências da Saúde da Universidade Federal do Piauí, Teresina-PI, Brazil
| | - Daniel C Jeffares
- Department of Biology, York Biomedical Research Institute, University of York, York, United Kingdom
| |
Collapse
|
28
|
Cheng X, DeGiorgio M. BalLeRMix +: mixture model approaches for robust joint identification of both positive selection and long-term balancing selection. Bioinformatics 2021; 38:861-863. [PMID: 34664624 PMCID: PMC8756184 DOI: 10.1093/bioinformatics/btab720] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Revised: 09/13/2021] [Accepted: 10/13/2021] [Indexed: 02/03/2023] Open
Abstract
SUMMARY The growing availability of genomewide polymorphism data has fueled interest in detecting diverse selective processes affecting population diversity. However, no model-based approaches exist to jointly detect and distinguish the two complementary processes of balancing and positive selection. We extend the BalLeRMix B-statistic framework described in Cheng and DeGiorgio (2020) for detecting balancing selection and present BalLeRMix+, which implements five B statistic extensions based on mixture models to robustly identify both types of selection. BalLeRMix+ is implemented in Python and computes the composite likelihood ratios and associated model parameters for each genomic test position. AVAILABILITY AND IMPLEMENTATION BalLeRMix+ is freely available at https://github.com/bioXiaoheng/BallerMixPlus. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
29
|
Lee D, Zdraljevic S, Stevens L, Wang Y, Tanny RE, Crombie TA, Cook DE, Webster AK, Chirakar R, Baugh LR, Sterken MG, Braendle C, Félix MA, Rockman MV, Andersen EC. Balancing selection maintains hyper-divergent haplotypes in Caenorhabditis elegans. Nat Ecol Evol 2021; 5:794-807. [PMID: 33820969 PMCID: PMC8202730 DOI: 10.1038/s41559-021-01435-x] [Citation(s) in RCA: 73] [Impact Index Per Article: 18.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Accepted: 02/26/2021] [Indexed: 12/16/2022]
Abstract
Across diverse taxa, selfing species have evolved independently from outcrossing species thousands of times. The transition from outcrossing to selfing decreases the effective population size, effective recombination rate and heterozygosity within a species. These changes lead to a reduction in genetic diversity, and therefore adaptive potential, by intensifying the effects of random genetic drift and linked selection. Within the nematode genus Caenorhabditis, selfing has evolved at least three times, and all three species, including the model organism Caenorhabditis elegans, show substantially reduced genetic diversity relative to outcrossing species. Selfing and outcrossing Caenorhabditis species are often found in the same niches, but we still do not know how selfing species with limited genetic diversity can adapt to these environments. Here, we examine the whole-genome sequences from 609 wild C. elegans strains isolated worldwide and show that genetic variation is concentrated in punctuated hyper-divergent regions that cover 20% of the C. elegans reference genome. These regions are enriched in environmental response genes that mediate sensory perception, pathogen response and xenobiotic stress response. Population genomic evidence suggests that genetic diversity in these regions has been maintained by long-term balancing selection. Using long-read genome assemblies for 15 wild strains, we show that hyper-divergent haplotypes contain unique sets of genes and show levels of divergence comparable to levels found between Caenorhabditis species that diverged millions of years ago. These results provide an example of how species can avoid the evolutionary dead end associated with selfing.
Collapse
Affiliation(s)
- Daehan Lee
- Department of Molecular Biosciences, Northwestern University, Evanston, IL, USA
- Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland
| | - Stefan Zdraljevic
- Department of Molecular Biosciences, Northwestern University, Evanston, IL, USA
- Interdisciplinary Biological Sciences Program, Northwestern University, Evanston, IL, USA
- Department of Human Genetics, University of California, Los Angeles, CA, USA
- Howard Hughes Medical Institute, University of California, Los Angeles, CA, USA
| | - Lewis Stevens
- Department of Molecular Biosciences, Northwestern University, Evanston, IL, USA
| | - Ye Wang
- Department of Molecular Biosciences, Northwestern University, Evanston, IL, USA
- Sichuan Key Laboratory of Conservation Biology on Endangered Wildlife, Chengdu Research Base of Giant Panda Breeding, Chengdu, People's Republic of China
| | - Robyn E Tanny
- Department of Molecular Biosciences, Northwestern University, Evanston, IL, USA
| | - Timothy A Crombie
- Department of Molecular Biosciences, Northwestern University, Evanston, IL, USA
| | - Daniel E Cook
- Department of Molecular Biosciences, Northwestern University, Evanston, IL, USA
| | - Amy K Webster
- Department of Biology, Duke University, Durham, NC, USA
- University Program in Genetics and Genomics, Duke University, Durham, NC, USA
| | | | - L Ryan Baugh
- Department of Biology, Duke University, Durham, NC, USA
- Center for Genomic and Computational Biology, Duke University, Durham, NC, USA
| | - Mark G Sterken
- Laboratory of Nematology, Wageningen University and Research, Wageningen, the Netherlands
| | | | - Marie-Anne Félix
- Institut de Biologie de l'Ecole Normale Supérieure, Centre National de la Recherche Scientifique, INSERM, École Normale Supérieure, Paris Sciences et Lettres, Paris, France
| | - Matthew V Rockman
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, USA
| | - Erik C Andersen
- Department of Molecular Biosciences, Northwestern University, Evanston, IL, USA.
| |
Collapse
|
30
|
North HL, McGaughran A, Jiggins CD. Insights into invasive species from whole-genome resequencing. Mol Ecol 2021; 30:6289-6308. [PMID: 34041794 DOI: 10.1111/mec.15999] [Citation(s) in RCA: 51] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Revised: 03/12/2021] [Accepted: 04/30/2021] [Indexed: 12/12/2022]
Abstract
Studies of invasive species can simultaneously inform management strategies and quantify rapid evolution in the wild. The role of genomics in invasion science is increasingly recognised, and the growing availability of reference genomes for invasive species is paving the way for whole-genome resequencing studies in a wide range of systems. Here, we survey the literature to assess the application of whole-genome resequencing data in invasion biology. For some applications, such as the reconstruction of invasion routes in time and space, sequencing the whole genome of many individuals can increase the accuracy of existing methods. In other cases, population genomic approaches such as haplotype analysis can permit entirely new questions to be addressed and new technologies applied. To date whole-genome resequencing has only been used in a handful of invasive systems, but these studies have confirmed the importance of processes such as balancing selection and hybridization in allowing invasive species to reuse existing adaptations and rapidly overcome the challenges of a foreign ecosystem. The use of genomic data does not constitute a paradigm shift per se, but by leveraging new theory, tools, and technologies, population genomics can provide unprecedented insight into basic and applied aspects of invasion science.
Collapse
Affiliation(s)
- Henry L North
- Department of Zoology, University of Cambridge, Cambridge, UK
| | - Angela McGaughran
- Te Aka Mātuatua/School of Science, University of Waikato, Hamilton, New Zealand
| | - Chris D Jiggins
- Department of Zoology, University of Cambridge, Cambridge, UK
| |
Collapse
|
31
|
Bourgeois YXC, Warren BH. An overview of current population genomics methods for the analysis of whole-genome resequencing data in eukaryotes. Mol Ecol 2021; 30:6036-6071. [PMID: 34009688 DOI: 10.1111/mec.15989] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Revised: 04/26/2021] [Accepted: 05/11/2021] [Indexed: 01/01/2023]
Abstract
Characterizing the population history of a species and identifying loci underlying local adaptation is crucial in functional ecology, evolutionary biology, conservation and agronomy. The constant improvement of high-throughput sequencing techniques has facilitated the production of whole genome data in a wide range of species. Population genomics now provides tools to better integrate selection into a historical framework, and take into account selection when reconstructing demographic history. However, this improvement has come with a profusion of analytical tools that can confuse and discourage users. Such confusion limits the amount of information effectively retrieved from complex genomic data sets, and impairs the diffusion of the most recent analytical tools into fields such as conservation biology. It may also lead to redundancy among methods. To address these isssues, we propose an overview of more than 100 state-of-the-art methods that can deal with whole genome data. We summarize the strategies they use to infer demographic history and selection, and discuss some of their limitations. A website listing these methods is available at www.methodspopgen.com.
Collapse
Affiliation(s)
| | - Ben H Warren
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Muséum National d'Histoire Naturelle, CNRS, Sorbonne Université, EPHE, UA, CP 51, Paris, France
| |
Collapse
|
32
|
Zeng K, Charlesworth B, Hobolth A. Studying models of balancing selection using phase-type theory. Genetics 2021; 218:6237896. [PMID: 33871627 DOI: 10.1093/genetics/iyab055] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Accepted: 03/25/2021] [Indexed: 11/15/2022] Open
Abstract
Balancing selection (BLS) is the evolutionary force that maintains high levels of genetic variability in many important genes. To further our understanding of its evolutionary significance, we analyze models with BLS acting on a biallelic locus: an equilibrium model with long-term BLS, a model with long-term BLS and recent changes in population size, and a model of recent BLS. Using phase-type theory, a mathematical tool for analyzing continuous time Markov chains with an absorbing state, we examine how BLS affects polymorphism patterns in linked neutral regions, as summarized by nucleotide diversity, the expected number of segregating sites, the site frequency spectrum, and the level of linkage disequilibrium (LD). Long-term BLS affects polymorphism patterns in a relatively small genomic neighborhood, and such selection targets are easier to detect when the equilibrium frequencies of the selected variants are close to 50%, or when there has been a population size reduction. For a new mutation subject to BLS, its initial increase in frequency in the population causes linked neutral regions to have reduced diversity, an excess of both high and low frequency derived variants, and elevated LD with the selected locus. These patterns are similar to those produced by selective sweeps, but the effects of recent BLS are weaker. Nonetheless, compared to selective sweeps, nonequilibrium polymorphism and LD patterns persist for a much longer period under recent BLS, which may increase the chance of detecting such selection targets. An R package for analyzing these models, among others (e.g., isolation with migration), is available.
Collapse
Affiliation(s)
- Kai Zeng
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield S10 2TN, UK
| | - Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3FL, UK
| | - Asger Hobolth
- Department of Mathematics, Aarhus University, Aarhus DK-8000, Denmark
| |
Collapse
|
33
|
Göktay M, Fulgione A, Hancock AM. A New Catalog of Structural Variants in 1,301 A. thaliana Lines from Africa, Eurasia, and North America Reveals a Signature of Balancing Selection at Defense Response Genes. Mol Biol Evol 2021; 38:1498-1511. [PMID: 33247723 PMCID: PMC8042739 DOI: 10.1093/molbev/msaa309] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Genomic variation in the model plant Arabidopsis thaliana has been extensively used to understand evolutionary processes in natural populations, mainly focusing on single-nucleotide polymorphisms. Conversely, structural variation has been largely ignored in spite of its potential to dramatically affect phenotype. Here, we identify 155,440 indels and structural variants ranging in size from 1 bp to 10 kb, including presence/absence variants (PAVs), inversions, and tandem duplications in 1,301 A. thaliana natural accessions from Morocco, Madeira, Europe, Asia, and North America. We show evidence for strong purifying selection on PAVs in genes, in particular for housekeeping genes and homeobox genes, and we find that PAVs are concentrated in defense-related genes (R-genes, secondary metabolites) and F-box genes. This implies the presence of a "core" genome underlying basic cellular processes and a "flexible" genome that includes genes that may be important in spatially or temporally varying selection. Further, we find an excess of intermediate frequency PAVs in defense response genes in nearly all populations studied, consistent with a history of balancing selection on this class of genes. Finally, we find that PAVs in genes involved in the cold requirement for flowering (vernalization) and drought response are strongly associated with temperature at the sites of origin.
Collapse
Affiliation(s)
- Mehmet Göktay
- Max Planck Institute for Plant Breeding Research, Cologne, Germany
| | - Andrea Fulgione
- Max Planck Institute for Plant Breeding Research, Cologne, Germany
| | - Angela M Hancock
- Max Planck Institute for Plant Breeding Research, Cologne, Germany
| |
Collapse
|
34
|
Huang Y, Li Y, Wang X, Yu J, Cai Y, Zheng Z, Li R, Zhang S, Chen N, Asadollahpour Nanaei H, Hanif Q, Chen Q, Fu W, Li C, Cao X, Zhou G, Liu S, He S, Li W, Chen Y, Chen H, Lei C, Liu M, Jiang Y. An atlas of CNV maps in cattle, goat and sheep. SCIENCE CHINA-LIFE SCIENCES 2021; 64:1747-1764. [PMID: 33486588 DOI: 10.1007/s11427-020-1850-x] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/09/2020] [Accepted: 11/16/2020] [Indexed: 11/26/2022]
Abstract
Copy number variation (CNV) is the most prevalent type of genetic structural variation that has been recognized as an important source of phenotypic variation in humans, animals and plants. However, the mechanisms underlying the evolution of CNVs and their function in natural or artificial selection remain unknown. Here, we generated CNV region (CNVR) datasets which were diverged or shared among cattle, goat, and sheep, including 886 individuals from 171 diverse populations. Using 9 environmental factors for genome-wide association study (GWAS), we identified a series of candidate CNVRs, including genes relating to immunity, tick resistance, multi-drug resistance, and muscle development. The number of CNVRs shared between species is significantly higher than expected (P<0.00001), and these CNVRs may be more persist than the single nucleotide polymorphisms (SNPs) shared between species. We also identified genomic regions under long-term balancing selection and uncovered the potential diversity of the selected CNVRs close to the important functional genes. This study provides the evidence that balancing selection might be more common in mammals than previously considered, and might play an important role in the daily activities of these ruminant species.
Collapse
Affiliation(s)
- Yongzhen Huang
- College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Yunjia Li
- College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Xihong Wang
- College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Jiantao Yu
- College of Information Engineering, Northwest A&F University, Yangling, 712100, China
| | - Yudong Cai
- College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Zhuqing Zheng
- College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Ran Li
- College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Shunjin Zhang
- College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Ningbo Chen
- College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | | | - Quratulain Hanif
- National Institute for Biotechnology and Genetic Engineering (NIBGE), Faisalabad, Punjab, 577, Pakistan
- Pakistan Institute of Engineering & Applied Sciences (PIEAS), Nilore, 45650, Islamabad, Pakistan
| | - Qiuming Chen
- College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Weiwei Fu
- College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Chao Li
- College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Xiukai Cao
- College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Guangxian Zhou
- College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Shudong Liu
- College of Information Engineering, Northwest A&F University, Yangling, 712100, China
| | - Sangang He
- Key Laboratory of Genetics Breeding and Reproduction of Grass feeding Livestock, Ministry of Agriculture, Biotechnology Research Institute, Xinjiang Academy of Animal Sciences, Urumqi, 830026, China
| | - Wenrong Li
- Key Laboratory of Genetics Breeding and Reproduction of Grass feeding Livestock, Ministry of Agriculture, Biotechnology Research Institute, Xinjiang Academy of Animal Sciences, Urumqi, 830026, China
| | - Yulin Chen
- College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Hong Chen
- College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Chuzhao Lei
- College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Mingjun Liu
- Key Laboratory of Genetics Breeding and Reproduction of Grass feeding Livestock, Ministry of Agriculture, Biotechnology Research Institute, Xinjiang Academy of Animal Sciences, Urumqi, 830026, China
| | - Yu Jiang
- College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China.
| |
Collapse
|
35
|
Cheng X, DeGiorgio M. Flexible Mixture Model Approaches That Accommodate Footprint Size Variability for Robust Detection of Balancing Selection. Mol Biol Evol 2020; 37:3267-3291. [PMID: 32462188 PMCID: PMC7820363 DOI: 10.1093/molbev/msaa134] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Long-term balancing selection typically leaves narrow footprints of increased genetic diversity, and therefore most detection approaches only achieve optimal performances when sufficiently small genomic regions (i.e., windows) are examined. Such methods are sensitive to window sizes and suffer substantial losses in power when windows are large. Here, we employ mixture models to construct a set of five composite likelihood ratio test statistics, which we collectively term B statistics. These statistics are agnostic to window sizes and can operate on diverse forms of input data. Through simulations, we show that they exhibit comparable power to the best-performing current methods, and retain substantially high power regardless of window sizes. They also display considerable robustness to high mutation rates and uneven recombination landscapes, as well as an array of other common confounding scenarios. Moreover, we applied a specific version of the B statistics, termed B2, to a human population-genomic data set and recovered many top candidates from prior studies, including the then-uncharacterized STPG2 and CCDC169-SOHLH2, both of which are related to gamete functions. We further applied B2 on a bonobo population-genomic data set. In addition to the MHC-DQ genes, we uncovered several novel candidate genes, such as KLRD1, involved in viral defense, and SCN9A, associated with pain perception. Finally, we show that our methods can be extended to account for multiallelic balancing selection and integrated the set of statistics into open-source software named BalLeRMix for future applications by the scientific community.
Collapse
Affiliation(s)
- Xiaoheng Cheng
- Huck Institutes of Life Sciences, Pennsylvania State University, University Park, PA
- Department of Biology, Pennsylvania State University, University Park, PA
| | - Michael DeGiorgio
- Department of Computer and Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL
| |
Collapse
|
36
|
Mughal MR, Koch H, Huang J, Chiaromonte F, DeGiorgio M. Learning the properties of adaptive regions with functional data analysis. PLoS Genet 2020; 16:e1008896. [PMID: 32853200 PMCID: PMC7480868 DOI: 10.1371/journal.pgen.1008896] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2019] [Revised: 09/09/2020] [Accepted: 05/29/2020] [Indexed: 12/12/2022] Open
Abstract
Identifying regions of positive selection in genomic data remains a challenge in population genetics. Most current approaches rely on comparing values of summary statistics calculated in windows. We present an approach termed SURFDAWave, which translates measures of genetic diversity calculated in genomic windows to functional data. By transforming our discrete data points to be outputs of continuous functions defined over genomic space, we are able to learn the features of these functions that signify selection. This enables us to confidently identify complex modes of natural selection, including adaptive introgression. We are also able to predict important selection parameters that are responsible for shaping the inferred selection events. By applying our model to human population-genomic data, we recapitulate previously identified regions of selective sweeps, such as OCA2 in Europeans, and predict that its beneficial mutation reached a frequency of 0.02 before it swept 1,802 generations ago, a time when humans were relatively new to Europe. In addition, we identify BNC2 in Europeans as a target of adaptive introgression, and predict that it harbors a beneficial mutation that arose in an archaic human population that split from modern humans within the hypothesized modern human-Neanderthal divergence range.
Collapse
Affiliation(s)
- Mehreen R. Mughal
- Bioinformatics and Genomics at the Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Hillary Koch
- Department of Statistics, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Jinguo Huang
- Bioinformatics and Genomics at the Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Francesca Chiaromonte
- Department of Statistics, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Michael DeGiorgio
- Department of Computer and Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, Florida, United States of America
| |
Collapse
|
37
|
Evolutionary origins of genomic adaptations in an invasive copepod. Nat Ecol Evol 2020; 4:1084-1094. [DOI: 10.1038/s41559-020-1201-y] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2019] [Accepted: 04/14/2020] [Indexed: 12/18/2022]
|
38
|
Footprints of natural selection at the mannose-6-phosphate isomerase locus in barnacles. Proc Natl Acad Sci U S A 2020; 117:5376-5385. [PMID: 32098846 PMCID: PMC7071928 DOI: 10.1073/pnas.1918232117] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
The rocky intertidal is a natural laboratory to study how natural selection acts on the genes and proteins responsible for organismal survival and reproduction. Alternative forms of enzymes that differ across the intertidal have been known for decades and have provided examples of selection, but the genetic basis of such enzyme variation is known in only a few cases. In this paper, we present molecular evidence of natural selection at the Mpi gene, a key enzyme in energy metabolism that alters survival of barnacles living across the stress gradient imposed by the intertidal. Our study demonstrates how natural selection can facilitate survival in highly heterogeneous environments through the maintenance of multiple molecular solutions to ecological stresses. The mannose-6-phosphate isomerase (Mpi) locus in Semibalanus balanoides has been studied as a candidate gene for balancing selection for more than two decades. Previous work has shown that Mpi allozyme genotypes (fast and slow) have different frequencies across Atlantic intertidal zones due to selection on postsettlement survival (i.e., allele zonation). We present the complete gene sequence of the Mpi locus and quantify nucleotide polymorphism in S. balanoides, as well as divergence to its sister taxon Semibalanus cariosus. We show that the slow allozyme contains a derived charge-altering amino acid polymorphism, and both allozyme classes correspond to two haplogroups with multiple internal haplotypes. The locus shows several footprints of balancing selection around the fast/slow site: an enrichment of positive Tajima’s D for nonsynonymous mutations, an excess of polymorphism, and a spike in the levels of silent polymorphism relative to silent divergence, as well as a site frequency spectrum enriched for midfrequency mutations. We observe other departures from neutrality across the locus in both coding and noncoding regions. These include a nonsynonymous trans-species polymorphism and a recent mutation under selection within the fast haplogroup. The latter suggests ongoing allelic replacement of functionally relevant amino acid variants. Moreover, predicted models of Mpi protein structure provide insight into the functional significance of the putatively selected amino acid polymorphisms. While footprints of selection are widespread across the range of S. balanoides, our data show that intertidal zonation patterns are variable across both spatial and temporal scales. These data provide further evidence for heterogeneous selection on Mpi.
Collapse
|