1
|
Soni V, Terbot JW, Versoza CJ, Pfeifer SP, Jensen JD. A whole-genome scan for evidence of recent positive and balancing selection in aye-ayes ( Daubentonia madagascariensis) utilizing a well-fit evolutionary baseline model. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.11.08.622667. [PMID: 39605496 PMCID: PMC11601216 DOI: 10.1101/2024.11.08.622667] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/29/2024]
Abstract
The aye-aye (Daubentonia madagascariensis) is one of the 25 most endangered primate species in the world, maintaining amongst the lowest genetic diversity of any primate measured to date. Characterizing patterns of genetic variation within aye-aye populations, and the relative influences of neutral and selective processes in shaping that variation, is thus important for future conservation efforts. In this study, we performed the first whole-genome scans for recent positive and balancing selection in the species, utilizing high-coverage population genomic data from newly sequenced individuals. We generated null thresholds for our genomic scans by creating an evolutionarily appropriate baseline model that incorporates the demographic history of this aye-aye population, and identified a small number of candidate genes. Most notably, a suite of genes involved in olfaction - a key trait in these nocturnal primates - were identified as experiencing long-term balancing selection. We also conducted analyses to quantify the expected statistical power to detect positive and balancing selection in this population using site frequency spectrum-based inference methods, once accounting for the potentially confounding contributions of population history, recombination and mutation rate variation, and purifying and background selection. This work, presenting the first high-quality, genome-wide polymorphism data across the functional regions of the aye-aye genome, thus provides important insights into the landscape of episodic selective forces in this highly endangered species.
Collapse
Affiliation(s)
- Vivak Soni
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - John W. Terbot
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Cyril J. Versoza
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Susanne P. Pfeifer
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Jeffrey D. Jensen
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA
| |
Collapse
|
2
|
Singh PP, Reeves GA, Contrepois K, Papsdorf K, Miklas JW, Ellenberger M, Hu CK, Snyder MP, Brunet A. Evolution of diapause in the African turquoise killifish by remodeling the ancient gene regulatory landscape. Cell 2024; 187:3338-3356.e30. [PMID: 38810644 PMCID: PMC11970524 DOI: 10.1016/j.cell.2024.04.048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Revised: 11/30/2023] [Accepted: 04/30/2024] [Indexed: 05/31/2024]
Abstract
Suspended animation states allow organisms to survive extreme environments. The African turquoise killifish has evolved diapause as a form of suspended development to survive a complete drought. However, the mechanisms underlying the evolution of extreme survival states are unknown. To understand diapause evolution, we performed integrative multi-omics (gene expression, chromatin accessibility, and lipidomics) in the embryos of multiple killifish species. We find that diapause evolved by a recent remodeling of regulatory elements at very ancient gene duplicates (paralogs) present in all vertebrates. CRISPR-Cas9-based perturbations identify the transcription factors REST/NRSF and FOXOs as critical for the diapause gene expression program, including genes involved in lipid metabolism. Indeed, diapause shows a distinct lipid profile, with an increase in triglycerides with very-long-chain fatty acids. Our work suggests a mechanism for the evolution of complex adaptations and offers strategies to promote long-term survival by activating suspended animation programs in other species.
Collapse
Affiliation(s)
| | - G Adam Reeves
- Department of Genetics, Stanford University, Stanford, CA, USA
| | - Kévin Contrepois
- Department of Genetics, Stanford University, Stanford, CA, USA; Stanford Cardiovascular Institute, Stanford University, Stanford, CA, USA
| | | | - Jason W Miklas
- Department of Genetics, Stanford University, Stanford, CA, USA
| | | | - Chi-Kuo Hu
- Department of Genetics, Stanford University, Stanford, CA, USA
| | - Michael P Snyder
- Department of Genetics, Stanford University, Stanford, CA, USA; Stanford Cardiovascular Institute, Stanford University, Stanford, CA, USA; Stanford Diabetes Research Center, Stanford University, Stanford, CA, USA
| | - Anne Brunet
- Department of Genetics, Stanford University, Stanford, CA, USA; Glenn Center for the Biology of Aging, Stanford University, Stanford, CA, USA; Wu Tsai Neurosciences Institute, Stanford University, Stanford, CA, USA; Chan Zuckerberg Biohub, San Francisco, San Francisco, CA, USA.
| |
Collapse
|
3
|
Jiang D, Zhang J. Ascertainment Bias in the Genomic Test of Positive Selection on Regulatory Sequences. Mol Biol Evol 2024; 41:msad284. [PMID: 38149460 PMCID: PMC10766478 DOI: 10.1093/molbev/msad284] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2023] [Revised: 11/12/2023] [Accepted: 12/22/2023] [Indexed: 12/28/2023] Open
Abstract
Evolution of gene expression mediated by cis-regulatory changes is thought to be an important contributor to organismal adaptation, but identifying adaptive cis-regulatory changes is challenging due to the difficulty in knowing the expectation under no positive selection. A new approach for detecting positive selection on transcription factor binding sites (TFBSs) was recently developed, thanks to the application of machine learning in predicting transcription factor (TF) binding affinities of DNA sequences. Given a TFBS sequence from a focal species and the corresponding inferred ancestral sequence that differs from the former at n sites, one can predict the TF-binding affinities of many n-step mutational neighbors of the ancestral sequence and obtain a null distribution of the derived binding affinity, which allows testing whether the binding affinity of the real derived sequence deviates significantly from the null distribution. Applying this test genomically to all experimentally identified binding sites of 3 TFs in humans, a recent study reported positive selection for elevated binding affinities of TFBSs. Here, we show that this genomic test suffers from an ascertainment bias because, even in the absence of positive selection for strengthened binding, the binding affinities of known human TFBSs are more likely to have increased than decreased in evolution. We demonstrate by computer simulation that this bias inflates the false positive rate of the selection test. We propose several methods to mitigate the ascertainment bias and show that almost all previously reported positive selection signals disappear when these methods are applied.
Collapse
Affiliation(s)
- Daohan Jiang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA
- Present address: Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
4
|
Jiang D, Zhang J. Ascertainment bias in the genomic test of positive selection on regulatory sequences. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.20.554030. [PMID: 37662307 PMCID: PMC10473660 DOI: 10.1101/2023.08.20.554030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/05/2023]
Abstract
Evolution of gene expression mediated by cis-regulatory changes is thought to be an important contributor to organismal adaptation, but identifying adaptive cis-regulatory changes is challenging due to the difficulty in knowing the expectation under no positive selection. A new approach for detecting positive selection on transcription factor binding sites (TFBSs) was recently developed, thanks to the application of machine learning in predicting transcription factor (TF) binding affinities of DNA sequences. Given a TFBS sequence from a focal species and the corresponding inferred ancestral sequence that differs from the former at n sites, one can predict the TF binding affinities of many n-step mutational neighbors of the ancestral sequence and obtain a null distribution of the derived binding affinity, which allows testing whether the binding affinity of the real derived sequence deviates significantly from the null distribution. Applying this test genomically to all experimentally identified binding sites of three TFs in humans, a recent study reported positive selection for elevated binding affinities of TFBSs. Here we show that this genomic test suffers from an ascertainment bias because, even in the absence of positive selection for strengthened binding, the binding affinities of known human TFBSs are more likely to have increased than decreased in evolution. We demonstrate by computer simulation that this bias inflates the false positive rate of the selection test. We propose several methods to mitigate the ascertainment bias and show that almost all previously reported positive selection signals disappear when these methods are applied.
Collapse
Affiliation(s)
- Daohan Jiang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan 48109, USA
| |
Collapse
|
5
|
Zhang X, Fang B, Huang YF. Transcription factor binding sites are frequently under accelerated evolution in primates. Nat Commun 2023; 14:783. [PMID: 36774380 PMCID: PMC9922303 DOI: 10.1038/s41467-023-36421-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Accepted: 01/31/2023] [Indexed: 02/13/2023] Open
Abstract
Recent comparative genomic studies have identified many human accelerated elements (HARs) with elevated substitution rates in the human lineage. However, it remains unknown to what extent transcription factor binding sites (TFBSs) are under accelerated evolution in humans and other primates. Here, we introduce two pooling-based phylogenetic methods with dramatically enhanced sensitivity to examine accelerated evolution in TFBSs. Using these new methods, we show that more than 6000 TFBSs annotated in the human genome have experienced accelerated evolution in Hominini, apes, and Old World monkeys. Although these TFBSs individually show relatively weak signals of accelerated evolution, they collectively are more abundant than HARs. Also, we show that accelerated evolution in Pol III binding sites may be driven by lineage-specific positive selection, whereas accelerated evolution in other TFBSs might be driven by nonadaptive evolutionary forces. Finally, the accelerated TFBSs are enriched around developmental genes, suggesting that accelerated evolution in TFBSs may drive the divergence of developmental processes between primates.
Collapse
Affiliation(s)
- Xinru Zhang
- Department of Biology, Pennsylvania State University, University Park, PA, 16802, USA. .,Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA, 16802, USA. .,Bioinformatics and Genomics Graduate Program, Pennsylvania State University, University Park, PA, 16802, USA.
| | - Bohao Fang
- Department of Organismic and Evolutionary Biology and the Museum of Comparative Zoology, Harvard University, Boston, MA, 02135, USA
| | - Yi-Fei Huang
- Department of Biology, Pennsylvania State University, University Park, PA, 16802, USA. .,Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA, 16802, USA.
| |
Collapse
|
6
|
Zug R, Uller T. Evolution and dysfunction of human cognitive and social traits: A transcriptional regulation perspective. EVOLUTIONARY HUMAN SCIENCES 2022; 4:e43. [PMID: 37588924 PMCID: PMC10426018 DOI: 10.1017/ehs.2022.42] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 08/11/2022] [Accepted: 09/11/2022] [Indexed: 11/07/2022] Open
Abstract
Evolutionary changes in brain and craniofacial development have endowed humans with unique cognitive and social skills, but also predisposed us to debilitating disorders in which these traits are disrupted. What are the developmental genetic underpinnings that connect the adaptive evolution of our cognition and sociality with the persistence of mental disorders with severe negative fitness effects? We argue that loss of function of genes involved in transcriptional regulation represents a crucial link between the evolution and dysfunction of human cognitive and social traits. The argument is based on the haploinsufficiency of many transcriptional regulator genes, which makes them particularly sensitive to loss-of-function mutations. We discuss how human brain and craniofacial traits evolved through partial loss of function (i.e. reduced expression) of these genes, a perspective compatible with the idea of human self-domestication. Moreover, we explain why selection against loss-of-function variants supports the view that mutation-selection-drift, rather than balancing selection, underlies the persistence of psychiatric disorders. Finally, we discuss testable predictions.
Collapse
Affiliation(s)
- Roman Zug
- Department of Biology, Lund University, Lund, Sweden
| | - Tobias Uller
- Department of Biology, Lund University, Lund, Sweden
| |
Collapse
|
7
|
Dubois‐Mignon T, Monget P. Gene essentiality and variability: What is the link? A within‐ and between‐species perspective. Bioessays 2022; 44:e2200132. [DOI: 10.1002/bies.202200132] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Revised: 08/17/2022] [Accepted: 08/30/2022] [Indexed: 11/07/2022]
Affiliation(s)
- Tania Dubois‐Mignon
- Institut de Biologie de l’École Normale Supérieure Université PSL 46 rue d'Ulm Paris 75005 France
| | - Philippe Monget
- Physiologie de la Reproduction et des Comportements, Centre Val de Loire – UMR INRAE, CNRS, IFCE Université de Tours Nouzilly France
| |
Collapse
|
8
|
Yu H, Chen M, Hu Y, Ou S, Yu X, Liang S, Li N, Yang M, Kong X, Sun C, Jia S, Zhang Q, Liu L, Hurst LD, Li R, Wang W, Wang J. Dynamic reprogramming of H3K9me3 at hominoid-specific retrotransposons during human preimplantation development. Cell Stem Cell 2022; 29:1031-1050.e12. [PMID: 35803225 DOI: 10.1016/j.stem.2022.06.006] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2021] [Revised: 04/06/2022] [Accepted: 06/08/2022] [Indexed: 12/13/2022]
Abstract
Reprogramming of H3K9me3-dependent heterochromatin is required for early development. How H3K9me3 is involved in early human development remains, however, largely unclear. Here, we resolve the temporal landscape of H3K9me3 during human preimplantation development and its regulation for diverse hominoid-specific retrotransposons. At the 8-cell stage, H3K9me3 reprogramming at hominoid-specific retrotransposons termed SINE-VNTR-Alu (SVA) facilitates interaction between certain promoters and SVA-derived enhancers, promoting the zygotic genome activation. In trophectoderm, de novo H3K9me3 domains prevent pluripotent transcription factors from binding to hominoid-specific retrotransposons-derived regulatory elements for inner cell mass (ICM)-specific genes. H3K9me3 re-establishment at SVA elements in the ICM is associated with higher transcription of DNA repair genes, when compared with naive human pluripotent stem cells. Our data demonstrate that species-specific reorganization of H3K9me3-dependent heterochromatin at hominoid-specific retrotransposons plays important roles during early human development, shedding light on how the epigenetic regulation for early development has evolved in mammals.
Collapse
Affiliation(s)
- Hanwen Yu
- Department of Histology and Embryology, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, China; Key Laboratory for Stem Cells and Tissue Engineering (Sun Yat-sen University), Ministry of Education, Guangzhou 510080, China
| | - Manqi Chen
- Department of Histology and Embryology, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, China; Key Laboratory for Stem Cells and Tissue Engineering (Sun Yat-sen University), Ministry of Education, Guangzhou 510080, China
| | - Yuanlang Hu
- Department of Histology and Embryology, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, China; Key Laboratory for Stem Cells and Tissue Engineering (Sun Yat-sen University), Ministry of Education, Guangzhou 510080, China; The First Affiliated Hospital, Jinan University, Guangzhou 510630, China
| | - Songbang Ou
- Department of Obstetrics and Gynaecology, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou 510120, China
| | - Xiu Yu
- Department of Histology and Embryology, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, China; Key Laboratory for Stem Cells and Tissue Engineering (Sun Yat-sen University), Ministry of Education, Guangzhou 510080, China
| | - Shiqi Liang
- Department of Histology and Embryology, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, China; Key Laboratory for Stem Cells and Tissue Engineering (Sun Yat-sen University), Ministry of Education, Guangzhou 510080, China
| | - Niannian Li
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin 300071, China; Department of Cell Biology and Genetics, College of Life Sciences, Nankai University, Tianjin 300071, China
| | - Mingzhu Yang
- Department of Histology and Embryology, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, China; Key Laboratory for Stem Cells and Tissue Engineering (Sun Yat-sen University), Ministry of Education, Guangzhou 510080, China
| | - Xuhui Kong
- Department of Histology and Embryology, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, China; Key Laboratory for Stem Cells and Tissue Engineering (Sun Yat-sen University), Ministry of Education, Guangzhou 510080, China
| | - Chuanbo Sun
- Laboratory of Medical Systems Biology, Guangzhou Women and Children's Medical Center, Guangzhou Medical University, Guangzhou 510623, China
| | - Shiqi Jia
- The First Affiliated Hospital, Jinan University, Guangzhou 510630, China
| | - Qingxue Zhang
- Department of Obstetrics and Gynaecology, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou 510120, China
| | - Lin Liu
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin 300071, China; Department of Cell Biology and Genetics, College of Life Sciences, Nankai University, Tianjin 300071, China
| | - Laurence D Hurst
- Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, UK
| | - Ruiqi Li
- Department of Obstetrics and Gynaecology, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou 510120, China.
| | - Wenjun Wang
- Department of Obstetrics and Gynaecology, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou 510120, China.
| | - Jichang Wang
- Department of Histology and Embryology, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, China; Key Laboratory for Stem Cells and Tissue Engineering (Sun Yat-sen University), Ministry of Education, Guangzhou 510080, China.
| |
Collapse
|
9
|
The evolution, evolvability and engineering of gene regulatory DNA. Nature 2022; 603:455-463. [PMID: 35264797 DOI: 10.1038/s41586-022-04506-6] [Citation(s) in RCA: 122] [Impact Index Per Article: 40.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2021] [Accepted: 02/02/2022] [Indexed: 11/08/2022]
Abstract
Mutations in non-coding regulatory DNA sequences can alter gene expression, organismal phenotype and fitness1-3. Constructing complete fitness landscapes, in which DNA sequences are mapped to fitness, is a long-standing goal in biology, but has remained elusive because it is challenging to generalize reliably to vast sequence spaces4-6. Here we build sequence-to-expression models that capture fitness landscapes and use them to decipher principles of regulatory evolution. Using millions of randomly sampled promoter DNA sequences and their measured expression levels in the yeast Saccharomyces cerevisiae, we learn deep neural network models that generalize with excellent prediction performance, and enable sequence design for expression engineering. Using our models, we study expression divergence under genetic drift and strong-selection weak-mutation regimes to find that regulatory evolution is rapid and subject to diminishing returns epistasis; that conflicting expression objectives in different environments constrain expression adaptation; and that stabilizing selection on gene expression leads to the moderation of regulatory complexity. We present an approach for using such models to detect signatures of selection on expression from natural variation in regulatory sequences and use it to discover an instance of convergent regulatory evolution. We assess mutational robustness, finding that regulatory mutation effect sizes follow a power law, characterize regulatory evolvability, visualize promoter fitness landscapes, discover evolvability archetypes and illustrate the mutational robustness of natural regulatory sequence populations. Our work provides a general framework for designing regulatory sequences and addressing fundamental questions in regulatory evolution.
Collapse
|
10
|
Quiver MH, Lachance J. Adaptive eQTLs reveal the evolutionary impacts of pleiotropy and tissue-specificity while contributing to health and disease. HGG ADVANCES 2022; 3:100083. [PMID: 35047867 PMCID: PMC8756519 DOI: 10.1016/j.xhgg.2021.100083] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2021] [Accepted: 12/21/2021] [Indexed: 11/24/2022] Open
Abstract
Large numbers of expression quantitative trait loci (eQTLs) have recently been identified in humans, and many of these regulatory variants have large allele frequency differences between populations. Here, we conducted genome-wide scans of selection to identify adaptive eQTLs (i.e., eQTLs with large population branch statistics). We then tested if tissue pleiotropy affects whether eQTLs are more or less likely to be adaptive and identified tissues that have been key targets of positive selection during the last 100,000 years. Top adaptive eQTL outliers include rs1043809, rs66899053, and rs2814778 (a SNP that is associated with malaria resistance). We found that effect sizes of eQTLs were negatively correlated with population branch statistics and that adaptive eQTLs affect two-thirds as many tissues as do non-adaptive eQTLs. Because the tissue breadth of an eQTL can be viewed as a measure of pleiotropy, these results imply that pleiotropy inhibits adaptation. The proportion of eQTLs that are adaptive varies by tissue, and we found that eQTLs that regulate expression in testis, thyroid, blood, or sun-exposed skin are enriched for signatures of positive selection. By contrast, eQTLs that regulate expression in the cerebrum or female-specific tissues have a relative lack of adaptive outliers. Scans of selections also reveal that many adaptive eQTLs are closely linked to disease-associated loci. Taken together, our results indicate that eQTLs have played an important role in recent human evolution.
Collapse
Affiliation(s)
- Melanie H Quiver
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | - Joseph Lachance
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA 30332, USA
| |
Collapse
|
11
|
Schweizer G, Wagner A. Both Binding Strength and Evolutionary Accessibility Affect the Population Frequency of Transcription Factor Binding Sequences in Arabidopsis thaliana. Genome Biol Evol 2021; 13:6459646. [PMID: 34894231 PMCID: PMC8712246 DOI: 10.1093/gbe/evab273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/06/2021] [Indexed: 11/22/2022] Open
Abstract
Mutations in DNA sequences that bind transcription factors and thus modulate gene expression are a source of adaptive variation in gene expression. To understand how transcription factor binding sequences evolve in natural populations of the thale cress Arabidopsis thaliana, we integrated genomic polymorphism data for loci bound by transcription factors with in vitro data on binding affinity for these transcription factors. Specifically, we studied 19 different transcription factors, and the allele frequencies of 8,333 genomic loci bound in vivo by these transcription factors in 1,135 A. thaliana accessions. We find that transcription factor binding sequences show very low genetic diversity, suggesting that they are subject to purifying selection. High frequency alleles of such binding sequences tend to bind transcription factors strongly. Conversely, alleles that are absent from the population tend to bind them weakly. In addition, alleles with high frequencies also tend to be the endpoints of many accessible evolutionary paths leading to these alleles. We show that both high affinity and high evolutionary accessibility contribute to high allele frequency for at least some transcription factors. Although binding sequences with stronger affinity are more frequent, we did not find them to be associated with higher gene expression levels. Epistatic interactions among individual mutations that alter binding affinity are pervasive and can help explain variation in accessibility among binding sequences. In summary, combining in vitro binding affinity data with in vivo binding sequence data can help understand the forces that affect the evolution of transcription factor binding sequences in natural populations.
Collapse
Affiliation(s)
- Gabriel Schweizer
- Department of Evolutionary Biology and Environmental Studies, University of Zürich, Switzerland.,Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Genopode, Lausanne, Switzerland
| | - Andreas Wagner
- Department of Evolutionary Biology and Environmental Studies, University of Zürich, Switzerland.,Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Genopode, Lausanne, Switzerland.,Santa Fe Institute, Santa Fe, New Mexico, USA.,Stellenbosch Institute for Advanced Study (STIAS), Wallenberg Research Centre at Stellenbosch University, South Africa
| |
Collapse
|
12
|
Liu J, Viales RR, Khoueiry P, Reddington JP, Girardot C, Furlong E, Robinson-Rechavi M. The hourglass model of evolutionary conservation during embryogenesis extends to developmental enhancers with signatures of positive selection. Genome Res 2021; 31:1573-1581. [PMID: 34266978 PMCID: PMC8415374 DOI: 10.1101/gr.275212.121] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Accepted: 06/02/2021] [Indexed: 11/24/2022]
Abstract
Inter-species comparisons of both morphology and gene expression within a phylum have revealed a period in the middle of embryogenesis with more similarity between species compared to earlier and later time-points. This "developmental hourglass" pattern has been observed in many phyla, yet the evolutionary constraints on gene expression, and underlying mechanisms of how this is regulated, remains elusive. Moreover, the role of positive selection on gene regulation in the more diverged earlier and later stages of embryogenesis remains unknown. Here, using DNase-seq to identify regulatory regions in two distant Drosophila species (D. melanogaster and D. virilis), we assessed the evolutionary conservation and adaptive evolution of enhancers throughout multiple stages of embryogenesis. This revealed a higher proportion of conserved enhancers at the phylotypic period, providing a regulatory basis for the hourglass expression pattern. Using an in silico mutagenesis approach, we detect signatures of positive selection on developmental enhancers at early and late stages of embryogenesis, with a depletion at the phylotypic period, suggesting positive selection as one evolutionary mechanism underlying the hourglass pattern of animal evolution.
Collapse
|
13
|
Joshi M, Kapopoulou A, Laurent S. Impact of Genetic Variation in Gene Regulatory Sequences: A Population Genomics Perspective. Front Genet 2021; 12:660899. [PMID: 34276769 PMCID: PMC8282999 DOI: 10.3389/fgene.2021.660899] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Accepted: 05/31/2021] [Indexed: 01/06/2023] Open
Abstract
The unprecedented rise of high-throughput sequencing and assay technologies has provided a detailed insight into the non-coding sequences and their potential role as gene expression regulators. These regulatory non-coding sequences are also referred to as cis-regulatory elements (CREs). Genetic variants occurring within CREs have been shown to be associated with altered gene expression and phenotypic changes. Such variants are known to occur spontaneously and ultimately get fixed, due to selection and genetic drift, in natural populations and, in some cases, pave the way for speciation. Hence, the study of genetic variation at CREs has improved our overall understanding of the processes of local adaptation and evolution. Recent advances in high-throughput sequencing and better annotations of CREs have enabled the evaluation of the impact of such variation on gene expression, phenotypic alteration and fitness. Here, we review recent research on the evolution of CREs and concentrate on studies that have investigated genetic variation occurring in these regulatory sequences within the context of population genetics.
Collapse
Affiliation(s)
- Manas Joshi
- Department of Comparative Development and Genetics, Max Planck Institute for Plant Breeding Research, Cologne, Germany
| | | | - Stefan Laurent
- Department of Comparative Development and Genetics, Max Planck Institute for Plant Breeding Research, Cologne, Germany
| |
Collapse
|
14
|
Giudicelli F, Roest Crollius H. On the importance of evolutionary constraint for regulatory sequence identification. Brief Funct Genomics 2021:elab015. [PMID: 33754633 DOI: 10.1093/bfgp/elab015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2020] [Revised: 01/15/2021] [Accepted: 02/19/2021] [Indexed: 11/13/2022] Open
Abstract
Regulation of gene expression relies on the activity of specialized genomic elements, enhancers or silencers, distributed over sometimes large distance from their target gene promoters. A significant part of vertebrate genomes consists in such regulatory elements, but their identification and that of their target genes remains challenging, due to the lack of clear signature at the nucleotide level. For many years the main hallmark used for identifying functional elements has been their sequence conservation between genomes of distant species, indicative of purifying selection. More recently, genome-wide biochemical assays have opened new avenues for detecting regulatory regions, shifting attention away from evolutionary constraints. Here, we review the respective contributions of comparative genomics and biochemical assays for the definition of regulatory elements and their targets and advocate that both sequence conservation and preserved synteny, taken as signature of functional constraint, remain essential tools in this task.
Collapse
|
15
|
Tu SM. The Cancer Genome: Paradigm or Paradox? Cancers (Basel) 2021; 13:cancers13040674. [PMID: 33567511 PMCID: PMC7915243 DOI: 10.3390/cancers13040674] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Revised: 02/03/2021] [Accepted: 02/05/2021] [Indexed: 12/16/2022] Open
Abstract
Simple Summary The observation that genetic mutations often do not cause cancer or disease in the phenomena of mosaicism, clonal hematopoiesis of indeterminate potential (CHIP), and heteroplasmy provides us with important clues about the origin and nature of cancer. We should be wary that the cancer genome may lead us astray to the wrong destination on a bad expedition unless we adopt the right cancer theory to elucidate it, and adhere to the proper scientific method to investigate it. Abstract Nowadays, many professionals are sequencing the DNA and studying the cancer genome. However, if the genetic theory of cancer is flawed, our faith in the cancer genome will falter. If gene sequencing is only a tool, we should question what we are making or creating with this tool. When we do not have the right cancer theory at our disposal, we cannot be sure that what we create from the cancer genome is meaningful or useful. In this article, we illustrate that mosaicism, CHIP, and heteroplasmy dispute our traditional perspectives about a genetic origin of cancer and challenge our current narratives about the cancer genome. We caution that when we have the wrong cancer theory, big data can provide poor evidence. Precision medicine may become rather imprecise. Targeted therapy either does not work or work for the wrong reasons. The cancer genome thus becomes a paradox rather than a paradigm.
Collapse
Affiliation(s)
- Shi-Ming Tu
- Department of Genitourinary Medical Oncology, Unit 1374, The University of Texas MD Anderson Cancer Center, 1155 Pressler Street, Houston, TX 77030-3721, USA
| |
Collapse
|