1
|
Mohanty V, Shakhnovich EI. Biophysical fitness landscape design traps viral evolution. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.30.646233. [PMID: 40236159 PMCID: PMC11996392 DOI: 10.1101/2025.03.30.646233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/17/2025]
Abstract
We introduce foundational principles for designing customizable fitness landscapes for proteins. We focus on crafting antibody ensembles to create evolutionary traps which restrict viral fitness enhancement. By deriving a fundamental relationship between a mutant protein's fitness and its binding affinities to host receptors and antibodies, we show that the fitnesses of different protein sequences are designable, meaning they can be independently tuned by careful choice of antibodies. Given a user-defined target fitness landscape, stochastic optimization can be performed to obtain such an ensemble of antibodies which force the protein to evolve according to the designed target fitness landscape. We conduct in silico serial dilution experiments using microscopic chemical reaction dynamics to simulate viral evolution and validate the fitness landscape design. We then apply the design protocol to control the relative fitnesses of two SARS-CoV-2 neutral genotype networks while ensuring absolute fitness reduction. Finally, we introduce an iterative design protocol which consistently discovers better vaccination target sequences, generating antibodies that restrict the post-vaccination fitness growth of escape variants while simultaneously suppressing wildtype fitness. Biophysical fitness landscape design thus opens the door to prescient vaccine, antibody, and peptide design, thinking several steps ahead of pathogen evolution.
Collapse
|
2
|
Ranjan G, Scaria V, Sivasubbu S. Syntenic lncRNA locus exhibits DNA regulatory functions with sequence evolution. Gene 2025; 933:148988. [PMID: 39378975 DOI: 10.1016/j.gene.2024.148988] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Revised: 07/12/2024] [Accepted: 10/04/2024] [Indexed: 10/10/2024]
Abstract
Syntenic long non-coding RNAs (lncRNAs) often show limited sequence conservation across species, prompting concern in the field. This study delves into functional signatures of syntenic lncRNAs between humans and zebrafish. Syntenic lncRNAs are highly expressed in zebrafish, with ∼90 % located near protein-coding genes, either in sense or antisense orientation. During early zebrafish development and in human embryonic stem cells (H1-hESC), syntenic lncRNA loci are enriched with cis-regulatory repressor signatures, influencing the expression of development-associated genes. In later zebrafish developmental stages and specific human cell lines, these syntenic lncRNA loci function as enhancers or transcription start sites (TSS) for protein-coding genes. Analysis of transposable elements (TEs) in syntenic lncRNA sequences revealed intriguing patterns: human lncRNAs are enriched in simple repeat elements, while their zebrafish counterparts show enrichment in LTR elements. This sequence evolution likely arises from post-rearrangement mutations that enhance DNA elements or cis-regulatory functions. It may also contribute to vertebrate innovation by creating novel transcription factor binding sites within the locus. This study highlights the conserved functionality of syntenic lncRNA loci through DNA elements, emphasizing their conserved roles across species despite sequence divergence.
Collapse
Affiliation(s)
- Gyan Ranjan
- CSIR Institute of Genomics and Integrative Biology, Mathura Road, Delhi 110024, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| | - Vinod Scaria
- CSIR Institute of Genomics and Integrative Biology, Mathura Road, Delhi 110024, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India; Vishwanath Cancer Care Foundation, Mumbai, India.; Dr. D. Y Patil Medical College, Hospital and Research Centre, Pune, India.
| | - Sridhar Sivasubbu
- CSIR Institute of Genomics and Integrative Biology, Mathura Road, Delhi 110024, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India; Vishwanath Cancer Care Foundation, Mumbai, India.; Dr. D. Y Patil Medical College, Hospital and Research Centre, Pune, India.
| |
Collapse
|
3
|
Westmann CA, Goldbach L, Wagner A. The highly rugged yet navigable regulatory landscape of the bacterial transcription factor TetR. Nat Commun 2024; 15:10745. [PMID: 39737967 PMCID: PMC11686294 DOI: 10.1038/s41467-024-54723-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Accepted: 11/19/2024] [Indexed: 01/01/2025] Open
Abstract
Transcription factor binding sites (TFBSs) are important sources of evolutionary innovations. Understanding how evolution navigates the sequence space of such sites can be achieved by mapping TFBS adaptive landscapes. In such a landscape, an individual location corresponds to a TFBS bound by a transcription factor. The elevation at that location corresponds to the strength of transcriptional regulation conveyed by the sequence. Here, we develop an in vivo massively parallel reporter assay to map the landscape of bacterial TFBSs. We apply this assay to the TetR repressor, for which few TFBSs are known. We quantify the strength of transcriptional repression for 17,765 TFBSs and show that the resulting landscape is highly rugged, with 2092 peaks. Only a few peaks convey stronger repression than the wild type. Non-additive (epistatic) interactions between mutations are frequent. Despite these hallmarks of ruggedness, most high peaks are evolutionarily accessible. They have large basins of attraction and are reached by around 20% of populations evolving on the landscape. Which high peak is reached during evolution is unpredictable and contingent on the mutational path taken. This in-depth analysis of a prokaryotic gene regulator reveals a landscape that is navigable but much more rugged than the landscapes of eukaryotic regulators.
Collapse
Affiliation(s)
- Cauã Antunes Westmann
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Winterthurerstrasse 190, Zurich, CH-8057, Switzerland
- Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Genopode, 1015, Lausanne, Switzerland
| | - Leander Goldbach
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Winterthurerstrasse 190, Zurich, CH-8057, Switzerland
- Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Genopode, 1015, Lausanne, Switzerland
| | - Andreas Wagner
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Winterthurerstrasse 190, Zurich, CH-8057, Switzerland.
- Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Genopode, 1015, Lausanne, Switzerland.
- The Santa Fe Institute, Santa Fe, NM, 87501, USA.
| |
Collapse
|
4
|
Yuan J, Dong K, Wu H, Zeng X, Liu X, Liu Y, Dai J, Yin J, Chen Y, Guo Y, Luo W, Liu N, Sun Y, Zhang S, Su B. Single-nucleus multi-omics analyses reveal cellular and molecular innovations in the anterior cingulate cortex during primate evolution. CELL GENOMICS 2024; 4:100703. [PMID: 39631404 DOI: 10.1016/j.xgen.2024.100703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/10/2024] [Revised: 08/17/2024] [Accepted: 11/07/2024] [Indexed: 12/07/2024]
Abstract
The anterior cingulate cortex (ACC) of the human brain is involved in higher-level cognitive functions such as emotion and self-awareness. We generated profiles of human and macaque ACC gene expression and chromatin accessibility at single-nucleus resolution. We characterized the conserved patterns of gene expression, chromatin accessibility, and transcription factor binding in different cell types. Combining the published mouse data, we discovered the molecular identities and cell-lineage origin of the primate von Economo neurons (VENs). Our in vitro and in vivo experiments identified a group of primate-shared and human-specific VEN marker genes, such as PCSK6, ADAMTSL3, and CDHR3, potentially contributing to VEN morphogenesis. We demonstrated that the human-specific sequence changes account for the cellular and functional innovations in the ACC during primate evolution and human origin. These findings provide new insights into understanding the cellular composition and molecular regulation of ACC and its evolutionary role in shaping human-owned higher cognitive skills.
Collapse
Affiliation(s)
- Jiamiao Yuan
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, P.R. China; Yunnan Key Laboratory of Integrative Anthropology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650107, China; National Key Laboratory of Genetic Evolution and Animal Model, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China; National Resource Center for Non-Human Primates, Kunming Primate Research Center, and National Research Facility for Phenotypic & Genetic Analysis of Model Animals (Primate Facility), Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650107, China
| | - Kangning Dong
- School of Mathematics, Renmin University of China, Beijing 100872, China; NCMIS, CEMS, RCSDS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China; School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Haixu Wu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, P.R. China; Yunnan Key Laboratory of Integrative Anthropology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650107, China; National Key Laboratory of Genetic Evolution and Animal Model, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China; National Resource Center for Non-Human Primates, Kunming Primate Research Center, and National Research Facility for Phenotypic & Genetic Analysis of Model Animals (Primate Facility), Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650107, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Beijing 100101, P.R. China
| | - Xuerui Zeng
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, P.R. China; Yunnan Key Laboratory of Integrative Anthropology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650107, China; National Key Laboratory of Genetic Evolution and Animal Model, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China; National Resource Center for Non-Human Primates, Kunming Primate Research Center, and National Research Facility for Phenotypic & Genetic Analysis of Model Animals (Primate Facility), Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650107, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Beijing 100101, P.R. China
| | - Xingyan Liu
- NCMIS, CEMS, RCSDS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China; School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yan Liu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, P.R. China; Yunnan Key Laboratory of Integrative Anthropology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650107, China; National Key Laboratory of Genetic Evolution and Animal Model, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China; National Resource Center for Non-Human Primates, Kunming Primate Research Center, and National Research Facility for Phenotypic & Genetic Analysis of Model Animals (Primate Facility), Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650107, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Beijing 100101, P.R. China
| | - Jiapei Dai
- Wuhan Institute for Neuroscience and Neuroengineering, South-Central Minzu University, Wuhan 430074, China; Chinese Brain Bank Center, South-Central Minzu University, Wuhan 430074, China
| | - Jichao Yin
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, P.R. China; Yunnan Key Laboratory of Integrative Anthropology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650107, China; National Key Laboratory of Genetic Evolution and Animal Model, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China; National Resource Center for Non-Human Primates, Kunming Primate Research Center, and National Research Facility for Phenotypic & Genetic Analysis of Model Animals (Primate Facility), Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650107, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Beijing 100101, P.R. China
| | - Yongjie Chen
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, P.R. China; Yunnan Key Laboratory of Integrative Anthropology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650107, China; National Key Laboratory of Genetic Evolution and Animal Model, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China; National Resource Center for Non-Human Primates, Kunming Primate Research Center, and National Research Facility for Phenotypic & Genetic Analysis of Model Animals (Primate Facility), Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650107, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Beijing 100101, P.R. China
| | - Yongbo Guo
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, P.R. China; National Key Laboratory of Genetic Evolution and Animal Model, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China
| | - Wenhao Luo
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, P.R. China; National Key Laboratory of Genetic Evolution and Animal Model, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China
| | - Na Liu
- Wuhan Institute for Neuroscience and Neuroengineering, South-Central Minzu University, Wuhan 430074, China; Chinese Brain Bank Center, South-Central Minzu University, Wuhan 430074, China
| | - Yan Sun
- Wuhan Institute for Neuroscience and Neuroengineering, South-Central Minzu University, Wuhan 430074, China; Chinese Brain Bank Center, South-Central Minzu University, Wuhan 430074, China
| | - Shihua Zhang
- NCMIS, CEMS, RCSDS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China; Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650223, China; School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China; Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou 310024, China.
| | - Bing Su
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, P.R. China; Yunnan Key Laboratory of Integrative Anthropology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650107, China; National Key Laboratory of Genetic Evolution and Animal Model, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China; National Resource Center for Non-Human Primates, Kunming Primate Research Center, and National Research Facility for Phenotypic & Genetic Analysis of Model Animals (Primate Facility), Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650107, China; Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650223, China.
| |
Collapse
|
5
|
Haseltine WA, Patarca R. The RNA Revolution in the Central Molecular Biology Dogma Evolution. Int J Mol Sci 2024; 25:12695. [PMID: 39684407 DOI: 10.3390/ijms252312695] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2024] [Revised: 11/24/2024] [Accepted: 11/25/2024] [Indexed: 12/18/2024] Open
Abstract
Human genome projects in the 1990s identified about 20,000 protein-coding sequences. We are now in the RNA revolution, propelled by the realization that genes determine phenotype beyond the foundational central molecular biology dogma, stating that inherited linear pieces of DNA are transcribed to RNAs and translated into proteins. Crucially, over 95% of the genome, initially considered junk DNA between protein-coding genes, encodes essential, functionally diverse non-protein-coding RNAs, raising the gene count by at least one order of magnitude. Most inherited phenotype-determining changes in DNA are in regulatory areas that control RNA and regulatory sequences. RNAs can directly or indirectly determine phenotypes by regulating protein and RNA function, transferring information within and between organisms, and generating DNA. RNAs also exhibit high structural, functional, and biomolecular interaction plasticity and are modified via editing, methylation, glycosylation, and other mechanisms, which bestow them with diverse intra- and extracellular functions without altering the underlying DNA. RNA is, therefore, currently considered the primary determinant of cellular to populational functional diversity, disease-linked and biomolecular structural variations, and cell function regulation. As demonstrated by RNA-based coronavirus vaccines' success, RNA technology is transforming medicine, agriculture, and industry, as did the advent of recombinant DNA technology in the 1980s.
Collapse
Affiliation(s)
- William A Haseltine
- Access Health International, 384 West Lane, Ridgefield, CT 06877, USA
- Feinstein Institutes for Medical Research, 350 Community Dr, Manhasset, NY 11030, USA
| | - Roberto Patarca
- Access Health International, 384 West Lane, Ridgefield, CT 06877, USA
- Feinstein Institutes for Medical Research, 350 Community Dr, Manhasset, NY 11030, USA
| |
Collapse
|
6
|
Metzger BPH, Park Y, Starr TN, Thornton JW. Epistasis facilitates functional evolution in an ancient transcription factor. eLife 2024; 12:RP88737. [PMID: 38767330 PMCID: PMC11105156 DOI: 10.7554/elife.88737] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/22/2024] Open
Abstract
A protein's genetic architecture - the set of causal rules by which its sequence produces its functions - also determines its possible evolutionary trajectories. Prior research has proposed that the genetic architecture of proteins is very complex, with pervasive epistatic interactions that constrain evolution and make function difficult to predict from sequence. Most of this work has analyzed only the direct paths between two proteins of interest - excluding the vast majority of possible genotypes and evolutionary trajectories - and has considered only a single protein function, leaving unaddressed the genetic architecture of functional specificity and its impact on the evolution of new functions. Here, we develop a new method based on ordinal logistic regression to directly characterize the global genetic determinants of multiple protein functions from 20-state combinatorial deep mutational scanning (DMS) experiments. We use it to dissect the genetic architecture and evolution of a transcription factor's specificity for DNA, using data from a combinatorial DMS of an ancient steroid hormone receptor's capacity to activate transcription from two biologically relevant DNA elements. We show that the genetic architecture of DNA recognition consists of a dense set of main and pairwise effects that involve virtually every possible amino acid state in the protein-DNA interface, but higher-order epistasis plays only a tiny role. Pairwise interactions enlarge the set of functional sequences and are the primary determinants of specificity for different DNA elements. They also massively expand the number of opportunities for single-residue mutations to switch specificity from one DNA target to another. By bringing variants with different functions close together in sequence space, pairwise epistasis therefore facilitates rather than constrains the evolution of new functions.
Collapse
Affiliation(s)
- Brian PH Metzger
- Department of Ecology and Evolution, University of ChicagoChicagoUnited States
| | - Yeonwoo Park
- Program in Genetics, Genomics, and Systems Biology, University of ChicagoChicagoUnited States
| | - Tyler N Starr
- Department of Biochemistry and Molecular Biophysics, University of ChicagoChicagoUnited States
| | - Joseph W Thornton
- Department of Ecology and Evolution, University of ChicagoChicagoUnited States
- Department of Human Genetics, University of ChicagoChicagoUnited States
| |
Collapse
|
7
|
Subramanian S, Zhang W, Nimkar S, Kamel M, O’Donnell M, Kuriyan J. Adaptive Capacity of a DNA Polymerase Clamp-loader ATPase Complex. Mol Biol Evol 2024; 41:msae013. [PMID: 38298175 PMCID: PMC10924251 DOI: 10.1093/molbev/msae013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Revised: 12/19/2023] [Accepted: 01/16/2024] [Indexed: 02/02/2024] Open
Abstract
The ability of mutations to facilitate adaptation is central to evolution. To understand how mutations can lead to functional adaptation in a complex molecular machine, we created a defective version of the T4 clamp-loader complex, which is essential for DNA replication. This variant, which is ∼5,000-fold less active than the wild type, was made by replacing the catalytic domains with those from another phage. A directed-evolution experiment revealed that multiple substitutions to a single negatively charged residue in the chimeric clamp loader-Asp 86-restore fitness to within ∼20-fold of wild type. These mutations remove an adventitious electrostatic repulsive interaction between Asp 86 and the sliding clamp. Thus, the fitness decrease of the chimeric clamp loader is caused by a reduction in affinity between the clamp loader and the clamp. Deep mutagenesis shows that the reduced fitness of the chimeric clamp loader is also compensated for by lysine and arginine substitutions of several DNA-proximal residues in the clamp loader or the sliding clamp. Our results demonstrate that there is a latent capacity for increasing the affinity of the clamp loader for DNA and the sliding clamp, such that even single-point mutations can readily compensate for the loss of function due to suboptimal interactions elsewhere.
Collapse
Affiliation(s)
- Subu Subramanian
- Department of Biochemistry, School of Medicine, Vanderbilt University, Nashville, TN, USA
| | - Weilin Zhang
- Department of Chemistry, University of California, Berkeley, Berkeley, CA, USA
| | - Siddharth Nimkar
- Department of Biochemistry, School of Medicine, Vanderbilt University, Nashville, TN, USA
| | - Mazzin Kamel
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
| | - Michael O’Donnell
- Howard Hughes Medical Institute, The Rockefeller University, New York, NY, USA
| | - John Kuriyan
- Department of Biochemistry, School of Medicine, Vanderbilt University, Nashville, TN, USA
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
| |
Collapse
|
8
|
Kang CK, Kim AR. Deep molecular learning of transcriptional control of a synthetic CRE enhancer and its variants. iScience 2024; 27:108747. [PMID: 38222110 PMCID: PMC10784702 DOI: 10.1016/j.isci.2023.108747] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 08/29/2023] [Accepted: 12/12/2023] [Indexed: 01/16/2024] Open
Abstract
Massively parallel reporter assay measures transcriptional activities of various cis-regulatory modules (CRMs) in a single experiment. We developed a thermodynamic computational model framework that calculates quantitative levels of gene expression directly from regulatory DNA sequences. Using the framework, we investigated the molecular mechanisms of cis-regulatory mutations of a synthetic enhancer that cause abnormal gene expression. We found that, in a human cell line, competitive binding between family transcription factors (TFs) with slightly different binding preferences significantly increases the accuracy of recapitulating the transcriptional effects of thousands of single- or multi-mutations. We also discovered that even if various harmful mutations occurred in an activator binding site, CRM could stably maintain or even increase gene expression through a certain form of competitive binding between family TFs. These findings enhance understanding the effect of SNPs and indels on CRMs and would help building robust custom-designed CRMs for biologics production and gene therapy.
Collapse
Affiliation(s)
- Chan-Koo Kang
- School of Life Science, Handong Global University, Pohang, Gyeong-Buk 37554, South Korea
- Department of Advanced Convergence, Handong Global University, Pohang, Gyeong-Buk 37554, South Korea
| | - Ah-Ram Kim
- School of Life Science, Handong Global University, Pohang, Gyeong-Buk 37554, South Korea
- Department of Advanced Convergence, Handong Global University, Pohang, Gyeong-Buk 37554, South Korea
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- School of Applied Artificial Intelligence, Handong Global University, Pohang, Gyeong-Buk 37554, South Korea
| |
Collapse
|
9
|
Shepherd MJ, Reynolds M, Pierce AP, Rice AM, Taylor TB. Transcription factor expression levels and environmental signals constrain transcription factor innovation. MICROBIOLOGY (READING, ENGLAND) 2023; 169:001378. [PMID: 37584667 PMCID: PMC10482368 DOI: 10.1099/mic.0.001378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Accepted: 07/26/2023] [Indexed: 08/17/2023]
Abstract
Evolutionary innovation of transcription factors frequently drives phenotypic diversification and adaptation to environmental change. Transcription factors can gain or lose connections to target genes, resulting in novel regulatory responses and phenotypes. However the frequency of functional adaptation varies between different regulators, even when they are closely related. To identify factors influencing propensity for innovation, we utilise a Pseudomonas fluorescens SBW25 strain rendered incapable of flagellar mediated motility in soft-agar plates via deletion of the flagellar master regulator (fleQ ). This bacterium can evolve to rescue flagellar motility via gene regulatory network rewiring of an alternative transcription factor to rescue activity of FleQ. Previously, we have identified two members (out of 22) of the RpoN-dependent enhancer binding protein (RpoN-EBP) family of transcription factors (NtrC and PFLU1132) that are capable of innovating in this way. These two transcription factors rescue motility repeatably and reliably in a strict hierarchy – with NtrC the only route in a ∆fleQ background, and PFLU1132 the only route in a ∆fleQ ∆ntrC background. However, why other members in the same transcription factor family have not been observed to rescue flagellar activity is unclear. Previous work shows that protein homology cannot explain this pattern within the protein family (RpoN-EBPs), and mutations in strains that rescued motility suggested high levels of transcription factor expression and activation drive innovation. We predict that mutations that increase expression of the transcription factor are vital to unlock evolutionary potential for innovation. Here, we construct titratable expression mutant lines for 11 of the RpoN-EBPs in P. fluorescens . We show that in five additional RpoN-EBPs (FleR, HbcR, GcsR, DctD, AauR and PFLU2209), high expression levels result in different mutations conferring motility rescue, suggesting alternative rewiring pathways. Our results indicate that expression levels (and not protein homology) of RpoN-EBPs are a key constraining factor in determining evolutionary potential for innovation. This suggests that transcription factors that can achieve high expression through few mutational changes, or transcription factors that are active in the selective environment, are more likely to innovate and contribute to adaptive gene regulatory network evolution.
Collapse
Affiliation(s)
- Matthew J. Shepherd
- Milner Centre for Evolution, Department of Life Sciences, University of Bath, Claverton Down, Bath BA2 7AY, UK
- Division of Evolution and Genomic Sciences, School of Biological Sciences, University of Manchester, Manchester, UK
| | - Mitchell Reynolds
- Milner Centre for Evolution, Department of Life Sciences, University of Bath, Claverton Down, Bath BA2 7AY, UK
| | - Aidan P. Pierce
- Milner Centre for Evolution, Department of Life Sciences, University of Bath, Claverton Down, Bath BA2 7AY, UK
| | - Alan M. Rice
- Milner Centre for Evolution, Department of Life Sciences, University of Bath, Claverton Down, Bath BA2 7AY, UK
| | - Tiffany B. Taylor
- Milner Centre for Evolution, Department of Life Sciences, University of Bath, Claverton Down, Bath BA2 7AY, UK
| |
Collapse
|
10
|
Smith GD, Ching WH, Cornejo-Páramo P, Wong ES. Decoding enhancer complexity with machine learning and high-throughput discovery. Genome Biol 2023; 24:116. [PMID: 37173718 PMCID: PMC10176946 DOI: 10.1186/s13059-023-02955-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Accepted: 04/28/2023] [Indexed: 05/15/2023] Open
Abstract
Enhancers are genomic DNA elements controlling spatiotemporal gene expression. Their flexible organization and functional redundancies make deciphering their sequence-function relationships challenging. This article provides an overview of the current understanding of enhancer organization and evolution, with an emphasis on factors that influence these relationships. Technological advancements, particularly in machine learning and synthetic biology, are discussed in light of how they provide new ways to understand this complexity. Exciting opportunities lie ahead as we continue to unravel the intricacies of enhancer function.
Collapse
Affiliation(s)
- Gabrielle D Smith
- Victor Chang Cardiac Research Institute, 405 Liverpool Street, Darlinghurst, NSW, Australia
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Kensington, NSW, Australia
| | - Wan Hern Ching
- Victor Chang Cardiac Research Institute, 405 Liverpool Street, Darlinghurst, NSW, Australia
| | - Paola Cornejo-Páramo
- Victor Chang Cardiac Research Institute, 405 Liverpool Street, Darlinghurst, NSW, Australia
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Kensington, NSW, Australia
| | - Emily S Wong
- Victor Chang Cardiac Research Institute, 405 Liverpool Street, Darlinghurst, NSW, Australia.
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Kensington, NSW, Australia.
| |
Collapse
|
11
|
Santos-Moreno J, Tasiudi E, Kusumawardhani H, Stelling J, Schaerli Y. Robustness and innovation in synthetic genotype networks. Nat Commun 2023; 14:2454. [PMID: 37117168 PMCID: PMC10147661 DOI: 10.1038/s41467-023-38033-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Accepted: 04/13/2023] [Indexed: 04/30/2023] Open
Abstract
Genotype networks are sets of genotypes connected by small mutational changes that share the same phenotype. They facilitate evolutionary innovation by enabling the exploration of different neighborhoods in genotype space. Genotype networks, first suggested by theoretical models, have been empirically confirmed for proteins and RNAs. Comparative studies also support their existence for gene regulatory networks (GRNs), but direct experimental evidence is lacking. Here, we report the construction of three interconnected genotype networks of synthetic GRNs producing three distinct phenotypes in Escherichia coli. Our synthetic GRNs contain three nodes regulating each other by CRISPR interference and governing the expression of fluorescent reporters. The genotype networks, composed of over twenty different synthetic GRNs, provide robustness in face of mutations while enabling transitions to innovative phenotypes. Through realistic mathematical modeling, we quantify robustness and evolvability for the complete genotype-phenotype map and link these features mechanistically to GRN motifs. Our work thereby exemplifies how GRN evolution along genotype networks might be driving evolutionary innovation.
Collapse
Affiliation(s)
- Javier Santos-Moreno
- Department of Fundamental Microbiology, University of Lausanne, Biophore Building, 1015, Lausanne, Switzerland
- Department of Medicine and Life Sciences, Pompeu Fabra University, 00803, Barcelona, Spain
| | - Eve Tasiudi
- Department of Biosystems Science and Engineering, ETH Zurich and SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Hadiastri Kusumawardhani
- Department of Fundamental Microbiology, University of Lausanne, Biophore Building, 1015, Lausanne, Switzerland
| | - Joerg Stelling
- Department of Biosystems Science and Engineering, ETH Zurich and SIB Swiss Institute of Bioinformatics, Basel, Switzerland.
| | - Yolanda Schaerli
- Department of Fundamental Microbiology, University of Lausanne, Biophore Building, 1015, Lausanne, Switzerland.
| |
Collapse
|
12
|
Arcuschin CD, Pinkasz M, Schor IE. Mechanisms of robustness in gene regulatory networks involved in neural development. Front Mol Neurosci 2023; 16:1114015. [PMID: 36814969 PMCID: PMC9940843 DOI: 10.3389/fnmol.2023.1114015] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Accepted: 01/16/2023] [Indexed: 02/08/2023] Open
Abstract
The functions of living organisms are affected by different kinds of perturbation, both internal and external, which in many cases have functional effects and phenotypic impact. The effects of these perturbations become particularly relevant for multicellular organisms with complex body patterns and cell type heterogeneity, where transcriptional programs controlled by gene regulatory networks determine, for example, the cell fate during embryonic development. Therefore, an essential aspect of development in these organisms is the ability to maintain the functionality of their genetic developmental programs even in the presence of genetic variation, changing environmental conditions and biochemical noise, a property commonly termed robustness. We discuss the implication of different molecular mechanisms of robustness involved in neurodevelopment, which is characterized by the interplay of many developmental programs at a molecular, cellular and systemic level. We specifically focus on processes affecting the function of gene regulatory networks, encompassing transcriptional regulatory elements and post-transcriptional processes such as miRNA-based regulation, but also higher order regulatory organization, such as gene network topology. We also present cases where impairment of robustness mechanisms can be associated with neurodevelopmental disorders, as well as reasons why understanding these mechanisms should represent an important part of the study of gene regulatory networks driving neural development.
Collapse
Affiliation(s)
- Camila D. Arcuschin
- Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE), Universidad de Buenos Aires—Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
- Departamento de Fisiología, Biología Molecular y Celular, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Marina Pinkasz
- Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE), Universidad de Buenos Aires—Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Ignacio E. Schor
- Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE), Universidad de Buenos Aires—Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
- Departamento de Fisiología, Biología Molecular y Celular, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina
| |
Collapse
|
13
|
Fong SL, Capra JA. Function and Constraint in Enhancer Sequences with Multiple Evolutionary Origins. Genome Biol Evol 2022; 14:evac159. [PMID: 36314566 PMCID: PMC9673499 DOI: 10.1093/gbe/evac159] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/22/2022] [Indexed: 11/04/2022] Open
Abstract
Thousands of human gene regulatory enhancers are composed of sequences with multiple evolutionary origins. These evolutionarily "complex" enhancers consist of older "core" sequences and younger "derived" sequences. However, the functional relationship between the sequences of different evolutionary origins within complex enhancers is poorly understood. We evaluated the function, selective pressures, and sequence variation across core and derived components of human complex enhancers. We find that both components are older than expected from the genomic background, and complex enhancers are enriched for core and derived sequences of similar evolutionary ages. Both components show strong evidence of biochemical activity in massively parallel report assays. However, core and derived sequences have distinct transcription factor (TF)-binding preferences that are largely similar across evolutionary origins. As expected, given these signatures of function, both core and derived sequences have substantial evidence of purifying selection. Nonetheless, derived sequences exhibit weaker purifying selection than adjacent cores. Derived sequences also tolerate more common genetic variation and are enriched compared with cores for expression quantitative trait loci associated with gene expression variability in human populations. In conclusion, both core and derived sequences have strong evidence of gene regulatory function, but derived sequences have distinct constraint profiles, TF-binding preferences, and tolerance to variation compared with cores. We propose that the step-wise integration of younger derived with older core sequences has generated regulatory substrates with robust activity and the potential for functional variation. Our analyses demonstrate that synthesizing study of enhancer evolution and function can aid interpretation of regulatory sequence activity and functional variation across human populations.
Collapse
Affiliation(s)
- Sarah L Fong
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, Tennessee
| | - John A Capra
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee
- Bakar Computational Health Sciences Institute and Department of Epidemiology and Biostatistics, University of California, San Francisco
| |
Collapse
|
14
|
Wagner A. Adaptive evolvability through direct selection instead of indirect, second-order selection. JOURNAL OF EXPERIMENTAL ZOOLOGY. PART B, MOLECULAR AND DEVELOPMENTAL EVOLUTION 2022; 338:395-404. [PMID: 34254439 PMCID: PMC9786751 DOI: 10.1002/jez.b.23071] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Revised: 05/11/2021] [Accepted: 06/04/2021] [Indexed: 12/30/2022]
Abstract
Can evolvability itself be the product of adaptive evolution? To answer this question is challenging, because any DNA mutation that alters only evolvability is subject to indirect, "second order" selection on the future effects of this mutation. Such indirect selection is weaker than "first-order" selection on mutations that alter fitness, in the sense that it can operate only under restrictive conditions. Here I discuss a route to adaptive evolvability that overcomes this challenge. Specifically, a recent evolution experiment showed that some mutations can enhance both fitness and evolvability through a combination of direct and indirect selection. Unrelated evidence from gene duplication and the evolution of gene regulation suggests that mutations with such dual effects may not be rare. Through such mutations, evolvability may increase at least in part because it provides an adaptive advantage. These observations suggest a research program on the adaptive evolution of evolvability, which aims to identify such mutations and to disentangle their direct fitness effects from their indirect effects on evolvability. If evolvability is itself adaptive, Darwinian evolution may have created more than life's diversity. It may also have helped create the very conditions that made the success of Darwinian evolution possible.
Collapse
Affiliation(s)
- Andreas Wagner
- Department of Evolutionary Biology and Environmental StudiesUniversity of ZurichZurichSwitzerland,Swiss Institute of BioinformaticsQuartier Sorge‐Batiment GenopodeLausanneSwitzerland,The Santa Fe InstituteSanta FeNew MexicoUSA,Stellenbosch Institute for Advanced Study, Wallenberg Research Centre at Stellenbosch UniversityStellenboschSouth Africa
| |
Collapse
|
15
|
Wisniewska A, Wons E, Potrykus K, Hinrichs R, Gucwa K, Graumann PL, Mruk I. Molecular basis for lethal cross-talk between two unrelated bacterial transcription factors - the regulatory protein of a restriction-modification system and the repressor of a defective prophage. Nucleic Acids Res 2022; 50:10964-10980. [PMID: 36271797 DOI: 10.1093/nar/gkac914] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Revised: 09/29/2022] [Accepted: 10/06/2022] [Indexed: 11/13/2022] Open
Abstract
Bacterial gene expression depends on the efficient functioning of global transcriptional networks, however their interconnectivity and orchestration rely mainly on the action of individual DNA binding proteins called transcription factors (TFs). TFs interact not only with their specific target sites, but also with secondary (off-target) sites, and vary in their promiscuity. It is not clear yet what mechanisms govern the interactions with secondary sites, and how such rewiring affects the overall regulatory network, but this could clearly constrain horizontal gene transfer. Here, we show the molecular mechanism of one such off-target interaction between two unrelated TFs in Escherichia coli: the C regulatory protein of a Type II restriction-modification system, and the RacR repressor of a defective prophage. We reveal that the C protein interferes with RacR repressor expression, resulting in derepression of the toxic YdaT protein. These results also provide novel insights into regulation of the racR-ydaST operon. We mapped the C regulator interaction to a specific off-target site, and also visualized C protein dynamics, revealing intriguing differences in single molecule dynamics in different genetic contexts. Our results demonstrate an apparent example of horizontal gene transfer leading to adventitious TF cross-talk with negative effects on the recipient's viability. More broadly, this study represents an experimentally-accessible model of a regulatory constraint on horizontal gene transfer.
Collapse
Affiliation(s)
- Aleksandra Wisniewska
- Department of Microbiology, Faculty of Biology, University of Gdansk, Wita Stwosza 59, Gdansk 80-308, Poland
| | - Ewa Wons
- Department of Microbiology, Faculty of Biology, University of Gdansk, Wita Stwosza 59, Gdansk 80-308, Poland
| | - Katarzyna Potrykus
- Department of Bacterial Molecular Genetics, Faculty of Biology, University of Gdansk, Wita Stwosza 59, Gdansk 80-308, Poland
| | - Rebecca Hinrichs
- SYNMIKRO, LOEWE Center for Synthetic Microbiology, Philipps Universität Marburg, Germany.,Department of Chemistry, Philipps Universität Marburg, Hans-Meerwein-Strasse 6, 35032 Marburg, Germany
| | - Katarzyna Gucwa
- Department of Microbiology, Faculty of Biology, University of Gdansk, Wita Stwosza 59, Gdansk 80-308, Poland
| | - Peter L Graumann
- SYNMIKRO, LOEWE Center for Synthetic Microbiology, Philipps Universität Marburg, Germany.,Department of Chemistry, Philipps Universität Marburg, Hans-Meerwein-Strasse 6, 35032 Marburg, Germany
| | - Iwona Mruk
- Department of Microbiology, Faculty of Biology, University of Gdansk, Wita Stwosza 59, Gdansk 80-308, Poland
| |
Collapse
|
16
|
Srivastava M, Payne JL. On the incongruence of genotype-phenotype and fitness landscapes. PLoS Comput Biol 2022; 18:e1010524. [PMID: 36121840 PMCID: PMC9521842 DOI: 10.1371/journal.pcbi.1010524] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Revised: 09/29/2022] [Accepted: 08/30/2022] [Indexed: 11/22/2022] Open
Abstract
The mapping from genotype to phenotype to fitness typically involves multiple nonlinearities that can transform the effects of mutations. For example, mutations may contribute additively to a phenotype, but their effects on fitness may combine non-additively because selection favors a low or intermediate value of that phenotype. This can cause incongruence between the topographical properties of a fitness landscape and its underlying genotype-phenotype landscape. Yet, genotype-phenotype landscapes are often used as a proxy for fitness landscapes to study the dynamics and predictability of evolution. Here, we use theoretical models and empirical data on transcription factor-DNA interactions to systematically study the incongruence of genotype-phenotype and fitness landscapes when selection favors a low or intermediate phenotypic value. Using the theoretical models, we prove a number of fundamental results. For example, selection for low or intermediate phenotypic values does not change simple sign epistasis into reciprocal sign epistasis, implying that genotype-phenotype landscapes with only simple sign epistasis motifs will always give rise to single-peaked fitness landscapes under such selection. More broadly, we show that such selection tends to create fitness landscapes that are more rugged than the underlying genotype-phenotype landscape, but this increased ruggedness typically does not frustrate adaptive evolution because the local adaptive peaks in the fitness landscape tend to be nearly as tall as the global peak. Many of these results carry forward to the empirical genotype-phenotype landscapes, which may help to explain why low- and intermediate-affinity transcription factor-DNA interactions are so prevalent in eukaryotic gene regulation.
Collapse
Affiliation(s)
- Malvika Srivastava
- Institute of Integrative Biology, ETH Zurich, Zurich, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Joshua L. Payne
- Institute of Integrative Biology, ETH Zurich, Zurich, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
17
|
Manrubia S. The simple emergence of complex molecular function. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2022; 380:20200422. [PMID: 35599566 DOI: 10.1098/rsta.2020.0422] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
At odds with a traditional view of molecular evolution that seeks a descent-with-modification relationship between functional sequences, new functions can emerge de novo with relative ease. At early times of molecular evolution, random polymers could have sufficed for the appearance of incipient chemical activity, while the cellular environment harbours a myriad of proto-functional molecules. The emergence of function is facilitated by several mechanisms intrinsic to molecular organization, such as redundant mapping of sequences into structures, phenotypic plasticity, modularity or cooperative associations between genomic sequences. It is the availability of niches in the molecular ecology that filters new potentially functional proposals. New phenotypes and subsequent levels of molecular complexity could be attained through combinatorial explorations of currently available molecular variants. Natural selection does the rest. This article is part of the theme issue 'Emergent phenomena in complex physical and socio-technical systems: from cells to societies'.
Collapse
Affiliation(s)
- Susanna Manrubia
- Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid, Spain
- Systems Biology Department, National Biotechnology Centre (CSIC), c/Darwin 3, 28049 Madrid, Spain
| |
Collapse
|
18
|
Jayaraman V, Toledo‐Patiño S, Noda‐García L, Laurino P. Mechanisms of protein evolution. Protein Sci 2022; 31:e4362. [PMID: 35762715 PMCID: PMC9214755 DOI: 10.1002/pro.4362] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 05/11/2022] [Accepted: 05/14/2022] [Indexed: 11/06/2022]
Abstract
How do proteins evolve? How do changes in sequence mediate changes in protein structure, and in turn in function? This question has multiple angles, ranging from biochemistry and biophysics to evolutionary biology. This review provides a brief integrated view of some key mechanistic aspects of protein evolution. First, we explain how protein evolution is primarily driven by randomly acquired genetic mutations and selection for function, and how these mutations can even give rise to completely new folds. Then, we also comment on how phenotypic protein variability, including promiscuity, transcriptional and translational errors, may also accelerate this process, possibly via "plasticity-first" mechanisms. Finally, we highlight open questions in the field of protein evolution, with respect to the emergence of more sophisticated protein systems such as protein complexes, pathways, and the emergence of pre-LUCA enzymes.
Collapse
Affiliation(s)
- Vijay Jayaraman
- Department of Molecular Cell BiologyWeizmann Institute of ScienceRehovotIsrael
| | - Saacnicteh Toledo‐Patiño
- Protein Engineering and Evolution UnitOkinawa Institute of Science and Technology Graduate UniversityOkinawaJapan
| | - Lianet Noda‐García
- Department of Plant Pathology and Microbiology, Institute of Environmental Sciences, Robert H. Smith Faculty of Agriculture, Food and EnvironmentHebrew University of JerusalemRehovotIsrael
| | - Paola Laurino
- Protein Engineering and Evolution UnitOkinawa Institute of Science and Technology Graduate UniversityOkinawaJapan
| |
Collapse
|
19
|
Schweizer G, Wagner A. Both Binding Strength and Evolutionary Accessibility Affect the Population Frequency of Transcription Factor Binding Sequences in Arabidopsis thaliana. Genome Biol Evol 2021; 13:6459646. [PMID: 34894231 PMCID: PMC8712246 DOI: 10.1093/gbe/evab273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/06/2021] [Indexed: 11/22/2022] Open
Abstract
Mutations in DNA sequences that bind transcription factors and thus modulate gene expression are a source of adaptive variation in gene expression. To understand how transcription factor binding sequences evolve in natural populations of the thale cress Arabidopsis thaliana, we integrated genomic polymorphism data for loci bound by transcription factors with in vitro data on binding affinity for these transcription factors. Specifically, we studied 19 different transcription factors, and the allele frequencies of 8,333 genomic loci bound in vivo by these transcription factors in 1,135 A. thaliana accessions. We find that transcription factor binding sequences show very low genetic diversity, suggesting that they are subject to purifying selection. High frequency alleles of such binding sequences tend to bind transcription factors strongly. Conversely, alleles that are absent from the population tend to bind them weakly. In addition, alleles with high frequencies also tend to be the endpoints of many accessible evolutionary paths leading to these alleles. We show that both high affinity and high evolutionary accessibility contribute to high allele frequency for at least some transcription factors. Although binding sequences with stronger affinity are more frequent, we did not find them to be associated with higher gene expression levels. Epistatic interactions among individual mutations that alter binding affinity are pervasive and can help explain variation in accessibility among binding sequences. In summary, combining in vitro binding affinity data with in vivo binding sequence data can help understand the forces that affect the evolution of transcription factor binding sequences in natural populations.
Collapse
Affiliation(s)
- Gabriel Schweizer
- Department of Evolutionary Biology and Environmental Studies, University of Zürich, Switzerland.,Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Genopode, Lausanne, Switzerland
| | - Andreas Wagner
- Department of Evolutionary Biology and Environmental Studies, University of Zürich, Switzerland.,Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Genopode, Lausanne, Switzerland.,Santa Fe Institute, Santa Fe, New Mexico, USA.,Stellenbosch Institute for Advanced Study (STIAS), Wallenberg Research Centre at Stellenbosch University, South Africa
| |
Collapse
|
20
|
Pu J, Wang Z, Cong H, Chin JSR, Justen J, Finet C, Yew JY, Chung H. Repression precedes independent evolutionary gains of a highly specific gene expression pattern. Cell Rep 2021; 37:109896. [PMID: 34706247 PMCID: PMC8578697 DOI: 10.1016/j.celrep.2021.109896] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Revised: 08/24/2021] [Accepted: 10/06/2021] [Indexed: 12/12/2022] Open
Abstract
Highly specific expression patterns can be caused by the overlapping activities of activator and repressor sequences in enhancers. However, few studies illuminate how these sequences evolve in the origin of new enhancers. Here, we show that expression of the bond gene in the semicircular wall epithelium (swe) of the Drosophila melanogaster male ejaculatory bulb (EB) is controlled by an enhancer consisting of an activator region that requires Abdominal-B driving expression in the entire EB and a repressor region that restricts this expression to the EB swe. Although this expression pattern is independently gained in the distantly related Scaptodrosophila lebanonensis and does not require Abdominal-B, we show that functionally similar repressor sequences are present in Scaptodrosophila and also in species that do not express bond in the EB. We suggest that during enhancer evolution, repressor sequences can precede the evolution of activator sequences and may lead to similar but independently evolved expression patterns. Pu et al. show that the independent gain of a highly specific expression pattern across distantly related species may be because of the preexistence of repressor sequences that precedes the diversification of these species. This may reflect a general mechanism underlying the evolution of highly specific enhancers.
Collapse
Affiliation(s)
- Jian Pu
- Department of Entomology, Michigan State University, East Lansing, MI 48824, USA.
| | - Zinan Wang
- Department of Entomology, Michigan State University, East Lansing, MI 48824, USA; Ecology, Evolution, and Behavior Program, Michigan State University, East Lansing, MI 48824, USA
| | - Haosu Cong
- Department of Entomology, Michigan State University, East Lansing, MI 48824, USA
| | - Jacqueline S R Chin
- Singapore Institute for Clinical Sciences, Agency for Science, Technology and Research (A(∗)STAR), Brenner Centre for Molecular Medicine, Singapore 117609, Singapore
| | - Jessa Justen
- Laboratory of Cellular and Molecular Biology, University of Wisconsin, Madison, WI 53706, USA
| | - Cédric Finet
- Yale-NUS College, 16 College Avenue West, Singapore 138527, Singapore
| | - Joanne Y Yew
- Pacific Biosciences Research Center, University of Hawai'i at Mānoa, Honolulu, HI 96822, USA
| | - Henry Chung
- Department of Entomology, Michigan State University, East Lansing, MI 48824, USA; Ecology, Evolution, and Behavior Program, Michigan State University, East Lansing, MI 48824, USA.
| |
Collapse
|
21
|
Abstract
Because gene expression is important for evolutionary adaptation, its misregulation is an important cause of maladaptation. A misregulated gene can be incorrectly silent ("off") when a transcription factor (TF) that is required for its activation does not binds its regulatory region. Conversely, a misregulated gene can be incorrectly active ("on") when a TF not normally involved in its activation binds its regulatory region, a phenomenon also known as regulatory crosstalk. DNA mutations that destroy or create TF binding sites on DNA are an important source of misregulation and crosstalk. Although misregulation reduces fitness in an environment to which an organism is well-adapted, it may become adaptive in a new environment. Here, I derive simple yet general mathematical expressions that delimit the conditions under which misregulation can be adaptive. These expressions depend on the strength of selection against misregulation, on the fraction of DNA sequence space filled with TF binding sites, and on the fraction of genes that must be expressed for optimal adaptation. I then use empirical data from RNA sequencing, protein-binding microarrays, and genome evolution, together with population genetic simulations to ask when these conditions are likely to be met. I show that they can be met under realistic circumstances, but these circumstances may vary among organisms and environments. My analysis provides a framework in which improved theory and data collection can help us demonstrate the role of misregulation in adaptation. It also shows that misregulation, like DNA mutation, is one of life's many imperfections that can help propel Darwinian evolution.
Collapse
Affiliation(s)
- Andreas Wagner
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, CH-8057, Switzerland.,The Santa Fe Institute, Santa Fe, NM 87501, USA.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
22
|
Manrubia S, Cuesta JA, Aguirre J, Ahnert SE, Altenberg L, Cano AV, Catalán P, Diaz-Uriarte R, Elena SF, García-Martín JA, Hogeweg P, Khatri BS, Krug J, Louis AA, Martin NS, Payne JL, Tarnowski MJ, Weiß M. From genotypes to organisms: State-of-the-art and perspectives of a cornerstone in evolutionary dynamics. Phys Life Rev 2021; 38:55-106. [PMID: 34088608 DOI: 10.1016/j.plrev.2021.03.004] [Citation(s) in RCA: 44] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2020] [Accepted: 03/01/2021] [Indexed: 12/21/2022]
Abstract
Understanding how genotypes map onto phenotypes, fitness, and eventually organisms is arguably the next major missing piece in a fully predictive theory of evolution. We refer to this generally as the problem of the genotype-phenotype map. Though we are still far from achieving a complete picture of these relationships, our current understanding of simpler questions, such as the structure induced in the space of genotypes by sequences mapped to molecular structures, has revealed important facts that deeply affect the dynamical description of evolutionary processes. Empirical evidence supporting the fundamental relevance of features such as phenotypic bias is mounting as well, while the synthesis of conceptual and experimental progress leads to questioning current assumptions on the nature of evolutionary dynamics-cancer progression models or synthetic biology approaches being notable examples. This work delves with a critical and constructive attitude into our current knowledge of how genotypes map onto molecular phenotypes and organismal functions, and discusses theoretical and empirical avenues to broaden and improve this comprehension. As a final goal, this community should aim at deriving an updated picture of evolutionary processes soundly relying on the structural properties of genotype spaces, as revealed by modern techniques of molecular and functional analysis.
Collapse
Affiliation(s)
- Susanna Manrubia
- Department of Systems Biology, Centro Nacional de Biotecnología (CSIC), Madrid, Spain; Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid, Spain.
| | - José A Cuesta
- Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid, Spain; Departamento de Matemáticas, Universidad Carlos III de Madrid, Leganés, Spain; Instituto de Biocomputación y Física de Sistemas Complejos (BiFi), Universidad de Zaragoza, Spain; UC3M-Santander Big Data Institute (IBiDat), Getafe, Madrid, Spain
| | - Jacobo Aguirre
- Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid, Spain; Centro de Astrobiología, CSIC-INTA, ctra. de Ajalvir km 4, 28850 Torrejón de Ardoz, Madrid, Spain
| | - Sebastian E Ahnert
- Department of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge CB3 0AS, UK; The Alan Turing Institute, British Library, 96 Euston Road, London NW1 2DB, UK
| | | | - Alejandro V Cano
- Institute of Integrative Biology, ETH Zurich, Zurich, Switzerland; Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Pablo Catalán
- Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid, Spain; Departamento de Matemáticas, Universidad Carlos III de Madrid, Leganés, Spain
| | - Ramon Diaz-Uriarte
- Department of Biochemistry, Universidad Autónoma de Madrid, Madrid, Spain; Instituto de Investigaciones Biomédicas "Alberto Sols" (UAM-CSIC), Madrid, Spain
| | - Santiago F Elena
- Instituto de Biología Integrativa de Sistemas, I(2)SysBio (CSIC-UV), València, Spain; The Santa Fe Institute, Santa Fe, NM, USA
| | | | - Paulien Hogeweg
- Theoretical Biology and Bioinformatics Group, Utrecht University, the Netherlands
| | - Bhavin S Khatri
- The Francis Crick Institute, London, UK; Department of Life Sciences, Imperial College London, London, UK
| | - Joachim Krug
- Institute for Biological Physics, University of Cologne, Köln, Germany
| | - Ard A Louis
- Rudolf Peierls Centre for Theoretical Physics, University of Oxford, Oxford, UK
| | - Nora S Martin
- Theory of Condensed Matter Group, Cavendish Laboratory, University of Cambridge, Cambridge, UK; Sainsbury Laboratory, University of Cambridge, Cambridge, UK
| | - Joshua L Payne
- Institute of Integrative Biology, ETH Zurich, Zurich, Switzerland; Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | | | - Marcel Weiß
- Theory of Condensed Matter Group, Cavendish Laboratory, University of Cambridge, Cambridge, UK; Sainsbury Laboratory, University of Cambridge, Cambridge, UK
| |
Collapse
|
23
|
Wagner A. Information Theory Can Help Quantify the Potential of New Phenotypes to Originate as Exaptations. Front Ecol Evol 2020. [DOI: 10.3389/fevo.2020.564071] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Exaptations are adaptive traits that do not originate de novo but from other adaptive traits. They include complex macroscopic traits, such as the middle ear bones of mammals, which originated from reptile jaw bones, but also molecular traits, such as new binding sites of transcriptional regulators. What determines whether a trait originates de novo or as an exaptation is unknown. I here use simple information theoretic concepts to quantify a molecular phenotype’s potential to give rise to new phenotypes. These quantities rely on the amount of genetic information needed to encode a phenotype. I use these quantities to estimate the propensity of new transcription factor binding phenotypes to emerge de novo or exaptively, and do so for 187 mouse transcription factors. I also use them to quantify whether an organism’s viability in one of 10 different chemical environment is likely to arise exaptively. I show that informationally expensive traits are more likely to originate exaptively. Exaptive evolution is only sometimes favored for new transcription factor binding, but it is always favored for the informationally complex metabolic phenotypes I consider. As our ability to genotype evolving populations increases, so will our ability to understand how phenotypes of ever-increasing informational complexity originate in evolution.
Collapse
|
24
|
Wong ES, Zheng D, Tan SZ, Bower NL, Garside V, Vanwalleghem G, Gaiti F, Scott E, Hogan BM, Kikuchi K, McGlinn E, Francois M, Degnan BM. Deep conservation of the enhancer regulatory code in animals. Science 2020; 370:370/6517/eaax8137. [PMID: 33154111 DOI: 10.1126/science.aax8137] [Citation(s) in RCA: 81] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2019] [Revised: 04/29/2020] [Accepted: 09/30/2020] [Indexed: 12/15/2022]
Abstract
Interactions of transcription factors (TFs) with DNA regulatory sequences, known as enhancers, specify cell identity during animal development. Unlike TFs, the origin and evolution of enhancers has been difficult to trace. We drove zebrafish and mouse developmental transcription using enhancers from an evolutionarily distant marine sponge. Some of these sponge enhancers are located in highly conserved microsyntenic regions, including an Islet enhancer in the Islet-Scaper region. We found that Islet enhancers in humans and mice share a suite of TF binding motifs with sponges, and that they drive gene expression patterns similar to those of sponge and endogenous Islet enhancers in zebrafish. Our results suggest the existence of an ancient and conserved, yet flexible, genomic regulatory syntax that has been repeatedly co-opted into cell type-specific gene regulatory networks across the animal kingdom.
Collapse
Affiliation(s)
- Emily S Wong
- School of Biological Sciences, University of Queensland, Brisbane, Australia. .,Victor Chang Cardiac Research Institute, Sydney, Australia.,School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney, Australia
| | - Dawei Zheng
- Victor Chang Cardiac Research Institute, Sydney, Australia
| | - Siew Z Tan
- Institute for Molecular Biosciences, University of Queensland, Brisbane, Australia
| | - Neil L Bower
- Institute for Molecular Biosciences, University of Queensland, Brisbane, Australia
| | - Victoria Garside
- Australian Regenerative Medicine Institute, Monash University, Melbourne, Australia
| | | | - Federico Gaiti
- School of Biological Sciences, University of Queensland, Brisbane, Australia
| | - Ethan Scott
- Queensland Brain Institute, University of Queensland, Brisbane, Australia
| | - Benjamin M Hogan
- Institute for Molecular Biosciences, University of Queensland, Brisbane, Australia.,Department of Anatomy and Neuroscience and Sir Peter MacCallum Department of Oncology, University of Melbourne, Melbourne, Australia
| | - Kazu Kikuchi
- Victor Chang Cardiac Research Institute, Sydney, Australia
| | - Edwina McGlinn
- Australian Regenerative Medicine Institute, Monash University, Melbourne, Australia
| | - Mathias Francois
- Institute for Molecular Biosciences, University of Queensland, Brisbane, Australia. .,Centenary Institute, David Richmond Program for Cardio-Vascular Research: Gene Regulation and Editing, School of Life and Environmental Sciences, University of Sydney, Sydney, Australia
| | - Bernard M Degnan
- School of Biological Sciences, University of Queensland, Brisbane, Australia.
| |
Collapse
|
25
|
Santiago-Alarcon D, Tapia-McClung H, Lerma-Hernández S, Venegas-Andraca SE. Quantum aspects of evolution: a contribution towards evolutionary explorations of genotype networks via quantum walks. J R Soc Interface 2020; 17:20200567. [PMID: 33171071 PMCID: PMC7729038 DOI: 10.1098/rsif.2020.0567] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Accepted: 10/20/2020] [Indexed: 12/14/2022] Open
Abstract
Quantum biology seeks to explain biological phenomena via quantum mechanisms, such as enzyme reaction rates via tunnelling and photosynthesis energy efficiency via coherent superposition of states. However, less effort has been devoted to study the role of quantum mechanisms in biological evolution. In this paper, we used transcription factor networks with two and four different phenotypes, and used classical random walks (CRW) and quantum walks (QW) to compare network search behaviour and efficiency at finding novel phenotypes between CRW and QW. In the network with two phenotypes, at temporal scales comparable to decoherence time TD, QW are as efficient as CRW at finding new phenotypes. In the case of the network with four phenotypes, the QW had a higher probability of mutating to a novel phenotype than the CRW, regardless of the number of mutational steps (i.e. 1, 2 or 3) away from the new phenotype. Before quantum decoherence, the QW probabilities become higher turning the QW effectively more efficient than CRW at finding novel phenotypes under different starting conditions. Thus, our results warrant further exploration of the QW under more realistic network scenarios (i.e. larger genotype networks) in both closed and open systems (e.g. by considering Lindblad terms).
Collapse
Affiliation(s)
- Diego Santiago-Alarcon
- Red de Biología y Conservación de Vertebrados, Instituto de Ecología, A.C. Carr. Antigua a Coatepec 351, Col. El Haya, C.P. 91070, Xalapa, Veracruz, Mexico
| | - Horacio Tapia-McClung
- Centro de Investigación en Inteligencia Artificial, Universidad Veracruzana, Sebastián Camacho 5, Centro, Xalapa-Enríquez, Veracruz, Mexico
| | - Sergio Lerma-Hernández
- Facultad de Física, Universidad Veracruzana, Circuito Aguirre Beltrán s/n, Xalapa, Veracruz 91000, Mexico
| | - Salvador E. Venegas-Andraca
- Tecnologico de Monterrey, Escuela de Ingenieria y Ciencias, Avenue Eugenio Garza Sada 2501, Monterrey 64849, Nuevo Leon, Mexico
| |
Collapse
|
26
|
Cano AV, Payne JL. Mutation bias interacts with composition bias to influence adaptive evolution. PLoS Comput Biol 2020; 16:e1008296. [PMID: 32986712 PMCID: PMC7571706 DOI: 10.1371/journal.pcbi.1008296] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Revised: 10/19/2020] [Accepted: 08/30/2020] [Indexed: 11/19/2022] Open
Abstract
Mutation is a biased stochastic process, with some types of mutations occurring more frequently than others. Previous work has used synthetic genotype-phenotype landscapes to study how such mutation bias affects adaptive evolution. Here, we consider 746 empirical genotype-phenotype landscapes, each of which describes the binding affinity of target DNA sequences to a transcription factor, to study the influence of mutation bias on adaptive evolution of increased binding affinity. By using empirical genotype-phenotype landscapes, we need to make only few assumptions about landscape topography and about the DNA sequences that each landscape contains. The latter is particularly important because the set of sequences that a landscape contains determines the types of mutations that can occur along a mutational path to an adaptive peak. That is, landscapes can exhibit a composition bias—a statistical enrichment of a particular type of mutation relative to a null expectation, throughout an entire landscape or along particular mutational paths—that is independent of any bias in the mutation process. Our results reveal the way in which composition bias interacts with biases in the mutation process under different population genetic conditions, and how such interaction impacts fundamental properties of adaptive evolution, such as its predictability, as well as the evolution of genetic diversity and mutational robustness. Mutation is often depicted as a random process due its unpredictable nature. However, such randomness does not imply uniformly distributed outcomes, because some DNA sequence changes happen more frequently than others. Mutation bias can be an orienting factor in adaptive evolution, influencing the mutational trajectories populations follow toward higher-fitness genotypes. Because these trajectories are typically just a small subset of all possible mutational trajectories, they can exhibit composition bias—an enrichment of a particular kind of DNA sequence change, such as transition or transversion mutations. Here, we use empirical data from eukaryotic transcriptional regulation to study how mutation bias and composition bias interact to influence adaptive evolution.
Collapse
Affiliation(s)
- Alejandro V. Cano
- Institute of Integrative Biology, ETH, Zurich, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Joshua L. Payne
- Institute of Integrative Biology, ETH, Zurich, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- * E-mail:
| |
Collapse
|
27
|
Genotype networks of 80 quantitative Arabidopsis thaliana phenotypes reveal phenotypic evolvability despite pervasive epistasis. PLoS Comput Biol 2020; 16:e1008082. [PMID: 32790763 PMCID: PMC7447023 DOI: 10.1371/journal.pcbi.1008082] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2019] [Revised: 08/25/2020] [Accepted: 06/22/2020] [Indexed: 12/23/2022] Open
Abstract
We study the genotype-phenotype maps of 80 quantitative phenotypes in the model plant Arabidopsis thaliana, by representing the genotypes affecting each phenotype as a genotype network. In such a network, each vertex or node corresponds to an individual's genotype at all those genomic loci that affect a given phenotype. Two vertices are connected by an edge if the associated genotypes differ in exactly one nucleotide. The 80 genotype networks we analyze are based on data from genome-wide association studies of 199 A. thaliana accessions. They form connected graphs whose topography differs substantially among phenotypes. We focus our analysis on the incidence of epistasis (non-additive interactions among mutations) because a high incidence of epistasis can reduce the accessibility of evolutionary paths towards high or low phenotypic values. We find epistatic interactions in 67 phenotypes, and in 51 phenotypes every pairwise mutant interaction is epistatic. Moreover, we find phenotype-specific differences in the fraction of accessible mutational paths to maximum phenotypic values. However, even though epistasis affects the accessibility of maximum phenotypic values, the relationships between genotypic and phenotypic change of our analyzed phenotypes are sufficiently smooth that some evolutionary paths remain accessible for most phenotypes, even where epistasis is pervasive. The genotype network representation we use can complement existing approaches to understand the genetic architecture of polygenic traits in many different organisms.
Collapse
|
28
|
Weiß M, Ahnert SE. Using small samples to estimate neutral component size and robustness in the genotype-phenotype map of RNA secondary structure. J R Soc Interface 2020; 17:20190784. [PMID: 32429824 DOI: 10.1098/rsif.2019.0784] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
In genotype-phenotype (GP) maps, the genotypes that map to the same phenotype are usually not randomly distributed across the space of genotypes, but instead are predominantly connected through one-point mutations, forming network components that are commonly referred to as neutral components (NCs). Because of their impact on evolutionary processes, the characteristics of these NCs, like their size or robustness, have been studied extensively. Here, we introduce a framework that allows the estimation of NC size and robustness in the GP map of RNA secondary structure. The advantage of this framework is that it only requires small samples of genotypes and their local environment, which also allows experimental realizations. We verify our framework by applying it to the exhaustively analysable GP map of RNA sequence length L = 15, and benchmark it against an existing method by applying it to longer, naturally occurring functional non-coding RNA sequences. Although it is specific to the RNA secondary structure GP map in the first place, our framework can probably be transferred and adapted to other sequence-to-structure GP maps.
Collapse
Affiliation(s)
- Marcel Weiß
- Theory of Condensed Matter Group, Cavendish Laboratory, University of Cambridge, JJ Thomson Avenue, Cambridge CB3 0HE, UK.,Sainsbury Laboratory, University of Cambridge, Bateman Street, Cambridge CB2 1LR, UK
| | - Sebastian E Ahnert
- Theory of Condensed Matter Group, Cavendish Laboratory, University of Cambridge, JJ Thomson Avenue, Cambridge CB3 0HE, UK.,Sainsbury Laboratory, University of Cambridge, Bateman Street, Cambridge CB2 1LR, UK
| |
Collapse
|
29
|
Kuo ST, Jahn RL, Cheng YJ, Chen YL, Lee YJ, Hollfelder F, Wen JD, Chou HHD. Global fitness landscapes of the Shine-Dalgarno sequence. Genome Res 2020; 30:711-723. [PMID: 32424071 PMCID: PMC7263185 DOI: 10.1101/gr.260182.119] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2019] [Accepted: 04/21/2020] [Indexed: 01/06/2023]
Abstract
Shine-Dalgarno sequences (SD) in prokaryotic mRNA facilitate protein translation by pairing with rRNA in ribosomes. Although conventionally defined as AG-rich motifs, recent genomic surveys reveal great sequence diversity, questioning how SD functions. Here, we determined the molecular fitness (i.e., translation efficiency) of 49 synthetic 9-nt SD genotypes in three distinct mRNA contexts in Escherichia coli. We uncovered generic principles governing the SD fitness landscapes: (1) Guanine contents, rather than canonical SD motifs, best predict the fitness of both synthetic and endogenous SD; (2) the genotype-fitness correlation of SD promotes its evolvability by steadily supplying beneficial mutations across fitness landscapes; and (3) the frequency and magnitude of deleterious mutations increase with background fitness, and adjacent nucleotides in SD show stronger epistasis. Epistasis results from disruption of the continuous base pairing between SD and rRNA. This “chain-breaking” epistasis creates sinkholes in SD fitness landscapes and may profoundly impact the evolution and function of prokaryotic translation initiation and other RNA-mediated processes. Collectively, our work yields functional insights into the SD sequence variation in prokaryotic genomes, identifies a simple design principle to guide bioengineering and bioinformatic analysis of SD, and illuminates the fundamentals of fitness landscapes and molecular evolution.
Collapse
Affiliation(s)
- Syue-Ting Kuo
- Department of Life Science, National Taiwan University, Taipei 10617, Taiwan
| | - Ruey-Lin Jahn
- Department of Electrical Engineering, National Taiwan University, Taipei 10617, Taiwan
| | - Yuan-Ju Cheng
- Department of Life Science, National Taiwan University, Taipei 10617, Taiwan
| | - Yi-Lan Chen
- Genome and Systems Biology Degree Program, Academia Sinica and National Taiwan University, Taipei 10617, Taiwan
| | - Yun-Ju Lee
- Department of Life Science, National Taiwan University, Taipei 10617, Taiwan
| | - Florian Hollfelder
- Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, United Kingdom
| | - Jin-Der Wen
- Genome and Systems Biology Degree Program, Academia Sinica and National Taiwan University, Taipei 10617, Taiwan.,Institute of Molecular and Cellular Biology, National Taiwan University, Taipei 10617, Taiwan
| | - Hsin-Hung David Chou
- Department of Life Science, National Taiwan University, Taipei 10617, Taiwan.,Genome and Systems Biology Degree Program, Academia Sinica and National Taiwan University, Taipei 10617, Taiwan
| |
Collapse
|
30
|
Gysi DM, Nowick K. Construction, comparison and evolution of networks in life sciences and other disciplines. J R Soc Interface 2020; 17:20190610. [PMID: 32370689 PMCID: PMC7276545 DOI: 10.1098/rsif.2019.0610] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2019] [Accepted: 04/09/2020] [Indexed: 12/12/2022] Open
Abstract
Network approaches have become pervasive in many research fields. They allow for a more comprehensive understanding of complex relationships between entities as well as their group-level properties and dynamics. Many networks change over time, be it within seconds or millions of years, depending on the nature of the network. Our focus will be on comparative network analyses in life sciences, where deciphering temporal network changes is a core interest of molecular, ecological, neuropsychological and evolutionary biologists. Further, we will take a journey through different disciplines, such as social sciences, finance and computational gastronomy, to present commonalities and differences in how networks change and can be analysed. Finally, we envision how borrowing ideas from these disciplines could enrich the future of life science research.
Collapse
Affiliation(s)
- Deisy Morselli Gysi
- Department of Computer Science, Interdisciplinary Center of Bioinformatics, University of Leipzig, 04109 Leipzig, Germany
- Swarm Intelligence and Complex Systems Group, Faculty of Mathematics and Computer Science, University of Leipzig, 04109 Leipzig, Germany
- Center for Complex Networks Research, Northeastern University, 177 Huntington Avenue, Boston, MA 02115, USA
| | - Katja Nowick
- Human Biology Group, Institute for Biology, Faculty of Biology, Chemistry, Pharmacy, Freie Universität Berlin, Königin-Luise-Straβe 1-3, 14195 Berlin, Germany
| |
Collapse
|
31
|
Zhou J, McCandlish DM. Minimum epistasis interpolation for sequence-function relationships. Nat Commun 2020; 11:1782. [PMID: 32286265 PMCID: PMC7156698 DOI: 10.1038/s41467-020-15512-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2019] [Accepted: 03/12/2020] [Indexed: 12/17/2022] Open
Abstract
Massively parallel phenotyping assays have provided unprecedented insight into how multiple mutations combine to determine biological function. While such assays can measure phenotypes for thousands to millions of genotypes in a single experiment, in practice these measurements are not exhaustive, so that there is a need for techniques to impute values for genotypes whose phenotypes have not been directly assayed. Here, we present an imputation method based on inferring the least epistatic possible sequence-function relationship compatible with the data. In particular, we infer the reconstruction where mutational effects change as little as possible across adjacent genetic backgrounds. The resulting models can capture complex higher-order genetic interactions near the data, but approach additivity where data is sparse or absent. We apply the method to high-throughput transcription factor binding assays and use it to explore a fitness landscape for protein G.
Collapse
Affiliation(s)
- Juannan Zhou
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
| | - David M McCandlish
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA.
| |
Collapse
|
32
|
The relation between crosstalk and gene regulation form revisited. PLoS Comput Biol 2020; 16:e1007642. [PMID: 32097416 PMCID: PMC7059967 DOI: 10.1371/journal.pcbi.1007642] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2019] [Revised: 03/06/2020] [Accepted: 01/08/2020] [Indexed: 01/11/2023] Open
Abstract
Genes differ in the frequency at which they are expressed and in the form of regulation used to control their activity. In particular, positive or negative regulation can lead to activation of a gene in response to an external signal. Previous works proposed that the form of regulation of a gene correlates with its frequency of usage: positive regulation when the gene is frequently expressed and negative regulation when infrequently expressed. Such network design means that, in the absence of their regulators, the genes are found in their least required activity state, hence regulatory intervention is often necessary. Due to the multitude of genes and regulators, spurious binding and unbinding events, called “crosstalk”, could occur. To determine how the form of regulation affects the global crosstalk in the network, we used a mathematical model that includes multiple regulators and multiple target genes. We found that crosstalk depends non-monotonically on the availability of regulators. Our analysis showed that excess use of regulation entailed by the formerly suggested network design caused high crosstalk levels in a large part of the parameter space. We therefore considered the opposite ‘idle’ design, where the default unregulated state of genes is their frequently required activity state. We found, that ‘idle’ design minimized the use of regulation and thus minimized crosstalk. In addition, we estimated global crosstalk of S. cerevisiae using transcription factors binding data. We demonstrated that even partial network data could suffice to estimate its global crosstalk, suggesting its applicability to additional organisms. We found that S. cerevisiae estimated crosstalk is lower than that of a random network, suggesting that natural selection reduces crosstalk. In summary, our study highlights a new type of protein production cost which is typically overlooked: that of regulatory interference caused by the presence of excess regulators in the cell. It demonstrates the importance of whole-network descriptions, which could show effects missed by single-gene models. Genes differ in the frequency at which they are expressed and in the form of regulation used to control their activity. The basic level of regulation is mediated by different types of DNA-binding proteins, where each type regulates particular gene(s). We distinguish between two basic forms of regulation: positive—if a gene is activated by the binding of its regulatory protein, and negative—if it is active unless bound by its regulatory protein. Due to the multitude of genes and regulators, spurious binding and unbinding events, called “crosstalk”, could occur. How does the form of regulation, positive or negative, affect the extent of regulatory crosstalk? To address this question, we used a mathematical model integrating many genes and many regulators. As intuition suggests, we found that in most of the parameter space, crosstalk increased with the availability of regulators. We propose, that crosstalk is usually reduced when networks are designed such that minimal regulation is needed, which we call the ‘idle’ design. In other words: a frequently needed gene will use negative regulation and conversely, a scarcely needed gene will employ positive regulation. In both cases, the requirement for the regulators is minimized. In addition, we demonstrate how crosstalk can be calculated from available datasets and discuss the technical challenges in such calculation, specifically data incompleteness.
Collapse
|
33
|
Bourgeat L, Serghei A, Lesieur C. Experimental Protein Molecular Dynamics: Broadband Dielectric Spectroscopy coupled with nanoconfinement. Sci Rep 2019; 9:17988. [PMID: 31784681 PMCID: PMC6884508 DOI: 10.1038/s41598-019-54562-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2019] [Accepted: 11/05/2019] [Indexed: 11/09/2022] Open
Abstract
Protein dynamics covers multiple spatiotemporal scale processes, among which slow motions, not much understood even though they are underlying protein folding and protein functions. Protein slow motions are associated with structural heterogeneity, short-lived and poorly populated conformations, hard to detect individually. In addition, they involve collective motions of many atoms, not easily tracked by simulation and experimental devices. Here we propose a biophysical approach, coupling geometrical nanoconfinement and broadband dielectric spectroscopy (BDS), which distinguishes protein conformations by their respective molecular dynamics. In particular, protein-unfolding intermediates, usually poorly populated in macroscopic solutions are detected. The protein dynamics is observed under unusual conditions (sample nanoconfinement and dehydration) highlighting the robustness of protein structure and protein dynamics to a variety of conditions consistent with protein sustainability. The protein dielectric signals evolve with the temperature of thermal treatments indicating sensitivity to atomic and molecular interaction changes triggered by the protein thermal unfolding. As dipole fluctuations depend on both collective large-scale motions and local motions, the approach offers a prospect to track in-depth unfolding events.
Collapse
Affiliation(s)
- Laëtitia Bourgeat
- AMPERE, CNRS, Univ. Lyon, 69622, Lyon, France
- IMP, CNRS, Univ. Lyon, 69622, Lyon, France
| | | | - Claire Lesieur
- AMPERE, CNRS, Univ. Lyon, 69622, Lyon, France.
- Institut Rhônalpin des systèmes complexes, IXXI-ENS-Lyon, 69007, Lyon, France.
| |
Collapse
|
34
|
Nichol D, Robertson-Tessi M, Anderson ARA, Jeavons P. Model genotype-phenotype mappings and the algorithmic structure of evolution. J R Soc Interface 2019; 16:20190332. [PMID: 31690233 PMCID: PMC6893500 DOI: 10.1098/rsif.2019.0332] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2019] [Accepted: 10/04/2019] [Indexed: 12/13/2022] Open
Abstract
Cancers are complex dynamic systems that undergo evolution and selection. Personalized medicine approaches in the clinic increasingly rely on predictions of tumour response to one or more therapies; these predictions are complicated by the inevitable evolution of the tumour. Despite enormous amounts of data on the mutational status of cancers and numerous therapies developed in recent decades to target these mutations, many of these treatments fail after a time due to the development of resistance in the tumour. The emergence of these resistant phenotypes is not easily predicted from genomic data, since the relationship between genotypes and phenotypes, termed the genotype-phenotype (GP) mapping, is neither injective nor functional. We present a review of models of this mapping within a generalized evolutionary framework that takes into account the relation between genotype, phenotype, environment and fitness. Different modelling approaches are described and compared, and many evolutionary results are shown to be conserved across studies despite using different underlying model systems. In addition, several areas for future work that remain understudied are identified, including plasticity and bet-hedging. The GP-mapping provides a pathway for understanding the potential routes of evolution taken by cancers, which will be necessary knowledge for improving personalized therapies.
Collapse
Affiliation(s)
- Daniel Nichol
- Department of Computer Science, University of Oxford, Oxford, UK
- Department of Integrated Mathematical Oncology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, USA
| | - Mark Robertson-Tessi
- Department of Integrated Mathematical Oncology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, USA
| | - Alexander R. A. Anderson
- Department of Integrated Mathematical Oncology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, USA
| | - Peter Jeavons
- Department of Computer Science, University of Oxford, Oxford, UK
| |
Collapse
|
35
|
Abstract
Evolvability is the ability of a biological system to produce phenotypic variation that is both heritable and adaptive. It has long been the subject of anecdotal observations and theoretical work. In recent years, however, the molecular causes of evolvability have been an increasing focus of experimental work. Here, we review recent experimental progress in areas as different as the evolution of drug resistance in cancer cells and the rewiring of transcriptional regulation circuits in vertebrates. This research reveals the importance of three major themes: multiple genetic and non-genetic mechanisms to generate phenotypic diversity, robustness in genetic systems, and adaptive landscape topography. We also discuss the mounting evidence that evolvability can evolve and the question of whether it evolves adaptively.
Collapse
|
36
|
Weiß M, Ahnert SE. Phenotypes can be robust and evolvable if mutations have non-local effects on sequence constraints. J R Soc Interface 2019; 15:rsif.2017.0618. [PMID: 29321270 DOI: 10.1098/rsif.2017.0618] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2017] [Accepted: 12/07/2017] [Indexed: 11/12/2022] Open
Abstract
The mapping between biological genotypes and phenotypes plays an important role in evolution, and understanding the properties of this mapping is crucial to determine the outcome of evolutionary processes. One of the most striking properties observed in several genotype-phenotype (GP) maps is the positive correlation between the robustness and evolvability of phenotypes. This implies that a phenotype can be strongly robust against mutations and at the same time evolvable to a diverse range of alternative phenotypes. Here, we examine the causes for this positive correlation by introducing two analytically tractable GP map models that follow the principles of real biological GP maps. The first model is based on gene-like GP maps, reflecting the way in which genetic sequences are organized into protein-coding genes, and the second one is based on the GP map of RNA secondary structure. For both models, we find that a positive correlation between phenotype robustness and evolvability only emerges if mutations at one sequence position can have non-local effects on the sequence constraints at another position. This highlights that non-local effects of mutations are closely related to the coexistence of robustness and evolvability in phenotypes, and are likely to be an important feature of many biological GP maps.
Collapse
Affiliation(s)
- Marcel Weiß
- Theory of Condensed Matter Group, Cavendish Laboratory, University of Cambridge, JJ Thomson Avenue, Cambridge CB3 0HE, UK .,Sainsbury Laboratory, University of Cambridge, Bateman Street, Cambridge CB2 1LR, UK
| | - Sebastian E Ahnert
- Theory of Condensed Matter Group, Cavendish Laboratory, University of Cambridge, JJ Thomson Avenue, Cambridge CB3 0HE, UK.,Sainsbury Laboratory, University of Cambridge, Bateman Street, Cambridge CB2 1LR, UK
| |
Collapse
|
37
|
Ghosh B, Sarma U, Sourjik V, Legewie S. Sharing of Phosphatases Promotes Response Plasticity in Phosphorylation Cascades. Biophys J 2019; 114:223-236. [PMID: 29320690 DOI: 10.1016/j.bpj.2017.10.037] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2017] [Revised: 10/06/2017] [Accepted: 10/17/2017] [Indexed: 01/06/2023] Open
Abstract
Sharing of positive or negative regulators between multiple targets is frequently observed in cellular signaling cascades. For instance, phosphatase sharing between multiple kinases is ubiquitous within the MAPK pathway. Here we investigate how such phosphatase sharing could shape robustness and evolvability of the phosphorylation cascade. Through modeling and evolutionary simulations, we demonstrate that 1) phosphatase sharing dramatically increases robustness of a bistable MAPK response, and 2) phosphatase-sharing cascades evolve faster than nonsharing cascades. This faster evolution is particularly pronounced when evolving from a monostable toward a bistable phenotype, whereas the transition speed of a population from a bistable to monostable response is not affected by phosphatase sharing. This property may enable the phosphatase-sharing design to adapt better in a changing environment. Analysis of the respective mutational landscapes reveal that phosphatase sharing reduces the number of limiting mutations required for transition from monostable to bistable responses, hence facilitating a faster transition to such response types. Taken together, using MAPK cascade as an example, our study offers a general theoretical framework to explore robustness and evolutionary plasticity of signal transduction cascades.
Collapse
Affiliation(s)
- Bhaswar Ghosh
- Department of Systems and Synthetic Microbiology, Max Planck Institute for Terrestrial Microbiology, Marburg, Germany; LOEWE Research Center for Synthetic Microbiology (SYNMIKRO), Marburg, Germany.
| | - Uddipan Sarma
- Modelling of Biological Networks Group, Institute of Molecular Biology (IMB), Mainz, Germany.
| | - Victor Sourjik
- Department of Systems and Synthetic Microbiology, Max Planck Institute for Terrestrial Microbiology, Marburg, Germany; LOEWE Research Center for Synthetic Microbiology (SYNMIKRO), Marburg, Germany.
| | - Stefan Legewie
- Modelling of Biological Networks Group, Institute of Molecular Biology (IMB), Mainz, Germany.
| |
Collapse
|
38
|
Bendixsen DP, Collet J, Østman B, Hayden EJ. Genotype network intersections promote evolutionary innovation. PLoS Biol 2019; 17:e3000300. [PMID: 31136568 PMCID: PMC6555535 DOI: 10.1371/journal.pbio.3000300] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2018] [Revised: 06/07/2019] [Accepted: 05/15/2019] [Indexed: 12/27/2022] Open
Abstract
Evolutionary innovations are qualitatively novel traits that emerge through evolution and increase biodiversity. The genetic mechanisms of innovation remain poorly understood. A systems view of innovation requires the analysis of genotype networks—the vast networks of genetic variants that produce the same phenotype. Innovations can occur at the intersection of two different genotype networks. However, the experimental characterization of genotype networks has been hindered by the vast number of genetic variants that need to be functionally analyzed. Here, we use high-throughput sequencing to study the fitness landscape at the intersection of the genotype networks of two catalytic RNA molecules (ribozymes). We determined the ability of numerous neighboring RNA sequences to catalyze two different chemical reactions, and we use these data as a proxy for a genotype to fitness map where two functions come in close proximity. We find extensive functional overlap, and numerous genotypes can catalyze both functions. We demonstrate through evolutionary simulations that these numerous points of intersection facilitate the discovery of a new function. However, the rate of adaptation of the new function depends upon the local ruggedness around the starting location in the genotype network. As a consequence, one direction of adaptation is more rapid than the other. We find that periods of neutral evolution increase rates of adaptation to the new function by allowing populations to spread out in their genotype network. Our study reveals the properties of a fitness landscape where genotype networks intersect and the consequences for evolutionary innovations. Our results suggest that historic innovations in natural systems may have been facilitated by overlapping genotype networks. The determination of the empirical fitness landscape at the genotypic intersection between two different catalytic RNA (ribozyme) functions reveals details about how novel traits can emerge through evolutionary innovation.
Collapse
Affiliation(s)
- Devin P. Bendixsen
- Biomolecular Sciences Graduate Programs, Boise State University, Boise, Idaho, United States of America
- * E-mail: (DPB); (EJH)
| | - James Collet
- Department of Biological Science, Boise State University, Boise, Idaho, United States of America
| | - Bjørn Østman
- Keck Graduate Institute, Claremont, California, United States of America
| | - Eric J. Hayden
- Biomolecular Sciences Graduate Programs, Boise State University, Boise, Idaho, United States of America
- Department of Biological Science, Boise State University, Boise, Idaho, United States of America
- * E-mail: (DPB); (EJH)
| |
Collapse
|
39
|
Miho E, Roškar R, Greiff V, Reddy ST. Large-scale network analysis reveals the sequence space architecture of antibody repertoires. Nat Commun 2019; 10:1321. [PMID: 30899025 PMCID: PMC6428871 DOI: 10.1038/s41467-019-09278-8] [Citation(s) in RCA: 81] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2017] [Accepted: 03/01/2019] [Indexed: 12/23/2022] Open
Abstract
The architecture of mouse and human antibody repertoires is defined by the sequence similarity networks of the clones that compose them. The major principles that define the architecture of antibody repertoires have remained largely unknown. Here, we establish a high-performance computing platform to construct large-scale networks from comprehensive human and murine antibody repertoire sequencing datasets (>100,000 unique sequences). Leveraging a network-based statistical framework, we identify three fundamental principles of antibody repertoire architecture: reproducibility, robustness and redundancy. Antibody repertoire networks are highly reproducible across individuals despite high antibody sequence dissimilarity. The architecture of antibody repertoires is robust to the removal of up to 50-90% of randomly selected clones, but fragile to the removal of public clones shared among individuals. Finally, repertoire architecture is intrinsically redundant. Our analysis provides guidelines for the large-scale network analysis of immune repertoires and may be used in the future to define disease-associated and synthetic repertoires.
Collapse
Affiliation(s)
- Enkelejda Miho
- Department of Biosystems Science and Engineering, ETH Zurich, 4058, Basel, Switzerland.,Institute of Medical Engineering and Medical Informatics, School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, 4132, Muttenz, Switzerland.,aiNET GmbH, c/o Switzerland Innovation Park Basel Area AG, Hochbergstrasse 60C, 4057, Basel, Switzerland
| | - Rok Roškar
- Research Informatics, Scientific IT Services, ETH Zürich, 8001, Zürich, Switzerland
| | - Victor Greiff
- Department of Immunology, University of Oslo, 0372, Oslo, Norway.
| | - Sai T Reddy
- Department of Biosystems Science and Engineering, ETH Zurich, 4058, Basel, Switzerland.
| |
Collapse
|
40
|
Hu G, Wendel JF. Cis-trans controls and regulatory novelty accompanying allopolyploidization. THE NEW PHYTOLOGIST 2019; 221:1691-1700. [PMID: 30290011 DOI: 10.1111/nph.15515] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2018] [Accepted: 09/30/2018] [Indexed: 05/20/2023]
Abstract
Allopolyploidy is a prevalent process in plants, having important physiological, ecological and evolutionary consequences. Transcriptomic responses to genomic merger and doubling have been demonstrated in many allopolyploid systems, encompassing a diversity of phenomena including homoeolog expression bias, genome dominance, expression-level dominance and revamping of co-expression networks. Notwithstanding the foregoing, there remains a need to develop a conceptual framework that will stimulate a deeper understanding of these diverse phenomena and their mechanistic interrelationships. Here we introduce considerations relevant to this framework with a focus on cis-trans interactions among duplicated genes and alleles in hybrids and allopolyploids. By extending classic allele-specific expression analysis to the allopolyploid level, we distinguish the distinct effects of progenitor regulatory interactions from the novel intergenomic interactions that arise from genome merger and allopolyploidization. This perspective informs experiments designed to reveal the molecular genetic basis of gene regulatory control, and will facilitate the disentangling of genetic from epigenetic and higher-order effects that impact gene expression. Finally, we suggest that the extended cis-trans model may help conceptually unify several presently disparate hallmarks of allopolyploid evolution, including genome-wide expression dominance and biased fractionation, and lead to a new level of understanding of phenotypic novelty accompanying polyploidy.
Collapse
Affiliation(s)
- Guanjing Hu
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, 50011, USA
| | - Jonathan F Wendel
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, 50011, USA
| |
Collapse
|
41
|
Layers of Cryptic Genetic Variation Underlie a Yeast Complex Trait. Genetics 2019; 211:1469-1482. [PMID: 30787041 PMCID: PMC6456305 DOI: 10.1534/genetics.119.301907] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2019] [Accepted: 02/14/2019] [Indexed: 01/13/2023] Open
Abstract
To better understand cryptic genetic variation, Lee et al. comprehensively map the genetic basis of a trait that is typically suppressed in a yeast cross. By determining how three different genetic perturbations give rise... Cryptic genetic variation may be an important contributor to heritable traits, but its extent and regulation are not fully understood. Here, we investigate the cryptic genetic variation underlying a Saccharomyces cerevisiae colony phenotype that is typically suppressed in a cross of the laboratory strain BY4716 (BY) and a derivative of the clinical isolate 322134S (3S). To do this, we comprehensively dissect the trait’s genetic basis in the BYx3S cross in the presence of three different genetic perturbations that enable its expression. This allows us to detect and compare the specific loci that interact with each perturbation to produce the trait. In total, we identify 21 loci, all but one of which interact with just a subset of the perturbations. Beyond impacting which loci contribute to the trait, the genetic perturbations also alter the extent of additivity, epistasis, and genotype–environment interaction among the detected loci. Additionally, we show that the single locus interacting with all three perturbations corresponds to the coding region of the cell surface gene FLO11. While nearly all of the other remaining loci influence FLO11 transcription in cis or trans, the perturbations tend to interact with loci in different pathways and subpathways. Our work shows how layers of cryptic genetic variation can influence complex traits. Here, these layers mainly represent different regulatory inputs into the transcription of a single key gene.
Collapse
|
42
|
Kurafeiski JD, Pinto P, Bornberg-Bauer E. Evolutionary Potential of Cis-Regulatory Mutations to Cause Rapid Changes in Transcription Factor Binding. Genome Biol Evol 2019; 11:406-414. [PMID: 30597011 PMCID: PMC6370388 DOI: 10.1093/gbe/evy269] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/11/2018] [Indexed: 01/25/2023] Open
Abstract
Transcriptional regulation is crucial for all biological processes and well investigated at the molecular level for a wide range of organisms. However, it is quite unclear how innovations, such as the activity of a novel regulatory element, evolve. In the case of transcription factor (TF) binding, both a novel TF and a novel-binding site would need to evolve concertedly. Since promiscuous functions have recently been identified as important intermediate steps in creating novel specific functions in many areas such as enzyme evolution and protein-protein interactions, we ask here how promiscuous binding of TFs to TF-binding sites (TFBSs) affects the robustness and evolvability of this tightly regulated system. Specifically, we investigate the binding behavior of several hundred TFs from different species at unprecedented breadth. Our results illustrate multiple aspects of TF-binding interactions, ranging from correlations between the strength of the interaction bond and specificity, to preferences regarding TFBS nucleotide composition in relation to both domains and binding specificity. We identified a subset of high A/T binding motifs. Motifs in this subset had many functionally neutral one-error mutants, and were bound by multiple different binding domains. Our results indicate that, especially for some TF-TFBS associations, low binding specificity confers high degrees of evolvability, that is that few mutations facilitate rapid changes in transcriptional regulation, in particular for large and old TF families. In this study we identify binding motifs exhibiting behavior indicating high evolutionary potential for innovations in transcriptional regulation.
Collapse
Affiliation(s)
| | - Paulo Pinto
- Molecular Evolution and Bioinformatics, University of Muenster, Germany
| | | |
Collapse
|
43
|
del Olmo Toledo V, Puccinelli R, Fordyce PM, Pérez JC. Diversification of DNA binding specificities enabled SREBP transcription regulators to expand the repertoire of cellular functions that they govern in fungi. PLoS Genet 2018; 14:e1007884. [PMID: 30596634 PMCID: PMC6329520 DOI: 10.1371/journal.pgen.1007884] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2018] [Revised: 01/11/2019] [Accepted: 12/08/2018] [Indexed: 01/08/2023] Open
Abstract
The Sterol Regulatory Element Binding Proteins (SREBPs) are basic-helix-loop-helix transcription regulators that control the expression of sterol biosynthesis genes in higher eukaryotes and some fungi. Surprisingly, SREBPs do not regulate sterol biosynthesis in the ascomycete yeasts (Saccharomycotina) as this role was handed off to an unrelated transcription regulator in this clade. The SREBPs, nonetheless, expanded in fungi such as the ascomycete yeasts Candida spp., raising questions about their role and evolution in these organisms. Here we report that the fungal SREBPs diversified their DNA binding preferences concomitantly with an expansion in function. We establish that several branches of fungal SREBPs preferentially bind non-palindromic DNA sequences, in contrast to the palindromic DNA motifs recognized by most basic-helix-loop-helix proteins (including SREBPs) in higher eukaryotes. Reconstruction and biochemical characterization of the likely ancestor protein suggest that an intrinsic DNA binding promiscuity in the family was resolved by alternative mechanisms in different branches of fungal SREBPs. Furthermore, we show that two SREBPs in the human commensal yeast Candida albicans drive a transcriptional cascade that inhibits a morphological switch under anaerobic conditions. Preventing this morphological transition enhances C. albicans colonization of the mammalian intestine, the fungus' natural niche. Thus, our results illustrate how diversification in DNA binding preferences enabled the functional expansion of a family of eukaryotic transcription regulators.
Collapse
Affiliation(s)
- Valentina del Olmo Toledo
- Interdisciplinary Center for Clinical Research, University Hospital Würzburg, Würzburg, Germany
- Institute for Molecular Infection Biology, University Würzburg, Würzburg, Germany
| | - Robert Puccinelli
- Department of Genetics, Stanford University, Stanford, California, United States of America
- Chan Zuckerberg Biohub, San Francisco, California, United States of America
| | - Polly M. Fordyce
- Department of Genetics, Stanford University, Stanford, California, United States of America
- Chan Zuckerberg Biohub, San Francisco, California, United States of America
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
- Stanford CheM-H Institute, Stanford University, Stanford, California, United States of America
| | - J. Christian Pérez
- Interdisciplinary Center for Clinical Research, University Hospital Würzburg, Würzburg, Germany
- Institute for Molecular Infection Biology, University Würzburg, Würzburg, Germany
- * E-mail:
| |
Collapse
|
44
|
Huang JH, Kwan RSY, Tsai ZTY, Lin TC, Tsai HK. Borders of Cis-Regulatory DNA Sequences Preferentially Harbor the Divergent Transcription Factor Binding Motifs in the Human Genome. Front Genet 2018; 9:571. [PMID: 30524473 PMCID: PMC6261980 DOI: 10.3389/fgene.2018.00571] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2018] [Accepted: 11/06/2018] [Indexed: 11/17/2022] Open
Abstract
Changes in cis-regulatory DNA sequences and transcription factor (TF) repertoires provide major sources of phenotypic diversity that shape the evolution of gene regulation in eukaryotes. The DNA-binding specificities of TFs may be diversified or produce new variants in different eukaryotic species. However, it is currently unclear how various levels of divergence in TF DNA-binding specificities or motifs became introduced into the cis-regulatory DNA regions of the genome over evolutionary time. Here, we first estimated the evolutionary divergence levels of TF binding motifs and quantified their occurrence at DNase I-hypersensitive sites. Results from our in silico motif scan and experimentally derived chromatin immunoprecipitation (TF-ChIP) show that the divergent motifs tend to be introduced in the edges of cis-regulatory regions, which is probably accompanied by the expansion of the accessible core of promoter-associated regulatory elements during evolution. We also find that the genes neighboring the expanded cis-regulatory regions with the most divergent motifs are associated with functions like development and morphogenesis. Accordingly, we propose that the accumulation of divergent motifs in the edges of cis-regulatory regions provides a functional mechanism for the evolution of divergent regulatory circuits.
Collapse
Affiliation(s)
- Jia-Hsin Huang
- Institute of Information Science, Academia Sinica, Nankang, Taipei, Taiwan
| | | | - Zing Tsung-Yeh Tsai
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, United States
| | - Tzu-Chieh Lin
- Institute of Information Science, Academia Sinica, Nankang, Taipei, Taiwan
| | - Huai-Kuang Tsai
- Institute of Information Science, Academia Sinica, Nankang, Taipei, Taiwan
| |
Collapse
|
45
|
Ligon RA, Diaz CD, Morano JL, Troscianko J, Stevens M, Moskeland A, Laman TG, Scholes E. Evolution of correlated complexity in the radically different courtship signals of birds-of-paradise. PLoS Biol 2018; 16:e2006962. [PMID: 30457985 PMCID: PMC6245505 DOI: 10.1371/journal.pbio.2006962] [Citation(s) in RCA: 58] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2018] [Accepted: 10/17/2018] [Indexed: 12/25/2022] Open
Abstract
Ornaments used in courtship often vary wildly among species, reflecting the evolutionary interplay between mate preference functions and the constraints imposed by natural selection. Consequently, understanding the evolutionary dynamics responsible for ornament diversification has been a longstanding challenge in evolutionary biology. However, comparing radically different ornaments across species, as well as different classes of ornaments within species, is a profound challenge to understanding diversification of sexual signals. Using novel methods and a unique natural history dataset, we explore evolutionary patterns of ornament evolution in a group—the birds-of-paradise—exhibiting dramatic phenotypic diversification widely assumed to be driven by sexual selection. Rather than the tradeoff between ornament types originally envisioned by Darwin and Wallace, we found positive correlations among cross-modal (visual/acoustic) signals indicating functional integration of ornamental traits into a composite unit—the “courtship phenotype.” Furthermore, given the broad theoretical and empirical support for the idea that systemic robustness—functional overlap and interdependency—promotes evolutionary innovation, we posit that birds-of-paradise have radiated extensively through ornamental phenotype space as a consequence of the robustness in the courtship phenotype that we document at a phylogenetic scale. We suggest that the degree of robustness in courtship phenotypes among taxa can provide new insights into the relative influence of sexual and natural selection on phenotypic radiations. Animals frequently vary widely in ornamentation, even among closely related species. Understanding the patterns that underlie this variation is a significant challenge, requiring comparisons among drastically different traits—like comparing apples to oranges. Here, we use novel analytical approaches to quantify variation in ornamental diversity and richness across the wildly divergent birds-of-paradise, a textbook example of how sexual selection can profoundly shape organismal phenotypes. We find that color and acoustic complexity, along with behavior and acoustic complexity, are positively correlated across evolutionary timescales. Positive links among ornament classes suggests that selection is acting on correlated suites of traits—a composite courtship phenotype—and this integration may be partially responsible for the extreme variation in signal form that we see in birds-of-paradise.
Collapse
Affiliation(s)
- Russell A. Ligon
- Cornell Lab of Ornithology, Cornell University, Ithaca, New York, United States of America
- Department of Neurobiology and Behavior, Cornell University, Ithaca, New York, United States of America
- * E-mail:
| | - Christopher D. Diaz
- Cornell Lab of Ornithology, Cornell University, Ithaca, New York, United States of America
| | - Janelle L. Morano
- Cornell Lab of Ornithology, Cornell University, Ithaca, New York, United States of America
| | - Jolyon Troscianko
- Centre for Ecology and Conservation, College of Life and Environmental Science, University of Exeter, Penryn, United Kingdom
| | - Martin Stevens
- Centre for Ecology and Conservation, College of Life and Environmental Science, University of Exeter, Penryn, United Kingdom
| | - Annalyse Moskeland
- Cornell Lab of Ornithology, Cornell University, Ithaca, New York, United States of America
| | - Timothy G. Laman
- Museum of Comparative Zoology, Harvard University, Cambridge, Massachusetts, United States of America
| | - Edwin Scholes
- Cornell Lab of Ornithology, Cornell University, Ithaca, New York, United States of America
| |
Collapse
|
46
|
Igler C, Lagator M, Tkačik G, Bollback JP, Guet CC. Evolutionary potential of transcription factors for gene regulatory rewiring. Nat Ecol Evol 2018; 2:1633-1643. [PMID: 30201966 DOI: 10.1038/s41559-018-0651-y] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2018] [Accepted: 07/27/2018] [Indexed: 11/09/2022]
Abstract
Gene regulatory networks evolve through rewiring of individual components-that is, through changes in regulatory connections. However, the mechanistic basis of regulatory rewiring is poorly understood. Using a canonical gene regulatory system, we quantify the properties of transcription factors that determine the evolutionary potential for rewiring of regulatory connections: robustness, tunability and evolvability. In vivo repression measurements of two repressors at mutated operator sites reveal their contrasting evolutionary potential: while robustness and evolvability were positively correlated, both were in trade-off with tunability. Epistatic interactions between adjacent operators alleviated this trade-off. A thermodynamic model explains how the differences in robustness, tunability and evolvability arise from biophysical characteristics of repressor-DNA binding. The model also uncovers that the energy matrix, which describes how mutations affect repressor-DNA binding, encodes crucial information about the evolutionary potential of a repressor. The biophysical determinants of evolutionary potential for regulatory rewiring constitute a mechanistic framework for understanding network evolution.
Collapse
Affiliation(s)
| | - Mato Lagator
- IST Austria, Am Campus 1, Klosterneuburg, Austria
| | | | - Jonathan P Bollback
- IST Austria, Am Campus 1, Klosterneuburg, Austria.,Institute of Integrative Biology, University of Liverpool, Liverpool, UK
| | - Călin C Guet
- IST Austria, Am Campus 1, Klosterneuburg, Austria.
| |
Collapse
|
47
|
Schaerli Y, Jiménez A, Duarte JM, Mihajlovic L, Renggli J, Isalan M, Sharpe J, Wagner A. Synthetic circuits reveal how mechanisms of gene regulatory networks constrain evolution. Mol Syst Biol 2018; 14:e8102. [PMID: 30201776 PMCID: PMC6129954 DOI: 10.15252/msb.20178102] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2017] [Revised: 08/15/2018] [Accepted: 08/15/2018] [Indexed: 12/22/2022] Open
Abstract
Phenotypic variation is the raw material of adaptive Darwinian evolution. The phenotypic variation found in organismal development is biased towards certain phenotypes, but the molecular mechanisms behind such biases are still poorly understood. Gene regulatory networks have been proposed as one cause of constrained phenotypic variation. However, most pertinent evidence is theoretical rather than experimental. Here, we study evolutionary biases in two synthetic gene regulatory circuits expressed in Escherichia coli that produce a gene expression stripe-a pivotal pattern in embryonic development. The two parental circuits produce the same phenotype, but create it through different regulatory mechanisms. We show that mutations cause distinct novel phenotypes in the two networks and use a combination of experimental measurements, mathematical modelling and DNA sequencing to understand why mutations bring forth only some but not other novel gene expression phenotypes. Our results reveal that the regulatory mechanisms of networks restrict the possible phenotypic variation upon mutation. Consequently, seemingly equivalent networks can indeed be distinct in how they constrain the outcome of further evolution.
Collapse
Affiliation(s)
- Yolanda Schaerli
- Department of Fundamental Microbiology, University of Lausanne, Lausanne, Switzerland
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zürich, Switzerland
| | - Alba Jiménez
- Systems Biology Program, Centre for Genomic Regulation (CRG), Universitat Pompeu Fabra, Barcelona, Spain
| | - José M Duarte
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zürich, Switzerland
| | - Ljiljana Mihajlovic
- Department of Fundamental Microbiology, University of Lausanne, Lausanne, Switzerland
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zürich, Switzerland
| | | | - Mark Isalan
- Department of Life Sciences, Imperial College London, London, UK
- Imperial College Centre for Synthetic Biology, Imperial College London, London, UK
| | - James Sharpe
- Systems Biology Program, Centre for Genomic Regulation (CRG), Universitat Pompeu Fabra, Barcelona, Spain
- Institucio Catalana de Recerca i Estudis Avancats (ICREA), Barcelona, Spain
- EMBL Barcelona European Molecular Biology Laboratory, Barcelona, Spain
| | - Andreas Wagner
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zürich, Switzerland
- The Swiss Institute of Bioinformatics, Lausanne, Switzerland
- The Santa Fe Institute, Santa Fe, NM, USA
| |
Collapse
|
48
|
Aguirre J, Catalán P, Cuesta JA, Manrubia S. On the networked architecture of genotype spaces and its critical effects on molecular evolution. Open Biol 2018; 8:180069. [PMID: 29973397 PMCID: PMC6070719 DOI: 10.1098/rsob.180069] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2018] [Accepted: 06/12/2018] [Indexed: 12/26/2022] Open
Abstract
Evolutionary dynamics is often viewed as a subtle process of change accumulation that causes a divergence among organisms and their genomes. However, this interpretation is an inheritance of a gradualistic view that has been challenged at the macroevolutionary, ecological and molecular level. Actually, when the complex architecture of genotype spaces is taken into account, the evolutionary dynamics of molecular populations becomes intrinsically non-uniform, sharing deep qualitative and quantitative similarities with slowly driven physical systems: nonlinear responses analogous to critical transitions, sudden state changes or hysteresis, among others. Furthermore, the phenotypic plasticity inherent to genotypes transforms classical fitness landscapes into multiscapes where adaptation in response to an environmental change may be very fast. The quantitative nature of adaptive molecular processes is deeply dependent on a network-of-networks multilayered structure of the map from genotype to function that we begin to unveil.
Collapse
Affiliation(s)
- Jacobo Aguirre
- Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid, Spain
- Programa de Biología de Sistemas, Centro Nacional de Biotecnología (CSIC), Madrid, Spain
| | - Pablo Catalán
- Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid, Spain
- Departamento de Matemáticas, Universidad Carlos III de Madrid, Leganés, Madrid, Spain
| | - José A Cuesta
- Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid, Spain
- Departamento de Matemáticas, Universidad Carlos III de Madrid, Leganés, Madrid, Spain
- Instituto de Biocomputación y Física de Sistemas Complejos (BIFI), Universidad de Zaragoza, Zaragoza, Spain
- UC3M-BS Institute of Financial Big Data (IFiBiD), Universidad Carlos III de Madrid, Getafe, Madrid, Spain
| | - Susanna Manrubia
- Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid, Spain
- Programa de Biología de Sistemas, Centro Nacional de Biotecnología (CSIC), Madrid, Spain
| |
Collapse
|
49
|
Aguilar‐Rodríguez J, Peel L, Stella M, Wagner A, Payne JL. The architecture of an empirical genotype-phenotype map. Evolution 2018; 72:1242-1260. [PMID: 29676774 PMCID: PMC6055911 DOI: 10.1111/evo.13487] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2017] [Accepted: 04/03/2018] [Indexed: 12/15/2022]
Abstract
Recent advances in high-throughput technologies are bringing the study of empirical genotype-phenotype (GP) maps to the fore. Here, we use data from protein-binding microarrays to study an empirical GP map of transcription factor (TF) -binding preferences. In this map, each genotype is a DNA sequence. The phenotype of this DNA sequence is its ability to bind one or more TFs. We study this GP map using genotype networks, in which nodes represent genotypes with the same phenotype, and edges connect nodes if their genotypes differ by a single small mutation. We describe the structure and arrangement of genotype networks within the space of all possible binding sites for 525 TFs from three eukaryotic species encompassing three kingdoms of life (animal, plant, and fungi). We thus provide a high-resolution depiction of the architecture of an empirical GP map. Among a number of findings, we show that these genotype networks are "small-world" and assortative, and that they ubiquitously overlap and interface with one another. We also use polymorphism data from Arabidopsis thaliana to show how genotype network structure influences the evolution of TF-binding sites in vivo. We discuss our findings in the context of regulatory evolution.
Collapse
Affiliation(s)
- José Aguilar‐Rodríguez
- Department of Evolutionary Biology and Environmental StudiesUniversity of ZurichZurichSwitzerland
- Swiss Institute of BioinformaticsLausanneSwitzerland
- Current Address: Department of Biology, Stanford University, StanfordCA, USA; Department of Chemical and Systems Biology, Stanford UniversityStanfordCAUSA
| | - Leto Peel
- Institute of Information and Communication Technologies, Electronics and Applied MathematicsUniversité Catholique de LouvainLouvain‐la‐NeuveBelgium
- Namur Center for Complex SystemsUniversity of NamurNamurBelgium
| | - Massimo Stella
- Institute for Complex Systems Simulation, Department of Electronics and Computer ScienceUniversity of SouthamptonSouthamptonUnited Kingdom
| | - Andreas Wagner
- Department of Evolutionary Biology and Environmental StudiesUniversity of ZurichZurichSwitzerland
- Swiss Institute of BioinformaticsLausanneSwitzerland
- The Santa Fe InstituteSanta FeNew MexicoUSA
| | - Joshua L. Payne
- Swiss Institute of BioinformaticsLausanneSwitzerland
- Institute for Integrative Biology, ETHZurichSwitzerland
| |
Collapse
|
50
|
Cvekl A, Zhao Y, McGreal R, Xie Q, Gu X, Zheng D. Evolutionary Origins of Pax6 Control of Crystallin Genes. Genome Biol Evol 2018; 9:2075-2092. [PMID: 28903537 PMCID: PMC5737492 DOI: 10.1093/gbe/evx153] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/14/2017] [Indexed: 12/19/2022] Open
Abstract
The birth of novel genes, including their cell-specific transcriptional control, is a major source of evolutionary innovation. The lens-preferred proteins, crystallins (vertebrates: α- and β/γ-crystallins), provide a gateway to study eye evolution. Diversity of crystallins was thought to originate from convergent evolution through multiple, independent formation of Pax6/PaxB-binding sites within the promoters of genes able to act as crystallins. Here, we propose that αB-crystallin arose from a duplication of small heat shock protein (Hspb1-like) gene accompanied by Pax6-site and heat shock element (HSE) formation, followed by another duplication to generate the αA-crystallin gene in which HSE was converted into another Pax6-binding site. The founding β/γ-crystallin gene arose from the ancestral Hspb1-like gene promoter inserted into a Ca2+-binding protein coding region, early in the cephalochordate/tunicate lineage. Likewise, an ancestral aldehyde dehydrogenase (Aldh) gene, through multiple gene duplications, expanded into a multigene family, with specific genes expressed in invertebrate lenses (Ω-crystallin/Aldh1a9) and both vertebrate lenses (η-crystallin/Aldh1a7 and Aldh3a1) and corneas (Aldh3a1). Collectively, the present data reconstruct the evolution of diverse crystallin gene families.
Collapse
Affiliation(s)
- Ales Cvekl
- Department of Ophthalmology and Visual Sciences, Albert Einstein College of Medicine, Bronx, New York.,Department of Genetics, Albert Einstein College of Medicine, Bronx, New York
| | - Yilin Zhao
- Department of Genetics, Albert Einstein College of Medicine, Bronx, New York
| | - Rebecca McGreal
- Department of Ophthalmology and Visual Sciences, Albert Einstein College of Medicine, Bronx, New York.,Department of Genetics, Albert Einstein College of Medicine, Bronx, New York
| | - Qing Xie
- Department of Ophthalmology and Visual Sciences, Albert Einstein College of Medicine, Bronx, New York.,Department of Genetics, Albert Einstein College of Medicine, Bronx, New York
| | - Xun Gu
- Program in Bioinformatics and Computational Biology, Department of Genetics, Development, and Cell Biology, Iowa State University
| | - Deyou Zheng
- Department of Genetics, Albert Einstein College of Medicine, Bronx, New York.,Department of Neurology, Albert Einstein College of Medicine, Bronx, New York.,Department of Neuroscience, Albert Einstein College of Medicine, Bronx, New York
| |
Collapse
|