1
|
O’Brien NLV, Holland B, Engelstädter J, Ortiz-Barrientos D. The distribution of fitness effects during adaptive walks using a simple genetic network. PLoS Genet 2024; 20:e1011289. [PMID: 38787919 PMCID: PMC11156440 DOI: 10.1371/journal.pgen.1011289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 06/06/2024] [Accepted: 05/04/2024] [Indexed: 05/26/2024] Open
Abstract
The tempo and mode of adaptation depends on the availability of beneficial alleles. Genetic interactions arising from gene networks can restrict this availability. However, the extent to which networks affect adaptation remains largely unknown. Current models of evolution consider additive genotype-phenotype relationships while often ignoring the contribution of gene interactions to phenotypic variance. In this study, we model a quantitative trait as the product of a simple gene regulatory network, the negative autoregulation motif. Using forward-time genetic simulations, we measure adaptive walks towards a phenotypic optimum in both additive and network models. A key expectation from adaptive walk theory is that the distribution of fitness effects of new beneficial mutations is exponential. We found that both models instead harbored distributions with fewer large-effect beneficial alleles than expected. The network model also had a complex and bimodal distribution of fitness effects among all mutations, with a considerable density at deleterious selection coefficients. This behavior is reminiscent of the cost of complexity, where correlations among traits constrain adaptation. Our results suggest that the interactions emerging from genetic networks can generate complex and multimodal distributions of fitness effects.
Collapse
Affiliation(s)
- Nicholas L. V. O’Brien
- School of the Environment, The University of Queensland, Brisbane, Queensland, Australia
- ARC Centre of Excellence for Plant Success in Nature and Agriculture, The University of Queensland, Brisbane, QLD, Australia
| | - Barbara Holland
- School of Natural Sciences, University of Tasmania, Hobart, Tasmania, Australia
- ARC Centre of Excellence for Plant Success in Nature and Agriculture, University of Tasmania, Hobart, Tasmania, Australia
| | - Jan Engelstädter
- School of the Environment, The University of Queensland, Brisbane, Queensland, Australia
- ARC Centre of Excellence for Plant Success in Nature and Agriculture, The University of Queensland, Brisbane, QLD, Australia
| | - Daniel Ortiz-Barrientos
- School of the Environment, The University of Queensland, Brisbane, Queensland, Australia
- ARC Centre of Excellence for Plant Success in Nature and Agriculture, The University of Queensland, Brisbane, QLD, Australia
| |
Collapse
|
2
|
Roberts JM, Beck JD, Pollock TB, Bendixsen DP, Hayden EJ. RNA sequence to structure analysis from comprehensive pairwise mutagenesis of multiple self-cleaving ribozymes. eLife 2023; 12:80360. [PMID: 36655987 PMCID: PMC9901934 DOI: 10.7554/elife.80360] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Accepted: 12/28/2022] [Indexed: 01/20/2023] Open
Abstract
Self-cleaving ribozymes are RNA molecules that catalyze the cleavage of their own phosphodiester backbones. These ribozymes are found in all domains of life and are also a tool for biotechnical and synthetic biology applications. Self-cleaving ribozymes are also an important model of sequence-to-function relationships for RNA because their small size simplifies synthesis of genetic variants and self-cleaving activity is an accessible readout of the functional consequence of the mutation. Here, we used a high-throughput experimental approach to determine the relative activity for every possible single and double mutant of five self-cleaving ribozymes. From this data, we comprehensively identified non-additive effects between pairs of mutations (epistasis) for all five ribozymes. We analyzed how changes in activity and trends in epistasis map to the ribozyme structures. The variety of structures studied provided opportunities to observe several examples of common structural elements, and the data was collected under identical experimental conditions to enable direct comparison. Heatmap-based visualization of the data revealed patterns indicating structural features of the ribozymes including paired regions, unpaired loops, non-canonical structures, and tertiary structural contacts. The data also revealed signatures of functionally critical nucleotides involved in catalysis. The results demonstrate that the data sets provide structural information similar to chemical or enzymatic probing experiments, but with additional quantitative functional information. The large-scale data sets can be used for models predicting structure and function and for efforts to engineer self-cleaving ribozymes.
Collapse
Affiliation(s)
- Jessica M Roberts
- Biomolecular Sciences Graduate Programs, Boise State UniversityBoiseUnited States
| | - James D Beck
- Computing PhD Program, Boise State UniversityBoiseUnited States
| | - Tanner B Pollock
- Department of Biological Science, Boise State UniversityBoiseUnited States
| | - Devin P Bendixsen
- Biomolecular Sciences Graduate Programs, Boise State UniversityBoiseUnited States
| | - Eric J Hayden
- Biomolecular Sciences Graduate Programs, Boise State UniversityBoiseUnited States
- Computing PhD Program, Boise State UniversityBoiseUnited States
- Department of Biological Science, Boise State UniversityBoiseUnited States
| |
Collapse
|
3
|
Rotrattanadumrong R, Yokobayashi Y. Experimental exploration of a ribozyme neutral network using evolutionary algorithm and deep learning. Nat Commun 2022; 13:4847. [PMID: 35977956 PMCID: PMC9385714 DOI: 10.1038/s41467-022-32538-z] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Accepted: 08/03/2022] [Indexed: 11/18/2022] Open
Abstract
A neutral network connects all genotypes with equivalent phenotypes in a fitness landscape and plays an important role in the mutational robustness and evolvability of biomolecules. In contrast to earlier theoretical works, evidence of large neutral networks has been lacking in recent experimental studies of fitness landscapes. This suggests that evolution could be constrained globally. Here, we demonstrate that a deep learning-guided evolutionary algorithm can efficiently identify neutral genotypes within the sequence space of an RNA ligase ribozyme. Furthermore, we measure the activities of all 216 variants connecting two active ribozymes that differ by 16 mutations and analyze mutational interactions (epistasis) up to the 16th order. We discover an extensive network of neutral paths linking the two genotypes and reveal that these paths might be predicted using only information from lower-order interactions. Our experimental evaluation of over 120,000 ribozyme sequences provides important empirical evidence that neutral networks can increase the accessibility and predictability of the fitness landscape. Neutral networks, which are sets of genotypes connected via single mutations that share the same phenotype, are important for evolvability. Here, the authors provide experimental evidence of a neutral network in an RNA enzyme using a high-throughput assay and deep learning.
Collapse
Affiliation(s)
- Rachapun Rotrattanadumrong
- Nucleic Acid Chemistry and Engineering Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Okinawa, 9040495, Japan
| | - Yohei Yokobayashi
- Nucleic Acid Chemistry and Engineering Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Okinawa, 9040495, Japan.
| |
Collapse
|
4
|
Beck JD, Roberts JM, Kitzhaber JM, Trapp A, Serra E, Spezzano F, Hayden EJ. Predicting higher-order mutational effects in an RNA enzyme by machine learning of high-throughput experimental data. Front Mol Biosci 2022; 9:893864. [PMID: 36046603 PMCID: PMC9421044 DOI: 10.3389/fmolb.2022.893864] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Accepted: 06/28/2022] [Indexed: 11/13/2022] Open
Abstract
Ribozymes are RNA molecules that catalyze biochemical reactions. Self-cleaving ribozymes are a common naturally occurring class of ribozymes that catalyze site-specific cleavage of their own phosphodiester backbone. In addition to their natural functions, self-cleaving ribozymes have been used to engineer control of gene expression because they can be designed to alter RNA processing and stability. However, the rational design of ribozyme activity remains challenging, and many ribozyme-based systems are engineered or improved by random mutagenesis and selection (in vitro evolution). Improving a ribozyme-based system often requires several mutations to achieve the desired function, but extensive pairwise and higher-order epistasis prevent a simple prediction of the effect of multiple mutations that is needed for rational design. Recently, high-throughput sequencing-based approaches have produced data sets on the effects of numerous mutations in different ribozymes (RNA fitness landscapes). Here we used such high-throughput experimental data from variants of the CPEB3 self-cleaving ribozyme to train a predictive model through machine learning approaches. We trained models using either a random forest or long short-term memory (LSTM) recurrent neural network approach. We found that models trained on a comprehensive set of pairwise mutant data could predict active sequences at higher mutational distances, but the correlation between predicted and experimentally observed self-cleavage activity decreased with increasing mutational distance. Adding sequences with increasingly higher numbers of mutations to the training data improved the correlation at increasing mutational distances. Systematically reducing the size of the training data set suggests that a wide distribution of ribozyme activity may be the key to accurate predictions. Because the model predictions are based only on sequence and activity data, the results demonstrate that this machine learning approach allows readily obtainable experimental data to be used for RNA design efforts even for RNA molecules with unknown structures. The accurate prediction of RNA functions will enable a more comprehensive understanding of RNA fitness landscapes for studying evolution and for guiding RNA-based engineering efforts.
Collapse
Affiliation(s)
| | - Jessica M. Roberts
- Biomolecular Sciences Graduate Programs, Boise State University, Boise, ID, United States
| | - Joey M. Kitzhaber
- Department of Computer Science, Boise State University, Boise, ID, United States
| | - Ashlyn Trapp
- Department of Biological Sciences, Boise State University, Boise, ID, United States
| | | | | | - Eric J. Hayden
- Biomolecular Sciences Graduate Programs, Boise State University, Boise, ID, United States
- Department of Computer Science, Boise State University, Boise, ID, United States
- *Correspondence: Eric J. Hayden,
| |
Collapse
|
5
|
Yang CH, Scarpino SV. A Family of Fitness Landscapes Modeled through Gene Regulatory Networks. ENTROPY (BASEL, SWITZERLAND) 2022; 24:622. [PMID: 35626507 PMCID: PMC9141513 DOI: 10.3390/e24050622] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Revised: 04/11/2022] [Accepted: 04/26/2022] [Indexed: 02/01/2023]
Abstract
Fitness landscapes are a powerful metaphor for understanding the evolution of biological systems. These landscapes describe how genotypes are connected to each other through mutation and related through fitness. Empirical studies of fitness landscapes have increasingly revealed conserved topographical features across diverse taxa, e.g., the accessibility of genotypes and "ruggedness". As a result, theoretical studies are needed to investigate how evolution proceeds on fitness landscapes with such conserved features. Here, we develop and study a model of evolution on fitness landscapes using the lens of Gene Regulatory Networks (GRNs), where the regulatory products are computed from multiple genes and collectively treated as phenotypes. With the assumption that regulation is a binary process, we prove the existence of empirically observed, topographical features such as accessibility and connectivity. We further show that these results hold across arbitrary fitness functions and that a trade-off between accessibility and ruggedness need not exist. Then, using graph theory and a coarse-graining approach, we deduce a mesoscopic structure underlying GRN fitness landscapes where the information necessary to predict a population's evolutionary trajectory is retained with minimal complexity. Using this coarse-graining, we develop a bottom-up algorithm to construct such mesoscopic backbones, which does not require computing the genotype network and is therefore far more efficient than brute-force approaches. Altogether, this work provides mathematical results of high-dimensional fitness landscapes and a path toward connecting theory to empirical studies.
Collapse
Affiliation(s)
- Chia-Hung Yang
- Network Science Institute, Northeastern University, Boston, MA 02115, USA
| | - Samuel V. Scarpino
- Network Science Institute, Northeastern University, Boston, MA 02115, USA
- Physics Department, Northeastern University, Boston, MA 02115, USA
- Roux Institute, Northeastern University, Boston, MA 02115, USA
- Institute for Experiential AI, Northeastern University, Boston, MA 02115, USA
- Santa Fe Institute, Santa Fe, NM 87501, USA
- Vermont Complex Systems Center, University of Vermont, Burlington, VT 05405, USA
| |
Collapse
|
6
|
Peri G, Gibard C, Shults NH, Crossin K, Hayden EJ. Dynamic RNA fitness landscapes of a group I ribozyme during changes to the experimental environment. Mol Biol Evol 2022; 39:6502289. [PMID: 35020916 PMCID: PMC8890501 DOI: 10.1093/molbev/msab373] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Fitness landscapes of protein and RNA molecules can be studied experimentally using high-throughput techniques to measure the functional effects of numerous combinations of mutations. The rugged topography of these molecular fitness landscapes is important for understanding and predicting natural and experimental evolution. Mutational effects are also dependent upon environmental conditions, but the effects of environmental changes on fitness landscapes remains poorly understood. Here, we investigate the changes to the fitness landscape of a catalytic RNA molecule while changing a single environmental variable that is critical for RNA structure and function. Using high-throughput sequencing of in vitro selections, we mapped a fitness landscape of the Azoarcus group I ribozyme under eight different concentrations of magnesium ions (1–48 mM MgCl2). The data revealed the magnesium dependence of 16,384 mutational neighbors, and from this, we investigated the magnesium induced changes to the topography of the fitness landscape. The results showed that increasing magnesium concentration improved the relative fitness of sequences at higher mutational distances while also reducing the ruggedness of the mutational trajectories on the landscape. As a result, as magnesium concentration was increased, simulated populations evolved toward higher fitness faster. Curve-fitting of the magnesium dependence of individual ribozymes demonstrated that deep sequencing of in vitro reactions can be used to evaluate the structural stability of thousands of sequences in parallel. Overall, the results highlight how environmental changes that stabilize structures can also alter the ruggedness of fitness landscapes and alter evolutionary processes.
Collapse
Affiliation(s)
- Gianluca Peri
- Biomolecular Sciences Graduate Programs, Boise State University, Boise, ID, USA
| | - Clémentine Gibard
- Department of Biological Science, Boise State University, Boise, ID, USA
| | - Nicholas H Shults
- Department of Biological Science, Boise State University, Boise, ID, USA
| | - Kent Crossin
- Department of Biological Science, Boise State University, Boise, ID, USA
| | - Eric J Hayden
- Biomolecular Sciences Graduate Programs, Boise State University, Boise, ID, USA.,Department of Biological Science, Boise State University, Boise, ID, USA
| |
Collapse
|
7
|
Lai YC, Liu Z, Chen IA. Encapsulation of ribozymes inside model protocells leads to faster evolutionary adaptation. Proc Natl Acad Sci U S A 2021; 118:e2025054118. [PMID: 34001592 PMCID: PMC8166191 DOI: 10.1073/pnas.2025054118] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Functional biomolecules, such as RNA, encapsulated inside a protocellular membrane are believed to have comprised a very early, critical stage in the evolution of life, since membrane vesicles allow selective permeability and create a unit of selection enabling cooperative phenotypes. The biophysical environment inside a protocell would differ fundamentally from bulk solution due to the microscopic confinement. However, the effect of the encapsulated environment on ribozyme evolution has not been previously studied experimentally. Here, we examine the effect of encapsulation inside model protocells on the self-aminoacylation activity of tens of thousands of RNA sequences using a high-throughput sequencing assay. We find that encapsulation of these ribozymes generally increases their activity, giving encapsulated sequences an advantage over nonencapsulated sequences in an amphiphile-rich environment. In addition, highly active ribozymes benefit disproportionately more from encapsulation. The asymmetry in fitness gain broadens the distribution of fitness in the system. Consistent with Fisher's fundamental theorem of natural selection, encapsulation therefore leads to faster adaptation when the RNAs are encapsulated inside a protocell during in vitro selection. Thus, protocells would not only provide a compartmentalization function but also promote activity and evolutionary adaptation during the origin of life.
Collapse
Affiliation(s)
- Yei-Chen Lai
- Department of Chemical and Biomolecular Engineering, University of California, Los Angeles, CA 90095
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095
| | - Ziwei Liu
- Medical Research Council Laboratory of Molecular Biology, Cambridge Biomedical Campus, Cambridge CB2 0QH, United Kingdom
| | - Irene A Chen
- Department of Chemical and Biomolecular Engineering, University of California, Los Angeles, CA 90095;
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095
| |
Collapse
|
8
|
Lozano-Huntelman NA, Zhou A, Tekin E, Cruz-Loya M, Østman B, Boyd S, Savage VM, Yeh P. Hidden suppressive interactions are common in higher-order drug combinations. iScience 2021; 24:102355. [PMID: 33870144 PMCID: PMC8044428 DOI: 10.1016/j.isci.2021.102355] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Revised: 01/26/2021] [Accepted: 03/22/2021] [Indexed: 11/25/2022] Open
Abstract
The rapid increase of multi-drug resistant bacteria has led to a greater emphasis on multi-drug combination treatments. However, some combinations can be suppressive—that is, bacteria grow faster in some drug combinations than when treated with a single drug. Typically, when studying interactions, the overall effect of the combination is only compared with the single-drug effects. However, doing so could miss “hidden” cases of suppression, which occur when the highest order is suppressive compared with a lower-order combination but not to a single drug. We examined an extensive dataset of 5-drug combinations and all lower-order—single, 2-, 3-, and 4-drug—combinations. We found that a majority of all combinations—54%—contain hidden suppression. Examining hidden interactions is critical to understanding the architecture of higher-order interactions and can substantially affect our understanding and predictions of the evolution of antibiotic resistance under multi-drug treatments. Most instances of suppressive interactions are missed by standard methods A majority (54%) of all antibiotic combinations tested contain hidden suppression Identifying hidden suppression can affect what combinations should be used
Collapse
Affiliation(s)
| | - April Zhou
- Ecology and Evolutionary Biology, University of California, Los Angeles, 90095, USA.,Computational and Systems Biology, University of California, Los Angeles, 90095, USA
| | - Elif Tekin
- Ecology and Evolutionary Biology, University of California, Los Angeles, 90095, USA
| | - Mauricio Cruz-Loya
- Computational and Systems Biology, University of California, Los Angeles, 90095, USA
| | - Bjørn Østman
- Ecology and Evolutionary Biology, University of California, Los Angeles, 90095, USA
| | - Sada Boyd
- Ecology and Evolutionary Biology, University of California, Los Angeles, 90095, USA
| | - Van M Savage
- Ecology and Evolutionary Biology, University of California, Los Angeles, 90095, USA.,Computational and Systems Biology, University of California, Los Angeles, 90095, USA.,Santa Fe Institute, Santa Fe, NM 87501, USA
| | - Pamela Yeh
- Ecology and Evolutionary Biology, University of California, Los Angeles, 90095, USA.,Santa Fe Institute, Santa Fe, NM 87501, USA
| |
Collapse
|
9
|
Routh S, Acharyya A, Dhar R. A two-step PCR assembly for construction of gene variants across large mutational distances. Biol Methods Protoc 2021; 6:bpab007. [PMID: 33928191 PMCID: PMC8062255 DOI: 10.1093/biomethods/bpab007] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Revised: 03/09/2021] [Accepted: 04/01/2021] [Indexed: 11/14/2022] Open
Abstract
Construction of empirical fitness landscapes has transformed our understanding of genotype-phenotype relationships across genes. However, most empirical fitness landscapes have been constrained to the local genotype neighbourhood of a gene primarily due to our limited ability to systematically construct genotypes that differ by a large number of mutations. Although a few methods have been proposed in the literature, these techniques are complex owing to several steps of construction or contain a large number of amplification cycles that increase chances of non-specific mutations. A few other described methods require amplification of the whole vector, thereby increasing the chances of vector backbone mutations that can have unintended consequences for study of fitness landscapes. Thus, this has substantially constrained us from traversing large mutational distances in the genotype network, thereby limiting our understanding of the interactions between multiple mutations and the role these interactions play in evolution of novel phenotypes. In the current work, we present a simple but powerful approach that allows us to systematically and accurately construct gene variants at large mutational distances. Our approach relies on building-up small fragments containing targeted mutations in the first step followed by assembly of these fragments into the complete gene fragment by polymerase chain reaction (PCR). We demonstrate the utility of our approach by constructing variants that differ by up to 11 mutations in a model gene. Our work thus provides an accurate method for construction of multi-mutant variants of genes and therefore will transform the studies of empirical fitness landscapes by enabling exploration of genotypes that are far away from a starting genotype.
Collapse
Affiliation(s)
- Shreya Routh
- Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur 721302, West Bengal, India
| | - Anamika Acharyya
- Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur 721302, West Bengal, India
| | - Riddhiman Dhar
- Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur 721302, West Bengal, India
| |
Collapse
|
10
|
Pervasive cooperative mutational effects on multiple catalytic enzyme traits emerge via long-range conformational dynamics. Nat Commun 2021; 12:1621. [PMID: 33712579 PMCID: PMC7955134 DOI: 10.1038/s41467-021-21833-w] [Citation(s) in RCA: 72] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2020] [Accepted: 01/29/2021] [Indexed: 12/11/2022] Open
Abstract
Multidimensional fitness landscapes provide insights into the molecular basis of laboratory and natural evolution. To date, such efforts usually focus on limited protein families and a single enzyme trait, with little concern about the relationship between protein epistasis and conformational dynamics. Here, we report a multiparametric fitness landscape for a cytochrome P450 monooxygenase that was engineered for the regio- and stereoselective hydroxylation of a steroid. We develop a computational program to automatically quantify non-additive effects among all possible mutational pathways, finding pervasive cooperative signs and magnitude epistasis on multiple catalytic traits. By using quantum mechanics and molecular dynamics simulations, we show that these effects are modulated by long-range interactions in loops, helices and β-strands that gate the substrate access channel allowing for optimal catalysis. Our work highlights the importance of conformational dynamics on epistasis in an enzyme involved in secondary metabolism and offers insights for engineering P450s. Connecting conformational dynamics and epistasis has so far been limited to a few proteins and a single fitness trait. Here, the authors provide evidence of positive epistasis on multiple catalytic traits in the evolution and dynamics of engineered cytochrome P450 monooxygenase, offering insights for in silico protein design.
Collapse
|
11
|
Soo VWC, Swadling JB, Faure AJ, Warnecke T. Fitness landscape of a dynamic RNA structure. PLoS Genet 2021; 17:e1009353. [PMID: 33524037 PMCID: PMC7877785 DOI: 10.1371/journal.pgen.1009353] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Revised: 02/11/2021] [Accepted: 01/12/2021] [Indexed: 11/24/2022] Open
Abstract
RNA structures are dynamic. As a consequence, mutational effects can be hard to rationalize with reference to a single static native structure. We reasoned that deep mutational scanning experiments, which couple molecular function to fitness, should capture mutational effects across multiple conformational states simultaneously. Here, we provide a proof-of-principle that this is indeed the case, using the self-splicing group I intron from Tetrahymena thermophila as a model system. We comprehensively mutagenized two 4-bp segments of the intron. These segments first come together to form the P1 extension (P1ex) helix at the 5' splice site. Following cleavage at the 5' splice site, the two halves of the helix dissociate to allow formation of an alternative helix (P10) at the 3' splice site. Using an in vivo reporter system that couples splicing activity to fitness in E. coli, we demonstrate that fitness is driven jointly by constraints on P1ex and P10 formation. We further show that patterns of epistasis can be used to infer the presence of intramolecular pleiotropy. Using a machine learning approach that allows quantification of mutational effects in a genotype-specific manner, we demonstrate that the fitness landscape can be deconvoluted to implicate P1ex or P10 as the effective genetic background in which molecular fitness is compromised or enhanced. Our results highlight deep mutational scanning as a tool to study alternative conformational states, with the capacity to provide critical insights into the structure, evolution and evolvability of RNAs as dynamic ensembles. Our findings also suggest that, in the future, deep mutational scanning approaches might help reverse-engineer multiple alternative or successive conformations from a single fitness landscape.
Collapse
Affiliation(s)
- Valerie W. C. Soo
- Medical Research Council London Institute of Medical Sciences, London, United Kingdom
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, London, United Kingdom
| | - Jacob B. Swadling
- Medical Research Council London Institute of Medical Sciences, London, United Kingdom
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, London, United Kingdom
| | - Andre J. Faure
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Tobias Warnecke
- Medical Research Council London Institute of Medical Sciences, London, United Kingdom
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, London, United Kingdom
| |
Collapse
|
12
|
Yokobayashi Y. High-Throughput Analysis and Engineering of Ribozymes and Deoxyribozymes by Sequencing. Acc Chem Res 2020; 53:2903-2912. [PMID: 33164502 DOI: 10.1021/acs.accounts.0c00546] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Ribozymes and deoxyribozymes are catalytic RNA and DNA, respectively, that catalyze chemical reactions such as self-cleavage or ligation reactions. While some ribozymes are found in nature, a larger variety of ribozymes and deoxyribozymes have been discovered by in vitro selection from random sequences. These catalytic nucleic acids, especially ribozymes, are of fundamental interest because they are crucial for the RNA world hypothesis, which suggests that RNA played a central role in both the propagation of genetic information and catalyzing metabolic reactions in primordial life prior to the emergence of proteins and DNA. On the practical side, catalytic nucleic acids have been extensively engineered for various applications, such as biosensors and genetic devices for synthetic biology. Therefore, it is important to gain a deeper understanding of the sequence-function relationships of ribozymes and deoxyribozymes.Mutational analysis, or measurements of activities of catalytic nucleic acid mutants, is one of the most fundamental approaches for that purpose. Mutations that abolish, reduce, retain, or even increase activity provide useful information about nucleic acid catalysts for engineering and other purposes. However, methods for mutational analysis of ribozymes and deoxyribozymes have not evolved much for decades, requiring tedious and low-throughput assays (e.g., gel electrophoresis) of individually prepared mutants. This has prevented researchers from performing quantitative mutational analysis of ribozymes and deoxyribozymes on a large scale.To address this limitation, we developed a massively parallel ribozyme and deoxyribozyme assay strategy that allows >104 assays using high-throughput sequencing (HTS). We used HTS to literally count the number of cleaved (or ligated) and uncleaved (or unligated) ribozyme (or deoxyribozyme) sequences and calculated the activities of each mutant in a reaction mixture. This simple yet powerful strategy was applied to analyze the mutational effects of various natural and synthetic ribozymes and deoxyribozymes at scales impossible for conventional mutational analysis. These large-scale sequence-function data sets were used to better understand the functional consequences of mutations and to engineer ribozymes for practical applications. Furthermore, these newly available data are motivating researchers to employ more rigorous computational methods to extract additional insights such as structural information and nonlinear effects of multiple mutations. The new HTS-based assay strategy is distinct from and complementary to a related strategy that uses HTS to analyze ribozyme and deoxyribozyme populations subjected to in vitro selection. Postselection sequencing can cover a larger sequence space, although it does not directly quantify the activities of ribozyme and deoxyribozyme mutants. With further advances in DNA sequencing technologies and computational methods, there should be more opportunities to harness the power of HTS to deepen our understanding of catalytic nucleic acids and enhance our ability to engineer them for even more applications.
Collapse
Affiliation(s)
- Yohei Yokobayashi
- Nucleic Acid Chemistry and Engineering Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Okinawa 904-0495, Japan
| |
Collapse
|
13
|
Liberles DA, Chang B, Geiler-Samerotte K, Goldman A, Hey J, Kaçar B, Meyer M, Murphy W, Posada D, Storfer A. Emerging Frontiers in the Study of Molecular Evolution. J Mol Evol 2020; 88:211-226. [PMID: 32060574 PMCID: PMC7386396 DOI: 10.1007/s00239-020-09932-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
A collection of the editors of Journal of Molecular Evolution have gotten together to pose a set of key challenges and future directions for the field of molecular evolution. Topics include challenges and new directions in prebiotic chemistry and the RNA world, reconstruction of early cellular genomes and proteins, macromolecular and functional evolution, evolutionary cell biology, genome evolution, molecular evolutionary ecology, viral phylodynamics, theoretical population genomics, somatic cell molecular evolution, and directed evolution. While our effort is not meant to be exhaustive, it reflects research questions and problems in the field of molecular evolution that are exciting to our editors.
Collapse
Affiliation(s)
- David A Liberles
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA, 19122, USA.
| | - Belinda Chang
- Department of Ecology and Evolutionary Biology and Department of Cell and Systems Biology, University of Toronto, 25 Harbord Street, Toronto, ON, M5S 3G5, Canada
| | - Kerry Geiler-Samerotte
- Center for Mechanisms of Evolution, School of Life Sciences, Arizona State University, Tempe, AZ, 85287, USA
| | - Aaron Goldman
- Department of Biology, Oberlin College and Conservatory, K123 Science Center, 119 Woodland Street, Oberlin, OH, 44074, USA
| | - Jody Hey
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA, 19122, USA
| | - Betül Kaçar
- Department of Molecular and Cell Biology, University of Arizona, Tucson, AZ, 85721, USA
| | - Michelle Meyer
- Department of Biology, Boston College, Chestnut Hill, MA, 02467, USA
| | - William Murphy
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, TX, 77843, USA
| | - David Posada
- Biomedical Research Center (CINBIO), University of Vigo, Vigo, Spain
| | - Andrew Storfer
- School of Biological Sciences, Washington State University, Pullman, WA, 99164, USA
| |
Collapse
|
14
|
Blanco C, Verbanic S, Seelig B, Chen IA. High throughput sequencing of in vitro selections of mRNA-displayed peptides: data analysis and applications. Phys Chem Chem Phys 2020; 22:6492-6506. [PMID: 31967131 PMCID: PMC8219182 DOI: 10.1039/c9cp05912a] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
In vitro selection using mRNA display is currently a widely used method to isolate functional peptides with desired properties. The analysis of high throughput sequencing (HTS) data from in vitro evolution experiments has proven to be a powerful technique but only recently has it been applied to mRNA display selections. In this Perspective, we introduce aspects of mRNA display and HTS that may be of interest to physical chemists. We highlight the potential of HTS to analyze in vitro selections of peptides and review recent advances in the application of HTS analysis to mRNA display experiments. We discuss some possible issues involved with HTS analysis and summarize some strategies to alleviate them. Finally, the potential for future impact of advancing HTS analysis on mRNA display experiments is discussed.
Collapse
Affiliation(s)
- Celia Blanco
- Department of Chemistry and Biochemistry, University of California, Santa Barbara, CA 93106, USA.
| | | | | | | |
Collapse
|
15
|
Kemble H, Nghe P, Tenaillon O. Recent insights into the genotype-phenotype relationship from massively parallel genetic assays. Evol Appl 2019; 12:1721-1742. [PMID: 31548853 PMCID: PMC6752143 DOI: 10.1111/eva.12846] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2019] [Revised: 06/21/2019] [Accepted: 07/02/2019] [Indexed: 12/20/2022] Open
Abstract
With the molecular revolution in Biology, a mechanistic understanding of the genotype-phenotype relationship became possible. Recently, advances in DNA synthesis and sequencing have enabled the development of deep mutational scanning assays, capable of scoring comprehensive libraries of genotypes for fitness and a variety of phenotypes in massively parallel fashion. The resulting empirical genotype-fitness maps pave the way to predictive models, potentially accelerating our ability to anticipate the behaviour of pathogen and cancerous cell populations from sequencing data. Besides from cellular fitness, phenotypes of direct application in industry (e.g. enzyme activity) and medicine (e.g. antibody binding) can be quantified and even selected directly by these assays. This review discusses the technological basis of and recent developments in massively parallel genetics, along with the trends it is uncovering in the genotype-phenotype relationship (distribution of mutation effects, epistasis), their possible mechanistic bases and future directions for advancing towards the goal of predictive genetics.
Collapse
Affiliation(s)
- Harry Kemble
- Infection, Antimicrobials, Modelling, Evolution, INSERM, Unité Mixte de Recherche 1137Université Paris Diderot, Université Paris NordParisFrance
- École Supérieure de Physique et de Chimie Industrielles de la Ville de Paris (ESPCI Paris), UMR CNRS‐ESPCI CBI 8231PSL Research UniversityParis Cedex 05France
| | - Philippe Nghe
- École Supérieure de Physique et de Chimie Industrielles de la Ville de Paris (ESPCI Paris), UMR CNRS‐ESPCI CBI 8231PSL Research UniversityParis Cedex 05France
| | - Olivier Tenaillon
- Infection, Antimicrobials, Modelling, Evolution, INSERM, Unité Mixte de Recherche 1137Université Paris Diderot, Université Paris NordParisFrance
| |
Collapse
|
16
|
Plebanek A, Larnerd C, Popović M, Wei C, Pohorille A, Ditzler MA. Big on Change, Small on Innovation: Evolutionary Consequences of RNA Sequence Duplication. J Mol Evol 2019; 87:240-253. [PMID: 31435687 PMCID: PMC6711949 DOI: 10.1007/s00239-019-09906-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2019] [Accepted: 08/06/2019] [Indexed: 01/11/2023]
Abstract
The potential for biopolymers to evolve new structures has important consequences for their ability to optimize function and our attempts to reconstruct their evolutionary histories. Prior work with in vitro systems suggests that structural remodeling of RNA is difficult to achieve through the accumulation of point mutations or through recombination events. Sequence duplication may represent an alternative mechanism that can more readily lead to the evolution of new structures. Structural and sequence elements in many RNAs and proteins appear to be the products of duplication events, indicating that this mechanism plays a major role in molecular evolution. Despite the potential significance of this mechanism, little experimental data is available concerning the structural and evolutionary consequences of duplicating biopolymer sequences. To assess the structural consequences of sequence duplication on the evolution of RNA, we mutagenized an RNA sequence containing two copies of an ATP aptamer and subjected the resulting population to multiple in vitro evolution experiments. We identified multiple routes by which duplication, followed by the accumulation of functional point mutations, allowed our populations to sample two entirely different secondary structures. The two structures have no base pairs in common, but both structures contain two copies of the same ATP-binding motif. We do not observe the emergence of any other functional secondary structures beyond these two. Although this result suggests a limited capacity for duplication to support short-term functional innovation, major changes in secondary structure, like the one observed here, should be given careful consideration as they are likely to frustrate attempts to infer deep evolutionary histories of functional RNAs.
Collapse
Affiliation(s)
- Andrew Plebanek
- Exobiology Branch, Space Science and Astrobiology Division, NASA Ames Research Center, Bldg N239 Mail Stop 239-4, Moffett Field, CA, 94035, USA.,Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, 94143, USA
| | - Caleb Larnerd
- NASA Internship Program, NASA Ames Research Center, Moffett Field, CA, 94035, USA
| | - Milena Popović
- Exobiology Branch, Space Science and Astrobiology Division, NASA Ames Research Center, Bldg N239 Mail Stop 239-4, Moffett Field, CA, 94035, USA.,Center for the Emergence of Life, NASA Ames Research Center, Moffett Field, CA, 94035, USA.,Blue Marble Space Institute of Science, Seattle, WA, 98145, USA
| | - Chenyu Wei
- Exobiology Branch, Space Science and Astrobiology Division, NASA Ames Research Center, Bldg N239 Mail Stop 239-4, Moffett Field, CA, 94035, USA.,Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, 94143, USA.,Center for the Emergence of Life, NASA Ames Research Center, Moffett Field, CA, 94035, USA
| | - Andrew Pohorille
- Exobiology Branch, Space Science and Astrobiology Division, NASA Ames Research Center, Bldg N239 Mail Stop 239-4, Moffett Field, CA, 94035, USA.,Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, 94143, USA.,Center for the Emergence of Life, NASA Ames Research Center, Moffett Field, CA, 94035, USA
| | - Mark A Ditzler
- Exobiology Branch, Space Science and Astrobiology Division, NASA Ames Research Center, Bldg N239 Mail Stop 239-4, Moffett Field, CA, 94035, USA. .,Center for the Emergence of Life, NASA Ames Research Center, Moffett Field, CA, 94035, USA.
| |
Collapse
|
17
|
Bendixsen DP, Collet J, Østman B, Hayden EJ. Genotype network intersections promote evolutionary innovation. PLoS Biol 2019; 17:e3000300. [PMID: 31136568 PMCID: PMC6555535 DOI: 10.1371/journal.pbio.3000300] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2018] [Revised: 06/07/2019] [Accepted: 05/15/2019] [Indexed: 12/27/2022] Open
Abstract
Evolutionary innovations are qualitatively novel traits that emerge through evolution and increase biodiversity. The genetic mechanisms of innovation remain poorly understood. A systems view of innovation requires the analysis of genotype networks—the vast networks of genetic variants that produce the same phenotype. Innovations can occur at the intersection of two different genotype networks. However, the experimental characterization of genotype networks has been hindered by the vast number of genetic variants that need to be functionally analyzed. Here, we use high-throughput sequencing to study the fitness landscape at the intersection of the genotype networks of two catalytic RNA molecules (ribozymes). We determined the ability of numerous neighboring RNA sequences to catalyze two different chemical reactions, and we use these data as a proxy for a genotype to fitness map where two functions come in close proximity. We find extensive functional overlap, and numerous genotypes can catalyze both functions. We demonstrate through evolutionary simulations that these numerous points of intersection facilitate the discovery of a new function. However, the rate of adaptation of the new function depends upon the local ruggedness around the starting location in the genotype network. As a consequence, one direction of adaptation is more rapid than the other. We find that periods of neutral evolution increase rates of adaptation to the new function by allowing populations to spread out in their genotype network. Our study reveals the properties of a fitness landscape where genotype networks intersect and the consequences for evolutionary innovations. Our results suggest that historic innovations in natural systems may have been facilitated by overlapping genotype networks. The determination of the empirical fitness landscape at the genotypic intersection between two different catalytic RNA (ribozyme) functions reveals details about how novel traits can emerge through evolutionary innovation.
Collapse
Affiliation(s)
- Devin P. Bendixsen
- Biomolecular Sciences Graduate Programs, Boise State University, Boise, Idaho, United States of America
- * E-mail: (DPB); (EJH)
| | - James Collet
- Department of Biological Science, Boise State University, Boise, Idaho, United States of America
| | - Bjørn Østman
- Keck Graduate Institute, Claremont, California, United States of America
| | - Eric J. Hayden
- Biomolecular Sciences Graduate Programs, Boise State University, Boise, Idaho, United States of America
- Department of Biological Science, Boise State University, Boise, Idaho, United States of America
- * E-mail: (DPB); (EJH)
| |
Collapse
|
18
|
Pressman AD, Liu Z, Janzen E, Blanco C, Müller UF, Joyce GF, Pascal R, Chen IA. Mapping a Systematic Ribozyme Fitness Landscape Reveals a Frustrated Evolutionary Network for Self-Aminoacylating RNA. J Am Chem Soc 2019; 141:6213-6223. [PMID: 30912655 PMCID: PMC6548421 DOI: 10.1021/jacs.8b13298] [Citation(s) in RCA: 60] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
![]()
Molecular
evolution can be conceptualized as a walk over a “fitness
landscape”, or the function of fitness (e.g., catalytic activity)
over the space of all possible sequences. Understanding evolution
requires knowing the structure of the fitness landscape and identifying
the viable evolutionary pathways through the landscape. However, the
fitness landscape for any catalytic biomolecule is largely unknown.
The evolution of catalytic RNA is of special interest because RNA
is believed to have been foundational to early life. In particular,
an essential activity leading to the genetic code would be the reaction
of ribozymes with activated amino acids, such as 5(4H)-oxazolones, to form aminoacyl-RNA. Here we combine in vitro selection
with a massively parallel kinetic assay to map a fitness landscape
for self-aminoacylating RNA, with nearly complete coverage of sequence
space in a central 21-nucleotide region. The method (SCAPE: sequencing
to measure catalytic activity paired with in vitro evolution) shows
that the landscape contains three major ribozyme families (landscape
peaks). An analysis of evolutionary pathways shows that, while local
optimization within a ribozyme family would be possible, optimization
of activity over the entire landscape would be frustrated by large
valleys of low activity. The sequence motifs associated with each
peak represent different solutions to the problem of catalysis, so
the inability to traverse the landscape globally corresponds to an
inability to restructure the ribozyme without losing activity. The
frustrated nature of the evolutionary network suggests that chance
emergence of a ribozyme motif would be more important than optimization
by natural selection.
Collapse
Affiliation(s)
- Abe D Pressman
- Department of Chemistry and Biochemistry 9510 , University of California , Santa Barbara , California 93106 , United States.,Program in Chemical Engineering , University of California , Santa Barbara , California 93106 , United States
| | - Ziwei Liu
- MRC Laboratory of Molecular Biology , Cambridge Biomedical Campus , Cambridge CB2 0QH , U.K.,IBMM, CNRS, University of Montpellier, ENSCM , 34090 Montpellier , France
| | - Evan Janzen
- Department of Chemistry and Biochemistry 9510 , University of California , Santa Barbara , California 93106 , United States.,Program in Biomolecular Sciences and Engineering , University of California , Santa Barbara , California 93106 , United States
| | - Celia Blanco
- Department of Chemistry and Biochemistry 9510 , University of California , Santa Barbara , California 93106 , United States
| | - Ulrich F Müller
- Department of Chemistry and Biochemistry , University of California , San Diego , California 92093 , United States
| | - Gerald F Joyce
- Salk Institute for Biological Studies , La Jolla , California 92037 , United States
| | - Robert Pascal
- IBMM, CNRS, University of Montpellier, ENSCM , 34090 Montpellier , France
| | - Irene A Chen
- Department of Chemistry and Biochemistry 9510 , University of California , Santa Barbara , California 93106 , United States.,Program in Biomolecular Sciences and Engineering , University of California , Santa Barbara , California 93106 , United States
| |
Collapse
|
19
|
Gonzalez CE, Ostermeier M. Pervasive Pairwise Intragenic Epistasis among Sequential Mutations in TEM-1 β-Lactamase. J Mol Biol 2019; 431:1981-1992. [PMID: 30922874 DOI: 10.1016/j.jmb.2019.03.020] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2018] [Revised: 02/25/2019] [Accepted: 03/13/2019] [Indexed: 12/25/2022]
Abstract
Interactions between mutations play a central role in shaping the fitness landscape, but a clear picture of intragenic epistasis has yet to emerge. To further reveal the prevalence and patterns of intragenic epistasis, we present a survey of epistatic interactions between sequential mutations in TEM-1 β-lactamase. We measured the fitness effect of ~12,000 pairs of consecutive amino acid substitutions and used our previous study of the fitness effects of single amino acid substitutions to calculate epistasis for over 8000 mutation pairs. Since sequential mutations are prone to physically interact, we postulated that our study would be surveying specific epistasis instead of nonspecific epistasis. We found widespread negative epistasis, especially in beta-strands, and a high frequency of negative sign epistasis among individually beneficial mutations. Negative epistasis (52%) occurred 7.6 times as frequently as positive epistasis (6.8%). Buried residues experienced more negative epistasis that surface-exposed residues. However, TEM-1 exhibited a couple of hotspots for positive epistasis, most notably L221/ R222 at which many combinations of mutations positively interacted. This study is the first to systematically examine pairwise epistasis throughout an entire protein performing its native function in its native host.
Collapse
Affiliation(s)
- Courtney E Gonzalez
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, 3400 N. Charles St., Baltimore, MD 21218, USA
| | - Marc Ostermeier
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, 3400 N. Charles St., Baltimore, MD 21218, USA.
| |
Collapse
|
20
|
Yokobayashi Y. Applications of high-throughput sequencing to analyze and engineer ribozymes. Methods 2019; 161:41-45. [PMID: 30738128 DOI: 10.1016/j.ymeth.2019.02.001] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2018] [Revised: 01/04/2019] [Accepted: 02/03/2019] [Indexed: 01/22/2023] Open
Abstract
A large number of catalytic RNAs, or ribozymes, have been identified in the genomes of various organisms and viruses. Ribozymes are involved in biological processes such as regulation of gene expression and viral replication, but biological roles of many ribozymes still remain unknown. Ribozymes have also inspired researchers to engineer synthetic ribozymes that function as sensors or gene switches. To gain deeper understanding of the sequence-function relationship of ribozymes and to efficiently engineer synthetic ribozymes, a large number of ribozyme variants need to be examined which was limited to hundreds of sequences by Sanger sequencing. The advent of high-throughput sequencing technologies, however, has allowed us to sequence millions of ribozyme sequences at low cost. This review focuses on the recent applications of high-throughput sequencing to both characterize and engineer ribozymes, to highlight how the large-scale sequence data can advance ribozyme research and engineering.
Collapse
Affiliation(s)
- Yohei Yokobayashi
- Nucleic Acid Chemistry and Engineering Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Okinawa 904 0495, Japan.
| |
Collapse
|
21
|
Fragata I, Blanckaert A, Dias Louro MA, Liberles DA, Bank C. Evolution in the light of fitness landscape theory. Trends Ecol Evol 2019; 34:69-82. [DOI: 10.1016/j.tree.2018.10.009] [Citation(s) in RCA: 84] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2018] [Revised: 10/16/2018] [Accepted: 10/17/2018] [Indexed: 01/28/2023]
|
22
|
Riesselman AJ, Ingraham JB, Marks DS. Deep generative models of genetic variation capture the effects of mutations. Nat Methods 2018; 15:816-822. [PMID: 30250057 DOI: 10.1038/s41592-018-0138-4] [Citation(s) in RCA: 320] [Impact Index Per Article: 45.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2018] [Accepted: 07/29/2018] [Indexed: 01/05/2023]
Abstract
The functions of proteins and RNAs are defined by the collective interactions of many residues, and yet most statistical models of biological sequences consider sites nearly independently. Recent approaches have demonstrated benefits of including interactions to capture pairwise covariation, but leave higher-order dependencies out of reach. Here we show how it is possible to capture higher-order, context-dependent constraints in biological sequences via latent variable models with nonlinear dependencies. We found that DeepSequence ( https://github.com/debbiemarkslab/DeepSequence ), a probabilistic model for sequence families, predicted the effects of mutations across a variety of deep mutational scanning experiments substantially better than existing methods based on the same evolutionary data. The model, learned in an unsupervised manner solely on the basis of sequence information, is grounded with biologically motivated priors, reveals the latent organization of sequence families, and can be used to explore new parts of sequence space.
Collapse
Affiliation(s)
- Adam J Riesselman
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA.,Program in Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - John B Ingraham
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA.,Program in Systems Biology, Harvard University, Cambridge, MA, USA
| | - Debora S Marks
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|