1
|
Ding N, Jiang Y, Lee S, Cheng Z, Ran X, Ding Y, Ge R, Zhang Y, Yang ZJ. Enzyme miniaturization: Revolutionizing future biocatalysts. Biotechnol Adv 2025; 82:108598. [PMID: 40354901 DOI: 10.1016/j.biotechadv.2025.108598] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2025] [Revised: 04/05/2025] [Accepted: 05/09/2025] [Indexed: 05/14/2025]
Abstract
Enzyme miniaturization offers a transformative approach to overcome limitations posed by the large size of conventional enzymes in industrial, therapeutic, and diagnostic applications. However, the evolutionary optimization of enzymes for activity has not inherently favored compact structures, creating challenges for modern applications requiring smaller catalysts. In this review, we surveyed the advantages of miniature enzymes, including enhanced expressivity, folding efficiency, thermostability, and resistance to proteolysis. We described the applications of miniature enzymes as industrial catalysts, therapeutic agents, and diagnostic elements. We highlighted strategies such as genome mining, rational design, random deletion, and de novo design for achieving enzyme miniaturization, integrating both computational and experimental techniques. By investigating these approaches, we aim to provide a framework for advancing enzyme engineering, emphasizing the unique potential of miniature enzymes to revolutionize biocatalysis, gene therapy, and biosensing technologies.
Collapse
Affiliation(s)
- Ning Ding
- Department of Chemistry, Vanderbilt University, Nashville, TN 37235, United States; Center for Structural Biology, Vanderbilt University, Nashville, TN 37235, United States.
| | - Yaoyukun Jiang
- Department of Chemistry, Vanderbilt University, Nashville, TN 37235, United States; Department of Chemistry and California Institute for Quantitative Biosciences, University of California-Berkeley, Berkeley, CA 94720, United States
| | - Sangsin Lee
- Department of Genetics, Stanford University, Stanford, CA 94305, United States
| | - Zihao Cheng
- Department of Chemistry, Vanderbilt University, Nashville, TN 37235, United States
| | - Xinchun Ran
- Department of Chemistry, Vanderbilt University, Nashville, TN 37235, United States
| | - Yujing Ding
- State Key Laboratory of Chemical Resource Engineering, Beijing University of Chemical Technology, Beijing 100029, China; Beijing Advanced Innovation Center for Soft Matter Science and Engineering, Beijing University of Chemical Technology, Beijing 100029, China
| | - Robbie Ge
- Department of Chemistry, Vanderbilt University, Nashville, TN 37235, United States
| | - Yifei Zhang
- State Key Laboratory of Chemical Resource Engineering, Beijing University of Chemical Technology, Beijing 100029, China; Beijing Advanced Innovation Center for Soft Matter Science and Engineering, Beijing University of Chemical Technology, Beijing 100029, China.
| | - Zhongyue J Yang
- Department of Chemistry, Vanderbilt University, Nashville, TN 37235, United States; Center for Structural Biology, Vanderbilt University, Nashville, TN 37235, United States.
| |
Collapse
|
2
|
Meng L, Wang X. TAWFN: a deep learning framework for protein function prediction. Bioinformatics 2024; 40:btae571. [PMID: 39312678 PMCID: PMC11639667 DOI: 10.1093/bioinformatics/btae571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Revised: 08/27/2024] [Accepted: 09/19/2024] [Indexed: 09/25/2024] Open
Abstract
MOTIVATION Proteins play pivotal roles in biological systems, and precise prediction of their functions is indispensable for practical applications. Despite the surge in protein sequence data facilitated by high-throughput techniques, unraveling the exact functionalities of proteins still demands considerable time and resources. Currently, numerous methods rely on protein sequences for prediction, while methods targeting protein structures are scarce, often employing convolutional neural networks (CNN) or graph convolutional networks (GCNs) individually. RESULTS To address these challenges, our approach starts from protein structures and proposes a method that combines CNN and GCN into a unified framework called the two-model adaptive weight fusion network (TAWFN) for protein function prediction. First, amino acid contact maps and sequences are extracted from the protein structure. Then, the sequence is used to generate one-hot encoded features and deep semantic features. These features, along with the constructed graph, are fed into the adaptive graph convolutional networks (AGCN) module and the multi-layer convolutional neural network (MCNN) module as needed, resulting in preliminary classification outcomes. Finally, the preliminary classification results are inputted into the adaptive weight computation network, where adaptive weights are calculated to fuse the initial predictions from both networks, yielding the final prediction result. To evaluate the effectiveness of our method, experiments were conducted on the PDBset and AFset datasets. For molecular function, biological process, and cellular component tasks, TAWFN achieved area under the precision-recall curve (AUPR) values of 0.718, 0.385, and 0.488 respectively, with corresponding Fmax scores of 0.762, 0.628, and 0.693, and Smin scores of 0.326, 0.483, and 0.454. The experimental results demonstrate that TAWFN exhibits promising performance, outperforming existing methods. AVAILABILITY AND IMPLEMENTATION The TAWFN source code can be found at: https://github.com/ss0830/TAWFN.
Collapse
Affiliation(s)
- Lu Meng
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning, 110000, China
| | - Xiaoran Wang
- College of Information Science and Engineering, Northeastern University, Shenyang, Liaoning, 110000, China
| |
Collapse
|
3
|
Yang Y, Braga MV, Dean MD. Insertion-Deletion Events Are Depleted in Protein Regions with Predicted Secondary Structure. Genome Biol Evol 2024; 16:evae093. [PMID: 38735759 PMCID: PMC11102076 DOI: 10.1093/gbe/evae093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Revised: 04/16/2024] [Accepted: 04/21/2024] [Indexed: 05/14/2024] Open
Abstract
A fundamental goal in evolutionary biology and population genetics is to understand how selection shapes the fate of new mutations. Here, we test the null hypothesis that insertion-deletion (indel) events in protein-coding regions occur randomly with respect to secondary structures. We identified indels across 11,444 sequence alignments in mouse, rat, human, chimp, and dog genomes and then quantified their overlap with four different types of secondary structure-alpha helices, beta strands, protein bends, and protein turns-predicted by deep-learning methods of AlphaFold2. Indels overlapped secondary structures 54% as much as expected and were especially underrepresented over beta strands, which tend to form internal, stable regions of proteins. In contrast, indels were enriched by 155% over regions without any predicted secondary structures. These skews were stronger in the rodent lineages compared to the primate lineages, consistent with population genetic theory predicting that natural selection will be more efficient in species with larger effective population sizes. Nonsynonymous substitutions were also less common in regions of protein secondary structure, although not as strongly reduced as in indels. In a complementary analysis of thousands of human genomes, we showed that indels overlapping secondary structure segregated at significantly lower frequency than indels outside of secondary structure. Taken together, our study shows that indels are selected against if they overlap secondary structure, presumably because they disrupt the tertiary structure and function of a protein.
Collapse
Affiliation(s)
- Yi Yang
- Molecular and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Matthew V Braga
- Molecular and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Matthew D Dean
- Molecular and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| |
Collapse
|
4
|
Sellés Vidal L, Isalan M, Heap JT, Ledesma-Amaro R. A primer to directed evolution: current methodologies and future directions. RSC Chem Biol 2023; 4:271-291. [PMID: 37034405 PMCID: PMC10074555 DOI: 10.1039/d2cb00231k] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Accepted: 01/18/2023] [Indexed: 01/30/2023] Open
Abstract
Directed evolution is one of the most powerful tools for protein engineering and functions by harnessing natural evolution, but on a shorter timescale. It enables the rapid selection of variants of biomolecules with properties that make them more suitable for specific applications. Since the first in vitro evolution experiments performed by Sol Spiegelman in 1967, a wide range of techniques have been developed to tackle the main two steps of directed evolution: genetic diversification (library generation), and isolation of the variants of interest. This review covers the main modern methodologies, discussing the advantages and drawbacks of each, and hence the considerations for designing directed evolution experiments. Furthermore, the most recent developments are discussed, showing how advances in the handling of ever larger library sizes are enabling new research questions to be tackled.
Collapse
Affiliation(s)
- Lara Sellés Vidal
- Imperial College Centre for Synthetic Biology, Imperial College London London SW7 2AZ UK
- Department of Bioengineering, Imperial College London London SW7 2AZ UK
| | - Mark Isalan
- Imperial College Centre for Synthetic Biology, Imperial College London London SW7 2AZ UK
- Department of Life Sciences, Imperial College London London SW7 2AZ UK
| | - John T Heap
- Imperial College Centre for Synthetic Biology, Imperial College London London SW7 2AZ UK
- Department of Life Sciences, Imperial College London London SW7 2AZ UK
- School of Life Sciences, The University of Nottingham, University Park Nottingham NG7 2RD UK
| | - Rodrigo Ledesma-Amaro
- Imperial College Centre for Synthetic Biology, Imperial College London London SW7 2AZ UK
- Department of Bioengineering, Imperial College London London SW7 2AZ UK
| |
Collapse
|
5
|
Macdonald CB, Nedrud D, Grimes PR, Trinidad D, Fraser JS, Coyote-Maestas W. DIMPLE: deep insertion, deletion, and missense mutation libraries for exploring protein variation in evolution, disease, and biology. Genome Biol 2023; 24:36. [PMID: 36829241 PMCID: PMC9951526 DOI: 10.1186/s13059-023-02880-6] [Citation(s) in RCA: 35] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Accepted: 02/16/2023] [Indexed: 02/26/2023] Open
Abstract
Insertions and deletions (indels) enable evolution and cause disease. Due to technical challenges, indels are left out of most mutational scans, limiting our understanding of them in disease, biology, and evolution. We develop a low cost and bias method, DIMPLE, for systematically generating deletions, insertions, and missense mutations in genes, which we test on a range of targets, including Kir2.1. We use DIMPLE to study how indels impact potassium channel structure, disease, and evolution. We find deletions are most disruptive overall, beta sheets are most sensitive to indels, and flexible loops are sensitive to deletions yet tolerate insertions.
Collapse
Affiliation(s)
- Christian B Macdonald
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, USA
| | | | | | - Donovan Trinidad
- Department of Medicine, Division of Infectious Disease, University of California, San Francisco, USA
| | - James S Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, USA.,Quantitative Biosciences Institute, University of California, San Francisco, USA
| | - Willow Coyote-Maestas
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, USA. .,Quantitative Biosciences Institute, University of California, San Francisco, USA.
| |
Collapse
|
6
|
Miton CM, Tokuriki N. Insertions and Deletions (Indels): A Missing Piece of the Protein Engineering Jigsaw. Biochemistry 2023; 62:148-157. [PMID: 35830609 DOI: 10.1021/acs.biochem.2c00188] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Over the years, protein engineers have studied nature and borrowed its tricks to accelerate protein evolution in the test tube. While there have been considerable advances, our ability to generate new proteins in the laboratory is seemingly limited. One explanation for these shortcomings may be that insertions and deletions (indels), which frequently arise in nature, are largely overlooked during protein engineering campaigns. The profound effect of indels on protein structures, by way of drastic backbone alterations, could be perceived as "saltation" events that bring about significant phenotypic changes in a single mutational step. Should we leverage these effects to accelerate protein engineering and gain access to unexplored regions of adaptive landscapes? In this Perspective, we describe the role played by indels in the functional diversification of proteins in nature and discuss their untapped potential for protein engineering, despite their often-destabilizing nature. We hope to spark a renewed interest in indels, emphasizing that their wider study and use may prove insightful and shape the future of protein engineering by unlocking unique functional changes that substitutions alone could never achieve.
Collapse
Affiliation(s)
- Charlotte M Miton
- Michael Smith Laboratories, University of British Columbia, Vancouver, V6T 1Z4 BC, Canada
| | - Nobuhiko Tokuriki
- Michael Smith Laboratories, University of British Columbia, Vancouver, V6T 1Z4 BC, Canada
| |
Collapse
|
7
|
Iqbal Z, Sadaf S. A patent-based consideration of latest platforms in the art of directed evolution: a decade long untold story. Biotechnol Genet Eng Rev 2022; 38:133-246. [PMID: 35200115 DOI: 10.1080/02648725.2021.2017638] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Directed (or in vitro) evolution of proteins and metabolic pathways requires tools for creating genetic diversity and identifying protein variants with new or improved functional properties. Besides simplicity, reliability, speed, versatility, universal applicability and economy of the technique, the new science of synthetic biology requires improved means for construction of smart and high-quality mutant libraries to better navigate the sequence diversity. In vitro CRISPR/Cas9-mediated mutagenic (ICM) system and machine-learning (ML)-assisted approaches to directed evolution are now in the field to achieve the goal. This review describes the gene diversification strategies, screening and selection methods, in silico (computer-aided), Cas9-mediated and ML-based approaches to mutagenesis, developed especially in the last decade, and their patent position. The objective behind is to emphasize researchers the need for noting which mutagenesis, screening or selection method is patented and then selecting a suitable restriction-free approach to sequence diversity. Techniques and evolved products subject to patent rights need commercial license if their use is for purposes other than private or experimental research.
Collapse
Affiliation(s)
- Zarina Iqbal
- IP Litigation Department, PakPat World Intellectual Property Protection Services, Lahore, Pakistan
| | - Saima Sadaf
- School of Biochemistry and Biotechnology, University of the Punjab, Lahore, Pakistan
| |
Collapse
|
8
|
Savino S, Desmet T, Franceus J. Insertions and deletions in protein evolution and engineering. Biotechnol Adv 2022; 60:108010. [PMID: 35738511 DOI: 10.1016/j.biotechadv.2022.108010] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Revised: 06/15/2022] [Accepted: 06/16/2022] [Indexed: 11/17/2022]
Abstract
Protein evolution or engineering studies are traditionally focused on amino acid substitutions and the way these contribute to fitness. Meanwhile, the insertion and deletion of amino acids is often overlooked, despite being one of the most common sources of genetic variation. Recent methodological advances and successful engineering stories have demonstrated that the time is ripe for greater emphasis on these mutations and their understudied effects. This review highlights the evolutionary importance and biotechnological relevance of insertions and deletions (indels). We provide a comprehensive overview of approaches that can be employed to include indels in random, (semi)-rational or computational protein engineering pipelines. Furthermore, we discuss the tolerance to indels at the structural level, address how domain indels can link the function of unrelated proteins, and feature studies that illustrate the surprising and intriguing potential of frameshift mutations.
Collapse
Affiliation(s)
- Simone Savino
- Centre for Synthetic Biology (CSB), Department of Biotechnology, Ghent University, Coupure Links 653, 9000 Ghent, Belgium
| | - Tom Desmet
- Centre for Synthetic Biology (CSB), Department of Biotechnology, Ghent University, Coupure Links 653, 9000 Ghent, Belgium
| | - Jorick Franceus
- Centre for Synthetic Biology (CSB), Department of Biotechnology, Ghent University, Coupure Links 653, 9000 Ghent, Belgium..
| |
Collapse
|
9
|
Alejaldre L, Pelletier JN, Quaglia D. Methods for enzyme library creation: Which one will you choose?: A guide for novices and experts to introduce genetic diversity. Bioessays 2021; 43:e2100052. [PMID: 34263468 DOI: 10.1002/bies.202100052] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Revised: 05/28/2021] [Accepted: 06/02/2021] [Indexed: 12/15/2022]
Abstract
Enzyme engineering allows to explore sequence diversity in search for new properties. The scientific literature is populated with methods to create enzyme libraries for engineering purposes, however, choosing a suitable method for the creation of mutant libraries can be daunting, in particular for the novices. Here, we address both novices and experts: how can one enter the arena of enzyme library design and what guidelines can advanced users apply to select strategies best suited to their purpose? Section I is dedicated to the novices and presents an overview of established and standard methods for library creation, as well as available commercial solutions. The expert will discover an up-to-date tool to freshen up their repertoire (Section I) and learn of the newest methods that are likely to become a mainstay (Section II). We focus primarily on in vitro methods, presenting the advantages of each method. Our ultimate aim is to offer a selection of methods/strategies that we believe to be most useful to the enzyme engineer, whether a first-timer or a seasoned user.
Collapse
Affiliation(s)
- Lorea Alejaldre
- Département de biochimie and Center for Green Chemistry and Catalysis (CGCC), Université de Montréal, Montréal, Quebec, Canada.,PROTEO, The Québec Network for Research on Protein Function, Engineering and Applications, Québec, Quebec, Canada
| | - Joelle N Pelletier
- Département de biochimie and Center for Green Chemistry and Catalysis (CGCC), Université de Montréal, Montréal, Quebec, Canada.,PROTEO, The Québec Network for Research on Protein Function, Engineering and Applications, Québec, Quebec, Canada.,Département de chimie, Université de Montréal, Montréal, Quebec, Canada
| | - Daniela Quaglia
- Département de chimie, Université de Montréal, Montréal, Quebec, Canada.,School of Chemistry, University of Nottingham, Nottingham, UK
| |
Collapse
|
10
|
Tizei PAG, Harris E, Withanage S, Renders M, Pinheiro VB. A novel framework for engineering protein loops exploring length and compositional variation. Sci Rep 2021; 11:9134. [PMID: 33911147 PMCID: PMC8080606 DOI: 10.1038/s41598-021-88708-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2020] [Accepted: 04/12/2021] [Indexed: 02/02/2023] Open
Abstract
Insertions and deletions (indels) are known to affect function, biophysical properties and substrate specificity of enzymes, and they play a central role in evolution. Despite such clear significance, this class of mutation remains an underexploited tool in protein engineering with few available platforms capable of systematically generating and analysing libraries of varying sequence composition and length. We present a novel DNA assembly platform (InDel assembly), based on cycles of endonuclease restriction digestion and ligation of standardised dsDNA building blocks, that can generate libraries exploring both composition and sequence length variation. In addition, we developed a framework to analyse the output of selection from InDel-generated libraries, combining next generation sequencing and alignment-free strategies for sequence analysis. We demonstrate the approach by engineering the well-characterized TEM-1 β-lactamase Ω-loop, involved in substrate specificity, identifying multiple novel extended spectrum β-lactamases with loops of modified length and composition-areas of the sequence space not previously explored. Together, the InDel assembly and analysis platforms provide an efficient route to engineer protein loops or linkers where sequence length and composition are both essential functional parameters.
Collapse
Affiliation(s)
- Pedro A. G. Tizei
- grid.83440.3b0000000121901201Department of Structural and Molecular Biology, University College London, Gower Street, London, WC1E 6BT UK
| | - Emma Harris
- grid.4464.20000 0001 2161 2573Department of Biological Sciences, University of London, Malet Street, Birkbeck, WC1E 7HX UK
| | - Shamal Withanage
- grid.415751.3KU Leuven, Rega Institute for Medical Research, Medicinal Chemistry, Herestraat 49, Box 1041, 3000 Leuven, Belgium
| | - Marleen Renders
- grid.415751.3KU Leuven, Rega Institute for Medical Research, Medicinal Chemistry, Herestraat 49, Box 1041, 3000 Leuven, Belgium
| | - Vitor B. Pinheiro
- grid.83440.3b0000000121901201Department of Structural and Molecular Biology, University College London, Gower Street, London, WC1E 6BT UK ,grid.4464.20000 0001 2161 2573Department of Biological Sciences, University of London, Malet Street, Birkbeck, WC1E 7HX UK ,grid.415751.3KU Leuven, Rega Institute for Medical Research, Medicinal Chemistry, Herestraat 49, Box 1041, 3000 Leuven, Belgium
| |
Collapse
|
11
|
Emond S, Petek M, Kay EJ, Heames B, Devenish SRA, Tokuriki N, Hollfelder F. Accessing unexplored regions of sequence space in directed enzyme evolution via insertion/deletion mutagenesis. Nat Commun 2020; 11:3469. [PMID: 32651386 PMCID: PMC7351745 DOI: 10.1038/s41467-020-17061-3] [Citation(s) in RCA: 46] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2019] [Accepted: 06/01/2020] [Indexed: 11/22/2022] Open
Abstract
Insertions and deletions (InDels) are frequently observed in natural protein evolution, yet their potential remains untapped in laboratory evolution. Here we introduce a transposon-based mutagenesis approach (TRIAD) to generate libraries of random variants with short in-frame InDels, and screen TRIAD libraries to evolve a promiscuous arylesterase activity in a phosphotriesterase. The evolution exhibits features that differ from previous point mutagenesis campaigns: while the average activity of TRIAD variants is more compromised, a larger proportion has successfully adapted for the activity. Different functional profiles emerge: (i) both strong and weak trade-off between activities are observed; (ii) trade-off is more severe (20- to 35-fold increased kcat/KM in arylesterase with 60-400-fold decreases in phosphotriesterase activity) and (iii) improvements are present in kcat rather than just in KM, suggesting adaptive solutions. These distinct features make TRIAD an alternative to widely used point mutagenesis, accessing functional innovations and traversing unexplored fitness landscape regions.
Collapse
Affiliation(s)
- Stephane Emond
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1GA, UK.
- Evonetix Ltd, Coldhams Business Park, Norman Way, Cambridge, CB1 3LH, UK.
| | - Maya Petek
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1GA, UK
| | - Emily J Kay
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1GA, UK
- Cancer Research UK Beatson Institute, Glasgow, G61 1BD, UK
| | - Brennen Heames
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1GA, UK
- Institute for Evolution and Biodiversity, Westfälische Wilhelms-Universität, Hüfferstrasse 1, 48149, Münster, Germany
| | - Sean R A Devenish
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1GA, UK
- Fluidic Analytics, The Paddocks Business Centre, Cherry Hinton Road, Cambridge, CB1 8DH, UK
| | - Nobuhiko Tokuriki
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| | - Florian Hollfelder
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1GA, UK.
| |
Collapse
|
12
|
Bratulic S, Badran AH. Modern methods for laboratory diversification of biomolecules. Curr Opin Chem Biol 2017; 41:50-60. [PMID: 29096324 PMCID: PMC6062405 DOI: 10.1016/j.cbpa.2017.10.010] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2017] [Revised: 10/03/2017] [Accepted: 10/08/2017] [Indexed: 11/29/2022]
Abstract
Genetic variation fuels Darwinian evolution, yet spontaneous mutation rates are maintained at low levels to ensure cellular viability. Low mutation rates preclude the exhaustive exploration of sequence space for protein evolution and genome engineering applications, prompting scientists to develop methods for efficient and targeted diversification of nucleic acid sequences. Directed evolution of biomolecules relies upon the generation of unbiased genetic diversity to discover variants with desirable properties, whereas genome-engineering applications require selective modifications on a genomic scale with minimal off-targets. Here, we review the current toolkit of mutagenesis strategies employed in directed evolution and genome engineering. These state-of-the-art methods enable facile modifications and improvements of single genes, multicomponent pathways, and whole genomes for basic and applied research, while simultaneously paving the way for genome editing therapeutic interventions.
Collapse
Affiliation(s)
- Sinisa Bratulic
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Ahmed H Badran
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.
| |
Collapse
|
13
|
Morelli A, Cabezas Y, Mills LJ, Seelig B. Extensive libraries of gene truncation variants generated by in vitro transposition. Nucleic Acids Res 2017; 45:e78. [PMID: 28130425 PMCID: PMC5449547 DOI: 10.1093/nar/gkx030] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2016] [Accepted: 01/20/2017] [Indexed: 11/14/2022] Open
Abstract
The detailed analysis of the impact of deletions on proteins or nucleic acids can reveal important functional regions and lead to variants with improved macromolecular properties. We present a method to generate large libraries of mutants with deletions of varying length that are randomly distributed throughout a given gene. This technique facilitates the identification of crucial sequence regions in nucleic acids or proteins. The approach utilizes in vitro transposition to generate 5΄ and 3΄ fragment sub-libraries of a given gene, which are then randomly recombined to yield a final library comprising both terminal and internal deletions. The method is easy to implement and can generate libraries in three to four days. We used this approach to produce a library of >9000 random deletion mutants of an artificial RNA ligase enzyme representing 32% of all possible deletions. The quality of the library was assessed by next-generation sequencing and detailed bioinformatics analysis. Finally, we subjected this library to in vitro selection and obtained fully functional variants with deletions of up to 18 amino acids of the parental enzyme that had been 95 amino acids in length.
Collapse
Affiliation(s)
- Aleardo Morelli
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA.,BioTechnology Institute, University of Minnesota, St. Paul, MN 55108, USA
| | - Yari Cabezas
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA.,BioTechnology Institute, University of Minnesota, St. Paul, MN 55108, USA
| | - Lauren J Mills
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, MN 55455, USA
| | - Burckhard Seelig
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA.,BioTechnology Institute, University of Minnesota, St. Paul, MN 55108, USA
| |
Collapse
|