1
|
Pinto GP, Corbella M, Demkiv AO, Kamerlin SCL. Exploiting enzyme evolution for computational protein design. Trends Biochem Sci 2021; 47:375-389. [PMID: 34544655 DOI: 10.1016/j.tibs.2021.08.008] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 08/18/2021] [Accepted: 08/24/2021] [Indexed: 11/15/2022]
Abstract
Recent years have seen an explosion of interest in understanding the physicochemical parameters that shape enzyme evolution, as well as substantial advances in computational enzyme design. This review discusses three areas where evolutionary information can be used as part of the design process: (i) using ancestral sequence reconstruction (ASR) to generate new starting points for enzyme design efforts; (ii) learning from how nature uses conformational dynamics in enzyme evolution to mimic this process in silico; and (iii) modular design of enzymes from smaller fragments, again mimicking the process by which nature appears to create new protein folds. Using showcase examples, we highlight the importance of incorporating evolutionary information to continue to push forward the boundaries of enzyme design studies.
Collapse
Affiliation(s)
- Gaspar P Pinto
- Department of Chemistry - BMC, Uppsala University, BMC Box 576, S-751 23 Uppsala, Sweden
| | - Marina Corbella
- Department of Chemistry - BMC, Uppsala University, BMC Box 576, S-751 23 Uppsala, Sweden
| | - Andrey O Demkiv
- Department of Chemistry - BMC, Uppsala University, BMC Box 576, S-751 23 Uppsala, Sweden
| | | |
Collapse
|
2
|
Abstract
The evolution of natural modular proteins and domain swapping by protein engineers have shown the disruptive potential of non-homologous recombination to create proteins with novel functions or traits. Bacteriophage endolysins, cellulosomes and polyketide synthases are 3 examples of natural modular proteins with each module having a dedicated function. These modular architectures have been created by extensive duplication, shuffling of domains and insertion/deletion of new domains. Protein engineers mimic these natural processes in vitro to create chimeras with altered properties or novel functions by swapping modules between different parental genes. Most domain swapping efforts are realized with traditional restriction and ligation techniques, which become particularly restrictive when either a large number of variants, or variants of proteins with multiple domains have to be constructed. Recent advances in homology-independent shuffling techniques increasingly address this need, but to realize the full potential of the synthetic biology of modular proteins a complete homology-independent method for both rational and random shuffling of modules from an unlimited number of parental genes is still needed.
Collapse
Affiliation(s)
- Veerle E T Maervoet
- a Laboratory of Applied Biotechnology, Department of Applied Biosciences , Ghent University , Ghent , Belgium
| | - Yves Briers
- a Laboratory of Applied Biotechnology, Department of Applied Biosciences , Ghent University , Ghent , Belgium
| |
Collapse
|
3
|
Villiers BRM, Stein V, Hollfelder F. USER friendly DNA recombination (USERec): a simple and flexible near homology-independent method for gene library construction. Protein Eng Des Sel 2010; 23:1-8. [PMID: 19897542 DOI: 10.1093/protein/gzp063] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
USER friendly DNA recombination (USERec) is introduced as a near homology-independent method that allows the simultaneous recombination of an unprecedented number of 10 DNA fragments (approximately 40-400 bp) within a day. The large number of fragments and their ease of preparation enables the creation of libraries of much larger genetic diversity (potentially approximately 10(10)-10(11) sequences) than current alternative methods based on DNA truncation (ITCHY, SCRATCHY and SHIPREC) or type IIb restriction enzymes (SISDC). At the same time, the frequency of frameshifts in the recombined library is low (90% of the recombined sequences are in frame). Compared to overlap extension PCR, USERec also requires much reduced crossover sequence constraints (only a 5'-AN(4-8)T-3' motif) and fewer experimental steps. Based on its simplicity and flexibility, and the accessibility of large and high quality recombined DNA libraries, USERec is established as a convenient alternative for the combinatorial assembly of gene fragments (e.g. exon or domain shuffling) and for a number of applications in gene library construction, such as loop grafting and multi-site-directed or random mutagenesis.
Collapse
Affiliation(s)
- B R M Villiers
- Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge CB2 1GA, UK
| | | | | |
Collapse
|
4
|
Li H, Fast W, Benkovic SJ. Structural and functional modularity of proteins in the de novo purine biosynthetic pathway. Protein Sci 2009; 18:881-92. [PMID: 19384989 PMCID: PMC2771292 DOI: 10.1002/pro.95] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2008] [Revised: 02/03/2009] [Accepted: 02/05/2009] [Indexed: 11/08/2022]
Abstract
It is generally accepted that naturally existing functional domains can serve as building blocks for complex protein structures, and that novel functions can arise from assembly of different combinations of these functional domains. To inform our understanding of protein evolution and explore the modular nature of protein structure, two model enzymes were chosen for study, purT-encoded glycinamide ribonucleotide formyltransferase (PurT) and purK-encoded N(5)-carboxylaminoimidazole ribonucleotide synthetase (PurK). Both enzymes are found in the de novo purine biosynthetic pathway of Escherichia coli. In spite of their low sequence identity, PurT and PurK share significant similarity in terms of tertiary structure, active site organization, and reaction mechanism. Their characteristic three domain structures categorize both PurT and PurK as members of the ATP-grasp protein superfamily. In this study, we investigate the exchangeability of individual protein domains between these two enzymes and the in vivo and in vitro functional properties of the resulting hybrids. Six domain-swapped hybrids were unable to catalyze full wild-type reactions, but each hybrid protein could catalyze partial reactions. Notably, an additional loop replacement in one of the domain-swapped hybrid proteins was able to restore near wild-type PurK activity. Therefore, in this model system, domain-swapped proteins retained the ability to catalyze partial reactions, but further modifications were required to efficiently couple the reaction intermediates and achieve catalysis of the full reaction. Implications for understanding the role of domain swapping in protein evolution are discussed.
Collapse
Affiliation(s)
| | | | - Stephen J Benkovic
- Department of Chemistry, The Pennsylvania State University, University ParkPennsylvania 16802
| |
Collapse
|
5
|
Muñoz E, Park JM, Deem MW. Quasispecies theory for horizontal gene transfer and recombination. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2008; 78:061921. [PMID: 19256882 PMCID: PMC4478466 DOI: 10.1103/physreve.78.061921] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/03/2008] [Revised: 10/03/2008] [Indexed: 05/27/2023]
Abstract
We introduce a generalization of the parallel, or Crow-Kimura, and Eigen models of molecular evolution to represent the exchange of genetic information between individuals in a population. We study the effect of different schemes of genetic recombination on the steady-state mean fitness and distribution of individuals in the population, through an analytic field theoretic mapping. We investigate both horizontal gene transfer from a population and recombination between pairs of individuals. Somewhat surprisingly, these nonlinear generalizations of quasispecies theory to modern biology are analytically solvable. For two-parent recombination, we find two selected phases, one of which is spectrally rigid. We present exact analytical formulas for the equilibrium mean fitness of the population, in terms of a maximum principle, which are generally applicable to any permutation invariant replication rate function. For smooth fitness landscapes, we show that when positive epistatic interactions are present, recombination or horizontal gene transfer introduces a mild load against selection. Conversely, if the fitness landscape exhibits negative epistasis, horizontal gene transfer or recombination introduces an advantage by enhancing selection towards the fittest genotypes. These results prove that the mutational deterministic hypothesis holds for quasispecies models. For the discontinuous single sharp peak fitness landscape, we show that horizontal gene transfer has no effect on the fitness, while recombination decreases the fitness, for both the parallel and the Eigen models. We present numerical and analytical results as well as phase diagrams for the different cases.
Collapse
Affiliation(s)
- Enrique Muñoz
- Department of Physics and Astronomy, Rice University, Houston, Texas 77005-1892, USA
| | | | | |
Collapse
|
6
|
Park JM, Deem MW. Phase diagrams of quasispecies theory with recombination and horizontal gene transfer. PHYSICAL REVIEW LETTERS 2007; 98:058101. [PMID: 17358903 DOI: 10.1103/physrevlett.98.058101] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2006] [Indexed: 05/14/2023]
Abstract
We consider how transfer of genetic information between individuals influences the phase diagram and mean fitness of both the Eigen and the parallel, or Crow-Kimura, models of evolution. In the absence of genetic transfer, these physical models of evolution consider the replication and point mutation of the genomes of independent individuals in a large population. A phase transition occurs, such that below a critical mutation rate an identifiable quasispecies forms. We show how transfer of genetic information changes the phase diagram and mean fitness and introduces metastability in quasispecies theory, via an analytic field theoretic mapping.
Collapse
Affiliation(s)
- J-M Park
- Department of Physics & Astronomy and Department of Bioengineering, Rice University, Houston, Texas 77005-1892, USA
| | | |
Collapse
|
7
|
Tan T, Bogarad LD, Deem MW. Modulation of base-specific mutation and recombination rates enables functional adaptation within the context of the genetic code. J Mol Evol 2005; 59:385-99. [PMID: 15553092 DOI: 10.1007/s00239-004-2633-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Abstract
The persistence of life requires populations to adapt at a rate commensurate with the dynamics of their environment. Successful populations that inhabit highly variable environments have evolved mechanisms to increase the likelihood of successful adaptation. We introduce a 64 x 64 matrix to quantify base-specific mutation potential, analyzing four different replicative systems, error-prone PCR, mouse antibodies, a nematode, and Drosophila. Mutational tendencies are correlated with the structural evolution of proteins. In systems under strong selective pressure, mutational biases are shown to favor the adaptive search of space, either by base mutation or by recombination. Such adaptability is discussed within the context of the genetic code at the levels of replication and codon usage.
Collapse
Affiliation(s)
- Taison Tan
- Department of Bioengineering and Department of Physics & Astronomy, Rice University, Houston, TX 77005-1892, USA
| | | | | |
Collapse
|
8
|
Geddie ML, Rowe LA, Alexander OB, Matsumura I. High throughput microplate screens for directed protein evolution. Methods Enzymol 2004; 388:134-45. [PMID: 15289067 DOI: 10.1016/s0076-6879(04)88012-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/30/2023]
Affiliation(s)
- Melissa L Geddie
- Department of Biochemistry, Emory University School of Medicine, Rollins Research Center, Atlanta, Georgia, USA
| | | | | | | |
Collapse
|
9
|
Jestin JL, Kaminski PA. Directed enzyme evolution and selections for catalysis based on product formation. J Biotechnol 2004; 113:85-103. [PMID: 15380650 DOI: 10.1016/j.jbiotec.2004.03.032] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2003] [Accepted: 03/03/2004] [Indexed: 10/26/2022]
Abstract
Enzyme engineering by molecular modelling and site-directed mutagenesis can be remarkably efficient. Directed enzyme evolution appears as a more general strategy for the isolation of catalysts as it can be applied to most chemical reactions in aqueous solutions. Selections, as opposed to screening, allow the simultaneous analysis of protein properties for sets of up to about 10(14) different proteins. These approaches for the parallel processing of molecular information 'Is the protein a catalyst?' are reviewed here in the case of selections based on the formation of a specific reaction product. Several questions are addressed about in vivo and in vitro selections for catalysis reported in the literature. Can the selection system be extended to other types of enzymes? Does the selection control regio- and stereo-selectivity? Does the selection allow the isolation of enzymes with an efficient turnover? How should substrates be substituted or mimicked for the design of efficient selections while minimising the number of chemical synthesis steps? Engineering sections provide also some clues to design selections or to circumvent selection biases. A special emphasis is put on the comparison of in vivo and in vitro selections for catalysis.
Collapse
Affiliation(s)
- Jean-Luc Jestin
- Département de Biologie Structurale et Chimie, Unité de Chimie Organique URA 2128 CNRS, Institut Pasteur, 28 rue du Dr. Roux, 75724 Paris 15, France.
| | | |
Collapse
|
10
|
Abstract
Concomitant with the evolution of biological diversity must have been the evolution of mechanisms that facilitate evolution, because of the essentially infinite complexity of protein sequence space. We describe how evolvability can be an object of Darwinian selection, emphasizing the collective nature of the process. We quantify our theory with computer simulations of protein evolution. These simulations demonstrate that rapid or dramatic environmental change leads to selection for greater evolvability. The selective pressure for large-scale genetic moves such as DNA exchange becomes increasingly strong as the environmental conditions become more uncertain. Our results demonstrate that evolvability is a selectable trait and allow for the explanation of a large body of experimental results.
Collapse
Affiliation(s)
- David J Earl
- Department of Bioengineering, Rice University, Houston, TX 77005-1892, USA
| | | |
Collapse
|
11
|
Horswill AR, Naumann TA, Benkovic SJ. Using Incremental Truncation to Create Libraries of Hybrid Enzymes. Methods Enzymol 2004; 388:50-60. [PMID: 15289061 DOI: 10.1016/s0076-6879(04)88006-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Affiliation(s)
- Alexander R Horswill
- Department of Chemistry, Pennsylvania State University, University Park 16802, USA
| | | | | |
Collapse
|
12
|
Kawarasaki Y, Griswold KE, Stevenson JD, Selzer T, Benkovic SJ, Iverson BL, Georgiou G. Enhanced crossover SCRATCHY: construction and high-throughput screening of a combinatorial library containing multiple non-homologous crossovers. Nucleic Acids Res 2003; 31:e126. [PMID: 14576326 PMCID: PMC275483 DOI: 10.1093/nar/gng126] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
SCRATCHY is a methodology for the construction of libraries of chimeras between genes that display low sequence homology. We have developed a strategy for library creation termed enhanced crossover SCRATCHY, that significantly increases the number of clones containing multiple crossovers. Complementary chimeric gene libraries generated by incremental truncation (ITCHY) of two distinct parental sequences are created, and are then divided into arbitrarily defined sections. The respective sections are amplified by skewed sets of primers (i.e. a combination of gene A specific forward primer and gene B specific reverse primer, etc.) allowing DNA fragments containing non-homologous crossover points to be amplified. The amplified chimeric sections are then subjected to a DNA shuffling process generating an enhanced crossover SCRATCHY library. We have constructed such a library using the rat theta 2 glutathione transferase (rGSTT2) and the human theta 1 glutathione transferase (hGSTT1) genes (63% DNA sequence identity). DNA sequencing analysis of unselected library members revealed a greater diversity than that obtained by canonical family shuffling or with conventional SCRATCHY. Expression and high-throughput flow cytometric screening of the chimeric GST library identified several chimeric progeny that retained rat-like parental substrate specificity.
Collapse
Affiliation(s)
- Yasuaki Kawarasaki
- Department of Chemical Engineering, University of Texas, Austin, TX 78712, USA
| | | | | | | | | | | | | |
Collapse
|
13
|
Giuliani A, Benigni R, Zbilut JP, Webber CL, Sirabella P, Colosimo A. Nonlinear signal analysis methods in the elucidation of protein sequence-structure relationships. Chem Rev 2002; 102:1471-92. [PMID: 11996541 DOI: 10.1021/cr0101499] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Alessandro Giuliani
- Istituto Superiore di Sanità, TCE Laboratory, Viale Regina Elena 299, 00161 Rome, Italy.
| | | | | | | | | | | |
Collapse
|
14
|
Kurtzman AL, Govindarajan S, Vahle K, Jones JT, Heinrichs V, Patten PA. Advances in directed protein evolution by recursive genetic recombination: applications to therapeutic proteins. Curr Opin Biotechnol 2001; 12:361-70. [PMID: 11551464 DOI: 10.1016/s0958-1669(00)00228-7] [Citation(s) in RCA: 85] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Recent developments in directed evolution technologies combined with innovations in robotics and screening methods have revolutionized protein engineering. These methods are being applied broadly to many fields of biotechnology, including chemical engineering, agriculture and human therapeutics. More specifically, DNA shuffling and other methods of genetic recombination and mutation have resulted in the improvement of proteins of therapeutic interest. Optimizing genetic diversity and fitness through iterative directed evolution will accelerate improvements in engineered protein therapeutics.
Collapse
Affiliation(s)
- A L Kurtzman
- Maxygen Inc., 515 Galveston Drive, Redwood City, CA 94063, USA.
| | | | | | | | | | | |
Collapse
|
15
|
Abstract
The selection of mutant enzymes with novel properties from libraries is emerging as a very powerful strategy for enzyme engineering. The past year has witnessed significant progress on several fronts: new and improved methods have been developed for the creation of libraries and advances have been made in screening and selection techniques. The results achieved demonstrate the enormous potential of the methods and leave questions open for further studies.
Collapse
Affiliation(s)
- P Soumillion
- Laboratoire de Biochimie Physique et des Biopolymères, Institut des Sciences de la Vie, Université Catholique de Louvain, 1 Place Louis Pasteur, B 1348 Louvain la-Neuve, Belgium
| | | |
Collapse
|