1
|
Middendorf L, Ravi Iyengar B, Eicholt LA. Sequence, Structure, and Functional Space of Drosophila De Novo Proteins. Genome Biol Evol 2024; 16:evae176. [PMID: 39212966 PMCID: PMC11363682 DOI: 10.1093/gbe/evae176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/29/2024] [Indexed: 09/04/2024] Open
Abstract
During de novo emergence, new protein coding genes emerge from previously nongenic sequences. The de novo proteins they encode are dissimilar in composition and predicted biochemical properties to conserved proteins. However, functional de novo proteins indeed exist. Both identification of functional de novo proteins and their structural characterization are experimentally laborious. To identify functional and structured de novo proteins in silico, we applied recently developed machine learning based tools and found that most de novo proteins are indeed different from conserved proteins both in their structure and sequence. However, some de novo proteins are predicted to adopt known protein folds, participate in cellular reactions, and to form biomolecular condensates. Apart from broadening our understanding of de novo protein evolution, our study also provides a large set of testable hypotheses for focused experimental studies on structure and function of de novo proteins in Drosophila.
Collapse
Affiliation(s)
- Lasse Middendorf
- Institute for Evolution and Biodiversity, University of Muenster, Huefferstrasse 1, 48149 Muenster, Germany
| | - Bharat Ravi Iyengar
- Institute for Evolution and Biodiversity, University of Muenster, Huefferstrasse 1, 48149 Muenster, Germany
| | - Lars A Eicholt
- Institute for Evolution and Biodiversity, University of Muenster, Huefferstrasse 1, 48149 Muenster, Germany
| |
Collapse
|
2
|
Agmon I. Three Biopolymers and Origin of Life Scenarios. Life (Basel) 2024; 14:277. [PMID: 38398786 PMCID: PMC10890401 DOI: 10.3390/life14020277] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2023] [Revised: 02/08/2024] [Accepted: 02/12/2024] [Indexed: 02/25/2024] Open
Abstract
To track down the possible roots of life, various models for the initial living system composed of different combinations of the three extant biopolymers, RNA, DNA, and proteins, are presented. The suitability of each molecular set is assessed according to its ability to emerge autonomously, sustain, and evolve continuously towards life as we know it. The analysis incorporates current biological knowledge gained from high-resolution structural data and large sequence datasets, together with experimental results concerned with RNA replication and with the activity demonstrated by standalone constructs of the ribosomal Peptidyl Transferase Center region. The scrutiny excludes the DNA-protein combination and assigns negligible likelihood to the existence of an RNA-DNA world, as well as to an RNA world that contained a replicase made of RNA. It points to the precedence of an RNA-protein system, whose model of emergence suggests specific processes whereby a coded proto-ribosome ribozyme, specifically aminoacylated proto-tRNAs and a proto-polymerase enzyme, could have autonomously emerged, cross-catalyzing the formation of each other. This molecular set constitutes a feasible starting point for a continuous evolutionary path, proceeding via natural processes from the inanimate matter towards life as we know it.
Collapse
Affiliation(s)
- Ilana Agmon
- Institute for Advanced Studies in Theoretical Chemistry, Schulich Faculty of Chemistry, Technion—Israel Institute of Technology, Haifa 3200003, Israel;
- Fritz Haber Research Center for Molecular Dynamics, Hebrew University, Jerusalem 9190401, Israel
| |
Collapse
|
3
|
Heames B, Buchel F, Aubel M, Tretyachenko V, Loginov D, Novák P, Lange A, Bornberg-Bauer E, Hlouchová K. Experimental characterization of de novo proteins and their unevolved random-sequence counterparts. Nat Ecol Evol 2023; 7:570-580. [PMID: 37024625 PMCID: PMC10089919 DOI: 10.1038/s41559-023-02010-2] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 02/10/2023] [Indexed: 04/08/2023]
Abstract
De novo gene emergence provides a route for new proteins to be formed from previously non-coding DNA. Proteins born in this way are considered random sequences and typically assumed to lack defined structure. While it remains unclear how likely a de novo protein is to assume a soluble and stable tertiary structure, intersecting evidence from random sequence and de novo-designed proteins suggests that native-like biophysical properties are abundant in sequence space. Taking putative de novo proteins identified in human and fly, we experimentally characterize a library of these sequences to assess their solubility and structure propensity. We compare this library to a set of synthetic random proteins with no evolutionary history. Bioinformatic prediction suggests that de novo proteins may have remarkably similar distributions of biophysical properties to unevolved random sequences of a given length and amino acid composition. However, upon expression in vitro, de novo proteins exhibit moderately higher solubility which is further induced by the DnaK chaperone system. We suggest that while synthetic random sequences are a useful proxy for de novo proteins in terms of structure propensity, de novo proteins may be better integrated in the cellular system than random expectation, given their higher solubility.
Collapse
Affiliation(s)
- Brennen Heames
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Filip Buchel
- Department of Cell Biology, Charles University, BIOCEV, Prague, Czech Republic
- Department of Biochemistry, Charles University, Prague, Czech Republic
| | - Margaux Aubel
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | | | - Dmitry Loginov
- Institute of Microbiology, Czech Academy of Sciences, Prague, Czech Republic
| | - Petr Novák
- Institute of Microbiology, Czech Academy of Sciences, Prague, Czech Republic
| | - Andreas Lange
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany.
- Department of Protein Evolution, MPI for Developmental Biology, Tübingen, Germany.
| | - Klára Hlouchová
- Department of Cell Biology, Charles University, BIOCEV, Prague, Czech Republic.
- Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, Prague, Czech Republic.
| |
Collapse
|
4
|
Dramé-Maigné A, Zadorin AS, Golovkova I, Rondelez Y. Quantifying the Performance of Micro-Compartmentalized Directed Evolution Protocols. Life (Basel) 2020; 10:life10020017. [PMID: 32069848 PMCID: PMC7175308 DOI: 10.3390/life10020017] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2019] [Revised: 01/21/2020] [Accepted: 01/25/2020] [Indexed: 01/14/2023] Open
Abstract
High-throughput, in vitro approaches for the evolution of enzymes rely on a random micro-encapsulation to link phenotypes to genotypes, followed by screening or selection steps. In order to optimise these approaches, or compare one to another, one needs a measure of their performance at extracting the best variants of a library. Here, we introduce a new metric, the Selection Quality Index (SQI), which can be computed from a simple mock experiment, performed with a known initial fraction of active variants. In contrast to previous approaches, our index integrates the effect of random co-encapsulation, and comes with a straightforward experimental interpretation. We further show how this new metric can be used to extract general protocol efficiency trends or reveal hidden selection mechanisms such as a counterintuitive form of beneficial poisoning in the compartmentalized self-replication protocol.
Collapse
Affiliation(s)
- Adèle Dramé-Maigné
- Laboratoire Gulliver, ESPCI Paris, PSL Research University, 10 rue Vauquelin, 75005 Paris, France
| | - Anton S. Zadorin
- Laboratoire Gulliver, ESPCI Paris, PSL Research University, 10 rue Vauquelin, 75005 Paris, France
- Laboratoire CBI, ESPCI Paris, PSL Research University, 10 rue Vauquelin, 75005 Paris, France
| | - Iaroslava Golovkova
- Laboratoire Gulliver, ESPCI Paris, PSL Research University, 10 rue Vauquelin, 75005 Paris, France
| | - Yannick Rondelez
- Laboratoire Gulliver, ESPCI Paris, PSL Research University, 10 rue Vauquelin, 75005 Paris, France
- Correspondence: ; Tel.: +33-140795105; Fax: +33-140795105
| |
Collapse
|
5
|
The complete mitochondrial genome of Vanessa indica and phylogenetic analyses of the family Nymphalidae. Genes Genomics 2018; 40:1011-1022. [PMID: 29949077 DOI: 10.1007/s13258-018-0709-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2018] [Accepted: 05/29/2018] [Indexed: 02/02/2023]
Abstract
Vanessa indica is a small butterfly lacking historical molecular and biological research. Vanessa indica belongs to the family Nymphalidae (Lepidoptera: Papilionoidea), which is the largest group of butterflies and are nearly ubiquitous. However, after more than a century of taxonomic and molecular studies, there is no consensus for family classification, and the phylogenetic relationships within Nymphalidae are controversial. The first objective was to sequence and characterize the complete mitochondrial genome of V. indica. The most important objective was to completely reconstruct the phylogenetic relationships for family members within Nymphalidae. The mitochondrial genomic DNA (mtDNA) of V. indica was extracted and amplified by polymerase chain reaction. The complete mitochondrial sequence was annotated and characterized by analyzing sequences with SeqMan program. The phylogenetic analyses were conducted on thirteen protein coding genes (PCGs) in 95 mtDNA of Nymphalidae downloaded from GenBank for reference using the maximum likelihood method and Bayesian inference to ensure the validity of the results. The complete mitogenome was a circular molecule with 15,191 bp consisting of 13 protein coding genes, two ribosomal RNA genes (16S rRNA and 12S rRNA), 22 transfer RNA (tRNA) genes, and an A + T-rich region (D-loop). The nucleotide composition of the genome was highly biased for A + T content, which accounts for 80.0% of the nucleotides. All the tRNAs have putative secondary structures that are characteristic of mitochondrial tRNAs, except tRNASer(AGN). All the PCGs started with ATN codons, except cytochrome c oxidase subunit 1 (COX1), which was found to start with an unusual CGA codon. Four genes were observed to have unusual codons: COX1 terminated with atypical TT and the other three genes terminated with a single T. The A + T rich region of 327 bp consisted of repetitive sequences, including a ATAGA motif, a 19-bp poly-T stretch, and two microsatellite-like regions (TA)8. The phylogenetic analyses consistently placed Biblidinae as a sister cluster to Heliconiinae and Calinaginae as a sister clade to Satyrinae. Moreover, the phylogenetic tree identified Libytheinae as a monophyletic group within Nymphalidae. The complete mitogenome of V. indica was 15,191 bp with mitochondrial characterizations common for lepidopteran species, which enriched the mitochondria data of Nymphalid species. And the phylogenetic analysis revealed different classifications and relationships than those previously described. Our results are significant because they would be useful in further understanding of the evolutionary biology of Nymphalidae.
Collapse
|
6
|
Ichihashi N, Yomo T. Constructive Approaches for Understanding the Origin of Self-Replication and Evolution. Life (Basel) 2016; 6:life6030026. [PMID: 27420098 PMCID: PMC5041002 DOI: 10.3390/life6030026] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2016] [Revised: 07/07/2016] [Accepted: 07/07/2016] [Indexed: 11/16/2022] Open
Abstract
The mystery of the origin of life can be divided into two parts. The first part is the origin of biomolecules: under what physicochemical conditions did biomolecules such as amino acids, nucleotides, and their polymers arise? The second part of the mystery is the origin of life-specific functions such as the replication of genetic information, the reproduction of cellular structures, metabolism, and evolution. These functions require the coordination of many different kinds of biological molecules. A direct strategy to approach the second part of the mystery is the constructive approach, in which life-specific functions are recreated in a test tube from specific biological molecules. Using this approach, we are able to employ design principles to reproduce life-specific functions, and the knowledge gained through the reproduction process provides clues as to their origins. In this mini-review, we introduce recent insights gained using this approach, and propose important future directions for advancing our understanding of the origins of life.
Collapse
Affiliation(s)
- Norikazu Ichihashi
- Department of Bioinformatics Engineering, Graduate School of Information Science and Technology, Osaka University, 1-5 Yamadaoka, Suita, Osaka 565-0871, Japan.
| | - Tetsuya Yomo
- Department of Bioinformatics Engineering, Graduate School of Information Science and Technology, Osaka University, 1-5 Yamadaoka, Suita, Osaka 565-0871, Japan.
- Graduate School of Frontier Biosciences, Osaka University, 1-5 Yamadaoka, Suita, Osaka 565-0871, Japan.
| |
Collapse
|
7
|
Vanhercke T, Ampe C, Tirry L, Denolf P. Rescue and In Situ Selection and Evaluation (RISE): A Method for High-Throughput Panning of Phage Display Libraries. ACTA ACUST UNITED AC 2016; 10:108-17. [PMID: 15799954 DOI: 10.1177/1087057104271956] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Phage display has proven to be an invaluable instrument in the search for proteins and peptides with optimized or novel functions. The amplification and selection of phage libraries typically involve several operations and handling large bacterial cultures during each round. Purification of the assembled phage particles after rescue adds to the labor and time demand. The authors therefore devised a method, termed rescue and in situ selection and evaluation (RISE), which combines all steps from rescue to binding in a single microwell. To test this concept, wells were precoated with different antibodies, which allowed newly formed phage particles to be captured directly in situ during overnight rescue. Following 6 washing steps, the retained phages could be easily detected in an enzyme-linked immunosorbent assay (ELISA), thus eliminating the need for purification or concentration of the viral particles. As a consequence, RISE enables a rapid characterization of phage-displayed proteins. In addition, this method allowed for the selective enrichment of phages displaying a hemagglutinin (HA) epitope tag, spiked in a 104-fold excess of wild-type background. Because the combination of phage rescue, selection, or evaluation in a single microwell is amenable to automation, RISE may boost the high-throughput screening of smaller sized phage display libraries.
Collapse
Affiliation(s)
- Thomas Vanhercke
- Department of Crop Protection, Faculty of Agricultural and Applied Biological Sciences, Ghent University, Ghent, Belgium, Bayer BioScience N.V., Ghent, Belgium.
| | | | | | | |
Collapse
|
8
|
Moroz YS, Dunston TT, Makhlynets OV, Moroz OV, Wu Y, Yoon JH, Olsen AB, McLaughlin JM, Mack KL, Gosavi PM, van Nuland NAJ, Korendovych IV. New Tricks for Old Proteins: Single Mutations in a Nonenzymatic Protein Give Rise to Various Enzymatic Activities. J Am Chem Soc 2015; 137:14905-11. [PMID: 26555770 DOI: 10.1021/jacs.5b07812] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Design of a new catalytic function in proteins, apart from its inherent practical value, is important for fundamental understanding of enzymatic activity. Using a computationally inexpensive, minimalistic approach that focuses on introducing a single highly reactive residue into proteins to achieve catalysis we converted a 74-residue-long C-terminal domain of calmodulin into an efficient esterase. The catalytic efficiency of the resulting stereoselective, allosterically regulated catalyst, nicknamed AlleyCatE, is higher than that of any previously reported de novo designed esterases. The simplicity of our design protocol should complement and expand the capabilities of current state-of-art approaches to protein design. These results show that even a small nonenzymatic protein can efficiently attain catalytic activities in various reactions (Kemp elimination, ester hydrolysis, retroaldol reaction) as a result of a single mutation. In other words, proteins can be just one mutation away from becoming entry points for subsequent evolution.
Collapse
Affiliation(s)
- Yurii S Moroz
- Department of Chemistry, Syracuse University , 111 College Place, Syracuse, New York 13244, United States
| | - Tiffany T Dunston
- Department of Chemistry, Syracuse University , 111 College Place, Syracuse, New York 13244, United States
| | - Olga V Makhlynets
- Department of Chemistry, Syracuse University , 111 College Place, Syracuse, New York 13244, United States
| | - Olesia V Moroz
- Department of Chemistry, Syracuse University , 111 College Place, Syracuse, New York 13244, United States
| | - Yibing Wu
- Department of Pharmaceutical Chemistry, University of California-San Francisco , 555 Mission Bay Boulevard South, San Francisco, California 94158, United States
| | - Jennifer H Yoon
- Department of Chemistry, Syracuse University , 111 College Place, Syracuse, New York 13244, United States
| | - Alissa B Olsen
- Department of Chemistry, Syracuse University , 111 College Place, Syracuse, New York 13244, United States
| | - Jaclyn M McLaughlin
- Department of Chemistry, Syracuse University , 111 College Place, Syracuse, New York 13244, United States
| | - Korrie L Mack
- Department of Chemistry, Syracuse University , 111 College Place, Syracuse, New York 13244, United States
| | - Pallavi M Gosavi
- Department of Chemistry, Syracuse University , 111 College Place, Syracuse, New York 13244, United States
| | - Nico A J van Nuland
- Jean Jeener NMR Centre, Structural Biology Brussels, Vrije Universiteit Brussel , Pleinlaan 2, 1050 Brussels, Belgium
| | - Ivan V Korendovych
- Department of Chemistry, Syracuse University , 111 College Place, Syracuse, New York 13244, United States
| |
Collapse
|
9
|
Currin A, Swainston N, Day PJ, Kell DB. Synthetic biology for the directed evolution of protein biocatalysts: navigating sequence space intelligently. Chem Soc Rev 2015; 44:1172-239. [PMID: 25503938 PMCID: PMC4349129 DOI: 10.1039/c4cs00351a] [Citation(s) in RCA: 258] [Impact Index Per Article: 25.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2014] [Indexed: 12/21/2022]
Abstract
The amino acid sequence of a protein affects both its structure and its function. Thus, the ability to modify the sequence, and hence the structure and activity, of individual proteins in a systematic way, opens up many opportunities, both scientifically and (as we focus on here) for exploitation in biocatalysis. Modern methods of synthetic biology, whereby increasingly large sequences of DNA can be synthesised de novo, allow an unprecedented ability to engineer proteins with novel functions. However, the number of possible proteins is far too large to test individually, so we need means for navigating the 'search space' of possible protein sequences efficiently and reliably in order to find desirable activities and other properties. Enzymologists distinguish binding (Kd) and catalytic (kcat) steps. In a similar way, judicious strategies have blended design (for binding, specificity and active site modelling) with the more empirical methods of classical directed evolution (DE) for improving kcat (where natural evolution rarely seeks the highest values), especially with regard to residues distant from the active site and where the functional linkages underpinning enzyme dynamics are both unknown and hard to predict. Epistasis (where the 'best' amino acid at one site depends on that or those at others) is a notable feature of directed evolution. The aim of this review is to highlight some of the approaches that are being developed to allow us to use directed evolution to improve enzyme properties, often dramatically. We note that directed evolution differs in a number of ways from natural evolution, including in particular the available mechanisms and the likely selection pressures. Thus, we stress the opportunities afforded by techniques that enable one to map sequence to (structure and) activity in silico, as an effective means of modelling and exploring protein landscapes. Because known landscapes may be assessed and reasoned about as a whole, simultaneously, this offers opportunities for protein improvement not readily available to natural evolution on rapid timescales. Intelligent landscape navigation, informed by sequence-activity relationships and coupled to the emerging methods of synthetic biology, offers scope for the development of novel biocatalysts that are both highly active and robust.
Collapse
Affiliation(s)
- Andrew Currin
- Manchester Institute of Biotechnology , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK . ; http://dbkgroup.org/; @dbkell ; Tel: +44 (0)161 306 4492
- School of Chemistry , The University of Manchester , Manchester M13 9PL , UK
- Centre for Synthetic Biology of Fine and Speciality Chemicals (SYNBIOCHEM) , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK
| | - Neil Swainston
- Manchester Institute of Biotechnology , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK . ; http://dbkgroup.org/; @dbkell ; Tel: +44 (0)161 306 4492
- Centre for Synthetic Biology of Fine and Speciality Chemicals (SYNBIOCHEM) , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK
- School of Computer Science , The University of Manchester , Manchester M13 9PL , UK
| | - Philip J. Day
- Manchester Institute of Biotechnology , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK . ; http://dbkgroup.org/; @dbkell ; Tel: +44 (0)161 306 4492
- Centre for Synthetic Biology of Fine and Speciality Chemicals (SYNBIOCHEM) , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK
- Faculty of Medical and Human Sciences , The University of Manchester , Manchester M13 9PT , UK
| | - Douglas B. Kell
- Manchester Institute of Biotechnology , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK . ; http://dbkgroup.org/; @dbkell ; Tel: +44 (0)161 306 4492
- School of Chemistry , The University of Manchester , Manchester M13 9PL , UK
- Centre for Synthetic Biology of Fine and Speciality Chemicals (SYNBIOCHEM) , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK
| |
Collapse
|
10
|
Abstract
Although both the most popular form of synthetic biology (SB) and chemical synthetic biology (CSB) share the biotechnologically useful aim of making new forms of life, SB does so by using genetic manipulation of extant microorganism, while CSB utilises classic chemical procedures in order to obtain biological structures which are non-existent in nature. The main query concerning CSB is the philosophical question: why did nature do this, and not that? The idea then is to synthesise alternative structures in order to understand why nature operated in such a particular way. We briefly present here some various examples of CSB, including those cases of nucleic acids synthesised with pyranose instead of ribose, and proteins with a reduced alphabet of amino acids; also we report the developing research on the "never born proteins" (NBP) and "never born RNA" (NBRNA), up to the minimal cell project, where the issue is the preparation of semi-synthetic cells that can perform the basic functions of biological cells.
Collapse
Affiliation(s)
| | - Pier Luigi Luisi
- Department of Materials, Swiss Federal Institute of Technology Zurich (ETHZ), University of Roma Tre, Italy
| |
Collapse
|
11
|
Abstract
All cellular proteins are derived from preexisting ones by natural selection. Because of the random nature of this process, many potentially useful protein structures never arose or were discarded during evolution. Here, we used a single round of genetic selection in mouse cells to isolate chemically simple, biologically active transmembrane proteins that do not contain any amino acid sequences from preexisting proteins. We screened a retroviral library expressing hundreds of thousands of proteins consisting of hydrophobic amino acids in random order to isolate four 29-aa proteins that induced focus formation in mouse and human fibroblasts and tumors in mice. These proteins share no amino acid sequences with known cellular or viral proteins, and the simplest of them contains only seven different amino acids. They transformed cells by forming a stable complex with the platelet-derived growth factor β receptor transmembrane domain and causing ligand-independent receptor activation. We term this approach de novo selection and suggest that it can be used to generate structures and activities not observed in nature, create prototypes for novel research reagents and therapeutics, and provide insight into cell biology, transmembrane protein-protein interactions, and possibly virus evolution and the origin of life.
Collapse
|
12
|
Chiarabelli C, Stano P, Luisi PL. Chemical synthetic biology: a mini-review. Front Microbiol 2013; 4:285. [PMID: 24065964 PMCID: PMC3779815 DOI: 10.3389/fmicb.2013.00285] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2013] [Accepted: 09/04/2013] [Indexed: 01/08/2023] Open
Abstract
Chemical synthetic biology (CSB) is a branch of synthetic biology (SB) oriented toward the synthesis of chemical structures alternative to those present in nature. Whereas SB combines biology and engineering with the aim of synthesizing biological structures or life forms that do not exist in nature – often based on genome manipulation, CSB uses and assembles biological parts, synthetic or not, to create new and alternative structures. A short epistemological note will introduce the theoretical concepts related to these fields, whereas the text will be largely devoted to introduce and comment two main projects of CSB, carried out in our laboratory in the recent years. The “Never Born Biopolymers” project deals with the construction and the screening of RNA and peptide sequences that are not present in nature, whereas the “Minimal Cell” project focuses on the construction of semi-synthetic compartments (usually liposomes) containing the minimal and sufficient number of components to perform the basic function of a biological cell. These two topics are extremely important for both the general understanding of biology in terms of function, organization, and development, and for applied biotechnology.
Collapse
|
13
|
Artificial proteins from combinatorial approaches. Trends Biotechnol 2012; 30:512-20. [DOI: 10.1016/j.tibtech.2012.06.001] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2012] [Revised: 06/01/2012] [Accepted: 06/06/2012] [Indexed: 11/21/2022]
|
14
|
Abstract
Background Our current understanding of evolution is so tightly linked to template-dependent replication of DNA and RNA molecules that the old idea from Oparin of a self-reproducing 'garbage bag' ('coacervate') of chemicals that predated fully-fledged cell-like entities seems to be farfetched to most scientists today. However, this is exactly the kind of scheme we propose for how Darwinian evolution could have occurred prior to template replication. Results We cannot confirm previous claims that autocatalytic sets of organic polymer molecules could undergo evolution in any interesting sense by themselves. While we and others have previously imagined inhibition would result in selectability, we found that it produced multiple attractors in an autocatalytic set that cannot be selected for. Instead, we discovered that if general conditions are satisfied, the accumulation of adaptations in chemical reaction networks can occur. These conditions are the existence of rare reactions producing viable cores (analogous to a genotype), that sustains a molecular periphery (analogous to a phenotype). Conclusions We conclude that only when a chemical reaction network consists of many such viable cores, can it be evolvable. When many cores are enclosed in a compartment there is competition between cores within the same compartment, and when there are many compartments, there is between-compartment competition due to the phenotypic effects of cores and their periphery at the compartment level. Acquisition of cores by rare chemical events, and loss of cores at division, allows macromutation, limited heredity and selectability, thus explaining how a poor man's natural selection could have operated prior to genetic templates. This is the only demonstration to date of a mechanism by which pre-template accumulation of adaptation could occur. Reviewers This article was reviewed by William Martin and Eugene Koonin.
Collapse
|
15
|
Kauffman SA. Approaches to the origin of life on Earth. Life (Basel) 2011; 1:34-48. [PMID: 25382055 PMCID: PMC4187126 DOI: 10.3390/life1010034] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2011] [Revised: 11/02/2011] [Accepted: 11/09/2011] [Indexed: 11/21/2022] Open
Abstract
I discuss briefly the history of the origin of life field, focusing on the "Miller" era of prebiotic synthesis, through the "Orgel" era seeking enzyme free template replication of single stranded RNA or similar polynucleotides, to the RNA world era with one of its foci on a ribozyme with the capacity to act as a polymerase able to copy itself. I give the history of the independent invention in 1971 by T. Ganti, M. Eigen and myself of three alternative theories of the origin of molecular replication: the Chemotron, the Hypercycle, and Collectively Autocatalytic Sets, CAS, respectively. To date, only collectively autocatalytic DNA, RNA, and peptide sets have achieved molecular reproduction of polymers. Theoretical work and experimental work on CAS both support their plausibility as models of openly evolvable protocells, if housed in dividing compartments such as dividing liposomes. My own further hypothesis beyond that of CAS in themselves, of their formation as a phase transition in complex chemical reaction systems of substrates, reactions and products, where the molecules in the system are candidates to catalyze the very same reactions, now firmly established as theorems, awaits experimental proof using combinatorial chemistry to make libraries of stochastic DNA, RNA and/or polypeptides, or other classes of molecules to test the hypothesis that molecular polymer reproduction has emerged as a true phase transition in complex chemical reaction systems. I remark that my colleague Marc Ballivet of the University of Geneva and I, may have issued the first publications discussing what became combinatorial chemistry, in published issued patents in 1987, 1989 and later, in this field.
Collapse
|
16
|
Hordijk W, Kauffman SA, Steel M. Required levels of catalysis for emergence of autocatalytic sets in models of chemical reaction systems. Int J Mol Sci 2011; 12:3085-101. [PMID: 21686171 PMCID: PMC3116177 DOI: 10.3390/ijms12053085] [Citation(s) in RCA: 58] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2011] [Revised: 04/18/2011] [Accepted: 05/05/2011] [Indexed: 11/16/2022] Open
Abstract
The formation of a self-sustaining autocatalytic chemical network is a necessary but not sufficient condition for the origin of life. The question of whether such a network could form "by chance" within a sufficiently complex suite of molecules and reactions is one that we have investigated for a simple chemical reaction model based on polymer ligation and cleavage. In this paper, we extend this work in several further directions. In particular, we investigate in more detail the levels of catalysis required for a self-sustaining autocatalytic network to form. We study the size of chemical networks within which we might expect to find such an autocatalytic subset, and we extend the theoretical and computational analyses to models in which catalysis requires template matching.
Collapse
Affiliation(s)
- Wim Hordijk
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland
- Author to whom correspondence should be addressed; E-Mail: ; Tel.: +41-21-692-4268
| | - Stuart A. Kauffman
- Tampere University of Technology, Finland; E-Mail:
- University of Vermont, 85 South Prospect Street Burlington, VT 05405, USA
- Santa Fe Institute, 1399 Hyde Park Road Santa Fe, NM 87501, USA
| | - Mike Steel
- Department of Mathematics and Statistics, University of Canterbury, Private Bag 4800, Christchurch, New Zealand; E-Mail:
| |
Collapse
|
17
|
Chiarabelli C, Stano P, Luisi PL. Chemical approaches to synthetic biology. Curr Opin Biotechnol 2009; 20:492-7. [DOI: 10.1016/j.copbio.2009.08.004] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2009] [Revised: 08/07/2009] [Accepted: 08/10/2009] [Indexed: 11/17/2022]
|
18
|
Patel SC, Bradley LH, Jinadasa SP, Hecht MH. Cofactor binding and enzymatic activity in an unevolved superfamily of de novo designed 4-helix bundle proteins. Protein Sci 2009; 18:1388-400. [PMID: 19544578 PMCID: PMC2775209 DOI: 10.1002/pro.147] [Citation(s) in RCA: 66] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2009] [Revised: 04/12/2009] [Accepted: 04/13/2009] [Indexed: 11/09/2022]
Abstract
To probe the potential for enzymatic activity in unevolved amino acid sequence space, we created a combinatorial library of de novo 4-helix bundle proteins. This collection of novel proteins can be considered an "artificial superfamily" of helical bundles. The superfamily of 102-residue proteins was designed using binary patterning of polar and nonpolar residues, and expressed in Escherichia coli from a library of synthetic genes. Sequences from the library were screened for a range of biological functions including heme binding and peroxidase, esterase, and lipase activities. Proteins exhibiting these functions were purified and characterized biochemically. The majority of de novo proteins from this superfamily bound the heme cofactor, and a sizable fraction of the proteins showed activity significantly above background for at least one of the tested enzymatic activities. Moreover, several of the designed 4-helix bundles proteins showed activity in all of the assays, thereby demonstrating the functional promiscuity of unevolved proteins. These studies reveal that de novo proteins-which have neither been designed for function, nor subjected to evolutionary pressure (either in vivo or in vitro)-can provide rudimentary activities and serve as a "feedstock" for evolution.
Collapse
Affiliation(s)
- Shona C Patel
- Department of Chemical Engineering, Princeton UniversityPrinceton, New Jersey 08544
| | - Luke H Bradley
- Department of Chemistry, Princeton UniversityPrinceton, New Jersey 08544
| | - Sayuri P Jinadasa
- Department of Chemistry, Princeton UniversityPrinceton, New Jersey 08544
| | - Michael H Hecht
- Department of Chemistry, Princeton UniversityPrinceton, New Jersey 08544
| |
Collapse
|
19
|
Toyota H, Hosokawa M, Urabe I, Yomo T. Emergence of polyproline II-like structure at early stages of experimental evolution from random polypeptides. Mol Biol Evol 2008; 25:1113-9. [PMID: 18353796 DOI: 10.1093/molbev/msn063] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
To examine whether a primordial functional protein at the early stages of evolution has structural features, we carried out experimental evolution consisting of 25 cycles (generations) of mutation and selection toward DNA-binding function using a random-sequence polypeptide of 139 amino acid residues with no secondary structure as the initial sequence. In each generation, 16 clones were sampled arbitrarily for sequence analysis, and a phylogenetic tree was constructed. Polypeptide evolution proceeded from the initial point on branch I in 2 main directions of branches II and III. The initial and 2 evolved polypeptides (one at the 24th generation on branch III and the other at the 25th generation on branch II) were purified to examine their functional and structural properties. Although binding of the initial polypeptide to the target DNA was not detected by surface plasmon resonance measurements, the 2 evolved polypeptides bound to the DNA with dissociation constants of 1.6 and 1.0 microM, respectively, indicating an increase in affinity during the experimental evolution. Circular dichroism spectra of the evolved polypeptides, but not of the initial polypeptide, showed features characteristic of the polyproline II (PPII)-like structure, a left-handed helical structure commonly found in natural proteins, suggesting that the structure emerged through the experimental evolution. The same structural feature was found in another experimental evolution toward catalytic activity. These results demonstrate that the PPII-like structure is one of the common features that could have appeared in the early evolutionary stages of primordial functional protein.
Collapse
Affiliation(s)
- Hitoshi Toyota
- Department of Biotechnology, Graduate School of Engineering, Osaka University, Suita, Osaka, Japan
| | | | | | | |
Collapse
|
20
|
Discovery and characterization of 91 novel transcripts expressed in cattle placenta. BMC Genomics 2007; 8:113. [PMID: 17488528 PMCID: PMC1884150 DOI: 10.1186/1471-2164-8-113] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2006] [Accepted: 05/09/2007] [Indexed: 01/22/2023] Open
Abstract
Background Among the eutherian mammals, placental architecture varies to a greater extent than any other tissue. The diversity of placental types, even within a single mammalian order suggests that genes expressed in placenta are under strong Darwinian selection. Thus, the ruminant placenta may be a rich source of genes to explore adaptive evolutionary responses in mammals. The aim of our study was to identify novel transcripts expressed in ruminant placenta, and to characterize them with respect to their expression patterns, organization of coding sequences in the genome, and potential functions. Results A combination of bioinformatics, comparative genomics and transcript profiling was used to identify and characterize 91 novel transcripts (NTs) represented in a cattle placenta cDNA library. These NTs have no significant similarity to any non-ferungulate DNA or RNA sequence. Proteins longer than 100 aa were predicted for 29 NTs, and 21 are candidate non-coding RNAs. Eighty-six NTs were found to be expressed in one or more of 18 different tissues, with 39 (42%) showing tissue-preference, including six that were expressed exclusively in placentome. The authenticity of the NTs was confirmed by their alignment to cattle genome sequence, 42 of which showed evidence of mRNA splicing. Analysis of the genomic context where NT genes reside revealed 61 to be in intergenic regions, whereas 30 are within introns of known genes. The genes encoding the NTs were found to be significantly associated with subtelomeric regions. Conclusion The 91 lineage-specific transcripts are a useful resource for studying adaptive evolutionary responses of the ruminant placenta. The presence of so many genes encoding NTs in cattle but not primates or rodents suggests that gene loss and gain are important mechanisms of genome evolution in mammals. Furthermore, the clustering of NT genes within subtelomeric regions suggests that such regions are highly dynamic and may foster the birth of novel genes. The sequencing of additional vertebrate genomes with defined phylogenetic relationships will permit the search for lineage-specific genes to take on a more evolutionary context that is required to understand their origins and functions.
Collapse
|
21
|
Nakashima T, Toyota H, Urabe I, Yomo T. Effective selection system for experimental evolution of random polypeptides towards DNA-binding protein. J Biosci Bioeng 2007; 103:155-60. [PMID: 17368398 DOI: 10.1263/jbb.103.155] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2006] [Accepted: 11/21/2006] [Indexed: 11/17/2022]
Abstract
An experimental evolution with selection based on binding affinity to DNA was carried out on a library of phage-displayed random polypeptides of about 140 amino acid residues. First, we constructed a system to artificially evolve phage-displayed random polypeptides toward binding to a target DNA containing a restriction enzyme site, in which random polypeptides capable of binding the DNA were recovered as complexes with the target DNA by digestion with the restriction enzyme. The experimental evolution cycle, including the above selection system and random mutagenesis for generating the next mutant library, was repeated until the fourth generation. The ability to bind to the DNA was enhanced per generation. In the fourth generation, convergence of the selected clones to a dominant sequence was observed. These results indicate that the newly constructed selection system is effective for exploring the evolvability of random polypeptides towards DNA-binding proteins.
Collapse
Affiliation(s)
- Toshihiro Nakashima
- Department of Biotechnology, Graduate School of Engineering, Osaka University, 2-1 Yamadaoka, Suita, Osaka 565-0871, Japan
| | | | | | | |
Collapse
|
22
|
Aita T, Hayashi Y, Toyota H, Husimi Y, Urabe I, Yomo T. Extracting characteristic properties of fitness landscape from in vitro molecular evolution: a case study on infectivity of fd phage to E.coli. J Theor Biol 2007; 246:538-50. [PMID: 17316695 DOI: 10.1016/j.jtbi.2006.12.037] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2006] [Accepted: 12/20/2006] [Indexed: 11/20/2022]
Abstract
We have developed a methodology for extracting characteristic properties of a fitness landscape of interest by analyzing fitness data on an in vitro molecular evolution. The in vitro evolution is required to be conducted as the following "adaptive walk": a single parent sequence generates N mutant sequences as its offsprings, and the fittest individual among the N offsprings will become a new parent in the next generation. N is the library size of mutants to be screened in a single generation. Our theory of the adaptive walk on the "NK landscape" suggests the following: the adaptive walker starting from a random sequence climbs the landscape easily in an early stage, and then reaches a stationary phase in which the mutation-selection-random drift balance sets in. The stationary fitness value is nearly proportional to square root of ln N. Our analysis is performed from the following points: (1) stationary fitness values, (2) time series of fitness in the transitional state, (3) mutant's fitness distribution, and (4) the strength of selection pressure. Applying our methodology, we analyzed experimental data on the in vitro evolution of a random polypeptide (139 amino acids) toward acquiring infectivity (= ability to infect) of fd phage. As a result, we estimated that k is about 27 in this system, indicating that an arbitrary residue in a sequence is affected from other 23% residues. In this article, we demonstrated that the experimental data is consistent with our theoretical equations quantitatively, and that our methodology for extracting characteristic properties of a fitness landscape may be effective.
Collapse
Affiliation(s)
- Takuyo Aita
- Rational Evolutionary Design of Advanced Biomolecules(REDS) Group/JST, Saitama Small Enterprise Promotion Corporation SKIP City, Kawaguchi 333-0844, Japan.
| | | | | | | | | | | |
Collapse
|
23
|
Hayashi Y, Aita T, Toyota H, Husimi Y, Urabe I, Yomo T. Experimental rugged fitness landscape in protein sequence space. PLoS One 2006; 1:e96. [PMID: 17183728 PMCID: PMC1762315 DOI: 10.1371/journal.pone.0000096] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2006] [Accepted: 10/01/2006] [Indexed: 11/30/2022] Open
Abstract
The fitness landscape in sequence space determines the process of biomolecular evolution. To plot the fitness landscape of protein function, we carried out in vitro molecular evolution beginning with a defective fd phage carrying a random polypeptide of 139 amino acids in place of the g3p minor coat protein D2 domain, which is essential for phage infection. After 20 cycles of random substitution at sites 12–130 of the initial random polypeptide and selection for infectivity, the selected phage showed a 1.7×104-fold increase in infectivity, defined as the number of infected cells per ml of phage suspension. Fitness was defined as the logarithm of infectivity, and we analyzed (1) the dependence of stationary fitness on library size, which increased gradually, and (2) the time course of changes in fitness in transitional phases, based on an original theory regarding the evolutionary dynamics in Kauffman's n-k fitness landscape model. In the landscape model, single mutations at single sites among n sites affect the contribution of k other sites to fitness. Based on the results of these analyses, k was estimated to be 18–24. According to the estimated parameters, the landscape was plotted as a smooth surface up to a relative fitness of 0.4 of the global peak, whereas the landscape had a highly rugged surface with many local peaks above this relative fitness value. Based on the landscapes of these two different surfaces, it appears possible for adaptive walks with only random substitutions to climb with relative ease up to the middle region of the fitness landscape from any primordial or random sequence, whereas an enormous range of sequence diversity is required to climb further up the rugged surface above the middle region.
Collapse
Affiliation(s)
- Yuuki Hayashi
- Department of Bioinformatic Engineering, Osaka University, Suita, Osaka, Japan
| | - Takuyo Aita
- Rational Evolutionary Design of Advanced Biomolecules (REDS) Group/JST, Saitama Small Enterprise Promotion Corporation SKIP City, Kawaguchi, Saitama, Japan
- Department of Functional Materials Science, Saitama University, Saitama, Japan
| | - Hitoshi Toyota
- Department of Biotechnology, Osaka University, Suita, Osaka, Japan
| | - Yuzuru Husimi
- Department of Functional Materials Science, Saitama University, Saitama, Japan
| | - Itaru Urabe
- Department of Biotechnology, Osaka University, Suita, Osaka, Japan
| | - Tetsuya Yomo
- Department of Bioinformatic Engineering, Osaka University, Suita, Osaka, Japan
- Graduate School of Frontier Biosciences, Osaka University, Suita, Osaka, Japan
- Expoloratory Research for Advanced Technology (ERATO), Japan Science and Technology Agency (JST), Suita, Osaka, Japan
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
24
|
Matsuura T, Yomo T. In vitro evolution of proteins. J Biosci Bioeng 2006; 101:449-56. [PMID: 16935245 DOI: 10.1263/jbb.101.449] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2006] [Accepted: 05/08/2006] [Indexed: 11/17/2022]
Abstract
Consecutive rounds of diversification and selection of the fittest is believed to be the main driving force for the evolution of life. For the evolution of life to proceed, all living cells are surrounded by a lipid bilayer that separates their own genes from the external environment and from those of other organisms. In this way, the genetic information of an individual is replicated on the basis of their phenotype; thus the enrichment of the fittest will occur. Hence, evolution is based on linkage between genotype and phenotype owing to the surrounding of the genetic material with a barrier. The linkage between genotype and phenotype is also known to be essential for the directed evolution of proteins. Indeed, systems for molecular evolution, including phage display, ribosome display, and in vitro compartmentalization, all satisfy this requirement in different ways. These systems have been shown to be powerful tools for high-throughput screening for the functions of proteins, screening as many as <10(12) molecules in 1 d. These selection systems in combination with various gene libraries yield proteins with improved or altered biophysical properties, and may even allow the generation of proteins with novel functions.
Collapse
Affiliation(s)
- Tomoaki Matsuura
- Department of Biotechnology, Graduate School of Engineering, Osaka University, 2-1 Yamadaoka, Suita, Osaka 565-0871, Japan
| | | |
Collapse
|
25
|
Nagao C, Terada TP, Yomo T, Sasai M. Correlation between evolutionary structural development and protein folding. Proc Natl Acad Sci U S A 2005; 102:18950-5. [PMID: 16365314 PMCID: PMC1323182 DOI: 10.1073/pnas.0509163102] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Evolution should have played important roles in determining folding mechanisms and structures of proteins. In this article we discuss how the folding mechanisms had been affected by the early stage of evolution through which the uniqueness of structure had developed. Although the process of such early-time evolution has remained a mystery, a plausible scenario is that the evolution of proteins toward the ordered structures was guided by functional selection pressure as demonstrated in vitro and in silico. We examine the in silico functional selection of sequences and show that there is a significant correlation between two different processes toward the unique 3D structure, the evolutionary development of structure through sequence selection, and the folding process of the resultant sequence. This finding could be rephrased as protein folding recapitulates the emergence of topology in the molecular evolution. The correlation suggests a guideline for engineering foldable proteins.
Collapse
Affiliation(s)
- Chioko Nagao
- Department of Complex Systems Science, Graduate School of Information Science, and Institute for Advanced Research, Nagoya University, Nagoya 464-8601, Japan
| | | | | | | |
Collapse
|
26
|
Abstract
Combinatorial libraries of de novo amino acid sequences can provide a rich source of diversity for the discovery of novel proteins with interesting and important activities. Randomly generated sequences, however, rarely fold into well-ordered proteinlike structures. To enhance the quality of a library, features of rational design must be used to focus sequence diversity into those regions of sequence space that are most likely to yield folded structures. This review describes how focused libraries can be constructed by designing the binary pattern of polar and nonpolar amino acids to favor proteins that contain abundant secondary structure, while simultaneously burying hydrophobic side chains and exposing hydrophilic side chains to solvent. The "binary code" for protein design was used to construct several libraries of de novo proteins, including both alpha-helical and beta-sheet structures. The recently determined solution structure of a binary patterned four-helix bundle is well ordered, thereby demonstrating that sequences that have neither been selected by evolution (in vivo or in vitro) nor designed by computer can form nativelike proteins. Examples are presented demonstrating how binary patterned libraries have successfully produced well-ordered structures, cofactor binding, catalytic activity, self-assembled monolayers, amyloid-like nanofibrils, and protein-based biomaterials.
Collapse
Affiliation(s)
- Michael H Hecht
- Department of Chemistry, Princeton University, Princeton, NJ 08544, USA.
| | | | | | | | | |
Collapse
|
27
|
Ito Y, Kawama T, Urabe I, Yomo T. Evolution of an arbitrary sequence in solubility. J Mol Evol 2004; 58:196-202. [PMID: 15042340 DOI: 10.1007/s00239-003-2542-2] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2003] [Accepted: 08/21/2003] [Indexed: 10/26/2022]
Abstract
We have investigated the evolvability of an insoluble random polypeptide, RP3-34, to a soluble form through iterative mutation and selection with the aid of the green fluorescent protein (GFP) folding reporter. To assess the solubility of the polypeptides in the selected clones of each generation, the polypeptide genes were detached from the GFP fusions and expressed with a His(6) tag. The solubility of the variant random polypeptides increased in each generation within the scope of the evolutionary process, and the polypeptides assumed a soluble form from the fourth generation. Analysis of the synonymous and nonsynonymous mutations found in the deduced amino acid sequence of the selected polypeptides revealed that selection had accelerated the evolutionary rate. The solubility and hydrophobicity of the polypeptides and the 25 arbitrarily chosen random polypeptides found in a previously prepared library were determined, analyzed, and interpreted from the landscape on the protein sequence space. This study showed the evolvability of an insoluble arbitrary sequence toward a soluble one, hence, it provides a new perspective on the field of artificial evolution.
Collapse
Affiliation(s)
- Yoichiro Ito
- Department of Biotechnology, Graduate School of Engineering, Osaka University, 2-1 Yamada-oka, Suita City, Osaka 565-0871, Japan
| | | | | | | |
Collapse
|
28
|
Affiliation(s)
- Tomoaki Matsuura
- Department of Bioinformatics Science, Graduate School of Information and Science Technology, Osaka University, Japan
| | | | | | | |
Collapse
|
29
|
Axe DD. Estimating the Prevalence of Protein Sequences Adopting Functional Enzyme Folds. J Mol Biol 2004; 341:1295-315. [PMID: 15321723 DOI: 10.1016/j.jmb.2004.06.058] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2003] [Revised: 05/02/2004] [Accepted: 06/18/2004] [Indexed: 11/25/2022]
Abstract
Proteins employ a wide variety of folds to perform their biological functions. How are these folds first acquired? An important step toward answering this is to obtain an estimate of the overall prevalence of sequences adopting functional folds. Since tertiary structure is needed for a typical enzyme active site to form, one way to obtain this estimate is to measure the prevalence of sequences supporting a working active site. Although the immense number of sequence combinations makes wholly random sampling unfeasible, two key simplifications may provide a solution. First, given the importance of hydrophobic interactions to protein folding, it seems likely that the sample space can be restricted to sequences carrying the hydropathic signature of a known fold. Second, because folds are stabilized by the cooperative action of many local interactions distributed throughout the structure, the overall problem of fold stabilization may be viewed reasonably as a collection of coupled local problems. This enables the difficulty of the whole problem to be assessed by assessing the difficulty of several smaller problems. Using these simplifications, the difficulty of specifying a working beta-lactamase domain is assessed here. An alignment of homologous domain sequences is used to deduce the pattern of hydropathic constraints along chains that form the domain fold. Starting with a weakly functional sequence carrying this signature, clusters of ten side-chains within the fold are replaced randomly, within the boundaries of the signature, and tested for function. The prevalence of low-level function in four such experiments indicates that roughly one in 10(64) signature-consistent sequences forms a working domain. Combined with the estimated prevalence of plausible hydropathic patterns (for any fold) and of relevant folds for particular functions, this implies the overall prevalence of sequences performing a specific function by any domain-sized fold may be as low as 1 in 10(77), adding to the body of evidence that functional folds require highly extraordinary sequences.
Collapse
Affiliation(s)
- Douglas D Axe
- The Babraham Institute, Structural Biology Unit, Babraham Research Campus, Cambridge CB2 4AT, UK.
| |
Collapse
|
30
|
Matsumoto T, Ishikawa S, Fukuda H, Kondo A. Construction of ethanol-tolerant yeast strains with combinatorial library-selected peptides. ACTA ACUST UNITED AC 2004. [DOI: 10.1016/j.molcatb.2003.12.023] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
31
|
Aita T, Husimi Y. Thermodynamical interpretation of an adaptive walk on a Mt. Fuji-type fitness landscape: Einstein relation-like formula holds in a stochastic evolution. J Theor Biol 2004; 225:215-28. [PMID: 14575655 DOI: 10.1016/s0022-5193(03)00240-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
We have theoretically studied the statistical properties of adaptive walks (or hill-climbing) on a Mt. Fuji-type fitness landscape in the multi-dimensional sequence space through mathematical analysis and computer simulation. The adaptive walk is characterized by the "mutation distance" d as the step-width of the walker and the "population size" N as the number of randomly generated d-fold point mutants to be screened. In addition to the fitness W, we introduced the following quantities analogous to thermodynamical concepts: "free fitness" G(W) is identical with W+T x S(W), where T is the "evolutionary temperature" T infinity square root of d/lnN and S(W) is the entropy as a function of W, and the "evolutionary force" X is identical with d(G(W)/T)/dW, that is caused by the mutation and selection pressure. It is known that a single adaptive walker rapidly climbs on the fitness landscape up to the stationary state where a "mutation-selection-random drift balance" is kept. In our interpretation, the walker tends to the maximal free fitness state, driven by the evolutionary force X. Our major findings are as follows: First, near the stationary point W*, the "climbing rate" J as the expected fitness change per generation is described by J approximately L x X with L approximately V/2, where V is the variance of fitness distribution on a local landscape. This simple relationship is analogous to the well-known Einstein relation in Brownian motion. Second, the "biological information gain" (DeltaG/T) through adaptive walk can be described by combining the Shannon's information gain (DeltaS) and the "fitness information gain" (DeltaW/T).
Collapse
Affiliation(s)
- Takuyo Aita
- Computational Biology Research Center, National Institute of Advanced Industrial Science and Technology, 2-43 Aomi, Koto-ku, Tokyo 135-0064, Japan
| | | |
Collapse
|
32
|
Suzuki H, Ono N, Yuta K. Several necessary conditions for the evolution of complex forms of life in an artificial environment. ARTIFICIAL LIFE 2003; 9:153-174. [PMID: 12906727 DOI: 10.1162/106454603322221504] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
In order for an artificial life (Alife) system to evolve complex creatures, an artificial environment prepared by a designer has to satisfy several conditions. To clarify this requirement, we first assume that an artificial environment implemented in the computational medium is composed of an information space in which elementary symbols move around and react with each other according to human-prepared elementary rules. As fundamental properties of these factors (space, symbols, transportation, and reaction), we present ten criteria from a comparison with the biochemical reaction space in the real world. Then, in the latter half of the article, we take several computational Alife systems one by one, and assess them in terms of the proposed criteria. The assessment can be used not only for improving previous Alife systems but also for devising new Alife models in which complex forms of artificial creatures can be expected to evolve.
Collapse
Affiliation(s)
- Hideaki Suzuki
- ATR Human Information Science Laboratories, 2-2-2 Hikaridai, Seika-cho, Soraku-gun, Kyoto 619-0288, Japan.
| | | | | |
Collapse
|
33
|
Abstract
Directed evolution has now been used for over two decades as an alternative to rational design for protein engineering. Protein function, however, is complex, and modifying enzyme activity is a tall order. We can now improve existing enzyme activity, change enzyme selectivity and evolve function de novo using directed evolution. Although directed evolution is now used routinely to improve existing enzyme activity, there are still only a handful of examples where substrate selectivity has been modified sufficiently for practical application, and the de novo evolution of function largely eludes us.
Collapse
Affiliation(s)
- Haiyan Tao
- Department of Chemistry, Columbia University, New York, NY 10027, USA.
| | | |
Collapse
|