51
|
Wagner A. The low cost of recombination in creating novel phenotypes: Recombination can create new phenotypes while disrupting well-adapted phenotypes much less than mutation. Bioessays 2011; 33:636-46. [PMID: 21633964 DOI: 10.1002/bies.201100027] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Recombination is often considered a disruptive force for well-adapted phenotypes, but recent evidence suggests that this cost of recombination can be small. A key benefit of recombination is that it can help create proteins and regulatory circuits with novel and useful phenotypes more efficiently than point mutation. Its effectiveness stems from the large-scale reorganization of genotypes that it causes, which can help explore far-flung regions in genotype space. Recent work on complex phenotypes in model gene regulatory circuits and proteins shows that the disruptive effects of recombination can be very mild compared to the effects of mutation. Recombination thus can have great benefits at a modest cost, but we do not understand the reasons well. A better understanding might shed light on the evolution of recombination and help improve evolutionary strategies in biochemical engineering.
Collapse
Affiliation(s)
- Andreas Wagner
- Institute of Evolutionary Biology and Environmental Sciences, University of Zurich, Zurich, Switzerland.
| |
Collapse
|
52
|
Abstract
The understanding of the molecular mechanisms of allostery in rabbit muscle pyruvate kinase (RMPK) is still in its infancy. Although, there is a paucity of knowledge on the ground rules on how its functions are regulated, RMPK is an ideal system to address basic questions regarding the fundamental chemical principles governing the regulatory mechanisms about this enzyme which has a TIM (α/β)(8) barrel structural motif [Copley, R. R., and Bork, P. (2000). Homology among (βα)8 barrels: Implications for the evolution of metabolic pathways. J. Mol. Biol.303, 627-640; Farber, G. K., and Petsko, G. A. (1990). The evolution of α/ß barrel enzymes. Trends Biochem.15, 228-234; Gerlt, J. A., and Babbitt, P. C. (2001). Divergent evolution of enzymatic function: Mechanistically diverse superfamilies and functionally distinct superfamilies. Annu. Rev. Biochem.70, 209-246; Heggi, H., and Gerstein, M. (1999). The relationship between protein structure and function: A comprehensive survey with application to the yeast genome. J. Mol. Biol.288, 147-164; Wierenga, R. K. (2001). The TIM-barrel fold: A versatile framework for efficient enzymes. FEB Lett.492, 193-198]. RMPK is a homotetramer. Each subunit consists of 530 amino acids and multiple domains. The active site resides between the A and B domains. Besides the basic TIM-barrel motif, RMPK also exhibits looped-out regions in the α/β barrel of each monomer forming the B- and C-domains. The two isozymes of PK, namely, the kidney and muscle isozymes, exhibit very different allosteric behaviors under the same experimental condition. The only amino acid sequence differences between the mammalian kidney and muscle PK isozymes are located in the C-domain and are involved in intersubunit interactions. Thus, embedded in these two isozymes of PK are the rules involved in engineering the popular TIM (α/β)(8) motif to modulate its allosteric properties. The PK system exhibits a lot of the properties that will allow mining of the ground rules governing the correlative linkages between sequence-fold-function. In this chapter, we review the approaches to acquire the fundamental functional and structural energetics that establish the linkages among this intricate network of linked multiequilibria. Results from these diverse approaches are integrated to establish a working model to represent the complex network of multiple linked reactions which ultimately leads to the observation of allosteric regulation of PK.
Collapse
|
53
|
Ullrich A, Rohrschneider M, Scheuermann G, Stadler PF, Flamm C. In silico evolution of early metabolism. ARTIFICIAL LIFE 2011; 17:87-108. [PMID: 21370961 DOI: 10.1162/artl_a_00021] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
We developed a simulation tool for investigating the evolution of early metabolism, allowing us to speculate on the formation of metabolic pathways from catalyzed chemical reactions and on the development of their characteristic properties. Our model consists of a protocellular entity with a simple RNA-based genetic system and an evolving metabolism of catalytically active ribozymes that manipulate a rich underlying chemistry. Ensuring an almost open-ended and fairly realistic simulation is crucial for understanding the first steps in metabolic evolution. We show here how our simulation tool can be helpful in arguing for or against hypotheses on the evolution of metabolic pathways. We demonstrate that seemingly mutually exclusive hypotheses may well be compatible when we take into account that different processes dominate different phases in the evolution of a metabolic system. Our results suggest that forward evolution shapes metabolic network in the very early steps of evolution. In later and more complex stages, enzyme recruitment supersedes forward evolution, keeping a core set of pathways from the early phase.
Collapse
Affiliation(s)
- Alexander Ullrich
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, University of Leipzig, Germany.
| | | | | | | | | |
Collapse
|
54
|
Doud EH, Perlstein DL, Wolpert M, Cane DE, Walker S. Two distinct mechanisms for TIM barrel prenyltransferases in bacteria. J Am Chem Soc 2011; 133:1270-3. [PMID: 21214173 PMCID: PMC3033458 DOI: 10.1021/ja109578b] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The reactions of two bacterial TIM barrel prenyltransferases (PTs), MoeO5 and PcrB, were explored. MoeO5, the enzyme responsible for the first step in moenomycin biosynthesis, catalyzes the transfer of farnesyl to 3-phosphoglyceric acid (3PG) to give a product containing a cis-allylic double bond. We show that this reaction involves isomerization to a nerolidyl pyrophosphate intermediate followed by bond rotation prior to attack by the nucleophile. This mechanism is unprecedented for a prenyltransferase that catalyzes an intermolecular coupling. We also show that PcrB transfers geranyl and geranylgeranyl groups to glycerol-1-phosphate (G1P), making it the first known bacterial enzyme to use G1P as a substrate. Unlike MoeO5, PcrB catalyzes prenyl transfer without isomerization to give products that retain the trans-allylic bond of the prenyl donors. The TIM barrel family of PTs is unique in including enzymes that catalyze prenyl transfer by distinctly different reaction mechanisms.
Collapse
|
55
|
Bernhardsson S, Gerlee P, Lizana L. Structural correlations in bacterial metabolic networks. BMC Evol Biol 2011; 11:20. [PMID: 21251250 PMCID: PMC3033826 DOI: 10.1186/1471-2148-11-20] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2010] [Accepted: 01/20/2011] [Indexed: 11/22/2022] Open
Abstract
Background Evolution of metabolism occurs through the acquisition and loss of genes whose products acts as enzymes in metabolic reactions, and from a presumably simple primordial metabolism the organisms living today have evolved complex and highly variable metabolisms. We have studied this phenomenon by comparing the metabolic networks of 134 bacterial species with known phylogenetic relationships, and by studying a neutral model of metabolic network evolution. Results We consider the 'union-network' of 134 bacterial metabolisms, and also the union of two smaller subsets of closely related species. Each reaction-node is tagged with the number of organisms it belongs to, which we denote organism degree (OD), a key concept in our study. Network analysis shows that common reactions are found at the centre of the network and that the average OD decreases as we move to the periphery. Nodes of the same OD are also more likely to be connected to each other compared to a random OD relabelling based on their occurrence in the real data. This trend persists up to a distance of around five reactions. A simple growth model of metabolic networks is used to investigate the biochemical constraints put on metabolic-network evolution. Despite this seemingly drastic simplification, a 'union-network' of a collection of unrelated model networks, free of any selective pressure, still exhibit similar structural features as their bacterial counterpart. Conclusions The OD distribution quantifies topological properties of the evolutionary history of bacterial metabolic networks, and lends additional support to the importance of horizontal gene transfer during bacterial metabolic evolution where new reactions are attached at the periphery of the network. The neutral model of metabolic network growth can reproduce the main features of real networks, but we observe that the real networks contain a smaller common core, while they are more similar at the periphery of the network. This suggests that natural selection and biochemical correlations can act both to diversify and to narrow down metabolic evolution.
Collapse
Affiliation(s)
- Sebastian Bernhardsson
- Center for Models of Life, Niels Bohr Institute, Blegdamsvej 17 DK-2100 Copenhagen Ø, Denmark.
| | | | | |
Collapse
|
56
|
Wierenga RK, Kapetaniou EG, Venkatesan R. Triosephosphate isomerase: a highly evolved biocatalyst. Cell Mol Life Sci 2010; 67:3961-82. [PMID: 20694739 PMCID: PMC11115733 DOI: 10.1007/s00018-010-0473-9] [Citation(s) in RCA: 165] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2010] [Revised: 07/15/2010] [Accepted: 07/16/2010] [Indexed: 02/04/2023]
Abstract
Triosephosphate isomerase (TIM) is a perfectly evolved enzyme which very fast interconverts dihydroxyacetone phosphate and D: -glyceraldehyde-3-phosphate. Its catalytic site is at the dimer interface, but the four catalytic residues, Asn11, Lys13, His95 and Glu167, are from the same subunit. Glu167 is the catalytic base. An important feature of the TIM active site is the concerted closure of loop-6 and loop-7 on ligand binding, shielding the catalytic site from bulk solvent. The buried active site stabilises the enediolate intermediate. The catalytic residue Glu167 is at the beginning of loop-6. On closure of loop-6, the Glu167 carboxylate moiety moves approximately 2 Å to the substrate. The dynamic properties of the Glu167 side chain in the enzyme substrate complex are a key feature of the proton shuttling mechanism. Two proton shuttling mechanisms, the classical and the criss-cross mechanism, are responsible for the interconversion of the substrates of this enolising enzyme.
Collapse
Affiliation(s)
- R K Wierenga
- Biocenter Oulu and Department of Biochemistry, University of Oulu, P.O. Box 3000, 90014 Oulu, Finland.
| | | | | |
Collapse
|
57
|
Hung CL, Lee C, Lin CY, Chang CH, Chung YC, Yi Tang C. Feature amplified voting algorithm for functional analysis of protein superfamily. BMC Genomics 2010; 11 Suppl 3:S14. [PMID: 21143781 PMCID: PMC2999344 DOI: 10.1186/1471-2164-11-s3-s14] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Identifying the regions associated with protein function is a singularly important task in the post-genomic era. Biological studies often identify functional enzyme residues by amino acid sequences, particularly when related structural information is unavailable. In some cases of protein superfamilies, functional residues are difficult to detect by current alignment tools or evolutionary strategies when phylogenetic relationships do not parallel their protein functions. The solution proposed in this study is Feature Amplified Voting Algorithm with Three-profile alignment (FAVAT). The core concept of FAVAT is to reveal the desired features of a target enzyme or protein by voting on three different property groups aligned by three-profile alignment method. Functional residues of a target protein can then be retrieved by FAVAT analysis. In this study, the amidohydrolase superfamily was an interesting case for verifying the proposed approach because it contains divergent enzymes and proteins. RESULTS The FAVAT was used to identify critical residues of mammalian imidase, a member of the amidohydrolase superfamily. Members of this superfamily were first classified by their functional properties and sources of original organisms. After FAVAT analysis, candidate residues were identified and compared to a bacterial hydantoinase in which the crystal structure (1GKQ) has been fully elucidated. One modified lysine, three histidines and one aspartate were found to participate in the coordination of metal ions in the active site. The FAVAT analysis also redressed the misrecognition of metal coordinator Asp57 by the multiple sequence alignment (MSA) method. Several other amino acid residues known to be related to the function or structure of mammalian imidase were also identified. CONCLUSIONS The FAVAT is shown to predict functionally important amino acids in amidohydrolase superfamily. This strategy effectively identifies functionally important residues by analyzing the discrepancy between the sequence and functional properties of related proteins in a superfamily, and it should be applicable to other protein families.
Collapse
Affiliation(s)
- Che-Lun Hung
- Department of Computer Science, National Tsing Hua University, 101, Section 2 Kuang Fu Road, Hsinchu, Taiwan
| | | | | | | | | | | |
Collapse
|
58
|
Lee M, Gräwert T, Quitterer F, Rohdich F, Eppinger J, Eisenreich W, Bacher A, Groll M. Biosynthesis of isoprenoids: crystal structure of the [4Fe-4S] cluster protein IspG. J Mol Biol 2010; 404:600-10. [PMID: 20932974 DOI: 10.1016/j.jmb.2010.09.050] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2010] [Revised: 09/17/2010] [Accepted: 09/21/2010] [Indexed: 11/24/2022]
Abstract
IspG protein serves as the penultimate enzyme of the recently discovered non-mevalonate pathway for the biosynthesis of the universal isoprenoid precursors, isopentenyl diphosphate and dimethylallyl diphosphate. The enzyme catalyzes the reductive ring opening of 2C-methyl-D-erythritol 2,4-cyclodiphosphate, which affords 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate. The protein was crystallized under anaerobic conditions, and its three-dimensional structure was determined to a resolution of 2.7 Å. Each subunit of the c(2) symmetric homodimer folds into two domains connected by a short linker sequence. The N-terminal domain (N domain) is an eight-stranded β barrel that belongs to the large TIM-barrel superfamily. The C-terminal domain (C domain) consists of a β sheet that is flanked on both sides by helices. One glutamate and three cysteine residues of the C domain coordinate a [4Fe-4S] cluster. Homodimer formation involves an extended contact area (about 1100 Å(2)) between helices 8 and 9 of each respective β barrel. Moreover, each C domain contacts the N domain of the partner subunit, but the interface regions are small (about 430 Å(2)). We propose that the enzyme substrate binds to the positively charged surface area at the C-terminal pole of the β barrel. The C domain carrying the iron-sulfur cluster could then move over to form a closed conformation where the substrate is sandwiched between the N domain and the C domain. This article completes the set of three-dimensional structures of the non-mevalonate pathway enzymes, which are of specific interest as potential targets for tuberculostatic and antimalarial drugs.
Collapse
Affiliation(s)
- Matthias Lee
- Lehrstuhl für Biochemie, Center for Integrated Protein Science Munich, Department Chemie, Technische Universität München, Lichtenbergstrasse 4, D-85747 Garching, Germany
| | | | | | | | | | | | | | | |
Collapse
|
59
|
Wang M, Jiang YY, Kim KM, Qu G, Ji HF, Mittenthal JE, Zhang HY, Caetano-Anollés G. A universal molecular clock of protein folds and its power in tracing the early history of aerobic metabolism and planet oxygenation. Mol Biol Evol 2010; 28:567-82. [PMID: 20805191 DOI: 10.1093/molbev/msq232] [Citation(s) in RCA: 104] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
The standard molecular clock describes a constant rate of molecular evolution and provides a powerful framework for evolutionary timescales. Here, we describe the existence and implications of a molecular clock of folds, a universal recurrence in the discovery of new structures in the world of proteins. Using a phylogenomic structural census in hundreds of proteomes, we build phylogenies and time lines of domains at fold and fold superfamily levels of structural complexity. These time lines correlate approximately linearly with geological timescales and were here used to date two crucial events in life history, planet oxygenation and organism diversification. We first dissected the structures and functions of enzymes in simulated metabolic networks. The placement of anaerobic and aerobic enzymes in the time line revealed that aerobic metabolism emerged about 2.9 billion years (giga-annum; Ga) ago and expanded during a period of about 400 My, reaching what is known as the Great Oxidation Event. During this period, enzymes recruited old and new folds for oxygen-mediated enzymatic activities. Remarkably, the first fold lost by a superkingdom disappeared in Archaea 2.6 Ga ago, within the span of oxygen rise, suggesting that oxygen also triggered diversification of life. The implications of a molecular clock of folds are many and important for the neutral theory of molecular evolution and for understanding the growth and diversity of the protein world. The clock also extends the standard concept that was specific to molecules and their timescales and turns it into a universal timescale-generating tool.
Collapse
Affiliation(s)
- Minglei Wang
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana-Champaign, USA
| | | | | | | | | | | | | | | |
Collapse
|
60
|
Glasner ME, Gerlt JA, Babbitt PC. Mechanisms of protein evolution and their application to protein engineering. ADVANCES IN ENZYMOLOGY AND RELATED AREAS OF MOLECULAR BIOLOGY 2010; 75:193-239, xii-xiii. [PMID: 17124868 DOI: 10.1002/9780471224464.ch3] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Protein engineering holds great promise for the development of new biosensors, diagnostics, therapeutics, and agents for bioremediation. Despite some remarkable successes in experimental and computational protein design, engineered proteins rarely achieve the efficiency or specificity of natural enzymes. Current protein design methods utilize evolutionary concepts, including mutation, recombination, and selection, but the inability to fully recapitulate the success of natural evolution suggests that some evolutionary principles have not been fully exploited. One aspect of protein engineering that has received little attention is how to select the most promising proteins to serve as templates, or scaffolds, for engineering. Two evolutionary concepts that could provide a rational basis for template selection are the conservation of catalytic mechanisms and functional promiscuity. Knowledge of the catalytic motifs responsible for conserved aspects of catalysis in mechanistically diverse superfamilies could be used to identify promising templates for protein engineering. Second, protein evolution often proceeds through promiscuous intermediates, suggesting that templates which are naturally promiscuous for a target reaction could enhance protein engineering strategies. This review explores these ideas and alternative hypotheses concerning protein evolution and engineering. Future research will determine if application of these principles will lead to a protein engineering methodology governed by predictable rules for designing efficient, novel catalysts.
Collapse
Affiliation(s)
- Margaret E Glasner
- Department of Biopharmaceutical Sciences, University of California-San Francisco, San Francisco, CA 94143, USA
| | | | | |
Collapse
|
61
|
Abstract
Many protein classification systems capture homologous relationships by grouping domains into families and superfamilies on the basis of sequence similarity. Superfamilies with similar 3D structures are further grouped into folds. In the absence of discernable sequence similarity, these structural similarities were long thought to have originated independently, by convergent evolution. However, the growth of databases and advances in sequence comparison methods have led to the discovery of many distant evolutionary relationships that transcend the boundaries of superfamilies and folds. To investigate the contributions of convergent versus divergent evolution in the origin of protein folds, we clustered representative domains of known structure by their sequence similarity, treating them as point masses in a virtual 2D space which attract or repel each other depending on their pairwise sequence similarities. As expected, families in the same superfamily form tight clusters. But often, superfamilies of the same fold are linked with each other, suggesting that the entire fold evolved from an ancient prototype. Strikingly, some links connect superfamilies with different folds. They arise from modular peptide fragments of between 20 and 40 residues that co-occur in the connected folds in disparate structural contexts. These may be descendants of an ancestral pool of peptide modules that evolved as cofactors in the RNA world and from which the first folded proteins arose by amplification and recombination. Our galaxy of folds summarizes, in a single image, most known and many yet undescribed homologous relationships between protein superfamilies, providing new insights into the evolution of protein domains.
Collapse
Affiliation(s)
- Vikram Alva
- Department of Protein Evolution, Max-Planck-Institute for Developmental Biology, Tübingen 72076, Germany
| | | | | | | | | |
Collapse
|
62
|
Opperman DJ, Sewell BT, Litthauer D, Isupov MN, Littlechild JA, van Heerden E. Crystal structure of a thermostable old yellow enzyme from Thermus scotoductus SA-01. Biochem Biophys Res Commun 2010; 393:426-31. [PMID: 20138824 DOI: 10.1016/j.bbrc.2010.02.011] [Citation(s) in RCA: 66] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2010] [Accepted: 02/03/2010] [Indexed: 11/20/2022]
Abstract
Recent characterization of the chromate reductase (CrS) from the thermophile Thermus scotoductus SA-01 revealed this enzyme to be related to the Old Yellow Enzyme (OYE) family. Here, we report the structure of a thermostable OYE homolog in its holoform at 2.2A as well as its complex with p-hydroxybenzaldehyde (pHBA). The enzyme crystallized as octamers with the monomers showing a classical TIM barrel fold which upon dimerization yields the biologically active form of the protein. A sulfate ion is bound above the si-side of the non-covalently bound FMN cofactor in the oxidized solved structure but is displaced upon pHBA binding. The active-site architecture is highly conserved as with other members of this enzyme family. The pHBA in the CrS complex is positioned by hydrogen bonding to the two conserved catalytic-site histidines. The most prominent structural difference between CrS and other OYE homologs is the size of the "capping domain". Thermostabilization of the enzyme is achieved in part through increased proline content within loops and turns as well as increased intersubunit interactions through hydrogen bonding and complex salt bridge networks. CrS is able to reduce the C=C bonds of alpha,beta-unsaturated carbonyl compounds with a preference towards cyclic substrates however no activity was observed towards beta-substituted substrates. Mutational studies have confirmed the role of Tyr177 as the proposed proton donor although reduction could still occur at a reduced rate when this residue was mutated to phenylalanine.
Collapse
Affiliation(s)
- Diederik J Opperman
- Department of Microbial, Biochemical and Food Biotechnology, BioPAD Metagenomics Platform, University of the Free State, Bloemfontein 9300, South Africa
| | | | | | | | | | | |
Collapse
|
63
|
Evolution of biomolecular networks: lessons from metabolic and protein interactions. Nat Rev Mol Cell Biol 2009; 10:791-803. [PMID: 19851337 DOI: 10.1038/nrm2787] [Citation(s) in RCA: 148] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Despite only becoming popular at the beginning of this decade, biomolecular networks are now frameworks that facilitate many discoveries in molecular biology. The nodes of these networks are usually proteins (specifically enzymes in metabolic networks), whereas the links (or edges) are their interactions with other molecules. These networks are made up of protein-protein interactions or enzyme-enzyme interactions through shared metabolites in the case of metabolic networks. Evolutionary analysis has revealed that changes in the nodes and links in protein-protein interaction and metabolic networks are subject to different selection pressures owing to distinct topological features. However, many evolutionary constraints can be uncovered only if temporal and spatial aspects are included in the network analysis.
Collapse
|
64
|
Herman P, Lee JC. Functional energetic landscape in the allosteric regulation of muscle pyruvate kinase. 1. Calorimetric study. Biochemistry 2009; 48:9448-55. [PMID: 19719244 PMCID: PMC2759577 DOI: 10.1021/bi900279x] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Rabbit muscle pyruvate kinase (RMPK) is an important allosteric enzyme of the glycolytic pathway catalyzing a transfer of the phosphate from phosphoenolpyruvate (PEP) to ADP. The energetic landscape of the allosteric regulatory mechanism of RMPK was characterized by isothermal titration calorimetry (ITC) in the temperature range from 4 to 45 degrees C. ITC data for RMPK binding to substrates PEP and ADP, for the allosteric inhibitor Phe, and for combination of ADP and Phe were globally analyzed. The thermodynamic parameters characterizing the linked-multiple-equilibrium system were extracted. Four novel insights were uncovered. (1) The binding preference of ADP for either the T or R state is temperature-dependent, namely, more favorable to the T and R states at high and low temperatures, respectively. This crossover of affinity toward R and T states implies that ADP plays a complex role in modulating the allosteric behavior of RMPK. Depending on the temperature, binding of ADP can regulate RMPK activity by favoring the enzyme to either the R or T state. (2) The binding of Phe is negatively coupled to that of ADP; i.e., Phe and ADP prefer not to bind to the same subunit of RMPK. (3) The release or absorption of protons linked to the various equilibria is specific to the particular reaction. As a consequence, pH will exert a complex effect on these linked equilibria, resulting in the proton being an allosteric regulatory ligand of RMPK. (4) The R <--> T equilibrium is accompanied by a significant DeltaC(p), rendering RMPK most sensitive to temperature under physiological conditions. During muscle activity, both pH and temperature fluctuations are known to happen; thus, results of this study are physiologically relevant.
Collapse
Affiliation(s)
- Petr Herman
- Faculty of Mathematics and Physics, Institute of Physics, Charles University, Ke Karlovu 5, 121 16 Prague, Czech Republic
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, TX 77555-1055, USA
| | - J. Ching Lee
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, TX 77555-1055, USA
| |
Collapse
|
65
|
Kim BH, Cheng H, Grishin NV. HorA web server to infer homology between proteins using sequence and structural similarity. Nucleic Acids Res 2009; 37:W532-8. [PMID: 19417074 PMCID: PMC2703895 DOI: 10.1093/nar/gkp328] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
The biological properties of proteins are often gleaned through comparative analysis of evolutionary relatives. Although protein structure similarity search methods detect more distant homologs than purely sequence-based methods, structural resemblance can result from either homology (common ancestry) or analogy (similarity without common ancestry). While many existing web servers detect structural neighbors, they do not explicitly address the question of homology versus analogy. Here, we present a web server named HorA (Homology or Analogy) that identifies likely homologs for a query protein structure. Unlike other servers, HorA combines sequence information from state-of-the-art profile methods with structure information from spatial similarity measures using an advanced computational technique. HorA aims to identify biologically meaningful connections rather than purely 3D-geometric similarities. The HorA method finds approximately 90% of remote homologs defined in the manually curated database SCOP. HorA will be especially useful for finding remote homologs that might be overlooked by other sequence or structural similarity search servers. The HorA server is available at http://prodata.swmed.edu/horaserver.
Collapse
Affiliation(s)
- Bong-Hyun Kim
- Department of Biochemistry, University of Texas, Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, TX 75390-9050, USA
| | | | | |
Collapse
|
66
|
Masters M, Blakely G, Coulson A, McLennan N, Yerko V, Acord J. Protein folding in Escherichia coli: the chaperonin GroE and its substrates. Res Microbiol 2009; 160:267-77. [PMID: 19393741 DOI: 10.1016/j.resmic.2009.04.002] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2009] [Revised: 04/02/2009] [Accepted: 04/10/2009] [Indexed: 10/20/2022]
Abstract
A brief summary of the role of DnaK and GroE chaperones in protein folding precedes a discussion of the role of GroE in Escherichia coli. We consider its obligate substrates, the 8 that are both obligate and essential, and the prospects for constructing a mutant that could survive without it. Structural features of GroE-dependent polypeptides are also considered.
Collapse
Affiliation(s)
- Millicent Masters
- Institute of Cell Biology, University of Edinburgh, Kings Buildings, Edinburgh EH93JR, Scotland, United Kingdom.
| | | | | | | | | | | |
Collapse
|
67
|
Fani R, Fondi M. Origin and evolution of metabolic pathways. Phys Life Rev 2009; 6:23-52. [PMID: 20416849 DOI: 10.1016/j.plrev.2008.12.003] [Citation(s) in RCA: 86] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2008] [Revised: 11/27/2008] [Accepted: 12/01/2008] [Indexed: 10/21/2022]
Abstract
The emergence and evolution of metabolic pathways represented a crucial step in molecular and cellular evolution. In fact, the exhaustion of the prebiotic supply of amino acids and other compounds that were likely present in the ancestral environment, imposed an important selective pressure, favoring those primordial heterotrophic cells which became capable of synthesizing those molecules. Thus, the emergence of metabolic pathways allowed primitive organisms to become increasingly less-dependent on exogenous sources of organic compounds. Comparative analyses of genes and genomes from organisms belonging to Archaea, Bacteria and Eukarya revealed that, during evolution, different forces and molecular mechanisms might have driven the shaping of genomes and the arisal of new metabolic abilities. Among these gene elongations, gene and operon duplications undoubtedly played a major role since they can lead to the (immediate) appearance of new genetic material that, in turn, might undergo evolutionary divergence giving rise to new genes coding for new metabolic abilities. Gene duplication has been invoked in the different schemes proposed to explain why and how the extant metabolic pathways have arisen and shaped. Both the analysis of completely sequenced genomes and directed evolution experiments strongly support one of them, i.e. the patchwork hypothesis, according to which metabolic pathways have been assembled through the recruitment of primitive enzymes that could react with a wide range of chemically related substrates. However, the analysis of the structure and organization of genes belonging to ancient metabolic pathways, such as histidine biosynthesis and nitrogen fixation, suggested that other different hypothesis, i.e. the retrograde hypothesis or the semi-enzymatic theory, may account for the arisal of some metabolic routes.
Collapse
Affiliation(s)
- Renato Fani
- Laboratory of Microbial and Molecular Evolution, Department of Evolutionary Biology, Via Romana 17-19, University of Florence, Italy
| | | |
Collapse
|
68
|
An enzymatic atavist revealed in dual pathways for water activation. PLoS Biol 2008; 6:e206. [PMID: 18752347 PMCID: PMC2525682 DOI: 10.1371/journal.pbio.0060206] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2007] [Accepted: 07/15/2008] [Indexed: 11/21/2022] Open
Abstract
Inosine monophosphate dehydrogenase (IMPDH) catalyzes an essential step in the biosynthesis of guanine nucleotides. This reaction involves two different chemical transformations, an NAD-linked redox reaction and a hydrolase reaction, that utilize mutually exclusive protein conformations with distinct catalytic residues. How did Nature construct such a complicated catalyst? Here we employ a “Wang-Landau” metadynamics algorithm in hybrid quantum mechanical/molecular mechanical (QM/MM) simulations to investigate the mechanism of the hydrolase reaction. These simulations show that the lowest energy pathway utilizes Arg418 as the base that activates water, in remarkable agreement with previous experiments. Surprisingly, the simulations also reveal a second pathway for water activation involving a proton relay from Thr321 to Glu431. The energy barrier for the Thr321 pathway is similar to the barrier observed experimentally when Arg418 is removed by mutation. The Thr321 pathway dominates at low pH when Arg418 is protonated, which predicts that the substitution of Glu431 with Gln will shift the pH-rate profile to the right. This prediction is confirmed in subsequent experiments. Phylogenetic analysis suggests that the Thr321 pathway was present in the ancestral enzyme, but was lost when the eukaryotic lineage diverged. We propose that the primordial IMPDH utilized the Thr321 pathway exclusively, and that this mechanism became obsolete when the more sophisticated catalytic machinery of the Arg418 pathway was installed. Thus, our simulations provide an unanticipated window into the evolution of a complex enzyme. Many enzymes have the remarkable ability to catalyze several different chemical transformations. For example, IMP dehydrogenase catalyzes both an NAD-linked redox reaction and a hydrolase reaction. These reactions utilize distinct catalytic residues and protein conformations. How did Nature construct such a complicated catalyst? While using computational methods to investigate the mechanism of the hydrolase reaction, we have discovered that IMP dehydrogenase contains two sets of catalytic residues to activate water. Importantly, the simulations are in good agreement with previous experimental observations and are further validated by subsequent experiments. Phylogenetic analysis suggests that the simpler, less efficient catalytic machinery was present in the ancestral enzyme, but was lost when the eukaryotic lineage diverged. We propose that the primordial IMP dehydrogenase utilized the less efficient machinery exclusively, and that this mechanism became obsolete when the more sophisticated catalytic machinery evolved. The presence of the less efficient machinery could facilitate adaptation, making the evolutionary challenge of the IMPDH reaction much less formidable. Thus our simulations provide an unanticipated window into the evolution of a complex enzyme. How does nature construct complex catalysts? Molecular simulations revealed two sets of catalytic residues in the enzyme IMPDH, one of which seems to represent a primitive catalytic machinery that may be a vestige of evolution.
Collapse
|
69
|
Experimental Evidence for the Existence of a Stable Half-Barrel Subdomain in the (β/α)8-Barrel Fold. J Mol Biol 2008; 382:458-66. [DOI: 10.1016/j.jmb.2008.07.040] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2008] [Revised: 07/14/2008] [Accepted: 07/16/2008] [Indexed: 11/16/2022]
|
70
|
Caetano-Anollés G, Yafremava LS, Gee H, Caetano-Anollés D, Kim HS, Mittenthal JE. The origin and evolution of modern metabolism. Int J Biochem Cell Biol 2008; 41:285-97. [PMID: 18790074 DOI: 10.1016/j.biocel.2008.08.022] [Citation(s) in RCA: 80] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2008] [Revised: 08/09/2008] [Accepted: 08/11/2008] [Indexed: 10/21/2022]
Abstract
One fundamental goal of current research is to understand how complex biomolecular networks took the form that we observe today. Cellular metabolism is probably one of the most ancient biological networks and constitutes a good model system for the study of network evolution. While many evolutionary models have been proposed, a substantial body of work suggests metabolic pathways evolve fundamentally by recruitment, in which enzymes are drawn from close or distant regions of the network to perform novel chemistries or use different substrates. Here we review how structural and functional genomics has impacted our knowledge of evolution of modern metabolism and describe some approaches that merge evolutionary and structural genomics with advances in bioinformatics. These include mining the data on structure and function of enzymes for salient patterns of enzyme recruitment. Initial studies suggest modern metabolism originated in enzymes of nucleotide metabolism harboring the P-loop hydrolase fold, probably in pathways linked to the purine metabolic subnetwork. This gateway of recruitment gave rise to pathways related to the synthesis of nucleotides and cofactors for an ancient RNA world. Once the TIM beta/alpha-barrel fold architecture was discovered, it appears metabolic activities were recruited explosively giving rise to subnetworks related to carbohydrate and then amino acid metabolism. Remarkably, recruitment occurred in a layered system reminiscent of Morowitz's prebiotic shells, supporting the notion that modern metabolism represents a palimpsest of ancient metabolic chemistries.
Collapse
|
71
|
Abstract
Since the introduction of the concepts of allostery about four decades ago, much advancement has been made in elucidating the structure-function correlation in allostery. However, there are still a number of issues that remain unresolved. In this review we used mammalian pyruvate kinase (PK) as a model system to understand the role of protein dynamics in modulating cooperativity. PK has a triosephosphate isomerase (TIM) (alpha/beta)(8) barrel structural motif. PK is an ideal system to address basic questions regarding regulatory mechanisms about this common (alpha/beta)(8) structural motif. The simplest model accounting for all of the solution thermodynamic and kinetic data on ligand-enzyme interactions involves two conformational states, inactive E(T) and active E(R). These conformational states are represented by domain movements. Further studies provide the first evidence for a differential effect of ligand binding on the dynamics of the structural elements, not major secondary structural changes. These data are consistent with our model that allosteric regulation of PK is the consequence of perturbation of the distribution of an ensemble of states in which the inactive E(T) and active E(R) represent the two extreme end states. Sequence differences and ligands can modulate the distribution of states leading to alterations of functions. The future work includes: defining the network of functionally connected residues; elucidating the chemical principles governing the sequence differences which affect functions; and probing the nature of mutations on the stability of the secondary structural elements, which in turn modulate allostery.
Collapse
Affiliation(s)
- J Ching Lee
- Department of Biochemistry and Molecular Biology, The University of Texas Medical Branch at Galveston, Galveston, Texas 77555-1055, USA.
| |
Collapse
|
72
|
Abstract
beta-Propellers are toroidal folds, in which repeated, four-stranded beta-meanders are arranged in a circular and slightly tilted fashion, like the blades of a propeller. They are found in all domains of life, with a strong preponderance among eukaryotes. Propellers show considerable sequence diversity and are classified into six separate structural groups by the SCOP and CATH databases. Despite this diversity, they often show similarities across groups, not only in structure but also in sequence, raising the possibility of a common origin. In agreement with this hypothesis, most propellers group together in a cluster map of all-beta folds generated by sequence similarity, because of numerous pairwise matches, many of which are individually nonsignificant. In total, 45 of 60 propellers in the SCOP25 database, covering four SCOP folds, are clustered in this group and analysis with sensitive sequence comparison methods shows that they are similar at a level indicative of homology. Two mechanisms appear to contribute to the evolution of beta-propellers: amplification from single blades and subsequent functional differentiation. The observation of propellers with nearly identical blades in genomic sequences show that these mechanisms are still operating today.
Collapse
Affiliation(s)
- Indronil Chaudhuri
- Department for Protein Evolution, Max Planck Institute for Developmental Biology, 72076 Tuebingen, Germany
| | | | | |
Collapse
|
73
|
Cheng H, Kim BH, Grishin NV. Discrimination between distant homologs and structural analogs: lessons from manually constructed, reliable data sets. J Mol Biol 2008; 377:1265-78. [PMID: 18313074 PMCID: PMC4494761 DOI: 10.1016/j.jmb.2007.12.076] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2007] [Accepted: 12/20/2007] [Indexed: 10/22/2022]
Abstract
A natural way to study protein sequence, structure, and function is to put them in the context of evolution. Homologs inherit similarities from their common ancestor, while analogs converge to similar structures due to a limited number of energetically favorable ways to pack secondary structural elements. Using novel strategies, we previously assembled two reliable databases of homologs and analogs. In this study, we compare these two data sets and develop a support vector machine (SVM)-based classifier to discriminate between homologs and analogs. The classifier uses a number of well-known similarity scores. We observe that although both structure scores and sequence scores contribute to SVM performance, profile sequence scores computed based on structural alignments are the best discriminators between remote homologs and structural analogs. We apply our classifier to a representative set from the expert-constructed database, Structural Classification of Proteins (SCOP). The SVM classifier recovers 76% of the remote homologs defined as domains in the same SCOP superfamily but from different families. More importantly, we also detect and discuss interesting homologous relationships between SCOP domains from different superfamilies, folds, and even classes.
Collapse
Affiliation(s)
- Hua Cheng
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, TX 75390-9050, USA.
| | | | | |
Collapse
|
74
|
Abstract
MALISAM (manual alignments for structurally analogous motifs) represents the first database containing pairs of structural analogs and their alignments. To find reliable analogs, we developed an approach based on three ideas. First, an insertion together with a part of the evolutionary core of one domain family (a hybrid motif) is analogous to a similar motif contained within the core of another domain family. Second, a motif at an interface, formed by secondary structural elements (SSEs) contributed by two or more domains or subunits contacting along that interface, is analogous to a similar motif present in the core of a single domain. Third, an artificial protein obtained through selection from random peptides or in sequence design experiments not biased by sequences of a particular homologous family, is analogous to a structurally similar natural protein. Each analogous pair is superimposed and aligned manually, as well as by several commonly used programs. Applications of this database may range from protein evolution studies, e.g. development of remote homology inference tools and discriminators between homologs and analogs, to protein-folding research, since in the absence of evolutionary reasons, similarity between proteins is caused by structural and folding constraints. The database is publicly available at http://prodata.swmed.edu/malisam.
Collapse
Affiliation(s)
- Hua Cheng
- Howard Hughes Medical Institute and Department of Biochemistry, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd., Dallas, TX 75390-9050, USA
| | - Bong-Hyun Kim
- Howard Hughes Medical Institute and Department of Biochemistry, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd., Dallas, TX 75390-9050, USA
| | - Nick V. Grishin
- Howard Hughes Medical Institute and Department of Biochemistry, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd., Dallas, TX 75390-9050, USA
- *To whom correspondence should be addressed.+214 645 5952 +214 645 5948
| |
Collapse
|
75
|
Caetano-Anollés G, Kim HS, Mittenthal JE. The origin of modern metabolic networks inferred from phylogenomic analysis of protein architecture. Proc Natl Acad Sci U S A 2007; 104:9358-63. [PMID: 17517598 PMCID: PMC1890499 DOI: 10.1073/pnas.0701214104] [Citation(s) in RCA: 118] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Metabolism represents a complex collection of enzymatic reactions and transport processes that convert metabolites into molecules capable of supporting cellular life. Here we explore the origins and evolution of modern metabolism. Using phylogenomic information linked to the structure of metabolic enzymes, we sort out recruitment processes and discover that most enzymatic activities were associated with the nine most ancient and widely distributed protein fold architectures. An analysis of newly discovered functions showed enzymatic diversification occurred early, during the onset of the modern protein world. Most importantly, phylogenetic reconstruction exercises and other evidence suggest strongly that metabolism originated in enzymes with the P-loop hydrolase fold in nucleotide metabolism, probably in pathways linked to the purine metabolic subnetwork. Consequently, the first enzymatic takeover of an ancient biochemistry or prebiotic chemistry was related to the synthesis of nucleotides for the RNA world.
Collapse
Affiliation(s)
- Gustavo Caetano-Anollés
- Departments of Crop Sciences and Cell and Developmental Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA.
| | | | | |
Collapse
|
76
|
Madej T, Panchenko AR, Chen J, Bryant SH. Protein homologous cores and loops: important clues to evolutionary relationships between structurally similar proteins. BMC STRUCTURAL BIOLOGY 2007; 7:23. [PMID: 17425794 PMCID: PMC1852803 DOI: 10.1186/1472-6807-7-23] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/13/2006] [Accepted: 04/10/2007] [Indexed: 11/11/2022]
Abstract
Background To discover remote evolutionary relationships and functional similarities between proteins, biologists rely on comparative sequence analysis, and when structures are available, on structural alignments and various measures of structural similarity. The measures/scores that have most commonly been used for this purpose include: alignment length, percent sequence identity, superposition RMSD and their different combinations. More recently, we have introduced the "Homologous core structure overlap score" (HCS) and the "Loop Hausdorff Measure" (LHM). Along with these we also consider the "gapped structural alignment score" (GSAS), which was introduced earlier by other researchers. Results We analyze the performance of these and other conventional measures at the task of ranking structure neighbors by homology, and we show that the HCS, LHM, and GSAS scores display considerably improved performance over the conventional measures of sequence or structural similarity. Conclusion The HCS, LHM, and GSAS scores are easily computable quantities that allow users of structure-neighbor databases to more easily identify interesting structural similarities between proteins.
Collapse
Affiliation(s)
- Thomas Madej
- Computational Biology Branch, National Center for Biotechnology Information, Building 38A, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Anna R Panchenko
- Computational Biology Branch, National Center for Biotechnology Information, Building 38A, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Jie Chen
- Computational Biology Branch, National Center for Biotechnology Information, Building 38A, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Stephen H Bryant
- Computational Biology Branch, National Center for Biotechnology Information, Building 38A, National Institutes of Health, Bethesda, Maryland 20894, USA
| |
Collapse
|
77
|
Söding J, Remmert M, Biegert A. HHrep: de novo protein repeat detection and the origin of TIM barrels. Nucleic Acids Res 2006; 34:W137-42. [PMID: 16844977 PMCID: PMC1538828 DOI: 10.1093/nar/gkl130] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
HHrep is a web server for the de novo identification of repeats in protein sequences, which is based on the pairwise comparison of profile hidden Markov models (HMMs). Its main strength is its sensitivity, allowing it to detect highly divergent repeat units in protein sequences whose repeats could as yet only be detected from their structures. Examples include sequences with β-propellor fold, ferredoxin-like fold, double psi barrels or (βα)8 (TIM) barrels. We illustrate this with proteins from four superfamilies of TIM barrels by revealing a clear 4- and 8-fold symmetry, which we detect solely from their sequences. This symmetry might be the trace of an ancient origin through duplication of a βαβα or βα unit. HHrep can be accessed at .
Collapse
Affiliation(s)
- Johannes Söding
- Department of Protein Evolution, Max-Planck-Institute for Developmental Biology, Spemannstrasse 35, 72076 Tübingen, Germany.
| | | | | |
Collapse
|
78
|
Kim HS, Mittenthal JE, Caetano-Anollés G. MANET: tracing evolution of protein architecture in metabolic networks. BMC Bioinformatics 2006; 7:351. [PMID: 16854231 PMCID: PMC1559654 DOI: 10.1186/1471-2105-7-351] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2006] [Accepted: 07/19/2006] [Indexed: 11/13/2022] Open
Abstract
Background Cellular metabolism can be characterized by networks of enzymatic reactions and transport processes capable of supporting cellular life. Our aim is to find evolutionary patterns and processes embedded in the architecture and function of modern metabolism, using information derived from structural genomics. Description The Molecular Ancestry Network (MANET) project traces evolution of protein architecture in biomolecular networks. We describe metabolic MANET, a database that links information in the Structural Classification of Proteins (SCOP), the Kyoto Encyclopedia of Genes and Genomes (KEGG), and phylogenetic reconstructions depicting the evolution of protein fold architecture. Metabolic MANET literally 'paints' the ancestries of enzymes derived from rooted phylogenomic trees directly onto over one hundred metabolic subnetworks, enabling the study of evolutionary patterns at global and local levels. An initial analysis of painted subnetworks reveals widespread enzymatic recruitment and an early origin of amino acid metabolism. Conclusion MANET maps evolutionary relationships directly and globally onto biological networks, and can generate and test hypotheses related to evolution of metabolism. We anticipate its use in the study of other networks, such as signaling and other protein-protein interaction networks.
Collapse
Affiliation(s)
- Hee Shin Kim
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Jay E Mittenthal
- Department of Cell and Developmental Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Gustavo Caetano-Anollés
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| |
Collapse
|
79
|
Panchenko AR, Wolf YI, Panchenko LA, Madej T. Evolutionary plasticity of protein families: coupling between sequence and structure variation. Proteins 2006; 61:535-44. [PMID: 16184609 PMCID: PMC1941674 DOI: 10.1002/prot.20644] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
In this work we examine how protein structural changes are coupled with sequence variation in the course of evolution of a family of homologs. The sequence-structure correlation analysis performed on 81 homologous protein families shows that the majority of them exhibit statistically significant linear correlation between the measures of sequence and structural similarity. We observed, however, that there are cases where structural variability cannot be mainly explained by sequence variation, such as protein families with a number of disulfide bonds. To understand whether structures from different families and/or folds evolve in the same manner, we compared the degrees of structural change per unit of sequence change ("the evolutionary plasticity of structure") between those families with a significant linear correlation. Using rigorous statistical procedures we find that, with a few exceptions, evolutionary plasticity does not show a statistically significant difference between protein families. Similar sequence-structure analysis performed for protein loop regions shows that evolutionary plasticity of loop regions is greater than for the protein core.
Collapse
Affiliation(s)
- Anna R Panchenko
- Computational Biology Branch, National Center for Biotechnology Information, National Institutes of Health, Bethesda, Maryland 20894, USA.
| | | | | | | |
Collapse
|
80
|
Stengl B, Reuter K, Klebe G. Mechanism and substrate specificity of tRNA-guanine transglycosylases (TGTs): tRNA-modifying enzymes from the three different kingdoms of life share a common catalytic mechanism. Chembiochem 2006; 6:1926-39. [PMID: 16206323 DOI: 10.1002/cbic.200500063] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Transfer RNA-guanine transglycosylases (TGTs) are evolutionarily ancient enzymes, present in all kingdoms of life, catalyzing guanine exchange within their cognate tRNAs by modified 7-deazaguanine bases. Although distinct bases are incorporated into tRNA at different positions in a kingdom-specific manner, the catalytic subunits of TGTs are structurally well conserved. This review provides insight into the sequential steps along the reaction pathway, substrate specificity, and conformational adaptions of the binding pockets by comparison of TGT crystal structures in complex with RNA substrates of a eubacterial and an archaebacterial species. Substrate-binding modes indicate an evolutionarily conserved base-exchange mechanism with a conserved aspartate serving as a nucleophile through covalent binding to C1' of the guanosine ribose moiety in an intermediate state. A second conserved aspartate seems to control the spatial rearrangement of the ribose ring along the reaction pathway and supposedly operates as a general acid/base. Water molecules inside the binding pocket accommodating interaction sites subsequently occupied by polar atoms of substrates help to elucidate substrate-recognition and substrate-specificity features. This emphasizes the role of water molecules as general probes to map binding-site properties for structure-based drug design. Additionally, substrate-bound crystal structures allow the extraction of valuable information about the classification of the TGT superfamily into a subdivision of presumably homologous superfamilies adopting the triose-phosphate isomerase type barrel fold with a standard phosphate-binding motif.
Collapse
Affiliation(s)
- Bernhard Stengl
- Institut für Pharmazeutische Chemie, Philipps-Universität Marburg, Marbacher Weg 6, 35032 Marburg, Germany
| | | | | |
Collapse
|
81
|
Gold ND, Jackson RM. Fold Independent Structural Comparisons of Protein–Ligand Binding Sites for Exploring Functional Relationships. J Mol Biol 2006; 355:1112-24. [PMID: 16359705 DOI: 10.1016/j.jmb.2005.11.044] [Citation(s) in RCA: 80] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2005] [Revised: 11/11/2005] [Accepted: 11/15/2005] [Indexed: 11/23/2022]
Abstract
The rapid growth in protein structural data and the emergence of structural genomics projects have increased the need for automatic structure analysis and tools for function prediction. Small molecule recognition is critical to the function of many proteins; therefore, determination of ligand binding site similarity is important for understanding ligand interactions and may allow their functional classification. Here, we present a binding sites database (SitesBase) that given a known protein-ligand binding site allows rapid retrieval of other binding sites with similar structure independent of overall sequence or fold similarity. However, each match is also annotated with sequence similarity and fold information to aid interpretation of structure and functional similarity. Similarity in ligand binding sites can indicate common binding modes and recognition of similar molecules, allowing potential inference of function for an uncharacterised protein or providing additional evidence of common function where sequence or fold similarity is already known. Alternatively, the resource can provide valuable information for detailed studies of molecular recognition including structure-based ligand design and in understanding ligand cross-reactivity. Here, we show examples of atomic similarity between superfamily or more distant fold relatives as well as between seemingly unrelated proteins. Assignment of unclassified proteins to structural superfamiles is also undertaken and in most cases substantiates assignments made using sequence similarity. Correct assignment is also possible where sequence similarity fails to find significant matches, illustrating the potential use of binding site comparisons for newly determined proteins.
Collapse
Affiliation(s)
- Nicola D Gold
- Institute of Molecular and Cellular Biology, University of Leeds, Leeds LS2 9JT, UK
| | | |
Collapse
|
82
|
Vesterstrøm J, Taylor WR. Flexible secondary structure based protein structure comparison applied to the detection of circular permutation. J Comput Biol 2006; 13:43-63. [PMID: 16472021 DOI: 10.1089/cmb.2006.13.43] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
We present a novel method for structural comparison of protein structures. The approach consists of two main phases: 1) an initial search phase where, starting from aligned pairs of secondary structure elements, the space of 3D transformations is searched for similarities and 2) a subsequent refinement phase where interim solutions are subjected to parallel, local, iterative dynamic programming in the areas of possible improvement. The proposed method combines dynamic programming for finding alignments but does not restrict solutions to be sequential. In addition, to deal with the problem of nonuniqueness of optimal similarities, we introduce a consensus scoring method in selecting the preferred similarity and provide a list of top-ranked solutions. The method, called FASE (flexible alignment of secondary structure elements), was tested on well-known data and various standard problems from the literature. The results show that FASE is able to find remote and weak similarities consistently using a reasonable run time. The method was tested (using the SCOP database) on its ability to discriminate interfold pairs from intrafold pairs at the level of the best existing methods. The method was then applied to the problem of finding circular permutations in proteins.
Collapse
Affiliation(s)
- Jakob Vesterstrøm
- BiRC-Bioinformatics Research Center, University of Aarhus, DK-8000 Aarhus C, Denmark
| | | |
Collapse
|
83
|
Sandhya S, Chakrabarti S, Abhinandan KR, Sowdhamini R, Srinivasan N. Assessment of a rigorous transitive profile based search method to detect remotely similar proteins. J Biomol Struct Dyn 2005; 23:283-98. [PMID: 16218755 DOI: 10.1080/07391102.2005.10507066] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
Profile-based sequence search procedures are commonly employed to detect remote relationships between proteins. We provide an assessment of a Cascade PSI-BLAST protocol that rigorously employs intermediate sequences in detecting remote relationships between proteins. In this approach we detect using PSI-BLAST, which involves multiple rounds of iteration, an initial set of homologues for a protein in a 'first generation' search by querying a database. We propagate a 'second generation' search in the database, involving multiple runs of PSI-BLAST using each of the homologues identified in the previous generation as queries to recognize homologues not detected earlier. This non-directed search process can be viewed as an iteration of iterations that is continued to detect further homologues until no new hits are detectable. We present an assessment of the coverage of this 'cascaded' intermediate sequence search on diverse folds and find that searches for up to three generations detect most known homologues of a query. Our assessments show that this approach appears to perform better than the traditional use of PSI-BLAST by detecting 15% more relationships within a family and 35% more relationships within a superfamily. We show that such searches can be performed on generalized sequence databases and non-trivial relationships between proteins can be detected effectively. Such a propagation of searches maximizes the chances of detecting distant homologies by effectively scanning protein "fold space".
Collapse
Affiliation(s)
- S Sandhya
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560 012, India
| | | | | | | | | |
Collapse
|
84
|
Saab-Rincón G, Mancera E, Montero-Morán G, Sánchez F, Soberón X. Generation of variability by in vivo recombination of halves of a (beta/alpha)8 barrel protein. ACTA ACUST UNITED AC 2005; 22:113-20. [PMID: 16125117 DOI: 10.1016/j.bioeng.2005.01.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2004] [Revised: 12/25/2004] [Accepted: 01/18/2005] [Indexed: 11/26/2022]
Abstract
Similar to what has been achieved with nucleic acids, directed evolution of proteins would be greatly facilitated by the availability of large libraries and efficient selection methods. So far, host cell transformation efficiency has been a bottleneck, practically limiting libraries to sizes less than 10(9). One way to circumvent this problem has been implemented with antibody systems, where contribution to the binding site is provided by two different polypeptides (light and heavy chains). The central concept is the construction of binary systems in which the gene from the two chains are separated by a cre-lox recombinase recognition site, packaged in a phage, and subsequently introduced, by multiple infection, into a recombinase expressing cell [Sblattero D, Bradbury A. Nat Biotechnol 2000;18(1):75-80]. Here, we describe the development of a system which applies the same concept to a single-domain enzyme, the cytoplasmic (beta/alpha)8 barrel protein phosphoribosyl anthranilate isomerase (PRAI) from E. coli. For that purpose, we identified the site at which a loop containing the recognition sequence for cre-lox recombinase could be inserted yielding a functional enzyme. We evaluated the effect of this insertion on the capability of the engineered gene to complement a trp F-E. coli strain and the efficiency of the system to recover the original sequence from an abundance of non-functional mutant genes.
Collapse
Affiliation(s)
- Gloria Saab-Rincón
- Instituto de Biotecnología, UNAM, Apartado Postal 510-3, Cuernavaca, Morelos 62271, México
| | | | | | | | | |
Collapse
|
85
|
Sterner R, Höcker B. Catalytic Versatility, Stability, and Evolution of the (βα)8-Barrel Enzyme Fold. Chem Rev 2005; 105:4038-55. [PMID: 16277370 DOI: 10.1021/cr030191z] [Citation(s) in RCA: 166] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Reinhard Sterner
- Institut für Biophysik und physikalische Biochemie, Universität Regensburg, Universitätsstrasse 31, D-93053 Regensburg, Germany.
| | | |
Collapse
|
86
|
Analysis of protein homology by assessing the (dis)similarity in protein loop regions. Proteins 2005; 57:539-47. [PMID: 15382231 DOI: 10.1002/prot.20237] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Two proteins are considered to have a similar fold if sufficiently many of their secondary structure elements are positioned similarly in space and are connected in the same order. Such a common structural scaffold may arise due to either divergent or convergent evolution. The intervening unaligned regions ("loops") between the superimposable helices and strands can exhibit a wide range of similarity and may offer clues to the structural evolution of folds. One might argue that more closely related proteins differ less in their nonconserved loop regions than distantly related proteins and, at the same time, the degree of variability in the loop regions in structurally similar but unrelated proteins is higher than in homologs. Here we introduce a new measure for structural (dis)similarity in loop regions that is based on the concept of the Hausdorff metric. This measure is used to gauge protein relatedness and is tested on a benchmark of homologous and analogous protein structures. It has been shown that the new measure can distinguish homologous from analogous proteins with the same or higher accuracy than the conventional measures that are based on comparing proteins in structurally aligned regions. We argue that this result can be attributed to the higher sensitivity of the Hausdorff (dis)similarity measure in detecting particularly evident dissimilarities in structures and draw some conclusions about evolutionary relatedness of proteins in the most populated protein folds.
Collapse
|
87
|
Kozbial PZ, Mushegian AR. Natural history of S-adenosylmethionine-binding proteins. BMC STRUCTURAL BIOLOGY 2005; 5:19. [PMID: 16225687 PMCID: PMC1282579 DOI: 10.1186/1472-6807-5-19] [Citation(s) in RCA: 218] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/21/2005] [Accepted: 10/14/2005] [Indexed: 11/10/2022]
Abstract
BACKGROUND S-adenosylmethionine is a source of diverse chemical groups used in biosynthesis and modification of virtually every class of biomolecules. The most notable reaction requiring S-adenosylmethionine, transfer of methyl group, is performed by a large class of enzymes, S-adenosylmethionine-dependent methyltransferases, which have been the focus of considerable structure-function studies. Evolutionary trajectories of these enzymes, and especially of other classes of S-adenosylmethionine-binding proteins, nevertheless, remain poorly understood. We addressed this issue by computational comparison of sequences and structures of various S-adenosylmethionine-binding proteins. RESULTS Two widespread folds, Rossmann fold and TIM barrel, have been repeatedly used in evolution for diverse types of S-adenosylmethionine conversion. There were also cases of recruitment of other relatively common folds for S-adenosylmethionine binding. Several classes of proteins have unique unrelated folds, specialized for just one type of chemistry and unified by the theme of internal domain duplications. In several cases, functional divergence is evident, when evolutionarily related enzymes have changed the mode of binding and the type of chemical transformation of S-adenosylmethionine. There are also instances of functional convergence, when biochemically similar processes are performed by drastically different classes of S-adenosylmethionine-binding proteins. Comparison of remote sequence similarities and analysis of phyletic patterns suggests that the last universal common ancestor of cellular life had between 10 and 20 S-adenosylmethionine-binding proteins from at least 5 fold classes, providing for S-adenosylmethionine formation, polyamine biosynthesis, and methylation of several substrates, including nucleic acids and peptide chain release factor. CONCLUSION We have observed several novel relationships between families that were not known to be related before, and defined 15 large superfamilies of SAM-binding proteins, at least 5 of which may have been represented in the last common ancestor.
Collapse
Affiliation(s)
- Piotr Z Kozbial
- Stowers Institute for Medical Research, 1000 E. 50th St., Kansas City, MO 64110, USA
| | - Arcady R Mushegian
- Stowers Institute for Medical Research, 1000 E. 50th St., Kansas City, MO 64110, USA
- Department of Microbiology, Molecular Genetics, and Immunology, University of Kansas Medical Center, Kansas City, Kansas 66160, USA
| |
Collapse
|
88
|
Livesay DR, La D. The evolutionary origins and catalytic importance of conserved electrostatic networks within TIM-barrel proteins. Protein Sci 2005; 14:1158-70. [PMID: 15840824 PMCID: PMC2253277 DOI: 10.1110/ps.041221105] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
Conservation of function is the basic tenet of protein evolution. Conservation of key electrostatic properties is a frequently employed mechanism that leads to conserved function. In a previous report, we identified several conserved electrostatic properties in four protein families and one functionally diverse enzyme superfamily. In this report, we demonstrate the evolutionary and catalytic importance of electrostatic networks in three ubiquitous metabolic enzymes: triosephosphate isomerase, enolase, and transaldolase. Evolutionary importance is demonstrated using phylogenetic motifs (sequence fragments that parallel the overall familial phylogeny). Phylogenetic motifs frequently correspond to both catalytic residues and conserved interactions that fine-tune catalytic residue pKa values. Further, in the case of triosephosphate isomerase, quantitative differences in the catalytic Glu169 pKa values parallel subfamily differentiation. Finally, phylogenetic motifs are shown to structurally cluster around the active sites of eight different TIM-barrel families. Depending upon the mechanistic requisites of each reaction catalyzed, interruptions to the canonical fold may or may not be identified as phylogenetic motifs.
Collapse
Affiliation(s)
- Dennis R Livesay
- Department of Chemistry, California State Polytechnic University, Pomona, 3801 W. Temple Avenue, Pomona, CA 91768, USA. .
| | | |
Collapse
|
89
|
Pfeiffer T, Soyer OS, Bonhoeffer S. The evolution of connectivity in metabolic networks. PLoS Biol 2005; 3:e228. [PMID: 16000019 PMCID: PMC1157096 DOI: 10.1371/journal.pbio.0030228] [Citation(s) in RCA: 98] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2004] [Accepted: 04/26/2005] [Indexed: 11/24/2022] Open
Abstract
Processes in living cells are the result of interactions between biochemical compounds in highly complex biochemical networks. It is a major challenge in biology to understand causes and consequences of the specific design of these networks. A characteristic design feature of metabolic networks is the presence of hub metabolites such as ATP or NADH that are involved in a high number of reactions. To study the emergence of hub metabolites, we implemented computer simulations of a widely accepted scenario for the evolution of metabolic networks. Our simulations indicate that metabolic networks with a large number of highly specialized enzymes may evolve from a few multifunctional enzymes. During this process, enzymes duplicate and specialize, leading to a loss of biochemical reactions and intermediary metabolites. Complex features of metabolic networks such as the presence of hubs may result from selection of growth rate if essential biochemical mechanisms are considered. Specifically, our simulations indicate that group transfer reactions are essential for the emergence of hubs. Computer simulations show how the complex organization of metabolic networks can arise from selection for a simple trait such as growth rate.
Collapse
|
90
|
Höcker B. Directed evolution of (βα)8-barrel enzymes. ACTA ACUST UNITED AC 2005; 22:31-8. [PMID: 15857781 DOI: 10.1016/j.bioeng.2004.09.005] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2004] [Revised: 09/23/2004] [Accepted: 09/24/2004] [Indexed: 10/25/2022]
Abstract
Natural molecular evolution supplies us with manifold examples of protein engineering. The imitation of these natural processes in the design of new enzymes has led to surprising and insightful results. Well-suited for design by evolutionary methods are enzymes with the common and versatile (betaalpha)(8)-barrel fold. Studies of enzyme stability, folding and design as well as the evolution of (betaalpha)(8)-barrel enzymes are discussed.
Collapse
Affiliation(s)
- Birte Höcker
- Duke University Medical Center, Department of Biochemistry, Box 3711, Durham, NC 27710, USA.
| |
Collapse
|
91
|
Ronimus RS, Morgan HW. Distribution and phylogenies of enzymes of the Embden-Meyerhof-Parnas pathway from archaea and hyperthermophilic bacteria support a gluconeogenic origin of metabolism. ARCHAEA-AN INTERNATIONAL MICROBIOLOGICAL JOURNAL 2005; 1:199-221. [PMID: 15803666 PMCID: PMC2685568 DOI: 10.1155/2003/162593] [Citation(s) in RCA: 84] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Enzymes of the gluconeogenic/glycolytic pathway (the Embden-Meyerhof-Parnas (EMP) pathway), the reductive tricarboxylic acid cycle, the reductive pentose phosphate cycle and the Entner-Doudoroff pathway are widely distributed and are often considered to be central to the origins of metabolism. In particular, several enzymes of the lower portion of the EMP pathway (the so-called trunk pathway), including triosephosphate isomerase (TPI; EC 5.3.1.1), glyceraldehyde-3-phosphate dehydrogenase (GAPDH; EC 1.2.1.12/13), phosphoglycerate kinase (PGK; EC 2.7.2.3) and enolase (EC 4.2.1.11), are extremely well conserved and universally distributed among the three domains of life. In this paper, the distribution of enzymes of gluconeogenesis/glycolysis in hyperthermophiles--microorganisms that many believe represent the least evolved organisms on the planet--is reviewed. In addition, the phylogenies of the trunk pathway enzymes (TPIs, GAPDHs, PGKs and enolases) are examined. The enzymes catalyzing each of the six-carbon transformations in the upper portion of the EMP pathway, with the possible exception of aldolase, are all derived from multiple gene sequence families. In contrast, single sequence families can account for the archaeal and hyperthermophilic bacterial enzyme activities of the lower portion of the EMP pathway. The universal distribution of the trunk pathway enzymes, in combination with their phylogenies, supports the notion that the EMP pathway evolved in the direction of gluconeogenesis, i.e., from the bottom up.
Collapse
Affiliation(s)
- Ron S Ronimus
- Thermophile Research Unit, Department of Biological Sciences, University of Waikato, Private Bag 3105, Hamilton, New Zealand.
| | | |
Collapse
|
92
|
Caetano-Anollés G, Caetano-Anollés D. Universal Sharing Patterns in Proteomes and Evolution of Protein Fold Architecture and Life. J Mol Evol 2005; 60:484-98. [PMID: 15883883 DOI: 10.1007/s00239-004-0221-6] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2004] [Accepted: 10/11/2004] [Indexed: 11/30/2022]
Abstract
Protein evolution is imprinted in both the sequence and the structure of evolutionary building blocks known as protein domains. These domains share a common ancestry and can be unified into a comparatively small set of folding architectures, the protein folds. We have traced the distribution of protein folds between and within proteomes belonging to Eukarya, Archaea, and Bacteria along the branches of a universal phylogeny of protein architecture. This tree was reconstructed from global fold-usage statistics derived from a structural census of proteomes. We found that folds shared by the three organismal domains were placed almost exclusively at the base of the rooted tree and that there were marked heterogeneities in fold distribution and clear evolutionary patterns related to protein architecture and organismal diversification. These include a relative timing for the emergence of prokaryotes, congruent episodes of architectural loss and diversification in Archaea and Bacteria, and a late and quite massive rise of architectural novelties in Eukarya perhaps linked to multicellularity.
Collapse
Affiliation(s)
- Gustavo Caetano-Anollés
- Department of Crop Sciences, University of Illinois, 332 NSRC, 1101 West Peabody Drive, Urbana, IL, 61801, USA.
| | | |
Collapse
|
93
|
Van Lanen SG, Reader JS, Swairjo MA, de Crécy-Lagard V, Lee B, Iwata-Reuyl D. From cyclohydrolase to oxidoreductase: discovery of nitrile reductase activity in a common fold. Proc Natl Acad Sci U S A 2005; 102:4264-9. [PMID: 15767583 PMCID: PMC555470 DOI: 10.1073/pnas.0408056102] [Citation(s) in RCA: 92] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2004] [Indexed: 11/18/2022] Open
Abstract
The enzyme YkvM from Bacillus subtilis was identified previously along with three other enzymes (YkvJKL) in a bioinformatics search for enzymes involved in the biosynthesis of queuosine, a 7-deazaguanine modified nucleoside found in tRNA(GUN) of Bacteria and Eukarya. Genetic analysis of ykvJKLM mutants in Acinetobacter confirmed that each was essential for queuosine biosynthesis, and the genes were renamed queCDEF. QueF exhibits significant homology to the type I GTP cyclohydrolases characterized by FolE. Given that GTP is the precursor to queuosine and that a cyclohydrolase-like reaction was postulated as the initial step in queuosine biosynthesis, QueF was proposed to be the putative cyclohydrolase-like enzyme responsible for this reaction. We have cloned the queF genes from B. subtilis and Escherichia coli and characterized the recombinant enzymes. Contrary to the predictions based on sequence analysis, we discovered that the enzymes, in fact, catalyze a mechanistically unrelated reaction, the NADPH-dependent reduction of 7-cyano-7-deazaguanineto7-aminomethyl-7-deazaguanine, a late step in the biosynthesis of queuosine. We report here in vitro and in vivo studies that demonstrate this catalytic activity, as well as preliminary biochemical and bioinformatics analysis that provide insight into the structure of this family of enzymes.
Collapse
Affiliation(s)
- Steven G Van Lanen
- Department of Chemistry, Portland State University, P.O. Box 751, Portland, OR 97207, USA
| | | | | | | | | | | |
Collapse
|
94
|
Marland E, Prachumwat A, Maltsev N, Gu Z, Li WH. Higher gene duplicabilities for metabolic proteins than for nonmetabolic proteins in yeast and E. coli. J Mol Evol 2005; 59:806-14. [PMID: 15599512 DOI: 10.1007/s00239-004-0068-x] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2004] [Accepted: 06/29/2004] [Indexed: 10/26/2022]
Abstract
Although the evolutionary significance of gene duplication has long been appreciated, it remains unclear what factors determine gene duplicability. In this study we investigated whether metabolism is an important determinant of gene duplicability because cellular metabolism is crucial for the survival and reproduction of an organism. Using genomic data and metabolic pathway data from the yeast (Saccharomyces cerevisiae) and Escherichia coli, we found that metabolic proteins indeed tend to have higher gene duplicability than nonmetabolic proteins. Moreover, a detailed analysis of metabolic pathways in these two organisms revealed that genes in the central metabolic pathways and the catabolic pathways have, on average, higher gene duplicability than do other genes and that most genes in anabolic pathways are single-copy genes.
Collapse
Affiliation(s)
- Elizabeth Marland
- Mathematics & Computer Science Division, Argonne National Laboratory, 9700 South Cass Avenue, Argonne, IL 60439, USA
| | | | | | | | | |
Collapse
|
95
|
Huynen MA, Gabaldón T, Snel B. Variation and evolution of biomolecular systems: Searching for functional relevance. FEBS Lett 2005; 579:1839-45. [PMID: 15763561 DOI: 10.1016/j.febslet.2005.02.004] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2005] [Revised: 01/18/2005] [Accepted: 02/01/2005] [Indexed: 11/29/2022]
Abstract
The availability of genome sequences and functional genomics data from multiple species enables us to compare the composition of biomolecular systems like biochemical pathways and protein complexes between species. Here, we review small- and large-scale, "genomics-based" approaches to biomolecular systems variation. In general, caution is required when comparing the results of bioinformatics analyses of genomes or of functional genomics data between species. Limitations to the sensitivity of sequence analysis tools and the noisy nature of genomics data tend to lead to systematic overestimates of the amount of variation. Nevertheless, the results from detailed manual analyses, and of large-scale analyses that filter out systematic biases, point to a large amount of variation in the composition of biomolecular systems. Such observations challenge our understanding of the function of the systems and their individual components and can potentially facilitate the identification and functional characterization of sub-systems within a system. Mapping the inter-species variation of complex biomolecular systems on a phylogenetic species tree allows one to reconstruct their evolution.
Collapse
Affiliation(s)
- Martijn A Huynen
- Center for Molecular and Biomolecular Informatics, Nijmegen Center for Molecular Life Sciences, Radboud University Nijmegen Medical Center, P.O. Box 9010, 6500 GL Nijmegen, The Netherlands.
| | | | | |
Collapse
|
96
|
Zientz E, Dandekar T, Gross R. Metabolic interdependence of obligate intracellular bacteria and their insect hosts. Microbiol Mol Biol Rev 2005; 68:745-70. [PMID: 15590782 PMCID: PMC539007 DOI: 10.1128/mmbr.68.4.745-770.2004] [Citation(s) in RCA: 215] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
Mutualistic associations of obligate intracellular bacteria and insects have attracted much interest in the past few years due to the evolutionary consequences for their genome structure. However, much less attention has been paid to the metabolic ramifications for these endosymbiotic microorganisms, which have to compete with but also to adapt to another metabolism--that of the host cell. This review attempts to provide insights into the complex physiological interactions and the evolution of metabolic pathways of several mutualistic bacteria of aphids, ants, and tsetse flies and their insect hosts.
Collapse
Affiliation(s)
- Evelyn Zientz
- Lehrstuhl für Mikrobiologie, Biozentrum der Universität Würzburg, Theodor-Boveri-Institut, Am Hubland, D-97074 Würzburg, Germany
| | | | | |
Collapse
|
97
|
Stevens FJ. Efficient recognition of protein fold at low sequence identity by conservative application of Psi-BLAST: validation. J Mol Recognit 2005; 18:139-49. [PMID: 15558595 DOI: 10.1002/jmr.721] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
A substantial fraction of protein sequences derived from genomic analyses is currently classified as representing 'hypothetical proteins of unknown function'. In part, this reflects the limitations of methods for comparison of sequences with very low identity. We evaluated the effectiveness of a Psi-BLAST search strategy to identify proteins of similar fold at low sequence identity. Psi-BLAST searches for structurally characterized low-sequence-identity matches were carried out on a set of over 300 proteins of known structure. Searches were conducted in NCBI's non-redundant database and were limited to three rounds. Some 614 potential homologs with 25% or lower sequence identity to 166 members of the search set were obtained. Disregarding the expect value, level of sequence identity and span of alignment, correspondence of fold between the target and potential homolog was found in more than 95% of the Psi-BLAST matches. Restrictions on expect value or span of alignment improved the false positive rate at the expense of eliminating many true homologs. Approximately three-quarters of the putative homologs obtained by three rounds of Psi-BLAST revealed no significant sequence similarity to the target protein upon direct sequence comparison by BLAST, and therefore could not be found by a conventional search. Although three rounds of Psi-BLAST identified many more homologs than a standard BLAST search, most homologs were undetected. It appears that more than 80% of all homologs to a target protein may be characterized by a lack of significant sequence similarity. We suggest that conservative use of Psi-BLAST has the potential to propose experimentally testable functions for the majority of proteins currently annotated as 'hypothetical proteins of unknown function'.
Collapse
Affiliation(s)
- F J Stevens
- Biosciences Division, Argonne National Laboratory, Argonne, IL 60439, USA.
| |
Collapse
|
98
|
Heine A, Luz JG, Wong CH, Wilson IA. Analysis of the class I aldolase binding site architecture based on the crystal structure of 2-deoxyribose-5-phosphate aldolase at 0.99A resolution. J Mol Biol 2004; 343:1019-34. [PMID: 15476818 DOI: 10.1016/j.jmb.2004.08.066] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2004] [Revised: 08/18/2004] [Accepted: 08/20/2004] [Indexed: 11/17/2022]
Abstract
The crystal structure of the bacterial (Escherichia coli) class I 2-deoxyribose-5-phosphate aldolase (DERA) has been determined by Se-Met multiple anomalous dispersion (MAD) methods at 0.99A resolution. This structure represents the highest-resolution X-ray structure of an aldolase determined to date and enables a true atomic view of the enzyme. The crystal structure shows the ubiquitous TIM alpha/beta barrel fold. The enzyme contains two lysine residues in the active site. Lys167 forms the Schiff base intermediate, whereas Lys201, which is in close vicinity to the reactive lysine residue, is responsible for the perturbed pK(a) of Lys167 and, hence, also a key residue in the reaction mechanism. DERA is the only known aldolase that is able to use aldehydes as both aldol donor and acceptor molecules in the aldol reaction and is, therefore, of particular interest as a biocatalyst in synthetic organic chemistry. The uncomplexed DERA structure enables a detailed comparison with the substrate complexes and highlights a conformational change in the phosphate-binding site. Knowledge of the enzyme active-site environment has been the basis for exploration of catalysis of non-natural substrates and of mutagenesis of the phosphate-binding site to expand substrate specificity. Detailed comparison with other class I aldolase enzymes and DERA enzymes from different organisms reveals a similar geometric arrangement of key residues and implies a potential role for water as a general base in the catalytic mechanism.
Collapse
Affiliation(s)
- Andreas Heine
- Department of Molecular Biology, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA
| | | | | | | |
Collapse
|
99
|
Abstract
MOTIVATION In this paper, we shall examine the evolution of domain architectures across 62 genomes of known phylogeny including all kingdoms of life. We look in particular at the possibility of convergent evolution, with a view to determining the extent to which the architectures observed in the genomes are due to functional necessity or evolutionary descent. We used domains of known structure, because from this and other information we know their evolutionary relationships. We use a range of methods including phylogenetic grouping, sequence similarity/alignment, mutation rates and comparative genomics to approach this difficult problem from several angles. RESULTS Although we do not claim an exhaustive analysis, we conclude that between 0.4 and 4% of sequences are involved in convergent evolution of domain architectures, and expect the actual number to be close to the lower bound. We also made two incidental observations, albeit on a small sample: the events leading to convergent evolution appear to be random with no functional or structural preferences, and changes in the number of tandem repeat domains occur more readily than changes which alter the domain composition. CONCLUSION The principal conclusion is that the observed domain architectures of the sequences in the genomes are driven by evolutionary descent rather than functional necessity. CONTACT gough@supfam.org.
Collapse
Affiliation(s)
- Julian Gough
- RIKEN Genomic Sciences Centre, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan.
| |
Collapse
|
100
|
Höcker B, Claren J, Sterner R. Mimicking enzyme evolution by generating new (betaalpha)8-barrels from (betaalpha)4-half-barrels. Proc Natl Acad Sci U S A 2004; 101:16448-53. [PMID: 15539462 PMCID: PMC534502 DOI: 10.1073/pnas.0405832101] [Citation(s) in RCA: 86] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Gene duplication and fusion events that multiply and link functional protein domains are crucial mechanisms of enzyme evolution. The analysis of amino acid sequences and three-dimensional structures suggested that the (betaalpha)8-barrel, which is the most frequent fold among enzymes, has evolved by the duplication, fusion, and mixing of (betaalpha)4-half-barrel domains. Here, we mimicked this evolutionary strategy by generating in vitro (betaalpha)8-barrels from (betaalpha)4-half-barrels that were deduced from the enzymes imidazole glycerol phosphate synthase (HisF) and N'[(5'-phosphoribosyl)formimino]-5-aminoimidazole-4-carboxamide-ribonucleotide isomerase (HisA). To this end, the gene for the C-terminal (betaalpha)4-half-barrel (HisF-C) of HisF was duplicated and fused in tandem to yield HisF-CC, which is more stable than HisF-C. In the next step, by optimizing side-chain interactions within the center of the beta-barrel of HisF-CC, the monomeric and compact (betaalpha)8-barrel protein HisF-C*C was generated. Moreover, the genes for the N- and C-terminal (betaalpha)4-half-barrels of HisF and HisA were fused crosswise to yield the chimeric proteins HisFA and HisAF. Whereas HisFA contains native secondary structure elements but adopts ill-defined association states, the (betaalpha)8-barrel HisAF is a stable and compact monomer that reversibly unfolds with high cooperativity. The results obtained suggest a previously undescribed dimension for the diversification of enzymatic activities: new (betaalpha)8-barrels with novel functions might have evolved by the exchange of (betaalpha)4-half-barrel domains with distinct functional properties.
Collapse
Affiliation(s)
- Birte Höcker
- Institut für Biochemie, Universität zu Köln, Otto-Fischer-Strasse 12-14, D-50674 Köln, Germany
| | | | | |
Collapse
|