1
|
Jácome R. Structural and Evolutionary Analysis of Proteins Endowed with a Nucleotidyltransferase, or Non-canonical Palm, Catalytic Domain. J Mol Evol 2024:10.1007/s00239-024-10207-7. [PMID: 39297932 DOI: 10.1007/s00239-024-10207-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Accepted: 09/09/2024] [Indexed: 09/21/2024]
Abstract
Many polymerases and other proteins are endowed with a catalytic domain belonging to the nucleotidyltransferase fold, which has also been deemed the non-canonical palm domain, in which three conserved acidic residues coordinate two divalent metal ions. Tertiary structure-based evolutionary analyses provide valuable information when the phylogenetic signal contained in the primary structure is blurry or has been lost, as is the case with these proteins. Pairwise structural comparisons of proteins with a nucleotidyltransferase fold were performed in the PDBefold web server: the RMSD, the number of superimposed residues, and the Qscore were obtained. The structural alignment score (RMSD × 100/number of superimposed residues) and the 1-Qscore were calculated, and distance matrices were constructed, from which a dendogram and a phylogenetic network were drawn for each score. The dendograms and the phylogenetic networks display well-defined clades, reflecting high levels of structural conservation within each clade, not mirrored by primary sequence. The conserved structural core between all these proteins consists of the catalytic nucleotidyltransferase fold, which is surrounded by different functional domains. Hence, many of the clades include proteins that bind different substrates or partake in non-related functions. Enzymes endowed with a nucleotidyltransferase fold are present in all domains of life, and participate in essential cellular and viral functions, which suggests that this domain is very ancient. Despite the loss of evolutionary traces in their primary structure, tertiary structure-based analyses allow us to delve into the evolution and functional diversification of the NT fold.
Collapse
Affiliation(s)
- Rodrigo Jácome
- Facultad de Ciencias, Universidad Nacional Autónoma de México, Mexico City, México.
| |
Collapse
|
2
|
Aberuagba A, Joel EB, Bello AJ, Igunnu A, Malomo SO, Olorunniji FJ. Thermophilic PHP Protein Tyrosine Phosphatases (Cap8C and Wzb) from Mesophilic Bacteria. Int J Mol Sci 2024; 25:1262. [PMID: 38279261 PMCID: PMC10816263 DOI: 10.3390/ijms25021262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2023] [Revised: 01/14/2024] [Accepted: 01/17/2024] [Indexed: 01/28/2024] Open
Abstract
Protein tyrosine phosphatases (PTPs) of the polymerase and histidinol phosphatase (PHP) superfamily with characteristic phosphatase activity dependent on divalent metal ions are found in many Gram-positive bacteria. Although members of this family are co-purified with metal ions, they still require the exogenous supply of metal ions for full activation. However, the specific roles these metal ions play during catalysis are yet to be well understood. Here, we report the metal ion requirement for phosphatase activities of S. aureus Cap8C and L. rhamnosus Wzb. AlphaFold-predicted structures of the two PTPs suggest that they are members of the PHP family. Like other PHP phosphatases, the two enzymes have a catalytic preference for Mn2+, Co2+ and Ni2+ ions. Cap8C and Wzb show an unusual thermophilic property with optimum activities over 75 °C. Consistent with this model, the activity-temperature profiles of the two enzymes are dependent on the divalent metal ion activating the enzyme.
Collapse
Affiliation(s)
- Adepeju Aberuagba
- School of Pharmacy & Biomolecular Sciences, Liverpool John Moores University, Byrom Street, Liverpool L3 3AF, UK; (A.A.); (E.B.J.); (A.J.B.)
| | - Enoch B. Joel
- School of Pharmacy & Biomolecular Sciences, Liverpool John Moores University, Byrom Street, Liverpool L3 3AF, UK; (A.A.); (E.B.J.); (A.J.B.)
- Department of Biochemistry, Faculty of Basic Medical Sciences, University of Jos, Jos 930003, Nigeria
| | - Adebayo J. Bello
- School of Pharmacy & Biomolecular Sciences, Liverpool John Moores University, Byrom Street, Liverpool L3 3AF, UK; (A.A.); (E.B.J.); (A.J.B.)
| | - Adedoyin Igunnu
- Department of Biochemistry, Faculty of Life Sciences, University of Ilorin, Ilorin 234031, Nigeria; (A.I.); (S.O.M.)
| | - Sylvia O. Malomo
- Department of Biochemistry, Faculty of Life Sciences, University of Ilorin, Ilorin 234031, Nigeria; (A.I.); (S.O.M.)
| | - Femi J. Olorunniji
- School of Pharmacy & Biomolecular Sciences, Liverpool John Moores University, Byrom Street, Liverpool L3 3AF, UK; (A.A.); (E.B.J.); (A.J.B.)
| |
Collapse
|
3
|
The PHP domain of PolX from Staphylococcus aureus aids high fidelity DNA synthesis through the removal of misincorporated deoxyribo-, ribo- and oxidized nucleotides. Sci Rep 2021; 11:4178. [PMID: 33603016 PMCID: PMC7893174 DOI: 10.1038/s41598-021-83498-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2020] [Accepted: 01/27/2021] [Indexed: 01/31/2023] Open
Abstract
The X family is one of the eight families of DNA polymerases (dPols) and members of this family are known to participate in the later stages of Base Excision Repair. Many prokaryotic members of this family possess a Polymerase and Histidinol Phosphatase (PHP) domain at their C-termini. The PHP domain has been shown to possess 3'-5' exonuclease activity and may represent the proofreading function in these dPols. PolX from Staphylococcus aureus also possesses the PHP domain at the C-terminus, and we show that this domain has an intrinsic Mn2+ dependent 3'-5' exonuclease capable of removing misincorporated dNMPs from the primer. The misincorporation of oxidized nucleotides such as 8oxodGTP and rNTPs are known to be pro-mutagenic and can lead to genomic instability. Here, we show that the PHP domain aids DNA replication by the removal of misincorporated oxidized nucleotides and rNMPs. Overall, our study shows that the proofreading activity of the PHP domain plays a critical role in maintaining genomic integrity and stability. The exonuclease activity of this enzyme can, therefore, be the target of therapeutic intervention to combat infection by methicillin-resistant-Staphylococcus-aureus.
Collapse
|
4
|
D'Alessandro B, Pérez Escanda V, Balestrazzi L, Grattarola F, Iriarte A, Pickard D, Yim L, Chabalgoity JA, Betancor L. Comparative genomics of Salmonella enterica serovar Enteritidis ST-11 isolated in Uruguay reveals lineages associated with particular epidemiological traits. Sci Rep 2020; 10:3638. [PMID: 32109937 PMCID: PMC7046640 DOI: 10.1038/s41598-020-60502-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2019] [Accepted: 02/07/2020] [Indexed: 11/09/2022] Open
Abstract
Salmonella enterica serovar Enteritidis is a major cause of foodborne disease in Uruguay since 1995. We used a genomic approach to study a set of isolates from different sources and years. Whole genome phylogeny showed that most of the strains are distributed in two major lineages (E1 and E2), both belonging to MLST sequence type 11 the major ST among serovar Enteritidis. Strikingly, E2 isolates are over-represented in periods of outbreak abundance in Uruguay, while E1 span all epidemic periods. Both lineages circulate in neighbor countries at the same timescale as in Uruguay, and are present in minor numbers in distant countries. We identified allelic variants associated with each lineage. Three genes, ycdX, pduD and hsdM, have distinctive variants in E1 that may result in defective products. Another four genes (ybiO, yiaN, aas, aceA) present variants specific for the E2 lineage. Overall this work shows that S. enterica serovar Enteritidis strains circulating in Uruguay have the same phylogenetic profile than strains circulating in the region, as well as in more distant countries. Based on these results we hypothesize that the E2 lineage, which is more prevalent during epidemics, exhibits a combination of allelic variants that could be associated with its epidemic ability.
Collapse
Affiliation(s)
- Bruno D'Alessandro
- Departamento de Desarrollo Biotecnológico, Instituto de Higiene, Facultad de Medicina, Universidad de la República, Av. Alfredo Navarro 3051, CP, 11600, Montevideo, Uruguay
| | - Victoria Pérez Escanda
- Departamento de Desarrollo Biotecnológico, Instituto de Higiene, Facultad de Medicina, Universidad de la República, Av. Alfredo Navarro 3051, CP, 11600, Montevideo, Uruguay
| | - Lucía Balestrazzi
- Departamento de Desarrollo Biotecnológico, Instituto de Higiene, Facultad de Medicina, Universidad de la República, Av. Alfredo Navarro 3051, CP, 11600, Montevideo, Uruguay
| | - Florencia Grattarola
- Departamento de Bacteriología y Virología, Instituto de Higiene, Facultad de Medicina, Universidad de la República, Av. Alfredo Navarro 3051, CP, 11600, Montevideo, Uruguay
| | - Andrés Iriarte
- Departamento de Desarrollo Biotecnológico, Instituto de Higiene, Facultad de Medicina, Universidad de la República, Av. Alfredo Navarro 3051, CP, 11600, Montevideo, Uruguay
| | - Derek Pickard
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| | - Lucía Yim
- Departamento de Desarrollo Biotecnológico, Instituto de Higiene, Facultad de Medicina, Universidad de la República, Av. Alfredo Navarro 3051, CP, 11600, Montevideo, Uruguay
| | - José Alejandro Chabalgoity
- Departamento de Desarrollo Biotecnológico, Instituto de Higiene, Facultad de Medicina, Universidad de la República, Av. Alfredo Navarro 3051, CP, 11600, Montevideo, Uruguay
| | - Laura Betancor
- Departamento de Desarrollo Biotecnológico, Instituto de Higiene, Facultad de Medicina, Universidad de la República, Av. Alfredo Navarro 3051, CP, 11600, Montevideo, Uruguay. .,Departamento de Bacteriología y Virología, Instituto de Higiene, Facultad de Medicina, Universidad de la República, Av. Alfredo Navarro 3051, CP, 11600, Montevideo, Uruguay.
| |
Collapse
|
5
|
An array of basic residues is essential for the nucleolytic activity of the PHP domain of bacterial/archaeal PolX DNA polymerases. Sci Rep 2019; 9:9947. [PMID: 31289311 PMCID: PMC6616362 DOI: 10.1038/s41598-019-46349-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2019] [Accepted: 06/26/2019] [Indexed: 12/15/2022] Open
Abstract
Bacterial/archaeal family X DNA polymerases (PolXs) have a C-terminal PHP domain with an active site formed by nine histidines and aspartates that catalyzes 3′-5′ exonuclease, AP-endonuclease, 3′-phosphodiesterase and 3′-phosphatase activities. Multiple sequence alignments have allowed us to identify additional highly conserved residues along the PHP domain of bacterial/archaeal PolXs that form an electropositive path to the catalytic site and whose potential role in the nucleolytic activities had not been established. Here, site directed mutagenesis at the corresponding Bacillus subtilis PolX (PolXBs) residues, Arg469, Arg474, Asn498, Arg503 and Lys545, as well as to the highly conserved residue Phe440 gave rise to enzymes severely affected in all the nucleolytic activities of the enzyme while conserving a wild-type gap-filling activity, indicating a function of those residues in DNA binding at the PHP domain. Altogether, the results obtained with the mutant proteins, the spatial arrangement of those DNA binding residues, the intermolecular transference of the 3′-terminus between the PHP and polymerization active sites, and the available 3D structures of bacterial PolXs led us to propose the requirement to a great degree of a functional/structural flexibility to coordinate the synthetic and degradative activities in these enzymes.
Collapse
|
6
|
Ghodge SV, Raushel FM. Structure, Mechanism, and Substrate Profiles of the Trinuclear Metallophosphatases from the Amidohydrolase Superfamily. Methods Enzymol 2018; 607:187-216. [PMID: 30149858 DOI: 10.1016/bs.mie.2018.04.019] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The rate of reliable protein function annotation has not kept pace with the rapid advances in genome sequencing technology. This has created a gap between the number of available protein sequences, and an accurate determination of the respective physiological functions. This investigation has attempted to bridge the gap within the confines of members of the polymerase and histidinol phosphatase family of proteins in cog1387 and cog0613, which is related to the amidohydrolase superfamily. The adopted approach relies on using the mechanistic knowledge of a known enzymatic reaction, and discovering functions of closely related homologs using various tools including bioinformatics and rational library screening. The initial enzymatic reaction was that of L-histidinol phosphate phosphatase. Extensive structural, biochemical, and bioinformatic analysis of enzymes capable of hydrolyzing L-histidinol phosphate provided useful insights in predicting substrates and mechanistic studies of related enzymes. This led to the discovery of unprecedented catalytic functions such as a cyclic phosphate dihydrolase that specifically hydrolyzed a cyclic phosphodiester to inorganic phosphate and a vicinal diol; a phosphoesterase that hydrolyzes the 3'-phosphate of 3',5'-adenosine bisphosphate and similar nucleotides; and the first reported 5'-3' exonuclease for 5'-phosphorylated oligonucleotides from Escherichia coli and related organisms. This work provides a template for developing sequence-structure-function correlations within a family of enzymes that helps expedite new enzyme function discovery and more accurate annotations in protein databases.
Collapse
Affiliation(s)
- Swapnil V Ghodge
- Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Frank M Raushel
- Department of Chemistry, Texas A & M University, College Station, TX, United States.
| |
Collapse
|
7
|
High-fidelity DNA replication in Mycobacterium tuberculosis relies on a trinuclear zinc center. Nat Commun 2017; 8:855. [PMID: 29021523 PMCID: PMC5636811 DOI: 10.1038/s41467-017-00886-w] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2016] [Accepted: 08/02/2017] [Indexed: 01/07/2023] Open
Abstract
High-fidelity DNA replication depends on a proofreading 3′–5′ exonuclease that is associated with the replicative DNA polymerase. The replicative DNA polymerase DnaE1 from the major pathogen Mycobacterium tuberculosis (Mtb) uses its intrinsic PHP-exonuclease that is distinct from the canonical DEDD exonucleases found in the Escherichia coli and eukaryotic replisomes. The mechanism of the PHP-exonuclease is not known. Here, we present the crystal structure of the Mtb DnaE1 polymerase. The PHP-exonuclease has a trinuclear zinc center, coordinated by nine conserved residues. Cryo-EM analysis reveals the entry path of the primer strand in the PHP-exonuclease active site. Furthermore, the PHP-exonuclease shows a striking similarity to E. coli endonuclease IV, which provides clues regarding the mechanism of action. Altogether, this work provides important insights into the PHP-exonuclease and reveals unique properties that make it an attractive target for novel anti-mycobacterial drugs. The polymerase and histidinol phosphatase (PHP) domain in the DNA polymerase DnaE1 is essential for mycobacterial high-fidelity DNA replication. Here, the authors determine the DnaE1 crystal structure, which reveals the PHP-exonuclease mechanism that can be exploited for antibiotic development.
Collapse
|
8
|
An atypical phosphodiesterase capable of degrading haloalkyl phosphate diesters from Sphingobium sp. strain TCM1. Sci Rep 2017; 7:2842. [PMID: 28588250 PMCID: PMC5460133 DOI: 10.1038/s41598-017-03142-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2017] [Accepted: 04/20/2017] [Indexed: 11/17/2022] Open
Abstract
Sphingobium sp. strain TCM1 can degrade tris(2-chloroethyl) phosphate (TCEP) to inorganic phosphate and 2-chloroethanol. A phosphotriesterase (PTE), phosphodiesterase (PDE) and phosphomonoesterase (PME) are believed to be involved in the degradation of TCEP. The PTE and PME that respectively catalyze the first and third steps of TCEP degradation in TCM1 have been identified. However, no information has been reported on a PDE catalyzing the second step. In this study, we identified, purified, and characterized a PDE capable of hydrolyzing haloalkyl phosphate diesters. The final preparation of the enzyme had a specific activity of 29 µmol min−1 mg−1 with bis(p-nitrophenyl) phosphate (BpNPP) as the substrate. It also possessed low PME activity with p-nitrophenyl phosphate (pNPP) as substrate. The catalytic efficiency (kcat/Km) with BpNPP was significantly higher than that with pNPP, indicating that the enzyme prefers the organophosphorus diester to the monoester. The enzyme degraded bis(2,3-dibromopropyl) phosphate, bis(1,3-dichloro-2-propyl) phosphate and bis(2-chloroethyl) phosphate, suggesting that it is involved in the metabolism of haloalkyl organophosphorus triesters. The primary structure of the PDE from TCM1 is distinct from those of typical PDE family members and the enzyme belongs to the polymerase and histidinol phosphatase superfamily.
Collapse
|
9
|
Abstract
This chapter focuses on transition metals. All transition metal cations are toxic-those that are essential for Escherichia coli and belong to the first transition period of the periodic system of the element and also the "toxic-only" metals with higher atomic numbers. Common themes are visible in the metabolism of these ions. First, there is transport. High-rate but low-affinity uptake systems provide a variety of cations and anions to the cells. Control of the respective systems seems to be mainly through regulation of transport activity (flux control), with control of gene expression playing only a minor role. If these systems do not provide sufficient amounts of a needed ion to the cell, genes for ATP-hydrolyzing high-affinity but low-rate uptake systems are induced, e.g., ABC transport systems or P-type ATPases. On the other hand, if the amount of an ion is in surplus, genes for efflux systems are induced. By combining different kinds of uptake and efflux systems with regulation at the levels of gene expression and transport activity, the concentration of a single ion in the cytoplasm and the composition of the cellular ion "bouquet" can be rapidly adjusted and carefully controlled. The toxicity threshold of an ion is defined by its ability to produce radicals (copper, iron, chromate), to bind to sulfide and thiol groups (copper, zinc, all cations of the second and third transition period), or to interfere with the metabolism of other ions. Iron poses an exceptional metabolic problem due its metabolic importance and the low solubility of Fe(III) compounds, combined with the ability to cause dangerous Fenton reactions. This dilemma for the cells led to the evolution of sophisticated multi-channel iron uptake and storage pathways to prevent the occurrence of unbound iron in the cytoplasm. Toxic metals like Cd2+ bind to thiols and sulfide, preventing assembly of iron complexes and releasing the metal from iron-sulfur clusters. In the unique case of mercury, the cation can be reduced to the volatile metallic form. Interference of nickel and cobalt with iron is prevented by the low abundance of these metals in the cytoplasm and their sequestration by metal chaperones, in the case of nickel, or by B12 and its derivatives, in the case of cobalt. The most dangerous metal, copper, catalyzes Fenton-like reactions, binds to thiol groups, and interferes with iron metabolism. E. coli solves this problem probably by preventing copper uptake, combined with rapid efflux if the metal happens to enter the cytoplasm.
Collapse
|
10
|
Radical SAM enzyme QueE defines a new minimal core fold and metal-dependent mechanism. Nat Chem Biol 2014; 10:106-12. [PMID: 24362703 PMCID: PMC3939041 DOI: 10.1038/nchembio.1426] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2013] [Accepted: 11/21/2013] [Indexed: 01/07/2023]
Abstract
7-carboxy-7-deazaguanine synthase (QueE) catalyzes a key S-adenosyl-L-methionine (AdoMet)- and Mg(2+)-dependent radical-mediated ring contraction step, which is common to the biosynthetic pathways of all deazapurine-containing compounds. QueE is a member of the AdoMet radical superfamily, which employs the 5'-deoxyadenosyl radical from reductive cleavage of AdoMet to initiate chemistry. To provide a mechanistic rationale for this elaborate transformation, we present the crystal structure of a QueE along with structures of pre- and post-turnover states. We find that substrate binds perpendicular to the [4Fe-4S]-bound AdoMet, exposing its C6 hydrogen atom for abstraction and generating the binding site for Mg(2+), which coordinates directly to the substrate. The Burkholderia multivorans structure reported here varies from all other previously characterized members of the AdoMet radical superfamily in that it contains a hypermodified (β6/α3) protein core and an expanded cluster-binding motif, CX14CX2C.
Collapse
|
11
|
Cummings JA, Vetting M, Ghodge SV, Xu C, Hillerich B, Seidel RD, Almo SC, Raushel FM. Prospecting for unannotated enzymes: discovery of a 3',5'-nucleotide bisphosphate phosphatase within the amidohydrolase superfamily. Biochemistry 2014; 53:591-600. [PMID: 24401123 PMCID: PMC3985815 DOI: 10.1021/bi401640r] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
![]()
In
bacteria, 3′,5′-adenosine bisphosphate (pAp) is
generated from 3′-phosphoadenosine 5′-phosphosulfate
in the sulfate assimilation pathway, and from coenzyme A by the transfer
of the phosphopantetheine group to the acyl-carrier protein. pAp is
subsequently hydrolyzed to 5′-AMP and orthophosphate, and this
reaction has been shown to be important for superoxide stress tolerance.
Herein, we report the discovery of the first instance of an enzyme
from the amidohydrolase superfamily that is capable of hydrolyzing
pAp. Crystal structures of Cv1693 from Chromobacterium violaceum have been determined to a resolution of 1.9 Å with AMP and
orthophosphate bound in the active site. The enzyme has a trinuclear
metal center in the active site with three Mn2+ ions. This
enzyme (Cv1693) belongs to the Cluster of Orthologous Groups cog0613
from the polymerase and histidinol phosphatase family of enzymes.
The values of kcat and kcat/Km for the hydrolysis
of pAp are 22 s–1 and 1.4 × 106 M–1 s–1, respectively. The enzyme is
promiscuous and is able to hydrolyze other 3′,5′-bisphosphonucleotides
(pGp, pCp, pUp, and pIp) and 2′-deoxynucleotides with comparable
catalytic efficiency. The enzyme is capable of hydrolyzing short oligonucleotides
(pdA)5, albeit at rates much lower than that of pAp. Enzymes
from two other enzyme families have previously been found to hydrolyze
pAp at physiologically significant rates. These enzymes include CysQ
from Escherichia coli (cog1218) and YtqI/NrnA from Bacillus subtilis (cog0618). Identification of the functional
homologues to the experimentally verified pAp phosphatases from cog0613,
cog1218, and cog0618 suggests that there is relatively little overlap
of enzymes with this function in sequenced bacterial genomes.
Collapse
Affiliation(s)
- Jennifer A Cummings
- Department of Chemistry, Texas A&M University , College Station, Texas 77843, United States
| | | | | | | | | | | | | | | |
Collapse
|
12
|
Pluchon PF, Fouqueau T, Crezé C, Laurent S, Briffotaux J, Hogrel G, Palud A, Henneke G, Godfroy A, Hausner W, Thomm M, Nicolas J, Flament D. An extended network of genomic maintenance in the archaeon Pyrococcus abyssi highlights unexpected associations between eucaryotic homologs. PLoS One 2013; 8:e79707. [PMID: 24244547 PMCID: PMC3820547 DOI: 10.1371/journal.pone.0079707] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2013] [Accepted: 09/24/2013] [Indexed: 11/18/2022] Open
Abstract
In Archaea, the proteins involved in the genetic information processing pathways, including DNA replication, transcription, and translation, share strong similarities with those of eukaryotes. Characterizations of components of the eukaryotic-type replication machinery complex provided many interesting insights into DNA replication in both domains. In contrast, DNA repair processes of hyperthermophilic archaea are less well understood and very little is known about the intertwining between DNA synthesis, repair and recombination pathways. The development of genetic system in hyperthermophilic archaea is still at a modest stage hampering the use of complementary approaches of reverse genetics and biochemistry to elucidate the function of new candidate DNA repair gene. To gain insights into genomic maintenance processes in hyperthermophilic archaea, a protein-interaction network centred on informational processes of Pyrococcus abyssi was generated by affinity purification coupled with mass spectrometry. The network consists of 132 interactions linking 87 proteins. These interactions give insights into the connections of DNA replication with recombination and repair, leading to the discovery of new archaeal components and of associations between eucaryotic homologs. Although this approach did not allow us to clearly delineate new DNA pathways, it provided numerous clues towards the function of new molecular complexes with the potential to better understand genomic maintenance processes in hyperthermophilic archaea. Among others, we found new potential partners of the replication clamp and demonstrated that the single strand DNA binding protein, Replication Protein A, enhances the transcription rate, in vitro, of RNA polymerase. This interaction map provides a valuable tool to explore new aspects of genome integrity in Archaea and also potentially in Eucaryotes.
Collapse
Affiliation(s)
- Pierre-François Pluchon
- Ifremer, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- Université de Bretagne Occidentale, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- CNRS, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
| | - Thomas Fouqueau
- Lehrstuhl für Mikrobiologie, Universität Regensburg, Regensburg, Germany
| | - Christophe Crezé
- Ifremer, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- Université de Bretagne Occidentale, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- CNRS, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
| | - Sébastien Laurent
- Ifremer, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- Université de Bretagne Occidentale, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- CNRS, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
| | - Julien Briffotaux
- Ifremer, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- Université de Bretagne Occidentale, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- CNRS, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
| | - Gaëlle Hogrel
- Ifremer, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- Université de Bretagne Occidentale, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- CNRS, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
| | - Adeline Palud
- Ifremer, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- Université de Bretagne Occidentale, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- CNRS, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
| | - Ghislaine Henneke
- Ifremer, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- Université de Bretagne Occidentale, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- CNRS, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
| | - Anne Godfroy
- Ifremer, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- Université de Bretagne Occidentale, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- CNRS, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
| | - Winfried Hausner
- Lehrstuhl für Mikrobiologie, Universität Regensburg, Regensburg, Germany
| | - Michael Thomm
- Lehrstuhl für Mikrobiologie, Universität Regensburg, Regensburg, Germany
| | - Jacques Nicolas
- IRISA-INRIA, Campus de Beaulieu, Rennes, France
- * E-mail: (DF); (JN)
| | - Didier Flament
- Ifremer, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- Université de Bretagne Occidentale, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- CNRS, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- * E-mail: (DF); (JN)
| |
Collapse
|
13
|
Barros T, Guenther J, Kelch B, Anaya J, Prabhakar A, O'Donnell M, Kuriyan J, Lamers MH. A structural role for the PHP domain in E. coli DNA polymerase III. BMC STRUCTURAL BIOLOGY 2013; 13:8. [PMID: 23672456 PMCID: PMC3666897 DOI: 10.1186/1472-6807-13-8] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/05/2013] [Accepted: 05/07/2013] [Indexed: 12/05/2022]
Abstract
Background In addition to the core catalytic machinery, bacterial replicative DNA polymerases contain a Polymerase and Histidinol Phosphatase (PHP) domain whose function is not entirely understood. The PHP domains of some bacterial replicases are active metal-dependent nucleases that may play a role in proofreading. In E. coli DNA polymerase III, however, the PHP domain has lost several metal-coordinating residues and is likely to be catalytically inactive. Results Genomic searches show that the loss of metal-coordinating residues in polymerase PHP domains is likely to have coevolved with the presence of a separate proofreading exonuclease that works with the polymerase. Although the E. coli Pol III PHP domain has lost metal-coordinating residues, the structure of the domain has been conserved to a remarkable degree when compared to that of metal-binding PHP domains. This is demonstrated by our ability to restore metal binding with only three point mutations, as confirmed by the metal-bound crystal structure of this mutant determined at 2.9 Å resolution. We also show that Pol III, a large multi-domain protein, unfolds cooperatively and that mutations in the degenerate metal-binding site of the PHP domain decrease the overall stability of Pol III and reduce its activity. Conclusions While the presence of a PHP domain in replicative bacterial polymerases is strictly conserved, its ability to coordinate metals and to perform proofreading exonuclease activity is not, suggesting additional non-enzymatic roles for the domain. Our results show that the PHP domain is a major structural element in Pol III and its integrity modulates both the stability and activity of the polymerase.
Collapse
Affiliation(s)
- Tiago Barros
- Howard Hughes Medical Institute, Department of Molecular and Cell Biology, University of California, Berkeley, CA 94720, USA
| | | | | | | | | | | | | | | |
Collapse
|
14
|
Lee D, Lee J, Seok C. What stabilizes close arginine pairing in proteins? Phys Chem Chem Phys 2013; 15:5844-53. [PMID: 23486862 DOI: 10.1039/c3cp00160a] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
Close stacking of arginine residues are often observed in protein structures despite the highly repulsive nature of the close like-charged groups. Physical factors stabilizing the close guanidinium ions of arginine side-chains have been previously studied in water and in protein-like environments, and the hydration free energy has been emphasized to be an important factor. However, how close arginine pairs are stabilized in real proteins has not been fully understood yet. In this paper, we show that arginine pairs are more frequently found in the protein interior than expected from the frequency of unpaired arginines buried inside protein through a statistical analysis of the protein structure database. We then confirm that 4 selected arginine pairs buried in the protein are indeed positively charged rather than neutralized, by molecular dynamics simulations and pKa estimation with molecular mechanics-Poisson-Boltzmann calculations. Further energy decomposition analysis shows that the hydration free energy may not be strong enough to overcome the repulsive Coulomb interaction between the positively charged arginine residues buried inside the protein. Instead, a highly polar interaction network is identified around each buried arginine pair, and the electrostatic interactions within such network are strong enough to stabilize the repulsive interaction of the buried arginine pair for the 4 selected cases. The polar interaction network is highly conserved evolutionarily in some proteins, implicating their roles in protein stabilization or biochemical function.
Collapse
Affiliation(s)
- Dongseon Lee
- Department of Chemistry, Seoul National University, Seoul 151-747, Republic of Korea
| | | | | |
Collapse
|
15
|
Standish AJ, Salim AA, Capon RJ, Morona R. Dual inhibition of DNA polymerase PolC and protein tyrosine phosphatase CpsB uncovers a novel antibiotic target. Biochem Biophys Res Commun 2012. [PMID: 23194664 DOI: 10.1016/j.bbrc.2012.11.049] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Increasing antibiotic resistance is making the identification of novel antimicrobial targets critical. Recently, we discovered an inhibitor of protein tyrosine phosphatase CpsB, fascioquinol E (FQE), which unexpectedly inhibited the growth of Gram-positive pathogens. CpsB is a member of the polymerase and histidinol phosphate phosphatase (PHP) domain family. Another member of this family found in a variety of Gram-positive pathogens is DNA polymerase PolC. We purified the PHP domain from PolC (PolC(PHP)), and showed that this competes away FQE inhibition of CpsB phosphatase activity. Furthermore, we showed that this domain hydrolyses the 5'-p-nitrophenyl ester of thymidine-5'-monophosphate (pNP-TMP), which has been used as a measure of exonuclease activity. Finally, we showed that FQE not only inhibits the phosphatase activity of CpsB, but also ability of PolC(PHP) to catalyse the hydrolysis of pNP-TMP. This suggests that PolC may be the essential target of FQE, and that the PHP domain may represent an as yet untapped target for the development of novel antibiotics.
Collapse
Affiliation(s)
- Alistair J Standish
- School of Molecular and Biomedical Science, University of Adelaide, Adelaide, SA 5005, Australia.
| | | | | | | |
Collapse
|
16
|
Giannozzi P, Jansen K, Penna GL, Minicozzi V, Morante S, Rossi G, Stellato F. Zn induced structural aggregation patterns of β-amyloid peptides by first-principle simulations and XAS measurements. Metallomics 2012; 4:156-65. [DOI: 10.1039/c2mt00148a] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
17
|
|
18
|
Abstract
Bacterial replicases are complex, tripartite replicative machines. They contain a polymerase, polymerase III (Pol III), a β₂ processivity factor, and a DnaX complex ATPase that loads β₂ onto DNA and chaperones Pol III onto the newly loaded β₂. Bacterial replicases are highly processive, yet cycle rapidly during Okazaki fragment synthesis in a regulated way. Many bacteria encode both a full-length τ and a shorter γ form of DnaX by a variety of mechanisms. γ appears to be uniquely placed in a single position relative to two τ protomers in a pentameric ring. The polymerase catalytic subunit of Pol III, α, contains a PHP domain that not only binds to a prototypical ε Mg²⁺-dependent exonuclease, but also contains a second Zn²⁺-dependent proofreading exonuclease, at least in some bacteria. This review focuses on a critical evaluation of recent literature and concepts pertaining to the above issues and suggests specific areas that require further investigation.
Collapse
Affiliation(s)
- Charles S McHenry
- Department of Chemistry and Biochemistry, University of Colorado, Boulder, Colorado 80309, USA.
| |
Collapse
|
19
|
YcdY protein of Escherichia coli, an atypical member of the TorD chaperone family. J Bacteriol 2011; 193:6512-6. [PMID: 21965574 DOI: 10.1128/jb.05927-11] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The TorD family of specific chaperones is divided into four subfamilies dedicated to molybdoenzyme biogenesis and a fifth one, exemplified by YcdY of Escherichia coli, for which no defined partner has been identified so far. We propose that YcdY is the chaperone of YcdX, a zinc protein involved in the swarming motility process of E. coli, since YcdY interacts with YcdX and increases its activity in vitro.
Collapse
|
20
|
Han GW, Ko J, Farr CL, Deller MC, Xu Q, Chiu HJ, Miller MD, Sefcikova J, Somarowthu S, Beuning PJ, Elsliger MA, Deacon AM, Godzik A, Lesley SA, Wilson IA, Ondrechen MJ. Crystal structure of a metal-dependent phosphoesterase (YP_910028.1) from Bifidobacterium adolescentis: Computational prediction and experimental validation of phosphoesterase activity. Proteins 2011; 79:2146-60. [PMID: 21538547 DOI: 10.1002/prot.23035] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2010] [Revised: 03/07/2011] [Accepted: 03/15/2011] [Indexed: 11/09/2022]
Abstract
The crystal structures of an unliganded and adenosine 5'-monophosphate (AMP) bound, metal-dependent phosphoesterase (YP_910028.1) from Bifidobacterium adolescentis are reported at 2.4 and 1.94 Å, respectively. Functional characterization of this enzyme was guided by computational analysis and then confirmed by experiment. The structure consists of a polymerase and histidinol phosphatase (PHP, Pfam: PF02811) domain with a second domain (residues 105-178) inserted in the middle of the PHP sequence. The insert domain functions in binding AMP, but the precise function and substrate specificity of this domain are unknown. Initial bioinformatics analyses yielded multiple potential functional leads, with most of them suggesting DNA polymerase or DNA replication activity. Phylogenetic analysis indicated a potential DNA polymerase function that was somewhat supported by global structural comparisons identifying the closest structural match to the alpha subunit of DNA polymerase III. However, several other functional predictions, including phosphoesterase, could not be excluded. Theoretical microscopic anomalous titration curve shapes, a computational method for the prediction of active sites from protein 3D structures, identified potential reactive residues in YP_910028.1. Further analysis of the predicted active site and local comparison with its closest structure matches strongly suggested phosphoesterase activity, which was confirmed experimentally. Primer extension assays on both normal and mismatched DNA show neither extension nor degradation and provide evidence that YP_910028.1 has neither DNA polymerase activity nor DNA-proofreading activity. These results suggest that many of the sequence neighbors previously annotated as having DNA polymerase activity may actually be misannotated.
Collapse
Affiliation(s)
- Gye Won Han
- Joint Center for Structural Genomics, Scripps Research Institute, La Jolla, California 92037, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
21
|
Shi W, Punta M, Bohon J, Sauder JM, D'Mello R, Sullivan M, Toomey J, Abel D, Lippi M, Passerini A, Frasconi P, Burley SK, Rost B, Chance MR. Characterization of metalloproteins by high-throughput X-ray absorption spectroscopy. Genome Res 2011; 21:898-907. [PMID: 21482623 DOI: 10.1101/gr.115097.110] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
High-throughput X-ray absorption spectroscopy was used to measure transition metal content based on quantitative detection of X-ray fluorescence signals for 3879 purified proteins from several hundred different protein families generated by the New York SGX Research Center for Structural Genomics. Approximately 9% of the proteins analyzed showed the presence of transition metal atoms (Zn, Cu, Ni, Co, Fe, or Mn) in stoichiometric amounts. The method is highly automated and highly reliable based on comparison of the results to crystal structure data derived from the same protein set. To leverage the experimental metalloprotein annotations, we used a sequence-based de novo prediction method, MetalDetector, to identify Cys and His residues that bind to transition metals for the redundancy reduced subset of 2411 sequences sharing <70% sequence identity and having at least one His or Cys. As the HT-XAS identifies metal type and protein binding, while the bioinformatics analysis identifies metal- binding residues, the results were combined to identify putative metal-binding sites in the proteins and their associated families. We explored the combination of this data with homology models to generate detailed structure models of metal-binding sites for representative proteins. Finally, we used extended X-ray absorption fine structure data from two of the purified Zn metalloproteins to validate predicted metalloprotein binding site structures. This combination of experimental and bioinformatics approaches provides comprehensive active site analysis on the genome scale for metalloproteins as a class, revealing new insights into metalloprotein structure and function.
Collapse
Affiliation(s)
- Wuxian Shi
- New York SGX Research Center for Structural Genomics (NYSGXRC), Case Western Reserve University, Center for Proteomics and Bioinformatics, Case Center for Synchrotron Biosciences, Upton, New York 11973, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Intrinsic apurinic/apyrimidinic (AP) endonuclease activity enables Bacillus subtilis DNA polymerase X to recognize, incise, and further repair abasic sites. Proc Natl Acad Sci U S A 2010; 107:19219-24. [PMID: 20974932 DOI: 10.1073/pnas.1013603107] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
The N-glycosidic bond can be hydrolyzed spontaneously or by glycosylases during removal of damaged bases by the base excision repair pathway, leading to the formation of highly mutagenic apurinic/apyrimidinic (AP) sites. Organisms encode for evolutionarily conserved repair machinery, including specific AP endonucleases that cleave the DNA backbone 5' to the AP site to prime further DNA repair synthesis. We report on the DNA polymerase X from the bacterium Bacillus subtilis (PolX(Bs)) that, along with polymerization and 3'-5'-exonuclease activities, possesses an intrinsic AP-endonuclease activity. Both, AP-endonuclease and 3'-5'-exonuclease activities are genetically linked and governed by the same metal ligands located at the C-terminal polymerase and histidinol phosphatase domain of the polymerase. The different catalytic functions of PolX(Bs) enable it to perform recognition and incision at an AP site and further restoration (repair) of the original nucleotide in a standalone AP-endonuclease-independent way.
Collapse
|
23
|
Bandyopadhyay D, Huan J, Prins J, Snoeyink J, Wang W, Tropsha A. Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: II. Case studies and applications. J Comput Aided Mol Des 2009; 23:785-97. [PMID: 19548090 DOI: 10.1007/s10822-009-9277-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2008] [Accepted: 04/22/2009] [Indexed: 11/25/2022]
Abstract
This paper describes several case studies concerning protein function inference from its structure using our novel approach described in the accompanying paper. This approach employs family-specific motifs, i.e. three-dimensional amino acid packing patterns that are statistically prevalent within a protein family. For our case studies we have selected families from the SCOP and EC classifications and analyzed the discriminating power of the motifs in depth. We have devised several benchmarks to compare motifs mined from unweighted topological graph representations of protein structures with those from distance-labeled (weighted) representations, demonstrating the superiority of the latter for function inference in most families. We have tested the robustness of our motif library by inferring the function of new members added to SCOP families, and discriminating between several families that are structurally similar but functionally divergent. Furthermore we have applied our method to predict function for several proteins characterized in structural genomics projects, including orphan structures, and we discuss several selected predictions in depth. Some of our predictions have been corroborated by other computational methods, and some have been validated by independent experimental studies, validating our approach for protein function inference from structure.
Collapse
|
24
|
Kalinina OV, Gelfand MS, Russell RB. Combining specificity determining and conserved residues improves functional site prediction. BMC Bioinformatics 2009; 10:174. [PMID: 19508719 PMCID: PMC2709924 DOI: 10.1186/1471-2105-10-174] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2009] [Accepted: 06/09/2009] [Indexed: 11/16/2022] Open
Abstract
Background Predicting the location of functionally important sites from protein sequence and/or structure is a long-standing problem in computational biology. Most current approaches make use of sequence conservation, assuming that amino acid residues conserved within a protein family are most likely to be functionally important. Most often these approaches do not consider many residues that act to define specific sub-functions within a family, or they make no distinction between residues important for function and those more relevant for maintaining structure (e.g. in the hydrophobic core). Many protein families bind and/or act on a variety of ligands, meaning that conserved residues often only bind a common ligand sub-structure or perform general catalytic activities. Results Here we present a novel method for functional site prediction based on identification of conserved positions, as well as those responsible for determining ligand specificity. We define Specificity-Determining Positions (SDPs), as those occupied by conserved residues within sub-groups of proteins in a family having a common specificity, but differ between groups, and are thus likely to account for specific recognition events. We benchmark the approach on enzyme families of known 3D structure with bound substrates, and find that in nearly all families residues predicted by SDPsite are in contact with the bound substrate, and that the addition of SDPs significantly improves functional site prediction accuracy. We apply SDPsite to various families of proteins containing known three-dimensional structures, but lacking clear functional annotations, and discusse several illustrative examples. Conclusion The results suggest a better means to predict functional details for the thousands of protein structures determined prior to a clear understanding of molecular function.
Collapse
|
25
|
Sadreyev RI, Kim BH, Grishin NV. Discrete-continuous duality of protein structure space. Curr Opin Struct Biol 2009; 19:321-8. [PMID: 19482467 PMCID: PMC3688466 DOI: 10.1016/j.sbi.2009.04.009] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2009] [Revised: 04/29/2009] [Accepted: 04/29/2009] [Indexed: 11/30/2022]
Abstract
Recently, the nature of protein structure space has been widely discussed in the literature. The traditional discrete view of protein universe as a set of separate folds has been criticized in the light of growing evidence that almost any arrangement of secondary structures is possible and the whole protein space can be traversed through a path of similar structures. Here we argue that the discrete and continuous descriptions are not mutually exclusive, but complementary: the space is largely discrete in evolutionary sense, but continuous geometrically when purely structural similarities are quantified. Evolutionary connections are mainly confined to separate structural prototypes corresponding to folds as islands of structural stability, with few remaining traceable links between the islands. However, for a geometric similarity measure, it is usually possible to find a reasonable cutoff that yields paths connecting any two structures through intermediates.
Collapse
Affiliation(s)
- Ruslan I. Sadreyev
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, TX 75390-9050, USA
| | - Bong-Hyun Kim
- Department of Biochemistry, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, TX 75390-9050, USA
| | - Nick V. Grishin
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, TX 75390-9050, USA
- Department of Biochemistry, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, TX 75390-9050, USA
| |
Collapse
|
26
|
Investigation of atomic level patterns in protein--small ligand interactions. PLoS One 2009; 4:e4473. [PMID: 19221587 PMCID: PMC2637420 DOI: 10.1371/journal.pone.0004473] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2008] [Accepted: 01/07/2009] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND Shape complementarity and non-covalent interactions are believed to drive protein-ligand interaction. To date protein-protein, protein-DNA, and protein-RNA interactions were systematically investigated, which is in contrast to interactions with small ligands. We investigate the role of covalent and non-covalent bonds in protein-small ligand interactions using a comprehensive dataset of 2,320 complexes. METHODOLOGY AND PRINCIPAL FINDINGS We show that protein-ligand interactions are governed by different forces for different ligand types, i.e., protein-organic compound interactions are governed by hydrogen bonds, van der Waals contacts, and covalent bonds; protein-metal ion interactions are dominated by electrostatic force and coordination bonds; protein-anion interactions are established with electrostatic force, hydrogen bonds, and van der Waals contacts; and protein-inorganic cluster interactions are driven by coordination bonds. We extracted several frequently occurring atomic-level patterns concerning these interactions. For instance, 73% of investigated covalent bonds were summarized with just three patterns in which bonds are formed between thiol of Cys and carbon or sulfur atoms of ligands, and nitrogen of Lys and carbon of ligands. Similar patterns were found for the coordination bonds. Hydrogen bonds occur in 67% of protein-organic compound complexes and 66% of them are formed between NH- group of protein residues and oxygen atom of ligands. We quantify relative abundance of specific interaction types and discuss their characteristic features. The extracted protein-organic compound patterns are shown to complement and improve a geometric approach for prediction of binding sites. CONCLUSIONS AND SIGNIFICANCE We show that for a given type (group) of ligands and type of the interaction force, majority of protein-ligand interactions are repetitive and could be summarized with several simple atomic-level patterns. We summarize and analyze 10 frequently occurring interaction patterns that cover 56% of all considered complexes and we show a practical application for the patterns that concerns interactions with organic compounds.
Collapse
|
27
|
Nakane S, Nakagawa N, Kuramitsu S, Masui R. Characterization of DNA polymerase X from Thermus thermophilus HB8 reveals the POLXc and PHP domains are both required for 3'-5' exonuclease activity. Nucleic Acids Res 2009; 37:2037-52. [PMID: 19211662 PMCID: PMC2665239 DOI: 10.1093/nar/gkp064] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
The X-family DNA polymerases (PolXs) comprise a highly conserved DNA polymerase family found in all kingdoms. Mammalian PolXs are known to be involved in several DNA-processing pathways including repair, but the cellular functions of bacterial PolXs are less known. Many bacterial PolXs have a polymerase and histidinol phosphatase (PHP) domain at their C-termini in addition to a PolX core (POLXc) domain, and possess 3'-5' exonuclease activity. Although both domains are highly conserved in bacteria, their molecular functions, especially for a PHP domain, are unknown. We found Thermus thermophilus HB8 PolX (ttPolX) has Mg(2+)/Mn(2+)-dependent DNA/RNA polymerase, Mn(2+)-dependent 3'-5' exonuclease and DNA-binding activities. We identified the domains of ttPolX by limited proteolysis and characterized their biochemical activities. The POLXc domain was responsible for the polymerase and DNA-binding activities but exonuclease activity was not detected for either domain. However, the POLXc and PHP domains interacted with each other and a mixture of the two domains had Mn(2+)-dependent 3'-5' exonuclease activity. Moreover, site-directed mutagenesis revealed catalytically important residues in the PHP domain for the 3'-5' exonuclease activity. Our findings provide a molecular insight into the functional domain organization of bacterial PolXs, especially the requirement of the PHP domain for 3'-5' exonuclease activity.
Collapse
Affiliation(s)
- Shuhei Nakane
- Department of Biological Sciences, Graduate School of Science, Osaka University, Toyonaka, Osaka 560-0043, Japan
| | | | | | | |
Collapse
|
28
|
Baños B, Lázaro JM, Villar L, Salas M, de Vega M. Editing of misaligned 3'-termini by an intrinsic 3'-5' exonuclease activity residing in the PHP domain of a family X DNA polymerase. Nucleic Acids Res 2008; 36:5736-49. [PMID: 18776221 PMCID: PMC2566882 DOI: 10.1093/nar/gkn526] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
Bacillus subtilis gene yshC encodes a family X DNA polymerase (PolXBs), whose biochemical features suggest that it plays a role during DNA repair processes. Here, we show that, in addition to the polymerization activity, PolXBs possesses an intrinsic 3′–5′ exonuclease activity specialized in resecting unannealed 3′-termini in a gapped DNA substrate. Biochemical analysis of a PolXBs deletion mutant lacking the C-terminal polymerase histidinol phosphatase (PHP) domain, present in most of the bacterial/archaeal PolXs, as well as of this separately expressed protein region, allow us to state that the 3′–5′ exonuclease activity of PolXBs resides in its PHP domain. Furthermore, site-directed mutagenesis of PolXBs His339 and His341 residues, evolutionary conserved in the PHP superfamily members, demonstrated that the predicted metal binding site is directly involved in catalysis of the exonucleolytic reaction. The implications of the unannealed 3′-termini resection by the 3′–5′ exonuclease activity of PolXBs in the DNA repair context are discussed.
Collapse
Affiliation(s)
- Benito Baños
- Instituto de Biología Molecular Eladio Viñuela (CSIC), Centro de Biología Molecular Severo Ochoa (CSIC-UAM), C/Nicolás Cabrera 1, Cantoblanco, 28049 Madrid, Spain
| | | | | | | | | |
Collapse
|
29
|
Schlicker A, Rahnenführer J, Albrecht M, Lengauer T, Domingues FS. GOTax: investigating biological processes and biochemical activities along the taxonomic tree. Genome Biol 2007; 8:R33. [PMID: 17346342 PMCID: PMC1868936 DOI: 10.1186/gb-2007-8-3-r33] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2006] [Revised: 01/18/2007] [Accepted: 03/08/2007] [Indexed: 11/10/2022] Open
Abstract
We describe GOTax, a comparative genomics platform that integrates protein annotation with protein family classification and taxonomy. User-defined sets of proteins, protein families, annotation terms or taxonomic groups can be selected and compared, allowing for the analysis of distribution of biological processes and molecular activities over different taxonomic groups. In particular, a measure of functional similarity is available for comparing proteins and protein families, establishing functional relationships independent of evolution.
Collapse
Affiliation(s)
- Andreas Schlicker
- Department of Computational Biology and Applied Algorithmics, Max-Planck-Institute for Informatics, Stuhlsatzenhausweg, 66123 Saarbrücken, Germany
| | - Jörg Rahnenführer
- Department of Computational Biology and Applied Algorithmics, Max-Planck-Institute for Informatics, Stuhlsatzenhausweg, 66123 Saarbrücken, Germany
| | - Mario Albrecht
- Department of Computational Biology and Applied Algorithmics, Max-Planck-Institute for Informatics, Stuhlsatzenhausweg, 66123 Saarbrücken, Germany
| | - Thomas Lengauer
- Department of Computational Biology and Applied Algorithmics, Max-Planck-Institute for Informatics, Stuhlsatzenhausweg, 66123 Saarbrücken, Germany
| | - Francisco S Domingues
- Department of Computational Biology and Applied Algorithmics, Max-Planck-Institute for Informatics, Stuhlsatzenhausweg, 66123 Saarbrücken, Germany
| |
Collapse
|
30
|
Bailey S, Wing RA, Steitz TA. The Structure of T. aquaticus DNA Polymerase III Is Distinct from Eukaryotic Replicative DNA Polymerases. Cell 2006; 126:893-904. [PMID: 16959569 DOI: 10.1016/j.cell.2006.07.027] [Citation(s) in RCA: 103] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2006] [Revised: 07/05/2006] [Accepted: 07/20/2006] [Indexed: 10/24/2022]
Abstract
The crystal structure of Thermus aquaticus DNA polymerase III alpha subunit reveals that the structure of the catalytic domain of the eubacterial replicative polymerase is unrelated to that of the eukaryotic replicative polymerase but rather belongs to the Polbeta-like nucleotidyltransferase superfamily. A model of the polymerase complexed with both DNA and beta-sliding clamp interacting with a reoriented binding domain and internal beta binding site was constructed that is consistent with existing biochemical data. Within the crystal, two C-terminal domains are interacting through a surface that is larger than many dimer interfaces. Since replicative polymerases of eubacteria and eukaryotes/archaea are not homologous, the nature of the replicative polymerase in the last common ancestor is unknown. Although other possibilities have been proposed, the plausibility of a ribozyme DNA polymerase should be considered.
Collapse
Affiliation(s)
- Scott Bailey
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | | | | |
Collapse
|
31
|
Lamers MH, Georgescu RE, Lee SG, O'Donnell M, Kuriyan J. Crystal Structure of the Catalytic α Subunit of E. coli Replicative DNA Polymerase III. Cell 2006; 126:881-92. [PMID: 16959568 DOI: 10.1016/j.cell.2006.07.028] [Citation(s) in RCA: 114] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2006] [Revised: 06/29/2006] [Accepted: 07/29/2006] [Indexed: 10/24/2022]
Abstract
Bacterial replicative DNA polymerases such as Polymerase III (Pol III) share no sequence similarity with other polymerases. The crystal structure, determined at 2.3 A resolution, of a large fragment of Pol III (residues 1-917), reveals a unique chain fold with localized similarity in the catalytic domain to DNA polymerase beta and related nucleotidyltransferases. The structure of Pol III is strikingly different from those of members of the canonical DNA polymerase families, which include eukaryotic replicative polymerases, suggesting that the DNA replication machinery in bacteria arose independently. A structural element near the active site in Pol III that is not present in nucleotidyltransferases but which resembles an element at the active sites of some canonical DNA polymerases suggests that, at a more distant level, all DNA polymerases may share a common ancestor. The structure also suggests a model for interaction of Pol III with the sliding clamp and DNA.
Collapse
Affiliation(s)
- Meindert H Lamers
- Howard Hughes Medical Institute, Department of Molecular and Cell Biology and Department of Chemistry, University of California, Berkeley, CA 94720, USA
| | | | | | | | | |
Collapse
|
32
|
Bandyopadhyay D, Huan J, Liu J, Prins J, Snoeyink J, Wang W, Tropsha A. Structure-based function inference using protein family-specific fingerprints. Protein Sci 2006; 15:1537-43. [PMID: 16731985 PMCID: PMC2265098 DOI: 10.1110/ps.062189906] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
We describe a method to assign a protein structure to a functional family using family-specific fingerprints. Fingerprints represent amino acid packing patterns that occur in most members of a family but are rare in the background, a nonredundant subset of PDB; their information is additional to sequence alignments, sequence patterns, structural superposition, and active-site templates. Fingerprints were derived for 120 families in SCOP using Frequent Subgraph Mining. For a new structure, all occurrences of these family-specific fingerprints may be found by a fast algorithm for subgraph isomorphism; the structure can then be assigned to a family with a confidence value derived from the number of fingerprints found and their distribution in background proteins. In validation experiments, we infer the function of new members added to SCOP families and we discriminate between structurally similar, but functionally divergent TIM barrel families. We then apply our method to predict function for several structural genomics proteins, including orphan structures. Some predictions have been corroborated by other computational methods and some validated by subsequent functional characterization.
Collapse
Affiliation(s)
- Deepak Bandyopadhyay
- Department of Computer Science, University of North Carolina at Chapel Hill, North Carolina 27599, USA
| | | | | | | | | | | | | |
Collapse
|
33
|
Burroughs AM, Allen KN, Dunaway-Mariano D, Aravind L. Evolutionary genomics of the HAD superfamily: understanding the structural adaptations and catalytic diversity in a superfamily of phosphoesterases and allied enzymes. J Mol Biol 2006; 361:1003-34. [PMID: 16889794 DOI: 10.1016/j.jmb.2006.06.049] [Citation(s) in RCA: 321] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2005] [Revised: 06/16/2006] [Accepted: 06/20/2006] [Indexed: 11/21/2022]
Abstract
The HAD (haloacid dehalogenase) superfamily includes phosphoesterases, ATPases, phosphonatases, dehalogenases, and sugar phosphomutases acting on a remarkably diverse set of substrates. The availability of numerous crystal structures of representatives belonging to diverse branches of the HAD superfamily provides us with a unique opportunity to reconstruct their evolutionary history and uncover the principal determinants that led to their diversification of structure and function. To this end we present a comprehensive analysis of the HAD superfamily that identifies their unique structural features and provides a detailed classification of the entire superfamily. We show that at the highest level the HAD superfamily is unified with several other superfamilies, namely the DHH, receiver (CheY-like), von Willebrand A, TOPRIM, classical histone deacetylases and PIN/FLAP nuclease domains, all of which contain a specific form of the Rossmannoid fold. These Rossmannoid folds are distinguished from others by the presence of equivalently placed acidic catalytic residues, including one at the end of the first core beta-strand of the central sheet. The HAD domain is distinguished from these related Rossmannoid folds by two key structural signatures, a "squiggle" (a single helical turn) and a "flap" (a beta hairpin motif) located immediately downstream of the first beta-strand of their core Rossmanoid fold. The squiggle and the flap motifs are predicted to provide the necessary mobility to these enzymes for them to alternate between the "open" and "closed" conformations. In addition, most members of the HAD superfamily contains inserts, termed caps, occurring at either of two positions in the core Rossmannoid fold. We show that the cap modules have been independently inserted into these two stereotypic positions on multiple occasions in evolution and display extensive evolutionary diversification independent of the core catalytic domain. The first group of caps, the C1 caps, is directly inserted into the flap motif and regulates access of reactants to the active site. The second group, the C2 caps, forms a roof over the active site, and access to their internal cavities might be in part regulated by the movement of the flap. The diversification of the cap module was a major factor in the exploration of a vast substrate space in the course of the evolution of this superfamily. We show that the HAD superfamily contains 33 major families distributed across the three superkingdoms of life. Analysis of the phyletic patterns suggests that at least five distinct HAD proteins are traceable to the last universal common ancestor (LUCA) of all extant organisms. While these prototypes diverged prior to the emergence of the LUCA, the major diversification in terms of both substrate specificity and reaction types occurred after the radiation of the three superkingdoms of life, primarily in bacteria. Most major diversification events appear to correlate with the acquisition of new metabolic capabilities, especially related to the elaboration of carbohydrate metabolism in the bacteria. The newly identified relationships and functional predictions provided here are likely to aid the future exploration of the numerous poorly understood members of this large superfamily of enzymes.
Collapse
Affiliation(s)
- A Maxwell Burroughs
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | | | |
Collapse
|
34
|
Stano NM, Chen J, McHenry CS. A coproofreading Zn(2+)-dependent exonuclease within a bacterial replicase. Nat Struct Mol Biol 2006; 13:458-9. [PMID: 16604084 DOI: 10.1038/nsmb1078] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2006] [Accepted: 03/02/2006] [Indexed: 11/09/2022]
Abstract
The proofreading exonucleases of all DNA replicases contain acidic residues that chelate two Mg(2+) ions that participate in catalysis. DNA polymerase III holoenzymes contain their proofreading activity in a separate subunit, epsilon, which binds the polymerase subunit, alpha, through alpha's N-terminal php domain. Here we demonstrate that the alpha php domain contains a novel Zn(2+)-dependent 3' --> 5' exonuclease that preferentially removes mispaired nucleotides, providing the first example of a coediting nuclease.
Collapse
Affiliation(s)
- Natalie M Stano
- Department of Biochemistry and Molecular Genetics, University of Colorado Health Sciences Center, Denver, Colorado 80262, USA
| | | | | |
Collapse
|
35
|
Damm KL, Carlson HA. Gaussian-weighted RMSD superposition of proteins: a structural comparison for flexible proteins and predicted protein structures. Biophys J 2006; 90:4558-73. [PMID: 16565070 PMCID: PMC1471868 DOI: 10.1529/biophysj.105.066654] [Citation(s) in RCA: 91] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Many proteins contain flexible structures such as loops and hinged domains. A simple root mean square deviation (RMSD) alignment of two different conformations of the same protein can be skewed by the difference between the mobile regions. To overcome this problem, we have developed a novel method to overlay two protein conformations by their atomic coordinates using a Gaussian-weighted RMSD (wRMSD) fit. The algorithm is based on the Kabsch least-squares method and determines an optimal transformation between two molecules by calculating the minimal weighted deviation between the two coordinate sets. Unlike other techniques that choose subsets of residues to overlay, all atoms are included in the wRMSD overlay. Atoms that barely move between the two conformations will have a greater weighting than those that have a large displacement. Our superposition tool has produced successful alignments when applied to proteins for which two conformations are known. The transformation calculation is heavily weighted by the coordinates of the static region of the two conformations, highlighting the range of flexibility in the overlaid structures. Lastly, we show how wRMSD fits can be used to evaluate predicted protein structures. Comparing a predicted fold to its experimentally determined target structure is another case of comparing two protein conformations of the same sequence, and the degree of alignment directly reflects the quality of the prediction.
Collapse
Affiliation(s)
- Kelly L Damm
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, Michigan 48109-1065, USA
| | | |
Collapse
|
36
|
Wieczorek A, McHenry CS. The NH2-terminal php domain of the alpha subunit of the Escherichia coli replicase binds the epsilon proofreading subunit. J Biol Chem 2006; 281:12561-7. [PMID: 16517598 DOI: 10.1074/jbc.m513844200] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The alpha subunit of the replicase of all bacteria contains a php domain, initially identified by its similarity to histidinol phosphatase but of otherwise unknown function (Aravind, L., and Koonin, E. V. (1998) Nucleic Acids Res. 26, 3746-3752). Deletion of 60 residues from the NH2 terminus of the alpha php domain destroys epsilon binding. The minimal 255-residue php domain, estimated by sequence alignment with homolog YcdX, is insufficient for epsilon binding. However, a 320-residue segment including sequences that immediately precede the polymerase domain binds epsilon with the same affinity as the 1160-residue full-length alpha subunit. A subset of mutations of a conserved acidic residue (Asp43 in Escherichia coli alpha) present in the php domain of all bacterial replicases resulted in defects in epsilon binding. Using sequence alignments, we show that the prototypical gram+ Pol C, which contains the polymerase and proofreading activities within the same polypeptide chain, has an epsilon-like sequence inserted in a surface loop near the center of the homologous YcdX protein. These findings suggest that the php domain serves as a platform to enable coordination of proofreading and polymerase activities during chromosomal replication.
Collapse
Affiliation(s)
- Anna Wieczorek
- Department of Biochemistry and Molecular Genetics, University of Colorado Health Sciences Center, Denver, Colorado 80262, USA
| | | |
Collapse
|
37
|
Todd AE, Marsden RL, Thornton JM, Orengo CA. Progress of Structural Genomics Initiatives: An Analysis of Solved Target Structures. J Mol Biol 2005; 348:1235-60. [PMID: 15854658 DOI: 10.1016/j.jmb.2005.03.037] [Citation(s) in RCA: 103] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2004] [Revised: 02/28/2005] [Accepted: 03/15/2005] [Indexed: 11/27/2022]
Abstract
The explosion in gene sequence data and technological breakthroughs in protein structure determination inspired the launch of structural genomics (SG) initiatives. An often stated goal of structural genomics is the high-throughput structural characterisation of all protein sequence families, with the long-term hope of significantly impacting on the life sciences, biotechnology and drug discovery. Here, we present a comprehensive analysis of solved SG targets to assess progress of these initiatives. Eleven consortia have contributed 316 non-redundant entries and 323 protein chains to the Protein Data Bank (PDB), and 459 and 393 domains to the CATH and SCOP structure classifications, respectively. The quality and size of these proteins are comparable to those solved in traditional structural biology and, despite huge scope for duplicated efforts, only 14% of targets have a close homologue (>/=30% sequence identity) solved by another consortium. Analysis of CATH and SCOP revealed the significant contribution that structural genomics is making to the coverage of superfamilies and folds. A total of 67% of SG domains in CATH are unique, lacking an already characterised close homologue in the PDB, whereas only 21% of non-SG domains are unique. For 29% of domains, structure determination revealed a remote evolutionary relationship not apparent from sequence, and 19% and 11% contributed new superfamilies and folds. The secondary structure class, fold and superfamily distributions of this dataset reflect those of the genomes. The domains fall into 172 different folds and 259 superfamilies in CATH but the distribution is highly skewed. The most populous of these are those that recur most frequently in the genomes. Whilst 11% of superfamilies are bacteria-specific, most are common to all three superkingdoms of life and together the 316 PDB entries have provided new and reliable homology models for 9287 non-redundant gene sequences in 206 completely sequenced genomes. From the perspective of this analysis, it appears that structural genomics is on track to be a success, and it is hoped that this work will inform future directions of the field.
Collapse
Affiliation(s)
- Annabel E Todd
- Department of Biochemistry and Molecular Biology, University College London, Gower Street, London, WC1E 6BT, UK.
| | | | | | | |
Collapse
|
38
|
Dlakić M. 3D models of yeast RNase P/MRP proteins Rpp1p and Pop3p. RNA (NEW YORK, N.Y.) 2005; 11:123-127. [PMID: 15613537 PMCID: PMC1370701 DOI: 10.1261/rna.7128905] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/21/2004] [Accepted: 11/08/2004] [Indexed: 05/24/2023]
Abstract
Sensitive profile searches and fold recognition were used to predict the structures of two yeast RNase P/MRP proteins. Rpp1p, which is one of the subunits common to eukaryotes and archaea, is predicted to adopt the seven-stranded TIM-barrel fold found in PHP phosphoesterases. Pop3p, initially thought to be one of the RNase P/MRP subunits unique to yeast, has been assigned the L7Ae/L30e fold. This RNA-binding fold is also present in human RNase P subunit Rpp38, raising the possibility that Pop3p and Rpp38 are functional homologs.
Collapse
|
39
|
Yakunin AF, Yee AA, Savchenko A, Edwards AM, Arrowsmith CH. Structural proteomics: a tool for genome annotation. Curr Opin Chem Biol 2004; 8:42-8. [PMID: 15036155 DOI: 10.1016/j.cbpa.2003.12.003] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
In any newly sequenced genome, 30% to 50% of genes encode proteins with unknown molecular or cellular function. Fortunately, structural genomics is emerging as a powerful approach of functional annotation. Because of recent developments in high-throughput technologies, ongoing structural genomics projects are generating new structures at an unprecedented rate. In the past year, structural studies have identified many new structural motifs involved in enzymatic catalysis or in binding ligands or other macromolecules (DNA, RNA, protein). The efficiency by which function is deduced from structure can be further improved by the integration of structure with bioinformatics and other experimental approaches, such as screening for enzymatic activity or ligand binding.
Collapse
Affiliation(s)
- Alexander F Yakunin
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario M5G 1L6, Canada
| | | | | | | | | |
Collapse
|