1601
|
Knutson BA. Insights into the domain and repeat architecture of target of rapamycin. J Struct Biol 2010; 170:354-63. [PMID: 20060908 DOI: 10.1016/j.jsb.2010.01.002] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2009] [Revised: 12/21/2009] [Accepted: 01/04/2010] [Indexed: 01/09/2023]
Abstract
A simple and efficient protein sequence analysis strategy was developed to predict the number and location of structural repeats in the TOR protein. This strategy uses multiple HHpred alignments against proteins of known 3D structure to enable protein repeats referenced from the 3D structure to be traced back to the query protein sequence by using user-directed repeat assignments. The HHpred strategy performed with high sensitivity by predicting 100% of the repeat units within a test set of HEAT- and TPR-repeat-containing proteins of known three-dimensional structure. The HHpred strategy predicts that TOR contains 32 tandem HEAT repeats extending from the N-terminus to the FAT domain, which is itself comprised of 16 tandem TPR repeats. These findings were used to assemble a 3D atomic model for the TOR protein.
Collapse
Affiliation(s)
- Bruce A Knutson
- Fred Hutchinson Cancer Research Center, Basic Sciences Division, 1100 Fairview Ave N, P.O. Box 19024, Mailstop A1-162, Seattle, WA 98109, USA.
| |
Collapse
|
1602
|
Abstract
While the prediction of a native protein structure from sequence continues to remain a challenging problem, over the past decades computational methods have become quite successful in exploiting the mechanisms behind secondary structure formation. The great effort expended in this area has resulted in the development of a vast number of secondary structure prediction methods. Especially the combination of well-optimized/sensitive machine-learning algorithms and inclusion of homologous sequence information has led to increased prediction accuracies of up to 80%. In this chapter, we will first introduce some basic notions and provide a brief history of secondary structure prediction advances. Then a comprehensive overview of state-of-the-art prediction methods will be given. Finally, we will discuss open questions and challenges in this field and provide some practical recommendations for the user.
Collapse
Affiliation(s)
- Walter Pirovano
- Centre for Integrative Bioinformatics VU, VU University, Amsterdam, The Netherlands
| | | |
Collapse
|
1603
|
Finn RD, Mistry J, Tate J, Coggill P, Heger A, Pollington JE, Gavin OL, Gunasekaran P, Ceric G, Forslund K, Holm L, Sonnhammer ELL, Eddy SR, Bateman A. The Pfam protein families database. Nucleic Acids Res 2010; 38:D211-22. [PMID: 19920124 PMCID: PMC2808889 DOI: 10.1093/nar/gkp985] [Citation(s) in RCA: 2359] [Impact Index Per Article: 157.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2009] [Accepted: 10/15/2009] [Indexed: 11/13/2022] Open
Abstract
Pfam is a widely used database of protein families and domains. This article describes a set of major updates that we have implemented in the latest release (version 24.0). The most important change is that we now use HMMER3, the latest version of the popular profile hidden Markov model package. This software is approximately 100 times faster than HMMER2 and is more sensitive due to the routine use of the forward algorithm. The move to HMMER3 has necessitated numerous changes to Pfam that are described in detail. Pfam release 24.0 contains 11,912 families, of which a large number have been significantly updated during the past two years. Pfam is available via servers in the UK (http://pfam.sanger.ac.uk/), the USA (http://pfam.janelia.org/) and Sweden (http://pfam.sbc.su.se/).
Collapse
Affiliation(s)
- Robert D Finn
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1SA, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1604
|
Brylinski M, Skolnick J. Comparison of structure-based and threading-based approaches to protein functional annotation. Proteins 2010; 78:118-34. [PMID: 19731377 PMCID: PMC2804779 DOI: 10.1002/prot.22566] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
To exploit the vast amount of sequence information provided by the Genomic revolution, the biological function of these sequences must be identified. As a practical matter, this is often accomplished by functional inference. Purely sequence-based approaches, particularly in the "twilight zone" of low sequence similarity levels, are complicated by many factors. For proteins, structure-based techniques aim to overcome these problems; however, most require high-quality crystal structures and suffer from complex and equivocal relations between protein fold and function. In this study, in extensive benchmarking, we consider a number of aspects of structure-based functional annotation: binding pocket detection, molecular function assignment and ligand-based virtual screening. We demonstrate that protein threading driven by a strong sequence profile component greatly improves the quality of purely structure-based functional annotation in the "twilight zone." By detecting evolutionarily related proteins, it considerably reduces the high false positive rate of function inference derived on the basis of global structure similarity alone. Combined evolution/structure-based function assignment emerges as a powerful technique that can make a significant contribution to comprehensive proteome annotation.
Collapse
Affiliation(s)
- Michal Brylinski
- Center for the Study of Systems Biology School of Biology, Georgia Institute of Technology, 250 14th Street NW, Atlanta, GA 30318
| | - Jeffrey Skolnick
- Center for the Study of Systems Biology School of Biology, Georgia Institute of Technology, 250 14th Street NW, Atlanta, GA 30318
| |
Collapse
|
1605
|
Zhou T, Shu N, Hovmöller S. A novel method for accurate one-dimensional protein structure prediction based on fragment matching. Bioinformatics 2009; 26:470-7. [DOI: 10.1093/bioinformatics/btp679] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
1606
|
Considering scores between unrelated proteins in the search database improves profile comparison. BMC Bioinformatics 2009; 10:399. [PMID: 19961610 PMCID: PMC3087343 DOI: 10.1186/1471-2105-10-399] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2009] [Accepted: 12/04/2009] [Indexed: 12/02/2022] Open
Abstract
Background Profile-based comparison of multiple sequence alignments is a powerful methodology for the detection remote protein sequence similarity, which is essential for the inference and analysis of protein structure, function, and evolution. Accurate estimation of statistical significance of detected profile similarities is essential for further development of this methodology. Here we analyze a novel approach to estimate the statistical significance of profile similarity: the explicit consideration of background score distributions for each database template (subject). Results Using a simple scheme to combine and analytically approximate query- and subject-based distributions, we show that (i) inclusion of background distributions for the subjects increases the quality of homology detection; (ii) this increase is higher when the distributions are based on the scores to all known non-homologs of the subject rather than a small calibration subset of the database representatives; and (iii) these all known non-homolog distributions of scores for the subject make the dominant contribution to the improved performance: adding the calibration distribution of the query has a negligible additional effect. Conclusion The construction of distributions based on the complete sets of non-homologs for each subject is particularly relevant in the setting of structure prediction where the database consists of proteins with solved 3D structure (PDB, SCOP, CATH, etc.) and therefore structural relationships between proteins are known. These results point to a potential new direction in the development of more powerful methods for remote homology detection.
Collapse
|
1607
|
Tsoy O, Ravcheev D, Mushegian A. Comparative genomics of ethanolamine utilization. J Bacteriol 2009; 191:7157-64. [PMID: 19783625 PMCID: PMC2786565 DOI: 10.1128/jb.00838-09] [Citation(s) in RCA: 87] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2009] [Accepted: 09/18/2009] [Indexed: 11/20/2022] Open
Abstract
Ethanolamine can be used as a source of carbon and nitrogen by phylogenetically diverse bacteria. Ethanolamine-ammonia lyase, the enzyme that breaks ethanolamine into acetaldehyde and ammonia, is encoded by the gene tandem eutBC. Despite extensive studies of ethanolamine utilization in Salmonella enterica serovar Typhimurium, much remains to be learned about EutBC structure and catalytic mechanism, about the evolutionary origin of ethanolamine utilization, and about regulatory links between the metabolism of ethanolamine itself and the ethanolamine-ammonia lyase cofactor adenosylcobalamin. We used computational analysis of sequences, structures, genome contexts, and phylogenies of ethanolamine-ammonia lyases to address these questions and to evaluate recent data-mining studies that have suggested an association between bacterial food poisoning and the diol utilization pathways. We found that EutBC evolution included recruitment of a TIM barrel and a Rossmann fold domain and their fusion to N-terminal alpha-helical domains to give EutB and EutC, respectively. This fusion was followed by recruitment and occasional loss of auxiliary ethanolamine utilization genes in Firmicutes and by several horizontal transfers, most notably from the firmicute stem to the Enterobacteriaceae and from Alphaproteobacteria to Actinobacteria. We identified a conserved DNA motif that likely represents the EutR-binding site and is shared by the ethanolamine and cobalamin operons in several enterobacterial species, suggesting a mechanism for coupling the biosyntheses of apoenzyme and cofactor in these species. Finally, we found that the food poisoning phenotype is associated with the structural components of metabolosome more strongly than with ethanolamine utilization genes or with paralogous propanediol utilization genes per se.
Collapse
Affiliation(s)
- Olga Tsoy
- Department of Bioengineering and Bioinformatics, Moscow State University, Vorob'evy gory 1-73, Moscow 119992, Russia, Institute for Information Transmission Problems, RAS, Bolshoi Karetny Pereulok 19, Moscow 127994, Russia, Stowers Institute for Medical Research, 1000 E. 50th St., Kansas City, Missouri 64110, Department of Microbiology, Molecular Genetics, and Immunology, University of Kansas Medical Center, Kansas City, Kansas 66160
| | - Dmitry Ravcheev
- Department of Bioengineering and Bioinformatics, Moscow State University, Vorob'evy gory 1-73, Moscow 119992, Russia, Institute for Information Transmission Problems, RAS, Bolshoi Karetny Pereulok 19, Moscow 127994, Russia, Stowers Institute for Medical Research, 1000 E. 50th St., Kansas City, Missouri 64110, Department of Microbiology, Molecular Genetics, and Immunology, University of Kansas Medical Center, Kansas City, Kansas 66160
| | - Arcady Mushegian
- Department of Bioengineering and Bioinformatics, Moscow State University, Vorob'evy gory 1-73, Moscow 119992, Russia, Institute for Information Transmission Problems, RAS, Bolshoi Karetny Pereulok 19, Moscow 127994, Russia, Stowers Institute for Medical Research, 1000 E. 50th St., Kansas City, Missouri 64110, Department of Microbiology, Molecular Genetics, and Immunology, University of Kansas Medical Center, Kansas City, Kansas 66160
| |
Collapse
|
1608
|
Huynen MA, de Hollander M, Szklarczyk R. Mitochondrial proteome evolution and genetic disease. Biochim Biophys Acta Mol Basis Dis 2009; 1792:1122-9. [DOI: 10.1016/j.bbadis.2009.03.005] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2008] [Revised: 03/04/2009] [Accepted: 03/20/2009] [Indexed: 11/16/2022]
|
1609
|
Alva V, Dunin-Horkawicz S, Habeck M, Coles M, Lupas AN. The GD box: a widespread noncontiguous supersecondary structural element. Protein Sci 2009; 18:1961-6. [PMID: 19609930 DOI: 10.1002/pro.207] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Identification and characterization of recurrent supersecondary structural elements is central to understanding the rules governing protein tertiary structure. Here, we describe the GD box, a widespread noncontiguous supersecondary element, which we initially found in a group of topologically distinct but homologous beta-barrels--the cradle-loop barrels. The GD box is similar both in sequence and structure and comprises two short unpaired beta-strands connected by an orthogonal type-II beta-turn and a noncontiguous beta-strand forming hydrogen bonds with the beta-turn. Using structure-based analysis, we have detected 518 instances of the GD box in a nonredundant subset of the SCOP database comprising 3771 domains. Apart from the cradle-loop barrels, this motif is also found in a diverse set of nonhomologous folds including other topologically related beta-barrels. Since nonlocal interactions are fundamental in the formation of protein structure, systematic identification and characterization of other noncontiguous supersecondary structural elements is likely to prove valuable to protein structure modeling, validation, and prediction.
Collapse
Affiliation(s)
- Vikram Alva
- Department of Protein Evolution, Max-Planck-Institute for Developmental Biology, 72076 Tübingen, Germany
| | | | | | | | | |
Collapse
|
1610
|
Clifford M, Twigg J, Upton C. Evidence for a novel gene associated with human influenza A viruses. Virol J 2009; 6:198. [PMID: 19917120 PMCID: PMC2780412 DOI: 10.1186/1743-422x-6-198] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2009] [Accepted: 11/16/2009] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND Influenza A virus genomes are comprised of 8 negative strand single-stranded RNA segments and are thought to encode 11 proteins, which are all translated from mRNAs complementary to the genomic strands. Although human, swine and avian influenza A viruses are very similar, cross-species infections are usually limited. However, antigenic differences are considerable and when viruses become established in a different host or if novel viruses are created by re-assortment devastating pandemics may arise. RESULTS Examination of influenza A virus genomes from the early 20th Century revealed the association of a 167 codon ORF encoded by the genomic strand of segment 8 with human isolates. Close to the timing of the 1948 pseudopandemic, a mutation occurred that resulted in the extension of this ORF to 216 codons. Since 1948, this ORF has been almost totally maintained in human influenza A viruses suggesting a selectable biological function. The discovery of cytotoxic T cells responding to an epitope encoded by this ORF suggests that it is translated into protein. Evidence of several other non-traditionally translated polypeptides in influenza A virus support the translation of this genomic strand ORF. The gene product is predicted to have a signal sequence and two transmembrane domains. CONCLUSION We hypothesize that the genomic strand of segment 8 of encodes a novel influenza A virus protein. The persistence and conservation of this genomic strand ORF for almost a century in human influenza A viruses provides strong evidence that it is translated into a polypeptide that enhances viral fitness in the human host. This has important consequences for the interpretation of experiments that utilize mutations in the NS1 and NEP genes of segment 8 and also for the consideration of events that may alter the spread and/or pathogenesis of swine and avian influenza A viruses in the human population.
Collapse
Affiliation(s)
- Monica Clifford
- Department of Biochemistry and Microbiology, University of Victoria, Victoria, BC, V8W 3P6, Canada
| | - James Twigg
- Department of Biochemistry and Microbiology, University of Victoria, Victoria, BC, V8W 3P6, Canada
| | - Chris Upton
- Department of Biochemistry and Microbiology, University of Victoria, Victoria, BC, V8W 3P6, Canada
| |
Collapse
|
1611
|
Abstract
The observation that similar protein sequences fold into similar three-dimensional structures provides a basis for the methods which predict structural features of a novel protein based on the similarity between its sequence and sequences of known protein structures. Similarity over entire sequence or large sequence fragment(s) enables prediction and modeling of entire structural domains while statistics derived from distributions of local features of known protein structures make it possible to predict such features in proteins with unknown structures. The accuracy of models of protein structures is sufficient for many practical purposes such as analysis of point mutation effects, enzymatic reactions, interaction interfaces of protein complexes, and active sites. Protein models are also used for phasing of crystallographic data and, in some cases, for drug design. By using models one can avoid the costly and time-consuming process of experimental structure determination. The purpose of this chapter is to give a practical review of the most popular protein structure prediction methods based on sequence similarity and to outline a practical approach to protein structure prediction. While the main focus of this chapter is on template-based protein structure prediction, it also provides references to other methods and programs which play an important role in protein structure prediction.
Collapse
|
1612
|
de Graaf RM, van Alen TA, Dutilh BE, Kuiper JWP, van Zoggel HJAA, Huynh MB, Görtz HD, Huynen MA, Hackstein JHP. The mitochondrial genomes of the ciliates Euplotes minuta and Euplotes crassus. BMC Genomics 2009; 10:514. [PMID: 19895685 PMCID: PMC2779199 DOI: 10.1186/1471-2164-10-514] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2009] [Accepted: 11/06/2009] [Indexed: 11/23/2022] Open
Abstract
Background There are thousands of very diverse ciliate species from which only a handful mitochondrial genomes have been studied so far. These genomes are rather similar because the ciliates analysed (Tetrahymena spp. and Paramecium aurelia) are closely related. Here we study the mitochondrial genomes of the hypotrichous ciliates Euplotes minuta and Euplotes crassus. These ciliates are only distantly related to Tetrahymena spp. and Paramecium aurelia, but more closely related to Nyctotherus ovalis, which possesses a hydrogenosomal (mitochondrial) genome. Results The linear mitochondrial genomes of the hypotrichous ciliates Euplotes minuta and Euplotes crassus were sequenced and compared with the mitochondrial genomes of several Tetrahymena species, Paramecium aurelia and the partially sequenced mitochondrial genome of the anaerobic ciliate Nyctotherus ovalis. This study reports new features such as long 5'gene extensions of several mitochondrial genes, extremely long cox1 and cox2 open reading frames and a large repeat in the middle of the linear mitochondrial genome. The repeat separates the open reading frames into two blocks, each having a single direction of transcription, from the repeat towards the ends of the chromosome. Although the Euplotes mitochondrial gene content is almost identical to that of Paramecium and Tetrahymena, the order of the genes is completely different. In contrast, the 33273 bp (excluding the repeat region) piece of the mitochondrial genome that has been sequenced in both Euplotes species exhibits no difference in gene order. Unexpectedly, many of the mitochondrial genes of E. minuta encoding ribosomal proteins possess N-terminal extensions that are similar to mitochondrial targeting signals. Conclusion The mitochondrial genomes of the hypotrichous ciliates Euplotes minuta and Euplotes crassus are rather different from the previously studied genomes. Many genes are extended in size compared to mitochondrial genes from other sources.
Collapse
Affiliation(s)
- Rob M de Graaf
- Department of Evolutionary Microbiology, IWWR, Radboud University Nijmegen, Nijmegen, The Netherlands.
| | | | | | | | | | | | | | | | | |
Collapse
|
1613
|
Hsu S, Kim Y, Li S, Durrant ES, Pace RM, Woods VL, Gentry MS. Structural insights into glucan phosphatase dynamics using amide hydrogen-deuterium exchange mass spectrometry. Biochemistry 2009; 48:9891-902. [PMID: 19754155 DOI: 10.1021/bi9008853] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Laforin and starch excess 4 (SEX4) are founding members of a class of phosphatases that dephosphorylate phosphoglucans. Each protein contains a carbohydrate binding module (CBM) and a dual-specificity phosphatase (DSP) domain. The gene encoding laforin is mutated in a fatal neurodegenerative disease called Lafora disease (LD). In the absence of laforin function, insoluble glucans that are hyperphosphorylated and exhibit sparse branching accumulate. It is hypothesized that these accumulations trigger the neurodegeneration and premature death of LD patients. We recently demonstrated that laforin removes phosphate from phosphoglucans and hypothesized that this function inhibits insoluble glucan accumulation. Loss of SEX4 function in plants yields a similar cellular phenotype; an excess amount of insoluble, hyperphosphorylated glucans accumulates in cells. While multiple groups have shown that these phosphatases dephosphorylate phosphoglucans, there is no structure of a glucan phosphatase and little is known about the mechanism whereby they perform this action. We utilized hydrogen-deuterium exchange mass spectrometry (DXMS) and structural modeling to probe the conformational and structural dynamics of the glucan phosphatase SEX4. We found that the enzyme does not undergo a global conformational change upon glucan binding but instead undergoes minimal rearrangement upon binding. The CBM has improved protection from deuteration when bound to glucans, confirming its role in glucan binding. More interestingly, we identified structural components of the DSP that also have improved protection from deuteration upon glucan addition. To determine the position of these regions, we generated a homology model of the SEX4 DSP. The homology model shows that all of these regions are adjacent to the DSP active site. Therefore, our results suggest that these regions of the DSP participate in the presentation of the phosphoglucan to the active site and provide the first structural analysis and mode of action of this unique class of phosphatases.
Collapse
Affiliation(s)
- Simon Hsu
- Department of Medicine, University of California at San Diego, La Jolla, California 92093-0601, USA
| | | | | | | | | | | | | |
Collapse
|
1614
|
Pei J, Grishin NV. The Rho GTPase inactivation domain in Vibrio cholerae MARTX toxin has a circularly permuted papain-like thiol protease fold. Proteins 2009; 77:413-9. [PMID: 19434753 PMCID: PMC3688474 DOI: 10.1002/prot.22447] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
A Rho GTPase inactivation domain (RID) has been discovered in the multifunctional, autoprocessing RTX toxin RtxA from Vibrio cholerae. The RID domain causes actin depolymerization and rounding of host cells through inactivation of the small Rho GTPases Rho, Rac, and Cdc42. With only a few toxin proteins containing RID domains in the current sequence database, the structure and molecular mechanisms of this domain are unknown. Using comparative sequence and structural analyses, we report homology inference, fold recognition, and active site prediction for RID domains. Remote homologs of RID domains were identified in two other experimentally characterized bacterial virulence factors: IcsB of Shigella flexneri and BopA of Burkholderia pseudomallei, as well as in a group of uncharacterized bacterial membrane proteins. IcsB plays an important role in helping Shigella to evade the host autophagy defense system. RID domain homologs share a conserved diad of cysteine and histidine residues, and are predicted to adopt a circularly permuted papain-like thiol protease fold. RID domains from MARTX toxins and virulence factors IcsB and BopA thus could function as proteases or acyltransferases acting on host molecules. Our results provide structural and mechanistic insights into several important proteins functioning in bacterial pathogenesis.
Collapse
Affiliation(s)
- Jimin Pei
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas 75390-9050, USA.
| | | |
Collapse
|
1615
|
Bazan JF, de Sauvage FJ. Structural ties between cholesterol transport and morphogen signaling. Cell 2009; 138:1055-6. [PMID: 19766557 DOI: 10.1016/j.cell.2009.09.006] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
|
1616
|
Jaszczur M, Rudzka J, Kraszewska J, Flis K, Polaczek P, Campbell JL, Fijalkowska IJ, Jonczyk P. Defective interaction between Pol2p and Dpb2p, subunits of DNA polymerase epsilon, contributes to a mutator phenotype in Saccharomyces cerevisiae. Mutat Res 2009; 669:27-35. [PMID: 19463834 PMCID: PMC2771212 DOI: 10.1016/j.mrfmmm.2009.04.012] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2008] [Revised: 03/12/2009] [Accepted: 04/17/2009] [Indexed: 10/20/2022]
Abstract
Most of the prokaryotic and eukaryotic replicative polymerases are multi-subunit complexes. There are several examples indicating that noncatalytic subunits of DNA polymerases may function as fidelity factors during replication process. In this work, we have further investigated the role of Dpb2p, a noncatalytic subunit of DNA polymerase epsilon holoenzyme from Saccharomyces cerevisiae in controlling the level of spontaneous mutagenesis. The data presented indicate that impaired interaction between catalytic Pol2p subunit and Dpb2p is responsible for the observed mutator phenotype in S. cerevisiae strains carrying different mutated alleles of the DPB2 gene. We observed a significant correlation between the decreased level of interaction between different mutated forms of Dpb2p towards a wild-type form of Pol2p and the strength of mutator phenotype that they confer. We propose that structural integrity of the Pol epsilon holoenzyme is essential for genetic stability in S. cerevisiae cells.
Collapse
Affiliation(s)
- Malgorzata Jaszczur
- Institute of Biochemistry and Biophysics, Polish Academy of Sciences, Pawinskiego 5A, 02-106 Warsaw, Poland
| | - Justyna Rudzka
- Institute of Biochemistry and Biophysics, Polish Academy of Sciences, Pawinskiego 5A, 02-106 Warsaw, Poland
| | - Joanna Kraszewska
- Institute of Biochemistry and Biophysics, Polish Academy of Sciences, Pawinskiego 5A, 02-106 Warsaw, Poland
| | - Krzysztof Flis
- Institute of Biochemistry and Biophysics, Polish Academy of Sciences, Pawinskiego 5A, 02-106 Warsaw, Poland
| | - Piotr Polaczek
- Braun Laboratories, California Institute of Technology, Pasadena, California 91125
| | - Judith L. Campbell
- Braun Laboratories, California Institute of Technology, Pasadena, California 91125
| | - Iwona J. Fijalkowska
- Institute of Biochemistry and Biophysics, Polish Academy of Sciences, Pawinskiego 5A, 02-106 Warsaw, Poland
| | - Piotr Jonczyk
- Institute of Biochemistry and Biophysics, Polish Academy of Sciences, Pawinskiego 5A, 02-106 Warsaw, Poland
| |
Collapse
|
1617
|
Rastgou M, Habibi MK, Izadpanah K, Masenga V, Milne RG, Wolf YI, Koonin EV, Turina M. Molecular characterization of the plant virus genus Ourmiavirus and evidence of inter-kingdom reassortment of viral genome segments as its possible route of origin. J Gen Virol 2009; 90:2525-2535. [PMID: 19535502 PMCID: PMC4091139 DOI: 10.1099/vir.0.013086-0] [Citation(s) in RCA: 88] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2009] [Accepted: 06/15/2009] [Indexed: 01/12/2023] Open
Abstract
Ourmia melon virus (OuMV), Epirus cherry virus (EpCV) and Cassava virus C (CsVC) are three species placed in the genus Ourmiavirus. We cloned and sequenced their RNA genomes. The sizes of the three genomic RNAs of OuMV, the type member of the genus, were 2814, 1064 and 974 nt and each had one open reading frame. RNA1 potentially encoded a 97.5 kDa protein carrying the GDD motif typical of RNA-dependent RNA polymerases (RdRps). The putative RdRps of ourmiaviruses are distantly related to known viral RdRps, with the closest similarity and phylogenetic affinity observed with fungal viruses of the genus Narnaviridae. RNA2 encoded a 31.6 kDa protein which, expressed in bacteria as a His-tag fusion protein and in plants through agroinfiltration, reacted specifically with antibodies made against tubular structures found in the cytoplasm. The ORF2 product is significantly similar to movement proteins of the genus Tombusviridae, and phylogenetic analysis supported this evolutionary relationship. The product of OuMV ORF3 is a 23.8 kDa protein. This protein was also expressed in bacteria and plants, and reacted specifically with antisera against the OuMV coat protein. The sequence of the ORF3 protein showed limited but significant similarity to capsid proteins of several plant and animal viruses, although phylogenetic analysis failed to reveal its most likely origin. Taken together, these results indicate that ourmiaviruses comprise a unique group of plant viruses that might have evolved by reassortment of genomic segments of RNA viruses infecting hosts belonging to different eukaryotic kingdoms, in particular, fungi and plants.
Collapse
Affiliation(s)
- M Rastgou
- Department of Plant Protection, College of Agriculture, Urmia University, Urmia, Iran
- Plant Protection Department, Faculty of Horticultural Science & Plant Protection, College of Agriculture and Natural Resources, University of Tehran, Karaj, Iran
| | - M K Habibi
- Plant Protection Department, Faculty of Horticultural Science & Plant Protection, College of Agriculture and Natural Resources, University of Tehran, Karaj, Iran
| | - K Izadpanah
- Plant Virology Research Center, Shiraz University, Shiraz, Iran
| | - V Masenga
- Istituto di Virologia Vegetale, CNR, Strada delle Cacce 73, 10135 Torino, Italy
| | - R G Milne
- Istituto di Virologia Vegetale, CNR, Strada delle Cacce 73, 10135 Torino, Italy
| | - Y I Wolf
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - E V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - M Turina
- Istituto di Virologia Vegetale, CNR, Strada delle Cacce 73, 10135 Torino, Italy
| |
Collapse
|
1618
|
Jaroszewski L, Li Z, Krishna SS, Bakolitsa C, Wooley J, Deacon AM, Wilson IA, Godzik A. Exploration of uncharted regions of the protein universe. PLoS Biol 2009; 7:e1000205. [PMID: 19787035 PMCID: PMC2744874 DOI: 10.1371/journal.pbio.1000205] [Citation(s) in RCA: 97] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2009] [Accepted: 08/19/2009] [Indexed: 12/02/2022] Open
Abstract
Determination of first protein structures, from hundreds of families of unknown function, have shown that divergence, rather than novelty, is the dominant force that shapes the evolution of the protein universe. The genome projects have unearthed an enormous diversity of genes of unknown function that are still awaiting biological and biochemical characterization. These genes, as most others, can be grouped into families based on sequence similarity. The PFAM database currently contains over 2,200 such families, referred to as domains of unknown function (DUF). In a coordinated effort, the four large-scale centers of the NIH Protein Structure Initiative have determined the first three-dimensional structures for more than 250 of these DUF families. Analysis of the first 248 reveals that about two thirds of the DUF families likely represent very divergent branches of already known and well-characterized families, which allows hypotheses to be formulated about their biological function. The remainder can be formally categorized as new folds, although about one third of these show significant substructure similarity to previously characterized folds. These results infer that, despite the enormous increase in the number and the diversity of new genes being uncovered, the fold space of the proteins they encode is gradually becoming saturated. The previously unexplored sectors of the protein universe appear to be primarily shaped by extreme diversification of known protein families, which then enables organisms to evolve new functions and adapt to particular niches and habitats. Notwithstanding, these DUF families still constitute the richest source for discovery of the remaining protein folds and topologies. More than 40% of known proteins lack any annotation within public databases and are usually referred to as hypothetical proteins despite most of them being real and many being evolutionarily conserved and thus expected to play important biological roles. Determination of the three-dimensional structures of representatives of more than 240 families of protein domains of unknown function by the Protein Structure Initiative has provided a unique sample of regions of the protein universe that, until this systematic effort, were completely uncharacterized. Analysis of these structures reveals that most of the 240 families can be considered as remote homologs of already known protein families. Such distant evolutionary links can sometimes be predicted by current state-of-the-art sequence comparison tools, but structural analysis has led to the first hypotheses about biological functions for many of these uncharacterized proteins, and serves as a starting point for experimental studies. The rapid pace of discovery of such relationships appears to suggest that the protein universe is made up of a relatively small and stable number of ‘extended neighborhoods’ that bring together distantly related protein families. Thus, the vast uncharacterized part of protein universe, called by some “the dark matter of protein space”, may consist mainly of highly divergent homologs. Continued structural characterization of these previously under-investigated regions of the protein universe should further help unravel the patterns and rules that led to such divergence in the evolution of protein structure and function.
Collapse
Affiliation(s)
- Lukasz Jaroszewski
- Joint Center for Structural Genomics, Bioinformatics Core, Burnham Institute for Medical Research, La Jolla, California, United States of America
| | - Zhanwen Li
- Joint Center for Molecular Modeling, Burnham Institute for Medical Research, La Jolla, California, United States of America
| | - S. Sri Krishna
- Joint Center for Structural Genomics, Bioinformatics Core, Burnham Institute for Medical Research, La Jolla, California, United States of America
| | - Constantina Bakolitsa
- Joint Center for Structural Genomics, Bioinformatics Core, Burnham Institute for Medical Research, La Jolla, California, United States of America
| | - John Wooley
- Joint Center for Structural Genomics, Bioinformatics Core, Center for Research in Biological Systems, University of California San Diego, La Jolla, California, United States of America
| | - Ashley M. Deacon
- Joint Center for Structural Genomics, Structure Determination Core, Stanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, Menlo Park, California, United States of America
| | - Ian A. Wilson
- Joint Center for Structural Genomics, The Scripps Research Institute, La Jolla, California, United States of America
| | - Adam Godzik
- Joint Center for Structural Genomics, Bioinformatics Core, Burnham Institute for Medical Research, La Jolla, California, United States of America
- Joint Center for Molecular Modeling, Burnham Institute for Medical Research, La Jolla, California, United States of America
- Joint Center for Structural Genomics, Bioinformatics Core, Center for Research in Biological Systems, University of California San Diego, La Jolla, California, United States of America
- * E-mail:
| |
Collapse
|
1619
|
Dlakić M. HHsvm: fast and accurate classification of profile-profile matches identified by HHsearch. ACTA ACUST UNITED AC 2009; 25:3071-6. [PMID: 19773335 DOI: 10.1093/bioinformatics/btp555] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION Recently developed profile-profile methods rival structural comparisons in their ability to detect homology between distantly related proteins. Despite this tremendous progress, many genuine relationships between protein families cannot be recognized as comparisons of their profiles result in scores that are statistically insignificant. RESULTS Using known evolutionary relationships among protein superfamilies in SCOP database, support vector machines were trained on four sets of discriminatory features derived from the output of HHsearch. Upon validation, it was shown that the automatic classification of all profile-profile matches was superior to fixed threshold-based annotation in terms of sensitivity and specificity. The effectiveness of this approach was demonstrated by annotating several domains of unknown function from the Pfam database. AVAILABILITY Programs and scripts implementing the methods described in this manuscript are freely available from http://hhsvm.dlakiclab.org/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Mensur Dlakić
- Department of Microbiology, Montana State University, Bozeman, MT 59717-3520, USA.
| |
Collapse
|
1620
|
Identification and classification of bcl genes and proteins of Bacillus cereus group organisms and their application in Bacillus anthracis detection and fingerprinting. Appl Environ Microbiol 2009; 75:7163-72. [PMID: 19767469 DOI: 10.1128/aem.01069-09] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
The Bacillus cereus group includes three closely related species, B. anthracis, B. cereus, and B. thuringiensis, which form a highly homogeneous subdivision of the genus Bacillus. One of these species, B. anthracis, has been identified as one of the most probable bacterial biowarfare agents. Here, we evaluate the sequence and length polymorphisms of the Bacillus collagen-like protein bcl genes as a basis for B. anthracis detection and fingerprinting. Five genes, designated bclA to bclE, are present in B. anthracis strains. Examination of bclABCDE sequences identified polymorphisms in bclB alleles of the B. cereus group organisms. These sequence polymorphisms allowed specific detection of B. anthracis strains by PCR using both genomic DNA and purified Bacillus spores in reactions. By exploiting the length variation of the bcl alleles it was demonstrated that the combined bclABCDE PCR products generate markedly different fingerprints for the B. anthracis Ames and Sterne strains. Moreover, we predict that bclABCDE length polymorphism creates unique signatures for B. anthracis strains, which facilitates identification of strains with specificity and confidence. Thus, we present a new diagnostic concept for B. anthracis detection and fingerprinting, which can be used alone or in combination with previously established typing platforms.
Collapse
|
1621
|
Bröer S, Schneider HP, Bröer A, Deitmer JW. Mutation of asparagine 76 in the center of glutamine transporter SNAT3 modulates substrate-induced conductances and Na+ binding. J Biol Chem 2009; 284:25823-31. [PMID: 19596860 PMCID: PMC2757984 DOI: 10.1074/jbc.m109.031013] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2009] [Revised: 07/08/2009] [Indexed: 11/06/2022] Open
Abstract
The glutamine transporter SLC38A3 (SNAT3) plays an important role in the release of glutamine from brain astrocytes and the uptake of glutamine into hepatocytes. It is related to the vesicular GABA (gamma-aminobutyric acid) transporter and the SLC36 family of proton-amino acid cotransporters. The transporter carries out electroneutral Na+-glutamine cotransport-H+ antiport. In addition, substrate-induced uncoupled cation currents are observed. Mutation of asparagine 76 to glutamine or histidine in predicted transmembrane helix 1 abolished all substrate-induced currents. Mutation of asparagine 76 to aspartate rendered the transporter Na+-independent and resulted in a gain of a large substrate-induced chloride conductance in the absence of Na+. Thus, a single residue is critical for coupled and uncoupled ion flows in the glutamine transporter SNAT3. Homology modeling of SNAT3 along the structure of the related benzyl-hydantoin permease from Microbacterium liquefaciens reveals that Asn-76 is likely to be located in the center of the membrane close to the translocation pore and forms part of the predicted Na+ -binding site.
Collapse
Affiliation(s)
- Stefan Bröer
- From the Research School of Biology, The Australian National University, Canberra, ACT 0200, Australia and
| | - Hans-Peter Schneider
- the Abteilung für Allgemeine Zoologie, Fachbereich Biologie, Technische Universität Kaiserslautern, Erwin-Schrödinger Strasse, D-67653 Kaiserslautern, Germany
| | - Angelika Bröer
- From the Research School of Biology, The Australian National University, Canberra, ACT 0200, Australia and
| | - Joachim W. Deitmer
- the Abteilung für Allgemeine Zoologie, Fachbereich Biologie, Technische Universität Kaiserslautern, Erwin-Schrödinger Strasse, D-67653 Kaiserslautern, Germany
| |
Collapse
|
1622
|
Sanchez M, Drechsler M, Stark H, Lipps G. DNA translocation activity of the multifunctional replication protein ORF904 from the archaeal plasmid pRN1. Nucleic Acids Res 2009; 37:6831-48. [PMID: 19762479 PMCID: PMC2777425 DOI: 10.1093/nar/gkp742] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The replication protein ORF904 from the plasmid pRN1 is a multifunctional enzyme with ATPase-, primase- and DNA polymerase activity. Sequence analysis suggests the presence of at least two conserved domains: an N-terminal prim/pol domain with primase and DNA polymerase activities and a C-terminal superfamily 3 helicase domain with a strong double-stranded DNA dependant ATPase activity. The exact molecular function of the helicase domain in the process of plasmid replication remains unclear. Potentially this motor protein is involved in duplex remodelling and/or origin opening at the plasmid replication origin. In support of this we found that the monomeric replication protein ORF904 forms a hexameric ring in the presence of DNA. It is able to translocate along single-stranded DNA in 3′–5′ direction as well as on double-stranded DNA. Critical residues important for ATPase activity and DNA translocation activity were identified and are in agreement with a homology model of the helicase domain. In addition we propose that a winged helix DNA-binding domain at the C-terminus of the helicase domain could assist the binding of the replication protein specifically to the replication origin.
Collapse
Affiliation(s)
- Martin Sanchez
- Department of Biochemistry, University of Bayreuth, Universitätsstrasse 30, 95447 Bayreuth, Switzerland
| | | | | | | |
Collapse
|
1623
|
Zhang Z, Albers T, Fiumera HL, Gameiro A, Grewer C. A conserved Na(+) binding site of the sodium-coupled neutral amino acid transporter 2 (SNAT2). J Biol Chem 2009; 284:25314-23. [PMID: 19589777 PMCID: PMC2757233 DOI: 10.1074/jbc.m109.038422] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2009] [Revised: 07/08/2009] [Indexed: 11/06/2022] Open
Abstract
The SLC38 family of solute transporters mediates the coupled transport of amino acids and Na(+) into or out of cells. The structural basis for this coupled transport process is not known. Here, a profile-based sequence analysis approach was used, predicting a distant relationship with the SLC5/6 transporter families. Homology models using the LeuT(Aa) and Mhp1 transporters of known structure as templates were established, predicting the location of a conserved Na(+) binding site in the center of membrane helices 1 and 8. This homology model was tested experimentally in the SLC38 member SNAT2 by analyzing the effect of a mutation to Thr-384, which is predicted to be part of this Na(+) binding site. The results show that the T384A mutation not only inhibits the anion leak current, which requires Na(+) binding to SNAT2, but also dramatically lowers the Na(+) affinity of the transporter. This result is consistent with a previous analysis of the N82A mutant transporter, which has a similar effect on anion leak current and Na(+) binding and which is also expected to form part of the Na(+) binding site. In contrast, random mutations to other sites in the transporter had little or no effect on Na(+) affinity. Our results are consistent with a cation binding site formed by transmembrane helices 1 and 8 that is conserved among the SLC38 transporters as well as among many other bacterial and plant transporter families of unknown structure, which are homologous to SLC38.
Collapse
Affiliation(s)
- Zhou Zhang
- From the College of Life and Environment Sciences, Shanghai Normal University, 100 Guilin Road, Shanghai 200234, China and
| | | | | | | | | |
Collapse
|
1624
|
Horst J, Samudrala R. Diversity of protein structures and difficulties in fold recognition: the curious case of protein G. F1000 BIOLOGY REPORTS 2009; 1:69. [PMID: 20209018 PMCID: PMC2832337 DOI: 10.3410/b1-69] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
We examine the ability of current state-of-the-art methods in protein structure prediction to discriminate topologically distant folds encoded by highly similar (>90% sequence identity) designed proteins in blind protein structure prediction experiments. We detail the corresponding prognosis for the protein fold recognition field and highlight the features of the methodologies that successfully deciphered this folding riddle.
Collapse
Affiliation(s)
- Jeremy Horst
- Department of Oral Biology, University of Washington School of Dentistry1959 NE Pacific Street, Seattle, WA 98195-7132USA
- Department of Microbiology, University of Washington School of Medicine1959 NE Pacific Street, Seattle, WA 98195-7132USA
| | - Ram Samudrala
- Department of Oral Biology, University of Washington School of Dentistry1959 NE Pacific Street, Seattle, WA 98195-7132USA
- Department of Microbiology, University of Washington School of Medicine1959 NE Pacific Street, Seattle, WA 98195-7132USA
| |
Collapse
|
1625
|
Percudani R, Peracchi A. The B6 database: a tool for the description and classification of vitamin B6-dependent enzymatic activities and of the corresponding protein families. BMC Bioinformatics 2009; 10:273. [PMID: 19723314 PMCID: PMC2748086 DOI: 10.1186/1471-2105-10-273] [Citation(s) in RCA: 233] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2009] [Accepted: 09/01/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Enzymes that depend on vitamin B6 (and in particular on its metabolically active form, pyridoxal 5'-phosphate, PLP) are of great relevance to biology and medicine, as they catalyze a wide variety of biochemical reactions mainly involving amino acid substrates. Although PLP-dependent enzymes belong to a small number of independent evolutionary lineages, they encompass more than 160 distinct catalytic functions, thus representing a striking example of divergent evolution. The importance and remarkable versatility of these enzymes, as well as the difficulties in their functional classification, create a need for an integrated source of information about them. DESCRIPTION The B6 database http://bioinformatics.unipr.it/B6db contains documented B6-dependent activities and the relevant protein families, defined as monophyletic groups of sequences possessing the same enzymatic function. One or more families were associated to each of 121 PLP-dependent activities with known sequences. Hidden Markov models (HMMs) were built from family alignments and incorporated in the database. These HMMs can be used for the functional classification of PLP-dependent enzymes in genomic sets of predicted protein sequences. An example of such analyses (a census of human genes coding for PLP-dependent enzymes) is provided here, whereas many more are accessible through the database itself. CONCLUSION The B6 database is a curated repository of biochemical and molecular information about an important group of enzymes. This information is logically organized and available for computational analyses, providing a key resource for the identification, classification and comparative analysis of B6-dependent enzymes.
Collapse
Affiliation(s)
- Riccardo Percudani
- Department of Biochemistry and Molecular Biology, University of Parma, 43100 Parma, Italy.
| | | |
Collapse
|
1626
|
Guex N, Peitsch MC, Schwede T. Automated comparative protein structure modeling with SWISS-MODEL and Swiss-PdbViewer: a historical perspective. Electrophoresis 2009; 30 Suppl 1:S162-73. [PMID: 19517507 DOI: 10.1002/elps.200900140] [Citation(s) in RCA: 1392] [Impact Index Per Article: 87.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
SWISS-MODEL pioneered the field of automated modeling as the first protein modeling service on the Internet. In combination with the visualization tool Swiss-PdbViewer, the Internet-based Workspace and the SWISS-MODEL Repository, it provides a fully integrated sequence to structure analysis and modeling platform. This computational environment is made freely available to the scientific community with the aim to hide the computational complexity of structural bioinformatics and encourage bench scientists to make use of the ever-increasing structural information available. Indeed, over the last decade, the availability of structural information has significantly increased for many organisms as a direct consequence of the complementary nature of comparative protein modeling and experimental structure determination. This has a very positive and enabling impact on many different applications in biomedical research as described in this paper.
Collapse
Affiliation(s)
- Nicolas Guex
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | | | | |
Collapse
|
1627
|
Pei J, Ma C, Rizo J, Grishin NV. Remote homology between Munc13 MUN domain and vesicle tethering complexes. J Mol Biol 2009; 391:509-17. [PMID: 19563813 PMCID: PMC3158588 DOI: 10.1016/j.jmb.2009.06.054] [Citation(s) in RCA: 65] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2009] [Accepted: 06/16/2009] [Indexed: 10/20/2022]
Abstract
Most core components of the neurotransmitter release machinery have homologues in other types of intracellular membrane traffic, likely underlying a universal mechanism of intracellular membrane fusion. However, no clear similarity between Munc13s and protein families generally involved in membrane traffic has been reported, despite the essential nature of Munc13s for neurotransmitter release. This crucial function was ascribed to a minimal Munc13 region called the MUN domain, which likely participates in soluble N-ethylmaleimide sensitive factor attachment protein receptor complex (SNARE) assembly and is also found in Ca(2+)-dependent activator protein for secretion. We have now used comparative sequence and structural analyses to study the structure and evolutionary origin of the MUN domain. We found weak yet significant sequence similarities between the MUN domain and a set of protein subunits from several related vesicle tethering complexes, such as Sec6 from the exocyst complex and Vps53 from the Golgi-associated retrograde protein complex. Such an evolutionary relationship allows structure prediction of the MUN domain and suggests functional similarities between MUN domain-containing proteins and multisubunit tethering complexes such as exocyst, conserved oligomeric Golgi complex, Golgi-associated retrograde protein complex, and Dsl1p. These findings further unify the mechanism of neurotransmitter release with those of other types of intracellular membrane traffic and, in turn, support a role for tethering complexes in soluble N-ethylmaleimide sensitive factor attachment protein receptor complex assembly.
Collapse
Affiliation(s)
- Jimin Pei
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA.
| | | | | | | |
Collapse
|
1628
|
Garza JA, Ilangovan U, Hinck AP, Barnes LD. Kinetic, dynamic, ligand binding properties, and structural models of a dual-substrate specific nudix hydrolase from Schizosaccharomyces pombe. Biochemistry 2009; 48:6224-39. [PMID: 19462967 DOI: 10.1021/bi802266g] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Schizosaccharomyces pombe Aps1 is a nudix hydrolase that catalyzes the hydrolysis of both diadenosine 5',5'''-P(1),P(n)-oligophosphates and diphosphoinositol polyphosphates in vitro. Nudix hydrolases act upon a wide variety of substrates, despite having a common 23 amino acid catalytic motif; hence, the residues responsible for substrate specificity are considered to reside outside the common catalytic nudix motif. The specific residues involved in binding each substrate of S. pombe Aps1 are unknown. In this study, we have conducted mutational and kinetic studies in combination with structural homology modeling and NMR spectroscopic analyses to identify potential residues involved in binding each class of substrates. This study demonstrates several major findings with regard to Aps1. First, the determination of the kinetic parameters of K(m) and k(cat) indicated that the initial 31 residues of Aps1 are not involved in substrate binding or catalysis with respect to Ap(6)A. Second, NMR spectroscopic analyses revealed the secondary structure and three dynamic backbone regions, one of which corresponds to a large insert in Aps1 as compared to other putative fungal orthologues. Third, two structural models of Aps1Delta2-19, based on the crystal structures of human DIPP1 and T. thermophilus Ndx1, were generated using homology modeling. The structural models were in excellent agreement with the NMR-derived secondary structure of Aps1Delta2-19. Fourth, NMR chemical shift mapping in conjunction with structural homology models indicated several residues outside the catalytic nudix motif that are involved in specific binding of diphosphoinositol polyphosphate or diadenosine oligophosphate ligands.
Collapse
Affiliation(s)
- John A Garza
- Department of Biochemistry, The University of Texas Health Science Center at San Antonio, 7703 Floyd Curl Drive, San Antonio, Texas 78229-3900, USA
| | | | | | | |
Collapse
|
1629
|
Generation and analysis of expressed sequence tags from leaf and root of Withania somnifera (Ashwgandha). Mol Biol Rep 2009; 37:893-902. [PMID: 19669665 DOI: 10.1007/s11033-009-9696-y] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2009] [Accepted: 07/28/2009] [Indexed: 10/20/2022]
Abstract
The first transcriptomes, expressed sequence tags (ESTs) in a leaf and root from Withania somnifera plant referenced in this report are the first of its kind. A cDNA library was constructed from samples of the 2-months-old, in vitro cultured leaves and roots, which generated 1,047 leaf cDNA and 1,034 root cDNA clones representing 48.5% and 61.5% unique sequences. The ESTs from leaf and root grouped into 239 and 230 clusters representing 22.8% and 22.2% of total sequences. Of these, about 70% encoded proteins found similar (E-value > or =10(-14)) to characterized or annotated proteins from the NCBI non-redundant database and diverse molecular functions and biological processes based on gene ontology (GO) classification. We identified genes with potential role in photosynthesis (cytochrome p-450), pathogenesis (arginine decarboxylase, chitinase) and withanolide biosynthesis (squalene epoxidase, CDP-ME kinase). Highly expressed transcripts, with a particularly high abundance of cytochrome p-450 (0.85% in leaf) were noticed. Pfam analysis revealed the presence of functional domains in selected sequences. W. somnifera is a source of multifarious and beneficial alkaloids referred as withanolides. High levels of withanolides accumulate in mature leaves and roots. Since, the knowledge for synthesis and presence of some of these important biochemical constituent is limited, identification of the genes involved in two different pathways of secondary metabolite synthesis (MVA and MEP), in different tissue will be requisite for articulation of withanolide biosynthesis. This investigation aimed at elucidating the differential gene expression in two vital sites where withanolides essentially found and leaf and root transcriptomes were comparatively analyzed. The comparative analysis of the sequences provides a framework for future research in proteomics and evolutionary genomics in the withanolide biosynthesis.
Collapse
|
1630
|
Vangelatos I, Vlachakis D, Sophianopoulou V, Diallinas G. Modelling and mutational evidence identify the substrate binding site and functional elements in APC amino acid transporters. Mol Membr Biol 2009; 26:356-70. [PMID: 19670073 DOI: 10.1080/09687680903170546] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
The Amino acid-Polyamine-Organocation (APC) superfamily is the main family of amino acid transporters found in all domains of life and one of the largest families of secondary transporters. Here, using a sensitive homology threading approach and modelling we show that the predicted structure of APC members is extremely similar to the crystal structures of several prokaryotic transporters belonging to evolutionary distinct protein families with different substrate specificities. All of these proteins, despite having no primary amino acid sequence similarity, share a similar structural core, consisting of two V-shaped domains of five transmembrane domains each, intertwined in an antiparallel topology. Based on this model, we reviewed available data on functional mutations in bacterial, fungal and mammalian APCs and obtained novel mutational data, which provide compelling evidence that the amino acid binding pocket is located in the vicinity of the unwound part of two broken helices, in a nearly identical position to the structures of similar transporters. Our analysis is fully supported by the evolutionary conservation and specific amino acid substitutions in the proposed substrate binding domains. Furthermore, it allows predictions concerning residues that might be crucial in determining the specificity profile of APC members. Finally, we show that two cytoplasmic loops constitute important functional elements in APCs. Our work along with different kinetic and specificity profiles of APC members in easily manipulated bacterial and fungal model systems could form a unique framework for combining genetic, in-silico and structural studies, for understanding the function of one of the most important transporter families.
Collapse
Affiliation(s)
- Ioannis Vangelatos
- Institute of Biology, National Center for Scientific Research Demokritos, Aghia Paraskevi 153 10, Athens, Greece
| | | | | | | |
Collapse
|
1631
|
Thevelein JM, Voordeckers K. Functioning and evolutionary significance of nutrient transceptors. Mol Biol Evol 2009; 26:2407-14. [PMID: 19651853 DOI: 10.1093/molbev/msp168] [Citation(s) in RCA: 74] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
The discovery of nutrient transceptors, transporter-like proteins with a receptor function, suggests that receptors for chemical signals may have been derived in evolution from nutrient transporters. Several examples are now available of nutrient transporters with an additional nutrient signaling function, nutrient receptors with a transporter-like sequence and structure but without transport capacity, and G protein-coupled receptors (GPCRs) that have nutrients as ligands. Recent results have revealed that transceptor signaling requires a specific ligand-induced conformational change, which indicates that transceptors function in a similar way as regular receptors. Advanced bioinformatic analysis for detection of homology in distantly related proteins identifies the nontransporting glucose transceptor Rgt2 as the closest homologue of the glucose-sensing GPCR Gpr1 in yeast. This supports an intermediate position for nutrient transceptors in evolution, between nutrient transporters and classical receptors for chemical signals.
Collapse
Affiliation(s)
- Johan M Thevelein
- Laboratory of Molecular Cell Biology, Institute of Botany and Microbiology, Katholieke Universiteit Leuven, Leuven-Heverlee, Flanders, Belgium.
| | | |
Collapse
|
1632
|
Maciąg M, Płochocka D, Adamowicz-Salach A, Burzyńska B. Novel beta-spectrin mutations in hereditary spherocytosis associated with decreased levels of mRNA. Br J Haematol 2009; 146:326-32. [DOI: 10.1111/j.1365-2141.2009.07759.x] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
1633
|
Abstract
MOTIVATION Homologous protein families share highly conserved sequence and structure regions that are frequent targets for comparative analysis of related proteins and families. Many protein families, such as the curated domain families in the Conserved Domain Database (CDD), exhibit similar structural cores. To improve accuracy in aligning such protein families, we propose a profile-profile method CORAL that aligns individual core regions as gap-free units. RESULTS CORAL computes optimal local alignment of two profiles with heuristics to preserve continuity within core regions. We benchmarked its performance on curated domains in CDD, which have pre-defined core regions, against COMPASS, HHalign and PSI-BLAST, using structure superpositions and comprehensive curator-optimized alignments as standards of truth. CORAL improves alignment accuracy on core regions over general profile methods, returning a balanced score of 0.57 for over 80% of all domain families in CDD, compared with the highest balanced score of 0.45 from other methods. Further, CORAL provides E-values to aid in detecting homologous protein families and, by respecting block boundaries, produces alignments with improved 'readability' that facilitate manual refinement. AVAILABILITY CORAL will be included in future versions of the NCBI Cn3D/CDTree software, which can be downloaded at http://www.ncbi.nlm.nih.gov/Structure/cdtree/cdtree.shtml. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jessica H Fong
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| | | |
Collapse
|
1634
|
Morita M, Saito S, Ikeda K, Ohno K, Sugawara K, Suzuki T, Togawa T, Sakuraba H. Structural bases of GM1 gangliosidosis and Morquio B disease. J Hum Genet 2009; 54:510-5. [PMID: 19644515 DOI: 10.1038/jhg.2009.70] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Allelic mutations of the lysosomal beta-galactosidase gene cause heterogeneous clinical phenotypes, such as GM1 gangliosidosis and Morquio B disease, the former being further classified into three variants, namely infantile, juvenile and adult forms; and heterogeneous biochemical phenotypes were shown in these forms. We tried to elucidate the bases of these diseases from a structural viewpoint. We first constructed a three-dimensional structural model of human beta-galactosidase by means of homology modeling. The human beta-galactosidase consists of three domains, such as, a TIM barrel fold domain, which functions as a catalytic domain, and two galactose-binding domain-like fold domains. We then constructed structural models of representative mutant beta-galactosidase proteins (G123R, R201C, I51T and Y83H) and predicted the structural change associated with each phenotype by calculating the number of affected atoms, determining the root-mean-square deviation and the solvent-accessible surface area, and by color imaging. The results show that there is a good correlation between the structural changes caused by amino-acid substitutions in the beta-galactosidase molecule, as well as biochemical and clinical phenotypes in these representative cases. Protein structural study is useful for elucidating the bases of these diseases.
Collapse
Affiliation(s)
- Mizuki Morita
- Agricultural Bioinformatics Research Unit, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
| | | | | | | | | | | | | | | |
Collapse
|
1635
|
Deschavanne P, Tufféry P. Enhanced protein fold recognition using a structural alphabet. Proteins 2009; 76:129-37. [DOI: 10.1002/prot.22324] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
1636
|
Amino acid changes in elongation factor Tu of Mycoplasma pneumoniae and Mycoplasma genitalium influence fibronectin binding. Infect Immun 2009; 77:3533-41. [PMID: 19546194 DOI: 10.1128/iai.00081-09] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Mycoplasma pneumoniae and Mycoplasma genitalium are closely related organisms that cause distinct clinical manifestations and possess different tissue predilections despite their high degree of genome homology. We reported earlier that surface-localized M. pneumoniae elongation factor Tu (EF-Tu(Mp)) mediates binding to the extracellular matrix component fibronectin (Fn) through the carboxyl region of EF-Tu. In this study, we demonstrate that surface-associated M. genitalium EF-Tu (EF-Tu(Mg)), in spite of sharing 96% identity with EF-Tu(Mp), does not bind Fn. We utilized this finding to identify the essential amino acids of EF-Tu(Mp) that mediate Fn interactions by generating modified recombinant EF-Tu proteins with amino acid changes corresponding to those of EF-Tu(Mg). Amino acid changes in serine 343, proline 345, and threonine 357 were sufficient to significantly reduce the Fn binding of EF-Tu(Mp). Synthetic peptides corresponding to this region of EF-Tu(Mp) (EF-Tu(Mp) 340-358) blocked both recombinant EF-Tu(Mp) and radiolabeled M. pneumoniae cell binding to Fn. In contrast, EF-Tu(Mg) 340-358 peptides exhibited minimal blocking activity, reinforcing the specificity of EF-Tu-Fn interactions as mediators of microbial colonization and tissue tropism.
Collapse
|
1637
|
Schindler C, Rodriguez F, Poon PP, Singer RA, Johnston GC, Spang A. The GAP domain and the SNARE, coatomer and cargo interaction region of the ArfGAP2/3 Glo3 are sufficient for Glo3 function. Traffic 2009; 10:1362-75. [PMID: 19602196 DOI: 10.1111/j.1600-0854.2009.00952.x] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The ArfGAP Glo3 is required for coat protein I vesicle generation in the Golgi-endoplasmic reticulum (ER) shuttle. The best-understood role of Glo3 is the stimulation of the GTPase activity of Arf1. In this study, we characterized functional domains of the ArfGAP Glo3 and identified an interaction interface for coatomer, SNAREs and cargo in the central region of Glo3 (BoCCS region). The GAP domain together with the BoCCS region is necessary and sufficient for all vital Glo3 functions. Expression of a truncated Glo3 lacking the GAP domain results in a dominant negative growth phenotype in glo3Delta cells at 37 degrees C. This phenotype was alleviated by mutating either the BoCCS region or the Glo3 regulatory motif (GRM), or by overexpression of ER-Golgi SNAREs or the ArfGAP Gcs1. The GRM is not essential for Glo3 function; it may act as an intrinsic sensor coupling GAP activity to SNARE binding to avoid dead-end complex formation at the Golgi membrane. Our data suggest that membrane-interaction modules and cargo-sensing regions have evolved independently in ArfGAP1s versus ArfGAP2/3s.
Collapse
Affiliation(s)
- Christina Schindler
- Biozentrum, University of Basel, Klingelbergstrasse 70, CH-4056 Basel, Switzerland
| | | | | | | | | | | |
Collapse
|
1638
|
Abstract
UNLABELLED Sensitive and accurate detection of distant protein homology is essential for the studies of protein structure, function and evolution. We recently developed PROCAIN, a method that is based on sequence profile comparison and involves the analysis of four signals--similarities of residue content at the profile positions combined with three types of assisting information: sequence motifs, residue conservation and predicted secondary structure. Here we present the PROCAIN web server that allows the user to submit a query sequence or multiple sequence alignment and perform the search in a profile database of choice. The output is structured similar to that of BLAST, with the list of detected homologs sorted by E-value and followed by profile-profile alignments. The front page allows the user to adjust multiple options of input processing and output formatting, as well as search settings, including the relative weights assigned to the three types of assisting information. AVAILABILITY http://prodata.swmed.edu/procain/.
Collapse
Affiliation(s)
- Yong Wang
- Biomedical Engineering Program, University of Texas Southwestern Medical Center, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, TX 75390-9050, USA
| | | | | |
Collapse
|
1639
|
Sadreyev RI, Kim BH, Grishin NV. Discrete-continuous duality of protein structure space. Curr Opin Struct Biol 2009; 19:321-8. [PMID: 19482467 PMCID: PMC3688466 DOI: 10.1016/j.sbi.2009.04.009] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2009] [Revised: 04/29/2009] [Accepted: 04/29/2009] [Indexed: 11/30/2022]
Abstract
Recently, the nature of protein structure space has been widely discussed in the literature. The traditional discrete view of protein universe as a set of separate folds has been criticized in the light of growing evidence that almost any arrangement of secondary structures is possible and the whole protein space can be traversed through a path of similar structures. Here we argue that the discrete and continuous descriptions are not mutually exclusive, but complementary: the space is largely discrete in evolutionary sense, but continuous geometrically when purely structural similarities are quantified. Evolutionary connections are mainly confined to separate structural prototypes corresponding to folds as islands of structural stability, with few remaining traceable links between the islands. However, for a geometric similarity measure, it is usually possible to find a reasonable cutoff that yields paths connecting any two structures through intermediates.
Collapse
Affiliation(s)
- Ruslan I. Sadreyev
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, TX 75390-9050, USA
| | - Bong-Hyun Kim
- Department of Biochemistry, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, TX 75390-9050, USA
| | - Nick V. Grishin
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, TX 75390-9050, USA
- Department of Biochemistry, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, TX 75390-9050, USA
| |
Collapse
|
1640
|
Zhou H, Skolnick J. Protein structure prediction by pro-Sp3-TASSER. Biophys J 2009; 96:2119-27. [PMID: 19289038 DOI: 10.1016/j.bpj.2008.12.3898] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2008] [Revised: 11/12/2008] [Accepted: 12/03/2008] [Indexed: 12/29/2022] Open
Abstract
An automated protein structure prediction algorithm, pro-sp3-Threading/ASSEmbly/Refinement (TASSER), is described and benchmarked. Structural templates are identified using five different scoring functions derived from the previously developed threading methods PROSPECTOR_3 and SP(3). Top templates identified by each scoring function are combined to derive contact and distant restraints for subsequent model refinement by short TASSER simulations. For Medium/Hard targets (those with moderate to poor quality templates and/or alignments), alternative template alignments are also generated by parametric alignment and the top models selected by TASSER-QA are included in the contact and distance restraint derivation. Then, multiple short TASSER simulations are used to generate an ensemble of full-length models. Subsequently, the top models are selected from the ensemble by TASSER-QA and used to derive TASSER contacts and distant restraints for another round of full TASSER refinement. The final models are selected from both rounds of TASSER simulations by TASSER-QA. We compare pro-sp3-TASSER with our previously developed MetaTASSER method (enhanced with chunk-TASSER for Medium/Hard targets) on a representative test data set of 723 proteins <250 residues in length. For the 348 proteins classified as easy targets (those templates with good alignments and global structure similarity to the target), the cumulative TM-score of the best of top five models by pro-sp3-TASSER shows a 2.1% improvement over MetaTASSER. For the 155/220 medium/hard targets, the improvements in TM-score are 2.8% and 2.2%, respectively. All improvements are statistically significant. More importantly, the number of foldable targets (those having models whose TM-score to native >0.4 in the top five clusters) increases from 472 to 497 for all targets, and the relative increases for medium and hard targets are 10% and 15%, respectively. A server that implements the above algorithm is available at http://cssb.biology.gatech.edu/skolnick/webservice/pro-sp3-TASSER/. The source code is also available upon request.
Collapse
Affiliation(s)
- Hongyi Zhou
- Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, Atlanta, Georgia, USA
| | | |
Collapse
|
1641
|
de Groot R, Bardhan A, Ramroop N, Lane DA, Crawley JTB. Essential role of the disintegrin-like domain in ADAMTS13 function. Blood 2009; 113:5609-16. [PMID: 19234142 DOI: 10.1182/blood-2008-11-187914] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
ADAMTS13 is a highly specific multidomain plasma metalloprotease that regulates the multimeric size and function of von Willebrand factor (VWF) through cleavage at a single site in the VWF A2 domain. The precise role that the ADAMTS13 disintegrin-like domain plays in its function remains uncertain. Truncated ADAMTS13 variants suggested the importance of the disintegrin-like domain for both enzyme activity and specificity. Targeted mutagenesis of nonconserved regions (among ADAMTS family members) in the disintegrin-like domain identified 3 of 8 ADAMTS13 mutants (R349A, L350G, V352G) with reduced proteolytic activity. Kinetic analyses revealed a 5- to 20-fold reduction in catalytic efficiency of VWF115 (VWF residues 1554-1668) proteolysis by these mutants. These residues form a predicted exposed exosite on the surface of the disintegrin-like domain that lies approximately 26 A from the active site. Kinetic analysis of VWF115 carrying the D1614A mutation suggested that Arg349 in the ADAMTS13 disintegrin-like domain interacts directly with Asp1614 in VWF A2. We hypothesize that this interaction assists in positioning the scissile bond within the active site of ADAMTS13 and therefore plays a major role in determining cleavage parameters (K(m) and k(cat)), as opposed to binding affinity (K(d)) of ADAMTS13 for VWF, the latter being primarily determined by the spacer domain.
Collapse
Affiliation(s)
- Rens de Groot
- Department of Haematology, Imperial College London, London, United Kingdom
| | | | | | | | | |
Collapse
|
1642
|
Liefhebber JMP, Brandt BW, Broer R, Spaan WJM, van Leeuwen HC. Hepatitis C virus NS4B carboxy terminal domain is a membrane binding domain. Virol J 2009; 6:62. [PMID: 19467155 PMCID: PMC2698844 DOI: 10.1186/1743-422x-6-62] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2009] [Accepted: 05/25/2009] [Indexed: 12/14/2022] Open
Abstract
Background Hepatitis C virus (HCV) induces membrane rearrangements during replication. All HCV proteins are associated to membranes, pointing out the importance of membranes for HCV. Non structural protein 4B (NS4B) has been reported to induce cellular membrane alterations like the membranous web. Four transmembrane segments in the middle of the protein anchor NS4B to membranes. An amphipatic helix at the amino-terminus attaches to membranes as well. The carboxy-terminal domain (CTD) of NS4B is highly conserved in Hepaciviruses, though its function remains unknown. Results A cytosolic localization is predicted for the NS4B-CTD. However, using membrane floatation assays and immunofluorescence, we now show targeting of the NS4B-CTD to membranes. Furthermore, a profile-profile search, with an HCV NS4B-CTD multiple sequence alignment, indicates sequence similarity to the membrane binding domain of prokaryotic D-lactate dehydrogenase (d-LDH). The crystal structure of E. coli d-LDH suggests that the region similar to NS4B-CTD is located in the membrane binding domain (MBD) of d-LDH, implying analogy in membrane association. Targeting of d-LDH to membranes occurs via electrostatic interactions of positive residues on the outside of the protein with negative head groups of lipids. To verify that anchorage of d-LDH MBD and NS4B-CTD is analogous, NS4B-CTD mutants were designed to disrupt these electrostatic interactions. Membrane association was confirmed by swopping the membrane contacting helix of d-LDH with the corresponding domain of the 4B-CTD. Furthermore, the functionality of these residues was tested in the HCV replicon system. Conclusion Together these data show that NS4B-CTD is associated to membranes, similar to the prokaryotic d-LDH MBD, and is important for replication.
Collapse
Affiliation(s)
- Jolanda M P Liefhebber
- Department of Medical Microbiology, Center of Infectious Diseases, Leiden University Medical Center, 2300 RC Leiden, The Netherlands.
| | | | | | | | | |
Collapse
|
1643
|
Pons JL, Labesse G. @TOME-2: a new pipeline for comparative modeling of protein-ligand complexes. Nucleic Acids Res 2009; 37:W485-91. [PMID: 19443448 PMCID: PMC2703933 DOI: 10.1093/nar/gkp368] [Citation(s) in RCA: 109] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
@TOME 2.0 is new web pipeline dedicated to protein structure modeling and small ligand docking based on comparative analyses. @TOME 2.0 allows fold recognition, template selection, structural alignment editing, structure comparisons, 3D-model building and evaluation. These tasks are routinely used in sequence analyses for structure prediction. In our pipeline the necessary software is efficiently interconnected in an original manner to accelerate all the processes. Furthermore, we have also connected comparative docking of small ligands that is performed using protein-protein superposition. The input is a simple protein sequence in one-letter code with no comment. The resulting 3D model, protein-ligand complexes and structural alignments can be visualized through dedicated Web interfaces or can be downloaded for further studies. These original features will aid in the functional annotation of proteins and the selection of templates for molecular modeling and virtual screening. Several examples are described to highlight some of the new functionalities provided by this pipeline. The server and its documentation are freely available at http://abcis.cbs.cnrs.fr/AT2/
Collapse
Affiliation(s)
- Jean-Luc Pons
- A.B.C.I.S., CNRS UMR5048-Universités Montpellier 1 & Montpellier II and INSERM U554, 29, Rue de Navacelles, 34090 Montpellier Cedex, France
| | | |
Collapse
|
1644
|
Sadreyev RI, Tang M, Kim BH, Grishin NV. COMPASS server for homology detection: improved statistical accuracy, speed and functionality. Nucleic Acids Res 2009; 37:W90-4. [PMID: 19435884 PMCID: PMC2703893 DOI: 10.1093/nar/gkp360] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
COMPASS is a profile-based method for the detection of remote sequence similarity and the prediction of protein structure. Here we describe a recently improved public web server of COMPASS, http://prodata.swmed.edu/compass. The server features three major developments: (i) improved statistical accuracy; (ii) increased speed from parallel implementation; and (iii) new functional features facilitating structure prediction. These features include visualization tools that allow the user to quickly and effectively analyze specific local structural region predictions suggested by COMPASS alignments. As an application example, we describe the structural, evolutionary and functional analysis of a protein with unknown function that served as a target in the recent CASP8 (Critical Assessment of Techniques for Protein Structure Prediction round 8). URL: http://prodata.swmed.edu/compass
Collapse
Affiliation(s)
- Ruslan I Sadreyev
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, TX 75390-9050, USA.
| | | | | | | |
Collapse
|
1645
|
Remmert M, Linke D, Lupas AN, Söding J. HHomp--prediction and classification of outer membrane proteins. Nucleic Acids Res 2009; 37:W446-51. [PMID: 19429691 PMCID: PMC2703889 DOI: 10.1093/nar/gkp325] [Citation(s) in RCA: 73] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Outer membrane proteins (OMPs) are the transmembrane proteins found in the outer membranes of Gram-negative bacteria, mitochondria and plastids. Most prediction methods have focused on analogous features, such as alternating hydrophobicity patterns. Here, we start from the observation that almost all beta-barrel OMPs are related by common ancestry. We identify proteins as OMPs by detecting their homologous relationships to known OMPs using sequence similarity. Given an input sequence, HHomp builds a profile hidden Markov model (HMM) and compares it with an OMP database by pairwise HMM comparison, integrating OMP predictions by PROFtmb. A crucial ingredient is the OMP database, which contains profile HMMs for over 20,000 putative OMP sequences. These were collected with the exhaustive, transitive homology detection method HHsenser, starting from 23 representative OMPs in the PDB database. In a benchmark on TransportDB, HHomp detects 63.5% of the true positives before including the first false positive. This is 70% more than PROFtmb, four times more than BOMP and 10 times more than TMB-Hunt. In Escherichia coli, HHomp identifies 57 out of 59 known OMPs and correctly assigns them to their functional subgroups. HHomp can be accessed at http://toolkit.tuebingen.mpg.de/hhomp.
Collapse
Affiliation(s)
- Michael Remmert
- Department of Protein Evolution, Max-Planck-Institute for Developmental Biology, Spemannstrasse 35, 72076 Tübingen, Germany
| | | | | | | |
Collapse
|
1646
|
Da Silva M, Upton C. Vaccinia virus G8R protein: a structural ortholog of proliferating cell nuclear antigen (PCNA). PLoS One 2009; 4:e5479. [PMID: 19421403 PMCID: PMC2674943 DOI: 10.1371/journal.pone.0005479] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2009] [Accepted: 04/15/2009] [Indexed: 11/30/2022] Open
Abstract
Background Eukaryotic DNA replication involves the synthesis of both a DNA leading and lagging strand, the latter requiring several additional proteins including flap endonuclease (FEN-1) and proliferating cell nuclear antigen (PCNA) in order to remove RNA primers used in the synthesis of Okazaki fragments. Poxviruses are complex viruses (dsDNA genomes) that infect eukaryotes, but surprisingly little is known about the process of DNA replication. Given our previous results that the vaccinia virus (VACV) G5R protein may be structurally similar to a FEN-1-like protein and a recent finding that poxviruses encode a primase function, we undertook a series of in silico analyses to identify whether VACV also encodes a PCNA-like protein. Results An InterProScan of all VACV proteins using the JIPS software package was used to identify any PCNA-like proteins. The VACV G8R protein was identified as the only vaccinia protein that contained a PCNA-like sliding clamp motif. The VACV G8R protein plays a role in poxvirus late transcription and is known to interact with several other poxvirus proteins including itself. The secondary and tertiary structure of the VACV G8R protein was predicted and compared to the secondary and tertiary structure of both human and yeast PCNA proteins, and a high degree of similarity between all three proteins was noted. Conclusions The structure of the VACV G8R protein is predicted to closely resemble the eukaryotic PCNA protein; it possesses several other features including a conserved ubiquitylation and SUMOylation site that suggest that, like its counterpart in T4 bacteriophage (gp45), it may function as a sliding clamp ushering transcription factors to RNA polymerase during late transcription.
Collapse
Affiliation(s)
- Melissa Da Silva
- Department of Biochemistry and Microbiology, University of Victoria, Victoria, British Columbia, Canada
| | - Chris Upton
- Department of Biochemistry and Microbiology, University of Victoria, Victoria, British Columbia, Canada
- * E-mail:
| |
Collapse
|
1647
|
Lobley A, Sadowski MI, Jones DT. pGenTHREADER and pDomTHREADER: new methods for improved protein fold recognition and superfamily discrimination. ACTA ACUST UNITED AC 2009; 25:1761-7. [PMID: 19429599 DOI: 10.1093/bioinformatics/btp302] [Citation(s) in RCA: 220] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION Generation of structural models and recognition of homologous relationships for unannotated protein sequences are fundamental problems in bioinformatics. Improving the sensitivity and selectivity of methods designed for these two tasks therefore has downstream benefits for many other bioinformatics applications. RESULTS We describe the latest implementation of the GenTHREADER method for structure prediction on a genomic scale. The method combines profile-profile alignments with secondary-structure specific gap-penalties, classic pair- and solvation potentials using a linear combination optimized with a regression SVM model. We find this combination significantly improves both detection of useful templates and accuracy of sequence-structure alignments relative to other competitive approaches. We further present a second implementation of the protocol designed for the task of discriminating superfamilies from one another. This method, pDomTHREADER, is the first to incorporate both sequence and structural data directly in this task and improves sensitivity and selectivity over the standard version of pGenTHREADER and three other standard methods for remote homology detection.
Collapse
Affiliation(s)
- Anna Lobley
- Department of Computer Science, University College London, UK
| | | | | |
Collapse
|
1648
|
Gao X, Bu D, Xu J, Li M. Improving consensus contact prediction via server correlation reduction. BMC STRUCTURAL BIOLOGY 2009; 9:28. [PMID: 19419562 PMCID: PMC2689239 DOI: 10.1186/1472-6807-9-28] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/25/2008] [Accepted: 05/06/2009] [Indexed: 11/10/2022]
Abstract
Background Protein inter-residue contacts play a crucial role in the determination and prediction of protein structures. Previous studies on contact prediction indicate that although template-based consensus methods outperform sequence-based methods on targets with typical templates, such consensus methods perform poorly on new fold targets. However, we find out that even for new fold targets, the models generated by threading programs can contain many true contacts. The challenge is how to identify them. Results In this paper, we develop an integer linear programming model for consensus contact prediction. In contrast to the simple majority voting method assuming that all the individual servers are equally important and independent, the newly developed method evaluates their correlation by using maximum likelihood estimation and extracts independent latent servers from them by using principal component analysis. An integer linear programming method is then applied to assign a weight to each latent server to maximize the difference between true contacts and false ones. The proposed method is tested on the CASP7 data set. If the top L/5 predicted contacts are evaluated where L is the protein size, the average accuracy is 73%, which is much higher than that of any previously reported study. Moreover, if only the 15 new fold CASP7 targets are considered, our method achieves an average accuracy of 37%, which is much better than that of the majority voting method, SVM-LOMETS, SVM-SEQ, and SAM-T06. These methods demonstrate an average accuracy of 13.0%, 10.8%, 25.8% and 21.2%, respectively. Conclusion Reducing server correlation and optimally combining independent latent servers show a significant improvement over the traditional consensus methods. This approach can hopefully provide a powerful tool for protein structure refinement and prediction use.
Collapse
Affiliation(s)
- Xin Gao
- David R, Cheriton School of Computer Science, University of Waterloo, N2L3G1, Canada.
| | | | | | | |
Collapse
|
1649
|
Brandt BW, Heringa J. webPRC: the Profile Comparer for alignment-based searching of public domain databases. Nucleic Acids Res 2009; 37:W48-52. [PMID: 19420063 PMCID: PMC2703954 DOI: 10.1093/nar/gkp279] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Profile–profile methods are well suited to detect remote evolutionary relationships between protein families. Profile Comparer (PRC) is an existing stand-alone program for scoring and aligning hidden Markov models (HMMs), which are based on multiple sequence alignments. Since PRC compares profile HMMs instead of sequences, it can be used to find distant homologues. For this purpose, PRC is used by, for example, the CATH and Pfam-domain databases. As PRC is a profile comparer, it only reports profile HMM alignments and does not produce multiple sequence alignments. We have developed webPRC server, which makes it straightforward to search for distant homologues or similar alignments in a number of domain databases. In addition, it provides the results both as multiple sequence alignments and aligned HMMs. Furthermore, the user can view the domain annotation, evaluate the PRC hits with the Jalview multiple alignment editor and generate logos from the aligned HMMs or the aligned multiple alignments. Thus, this server assists in detecting distant homologues with PRC as well as in evaluating and using the results. The webPRC interface is available at http://www.ibi.vu.nl/programs/prcwww/.
Collapse
Affiliation(s)
- Bernd W Brandt
- Centre for Integrative Bioinformatics (IBIVU), VU University Amsterdam, The Netherlands.
| | | |
Collapse
|
1650
|
Kim BH, Cheng H, Grishin NV. HorA web server to infer homology between proteins using sequence and structural similarity. Nucleic Acids Res 2009; 37:W532-8. [PMID: 19417074 PMCID: PMC2703895 DOI: 10.1093/nar/gkp328] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
The biological properties of proteins are often gleaned through comparative analysis of evolutionary relatives. Although protein structure similarity search methods detect more distant homologs than purely sequence-based methods, structural resemblance can result from either homology (common ancestry) or analogy (similarity without common ancestry). While many existing web servers detect structural neighbors, they do not explicitly address the question of homology versus analogy. Here, we present a web server named HorA (Homology or Analogy) that identifies likely homologs for a query protein structure. Unlike other servers, HorA combines sequence information from state-of-the-art profile methods with structure information from spatial similarity measures using an advanced computational technique. HorA aims to identify biologically meaningful connections rather than purely 3D-geometric similarities. The HorA method finds approximately 90% of remote homologs defined in the manually curated database SCOP. HorA will be especially useful for finding remote homologs that might be overlooked by other sequence or structural similarity search servers. The HorA server is available at http://prodata.swmed.edu/horaserver.
Collapse
Affiliation(s)
- Bong-Hyun Kim
- Department of Biochemistry, University of Texas, Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, TX 75390-9050, USA
| | | | | |
Collapse
|