Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Fischer D, Barret C, Bryson K, Elofsson A, Godzik A, Jones D, Karplus KJ, Kelley LA, MacCallum RM, Pawowski K, Rost B, Rychlewski L, Sternberg M. CAFASP-1: critical assessment of fully automated structure prediction methods. Proteins 1999;Suppl 3:209-17. [PMID: 10526371 DOI: 10.1002/(sici)1097-0134(1999)37:3+<209::aid-prot27>3.3.co;2-p] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

For:	Fischer D, Barret C, Bryson K, Elofsson A, Godzik A, Jones D, Karplus KJ, Kelley LA, MacCallum RM, Pawowski K, Rost B, Rychlewski L, Sternberg M. CAFASP-1: critical assessment of fully automated structure prediction methods. Proteins 1999;Suppl 3:209-17. [PMID: 10526371 DOI: 10.1002/(sici)1097-0134(1999)37:3+<209::aid-prot27>3.3.co;2-p] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Number

Cited by Other Article(s)

Gadzała M, Kalinowska B, Banach M, Konieczny L, Roterman I. Determining protein similarity by comparing hydrophobic core structure. Heliyon 2017;3:e00235. [PMID: 28217749 PMCID: PMC5300504 DOI: 10.1016/j.heliyon.2017.e00235] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2016] [Revised: 12/06/2016] [Accepted: 01/19/2017] [Indexed: 12/19/2022] Open

Shatnawi M, Zaki N, Yoo PD. Protein inter-domain linker prediction using Random Forest and amino acid physiochemical properties. BMC Bioinformatics 2014;15 Suppl 16:S8. [PMID: 25521329 PMCID: PMC4290662 DOI: 10.1186/1471-2105-15-s16-s8] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Zimic M, Gutiérrez AH, Gilman RH, López C, Quiliano M, Evangelista W, Gonzales A, García HH, Sheen P. Immunoinformatics prediction of linear epitopes from Taenia solium TSOL18. Bioinformation 2011;6:271-4. [PMID: 21738328 PMCID: PMC3124692 DOI: 10.6026/97320630006271] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2011] [Accepted: 06/03/2011] [Indexed: 11/23/2022] Open

Yan RX, Si JN, Wang C, Zhang Z. DescFold: a web server for protein fold recognition. BMC Bioinformatics 2009;10:416. [PMID: 20003426 PMCID: PMC2803855 DOI: 10.1186/1471-2105-10-416] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2009] [Accepted: 12/14/2009] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Machine learning-based methods have been proven to be powerful in developing new fold recognition tools. In our previous work [Zhang, Kochhar and Grigorov (2005) Protein Science, 14: 431-444], a machine learning-based method called DescFold was established by using Support Vector Machines (SVMs) to combine the following four descriptors: a profile-sequence-alignment-based descriptor using Psi-blast e-values and bit scores, a sequence-profile-alignment-based descriptor using Rps-blast e-values and bit scores, a descriptor based on secondary structure element alignment (SSEA), and a descriptor based on the occurrence of PROSITE functional motifs. In this work, we focus on the improvement of DescFold by incorporating more powerful descriptors and setting up a user-friendly web server.

RESULTS

In seeking more powerful descriptors, the profile-profile alignment score generated from the COMPASS algorithm was first considered as a new descriptor (i.e., PPA). When considering a profile-profile alignment between two proteins in the context of fold recognition, one protein is regarded as a template (i.e., its 3D structure is known). Instead of a sequence profile derived from a Psi-blast search, a structure-seeded profile for the template protein was generated by searching its structural neighbors with the assistance of the TM-align structural alignment algorithm. Moreover, the COMPASS algorithm was used again to derive a profile-structural-profile-alignment-based descriptor (i.e., PSPA). We trained and tested the new DescFold in a total of 1,835 highly diverse proteins extracted from the SCOP 1.73 version. When the PPA and PSPA descriptors were introduced, the new DescFold boosts the performance of fold recognition substantially. Using the SCOP_1.73_40% dataset as the fold library, the DescFold web server based on the trained SVM models was further constructed. To provide a large-scale test for the new DescFold, a stringent test set of 1,866 proteins were selected from the SCOP 1.75 version. At a less than 5% false positive rate control, the new DescFold is able to correctly recognize structural homologs at the fold level for nearly 46% test proteins. Additionally, we also benchmarked the DescFold method against several well-established fold recognition algorithms through the LiveBench targets and Lindahl dataset.

CONCLUSIONS

The new DescFold method was intensively benchmarked to have very competitive performance compared with some well-established fold recognition methods, suggesting that it can serve as a useful tool to assist in template-based protein structure prediction. The DescFold server is freely accessible at http://202.112.170.199/DescFold/index.html.

Collapse

Karplus K. SAM-T08, HMM-based protein structure prediction. Nucleic Acids Res 2009;37:W492-7. [PMID: 19483096 PMCID: PMC2703928 DOI: 10.1093/nar/gkp403] [Citation(s) in RCA: 103] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open

Doxey AC, Lynch MDJ, Müller KM, Meiering EM, McConkey BJ. Insights into the evolutionary origins of clostridial neurotoxins from analysis of the Clostridium botulinum strain A neurotoxin gene cluster. BMC Evol Biol 2008;8:316. [PMID: 19014598 PMCID: PMC2605760 DOI: 10.1186/1471-2148-8-316] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2008] [Accepted: 11/14/2008] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Clostridial neurotoxins (CNTs) are the most deadly toxins known and causal agents of botulism and tetanus neuroparalytic diseases. Despite considerable progress in understanding CNT structure and function, the evolutionary origins of CNTs remain a mystery as they are unique to Clostridium and possess a sequence and structural architecture distinct from other protein families. Uncovering the origins of CNTs would be a significant contribution to our understanding of how pathogens evolve and generate novel toxin families.

RESULTS

The C. botulinum strain A genome was examined for potential homologues of CNTs. A key link was identified between the neurotoxin and the flagellin gene (CBO0798) located immediately upstream of the BoNT/A neurotoxin gene cluster. This flagellin sequence displayed the strongest sequence similarity to the neurotoxin and NTNH homologue out of all proteins encoded within C. botulinum strain A. The CBO0798 gene contains a unique hypervariable region, which in closely related flagellins encodes a collagenase-like domain. Remarkably, these collagenase-containing flagellins were found to possess the characteristic HEXXH zinc-protease motif responsible for the neurotoxin's endopeptidase activity. Additional links to collagenase-related sequences and functions were detected by further analysis of CNTs and surrounding genes, including sequence similarities to collagen-adhesion domains and collagenases. Furthermore, the neurotoxin's HCRn domain was found to exhibit both structural and sequence similarity to eukaryotic collagen jelly-roll domains.

CONCLUSION

Multiple lines of evidence suggest that the neurotoxin and adjacent genes evolved from an ancestral collagenase-like gene cluster, linking CNTs to another major family of clostridial proteolytic toxins. Duplication, reshuffling and assembly of neighboring genes within the BoNT/A neurotoxin gene cluster may have lead to the neurotoxin's unique architecture. This work provides new insights into the evolution of C. botulinum neurotoxins and the evolutionary mechanisms underlying the origins of virulent genes.

Collapse

Altman RB, Bergman CM, Blake J, Blaschke C, Cohen A, Gannon F, Grivell L, Hahn U, Hersh W, Hirschman L, Jensen LJ, Krallinger M, Mons B, O'Donoghue SI, Peitsch MC, Rebholz-Schuhmann D, Shatkay H, Valencia A. Text mining for biology--the way forward: opinions from leading scientists. Genome Biol 2008;9 Suppl 2:S7. [PMID: 18834498 PMCID: PMC2559991 DOI: 10.1186/gb-2008-9-s2-s7] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open

Cheng J, Baldi P. Improved residue contact prediction using support vector machines and a large feature set. BMC Bioinformatics 2007;8:113. [PMID: 17407573 PMCID: PMC1852326 DOI: 10.1186/1471-2105-8-113] [Citation(s) in RCA: 174] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2006] [Accepted: 04/02/2007] [Indexed: 11/12/2022] Open

Barberis M, De Gioia L, Ruzzene M, Sarno S, Coccetti P, Fantucci P, Vanoni M, Alberghina L. The yeast cyclin-dependent kinase inhibitor Sic1 and mammalian p27Kip1 are functional homologues with a structurally conserved inhibitory domain. Biochem J 2006;387:639-47. [PMID: 15649124 PMCID: PMC1134993 DOI: 10.1042/bj20041299] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]

Wallner B, Elofsson A. Identification of correct regions in protein models using structural, alignment, and consensus information. Protein Sci 2006;15:900-13. [PMID: 16522791 PMCID: PMC2242478 DOI: 10.1110/ps.051799606] [Citation(s) in RCA: 122] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Challis RJ, Goodacre SL, Hewitt GM. Evolution of spider silks: conservation and diversification of the C-terminus. INSECT MOLECULAR BIOLOGY 2006;15:45-56. [PMID: 16469067 DOI: 10.1111/j.1365-2583.2005.00606.x] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]

Draker R, Roper RL, Petric M, Tellier R. The complete sequence of the bovine torovirus genome. Virus Res 2005;115:56-68. [PMID: 16137782 PMCID: PMC7114287 DOI: 10.1016/j.virusres.2005.07.005] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2005] [Revised: 07/05/2005] [Accepted: 07/12/2005] [Indexed: 12/15/2022]

Jaroszewski L, Rychlewski L, Li Z, Li W, Godzik A. FFAS03: a server for profile--profile sequence alignments. Nucleic Acids Res 2005;33:W284-8. [PMID: 15980471 PMCID: PMC1160179 DOI: 10.1093/nar/gki418] [Citation(s) in RCA: 456] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

Thelin WR, Hodson CA, Milgram SL. Beyond the brush border: NHERF4 blazes new NHERF turf. J Physiol 2005;567:13-9. [PMID: 15994182 PMCID: PMC1474171 DOI: 10.1113/jphysiol.2005.091041] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open

Ginalski K, Grishin NV, Godzik A, Rychlewski L. Practical lessons from protein structure prediction. Nucleic Acids Res 2005;33:1874-91. [PMID: 15805122 PMCID: PMC1074308 DOI: 10.1093/nar/gki327] [Citation(s) in RCA: 99] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open

Goulielmos GN, Eliopoulos E, Loukas M, Tsakas S. Functional constraints of 6-phosphogluconate dehydrogenase (6-PGD) based on sequence and structural information. J Mol Evol 2005;59:358-71. [PMID: 15553090 DOI: 10.1007/s00239-004-2630-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

Hashimoto Y, Lawrence P. Comparative analysis of selected genes from Diachasmimorpha longicaudata entomopoxvirus and other poxviruses. JOURNAL OF INSECT PHYSIOLOGY 2005;51:207-20. [PMID: 15749105 PMCID: PMC7094658 DOI: 10.1016/j.jinsphys.2004.10.010] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/14/2004] [Accepted: 10/22/2004] [Indexed: 05/16/2023]

Eliopoulos E, Goulielmos GN, Loukas M. Functional constraints of alcohol dehydrogenase (ADH) of tephritidae and relationships with other Dipteran species. J Mol Evol 2004;58:493-505. [PMID: 15170253 DOI: 10.1007/s00239-003-2568-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2003] [Accepted: 11/04/2003] [Indexed: 10/26/2022]

Abstract

Alcohol dehydrogenase is considered a very important enzyme in insect metabolism because it is involved (in its homodimeric form) in the catalysis of the reversible conversion of various alcohols in larval feeding sites to their corresponding aldehydes and ketones, thus contributing to detoxification and metabolic purposes. Using 14 amino acid ADH sequences recently determined in our laboratory, we constructed a three-dimensional (3D) model of olive fruit fly Bactrocera oleae ADH1 and ADH2, based on the known homologous Drosophila lebanonensis ADH structure, and the amino acid residues that have been proposed as being responsible for catalysis were located on it. Moreover, in a comparative study of the ADH sequences, the residues occupying characteristic positions in the ADH of species of the Bactrocera and Ceratitis genera (called genus-specific) as well as residues appearing only in ADH1 or ADH2 (called isozymic-specific) were defined and localized on the 3D model. All regions important for catalytic activity, such as those forming the substrate- and coenzyme-binding sites, are highly conserved in all tephritid species examined. Genus-specific amino acids are located on the outside of the protein, on loops and regions predicted to be antigenic. The higher percentage of genus-specific amino acid variation seems to be centered in the NAD adenine-binding site, located near the surface of the protein molecule. Nine of 12 isozymic-specific positions are lined along an "arc" on the surface of the protein, thus linking the two "monomer bases" of the dimer via the C-terminal interacting loops. Furthermore, the distribution of isozymic- and genus-specific amino acids on the monomer-monomer interface may have some evolutionary significance. Most amino acids predicted to be antigenic are positioned in peripheral regions of nonfunctional importance, but surprisingly, an additional antigenic region is contained within the (highly conserved in tephritids) C-terminal tail.

Collapse

Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 2004;32:1792-7. [PMID: 15034147 PMCID: PMC390337 DOI: 10.1093/nar/gkh340] [Citation(s) in RCA: 27920] [Impact Index Per Article: 1396.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Kopp J, Schwede T. The SWISS-MODEL Repository of annotated three-dimensional protein structure homology models. Nucleic Acids Res 2004;32:D230-4. [PMID: 14681401 PMCID: PMC308743 DOI: 10.1093/nar/gkh008] [Citation(s) in RCA: 243] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Eyrich VA, Rost B. META-PP: single interface to crucial prediction servers. Nucleic Acids Res 2003;31:3308-10. [PMID: 12824314 PMCID: PMC168978 DOI: 10.1093/nar/gkg572] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2003] [Revised: 04/08/2003] [Accepted: 04/08/2003] [Indexed: 11/14/2022] Open

Wallner B, Elofsson A. Can correct protein models be identified? Protein Sci 2003;12:1073-86. [PMID: 12717029 PMCID: PMC2323877 DOI: 10.1110/ps.0236803] [Citation(s) in RCA: 529] [Impact Index Per Article: 25.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

González B, Campillo N, Garrido F, Gasset M, Sanz-Aparicio J, Pajares MA. Active-site-mutagenesis study of rat liver betaine-homocysteine S-methyltransferase. Biochem J 2003;370:945-52. [PMID: 12487625 PMCID: PMC1223237 DOI: 10.1042/bj20021510] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2002] [Revised: 12/09/2002] [Accepted: 12/17/2002] [Indexed: 11/17/2022]

Ko DC, Binkley J, Sidow A, Scott MP. The integrity of a cholesterol-binding pocket in Niemann-Pick C2 protein is necessary to control lysosome cholesterol levels. Proc Natl Acad Sci U S A 2003;100:2518-25. [PMID: 12591949 PMCID: PMC151373 DOI: 10.1073/pnas.0530027100] [Citation(s) in RCA: 157] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/03/2003] [Indexed: 11/18/2022] Open

Beebe K, Ribas de Pouplana L, Schimmel P. Elucidation of tRNA-dependent editing by a class II tRNA synthetase and significance for cell viability. EMBO J 2003;22:668-75. [PMID: 12554667 PMCID: PMC140749 DOI: 10.1093/emboj/cdg065] [Citation(s) in RCA: 136] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2002] [Revised: 12/03/2002] [Accepted: 12/05/2002] [Indexed: 11/14/2022] Open

Swalla BM, Gumport RI, Gardner JF. Conservation of structure and function among tyrosine recombinases: homology-based modeling of the lambda integrase core-binding domain. Nucleic Acids Res 2003;31:805-18. [PMID: 12560475 PMCID: PMC149183 DOI: 10.1093/nar/gkg142] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Grundhoff A, Ganem D. The latency-associated nuclear antigen of Kaposi's sarcoma-associated herpesvirus permits replication of terminal repeat-containing plasmids. J Virol 2003;77:2779-83. [PMID: 12552022 PMCID: PMC141125 DOI: 10.1128/jvi.77.4.2779-2783.2003] [Citation(s) in RCA: 122] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Rigden DJ, Setlow P, Setlow B, Bagyan I, Stein RA, Jedrzejas MJ. PrfA protein of Bacillus species: prediction and demonstration of endonuclease activity on DNA. Protein Sci 2002;11:2370-81. [PMID: 12237459 PMCID: PMC2373696 DOI: 10.1110/ps.0216802] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Samudrala R, Levitt M. A comprehensive analysis of 40 blind protein structure predictions. BMC STRUCTURAL BIOLOGY 2002;2:3. [PMID: 12150712 PMCID: PMC122083 DOI: 10.1186/1472-6807-2-3] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/09/2002] [Accepted: 08/01/2002] [Indexed: 11/21/2022]

Harton JA, O'Connor W, Conti BJ, Linhoff MW, Ting JPY. Leucine-rich repeats of the class II transactivator control its rate of nuclear accumulation. Hum Immunol 2002;63:588-601. [PMID: 12072194 DOI: 10.1016/s0198-8859(02)00400-7] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Cristobal S, Zemla A, Fischer D, Rychlewski L, Elofsson A. A study of quality measures for protein threading models. BMC Bioinformatics 2001;2:5. [PMID: 11545673 PMCID: PMC55330 DOI: 10.1186/1471-2105-2-5] [Citation(s) in RCA: 148] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2001] [Accepted: 08/01/2001] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Prediction of protein structures is one of the fundamental challenges in biology today. To fully understand how well different prediction methods perform, it is necessary to use measures that evaluate their performance. Every two years, starting in 1994, the CASP (Critical Assessment of protein Structure Prediction) process has been organized to evaluate the ability of different predictors to blindly predict the structure of proteins. To capture different features of the models, several measures have been developed during the CASP processes. However, these measures have not been examined in detail before. In an attempt to develop fully automatic measures that can be used in CASP, as well as in other type of benchmarking experiments, we have compared twenty-one measures. These measures include the measures used in CASP3 and CASP2 as well as have measures introduced later. We have studied their ability to distinguish between the better and worse models submitted to CASP3 and the correlation between them.

RESULTS

Using a small set of 1340 models for 23 different targets we show that most methods correlate with each other. Most pairs of measures show a correlation coefficient of about 0.5. The correlation is slightly higher for measures of similar types. We found that a significant problem when developing automatic measures is how to deal with proteins of different length. Also the comparisons between different measures is complicated as many measures are dependent on the size of the target. We show that the manual assessment can be reproduced to about 70% using automatic measures. Alignment independent measures, detects slightly more of the models with the correct fold, while alignment dependent measures agree better when selecting the best models for each target. Finally we show that using automatic measures would, to a large extent, reproduce the assessors ranking of the predictors at CASP3.

CONCLUSIONS

We show that given a sufficient number of targets the manual and automatic measures would have given almost identical results at CASP3. If the intent is to reproduce the type of scoring done by the manual assessor in in CASP3, the best approach might be to use a combination of alignment independent and alignment dependent measures, as used in several recent studies.

Collapse

Rodrigues-Lima F, Deloménie C, Goodfellow GH, Grant DM, Dupret JM. Homology modelling and structural analysis of human arylamine N-acetyltransferase NAT1: evidence for the conservation of a cysteine protease catalytic domain and an active-site loop. Biochem J 2001;356:327-34. [PMID: 11368758 PMCID: PMC1221842 DOI: 10.1042/0264-6021:3560327] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Bujnicki JM, Elofsson A, Fischer D, Rychlewski L. LiveBench-1: continuous benchmarking of protein structure prediction servers. Protein Sci 2001;10:352-61. [PMID: 11266621 PMCID: PMC2373940 DOI: 10.1110/ps.40501] [Citation(s) in RCA: 101] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]

Abstract

We present a novel, continuous approach aimed at the large-scale assessment of the performance of available fold-recognition servers. Six popular servers were investigated: PDB-Blast, FFAS, T98-lib, GenTHREADER, 3D-PSSM, and INBGU. The assessment was conducted using as prediction targets a large number of selected protein structures released from October 1999 to April 2000. A target was selected if its sequence showed no significant similarity to any of the proteins previously available in the structural database. Overall, the servers were able to produce structurally similar models for one-half of the targets, but significantly accurate sequence-structure alignments were produced for only one-third of the targets. We further classified the targets into two sets: easy and hard. We found that all servers were able to find the correct answer for the vast majority of the easy targets if a structurally similar fold was present in the server's fold libraries. However, among the hard targets--where standard methods such as PSI-BLAST fail--the most sensitive fold-recognition servers were able to produce similar models for only 40% of the cases, half of which had a significantly accurate sequence-structure alignment. Among the hard targets, the presence of updated libraries appeared to be less critical for the ranking. An "ideally combined consensus" prediction, where the results of all servers are considered, would increase the percentage of correct assignments by 50%. Each server had a number of cases with a correct assignment, where the assignments of all the other servers were wrong. This emphasizes the benefits of considering more than one server in difficult prediction tasks. The LiveBench program (http://BioInfo.PL/LiveBench) is being continued, and all interested developers are cordially invited to join.

Collapse

Iwadate M, Ebisawa K, Umeyama H. Comparative Modeling of CAFASP2 Competition. CHEM-BIO INFORMATICS JOURNAL 2001. [DOI: 10.1273/cbij.1.136] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

David R, Korenberg MJ, Hunter IW. 3D-1D threading methods for protein fold recognition. Pharmacogenomics 2000;1:445-55. [PMID: 11257928 DOI: 10.1517/14622416.1.4.445] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open