Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Kim DE, Chivian D, Malmström L, Baker D. Automated prediction of domain boundaries in CASP6 targets using Ginzu and RosettaDOM. Proteins 2006;61 Suppl 7:193-200. [PMID: 16187362 DOI: 10.1002/prot.20737] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Number

Cited by Other Article(s)

Yu ZZ, Peng CX, Liu J, Zhang B, Zhou XG, Zhang GJ. DomBpred: Protein Domain Boundary Prediction Based on Domain-Residue Clustering Using Inter-Residue Distance. IEEE/ACM Trans Comput Biol Bioinform 2023;20:912-922. [PMID: 35594218 DOI: 10.1109/tcbb.2022.3175905] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]

Abstract

Domain boundary prediction is one of the most important problems in the study of protein structure and function, especially for large proteins. At present, most domain boundary prediction methods have low accuracy and limitations in dealing with multi-domain proteins. In this study, we develop a sequence-based protein domain boundary prediction, named DomBpred. In DomBpred, the input sequence is first classified as either a single-domain protein or a multi-domain protein through a designed effective sequence metric based on a constructed single-domain sequence library. For the multi-domain protein, a domain-residue clustering algorithm inspired by Ising model is proposed to cluster the spatially close residues according inter-residue distance. The unclassified residues and the residues at the edge of the cluster are then tuned by the secondary structure to form potential cut points. Finally, a domain boundary scoring function is proposed to recursively evaluate the potential cut points to generate the domain boundary. DomBpred is tested on a large-scale test set of FUpred comprising 2549 proteins. Experimental results show that DomBpred better performs than the state-of-the-art methods in classifying whether protein sequences are composed by single or multiple domains, and the Matthew's correlation coefficient is 0.882. Moreover, on 849 multi-domain proteins, the domain boundary distance and normalised domain overlap scores of DomBpred are 0.523 and 0.824, respectively, which are 5.0% and 4.2% higher than those of the best comparison method, respectively. Comparison with other methods on the given test set shows that DomBpred outperforms most state-of-the-art sequence-based methods and even achieves better results than the top-level template-based method. The executable program is freely available at https://github.com/iobio-zjut/DomBpred and the online server at http://zhanglab-bioinf.com/DomBpred/.

Collapse

Sánchez BJ, Mubaid S, Busque S, de los Santos Y, Ashour K, Sadek J, Lian X, Khattak S, Di Marco S, Gallouzi IE. The formation of HuR/YB1 complex is required for the stabilization of target mRNA to promote myogenesis. Nucleic Acids Res 2023;51:1375-1392. [PMID: 36629268 PMCID: PMC9943665 DOI: 10.1093/nar/gkac1245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2022] [Accepted: 12/14/2022] [Indexed: 01/12/2023] Open

Affiliation(s)

Brenda Janice Sánchez KAUST Smart-Health Initiative King Abdullah University of Science and Technology (KAUST), Jeddah, Saudi Arabia,KAUST Biological Environmental Science and Engineering (BESE) Division, King Abdullah University of Science and Technology (KAUST), Jeddah, Saudi Arabia,Dept. of Biochemistry, McGill University, 3655 Promenade Sir William Osler, Montreal, QC H3G1Y6, Canada,Rosalind & Morris Goodman Cancer Institute, McGill University, 1160 Pine Avenue, Montreal, QC H3A1A3, Canada
Souad Mubaid Dept. of Biochemistry, McGill University, 3655 Promenade Sir William Osler, Montreal, QC H3G1Y6, Canada,Rosalind & Morris Goodman Cancer Institute, McGill University, 1160 Pine Avenue, Montreal, QC H3A1A3, Canada
Sandrine Busque Dept. of Biochemistry, McGill University, 3655 Promenade Sir William Osler, Montreal, QC H3G1Y6, Canada,Rosalind & Morris Goodman Cancer Institute, McGill University, 1160 Pine Avenue, Montreal, QC H3A1A3, Canada
Yossef Lopez de los Santos KAUST Biological Environmental Science and Engineering (BESE) Division, King Abdullah University of Science and Technology (KAUST), Jeddah, Saudi Arabia
Kholoud Ashour Dept. of Biochemistry, McGill University, 3655 Promenade Sir William Osler, Montreal, QC H3G1Y6, Canada,Rosalind & Morris Goodman Cancer Institute, McGill University, 1160 Pine Avenue, Montreal, QC H3A1A3, Canada
Jason Sadek Dept. of Biochemistry, McGill University, 3655 Promenade Sir William Osler, Montreal, QC H3G1Y6, Canada,Rosalind & Morris Goodman Cancer Institute, McGill University, 1160 Pine Avenue, Montreal, QC H3A1A3, Canada
Xian Jin Lian Dept. of Biochemistry, McGill University, 3655 Promenade Sir William Osler, Montreal, QC H3G1Y6, Canada,Rosalind & Morris Goodman Cancer Institute, McGill University, 1160 Pine Avenue, Montreal, QC H3A1A3, Canada
Shahryar Khattak KAUST Smart-Health Initiative King Abdullah University of Science and Technology (KAUST), Jeddah, Saudi Arabia,KAUST Biological Environmental Science and Engineering (BESE) Division, King Abdullah University of Science and Technology (KAUST), Jeddah, Saudi Arabia
Sergio Di Marco KAUST Smart-Health Initiative King Abdullah University of Science and Technology (KAUST), Jeddah, Saudi Arabia,KAUST Biological Environmental Science and Engineering (BESE) Division, King Abdullah University of Science and Technology (KAUST), Jeddah, Saudi Arabia,Dept. of Biochemistry, McGill University, 3655 Promenade Sir William Osler, Montreal, QC H3G1Y6, Canada,Rosalind & Morris Goodman Cancer Institute, McGill University, 1160 Pine Avenue, Montreal, QC H3A1A3, Canada
Imed-Eddine Gallouzi To whom correspondence should be addressed. Tel: +966 12 808 2354;

Collapse

Wang L, Zhong H, Xue Z, Wang Y. Res-Dom: predicting protein domain boundary from sequence using deep residual network and Bi-LSTM. Bioinform Adv 2022;2:vbac060. [PMID: 36699417 PMCID: PMC9710680 DOI: 10.1093/bioadv/vbac060] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/22/2022] [Revised: 07/01/2022] [Accepted: 08/30/2022] [Indexed: 01/28/2023]

Grinkevich VV, Vema A, Fawkner K, Issaeva N, Andreotti V, Dickinson ER, Hedström E, Spinnler C, Inga A, Larsson LG, Karlén A, Wilhelm M, Barran PE, Okorokov AL, Selivanova G, Zawacka-Pankau JE. Novel Allosteric Mechanism of Dual p53/MDM2 and p53/MDM4 Inhibition by a Small Molecule. Front Mol Biosci 2022;9:823195. [PMID: 35720128 PMCID: PMC9198586 DOI: 10.3389/fmolb.2022.823195] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Accepted: 04/26/2022] [Indexed: 01/26/2023] Open

Affiliation(s)

Vera V. Grinkevich Department of Microbiology, Tumor and Cell Biology, Karolinska Institute, Stockholm, Sweden
Aparna Vema Division of Organic Pharmaceutical Chemistry, Department of Medicinal Chemistry, Uppsala University, Uppsala, Sweden
Karin Fawkner Department of Microbiology, Tumor and Cell Biology, Karolinska Institute, Stockholm, Sweden
Natalia Issaeva Department of Otolaryngology/Head and Neck Surgery, UNC-Chapel Hill, Chapel Hill, NC, United States
Virginia Andreotti IRCCS Ospedale Policlinico San Martino, Genetics of Rare Cancers, Genoa, Italy
Eleanor R. Dickinson Manchester Institute of Biotechnology, The School of Chemistry, The University of Manchester, Manchester, United Kingdom
Elisabeth Hedström Department of Microbiology, Tumor and Cell Biology, Karolinska Institute, Stockholm, Sweden
Clemens Spinnler Department of Microbiology, Tumor and Cell Biology, Karolinska Institute, Stockholm, Sweden
Alberto Inga Department CIBIO, University of Trento, Trento, Italy
Lars-Gunnar Larsson Department of Microbiology, Tumor and Cell Biology, Karolinska Institute, Stockholm, Sweden
Anders Karlén Division of Organic Pharmaceutical Chemistry, Department of Medicinal Chemistry, Uppsala University, Uppsala, Sweden
Margareta Wilhelm Department of Microbiology, Tumor and Cell Biology, Karolinska Institute, Stockholm, Sweden
Perdita E. Barran Manchester Institute of Biotechnology, The School of Chemistry, The University of Manchester, Manchester, United Kingdom
Andrei L. Okorokov Wolfson Institute for Biomedical Research, University College London, London, United Kingdom
Galina Selivanova Department of Microbiology, Tumor and Cell Biology, Karolinska Institute, Stockholm, Sweden,*Correspondence: Galina Selivanova, ; Joanna E. Zawacka-Pankau,
Joanna E. Zawacka-Pankau Department of Medicine, Huddinge, Center for Hematology and Regenerative Medicine, Karolinska Institute, Stockholm, Sweden,*Correspondence: Galina Selivanova, ; Joanna E. Zawacka-Pankau,

Collapse

Cretin G, Galochkina T, Vander Meersche Y, de Brevern AG, Postic G, Gelly JC. SWORD2: hierarchical analysis of protein 3D structures. Nucleic Acids Res 2022;50:W732-W738. [PMID: 35580056 PMCID: PMC9252838 DOI: 10.1093/nar/gkac370] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Revised: 04/19/2022] [Accepted: 04/29/2022] [Indexed: 11/27/2022] Open

Mulnaes D, Golchin P, Koenig F, Gohlke H. TopDomain: Exhaustive Protein Domain Boundary Metaprediction Combining Multisource Information and Deep Learning. J Chem Theory Comput 2021;17:4599-4613. [PMID: 34161735 DOI: 10.1021/acs.jctc.1c00129] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Li G, Zhou X, Li Z, Liu Y, Liu D, Miao Y, Wan Q, Zhang R. Significantly improving the thermostability of a hyperthermophilic GH10 family xylanase XynAF1 by semi-rational design. Appl Microbiol Biotechnol 2021;105:4561-4576. [PMID: 34014347 DOI: 10.1007/s00253-021-11340-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2021] [Revised: 04/16/2021] [Accepted: 05/09/2021] [Indexed: 11/28/2022]

Abstract

Xylanases have a broad range of applications in industrial biotechnologies, which require the enzymes to resist the high-temperature environments. The majority of xylanases have maximum activity at moderate temperatures, which limited their potential applications in industries. In this study, a thermophilic GH10 family xylanase XynAF1 from the high-temperature composting strain Aspergillus fumigatus Z5 was characterized and engineered to further improve its thermostability. XynAF1 has the optimal reaction temperature of 90 °C. The crystal structure of XynAF1 was obtained by X-ray diffraction after heterologous expression, purification, and crystallization. The high-resolution X-ray crystallographic structure of the protein-product complex was obtained by soaking the apo-state crystal with xylotetraose. Structure analysis indicated that XynAF1 has a rigid skeleton, which helps to maintain the hyperthermophilic characteristic. The homologous structure analysis and the catalytic center mutant construction of XynAF1 indicated the conserved catalytic center contributed to the high optimum catalytic temperature. The amino acids in the surface of xylanase XynAF1 which might influence the enzyme thermostability were identified by the structure analysis. Combining the rational design with the saturation mutation at the high B-value regions, the integrative mutant XynAF1-AC with a 6-fold increase of thermostability was finally obtained. This study efficiently improved the thermostability of a GH10 family xylanase by semi-rational design, which provided a new biocatalyst for high-temperature biotechnological applications. KEY POINTS: • Obtained the crystal structure of GH10 family hyperthermophilic xylanase XynAF1. • Shed light on the understanding of the GH10 family xylanase thermophilic mechanism. • Constructed a 6-fold increased thermostability recombinant xylanase.

Collapse

Affiliation(s)

Guangqi Li Key Laboratory of Microbial Resources Collection and Preservation, Ministry of Agriculture, Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences, Beijing, 100081, People's Republic of China.,Jiangsu Provincial Key Lab for Organic Solid Waste Utilization, National Engineering Research Center for Organic-based Fertilizers, Jiangsu Collaborative Innovation Center for Solid Organic Waste Resource Utilization, Nanjing Agricultural University, Nanjing, 210095, People's Republic of China
Xuan Zhou National Agricultural Technology Extension and Service Center, Beijing, 100125, People's Republic of China
Zhihong Li College of Science, Nanjing Agricultural University, Nanjing, 210095, People's Republic of China
Yunpeng Liu Key Laboratory of Microbial Resources Collection and Preservation, Ministry of Agriculture, Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences, Beijing, 100081, People's Republic of China
Dongyang Liu Jiangsu Provincial Key Lab for Organic Solid Waste Utilization, National Engineering Research Center for Organic-based Fertilizers, Jiangsu Collaborative Innovation Center for Solid Organic Waste Resource Utilization, Nanjing Agricultural University, Nanjing, 210095, People's Republic of China
Youzhi Miao Jiangsu Provincial Key Lab for Organic Solid Waste Utilization, National Engineering Research Center for Organic-based Fertilizers, Jiangsu Collaborative Innovation Center for Solid Organic Waste Resource Utilization, Nanjing Agricultural University, Nanjing, 210095, People's Republic of China
Qun Wan College of Science, Nanjing Agricultural University, Nanjing, 210095, People's Republic of China. .,The Key Laboratory of Plant Immunity, Nanjing Agricultural University, Nanjing, 210095, People's Republic of China.
Ruifu Zhang Key Laboratory of Microbial Resources Collection and Preservation, Ministry of Agriculture, Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences, Beijing, 100081, People's Republic of China. .,Jiangsu Provincial Key Lab for Organic Solid Waste Utilization, National Engineering Research Center for Organic-based Fertilizers, Jiangsu Collaborative Innovation Center for Solid Organic Waste Resource Utilization, Nanjing Agricultural University, Nanjing, 210095, People's Republic of China.

Collapse

Wang Y, Zhang H, Zhong H, Xue Z. Protein domain identification methods and online resources. Comput Struct Biotechnol J 2021;19:1145-1153. [PMID: 33680357 PMCID: PMC7895673 DOI: 10.1016/j.csbj.2021.01.041] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2020] [Revised: 01/25/2021] [Accepted: 01/26/2021] [Indexed: 01/03/2023] Open

Zheng W, Zhou X, Wuyun Q, Pearce R, Li Y, Zhang Y. FUpred: detecting protein domains through deep-learning-based contact map prediction. Bioinformatics 2020;36:3749-3757. [PMID: 32227201 DOI: 10.1093/bioinformatics/btaa217] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2019] [Revised: 02/27/2020] [Accepted: 03/25/2020] [Indexed: 11/12/2022] Open

Shi Q, Chen W, Huang S, Jin F, Dong Y, Wang Y, Xue Z. DNN-Dom: predicting protein domain boundary from sequence alone by deep neural network. Bioinformatics 2020;35:5128-5136. [PMID: 31197306 DOI: 10.1093/bioinformatics/btz464] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Revised: 05/07/2019] [Accepted: 06/05/2019] [Indexed: 11/13/2022] Open

Joseph AM, Pohl AE, Ball TJ, Abram TG, Johnson DK, Geisbrecht BV, Shames SR. The Legionella pneumophila Metaeffector Lpg2505 (MesI) Regulates SidI-Mediated Translation Inhibition and Novel Glycosyl Hydrolase Activity. Infect Immun 2020;88:e00853-19. [PMID: 32122942 DOI: 10.1128/IAI.00853-19] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2019] [Accepted: 02/27/2020] [Indexed: 12/19/2022] Open

Greener JG, Kandathil SM, Jones DT. Deep learning extends de novo protein modelling coverage of genomes using iteratively predicted structural constraints. Nat Commun 2019;10:3977. [PMID: 31484923 PMCID: PMC6726615 DOI: 10.1038/s41467-019-11994-0] [Citation(s) in RCA: 108] [Impact Index Per Article: 21.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2019] [Accepted: 08/14/2019] [Indexed: 01/30/2023] Open

Dulcey CE, López de Los Santos Y, Létourneau M, Déziel E, Doucet N. Semi-rational evolution of the 3-(3-hydroxyalkanoyloxy)alkanoate (HAA) synthase RhlA to improve rhamnolipid production in Pseudomonas aeruginosa and Burkholderia glumae. FEBS J 2019;286:4036-4059. [PMID: 31177633 DOI: 10.1111/febs.14954] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2018] [Revised: 04/12/2019] [Accepted: 06/06/2019] [Indexed: 12/15/2022]

Wang Y, Wang J, Li R, Shi Q, Xue Z, Zhang Y. ThreaDomEx: a unified platform for predicting continuous and discontinuous protein domains by multiple-threading and segment assembly. Nucleic Acids Res 2019;45:W400-W407. [PMID: 28498994 PMCID: PMC5793814 DOI: 10.1093/nar/gkx410] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2017] [Accepted: 04/28/2017] [Indexed: 12/21/2022] Open

García-Mauriño SM, Díaz-Quintana A, Rivero-Rodríguez F, Cruz-Gallardo I, Grüttner C, Hernández-Vellisca M, Díaz-Moreno I. A putative RNA binding protein from Plasmodium vivax apicoplast. FEBS Open Bio 2017;8:177-188. [PMID: 29435408 PMCID: PMC5794462 DOI: 10.1002/2211-5463.12351] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2017] [Revised: 11/03/2017] [Accepted: 11/14/2017] [Indexed: 01/30/2023] Open

Gamage DG, Varma Y, Meitzler JL, Morissette R, Ness TJ, Hendrickson TL. The soluble domains of Gpi8 and Gaa1, two subunits of glycosylphosphatidylinositol transamidase (GPI-T), assemble into a complex. Arch Biochem Biophys 2017;633:58-67. [DOI: 10.1016/j.abb.2017.09.006] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2017] [Revised: 09/06/2017] [Accepted: 09/07/2017] [Indexed: 11/23/2022]

Sanders K, Lin CL, Smith AJ, Cronin N, Fisher G, Eftychidis V, McGlynn P, Savery NJ, Wigley DB, Dillingham MS. The structure and function of an RNA polymerase interaction domain in the PcrA/UvrD helicase. Nucleic Acids Res 2017;45:3875-3887. [PMID: 28160601 PMCID: PMC5397179 DOI: 10.1093/nar/gkx074] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2016] [Accepted: 01/25/2017] [Indexed: 11/14/2022] Open

Sheu MJ, Hsieh MJ, Chou YE, Wang PH, Yeh CB, Yang SF, Lee HL, Liu YF. Effects of ADAMTS14 genetic polymorphism and cigarette smoking on the clinicopathologic development of hepatocellular carcinoma. PLoS One 2017;12:e0172506. [PMID: 28231306 PMCID: PMC5322915 DOI: 10.1371/journal.pone.0172506] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2016] [Accepted: 02/05/2017] [Indexed: 01/12/2023] Open

Ovchinnikov S, Kim DE, Wang RYR, Liu Y, DiMaio F, Baker D. Improved de novo structure prediction in CASP11 by incorporating coevolution information into Rosetta. Proteins 2016;84 Suppl 1:67-75. [PMID: 26677056 PMCID: PMC5490371 DOI: 10.1002/prot.24974] [Citation(s) in RCA: 83] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2015] [Revised: 11/27/2015] [Accepted: 12/12/2015] [Indexed: 12/19/2022]

Xue Z, Jang R, Govindarajoo B, Huang Y, Wang Y. Extending Protein Domain Boundary Predictors to Detect Discontinuous Domains. PLoS One 2015;10:e0141541. [PMID: 26502173 PMCID: PMC4621036 DOI: 10.1371/journal.pone.0141541] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2015] [Accepted: 10/10/2015] [Indexed: 11/18/2022] Open

Morissette R, Chen W, Perritt AF, Dreiling JL, Arai AE, Sachdev V, Hannoush H, Mallappa A, Xu Z, McDonnell NB, Quezado M, Merke DP. Broadening the Spectrum of Ehlers Danlos Syndrome in Patients With Congenital Adrenal Hyperplasia. J Clin Endocrinol Metab 2015;100:E1143-52. [PMID: 26075496 PMCID: PMC4525000 DOI: 10.1210/jc.2015-2232] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]

Abstract

CONTEXT

The contiguous gene deletion syndrome (CAH-X) was described in a subset (7%) of congenital adrenal hyperplasia (CAH) patients with a TNXA/TNXB chimera, resulting in deletions of CYP21A2, encoding 21-hydroxylase necessary for cortisol biosynthesis, and TNXB, encoding the extracellular matrix glycoprotein tenascin-X (TNX). This TNXA/TNXB chimera is characterized by a 120-bp deletion in exon 35 and results in TNXB haploinsufficiency, disrupted TGF-β signaling, and an Ehlers Danlos syndrome phenotype.

OBJECTIVE

The objective of the study was to determine the genetic status of TNXB and resulting protein defects in CAH patients with a CAH-X phenotype but not the previously described TNXA/TNXB chimera. Design, Settings, Participants, and Intervention: A total of 246 unrelated CAH patients were screened for TNXB defects. Genetic defects were investigated by Southern blotting, multiplex ligation-dependent probe amplification, Sanger, and next-generation sequencing. Dermal fibroblasts and tissue were used for immunoblotting, immunohistochemical, and coimmunoprecipitation experiments.

MAIN OUTCOME MEASURES

The genetic and protein status of tenascin-X in phenotypic CAH-X patients was measured.

RESULTS

Seven families harbor a novel TNXB missense variant c.12174C>G (p.C4058W) and a clinical phenotype consistent with hypermobility-type Ehlers Danlos syndrome. Fourteen CAH probands carry previously described TNXA/TNXB chimeras, and seven unrelated patients carry the novel TNXB variant, resulting in a CAH-X prevalence of 8.5%. This highly conserved pseudogene-derived variant in the TNX fibrinogen-like domain is predicted to be deleterious and disulfide bonded, results in reduced dermal elastin and fibrillin-1 staining and altered TGF-β1 binding, and represents a novel TNXA/TNXB chimera. Tenascin-X protein expression was normal in dermal fibroblasts, suggesting a dominant-negative effect.

CONCLUSIONS

CAH-X syndrome is commonly found in CAH due to 21-hydroxylase deficiency and may result from various etiological mechanisms.

Collapse

Jing R, Sun J, Wang Y, Li M. Domain position prediction based on sequence information by using fuzzy mean operator. Proteins 2015;83:1462-9. [PMID: 26009844 DOI: 10.1002/prot.24833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2015] [Revised: 04/23/2015] [Accepted: 05/17/2015] [Indexed: 11/09/2022]

Punta M, Simon I, Dosztányi Z. Prediction and analysis of intrinsically disordered proteins. Methods Mol Biol 2015;1261:35-59. [PMID: 25502193 DOI: 10.1007/978-1-4939-2230-7_3] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]

Xue Z, Xu D, Wang Y, Zhang Y. ThreaDom: extracting protein domain boundary information from multiple threading alignments. Bioinformatics 2013;29:i247-56. [PMID: 23812990 PMCID: PMC3694664 DOI: 10.1093/bioinformatics/btt209] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open

Abstract

Motivation: Protein domains are subunits that can fold and evolve independently. Identification of domain boundary locations is often the first step in protein folding and function annotations. Most of the current methods deduce domain boundaries by sequence-based analysis, which has low accuracy. There is no efficient method for predicting discontinuous domains that consist of segments from separated sequence regions. As template-based methods are most efficient for protein 3D structure modeling, combining multiple threading alignment information should increase the accuracy and reliability of computational domain predictions.

Result: We developed a new protein domain predictor, ThreaDom, which deduces domain boundary locations based on multiple threading alignments. The core of the method development is the derivation of a domain conservation score that combines information from template domain structures and terminal and internal alignment gaps. Tested on 630 non-redundant sequences, without using homologous templates, ThreaDom generates correct single- and multi-domain classifications in 81% of cases, where 78% have the domain linker assigned within ±20 residues. In a second test on 486 proteins with discontinuous domains, ThreaDom achieves an average precision 84% and recall 65% in domain boundary prediction. Finally, ThreaDom was examined on 56 targets from CASP8 and had a domain overlap rate 73, 87 and 85% with the target for Free Modeling, Hard multiple-domain and discontinuous domain proteins, respectively, which are significantly higher than most domain predictors in the CASP8. Similar results were achieved on the targets from the most recently CASP9 and CASP10 experiments.

Availability:http://zhanglab.ccmb.med.umich.edu/ThreaDom/.

Contact:zhng@umich.edu

Supplementary information:Supplementary data are available at Bioinformatics online.

Collapse

Gwynn EJ, Smith AJ, Guy CP, Savery NJ, McGlynn P, Dillingham MS. The conserved C-terminus of the PcrA/UvrD helicase interacts directly with RNA polymerase. PLoS One 2013;8:e78141. [PMID: 24147116 PMCID: PMC3797733 DOI: 10.1371/journal.pone.0078141] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2013] [Accepted: 09/13/2013] [Indexed: 12/31/2022] Open

Da Silva M, Upton C. Bioinformatics for analysis of poxvirus genomes. Methods Mol Biol 2012;890:233-58. [PMID: 22688771 DOI: 10.1007/978-1-61779-876-4_14] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/20/2023]

Kint CI, Verstraeten N, Wens I, Liebens VR, Hofkens J, Versées W, Fauvart M, Michiels J. The Escherichia coli GTPase ObgE modulates hydroxyl radical levels in response to DNA replication fork arrest. FEBS J 2012;279:3692-3704. [PMID: 22863262 DOI: 10.1111/j.1742-4658.2012.08731.x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]

Affiliation(s)

Cyrielle I Kint Centre of Microbial and Plant Genetics, Katholieke Universiteit Leuven, Heverlee, Belgium Department of Chemistry, Katholieke Universiteit Leuven, Heverlee, Belgium Structural Biology Brussels, Vrije Universiteit Brussel, Belgium Department of Structural Biology, Vlaams Instituut voor Biotechnologie, Brussels, Belgium
Natalie Verstraeten Centre of Microbial and Plant Genetics, Katholieke Universiteit Leuven, Heverlee, Belgium Department of Chemistry, Katholieke Universiteit Leuven, Heverlee, Belgium Structural Biology Brussels, Vrije Universiteit Brussel, Belgium Department of Structural Biology, Vlaams Instituut voor Biotechnologie, Brussels, Belgium
Inez Wens Centre of Microbial and Plant Genetics, Katholieke Universiteit Leuven, Heverlee, Belgium Department of Chemistry, Katholieke Universiteit Leuven, Heverlee, Belgium Structural Biology Brussels, Vrije Universiteit Brussel, Belgium Department of Structural Biology, Vlaams Instituut voor Biotechnologie, Brussels, Belgium
Veerle R Liebens Centre of Microbial and Plant Genetics, Katholieke Universiteit Leuven, Heverlee, Belgium Department of Chemistry, Katholieke Universiteit Leuven, Heverlee, Belgium Structural Biology Brussels, Vrije Universiteit Brussel, Belgium Department of Structural Biology, Vlaams Instituut voor Biotechnologie, Brussels, Belgium
Johan Hofkens Centre of Microbial and Plant Genetics, Katholieke Universiteit Leuven, Heverlee, Belgium Department of Chemistry, Katholieke Universiteit Leuven, Heverlee, Belgium Structural Biology Brussels, Vrije Universiteit Brussel, Belgium Department of Structural Biology, Vlaams Instituut voor Biotechnologie, Brussels, Belgium
Wim Versées Centre of Microbial and Plant Genetics, Katholieke Universiteit Leuven, Heverlee, Belgium Department of Chemistry, Katholieke Universiteit Leuven, Heverlee, Belgium Structural Biology Brussels, Vrije Universiteit Brussel, Belgium Department of Structural Biology, Vlaams Instituut voor Biotechnologie, Brussels, Belgium
Maarten Fauvart Centre of Microbial and Plant Genetics, Katholieke Universiteit Leuven, Heverlee, Belgium Department of Chemistry, Katholieke Universiteit Leuven, Heverlee, Belgium Structural Biology Brussels, Vrije Universiteit Brussel, Belgium Department of Structural Biology, Vlaams Instituut voor Biotechnologie, Brussels, Belgium
Jan Michiels Centre of Microbial and Plant Genetics, Katholieke Universiteit Leuven, Heverlee, Belgium Department of Chemistry, Katholieke Universiteit Leuven, Heverlee, Belgium Structural Biology Brussels, Vrije Universiteit Brussel, Belgium Department of Structural Biology, Vlaams Instituut voor Biotechnologie, Brussels, Belgium

Collapse

Law YS, Gudimella R, Song BK, Ratnam W, Harikrishna JA. Molecular characterization and comparative sequence analysis of defense-related gene, Oryza rufipogon receptor-like protein kinase 1. Int J Mol Sci 2012;13:9343-9362. [PMID: 22942769 PMCID: PMC3430300 DOI: 10.3390/ijms13079343] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2012] [Revised: 07/06/2012] [Accepted: 07/06/2012] [Indexed: 11/16/2022] Open

Li BQ, Hu LL, Chen L, Feng KY, Cai YD, Chou KC. Prediction of protein domain with mRMR feature selection and analysis. PLoS One 2012;7:e39308. [PMID: 22720092 PMCID: PMC3376124 DOI: 10.1371/journal.pone.0039308] [Citation(s) in RCA: 78] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2011] [Accepted: 05/17/2012] [Indexed: 11/30/2022] Open

Abstract

The domains are the structural and functional units of proteins. With the avalanche of protein sequences generated in the postgenomic age, it is highly desired to develop effective methods for predicting the protein domains according to the sequences information alone, so as to facilitate the structure prediction of proteins and speed up their functional annotation. However, although many efforts have been made in this regard, prediction of protein domains from the sequence information still remains a challenging and elusive problem. Here, a new method was developed by combing the techniques of RF (random forest), mRMR (maximum relevance minimum redundancy), and IFS (incremental feature selection), as well as by incorporating the features of physicochemical and biochemical properties, sequence conservation, residual disorder, secondary structure, and solvent accessibility. The overall success rate achieved by the new method on an independent dataset was around 73%, which was about 28–40% higher than those by the existing method on the same benchmark dataset. Furthermore, it was revealed by an in-depth analysis that the features of evolution, codon diversity, electrostatic charge, and disorder played more important roles than the others in predicting protein domains, quite consistent with experimental observations. It is anticipated that the new method may become a high-throughput tool in annotating protein domains, or may, at the very least, play a complementary role to the existing domain prediction methods, and that the findings about the key features with high impacts to the domain prediction might provide useful insights or clues for further experimental investigations in this area. Finally, it has not escaped our notice that the current approach can also be utilized to study protein signal peptides, B-cell epitopes, HIV protease cleavage sites, among many other important topics in protein science and biomedicine.

Collapse

Gupta AB, Wee LE, Zhou YT, Hortsch M, Low BC. Cross-species analyses identify the BNIP-2 and Cdc42GAP homology (BCH) domain as a distinct functional subclass of the CRAL_TRIO/Sec14 superfamily. PLoS One 2012;7:e33863. [PMID: 22479462 PMCID: PMC3313917 DOI: 10.1371/journal.pone.0033863] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2011] [Accepted: 02/18/2012] [Indexed: 11/19/2022] Open

Abstract

The CRAL_TRIO protein domain, which is unique to the Sec14 protein superfamily, binds to a diverse set of small lipophilic ligands. Similar domains are found in a range of different proteins including neurofibromatosis type-1, a Ras GTPase-activating Protein (RasGAP) and Rho guanine nucleotide exchange factors (RhoGEFs). Proteins containing this structural protein domain exhibit a low sequence similarity and ligand specificity while maintaining an overall characteristic three-dimensional structure. We have previously demonstrated that the BNIP-2 and Cdc42GAP Homology (BCH) protein domain, which shares a low sequence homology with the CRAL_TRIO domain, can serve as a regulatory scaffold that binds to Rho, RhoGEFs and RhoGAPs to control various cell signalling processes. In this work, we investigate 175 BCH domain-containing proteins from a wide range of different organisms. A phylogenetic analysis with ∼100 CRAL_TRIO and similar domains from eight representative species indicates a clear distinction of BCH-containing proteins as a novel subclass within the CRAL_TRIO/Sec14 superfamily. BCH-containing proteins contain a hallmark sequence motif R(R/K)h(R/K)(R/K)NL(R/K)xhhhhHPs (‘h’ is large and hydrophobic residue and ‘s’ is small and weekly polar residue) and can be further subdivided into three unique subtypes associated with BNIP-2-N, macro- and RhoGAP-type protein domains. A previously unknown group of genes encoding ‘BCH-only’ domains is also identified in plants and arthropod species. Based on an analysis of their gene-structure and their protein domain context we hypothesize that BCH domain-containing genes evolved through gene duplication, intron insertions and domain swapping events. Furthermore, we explore the point of divergence between BCH and CRAL-TRIO proteins in relation to their ability to bind small GTPases, GAPs and GEFs and lipid ligands. Our study suggests a need for a more extensive analysis of previously uncharacterized BCH, ‘BCH-like’ and CRAL_TRIO-containing proteins and their significance in regulating signaling events involving small GTPases.

Collapse

Pentony MM, Winters P, Penfold-Brown D, Drew K, Narechania A, DeSalle R, Bonneau R, Purugganan MD. The plant proteome folding project: structure and positive selection in plant protein families. Genome Biol Evol 2012;4:360-71. [PMID: 22345424 PMCID: PMC3318447 DOI: 10.1093/gbe/evs015] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Little NS, Quon T, Upton C. Prediction of a novel RNA binding domain in crocodilepox Zimbabwe Gene 157. Microb Inform Exp 2011;1:12. [PMID: 22587704 PMCID: PMC3372294 DOI: 10.1186/2042-5783-1-12] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/18/2011] [Accepted: 11/21/2011] [Indexed: 11/30/2022]

Drew K, Winters P, Butterfoss GL, Berstis V, Uplinger K, Armstrong J, Riffle M, Schweighofer E, Bovermann B, Goodlett DR, Davis TN, Shasha D, Malmström L, Bonneau R. The Proteome Folding Project: proteome-scale prediction of structure and function. Genome Res 2011;21:1981-94. [PMID: 21824995 DOI: 10.1101/gr.121475.111] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Eickholt J, Deng X, Cheng J. DoBo: Protein domain boundary prediction by integrating evolutionary signals and machine learning. BMC Bioinformatics 2011;12:43. [PMID: 21284866 PMCID: PMC3036623 DOI: 10.1186/1471-2105-12-43] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2010] [Accepted: 02/01/2011] [Indexed: 11/17/2022] Open

Motono C, Nakata J, Koike R, Shimizu K, Shirota M, Amemiya T, Tomii K, Nagano N, Sakaya N, Misoo K, Sato M, Kidera A, Hiroaki H, Shirai T, Kinoshita K, Noguchi T, Ota M. SAHG, a comprehensive database of predicted structures of all human proteins. Nucleic Acids Res 2010;39:D487-93. [PMID: 21051360 PMCID: PMC3013665 DOI: 10.1093/nar/gkq1057] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open

Edwards TE, Phan I, Abendroth J, Dieterich SH, Masoudi A, Guo W, Hewitt SN, Kelley A, Leibly D, Brittnacher MJ, Staker BL, Miller SI, Van Voorhis WC, Myler PJ, Stewart LJ. Structure of a Burkholderia pseudomallei trimeric autotransporter adhesin head. PLoS One 2010;5. [PMID: 20862217 PMCID: PMC2942831 DOI: 10.1371/journal.pone.0012803] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2010] [Accepted: 08/18/2010] [Indexed: 02/04/2023] Open

Malmström L, Goodlett DR. Protein structure modeling. Methods Mol Biol 2010;673:63-72. [PMID: 20835793 DOI: 10.1007/978-1-60761-842-3_5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]

Lenhart TR, Akins DR. Borrelia burgdorferi locus BB0795 encodes a BamA orthologue required for growth and efficient localization of outer membrane proteins. Mol Microbiol 2009;75:692-709. [PMID: 20025662 DOI: 10.1111/j.1365-2958.2009.07015.x] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Dosztanyi Z, Meszaros B, Simon I. Bioinformatical approaches to characterize intrinsically disordered/unstructured proteins. Brief Bioinform 2009;11:225-43. [DOI: 10.1093/bib/bbp061] [Citation(s) in RCA: 93] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Walsh I, Martin AJM, Mooney C, Rubagotti E, Vullo A, Pollastri G. Ab initio and homology based prediction of protein domains by recursive neural networks. BMC Bioinformatics 2009;10:195. [PMID: 19558651 PMCID: PMC2711945 DOI: 10.1186/1471-2105-10-195] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2008] [Accepted: 06/26/2009] [Indexed: 11/10/2022] Open

Abstract

Background

Proteins, especially larger ones, are often composed of individual evolutionary units, domains, which have their own function and structural fold. Predicting domains is an important intermediate step in protein analyses, including the prediction of protein structures.

Results

We describe novel systems for the prediction of protein domain boundaries powered by Recursive Neural Networks. The systems rely on a combination of primary sequence and evolutionary information, predictions of structural features such as secondary structure, solvent accessibility and residue contact maps, and structural templates, both annotated for domains (from the SCOP dataset) and unannotated (from the PDB). We gauge the contribution of contact maps, and PDB and SCOP templates independently and for different ranges of template quality. We find that accurately predicted contact maps are informative for the prediction of domain boundaries, while the same is not true for contact maps predicted ab initio. We also find that gap information from PDB templates is informative, but, not surprisingly, less than SCOP annotations. We test both systems trained on templates of all qualities, and systems trained only on templates of marginal similarity to the query (less than 25% sequence identity). While the first batch of systems produces near perfect predictions in the presence of fair to good templates, the second batch outperforms or match ab initio predictors down to essentially any level of template quality.

We test all systems in 5-fold cross-validation on a large non-redundant set of multi-domain and single domain proteins. The final predictors are state-of-the-art, with a template-less prediction boundary recall of 50.8% (precision 38.7%) within ± 20 residues and a single domain recall of 80.3% (precision 78.1%). The SCOP-based predictors achieve a boundary recall of 74% (precision 77.1%) again within ± 20 residues, and classify single domain proteins as such in over 85% of cases, when we allow a mix of bad and good quality templates. If we only allow marginal templates (max 25% sequence identity to the query) the scores remain high, with boundary recall and precision of 59% and 66.3%, and 80% of all single domain proteins predicted correctly.

Conclusion

The systems presented here may prove useful in large-scale annotation of protein domains in proteins of unknown structure. The methods are available as public web servers at the address: and we plan on running them on a multi-genomic scale and make the results public in the near future.

Collapse

Salipante SJ, Rojas ME, Korkmaz B, Duan Z, Wechsler J, Benson KF, Person RE, Grimes HL, Horwitz MS. Contributions to neutropenia from PFAAP5 (N4BP2L2), a novel protein mediating transcriptional repressor cooperation between Gfi1 and neutrophil elastase. Mol Cell Biol 2009;29:4394-405. [PMID: 19506020 DOI: 10.1128/MCB.00596-09] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Kirillova S, Kumar S, Carugo O. Protein domain boundary predictions: a structural biology perspective. Open Biochem J 2009;3:1-8. [PMID: 19401756 PMCID: PMC2669640 DOI: 10.2174/1874091x00903010001] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2008] [Revised: 11/27/2008] [Accepted: 11/29/2008] [Indexed: 11/22/2022] Open

Bondugula R, Lee MS, Wallqvist A. FIEFDom: a transparent domain boundary recognition system using a fuzzy mean operator. Nucleic Acids Res 2008;37:452-62. [PMID: 19056827 PMCID: PMC2632928 DOI: 10.1093/nar/gkn944] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Wu Y, Dousis AD, Chen M, Li J, Ma J. OPUS-Dom: applying the folding-based method VECFOLD to determine protein domain boundaries. J Mol Biol 2008;385:1314-29. [PMID: 19026662 DOI: 10.1016/j.jmb.2008.10.093] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2008] [Revised: 10/29/2008] [Accepted: 10/31/2008] [Indexed: 10/21/2022]

Trevisan S, Borsa P, Botton A, Varotto S, Malagoli M, Ruperti B, Quaggiotti S. Expression of two maize putative nitrate transporters in response to nitrate and sugar availability. Plant Biol (Stuttg) 2008;10:462-75. [PMID: 18557906 DOI: 10.1111/j.1438-8677.2008.00041.x] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]

Hu X, Murata LB, Weichsel A, Brailey JL, Roberts SA, Nighorn A, Montfort WR. Allostery in recombinant soluble guanylyl cyclase from Manduca sexta. J Biol Chem 2008;283:20968-77. [PMID: 18515359 DOI: 10.1074/jbc.m801501200] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open

Ye L, Liu T, Wu Z, Zhou R. Sequence-based protein domain boundary prediction using BP neural network with various property profiles. Proteins 2008;71:300-7. [PMID: 17932915 DOI: 10.1002/prot.21745] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Kolmos E, Schoof H, Plümer M, Davis SJ. Structural insights into the function of the core-circadian factor TIMING OF CAB2 EXPRESSION 1 (TOC1). J Circadian Rhythms 2008;6:3. [PMID: 18298828 PMCID: PMC2292679 DOI: 10.1186/1740-3391-6-3] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2007] [Accepted: 02/25/2008] [Indexed: 01/24/2023] Open

Tress M, Cheng J, Baldi P, Joo K, Lee J, Seo JH, Lee J, Baker D, Chivian D, Kim D, Ezkurdia I. Assessment of predictions submitted for the CASP7 domain prediction category. Proteins 2008;69 Suppl 8:137-51. [PMID: 17680686 DOI: 10.1002/prot.21675] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Battey JND, Kopp J, Bordoli L, Read RJ, Clarke ND, Schwede T. Automated server predictions in CASP7. Proteins 2008;69 Suppl 8:68-82. [PMID: 17894354 DOI: 10.1002/prot.21761] [Citation(s) in RCA: 94] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]