Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Thornton JM. From genome to function. Science 2001;292:2095-7. [PMID: 11408660 DOI: 10.1126/science.292.5524.2095] [Citation(s) in RCA: 62] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]

Number

Cited by Other Article(s)

Grigorev V, Tinkov O, Grigoreva L, Rasdolsky A. Structural fractal analysis of the active sites of acetylcholinesterase from various organisms. J Mol Graph Model 2022;116:108265. [PMID: 35816907 DOI: 10.1016/j.jmgm.2022.108265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Revised: 06/24/2022] [Accepted: 06/29/2022] [Indexed: 12/15/2022]

Ling C, Wei X, Shen Y, Zhang H. Development and validation of multiple machine learning algorithms for the classification of G-protein-coupled receptors using molecular evolution model-based feature extraction strategy. Amino Acids 2021;53:1705-1714. [PMID: 34562175 DOI: 10.1007/s00726-021-03080-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2021] [Accepted: 09/13/2021] [Indexed: 11/25/2022]

Leinweber M, Fober T, Freisleben B. GPU-Based Point Cloud Superpositioning for Structural Comparisons of Protein Binding Sites. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018;15:740-752. [PMID: 27845672 DOI: 10.1109/tcbb.2016.2625793] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Li M, Ling C, Xu Q, Gao J. Classification of G-protein coupled receptors based on a rich generation of convolutional neural network, N-gram transformation and multiple sequence alignments. Amino Acids 2017;50:255-266. [PMID: 29151135 DOI: 10.1007/s00726-017-2512-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2017] [Accepted: 11/14/2017] [Indexed: 10/18/2022]

Abstract

Sequence classification is crucial in predicting the function of newly discovered sequences. In recent years, the prediction of the incremental large-scale and diversity of sequences has heavily relied on the involvement of machine-learning algorithms. To improve prediction accuracy, these algorithms must confront the key challenge of extracting valuable features. In this work, we propose a feature-enhanced protein classification approach, considering the rich generation of multiple sequence alignment algorithms, N-gram probabilistic language model and the deep learning technique. The essence behind the proposed method is that if each group of sequences can be represented by one feature sequence, composed of homologous sites, there should be less loss when the sequence is rebuilt, when a more relevant sequence is added to the group. On the basis of this consideration, the prediction becomes whether a query sequence belonging to a group of sequences can be transferred to calculate the probability that the new feature sequence evolves from the original one. The proposed work focuses on the hierarchical classification of G-protein Coupled Receptors (GPCRs), which begins by extracting the feature sequences from the multiple sequence alignment results of the GPCRs sub-subfamilies. The N-gram model is then applied to construct the input vectors. Finally, these vectors are imported into a convolutional neural network to make a prediction. The experimental results elucidate that the proposed method provides significant performance improvements. The classification error rate of the proposed method is reduced by at least 4.67% (family level I) and 5.75% (family Level II), in comparison with the current state-of-the-art methods. The implementation program of the proposed work is freely available at: https://github.com/alanFchina/CNN .

Collapse

Krotzky T, Grunwald C, Egerland U, Klebe G. Large-scale mining for similar protein binding pockets: with RAPMAD retrieval on the fly becomes real. J Chem Inf Model 2014;55:165-79. [PMID: 25474400 DOI: 10.1021/ci5005898] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]

Krotzky T, Fober T, Hüllermeier E, Klebe G. Extended Graph-Based Models for Enhanced Similarity Search in Cavbase. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2014;11:878-890. [PMID: 26356860 DOI: 10.1109/tcbb.2014.2325020] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Kolodny R, Pereyaslavets L, Samson AO, Levitt M. On the Universe of Protein Folds. Annu Rev Biophys 2013;42:559-82. [DOI: 10.1146/annurev-biophys-083012-130432] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Kakumani R, Devabhaktuni V, Ahmad M. A two-stage neural network based technique for protein secondary structure prediction. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2009;2008:1355-8. [PMID: 19162919 DOI: 10.1109/iembs.2008.4649416] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Tseng YY, Dundas J, Liang J. Predicting protein function and binding profile via matching of local evolutionary and geometric surface patterns. J Mol Biol 2009;387:451-64. [PMID: 19154742 PMCID: PMC2670802 DOI: 10.1016/j.jmb.2008.12.072] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2008] [Revised: 12/19/2008] [Accepted: 12/23/2008] [Indexed: 11/25/2022]

Abstract

Inferring protein functions from structures is a challenging task, as a large number of orphan protein structures from structural genomics project are now solved without their biochemical functions characterized. For proteins binding to similar substrates or ligands and carrying out similar functions, their binding surfaces are under similar physicochemical constraints, and hence the sets of allowed and forbidden residue substitutions are similar. However, it is difficult to isolate such selection pressure due to protein function from selection pressure due to protein folding, and evolutionary relationship reflected by global sequence and structure similarities between proteins is often unreliable for inferring protein function. We have developed a method, called pevoSOAR (pocket-based evolutionary search of amino acid residues), for predicting protein functions by solving the problem of uncovering amino acids residue substitution pattern due to protein function and separating it from amino acids substitution pattern due to protein folding. We incorporate evolutionary information specific to an individual binding region and match local surfaces on a large scale with millions of precomputed protein surfaces to identify those with similar functions. Our pevoSOAR method also generates a probablistic model called the computed binding a profile that characterizes protein-binding activities that may involve multiple substrates or ligands. We show that our method can be used to predict enzyme functions with accuracy. Our method can also assess enzyme binding specificity and promiscuity. In an objective large-scale test of 100 enzyme families with thousands of structures, our predictions are found to be sensitive and specific: At the stringent specificity level of 99.98%, we can correctly predict enzyme functions for 80.55% of the proteins. The overall area under the receiver operating characteristic curve measuring the performance of our prediction is 0.955, close to the perfect value of 1.00. The best Matthews coefficient is 86.6%. Our method also works well in predicting the biochemical functions of orphan proteins from structural genomics projects.

Collapse

Nimrod G, Schushan M, Steinberg DM, Ben-Tal N. Detection of functionally important regions in "hypothetical proteins" of known structure. Structure 2009;16:1755-63. [PMID: 19081051 DOI: 10.1016/j.str.2008.10.017] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2008] [Revised: 10/16/2008] [Accepted: 10/19/2008] [Indexed: 10/21/2022]

Pai RD, Zhang W, Schuwirth BS, Hirokawa G, Kaji H, Kaji A, Cate JHD. Structural Insights into ribosome recycling factor interactions with the 70S ribosome. J Mol Biol 2008;376:1334-47. [PMID: 18234219 PMCID: PMC2712656 DOI: 10.1016/j.jmb.2007.12.048] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2007] [Revised: 12/11/2007] [Accepted: 12/19/2007] [Indexed: 11/25/2022]

Singh A, Kushwaha HR, Sharma P. Molecular modelling and comparative structural account of aspartyl beta-semialdehyde dehydrogenase of Mycobacterium tuberculosis (H37Rv). J Mol Model 2008;14:249-63. [PMID: 18236087 DOI: 10.1007/s00894-008-0267-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2007] [Accepted: 01/03/2008] [Indexed: 11/29/2022]

Song J, Yuan Z, Tan H, Huber T, Burrage K. Predicting disulfide connectivity from protein sequence using multiple sequence feature vectors and secondary structure. ACTA ACUST UNITED AC 2007;23:3147-54. [PMID: 17942444 DOI: 10.1093/bioinformatics/btm505] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Abstract

MOTIVATION

Disulfide bonds are primary covalent crosslinks between two cysteine residues in proteins that play critical roles in stabilizing the protein structures and are commonly found in extracy-toplasmatic or secreted proteins. In protein folding prediction, the localization of disulfide bonds can greatly reduce the search in conformational space. Therefore, there is a great need to develop computational methods capable of accurately predicting disulfide connectivity patterns in proteins that could have potentially important applications.

RESULTS

We have developed a novel method to predict disulfide connectivity patterns from protein primary sequence, using a support vector regression (SVR) approach based on multiple sequence feature vectors and predicted secondary structure by the PSIPRED program. The results indicate that our method could achieve a prediction accuracy of 74.4% and 77.9%, respectively, when averaged on proteins with two to five disulfide bridges using 4-fold cross-validation, measured on the protein and cysteine pair on a well-defined non-homologous dataset. We assessed the effects of different sequence encoding schemes on the prediction performance of disulfide connectivity. It has been shown that the sequence encoding scheme based on multiple sequence feature vectors coupled with predicted secondary structure can significantly improve the prediction accuracy, thus enabling our method to outperform most of other currently available predictors. Our work provides a complementary approach to the current algorithms that should be useful in computationally assigning disulfide connectivity patterns and helps in the annotation of protein sequences generated by large-scale whole-genome projects.

AVAILABILITY

The prediction web server and Supplementary Material are accessible at http://foo.maths.uq.edu.au/~huber/disulfide

Collapse

Quantitative assessment of relationship between sequence similarity and function similarity. BMC Genomics 2007;8:222. [PMID: 17620139 PMCID: PMC1949826 DOI: 10.1186/1471-2164-8-222] [Citation(s) in RCA: 68] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2006] [Accepted: 07/09/2007] [Indexed: 11/16/2022] Open

Saini HK, Fischer D. FRalanyzer: a tool for functional analysis of fold-recognition sequence-structure alignments. Nucleic Acids Res 2007;35:W499-502. [PMID: 17537819 PMCID: PMC1933221 DOI: 10.1093/nar/gkm367] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open

Nagano N, Noguchi T, Akiyama Y. Systematic comparison of catalytic mechanisms of hydrolysis and transfer reactions classified in the EzCatDB database. Proteins 2007;66:147-59. [PMID: 17039546 DOI: 10.1002/prot.21193] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Abstract

Catalytic mechanisms of 270 enzymes from 131 superfamilies, mainly hydrolases and transferases, were analyzed based on their enzyme structures. A method of systematic comparison and classification of the catalytic reactions was developed. Hydrolysis and transfer reactions closely resemble one another, displaying common mechanisms, single displacement, and double displacement. These displacement mechanisms might be further subclassified according to the type of catalytic factors and nucleophilic substitution involved. Several types of catalytic factors exist: nucleophile, acid, base, stabilizer, modulator, cofactors. Nucleophilic substitution might be categorized as S(N)1/S(N)2 (or dissociative/associative) reactions. The classification indicates that some mechanisms favor particular types of catalytic factors. In hydrolyses of amide bonds and phosphoric ester bonds, mechanisms with single displacement tend to use inorganic cofactors such as zinc and magnesium ions as important catalysts, whereas those with double displacement frequently do not use such cofactors. In contrast, hydrolyses of O-glycoside bond rarely use such cofactors, with one exception. The trypsin-like hydrolytic reaction, which is catalyzed by the classic catalytic triad comprising serine/histidine/aspartate, can be considered as a "super-reaction" because it is observed in at least three nonhomologous enzymes, whereas most reactions are singlets without any nonhomologous enzymes. By dividing complex reactions into several reactions, correlations between active site structures and catalytic functions can be suggested. This classification method is applicable to other reactions such as elimination and isomerization. Furthermore, it will facilitate annotation of enzyme functions from 3D patterns of enzyme active sites. The classification is available at http://mbs.cbrc.jp/EzCatDB/RLCP/index.html.

Collapse

Saha RP, Chakrabarti P. Molecular modeling and characterization of Vibrio cholerae transcription regulator HlyU. BMC STRUCTURAL BIOLOGY 2006;6:24. [PMID: 17116251 PMCID: PMC1665450 DOI: 10.1186/1472-6807-6-24] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/17/2006] [Accepted: 11/20/2006] [Indexed: 11/15/2022]

Glaser F, Rosenberg Y, Kessel A, Pupko T, Ben-Tal N. The ConSurf-HSSP database: the mapping of evolutionary conservation among homologs onto PDB structures. Proteins 2006;58:610-7. [PMID: 15614759 DOI: 10.1002/prot.20305] [Citation(s) in RCA: 95] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Abstract

The HSSP (Homology-Derived Secondary Structure of Proteins) database provides multiple sequence alignments (MSAs) for proteins of known three-dimensional (3D) structure in the Protein Data Bank (PDB). The database also contains an estimate of the degree of evolutionary conservation at each amino acid position. This estimate, which is based on the relative entropy, correlates with the functional importance of the position; evolutionarily conserved positions (i.e., positions with limited variability and low entropy) are occasionally important to maintain the 3D structure and biological function(s) of the protein. We recently developed the Rate4Site algorithm for scoring amino acid conservation based on their calculated evolutionary rate. This algorithm takes into account the phylogenetic relationships between the homologs and the stochastic nature of the evolutionary process. Here we present the ConSurf-HSSP database of Rate4Site estimates of the evolutionary rates of the amino acid positions, calculated using HSSP's MSAs. The database provides precalculated evolutionary rates for nearly all of the PDB. These rates are projected, using a color code, onto the protein structure, and can be viewed online using the ConSurf server interface. To exemplify the database, we analyzed in detail the conservation pattern obtained for pyruvate kinase and compared the results with those observed using the relative entropy scores of the HSSP database. It is reassuring to know that the main functional region of the enzyme is detectable using both conservation scores. Interestingly, the ConSurf-HSSP calculations mapped additional functionally important regions, which are moderately conserved and were overlooked by the original HSSP estimate. The ConSurf-HSSP database is available online (http://consurf-hssp.tau.ac.il).

Collapse

Mika S, Rost B. Protein-protein interactions more conserved within species than across species. PLoS Comput Biol 2006;2:e79. [PMID: 16854211 PMCID: PMC1513270 DOI: 10.1371/journal.pcbi.0020079] [Citation(s) in RCA: 84] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2005] [Indexed: 11/21/2022] Open

Abstract

Experimental high-throughput studies of protein–protein interactions are beginning to provide enough data for comprehensive computational studies. Today, about ten large data sets, each with thousands of interacting pairs, coarsely sample the interactions in fly, human, worm, and yeast. Another about 55,000 pairs of interacting proteins have been identified by more careful, detailed biochemical experiments. Most interactions are experimentally observed in prokaryotes and simple eukaryotes; very few interactions are observed in higher eukaryotes such as mammals. It is commonly assumed that pathways in mammals can be inferred through homology to model organisms, e.g. the experimental observation that two yeast proteins interact is transferred to infer that the two corresponding proteins in human also interact. Two pairs for which the interaction is conserved are often described as interologs. The goal of this investigation was a large-scale comprehensive analysis of such inferences, i.e. of the evolutionary conservation of interologs. Here, we introduced a novel score for measuring the overlap between protein–protein interaction data sets. This measure appeared to reflect the overall quality of the data and was the basis for our two surprising results from our large-scale analysis. Firstly, homology-based inferences of physical protein–protein interactions appeared far less successful than expected. In fact, such inferences were accurate only for extremely high levels of sequence similarity. Secondly, and most surprisingly, the identification of interacting partners through sequence similarity was significantly more reliable for protein pairs within the same organism than for pairs between species. Our analysis underlined that the discrepancies between different datasets are large, even when using the same type of experiment on the same organism. This reality considerably constrains the power of homology-based transfer of interactions. In particular, the experimental probing of interactions in distant model organisms has to be undertaken with some caution. More comprehensive images of protein–protein networks will require the combination of many high-throughput methods, including in silico inferences and predictions. http://www.rostlab.org/results/2006/ppi_homology/

The IntAct database contains about ten large-scale data sets of protein–protein interactions. Each set contains thousands of experimentally observed pair interactions. Most pairs were observed in yeast (Saccharomyces cerevisiae), fly (Drosophila melanogaster), and worm (Caenorhabditis elegans). These interactions are often perceived as model organisms in the sense that one can infer that two mouse proteins interact if one experimentally observes the two corresponding proteins in worm to interact. Here, the authors analyzed in detail how the sequence signals of physical protein–protein interactions are conserved. It is a common assumption that protein–protein interactions can easily be inferred through homology transfer from one model organism to another organism of interest. Here, the authors demonstrated that such homology transfers are only accurate at unexpectedly high levels of sequence identity. Even more surprisingly, homology transfers of protein–protein interactions are significantly more reliable for protein pairs from the same species than for two protein pairs from different organisms. The observation that interactions were much more conserved within than across species was valid for all levels of sequence similarity, i.e. for very similar as well as for more diverged interologs.

Collapse

Tanzer ML. Current concepts of extracellular matrix. J Orthop Sci 2006;11:326-31. [PMID: 16721539 PMCID: PMC2778692 DOI: 10.1007/s00776-006-1012-2] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/14/2005] [Indexed: 01/16/2023]

Clare A, Karwath A, Ougham H, King RD. Functional bioinformatics for Arabidopsis thaliana. Bioinformatics 2006;22:1130-6. [PMID: 16481336 DOI: 10.1093/bioinformatics/btl051] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Vries JK, Munshi R, Tobi D, Klein-Seetharaman J, Benos PV, Bahar I. A sequence alignment-independent method for protein classification. ACTA ACUST UNITED AC 2005;3:137-48. [PMID: 15693739 DOI: 10.2165/00822942-200403020-00008] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]

Abstract

Annotation of the rapidly accumulating body of sequence data relies heavily on the detection of remote homologues and functional motifs in protein families. The most popular methods rely on sequence alignment. These include programs that use a scoring matrix to compare the probability of a potential alignment with random chance and programs that use curated multiple alignments to train profile hidden Markov models (HMMs). Related approaches depend on bootstrapping multiple alignments from a single sequence. However, alignment-based programs have limitations. They make the assumption that contiguity is conserved between homologous segments, which may not be true in genetic recombination or horizontal transfer. Alignments also become ambiguous when sequence similarity drops below 40%. This has kindled interest in classification methods that do not rely on alignment. An approach to classification without alignment based on the distribution of contiguous sequences of four amino acids (4-grams) was developed. Interest in 4-grams stemmed from the observation that almost all theoretically possible 4-grams (20(4)) occur in natural sequences and the majority of 4-grams are uniformly distributed. This implies that the probability of finding identical 4-grams by random chance in unrelated sequences is low. A Bayesian probabilistic model was developed to test this hypothesis. For each protein family in Pfam-A and PIR-PSD, a feature vector called a probe was constructed from the set of 4-grams that best characterised the family. In rigorous jackknife tests, unknown sequences from Pfam-A and PIR-PSD were compared with the probes for each family. A classification result was deemed a true positive if the probe match with the highest probability was in first place in a rank-ordered list. This was achieved in 70% of cases. Analysis of false positives suggested that the precision might approach 85% if selected families were clustered into subsets. Case studies indicated that the 4-grams in common between an unknown and the best matching probe correlated with functional motifs from PRINTS. The results showed that remote homologues and functional motifs could be identified from an analysis of 4-gram patterns.

Collapse

Magliery TJ, Regan L. Sequence variation in ligand binding sites in proteins. BMC Bioinformatics 2005;6:240. [PMID: 16194281 PMCID: PMC1261162 DOI: 10.1186/1471-2105-6-240] [Citation(s) in RCA: 56] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2005] [Accepted: 09/30/2005] [Indexed: 11/25/2022] Open

Punta M, Rost B. PROFcon: novel prediction of long-range contacts. Bioinformatics 2005;21:2960-8. [PMID: 15890748 DOI: 10.1093/bioinformatics/bti454] [Citation(s) in RCA: 95] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open

Namboori S, Mhatre N, Sujatha S, Srinivasan N, Pandit SB. Enhanced functional and structural domain assignments using remote similarity detection procedures for proteins encoded in the genome of Mycobacterium tuberculosis H37Rv. J Biosci 2005;29:245-59. [PMID: 15381846 DOI: 10.1007/bf02702607] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

Marabotti A, D'Auria S, Rossi M, Facchiano AM. Theoretical model of the three-dimensional structure of a sugar-binding protein from Pyrococcus horikoshii: structural analysis and sugar-binding simulations. Biochem J 2004;380:677-84. [PMID: 15015939 PMCID: PMC1224218 DOI: 10.1042/bj20031876] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2003] [Revised: 03/11/2004] [Accepted: 03/12/2004] [Indexed: 11/17/2022]

Toriumi C, Imai K. An identification method for altered proteins in tissues utilizing fluorescence derivatization, liquid chromatography, tandem mass spectrometry, and a database-searching algorithm. Anal Chem 2004;75:3725-30. [PMID: 14572036 DOI: 10.1021/ac020693x] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Guo J, Chen H, Sun Z, Lin Y. A novel method for protein secondary structure prediction using dual-layer SVM and profiles. Proteins 2004;54:738-43. [PMID: 14997569 DOI: 10.1002/prot.10634] [Citation(s) in RCA: 137] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Conrad C, Vianna C, Schultz C, Thal DR, Ghebremedhin E, Lenz J, Braak H, Davies P. Molecular evolution and genetics of the Saitohin gene and tau haplotype in Alzheimer's disease and argyrophilic grain disease. J Neurochem 2004;89:179-88. [PMID: 15030402 DOI: 10.1046/j.1471-4159.2004.02320.x] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Whittaker PA. What is the relevance of bioinformatics to pharmacology? Trends Pharmacol Sci 2003;24:434-9. [PMID: 12915054 DOI: 10.1016/s0165-6147(03)00197-4] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Ofran Y, Rost B. Predicted protein-protein interaction sites from local sequence information. FEBS Lett 2003;544:236-9. [PMID: 12782323 DOI: 10.1016/s0014-5793(03)00456-3] [Citation(s) in RCA: 162] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Lan N, Montelione GT, Gerstein M. Ontologies for proteomics: towards a systematic definition of structure and function that scales to the genome level. Curr Opin Chem Biol 2003;7:44-54. [PMID: 12547426 DOI: 10.1016/s1367-5931(02)00020-0] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

McDonald JD, Andriolo M, Calì F, Mirisola M, Puglisi-Allegra S, Romano V, Sarkissian CN, Smith CB. The phenylketonuria mouse model: a meeting review. Mol Genet Metab 2002;76:256-61. [PMID: 12208130 DOI: 10.1016/s1096-7192(02)00115-4] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]

Hart AL, Stagg AJ, Frame M, Graffner H, Glise H, Falk P, Kamm MA. The role of the gut flora in health and disease, and its modification as therapy. Aliment Pharmacol Ther 2002;16:1383-93. [PMID: 12182739 DOI: 10.1046/j.1365-2036.2002.01310.x] [Citation(s) in RCA: 62] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]

Wang HW, Sharp TV, Koumi A, Koentges G, Boshoff C. Characterization of an anti-apoptotic glycoprotein encoded by Kaposi's sarcoma-associated herpesvirus which resembles a spliced variant of human survivin. EMBO J 2002;21:2602-15. [PMID: 12032073 PMCID: PMC126038 DOI: 10.1093/emboj/21.11.2602] [Citation(s) in RCA: 129] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open

Stahura FL, Bajorath J. Bio- and chemo-informatics beyond data management: crucial challenges and future opportunities. Drug Discov Today 2002;7:S41-7. [PMID: 12047879 DOI: 10.1016/s1359-6446(02)02271-7] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Dieckman L, Gu M, Stols L, Donnelly MI, Collart FR. High throughput methods for gene cloning and expression. Protein Expr Purif 2002;25:1-7. [PMID: 12071692 DOI: 10.1006/prep.2001.1602] [Citation(s) in RCA: 112] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Rost B. Did evolution leap to create the protein universe? Curr Opin Struct Biol 2002;12:409-16. [PMID: 12127462 DOI: 10.1016/s0959-440x(02)00337-8] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Ueberle B, Frank R, Herrmann R. The proteome of the bacterium Mycoplasma pneumoniae: comparing predicted open reading frames to identified gene products. Proteomics 2002;2:754-64. [PMID: 12112859 DOI: 10.1002/1615-9861(200206)2:6<754::aid-prot754>3.0.co;2-2] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Conrad C, Vianna C, Freeman M, Davies P. A polymorphic gene nested within an intron of the tau gene: implications for Alzheimer's disease. Proc Natl Acad Sci U S A 2002;99:7751-6. [PMID: 12032355 PMCID: PMC124341 DOI: 10.1073/pnas.112194599] [Citation(s) in RCA: 94] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Kiechle FL, Zhang X. The postgenomic era: implications for the clinical laboratory. Arch Pathol Lab Med 2002;126:255-62. [PMID: 11860296 DOI: 10.5858/2002-126-0255-tpe] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Abstract

OBJECTIVES

To review the advances in clinically useful molecular biological techniques and to identify their applications in clinical practice, as presented at the Tenth Annual William Beaumont Hospital DNA Symposium.

DATA SOURCES

The 11 manuscripts submitted were reviewed and their major findings were compared with literature on the same topic.

STUDY SELECTION

Manuscripts address creative thinking techniques applied to DNA discovery, extraction of DNA from clotted blood, the relationship of mitochondrial dysfunction in neurodegenerative disorders, and molecular methods to identify human lymphocyte antigen class I and class II loci. Two other manuscripts review current issues in molecular microbiology, including detection of hepatitis C virus and biological warfare. The last 5 manuscripts describe current issues in molecular cardiovascular disease, including assessing thrombotic risk, genomic analysis, gene therapy, and a device for aiding in cardiac angiogenesis.

DATA SYNTHESIS

Novel problem-solving techniques have been used in the past and will be required in the future in DNA discovery. The extraction of DNA from clotted blood demonstrates a potential cost-effective strategy. Cybrids created from mitochondrial DNA-depleted cells and mitochondrial DNA from a platelet donor have been useful in defining the role mitochondria play in neurodegeneration. Mitochondrial depletion has been reported as a genetically inherited disorder or after human immunodeficiency virus therapy. Hepatitis C viral detection by qualitative, quantitative, or genotyping techniques is useful clinically. Preparedness for potential biological warfare is a responsibility of all clinical laboratorians. Thrombotic risk in cardiovascular disorders may be assessed by coagulation screening assays and further defined by mutation analysis for specific genes for prothrombin and factor V Leiden. Gene therapy for reducing arteriosclerotic risk has been hindered primarily by complications introduced by the vectors used to introduce the therapeutic genes. Neovascularization in cardiac muscle with occluded vessels represents a promising method for recovery of viable tissue following ischemia.

CONCLUSIONS

The sequence of the human genome was reported by 2 groups in February 2001. The postgenomic era will emphasize the use of microarrays and database software for genomic and proteomic screening in the search for useful clinical assays. The number of molecular pathologic techniques and assays will expand as additional disease-associated mutations are defined. Gene therapy and tissue engineering will represent successful therapeutic adjuncts.

Collapse

Lichtarge O, Sowa ME. Evolutionary predictions of binding surfaces and interactions. Curr Opin Struct Biol 2002;12:21-7. [PMID: 11839485 DOI: 10.1016/s0959-440x(02)00284-1] [Citation(s) in RCA: 184] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

LIU LEYUAN, VO AMY, LIU GUOQIN, MCKEEHAN WALLACEL. Novel complex integrating mitochondria and the microtubular cytoskeleton with chromosome remodeling and tumor suppressor RASSF1 deduced by in silico homology analysis, interaction cloning in yeast, and colocalization in cultured cells. In Vitro Cell Dev Biol Anim 2002;38:582-94. [PMID: 12762840 PMCID: PMC3225227 DOI: 10.1290/1543-706x(2002)38<582:ncimat>2.0.co;2] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Abstract

Availability of the complete sequence of the human genome and sequence homology analysis has accelerated new protein discovery and clues to protein function. Protein-protein interaction cloning suggests multisubunit complexes and pathways. Here, we combine these molecular approaches with cultured cell colocalization analysis to suggest a novel complex and a pathway that integrate the mitochondrial location and the microtubular cytoskeleton with chromosome remodeling, apoptosis, and tumor suppression based on a novel leucine-rich pentatricopeptide repeat-motif-containing protein (LRPPRC) that copurified with the fibroblast growth factor receptor complex. One round of interaction cloning and sequence homology analysis defined a primary LRPPRC complex with novel subunits cat eye syndrome chromosome region candidate 2 (CECR2), ubiquitously expressed transcript (UXT), and chromosome 19 open reading frames 5 (C19ORF5) but still of unknown function. Immuno, deoxyribonucleic acid (DNA), and green fluorescent protein (GFP) tag colocalization analyses revealed that LRPPRC appears in both cytosol and nuclei of cultured cells, colocalizes with mitochondria and beta-tubulin rather than with alpha-actin in the cytosol of interphase cells, and exhibits phase-dependent organization around separating chromosomes in mitotic cells. GFP-tagged CECR2B was strictly nuclear and colocalized with condensed DNA in apoptotic cells. GFP-tagged UXT and GFP-tagged C19ORF5 appeared in both cytosol and nuclei and colocalized with LRPPRC and beta-tubulin. Cells exhibiting nuclear C19ORF5 were apoptotic. Screening for interactive substrates with the primary LRPPRC substrates in the human liver complementary DNA library revealed that CECR2B interacted with chromatin-associated TFIID-associated protein TAFII30 and ribonucleic acid splicing factor SRP40, UXT bridged to CBP/p300-binding factor CITED2 and kinetochore-associated factor BUB3, and C19ORF5 complexed with mitochondria-associated NADH dehydrogenase I and cytochrome c oxidase I. C19ORF5 also interacted with RASSF1, providing a bridge to apoptosis and tumor suppression.

Collapse

Knudsen B, Miyamoto MM. A likelihood ratio test for evolutionary rate shifts and functional divergence among proteins. Proc Natl Acad Sci U S A 2001;98:14512-7. [PMID: 11734650 PMCID: PMC64713 DOI: 10.1073/pnas.251526398] [Citation(s) in RCA: 88] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Bajorath J. Rational drug discovery revisited: interfacing experimental programs with bio- and chemo-informatics. Drug Discov Today 2001;6:989-995. [PMID: 11576865 DOI: 10.1016/s1359-6446(01)01961-4] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Brenner SE. A tour of structural genomics. Nat Rev Genet 2001;2:801-9. [PMID: 11584296 DOI: 10.1038/35093574] [Citation(s) in RCA: 105] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]

Greenbaum D, Luscombe NM, Jansen R, Qian J, Gerstein M. Interrelating different types of genomic data, from proteome to secretome: 'oming in on function. Genome Res 2001;11:1463-8. [PMID: 11544189 DOI: 10.1101/gr.207401] [Citation(s) in RCA: 114] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Stevens RC, Wilson IA. Tech.Sight. Industrializing structural biology. Science 2001;293:519-20. [PMID: 11463918 DOI: 10.1126/science.293.5529.519] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]