1
|
Shawan MMAK, Jahan N, Ahamed T, Das A, Khan MA, Hossain S, Sarker SR. <i>In silico</i> subtractive genomics approach characterizes a hypothetical protein (MG_476) from <i>microplasma genitalium</i> G37. JOURNAL OF CLINICAL AND EXPERIMENTAL INVESTIGATIONS 2022. [DOI: 10.29333/jcei/12377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
|
2
|
Detecting protein-protein interactions with a novel matrix-based protein sequence representation and support vector machines. BIOMED RESEARCH INTERNATIONAL 2015; 2015:867516. [PMID: 26000305 PMCID: PMC4426769 DOI: 10.1155/2015/867516] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/01/2014] [Revised: 01/09/2015] [Accepted: 01/09/2015] [Indexed: 11/27/2022]
Abstract
Proteins and their interactions lie at the heart of most underlying biological processes. Consequently, correct detection of protein-protein interactions (PPIs) is of fundamental importance to understand the molecular mechanisms in biological systems. Although the convenience brought by high-throughput experiment in technological advances makes it possible to detect a large amount of PPIs, the data generated through these methods is unreliable and may not be completely inclusive of all possible PPIs. Targeting at this problem, this study develops a novel computational approach to effectively detect the protein interactions. This approach is proposed based on a novel matrix-based representation of protein sequence combined with the algorithm of support vector machine (SVM), which fully considers the sequence order and dipeptide information of the protein primary sequence. When performed on yeast PPIs datasets, the proposed method can reach 90.06% prediction accuracy with 94.37% specificity at the sensitivity of 85.74%, indicating that this predictor is a useful tool to predict PPIs. Achieved results also demonstrate that our approach can be a helpful supplement for the interactions that have been detected experimentally.
Collapse
|
3
|
|
4
|
You ZH, Zhu L, Zheng CH, Yu HJ, Deng SP, Ji Z. Prediction of protein-protein interactions from amino acid sequences using a novel multi-scale continuous and discontinuous feature set. BMC Bioinformatics 2014; 15 Suppl 15:S9. [PMID: 25474679 PMCID: PMC4271571 DOI: 10.1186/1471-2105-15-s15-s9] [Citation(s) in RCA: 84] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND Identifying protein-protein interactions (PPIs) is essential for elucidating protein functions and understanding the molecular mechanisms inside the cell. However, the experimental methods for detecting PPIs are both time-consuming and expensive. Therefore, computational prediction of protein interactions are becoming increasingly popular, which can provide an inexpensive way of predicting the most likely set of interactions at the entire proteome scale, and can be used to complement experimental approaches. Although much progress has already been achieved in this direction, the problem is still far from being solved and new approaches are still required to overcome the limitations of the current prediction models. RESULTS In this work, a sequence-based approach is developed by combining a novel Multi-scale Continuous and Discontinuous (MCD) feature representation and Support Vector Machine (SVM). The MCD representation gives adequate consideration to the interactions between sequentially distant but spatially close amino acid residues, thus it can sufficiently capture multiple overlapping continuous and discontinuous binding patterns within a protein sequence. An effective feature selection method mRMR was employed to construct an optimized and more discriminative feature set by excluding redundant features. Finally, a prediction model is trained and tested based on SVM algorithm to predict the interaction probability of protein pairs. CONCLUSIONS When performed on the yeast PPIs data set, the proposed approach achieved 91.36% prediction accuracy with 91.94% precision at the sensitivity of 90.67%. Extensive experiments are conducted to compare our method with the existing sequence-based method. Experimental results show that the performance of our predictor is better than several other state-of-the-art predictors, whose average prediction accuracy is 84.91%, sensitivity is 83.24%, and precision is 86.12%. Achieved results show that the proposed approach is very promising for predicting PPI, so it can be a useful supplementary tool for future proteomics studies. The source code and the datasets are freely available at http://csse.szu.edu.cn/staff/youzh/MCDPPI.zip for academic use.
Collapse
|
5
|
Hou J, Jiang Y. Dynamically searching for a domain for protein function prediction. J Bioinform Comput Biol 2013; 11:1350008. [PMID: 23859272 DOI: 10.1142/s021972001350008x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
The availability of large amounts of protein-protein interaction (PPI) data makes it feasible to use computational approaches to predict protein functions. The base of existing computational approaches is to exploit the known function information of annotated proteins in the PPI data to predict functions of un-annotated proteins. However, these approaches consider the prediction domain (i.e. the set of proteins from which the functions are predicted) as unchangeable during the prediction procedure. This may lead to valuable information being overwhelmed by the unavoidable noise information in the PPI data when predicting protein functions, and in turn, the prediction results will be distorted. In this paper, we propose a novel method to dynamically predict protein functions from the PPI data. Our method regards the function prediction as a dynamic process of finding a suitable prediction domain, from which representative functions of the domain are selected to predict functions of un-annotated proteins. Our method exploits the topological structural information of a PPI network and the semantic relationship between protein functions to measure the relationship between proteins, dynamically select a suitable prediction domain and predict functions. The evaluation on real PPI datasets demonstrated the effectiveness of our proposed method, and generated better prediction results.
Collapse
Affiliation(s)
- Jingyu Hou
- School of Information Technology, Deakin University, 221 Burwood Highway, Burwood, Victoria 3125, Australia.
| | | |
Collapse
|
6
|
Jaramillo-Garzón JA, Gallardo-Chacón JJ, Castellanos-Domínguez CG, Perera-Lluna A. Predictability of gene ontology slim-terms from primary structure information in Embryophyta plant proteins. BMC Bioinformatics 2013; 14:68. [PMID: 23441934 PMCID: PMC3660269 DOI: 10.1186/1471-2105-14-68] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2012] [Accepted: 02/19/2013] [Indexed: 11/25/2022] Open
Abstract
Background Proteins are the key elements on the path from genetic information to the development of life. The roles played by the different proteins are difficult to uncover experimentally as this process involves complex procedures such as genetic modifications, injection of fluorescent proteins, gene knock-out methods and others. The knowledge learned from each protein is usually annotated in databases through different methods such as the proposed by The Gene Ontology (GO) consortium. Different methods have been proposed in order to predict GO terms from primary structure information, but very few are available for large-scale functional annotation of plants, and reported success rates are much less than the reported by other non-plant predictors. This paper explores the predictability of GO annotations on proteins belonging to the Embryophyta group from a set of features extracted solely from their primary amino acid sequence. Results High predictability of several GO terms was found for Molecular Function and Cellular Component. As expected, a lower degree of predictability was found on Biological Process ontology annotations, although a few biological processes were easily predicted. Proteins related to transport and transcription were particularly well predicted from primary structure information. The most discriminant features for prediction were those related to electric charges of the amino-acid sequence and hydropathicity derived features. Conclusions An analysis of GO-slim terms predictability in plants was carried out, in order to determine single categories or groups of functions that are most related with primary structure information. For each highly predictable GO term, the responsible features of such successfulness were identified and discussed. In addition to most published studies, focused on few categories or single ontologies, results in this paper comprise a complete landscape of GO predictability from primary structure encompassing 75 GO terms at molecular, cellular and phenotypical level. Thus, it provides a valuable guide for researchers interested on further advances in protein function prediction on Embryophyta plants.
Collapse
Affiliation(s)
- Jorge Alberto Jaramillo-Garzón
- Departamento de Ingeniería Eléctrica, Electrónica y Computación, Universidad Nacional de Colombia sede Manizales, Campus La Nubia, Km 7 Vía al Magdalena, Manizales-Caldas, Colombia.
| | | | | | | |
Collapse
|
7
|
Puelma T, Gutiérrez RA, Soto A. Discriminative local subspaces in gene expression data for effective gene function prediction. ACTA ACUST UNITED AC 2012; 28:2256-64. [PMID: 22820203 PMCID: PMC3426849 DOI: 10.1093/bioinformatics/bts455] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
MOTIVATION Massive amounts of genome-wide gene expression data have become available, motivating the development of computational approaches that leverage this information to predict gene function. Among successful approaches, supervised machine learning methods, such as Support Vector Machines (SVMs), have shown superior prediction accuracy. However, these methods lack the simple biological intuition provided by co-expression networks (CNs), limiting their practical usefulness. RESULTS In this work, we present Discriminative Local Subspaces (DLS), a novel method that combines supervised machine learning and co-expression techniques with the goal of systematically predict genes involved in specific biological processes of interest. Unlike traditional CNs, DLS uses the knowledge available in Gene Ontology (GO) to generate informative training sets that guide the discovery of expression signatures: expression patterns that are discriminative for genes involved in the biological process of interest. By linking genes co-expressed with these signatures, DLS is able to construct a discriminative CN that links both, known and previously uncharacterized genes, for the selected biological process. This article focuses on the algorithm behind DLS and shows its predictive power using an Arabidopsis thaliana dataset and a representative set of 101 GO terms from the Biological Process Ontology. Our results show that DLS has a superior average accuracy than both SVMs and CNs. Thus, DLS is able to provide the prediction accuracy of supervised learning methods while maintaining the intuitive understanding of CNs. AVAILABILITY A MATLAB® implementation of DLS is available at http://virtualplant.bio.puc.cl/cgi-bin/Lab/tools.cgi.
Collapse
Affiliation(s)
- Tomas Puelma
- Department of Molecular Genetics and Microbiology, FONDAP Center for Genome Regulation, Pontificia Universidad Catolica de Chile, Santiago, Chile.
| | | | | |
Collapse
|
8
|
Abstract
An overwhelming array of structural variants has evolved from a comparatively small number of protein structural domains; which has in turn facilitated an expanse of functional derivatives. Herein, I review the primary mechanisms which have contributed to the vastness of our existing, and expanding, protein repertoires. Protein function prediction strategies, both sequence and structure based, are also discussed and their associated strengths and weaknesses assessed.
Collapse
Affiliation(s)
- Roy D Sleator
- Department of Biological Sciences, Cork Institute of Technology, Cork, Ireland.
| |
Collapse
|
9
|
Abstract
The recent explosion in the number and diversity of novel proteins identified by the large-scale "omics" technologies poses new and important questions to the blossoming field of systems biology--what are all these proteins, how did they come about, and most importantly, what do they do? From a comparatively small number of protein structural domains a staggering array of structural variants has evolved, which has in turn facilitated an expanse of functional derivatives. This review considers the primary mechanisms that have contributed to the vastness of our existing, and expanding, protein repertoires, while also outlining the protocols available for elucidating their true biological function. The various function prediction programs available, both sequence and structure based, are discussed and their associated strengths and weaknesses outlined.
Collapse
Affiliation(s)
- Roy D Sleator
- Department of Biological Sciences, Cork Institute of Technology, Bishopstown, Cork, Ireland.
| |
Collapse
|
10
|
ZHAO XINGMING, CHEUNG YIUMING, HUANG DESHUANG. ANALYSIS OF GENE EXPRESSION DATA USING RPEM ALGORITHM IN NORMAL MIXTURE MODEL WITH DYNAMIC ADJUSTMENT OF LEARNING RATE. INT J PATTERN RECOGN 2011. [DOI: 10.1142/s0218001410008056] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Microarray technology is a useful tool for monitoring the expression levels of thousands of genes simultaneously. Recently, mixture modeling has been used to extract expression signatures from gene expression profiles. In general, two separate steps are utilized to estimate the number of classes and model parameters, respectively. However, such a method is often time-consuming and leads to suboptimal solutions. In this paper, we therefore apply a one-step approach, namely Rival Penalized Expectation-Maximization (RPEM) algorithm, to analyze the gene expression data. The RPEM algorithm is capable of estimating the parameters of normal mixture model, while determining the number of classes automatically at the same time. Furthermore, we speed up the learning procedure of RPEM by proposing a new mechanism to adjust the learning rate dynamically. The numerical results on real gene expression data demonstrate that our proposed method is indeed effective and efficient.
Collapse
Affiliation(s)
- XING-MING ZHAO
- Institute of Systems Biology, Shanghai University, Shanghai 200444, P. R. China
| | - YIU-MING CHEUNG
- Department of Computer Science, Hong Kong Baptist University, Hong Kong, P. R. China
| | - DE-SHUANG HUANG
- Intelligent Computing Lab, Institute of Intelligent Machines, Chinese Academy of Sciences, P. O. Box 1130, Hefei, Anhui 230031, P. R. China
| |
Collapse
|
11
|
|
12
|
Sapkota A, Liu X, Zhao XM, Cao Y, Liu J, Liu ZP, Chen L. DIPOS: database of interacting proteins in Oryza sativa. MOLECULAR BIOSYSTEMS 2011; 7:2615-21. [PMID: 21713282 DOI: 10.1039/c1mb05120b] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Rice is an important crop throughout the world and is the staple food for about half the world's population. For better breeding and improved production, we need to know the function of rice molecules which facilitate their function through interactions with each other. The database of interacting proteins in Oryza sativa (DIPOS) provides comprehensive information of interacting proteins in rice, where the interactions are predicted using two computational methods, i.e., interologs and domain based methods. DIPOS contains 14 614 067 pairwise interactions among 27 746 proteins, covering about 41% of the whole Oryaza sativa proteome. Furthermore, each interaction is assigned a confidence score which further enables biologists to sort out the important proteins. Biological explanations of pathways and interactions are also provided based on the database. Public access to the DIPOS is available at and .
Collapse
Affiliation(s)
- Achyut Sapkota
- Institute of Systems Biology, Shanghai University, Shanghai 200444, China
| | | | | | | | | | | | | |
Collapse
|
13
|
Bhattacharya SK, Gomes J, Cebulla CM. Toward failure analyses in systems biology. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2011; 2:507-517. [PMID: 20836044 DOI: 10.1002/wsbm.83] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
Parallels between designed and biological systems with respect to formal failure analyses have been presented. Failure analysis in designed systems depends on an identified, limited set of parameters or operation variables with high predictive value. In contrast, the biological systems pose problems in identification of operation variables and the identified variables may not be accurate predictors of failure. The difficulty in parameter identification is because of large numbers of components and the inability to envelope variables at each compartment or contour level. Contour level maps for biological systems are currently non-existent, and most failure models are based on very limited, unilateral operation variables (a mutant gene). Operation variable identification within each contour level will enhance failure analyses of complex biological systems.
Collapse
Affiliation(s)
| | - James Gomes
- Indian Institute of Technology Delhi, Hauz Khas, New Delhi, India
| | - Colleen M Cebulla
- Havener Eye Institute, Department of Ophthalmology, The Ohio State University, Columbus, OH, USA
| |
Collapse
|
14
|
Zhou YZ, Gao Y, Zheng YY. Prediction of Protein-Protein Interactions Using Local Description of Amino Acid Sequence. COMMUNICATIONS IN COMPUTER AND INFORMATION SCIENCE 2011. [DOI: 10.1007/978-3-642-22456-0_37] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]
|
15
|
Janga SC, Díaz-Mejía JJ, Moreno-Hagelsieb G. Network-based function prediction and interactomics: the case for metabolic enzymes. Metab Eng 2011; 13:1-10. [PMID: 20654726 DOI: 10.1016/j.ymben.2010.07.001] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2010] [Revised: 07/15/2010] [Accepted: 07/16/2010] [Indexed: 12/19/2022]
Abstract
As sequencing technologies increase in power, determining the functions of unknown proteins encoded by the DNA sequences so produced becomes a major challenge. Functional annotation is commonly done on the basis of amino-acid sequence similarity alone. Long after sequence similarity becomes undetectable by pair-wise comparison, profile-based identification of homologs can often succeed due to the conservation of position-specific patterns, important for a protein's three dimensional folding and function. Nevertheless, prediction of protein function from homology-driven approaches is not without problems. Homologous proteins might evolve different functions and the power of homology detection has already started to reach its maximum. Computational methods for inferring protein function, which exploit the context of a protein in cellular networks, have come to be built on top of homology-based approaches. These network-based functional inference techniques provide both a first hand hint into a proteins' functional role and offer complementary insights to traditional methods for understanding the function of uncharacterized proteins. Most recent network-based approaches aim to integrate diverse kinds of functional interactions to boost both coverage and confidence level. These techniques not only promise to solve the moonlighting aspect of proteins by annotating proteins with multiple functions, but also increase our understanding on the interplay between different functional classes in a cell. In this article we review the state of the art in network-based function prediction and describe some of the underlying difficulties and successes. Given the volume of high-throughput data that is being reported the time is ripe to employ these network-based approaches, which can be used to unravel the functions of the uncharacterized proteins accumulating in the genomic databases.
Collapse
Affiliation(s)
- S C Janga
- MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB20QH, United Kingdom.
| | | | | |
Collapse
|
16
|
Chou KC. Some remarks on protein attribute prediction and pseudo amino acid composition. J Theor Biol 2010; 273:236-47. [PMID: 21168420 PMCID: PMC7125570 DOI: 10.1016/j.jtbi.2010.12.024] [Citation(s) in RCA: 971] [Impact Index Per Article: 64.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2010] [Revised: 12/08/2010] [Accepted: 12/13/2010] [Indexed: 11/29/2022]
Abstract
With the accomplishment of human genome sequencing, the number of sequence-known proteins has increased explosively. In contrast, the pace is much slower in determining their biological attributes. As a consequence, the gap between sequence-known proteins and attribute-known proteins has become increasingly large. The unbalanced situation, which has critically limited our ability to timely utilize the newly discovered proteins for basic research and drug development, has called for developing computational methods or high-throughput automated tools for fast and reliably identifying various attributes of uncharacterized proteins based on their sequence information alone. Actually, during the last two decades or so, many methods in this regard have been established in hope to bridge such a gap. In the course of developing these methods, the following things were often needed to consider: (1) benchmark dataset construction, (2) protein sample formulation, (3) operating algorithm (or engine), (4) anticipated accuracy, and (5) web-server establishment. In this review, we are to discuss each of the five procedures, with a special focus on the introduction of pseudo amino acid composition (PseAAC), its different modes and applications as well as its recent development, particularly in how to use the general formulation of PseAAC to reflect the core and essential features that are deeply hidden in complicated protein sequences.
Collapse
Affiliation(s)
- Kuo-Chen Chou
- Gordon Life Science Institute, 13784 Torrey Del Mar Drive, San Diego, CA 92130, USA.
| |
Collapse
|
17
|
Abstract
Because of the increasing number of structures of unknown function accumulated by ongoing structural genomics projects, there is an urgent need for computational methods for characterizing protein tertiary structures. As functions of many of these proteins are not easily predicted by conventional sequence database searches, a legitimate strategy is to utilize structure information in function characterization. Of particular interest is prediction of ligand binding to a protein, as ligand molecule recognition is a major part of molecular function of proteins. Predicting whether a ligand molecule binds a protein is a complex problem due to the physical nature of protein-ligand interactions and the flexibility of both binding sites and ligand molecules. However, geometric and physicochemical complementarity is observed between the ligand and its binding site in many cases. Therefore, ligand molecules which bind to a local surface site in a protein can be predicted by finding similar local pockets of known binding ligands in the structure database. Here, we present two representations of ligand binding pockets and utilize them for ligand binding prediction by pocket shape comparison. These representations are based on mapping of surface properties of binding pockets, which are compactly described either by the two-dimensional pseudo-Zernike moments or the three-dimensional Zernike descriptors. These compact representations allow a fast real-time pocket searching against a database. Thorough benchmark studies employing two different datasets show that our representations are competitive with the other existing methods. Limitations and potentials of the shape-based methods as well as possible improvements are discussed.
Collapse
Affiliation(s)
- Rayan Chikhi
- École Normale Supérieure de Cachan, Computer Science Department, 61 Avenue du President Wilson, 94235 Cachan cedex, Britanny, France
| | - Lee Sael
- Department of Computer Science, College of Science, Purdue University, West Lafayette, IN, 47907, USA
| | - Daisuke Kihara
- Department of Biological Sciences, College of Science, Purdue University, West Lafayette, IN, 47907, USA
- Department of Computer Science, College of Science, Purdue University, West Lafayette, IN, 47907, USA
- Markey Center for Structural Biology, College of Science, Purdue University, West Lafayette, IN, 47907, USA
| |
Collapse
|
18
|
Xia JF, Zhao XM, Huang DS. Predicting protein-protein interactions from protein sequences using meta predictor. Amino Acids 2010; 39:1595-9. [PMID: 20386937 DOI: 10.1007/s00726-010-0588-1] [Citation(s) in RCA: 65] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2009] [Accepted: 03/27/2010] [Indexed: 11/24/2022]
Abstract
A novel method is proposed for predicting protein-protein interactions (PPIs) based on the meta approach, which predicts PPIs using support vector machine that combines results by six independent state-of-the-art predictors. Significant improvement in prediction performance is observed, when performed on Saccharomyces cerevisiae and Helicobacter pylori datasets. In addition, we used the final prediction model trained on the PPIs dataset of S. cerevisiae to predict interactions in other species. The results reveal that our meta model is also capable of performing cross-species predictions. The source code and the datasets are available at http://home.ustc.edu.cn/~jfxia/Meta_PPI.html.
Collapse
Affiliation(s)
- Jun-Feng Xia
- Intelligent Computing Laboratory, Hefei Institute of Intelligent Machines, Chinese Academy of Sciences, P.O. Box 1130, Hefei, Anhui, 230031, China
| | | | | |
Collapse
|
19
|
Kaleem A, Ahmad I, Walker-Nasir E, Hoessli DC, Shakoori AR. Effect on the Ras/Raf signaling pathway of post-translational modifications of neurofibromin: in silico study of protein modification responsible for regulatory pathways. J Cell Biochem 2010; 108:816-24. [PMID: 19718661 DOI: 10.1002/jcb.22301] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Mapping and chemical characterization of post-translational modifications (PTMs) in proteins are critical to understand the regulatory mechanisms involving modified proteins and their role in disease. Neurofibromatosis type 1 (NF-1) is an autosomal dominantly inherited disorder, where NF1 mutations usually result in a reduced level of the tumor suppressor protein, neurofibromin (NF). NF is a multifunctional cytoplasmic protein that regulates microtubule dynamics and participates in several signaling pathways, particularly the RAS signaling pathway. NF is a Ras GTPase-activating protein (GAP) that prevents oncogenesis by converting GTP-Ras to GDP-Ras. This function of NF is regulated by phosphorylation. Interplay of phosphorylation with O-GlcNAc modification on the same or vicinal Ser/Thr residues, the Yin Yang sites, is well known in cytoplasmic and nuclear proteins. The dynamic aspects of PTMs and their interplay being difficult to follow in vivo, we undertook this in silico work to predict and define the possible role of Yin Yang sites in NF-1. Interplay of phosphorylation and O-GlcNAc modification is proposed as a mechanism controlling the Ras signaling pathway.
Collapse
Affiliation(s)
-
- Institute of Molecular Sciences and Bioinformatics, Lahore, Pakistan.
| | | | | | | | | | | |
Collapse
|
20
|
An overview of in silico protein function prediction. Arch Microbiol 2010; 192:151-5. [DOI: 10.1007/s00203-010-0549-9] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2009] [Revised: 01/08/2010] [Accepted: 01/10/2010] [Indexed: 12/12/2022]
|
21
|
Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: I. Method development. J Comput Aided Mol Des 2009; 23:773-84. [DOI: 10.1007/s10822-009-9273-4] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2008] [Accepted: 04/15/2009] [Indexed: 12/12/2022]
|
22
|
Li X, Bazer FW, Gao H, Jobgen W, Johnson GA, Li P, McKnight JR, Satterfield MC, Spencer TE, Wu G. Amino acids and gaseous signaling. Amino Acids 2009; 37:65-78. [DOI: 10.1007/s00726-009-0264-5] [Citation(s) in RCA: 96] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2009] [Accepted: 02/12/2009] [Indexed: 01/08/2023]
|
23
|
Proteomic analysis reveals altered expression of proteins related to glutathione metabolism and apoptosis in the small intestine of zinc oxide-supplemented piglets. Amino Acids 2009; 37:209-18. [DOI: 10.1007/s00726-009-0242-y] [Citation(s) in RCA: 80] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2009] [Accepted: 01/12/2009] [Indexed: 10/21/2022]
|