1
|
Kochen NN, Seaney D, Vasandani V, Murray M, Braun AR, Sachs JN. Post-translational modification sites are present in hydrophilic cavities of alpha-synuclein, tau, FUS, and TDP-43 fibrils: A molecular dynamics study. Proteins 2024; 92:854-864. [PMID: 38458997 PMCID: PMC11147710 DOI: 10.1002/prot.26679] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Revised: 02/09/2024] [Accepted: 02/28/2024] [Indexed: 03/10/2024]
Abstract
Hydration plays a crucial role in the refolding of intrinsically disordered proteins into amyloid fibrils; however, the specific interactions between water and protein that may contribute to this process are still unknown. In our previous studies of alpha-synuclein (aSyn), we have shown that waters confined in fibril cavities are stabilizing features of this pathological fold; and that amino acids that hydrogen bond with these confined waters modulate primary and seeded aggregation. Here, we extend our aSyn molecular dynamics (MD) simulations with three new polymorphs and correlate MD trajectory information with known post-translational modifications (PTMs) and experimental data. We show that cavity residues are more evolutionarily conserved than non-cavity residues and are enriched with PTM sites. As expected, the confinement within hydrophilic cavities results in more stably hydrated amino acids. Interestingly, cavity PTM sites display the longest protein-water hydrogen bond lifetimes, three-fold greater than non-PTM cavity sites. Utilizing the deep mutational screen dataset by Newberry et al. and the Thioflavin T aggregation review by Pancoe et al. parsed using a fibril cavity/non-cavity definition, we show that hydrophobic changes to amino acids in cavities have a larger effect on fitness and aggregation rate than residues outside cavities, supporting our hypothesis that these sites are involved in the inhibition of aSyn toxic fibrillization. Finally, we expand our study to include analysis of fibril structures of tau, FUS, TDP-43, prion, and hnRNPA1; all of which contained hydrated cavities, with tau, FUS, and TDP-43 recapitulating our PTM results in aSyn fibril cavities.
Collapse
Affiliation(s)
- Noah Nathan Kochen
- Department of Biomedical Engineering, University of Minnesota, Minneapolis, Minnesota, USA
| | - Darren Seaney
- Department of Biomedical Engineering, University of Minnesota, Minneapolis, Minnesota, USA
| | - Vivek Vasandani
- Department of Biomedical Engineering, University of Minnesota, Minneapolis, Minnesota, USA
| | - Marguerite Murray
- Department of Biomedical Engineering, University of Minnesota, Minneapolis, Minnesota, USA
| | - Anthony R Braun
- Department of Biomedical Engineering, University of Minnesota, Minneapolis, Minnesota, USA
| | - Jonathan N Sachs
- Department of Biomedical Engineering, University of Minnesota, Minneapolis, Minnesota, USA
| |
Collapse
|
2
|
Goldtzvik Y, Sen N, Lam SD, Orengo C. Protein diversification through post-translational modifications, alternative splicing, and gene duplication. Curr Opin Struct Biol 2023; 81:102640. [PMID: 37354790 DOI: 10.1016/j.sbi.2023.102640] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Revised: 05/05/2023] [Accepted: 05/24/2023] [Indexed: 06/26/2023]
Abstract
Proteins provide the basis for cellular function. Having multiple versions of the same protein within a single organism provides a way of regulating its activity or developing novel functions. Post-translational modifications of proteins, by means of adding/removing chemical groups to amino acids, allow for a well-regulated and controlled way of generating functionally distinct protein species. Alternative splicing is another method with which organisms possibly generate new isoforms. Additionally, gene duplication events throughout evolution generate multiple paralogs of the same genes, resulting in multiple versions of the same protein within an organism. In this review, we discuss recent advancements in the study of these three methods of protein diversification and provide illustrative examples of how they affect protein structure and function.
Collapse
Affiliation(s)
- Yonathan Goldtzvik
- Department of Structural and Molecular Biology, University College London, London, United Kingdom
| | - Neeladri Sen
- Department of Structural and Molecular Biology, University College London, London, United Kingdom. https://twitter.com/@NeeladriSen
| | - Su Datt Lam
- Department of Structural and Molecular Biology, University College London, London, United Kingdom; Department of Applied Physics, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, Bangi, Malaysia
| | - Christine Orengo
- Department of Structural and Molecular Biology, University College London, London, United Kingdom.
| |
Collapse
|
3
|
Chai CY, Maran S, Thew HY, Tan YC, Rahman NMANA, Cheng WH, Lai KS, Loh JY, Yap WS. Predicting Deleterious Non-Synonymous Single Nucleotide Polymorphisms (nsSNPs) of HRAS Gene and In Silico Evaluation of Their Structural and Functional Consequences towards Diagnosis and Prognosis of Cancer. BIOLOGY 2022; 11:1604. [PMID: 36358305 PMCID: PMC9688001 DOI: 10.3390/biology11111604] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 09/28/2022] [Accepted: 10/09/2022] [Indexed: 10/19/2024]
Abstract
The Harvey rat sarcoma (HRAS) proto-oncogene belongs to the RAS family and is one of the pathogenic genes that cause cancer. Deleterious nsSNPs might have adverse consequences at the protein level. This study aimed to investigate deleterious nsSNPs in the HRAS gene in predicting structural alterations associated with mutants that disrupt normal protein-protein interactions. Functional and structural analysis was employed in analyzing the HRAS nsSNPs. Putative post-translational modification sites and the changes in protein-protein interactions, which included a variety of signal cascades, were also investigated. Five different bioinformatics tools predicted 33 nsSNPs as "pathogenic" or "harmful". Stability analysis predicted rs1554885139, rs770492627, rs1589792804, rs730880460, rs104894227, rs104894227, and rs121917759 as unstable. Protein-protein interaction analysis revealed that HRAS has a hub connecting three clusters consisting of 11 proteins, and changes in HRAS might cause signal cascades to dissociate. Furthermore, Kaplan-Meier bioinformatics analyses indicated that the HRAS gene deregulation affected the overall survival rate of patients with breast cancer, leading to prognostic significance. Thus, based on these analyses, our study suggests that the reported nsSNPs of HRAS may serve as potential targets for different proteomic studies, diagnoses, and therapeutic interventions focusing on cancer.
Collapse
Affiliation(s)
- Chuan-Yu Chai
- Department of Biological Science, Faculty of Science, Universiti Tunku Abdul Rahman, Jalan Universiti, Bandar Barat, Kampar 31900, Malaysia
| | - Sathiya Maran
- School of Pharmacy, Monash University Malaysia, Jalan Lagoon Selatan, Bandar Sunway 47500, Malaysia
| | - Hin-Yee Thew
- School of Pharmacy, Monash University Malaysia, Jalan Lagoon Selatan, Bandar Sunway 47500, Malaysia
| | - Yong-Chiang Tan
- School of Postgraduate Studies, International Medical University, Jalan Jalil Perkasa 19, Bukit Jalil, Kuala Lumpur 57000, Malaysia
| | - Nik Mohd Afizan Nik Abd Rahman
- Department of Cell and Molecular Biology, Faculty of Biotechnology and Biomolecular Sciences, Universiti Putra Malaysia, 43400 Serdang, Selangor, Malaysia
| | - Wan-Hee Cheng
- Faculty of Health and Life Sciences, INTI International University, Persiaran Perdana BBN, Putra Nilai, Nilai 71800, Malaysia
| | - Kok-Song Lai
- Health Sciences Division, Abu Dhabi Women’s College, Higher Colleges of Technology, Abu Dhabi 41012, United Arab Emirates
| | - Jiun-Yan Loh
- Centre of Research for Advanced Aquaculture (CORAA), UCSI University, No. 1, Jalan Menara Gading UCSI Height, Cheras, Kuala Lumpur 56000, Malaysia
| | - Wai-Sum Yap
- He & Ni Academy, Office Tower B, Northpoint Mid Valley City, Kuala Lumpur 59200, Malaysia
| |
Collapse
|
4
|
Wadie B, Kleshchevnikov V, Sandaltzopoulou E, Benz C, Petsalaki E. Use of viral motif mimicry improves the proteome-wide discovery of human linear motifs. Cell Rep 2022; 39:110764. [PMID: 35508127 DOI: 10.1016/j.celrep.2022.110764] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2021] [Revised: 02/09/2022] [Accepted: 04/08/2022] [Indexed: 12/16/2022] Open
Abstract
Linear motifs have an integral role in dynamic cell functions, including cell signaling. However, due to their small size, low complexity, and frequent mutations, identifying novel functional motifs poses a challenge. Viruses rely extensively on the molecular mimicry of cellular linear motifs. In this study, we apply systematic motif prediction combined with functional filters to identify human linear motifs convergently evolved also in viral proteins. We observe an increase in the sensitivity of motif prediction and improved enrichment in known instances. We identify >7,300 non-redundant motif instances at various confidence levels, 99 of which are supported by all functional and structural filters. Overall, we provide a pipeline to improve the identification of functional linear motifs from interactomics datasets and a comprehensive catalog of putative human motifs that can contribute to our understanding of the human domain-linear motif code and the associated mechanisms of viral interference.
Collapse
Affiliation(s)
- Bishoy Wadie
- European Molecular Biology Laboratory - European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Vitalii Kleshchevnikov
- European Molecular Biology Laboratory - European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Elissavet Sandaltzopoulou
- European Molecular Biology Laboratory - European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Caroline Benz
- Department of Chemistry - BMC, Uppsala University, Uppsala, Sweden
| | - Evangelia Petsalaki
- European Molecular Biology Laboratory - European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK.
| |
Collapse
|
5
|
Pellegrina D, Bahcheli AT, Krassowski M, Reimand J. Human phospho-signaling networks of SARS-CoV-2 infection are rewired by population genetic variants. Mol Syst Biol 2022; 18:e10823. [PMID: 35579274 PMCID: PMC9112486 DOI: 10.15252/msb.202110823] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Revised: 04/26/2022] [Accepted: 04/27/2022] [Indexed: 11/23/2022] Open
Abstract
SARS-CoV-2 infection hijacks signaling pathways and induces protein-protein interactions between human and viral proteins. Human genetic variation may impact SARS-CoV-2 infection and COVID-19 pathology; however, the genetic variation in these signaling networks remains uncharacterized. Here, we studied human missense single nucleotide variants (SNVs) altering phosphorylation sites modulated by SARS-CoV-2 infection, using machine learning to identify amino acid substitutions altering kinase-bound sequence motifs. We found 2,033 infrequent phosphorylation-associated SNVs (pSNVs) that are enriched in sequence motif alterations, potentially reflecting the evolution of signaling networks regulating host defenses. Proteins with pSNVs are involved in viral life cycle and host responses, including RNA splicing, interferon response (TRIM28), and glucose homeostasis (TBC1D4) with potential associations with COVID-19 comorbidities. pSNVs disrupt CDK and MAPK substrate motifs and replace these with motifs of Tank Binding Kinase 1 (TBK1) involved in innate immune responses, indicating consistent rewiring of signaling networks. Several pSNVs associate with severe COVID-19 and hospitalization (STARD13, ARFGEF2). Our analysis highlights potential genetic factors contributing to inter-individual variation of SARS-CoV-2 infection and COVID-19 and suggests leads for mechanistic and translational studies.
Collapse
Affiliation(s)
- Diogo Pellegrina
- Computational Biology ProgramOntario Institute for Cancer ResearchTorontoONCanada
| | - Alexander T Bahcheli
- Computational Biology ProgramOntario Institute for Cancer ResearchTorontoONCanada
- Department of Molecular GeneticsUniversity of TorontoTorontoONCanada
| | - Michal Krassowski
- Medical Sciences DivisionNuffield Department of Women's and Reproductive HealthUniversity of OxfordOxfordUK
| | - Jüri Reimand
- Computational Biology ProgramOntario Institute for Cancer ResearchTorontoONCanada
- Department of Molecular GeneticsUniversity of TorontoTorontoONCanada
- Department of Medical BiophysicsUniversity of TorontoTorontoONCanada
| |
Collapse
|
6
|
Moraes BC, Ribeiro-Filho HV, Roldão AP, Toniolo EF, Carretero GPB, Sgro GG, Batista FAH, Berardi DE, Oliveira VRS, Tomasin R, Vieceli FM, Pramio DT, Cardoso AB, Figueira ACM, Farah SC, Devi LA, Dale CS, de Oliveira PSL, Schechtman D. Structural analysis of TrkA mutations in patients with congenital insensitivity to pain reveals PLCγ as an analgesic drug target. Sci Signal 2022; 15:eabm6046. [PMID: 35471943 DOI: 10.1126/scisignal.abm6046] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Chronic pain is a major health issue, and the search for new analgesics has become increasingly important because of the addictive properties and unwanted side effects of opioids. To explore potentially new drug targets, we investigated mutations in the NTRK1 gene found in individuals with congenital insensitivity to pain with anhidrosis (CIPA). NTRK1 encodes tropomyosin receptor kinase A (TrkA), the receptor for nerve growth factor (NGF) and that contributes to nociception. Molecular modeling and biochemical analysis identified mutations that decreased the interaction between TrkA and one of its substrates and signaling effectors, phospholipase Cγ (PLCγ). We developed a cell-permeable phosphopeptide derived from TrkA (TAT-pQYP) that bound the Src homology domain 2 (SH2) of PLCγ. In HEK-293T cells, TAT-pQYP inhibited the binding of heterologously expressed TrkA to PLCγ and decreased NGF-induced, TrkA-mediated PLCγ activation and signaling. In mice, intraplantar administration of TAT-pQYP decreased mechanical sensitivity in an inflammatory pain model, suggesting that targeting this interaction may be analgesic. The findings demonstrate a strategy to identify new targets for pain relief by analyzing the signaling pathways that are perturbed in CIPA.
Collapse
Affiliation(s)
- Beatriz C Moraes
- Department of Biochemistry, Chemistry Institute, University of São Paulo, SP 05508-000, Brazil
| | - Helder V Ribeiro-Filho
- Brazilian Center for Research in Energy and Materials (CNPEM), Brazilian Biosciences National Laboratory (LNBio) Campinas, SP 13083-100, Brazil
| | - Allan P Roldão
- Department of Biochemistry, Chemistry Institute, University of São Paulo, SP 05508-000, Brazil
| | - Elaine F Toniolo
- Laboratory of Neuromodulation of Experimental Pain (LaNed), Department of Anatomy, Institute of Biomedical Sciences, University of Sao Paulo, SP 05508-000, Brazil
| | - Gustavo P B Carretero
- Department of Biochemistry, Chemistry Institute, University of São Paulo, SP 05508-000, Brazil
| | - Germán G Sgro
- Department of Biochemistry, Chemistry Institute, University of São Paulo, SP 05508-000, Brazil
- Department of Biomolecular Sciences, School of Pharmaceutical Sciences of Ribeirão Preto, University of São Paulo, Ribeirão Preto, SP 14040903, Brazil
| | - Fernanda A H Batista
- Brazilian Center for Research in Energy and Materials (CNPEM), Brazilian Biosciences National Laboratory (LNBio) Campinas, SP 13083-100, Brazil
| | - Damian E Berardi
- Department of Biochemistry, Chemistry Institute, University of São Paulo, SP 05508-000, Brazil
| | - Victoria R S Oliveira
- Laboratory of Neuromodulation of Experimental Pain (LaNed), Department of Anatomy, Institute of Biomedical Sciences, University of Sao Paulo, SP 05508-000, Brazil
| | - Rebeka Tomasin
- Department of Biochemistry, Chemistry Institute, University of São Paulo, SP 05508-000, Brazil
| | - Felipe M Vieceli
- Department of Biochemistry, Chemistry Institute, University of São Paulo, SP 05508-000, Brazil
| | - Dimitrius T Pramio
- Department of Biochemistry, Chemistry Institute, University of São Paulo, SP 05508-000, Brazil
| | - Alexandre B Cardoso
- Department of Biochemistry, Chemistry Institute, University of São Paulo, SP 05508-000, Brazil
| | - Ana C M Figueira
- Brazilian Center for Research in Energy and Materials (CNPEM), Brazilian Biosciences National Laboratory (LNBio) Campinas, SP 13083-100, Brazil
| | - Shaker C Farah
- Department of Biochemistry, Chemistry Institute, University of São Paulo, SP 05508-000, Brazil
| | - Lakshmi A Devi
- Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Camila S Dale
- Laboratory of Neuromodulation of Experimental Pain (LaNed), Department of Anatomy, Institute of Biomedical Sciences, University of Sao Paulo, SP 05508-000, Brazil
| | - Paulo S L de Oliveira
- Brazilian Center for Research in Energy and Materials (CNPEM), Brazilian Biosciences National Laboratory (LNBio) Campinas, SP 13083-100, Brazil
| | - Deborah Schechtman
- Department of Biochemistry, Chemistry Institute, University of São Paulo, SP 05508-000, Brazil
| |
Collapse
|
7
|
Spielmann M, Kircher M. Computational and experimental methods for classifying variants of unknown clinical significance. Cold Spring Harb Mol Case Stud 2022; 8:mcs.a006196. [PMID: 35483875 PMCID: PMC9059783 DOI: 10.1101/mcs.a006196] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
The increase in sequencing capacity, reduction in costs, and national and international coordinated efforts have led to the widespread introduction of next-generation sequencing (NGS) technologies in patient care. More generally, human genetics and genomic medicine are gaining importance for more and more patients. Some communities are already discussing the prospect of sequencing each individual's genome at time of birth. Together with digital health records, this shall enable individualized treatments and preventive measures, so-called precision medicine. A central step in this process is the identification of disease causal mutations or variant combinations that make us more susceptible for diseases. Although various technological advances have improved the identification of genetic alterations, the interpretation and ranking of the identified variants remains a major challenge. Based on our knowledge of molecular processes or previously identified disease variants, we can identify potentially functional genetic variants and, using different lines of evidence, we are sometimes able to demonstrate their pathogenicity directly. However, the vast majority of variants are classified as variants of uncertain clinical significance (VUSs) with not enough experimental evidence to determine their pathogenicity. In these cases, computational methods may be used to improve the prioritization and an increasing toolbox of experimental methods is emerging that can be used to assay the molecular effects of VUSs. Here, we discuss how computational and experimental methods can be used to create catalogs of variant effects for a variety of molecular and cellular phenotypes. We discuss the prospects of integrating large-scale functional data with machine learning and clinical knowledge for the development of accurate pathogenicity predictions for clinical applications.
Collapse
Affiliation(s)
- Malte Spielmann
- Institute of Human Genetics, University of Lübeck, 23562 Lübeck, Germany;,Institute of Human Genetics, Christian-Albrechts-Universität, 24105 Kiel, Germany;,Human Molecular Genomics Group, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany;,DZHK (German Centre for Cardiovascular Research), partner site Hamburg/Lübeck/Kiel, 23562 Lübeck, Germany
| | - Martin Kircher
- Institute of Human Genetics, University of Lübeck, 23562 Lübeck, Germany;,Berlin Institute of Health at Charité—Universitätsmedizin Berlin, 10117 Berlin, Germany;,DZHK (German Centre for Cardiovascular Research), partner site Berlin, 10115 Berlin, Germany
| |
Collapse
|
8
|
Glycomic and Glycoproteomic Techniques in Neurodegenerative Disorders and Neurotrauma: Towards Personalized Markers. Cells 2022; 11:cells11030581. [PMID: 35159390 PMCID: PMC8834236 DOI: 10.3390/cells11030581] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2021] [Revised: 01/22/2022] [Accepted: 02/03/2022] [Indexed: 12/16/2022] Open
Abstract
The proteome represents all the proteins expressed by a genome, a cell, a tissue, or an organism at any given time under defined physiological or pathological circumstances. Proteomic analysis has provided unparalleled opportunities for the discovery of expression patterns of proteins in a biological system, yielding precise and inclusive data about the system. Advances in the proteomics field opened the door to wider knowledge of the mechanisms underlying various post-translational modifications (PTMs) of proteins, including glycosylation. As of yet, the role of most of these PTMs remains unidentified. In this state-of-the-art review, we present a synopsis of glycosylation processes and the pathophysiological conditions that might ensue secondary to glycosylation shortcomings. The dynamics of protein glycosylation, a crucial mechanism that allows gene and pathway regulation, is described. We also explain how-at a biomolecular level-mutations in glycosylation-related genes may lead to neuropsychiatric manifestations and neurodegenerative disorders. We then analyze the shortcomings of glycoproteomic studies, putting into perspective their downfalls and the different advanced enrichment techniques that emanated to overcome some of these challenges. Furthermore, we summarize studies tackling the association between glycosylation and neuropsychiatric disorders and explore glycoproteomic changes in neurodegenerative diseases, including Alzheimer's disease, Parkinson's disease, Huntington disease, multiple sclerosis, and amyotrophic lateral sclerosis. We finally conclude with the role of glycomics in the area of traumatic brain injury (TBI) and provide perspectives on the clinical application of glycoproteomics as potential diagnostic tools and their application in personalized medicine.
Collapse
|
9
|
English N, Torres M. Enhancing the Discovery of Functional Post-Translational Modification Sites with Machine Learning Models - Development, Validation, and Interpretation. Methods Mol Biol 2022; 2499:221-260. [PMID: 35696084 DOI: 10.1007/978-1-0716-2317-6_12] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Protein posttranslational modifications (PTMs) are a rapidly expanding feature class of significant importance in cell biology. Due to a high burden of experimental proof, the number of functionals PTMs in the eukaryotic proteome is currently underestimated. Furthermore, not all PTMs are functionally equivalent. Computational approaches that can confidently recommend PTMs of probable function can improve the heuristics of PTM investigation and alleviate these problems. To address this need, we developed SAPH-ire: a multifeature heuristic neural network model that takes community wisdom into account by recommending experimental PTMs similar to those which have previously been established as having regulatory impact. Here, we describe the principle behind the SAPH-ire model, how it is developed, how we evaluate its performance, and important caveats to consider when building and interpreting such models. Finally, we discus current limitations of functional PTM prediction models and highlight potential mechanisms for their improvement.
Collapse
Affiliation(s)
- Nolan English
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA
| | - Matthew Torres
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA.
| |
Collapse
|
10
|
Silva-Costa LC, Smith BJ. Post-translational Modifications in Brain Diseases: A Future for Biomarkers. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2022; 1382:129-141. [DOI: 10.1007/978-3-031-05460-0_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
11
|
Shauli T, Brandes N, Linial M. Evolutionary and functional lessons from human-specific amino acid substitution matrices. NAR Genom Bioinform 2021; 3:lqab079. [PMID: 34541526 PMCID: PMC8445205 DOI: 10.1093/nargab/lqab079] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Revised: 08/02/2021] [Accepted: 09/14/2021] [Indexed: 12/26/2022] Open
Abstract
Human genetic variation in coding regions is fundamental to the study of protein structure and function. Most methods for interpreting missense variants consider substitution measures derived from homologous proteins across different species. In this study, we introduce human-specific amino acid (AA) substitution matrices that are based on genetic variations in the modern human population. We analyzed the frequencies of >4.8M single nucleotide variants (SNVs) at codon and AA resolution and compiled human-centric substitution matrices that are fundamentally different from classic cross-species matrices (e.g. BLOSUM, PAM). Our matrices are asymmetric, with some AA replacements showing significant directional preference. Moreover, these AA matrices are only partly predicted by nucleotide substitution rates. We further test the utility of our matrices in exposing functional signals of experimentally-validated protein annotations. A significant reduction in AA transition frequencies was observed across nine post-translational modification (PTM) types and four ion-binding sites. Our results propose a purifying selection signal in the human proteome across a diverse set of functional protein annotations and provide an empirical baseline for interpreting human genetic variation in coding regions.
Collapse
Affiliation(s)
- Tair Shauli
- The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, 91904, Israel
| | - Nadav Brandes
- The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, 91904, Israel
| | - Michal Linial
- Department of Biological Chemistry, Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, 91904, Israel
| |
Collapse
|
12
|
A global map of associations between types of protein posttranslational modifications and human genetic diseases. iScience 2021; 24:102917. [PMID: 34430807 PMCID: PMC8365368 DOI: 10.1016/j.isci.2021.102917] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Revised: 06/27/2021] [Accepted: 07/27/2021] [Indexed: 12/14/2022] Open
Abstract
There are >200 types of protein posttranslational modifications (PTMs) described in eukaryotes, each with unique proteome coverage and functions. We hypothesized that some genetic diseases may be caused by the removal of a specific type of PTMs by genomic variants and the consequent deregulation of particular functions. We collected >320,000 human PTMs representing 59 types and crossed them with >4M nonsynonymous DNA variants annotated with predicted pathogenicity and disease associations. We report >1.74M PTM-variant co-occurrences that an enrichment analysis distributed into 215 pairwise associations between 18 PTM types and 148 genetic diseases. Of them, 42% were not previously described. Removal of lysine acetylation exerts the most pronounced effect, and less studied PTM types such as S-glutathionylation or S-nitrosylation show relevance. Using pathogenicity predictions, we identified PTM sites that may produce particular diseases if prevented. Our results provide evidence of a substantial impact of PTM-specific removal on the pathogenesis of genetic diseases and phenotypes. There is an enrichment of disease-associated nsSNVs preventing certain types of PTMs We report 215 pairwise associations between 18 PTM types and 148 genetic diseases The removal of lysine acetylation exerts the most pronounced effect We report a set of PTM sites that may produce particular diseases if prevented
Collapse
|
13
|
Krassowski M, Pellegrina D, Mee MW, Fradet-Turcotte A, Bhat M, Reimand J. ActiveDriverDB: Interpreting Genetic Variation in Human and Cancer Genomes Using Post-translational Modification Sites and Signaling Networks (2021 Update). Front Cell Dev Biol 2021; 9:626821. [PMID: 33834021 PMCID: PMC8021862 DOI: 10.3389/fcell.2021.626821] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Accepted: 02/08/2021] [Indexed: 12/14/2022] Open
Abstract
Deciphering the functional impact of genetic variation is required to understand phenotypic diversity and the molecular mechanisms of inherited disease and cancer. While millions of genetic variants are now mapped in genome sequencing projects, distinguishing functional variants remains a major challenge. Protein-coding variation can be interpreted using post-translational modification (PTM) sites that are core components of cellular signaling networks controlling molecular processes and pathways. ActiveDriverDB is an interactive proteo-genomics database that uses more than 260,000 experimentally detected PTM sites to predict the functional impact of genetic variation in disease, cancer and the human population. Using machine learning tools, we prioritize proteins and pathways with enriched PTM-specific amino acid substitutions that potentially rewire signaling networks via induced or disrupted short linear motifs of kinase binding. We then map these effects to site-specific protein interaction networks and drug targets. In the 2021 update, we increased the PTM datasets by nearly 50%, included glycosylation, sumoylation and succinylation as new types of PTMs, and updated the workflows to interpret inherited disease mutations. We added a recent phosphoproteomics dataset reflecting the cellular response to SARS-CoV-2 to predict the impact of human genetic variation on COVID-19 infection and disease course. Overall, we estimate that 16-21% of known amino acid substitutions affect PTM sites among pathogenic disease mutations, somatic mutations in cancer genomes and germline variants in the human population. These data underline the potential of interpreting genetic variation through the lens of PTMs and signaling networks. The open-source database is freely available at www.ActiveDriverDB.org.
Collapse
Affiliation(s)
- Michal Krassowski
- Nuffield Department of Women’s and Reproductive Health, Medical Sciences Division, University of Oxford, Oxford, United Kingdom
| | - Diogo Pellegrina
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Miles W. Mee
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Amelie Fradet-Turcotte
- Department of Molecular Biology, Medical Biochemistry and Pathology, Universite Laval, Quebec, QC, Canada
- Oncology Division, Centre Hospitalier Universitaire (CHU) de Quebec-Universite Laval Research Center, Quebec, QC, Canada
| | - Mamatha Bhat
- Multiorgan Transplant Program, University Health Network, Toronto, ON, Canada
- Division of Gastroenterology & Hepatology, Department of Medicine, University of Toronto, Toronto, ON, Canada
| | - Jüri Reimand
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, ON, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
14
|
Holstein E, Dittmann A, Kääriäinen A, Pesola V, Koivunen J, Pihlajaniemi T, Naba A, Izzi V. The Burden of Post-Translational Modification (PTM)-Disrupting Mutations in the Tumor Matrisome. Cancers (Basel) 2021; 13:1081. [PMID: 33802493 PMCID: PMC7959462 DOI: 10.3390/cancers13051081] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Revised: 02/25/2021] [Accepted: 02/26/2021] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND To evaluate the occurrence of mutations affecting post-translational modification (PTM) sites in matrisome genes across different tumor types, in light of their genomic and functional contexts and in comparison with the rest of the genome. METHODS This study spans 9075 tumor samples and 32 tumor types from The Cancer Genome Atlas (TCGA) Pan-Cancer cohort and identifies 151,088 non-silent mutations in the coding regions of the matrisome, of which 1811 affecting known sites of hydroxylation, phosphorylation, N- and O-glycosylation, acetylation, ubiquitylation, sumoylation and methylation PTM. RESULTS PTM-disruptive mutations (PTMmut) in the matrisome are less frequent than in the rest of the genome, seem independent of cell-of-origin patterns but show dependence on the nature of the matrisome protein affected and the background PTM types it generally harbors. Also, matrisome PTMmut are often found among structural and functional protein regions and in proteins involved in homo- and heterotypic interactions, suggesting potential disruption of matrisome functions. CONCLUSIONS Though quantitatively minoritarian in the spectrum of matrisome mutations, PTMmut show distinctive features and damaging potential which might concur to deregulated structural, functional, and signaling networks in the tumor microenvironment.
Collapse
Affiliation(s)
- Elisa Holstein
- Faculty of Biochemistry and Molecular Medicine, University of Oulu, FI-90014 Oulu, Finland; (E.H.); (A.D.); (A.K.); (V.P.); (J.K.); (T.P.)
| | - Annalena Dittmann
- Faculty of Biochemistry and Molecular Medicine, University of Oulu, FI-90014 Oulu, Finland; (E.H.); (A.D.); (A.K.); (V.P.); (J.K.); (T.P.)
| | - Anni Kääriäinen
- Faculty of Biochemistry and Molecular Medicine, University of Oulu, FI-90014 Oulu, Finland; (E.H.); (A.D.); (A.K.); (V.P.); (J.K.); (T.P.)
| | - Vilma Pesola
- Faculty of Biochemistry and Molecular Medicine, University of Oulu, FI-90014 Oulu, Finland; (E.H.); (A.D.); (A.K.); (V.P.); (J.K.); (T.P.)
| | - Jarkko Koivunen
- Faculty of Biochemistry and Molecular Medicine, University of Oulu, FI-90014 Oulu, Finland; (E.H.); (A.D.); (A.K.); (V.P.); (J.K.); (T.P.)
| | - Taina Pihlajaniemi
- Faculty of Biochemistry and Molecular Medicine, University of Oulu, FI-90014 Oulu, Finland; (E.H.); (A.D.); (A.K.); (V.P.); (J.K.); (T.P.)
| | - Alexandra Naba
- Department of Physiology and Biophysics, University of Illinois at Chicago, Chicago, IL 60612, USA;
- University of Illinois Cancer Center, Chicago, IL 60612, USA
| | - Valerio Izzi
- Faculty of Biochemistry and Molecular Medicine, University of Oulu, FI-90014 Oulu, Finland; (E.H.); (A.D.); (A.K.); (V.P.); (J.K.); (T.P.)
- Faculty of Medicine, University of Oulu, FI-90014 Oulu, Finland
- Finnish Cancer Institute, 00130 Helsinki, Finland
| |
Collapse
|
15
|
Di Blasi R, Blyuss O, Timms JF, Conole D, Ceroni F, Whitwell HJ. Non-Histone Protein Methylation: Biological Significance and Bioengineering Potential. ACS Chem Biol 2021; 16:238-250. [PMID: 33411495 DOI: 10.1021/acschembio.0c00771] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Protein methylation is a key post-translational modification whose effects on gene expression have been intensively studied over the last two decades. Recently, renewed interest in non-histone protein methylation has gained momentum for its role in regulating important cellular processes and the activity of many proteins, including transcription factors, enzymes, and structural complexes. The extensive and dynamic role that protein methylation plays within the cell also highlights its potential for bioengineering applications. Indeed, while synthetic histone protein methylation has been extensively used to engineer gene expression, engineering of non-histone protein methylation has not been fully explored yet. Here, we report the latest findings, highlighting how non-histone protein methylation is fundamental for certain cellular functions and is implicated in disease, and review recent efforts in the engineering of protein methylation.
Collapse
Affiliation(s)
- Roberto Di Blasi
- Department of Chemical Engineering, Faculty of Engineering, Imperial College London, London, U.K
- Imperial College Centre for Synthetic Biology, Imperial College London, London, U.K
| | - Oleg Blyuss
- School of Physics, Astronomy and Mathematics, University of Hertfordshire, Hatfield, U.K
- Department of Paediatrics and Paediatric Infectious Diseases, Sechenov First Moscow State Medical University, Moscow, Russia
- Department of Applied Mathematics, Lobachevsky State University of Nizhny Novgorod, Nizhny Novgorod, Russia
| | - John F Timms
- Department of Women's Cancer, EGA Institute for Women's Health, University College London, London, U.K
| | - Daniel Conole
- Department of Chemistry, Molecular Sciences Research Hub, Imperial College London, London, U.K
| | - Francesca Ceroni
- Department of Chemical Engineering, Faculty of Engineering, Imperial College London, London, U.K
- Imperial College Centre for Synthetic Biology, Imperial College London, London, U.K
| | - Harry J Whitwell
- Department of Applied Mathematics, Lobachevsky State University of Nizhny Novgorod, Nizhny Novgorod, Russia
- National Phenome Centre and Imperial Clinical Phenotyping Centre, Department of Metabolism, Digestion and Reproduction, IRDB Building, Imperial College London, Hammersmith Campus, London, W12 0NN, U.K
- Division of Systems Medicine, Department of Metabolism, Digestion and Reproduction, Sir Alexander Fleming Building, Imperial College London, Hammersmith Campus, London, SW7 2AZ, U.K
| |
Collapse
|
16
|
Iqbal S, Pérez-Palma E, Jespersen JB, May P, Hoksza D, Heyne HO, Ahmed SS, Rifat ZT, Rahman MS, Lage K, Palotie A, Cottrell JR, Wagner FF, Daly MJ, Campbell AJ, Lal D. Comprehensive characterization of amino acid positions in protein structures reveals molecular effect of missense variants. Proc Natl Acad Sci U S A 2020; 117:28201-28211. [PMID: 33106425 PMCID: PMC7668189 DOI: 10.1073/pnas.2002660117] [Citation(s) in RCA: 67] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Interpretation of the colossal number of genetic variants identified from sequencing applications is one of the major bottlenecks in clinical genetics, with the inference of the effect of amino acid-substituting missense variations on protein structure and function being especially challenging. Here we characterize the three-dimensional (3D) amino acid positions affected in pathogenic and population variants from 1,330 disease-associated genes using over 14,000 experimentally solved human protein structures. By measuring the statistical burden of variations (i.e., point mutations) from all genes on 40 3D protein features, accounting for the structural, chemical, and functional context of the variations' positions, we identify features that are generally associated with pathogenic and population missense variants. We then perform the same amino acid-level analysis individually for 24 protein functional classes, which reveals unique characteristics of the positions of the altered amino acids: We observe up to 46% divergence of the class-specific features from the general characteristics obtained by the analysis on all genes, which is consistent with the structural diversity of essential regions across different protein classes. We demonstrate that the function-specific 3D features of the variants match the readouts of mutagenesis experiments for BRCA1 and PTEN, and positively correlate with an independent set of clinically interpreted pathogenic and benign missense variants. Finally, we make our results available through a web server to foster accessibility and downstream research. Our findings represent a crucial step toward translational genetics, from highlighting the impact of mutations on protein structure to rationalizing the variants' pathogenicity in terms of the perturbed molecular mechanisms.
Collapse
Affiliation(s)
- Sumaiya Iqbal
- Center for the Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, MA 02142;
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, 02142
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114
| | - Eduardo Pérez-Palma
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195
| | - Jakob B Jespersen
- Department of Bio and Health Informatics, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark
| | - Patrick May
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 4365 Esch-sur-Alzette, Luxembourg
| | - David Hoksza
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 4365 Esch-sur-Alzette, Luxembourg
- Department of Software Engineering, Faculty of Mathematics and Physics, Charles University, Prague 11636, Czech Republic
| | - Henrike O Heyne
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, 02142
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114
- Institute for Molecular Medicine Finland, University of Helsinki, 00100 Helsinki, Finland
| | - Shehab S Ahmed
- Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka-1205, Bangladesh
| | - Zaara T Rifat
- Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka-1205, Bangladesh
| | - M Sohel Rahman
- Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka-1205, Bangladesh
| | - Kasper Lage
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, 02142
- Department of Surgery, Massachusetts General Hospital, Boston, MA 02114
| | - Aarno Palotie
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, 02142
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142
- Institute for Molecular Medicine Finland, University of Helsinki, 00100 Helsinki, Finland
| | - Jeffrey R Cottrell
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, 02142
| | - Florence F Wagner
- Center for the Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, MA 02142
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, 02142
| | - Mark J Daly
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, 02142
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114
- Institute for Molecular Medicine Finland, University of Helsinki, 00100 Helsinki, Finland
| | - Arthur J Campbell
- Center for the Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, MA 02142;
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, 02142
| | - Dennis Lal
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, 02142;
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195
- Cologne Center for Genomics, University of Cologne, 50931 Cologne, Germany
- Epilepsy Center, Neurological Institute, Cleveland Clinic, Cleveland, OH 44195
| |
Collapse
|
17
|
Aggarwal S, Banerjee SK, Talukdar NC, Yadav AK. Post-translational Modification Crosstalk and Hotspots in Sirtuin Interactors Implicated in Cardiovascular Diseases. Front Genet 2020; 11:356. [PMID: 32425973 PMCID: PMC7204943 DOI: 10.3389/fgene.2020.00356] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Accepted: 03/24/2020] [Indexed: 01/07/2023] Open
Abstract
Sirtuins are protein deacetylases that play a protective role in cardiovascular diseases (CVDs), as well as many other diseases. Absence of sirtuins can lead to hyperacetylation of both nuclear and mitochondrial proteins leading to metabolic dysregulation. The protein post-translational modifications (PTMs) are known to crosstalk among each other to bring about complex phenotypic outcomes. Various PTM types such as acetylation, ubiquitination, and phosphorylation, and so on, drive transcriptional regulation and metabolism, but such crosstalks are poorly understood. We integrated protein–protein interactions (PPI) and PTMs from several databases to integrate information on 1,251 sirtuin-interacting proteins, of which 544 are associated with cardiac diseases. Based on the ∼100,000 PTM sites obtained for sirtuin interactors, we observed that the frequency of PTM sites (83 per protein), as well as PTM types (five per protein), is higher than the global average for human proteome. We found that ∼60–70% PTM sites fall into ordered regions. Approximately 83% of the sirtuin interactors contained at least one competitive crosstalk (in situ) site, with half of the sites occurring in CVD-associated proteins. A large proportion of identified crosstalk sites were observed for acetylation and ubiquitination competition. We identified 614 proteins containing PTM hotspots (≥5 PTM sites) and 133 proteins containing crosstalk hotspots (≥3 crosstalk sites). We observed that a large proportion of disease-associated sequence variants were found in PTM motifs of CVD proteins. We identified seven proteins (TP53, LMNA, MAPT, ATP2A2, NCL, APEX1, and HIST1H3A) containing disease-associated variants in PTM and crosstalk hotspots. This is the first comprehensive bioinformatics analysis on sirtuin interactors with respect to PTMs and their crosstalks. This study forms a platform for generating interesting hypotheses that can be tested for a deeper mechanistic understanding gained or derived from big-data analytics.
Collapse
Affiliation(s)
- Suruchi Aggarwal
- Translational Health Science and Technology Institute, NCR Biotech Science Cluster, Faridabad, India.,Division of Life Sciences, Institute of Advanced Study in Science and Technology, Guwahati, India.,Department of Molecular Biology and Biotechnology, Cotton University, Guwahati, India
| | - Sanjay K Banerjee
- Translational Health Science and Technology Institute, NCR Biotech Science Cluster, Faridabad, India
| | - Narayan Chandra Talukdar
- Division of Life Sciences, Institute of Advanced Study in Science and Technology, Guwahati, India.,Department of Molecular Biology and Biotechnology, Cotton University, Guwahati, India
| | - Amit Kumar Yadav
- Translational Health Science and Technology Institute, NCR Biotech Science Cluster, Faridabad, India
| |
Collapse
|
18
|
Schneider K, Adams CE, Elmer KR. Parallel selection on ecologically relevant gene functions in the transcriptomes of highly diversifying salmonids. BMC Genomics 2019; 20:1010. [PMID: 31870285 PMCID: PMC6929470 DOI: 10.1186/s12864-019-6361-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2019] [Accepted: 12/01/2019] [Indexed: 12/11/2022] Open
Abstract
Background Salmonid fishes are characterised by a very high level of variation in trophic, ecological, physiological, and life history adaptations. Some salmonid taxa show exceptional potential for fast, within-lake diversification into morphologically and ecologically distinct variants, often in parallel; these are the lake-resident charr and whitefish (several species in the genera Salvelinus and Coregonus). To identify selection on genes and gene categories associated with such predictable diversifications, we analysed 2702 orthogroups (4.82 Mbp total; average 4.77 genes/orthogroup; average 1783 bp/orthogroup). We did so in two charr and two whitefish species and compared to five other salmonid lineages, which do not evolve in such ecologically predictable ways, and one non-salmonid outgroup. Results All selection analyses are based on Coregonus and Salvelinus compared to non-diversifying taxa. We found more orthogroups were affected by relaxed selection than intensified selection. Of those, 122 were under significant relaxed selection, with trends of an overrepresentation of serine family amino acid metabolism and transcriptional regulation, and significant enrichment of behaviour-associated gene functions. Seventy-eight orthogroups were under significant intensified selection and were enriched for signalling process and transcriptional regulation gene ontology terms and actin filament and lipid metabolism gene sets. Ninety-two orthogroups were under diversifying/positive selection. These were enriched for signal transduction, transmembrane transport, and pyruvate metabolism gene ontology terms and often contained genes involved in transcriptional regulation and development. Several orthogroups showed signs of multiple types of selection. For example, orthogroups under relaxed and diversifying selection contained genes such as ap1m2, involved in immunity and development, and slc6a8, playing an important role in muscle and brain creatine uptake. Orthogroups under intensified and diversifying selection were also found, such as genes syn3, with a role in neural processes, and ctsk, involved in bone remodelling. Conclusions Our approach pinpointed relevant genomic targets by distinguishing among different kinds of selection. We found that relaxed, intensified, and diversifying selection affect orthogroups and gene functions of ecological relevance in salmonids. Because they were found consistently and robustly across charr and whitefish and not other salmonid lineages, we propose these genes have a potential role in the replicated ecological diversifications.
Collapse
Affiliation(s)
- Kevin Schneider
- Institute of Biodiversity, Animal Health & Comparative Medicine, College of Medical, Veterinary & Life Sciences, University of Glasgow, Glasgow, G12 8QQ, UK
| | - Colin E Adams
- Institute of Biodiversity, Animal Health & Comparative Medicine, College of Medical, Veterinary & Life Sciences, University of Glasgow, Glasgow, G12 8QQ, UK.,Scottish Centre for Ecology and the Natural Environment, University of Glasgow, Rowardennan, G63 0AW, UK
| | - Kathryn R Elmer
- Institute of Biodiversity, Animal Health & Comparative Medicine, College of Medical, Veterinary & Life Sciences, University of Glasgow, Glasgow, G12 8QQ, UK.
| |
Collapse
|
19
|
Fert-Bober J, Murray CI, Parker SJ, Van Eyk JE. Precision Profiling of the Cardiovascular Post-Translationally Modified Proteome: Where There Is a Will, There Is a Way. Circ Res 2019; 122:1221-1237. [PMID: 29700069 DOI: 10.1161/circresaha.118.310966] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
There is an exponential increase in biological complexity as initial gene transcripts are spliced, translated into amino acid sequence, and post-translationally modified. Each protein can exist as multiple chemical or sequence-specific proteoforms, and each has the potential to be a critical mediator of a physiological or pathophysiological signaling cascade. Here, we provide an overview of how different proteoforms come about in biological systems and how they are most commonly measured using mass spectrometry-based proteomics and bioinformatics. Our goal is to present this information at a level accessible to every scientist interested in mass spectrometry and its application to proteome profiling. We will specifically discuss recent data linking various protein post-translational modifications to cardiovascular disease and conclude with a discussion for enablement and democratization of proteomics across the cardiovascular and scientific community. The aim is to inform and inspire the readership to explore a larger breadth of proteoform, particularity post-translational modifications, related to their particular areas of expertise in cardiovascular physiology.
Collapse
Affiliation(s)
- Justyna Fert-Bober
- From the Advanced Clinical BioSystems Research Institute, Smidt Heart Institute, Department of Medicine, Cedars Sinai Medical Center, Los Angeles, CA
| | - Christopher I Murray
- From the Advanced Clinical BioSystems Research Institute, Smidt Heart Institute, Department of Medicine, Cedars Sinai Medical Center, Los Angeles, CA
| | - Sarah J Parker
- From the Advanced Clinical BioSystems Research Institute, Smidt Heart Institute, Department of Medicine, Cedars Sinai Medical Center, Los Angeles, CA.
| | - Jennifer E Van Eyk
- From the Advanced Clinical BioSystems Research Institute, Smidt Heart Institute, Department of Medicine, Cedars Sinai Medical Center, Los Angeles, CA
| |
Collapse
|
20
|
Krassowski M, Paczkowska M, Cullion K, Huang T, Dzneladze I, Ouellette BFF, Yamada JT, Fradet-Turcotte A, Reimand J. ActiveDriverDB: human disease mutations and genome variation in post-translational modification sites of proteins. Nucleic Acids Res 2019; 46:D901-D910. [PMID: 29126202 PMCID: PMC5753267 DOI: 10.1093/nar/gkx973] [Citation(s) in RCA: 60] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2017] [Accepted: 10/18/2017] [Indexed: 12/18/2022] Open
Abstract
Interpretation of genetic variation is needed for deciphering genotype-phenotype associations, mechanisms of inherited disease, and cancer driver mutations. Millions of single nucleotide variants (SNVs) in human genomes are known and thousands are associated with disease. An estimated 21% of disease-associated amino acid substitutions corresponding to missense SNVs are located in protein sites of post-translational modifications (PTMs), chemical modifications of amino acids that extend protein function. ActiveDriverDB is a comprehensive human proteo-genomics database that annotates disease mutations and population variants through the lens of PTMs. We integrated >385,000 published PTM sites with ∼3.6 million substitutions from The Cancer Genome Atlas (TCGA), the ClinVar database of disease genes, and human genome sequencing projects. The database includes site-specific interaction networks of proteins, upstream enzymes such as kinases, and drugs targeting these enzymes. We also predicted network-rewiring impact of mutations by analyzing gains and losses of kinase-bound sequence motifs. ActiveDriverDB provides detailed visualization, filtering, browsing and searching options for studying PTM-associated mutations. Users can upload mutation datasets interactively and use our application programming interface in pipelines. Integrative analysis of mutations and PTMs may help decipher molecular mechanisms of phenotypes and disease, as exemplified by case studies of TP53, BRCA2 and VHL. The open-source database is available at https://www.ActiveDriverDB.org.
Collapse
Affiliation(s)
- Michal Krassowski
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada.,Faculty of Mathematics, Informatics and Mechanics, University of Warsaw, Warsaw, Poland
| | - Marta Paczkowska
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada
| | - Kim Cullion
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada
| | - Tina Huang
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada
| | - Irakli Dzneladze
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - B F Francis Ouellette
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada.,Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario, Canada
| | - Joseph T Yamada
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada
| | - Amelie Fradet-Turcotte
- Department of Molecular Biology, Medical Biochemistry and Pathology, Université Laval, Québec, Québec, Canada
| | - Jüri Reimand
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
21
|
Lyon KF, Cai X, Young RJ, Mamun AA, Rajasekaran S, Schiller MR. Minimotif Miner 4: a million peptide minimotifs and counting. Nucleic Acids Res 2019; 46:D465-D470. [PMID: 29140456 PMCID: PMC5753208 DOI: 10.1093/nar/gkx1085] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2017] [Accepted: 11/09/2017] [Indexed: 12/27/2022] Open
Abstract
Minimotif Miner (MnM) is a database and web system for analyzing short functional peptide motifs, termed minimotifs. We present an update to MnM growing the database from ∼300 000 to >1 000 000 minimotif consensus sequences and instances. This growth comes largely from updating data from existing databases and annotation of articles with high-throughput approaches analyzing different types of post-translational modifications. Another update is mapping human proteins and their minimotifs to know human variants from the dbSNP, build 150. Now MnM 4 can be used to generate mechanistic hypotheses about how human genetic variation affect minimotifs and outcomes. One example of the utility of the combined minimotif/SNP tool identifies a loss of function missense SNP in a ubiquitylation minimotif encoded in the excision repair cross-complementing 2 (ERCC2) nucleotide excision repair gene. This SNP reaches genome wide significance for many types of cancer and the variant identified with MnM 4 reveals a more detailed mechanistic hypothesis concerning the role of ERCC2 in cancer. Other updates to the web system include a new architecture with migration of the web system and database to Docker containers for better performance and management. Weblinks:minimotifminer.org and mnm.engr.uconn.edu
Collapse
Affiliation(s)
- Kenneth F Lyon
- Nevada Institute of Personalized Medicine and School of Life Sciences, University of Nevada, Las Vegas, 89154 4004 NV, USA
| | - Xingyu Cai
- Department of Computer Science and Engineering, University of Connecticut, Storrs, CT 06269 2155, USA
| | - Richard J Young
- Nevada Institute of Personalized Medicine and School of Life Sciences, University of Nevada, Las Vegas, 89154 4004 NV, USA
| | - Abdullah-Al Mamun
- Department of Computer Science and Engineering, University of Connecticut, Storrs, CT 06269 2155, USA
| | - Sanguthevar Rajasekaran
- Department of Computer Science and Engineering, University of Connecticut, Storrs, CT 06269 2155, USA
| | - Martin R Schiller
- Nevada Institute of Personalized Medicine and School of Life Sciences, University of Nevada, Las Vegas, 89154 4004 NV, USA
| |
Collapse
|
22
|
Kopra K, Tong-Ochoa N, Laine M, Eskonen V, Koskinen PJ, Härmä H. Homogeneous peptide-break assay for luminescent detection of enzymatic protein post-translational modification activity utilizing charged peptides. Anal Chim Acta 2019; 1055:126-132. [PMID: 30782363 DOI: 10.1016/j.aca.2018.12.041] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2018] [Accepted: 12/18/2018] [Indexed: 11/18/2022]
Abstract
We have developed a rapid and sensitive universal peptide-based time-resolved luminescence assay for detection of enzymatic post-translational modifications (PTMs). PTMs play essential roles in intracellular signaling and cell regulation, thus providing functional protein diversity in cell. Due this, impaired PTM patterns have been linked to multiple disease states. Clear link between PTMs and pathological conditions have also driven assay development further, but still today most of the methodologies are based on single-specificity or group-specific PTM-recognition. We have previously introduced leuzine-zipper based peptide-break technology as a viable option for universal PTM detection. Here, we introduce peptide-break technology utilizing single-label homogeneous quenching resonance energy transfer (QRET) and charge-based peptide-peptide interaction. We demonstrate the functionality of the new assay concept in phosphorylation, deacetylation, and citrullination. In a comparable study between previously introduced leucine-zipper and the novel charge-based approach, we found equal PTM detection performance and sensitivity, but the peptide design for new targets is simplified with the charged peptides. The new concept allows the use of short <20 amino acid peptides without limitations rising from the leucine-zipper coiled-coil structure. Introduced methodology enables wash-free PTM detection in a 384-well plate format, using low nanomolar enzyme concentrations. Potentially, the peptide-break technique using charged peptides may be applicable for natural peptide sequences directly obtained from the target protein.
Collapse
Affiliation(s)
- Kari Kopra
- Materials Chemistry and Chemical Analysis, University of Turku, Vatselankatu 2, 20500, Turku, Finland.
| | - Natalia Tong-Ochoa
- Materials Chemistry and Chemical Analysis, University of Turku, Vatselankatu 2, 20500, Turku, Finland
| | - Mari Laine
- Materials Chemistry and Chemical Analysis, University of Turku, Vatselankatu 2, 20500, Turku, Finland
| | - Ville Eskonen
- Materials Chemistry and Chemical Analysis, University of Turku, Vatselankatu 2, 20500, Turku, Finland
| | - Päivi J Koskinen
- Section of Physiology and Genetics, Department of Biology, University of Turku, Vesilinnantie 5, Turku, Finland
| | - Harri Härmä
- Materials Chemistry and Chemical Analysis, University of Turku, Vatselankatu 2, 20500, Turku, Finland
| |
Collapse
|
23
|
Improved measures for evolutionary conservation that exploit taxonomy distances. Nat Commun 2019; 10:1556. [PMID: 30952844 PMCID: PMC6450959 DOI: 10.1038/s41467-019-09583-2] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2018] [Accepted: 03/19/2019] [Indexed: 11/30/2022] Open
Abstract
Selective pressures on protein-coding regions that provide fitness advantages can lead to the regions' fixation and conservation in genome duplications and speciation events. Consequently, conservation analyses relying on sequence similarities are exploited by a myriad of applications across all biosciences to identify functionally important protein regions. While very potent, existing conservation measures based on multiple sequence alignments are so pervasive that improvements to solutions of many problems have become incremental. We introduce a new framework for evolutionary conservation with measures that exploit taxonomy distances across species. Results show that our taxonomy-based framework comfortably outperforms existing conservation measures in identifying deleterious variants observed in the human population, including variants located in non-abundant sequence domains such as intrinsically disordered regions. The predictive power of our approach emphasizes that the phenotypic effects of sequence variants can be taxonomy-level specific and thus, conservation needs to be interpreted accordingly. Information on protein sequence variability and conservation can be leveraged to identify functionally important regions. Here, the authors develop new conservation measures that exploit taxonomy distances and LIST, a tool for predicting deleteriousness of human variants.
Collapse
|
24
|
Lambrughi M, Tiberti M, Allega MF, Sora V, Nygaard M, Toth A, Salamanca Viloria J, Bignon E, Papaleo E. Analyzing Biomolecular Ensembles. Methods Mol Biol 2019; 2022:415-451. [PMID: 31396914 DOI: 10.1007/978-1-4939-9608-7_18] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Several techniques are available to generate conformational ensembles of proteins and other biomolecules either experimentally or computationally. These methods produce a large amount of data that need to be analyzed to identify structure-dynamics-function relationship. In this chapter, we will cover different tools to unveil the information hidden in conformational ensemble data and to guide toward the rationalization of the data. We included routinely used approaches such as dimensionality reduction, as well as new methods inspired by high-order statistics and graph theory.
Collapse
Affiliation(s)
- Matteo Lambrughi
- Computational Biology Laboratory, Danish Cancer Society Research Center, Copenhagen, Denmark
| | - Matteo Tiberti
- Computational Biology Laboratory, Danish Cancer Society Research Center, Copenhagen, Denmark
| | - Maria Francesca Allega
- Computational Biology Laboratory, Danish Cancer Society Research Center, Copenhagen, Denmark
| | - Valentina Sora
- Computational Biology Laboratory, Danish Cancer Society Research Center, Copenhagen, Denmark
| | - Mads Nygaard
- Computational Biology Laboratory, Danish Cancer Society Research Center, Copenhagen, Denmark
| | - Agota Toth
- Computational Biology Laboratory, Danish Cancer Society Research Center, Copenhagen, Denmark
| | - Juan Salamanca Viloria
- Computational Biology Laboratory, Danish Cancer Society Research Center, Copenhagen, Denmark
| | - Emmanuelle Bignon
- Computational Biology Laboratory, Danish Cancer Society Research Center, Copenhagen, Denmark
| | - Elena Papaleo
- Computational Biology Laboratory, Danish Cancer Society Research Center, Copenhagen, Denmark.
| |
Collapse
|
25
|
Wagih O, Galardini M, Busby BP, Memon D, Typas A, Beltrao P. A resource of variant effect predictions of single nucleotide variants in model organisms. Mol Syst Biol 2018; 14:e8430. [PMID: 30573687 PMCID: PMC6301329 DOI: 10.15252/msb.20188430] [Citation(s) in RCA: 56] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2018] [Revised: 11/19/2018] [Accepted: 11/21/2018] [Indexed: 12/18/2022] Open
Abstract
The effect of single nucleotide variants (SNVs) in coding and noncoding regions is of great interest in genetics. Although many computational methods aim to elucidate the effects of SNVs on cellular mechanisms, it is not straightforward to comprehensively cover different molecular effects. To address this, we compiled and benchmarked sequence and structure-based variant effect predictors and we computed the impact of nearly all possible amino acid and nucleotide variants in the reference genomes of Homo sapiens, Saccharomyces cerevisiae and Escherichia coli Studied mechanisms include protein stability, interaction interfaces, post-translational modifications and transcription factor binding sites. We apply this resource to the study of natural and disease coding variants. We also show how variant effects can be aggregated to generate protein complex burden scores that uncover protein complex to phenotype associations based on a set of newly generated growth profiles of 93 sequenced S. cerevisiae strains in 43 conditions. This resource is available through mutfunc (www.mutfunc.com), a tool by which users can query precomputed predictions by providing amino acid or nucleotide-level variants.
Collapse
Affiliation(s)
- Omar Wagih
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge, UK
| | - Marco Galardini
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge, UK
| | - Bede P Busby
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge, UK
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Danish Memon
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge, UK
| | - Athanasios Typas
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Pedro Beltrao
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge, UK
| |
Collapse
|
26
|
Xiang Y, Ye Y, Zhang Z, Han L. Maximizing the Utility of Cancer Transcriptomic Data. Trends Cancer 2018; 4:823-837. [PMID: 30470304 DOI: 10.1016/j.trecan.2018.09.009] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2018] [Revised: 09/23/2018] [Accepted: 09/24/2018] [Indexed: 12/13/2022]
Abstract
Transcriptomic profiling has been applied to large numbers of cancer samples, by large-scale consortia, including The Cancer Genome Atlas, International Cancer Genome Consortium, and Cancer Cell Line Encyclopedia. Advances in mining cancer transcriptomic data enable us to understand the endless complexity of the cancer transcriptome and thereby to discover new biomarkers and therapeutic targets. In this paper, we review computational resources for deep mining of transcriptomic data to identify, quantify, and determine the functional effects and clinical utility of transcriptomic events, including noncoding RNAs, post-transcriptional regulation, exogenous RNAs, and transcribed genetic variants. These approaches can be applied to other complex diseases, thereby greatly leveraging the impact of this work.
Collapse
Affiliation(s)
- Yu Xiang
- Department of Biochemistry and Molecular Biology, McGovern Medical School at The University of Texas Health Science Center at Houston, Houston, TX 77030, USA; These authors contributed equally
| | - Youqiong Ye
- Department of Biochemistry and Molecular Biology, McGovern Medical School at The University of Texas Health Science Center at Houston, Houston, TX 77030, USA; These authors contributed equally
| | - Zhao Zhang
- Department of Biochemistry and Molecular Biology, McGovern Medical School at The University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Leng Han
- Department of Biochemistry and Molecular Biology, McGovern Medical School at The University of Texas Health Science Center at Houston, Houston, TX 77030, USA; Center for Precision Health, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA.
| |
Collapse
|
27
|
Engelmann BW, Hsiao CJ, Blischak JD, Fourne Y, Khan Z, Ford M, Gilad Y. A Methodological Assessment and Characterization of Genetically-Driven Variation in Three Human Phosphoproteomes. Sci Rep 2018; 8:12106. [PMID: 30108239 PMCID: PMC6092387 DOI: 10.1038/s41598-018-30587-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2018] [Accepted: 07/17/2018] [Indexed: 11/12/2022] Open
Abstract
Phosphorylation of proteins on serine, threonine, and tyrosine residues is a ubiquitous post-translational modification that plays a key part of essentially every cell signaling process. It is reasonable to assume that inter-individual variation in protein phosphorylation may underlie phenotypic differences, as has been observed for practically any other molecular regulatory phenotype. However, we do not know much about the extent of inter-individual variation in phosphorylation because it is quite challenging to perform a quantitative high throughput study to assess inter-individual variation in any post-translational modification. To test our ability to address this challenge with SILAC-based mass spectrometry, we quantified phosphorylation levels for three genotyped human cell lines within a nested experimental framework, and found that genetic background is the primary determinant of phosphoproteome variation. We uncovered multiple functional, biophysical, and genetic associations with germline driven phosphopeptide variation. Variants affecting protein levels or structure were among these associations, with the latter presenting, on average, a stronger effect. Interestingly, we found evidence that is consistent with a phosphopeptide variability buffering effect endowed from properties enriched within longer proteins. Because the small sample size in this 'pilot' study may limit the applicability of our genetic observations, we also undertook a thorough technical assessment of our experimental workflow to aid further efforts. Taken together, these results provide the foundation for future work to characterize inter-individual variation in post-translational modification levels and reveal novel insights into the nature of inter-individual variation in phosphorylation.
Collapse
Affiliation(s)
- Brett W Engelmann
- Department of Human Genetics, University of Chicago, Chicago, Illinois, USA.
- AbbVie, North Chicago, Illinois, USA.
| | | | - John D Blischak
- Department of Human Genetics, University of Chicago, Chicago, Illinois, USA
| | - Yannick Fourne
- Department of Human Genetics, University of Chicago, Chicago, Illinois, USA
| | - Zia Khan
- Department of Human Genetics, University of Chicago, Chicago, Illinois, USA
- Genentech, South San Francisco, California, USA
| | - Michael Ford
- MS Bioworks, LLC, 3950, Varsity Drive, Ann Arbor, Michigan, USA
| | - Yoav Gilad
- Department of Human Genetics, University of Chicago, Chicago, Illinois, USA.
- Department of Medicine, University of Chicago, Chicago, Illinois, USA.
| |
Collapse
|
28
|
Sharma S, Young RJ, Chen J, Chen X, Oh EC, Schiller MR. Minimotifs dysfunction is pervasive in neurodegenerative disorders. ALZHEIMER'S & DEMENTIA (NEW YORK, N. Y.) 2018; 4:414-432. [PMID: 30225339 PMCID: PMC6139474 DOI: 10.1016/j.trci.2018.06.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Minimotifs are modular contiguous peptide sequences in proteins that are important for posttranslational modifications, binding to other molecules, and trafficking to specific subcellular compartments. Some molecular functions of proteins in cellular pathways can be predicted from minimotif consensus sequences identified through experimentation. While a role for minimotifs in regulating signal transduction and gene regulation during disease pathogenesis (such as infectious diseases and cancer) is established, the therapeutic use of minimotif mimetic drugs is limited. In this review, we discuss a general theme identifying a pervasive role of minimotifs in the pathomechanism of neurodegenerative diseases. Beyond their longstanding history in the genetics of familial neurodegeneration, minimotifs are also major players in neurotoxic protein aggregation, aberrant protein trafficking, and epigenetic regulation. Generalizing the importance of minimotifs in neurodegenerative diseases offers a new perspective for the future study of neurodegenerative mechanisms and the investigation of new therapeutics.
Collapse
Affiliation(s)
- Surbhi Sharma
- Nevada Institute of Personalized Medicine, Las Vegas, NV, USA
- School of Life Sciences, Las Vegas, NV, USA
| | - Richard J. Young
- Nevada Institute of Personalized Medicine, Las Vegas, NV, USA
- School of Life Sciences, Las Vegas, NV, USA
| | - Jingchun Chen
- Nevada Institute of Personalized Medicine, Las Vegas, NV, USA
| | - Xiangning Chen
- Nevada Institute of Personalized Medicine, Las Vegas, NV, USA
- Department of Psychology, Las Vegas, NV, USA
| | - Edwin C. Oh
- Nevada Institute of Personalized Medicine, Las Vegas, NV, USA
- School of Medicine, Las Vegas, NV, USA
| | - Martin R. Schiller
- Nevada Institute of Personalized Medicine, Las Vegas, NV, USA
- School of Life Sciences, Las Vegas, NV, USA
- School of Medicine, Las Vegas, NV, USA
| |
Collapse
|
29
|
Shi S, Wang L, Cao M, Chen G, Yu J. Proteomic analysis and prediction of amino acid variations that influence protein posttranslational modifications. Brief Bioinform 2018; 20:1597-1606. [DOI: 10.1093/bib/bby036] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2018] [Revised: 03/07/2018] [Indexed: 12/18/2022] Open
Abstract
Abstract
Accumulative studies have indicated that amino acid variations through changing the type of residues of the target sites or key flanking residues could directly or indirectly influence protein posttranslational modifications (PTMs) and bring about a detrimental effect on protein function. Computational mutation analysis can greatly narrow down the efforts on experimental work. To increase the utilization of current computational resources, we first provide an overview of computational prediction of amino acid variations that influence protein PTMs and their functional analysis. We also discuss the challenges that are faced while developing novel in silico approaches in the future. The development of better methods for mutation analysis-related protein PTMs will help to facilitate the development of personalized precision medicine.
Collapse
Affiliation(s)
- Shaoping Shi
- Department of Mathematics and Numerical Simulation and High-Performance Computing Laboratory, School of Sciences, Nanchang University, Nanchang, Jiangxi 330031, China
| | - Lina Wang
- Department of Science, Nanchang Institute of Technology, Nanchang, Jiangxi 330031, China
| | - Man Cao
- Department of Mathematics, School of Sciences, Nanchang University, Nanchang, Jiangxi 330031, China
| | - Guodong Chen
- Department of Mathematics, School of Sciences, Nanchang University, Nanchang, Jiangxi 330031, China
| | - Jialin Yu
- Department of Mathematics, School of Sciences, Nanchang University, Nanchang, Jiangxi 330031, China
| |
Collapse
|
30
|
Rathore D, Faustino A, Schiel J, Pang E, Boyne M, Rogstad S. The role of mass spectrometry in the characterization of biologic protein products. Expert Rev Proteomics 2018; 15:431-449. [DOI: 10.1080/14789450.2018.1469982] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Affiliation(s)
- Deepali Rathore
- Division of Pharmaceutical Analysis, Office of Testing and Research, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, Silver Spring, MD, USA
- Center for Biomedical Mass Spectrometry Research, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Anneliese Faustino
- Division of Pharmaceutical Analysis, Office of Testing and Research, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, Silver Spring, MD, USA
| | - John Schiel
- Biomolecular Measurement Division, National Institute of Standards and Technology, Institute for Bioscience and Biotechnology Research, Rockville, MD, USA
| | - Eric Pang
- Office of Lifecycle Drug Products, Office of Pharmaceutical Quality, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, Silver Spring, MD, USA
| | - Michael Boyne
- Division of Pharmaceutical Analysis, Office of Testing and Research, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, Silver Spring, MD, USA
- COUR Pharmaceuticals Development Company, Northbrook, IL, USA
| | - Sarah Rogstad
- Division of Pharmaceutical Analysis, Office of Testing and Research, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, Silver Spring, MD, USA
| |
Collapse
|
31
|
Taipale M. Disruption of protein function by pathogenic mutations: common and uncommon mechanisms 1. Biochem Cell Biol 2018; 97:46-57. [PMID: 29693415 DOI: 10.1139/bcb-2018-0007] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Mutations in protein-coding regions underlie almost all Mendelian disorders, drive tumorigenesis, and contribute to susceptibility to common diseases. Despite the great diversity of diseases that are caused by coding mutations, the cellular processes that affect, and are affected by, pathogenic variants at the molecular level are fundamentally conserved. Experimental and computational approaches have revealed that a substantial fraction of disease mutations are not simple loss-of-function alleles. Rather, these pathogenic variants disrupt protein function in more subtle ways by tuning protein folding pathways, altering subcellular trafficking, interrupting signaling cascades, and rewiring highly connected interaction networks. Focusing mainly on Mendelian disorders, this review discusses the common mechanisms by which deleterious mutations disrupt protein function and how these disruptions can be exploited in the development of novel therapies.
Collapse
Affiliation(s)
- Mikko Taipale
- a Donnelly Centre, Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 3E1, Canada.,b Molecular Architecture of Life Program, Canadian Institute for Advanced Research, Toronto, ON M5S 1M1, Canada
| |
Collapse
|
32
|
Huang KL, Mashl RJ, Wu Y, Ritter DI, Wang J, Oh C, Paczkowska M, Reynolds S, Wyczalkowski MA, Oak N, Scott AD, Krassowski M, Cherniack AD, Houlahan KE, Jayasinghe R, Wang LB, Zhou DC, Liu D, Cao S, Kim YW, Koire A, McMichael JF, Hucthagowder V, Kim TB, Hahn A, Wang C, McLellan MD, Al-Mulla F, Johnson KJ, Lichtarge O, Boutros PC, Raphael B, Lazar AJ, Zhang W, Wendl MC, Govindan R, Jain S, Wheeler D, Kulkarni S, Dipersio JF, Reimand J, Meric-Bernstam F, Chen K, Shmulevich I, Plon SE, Chen F, Ding L. Pathogenic Germline Variants in 10,389 Adult Cancers. Cell 2018; 173:355-370.e14. [PMID: 29625052 PMCID: PMC5949147 DOI: 10.1016/j.cell.2018.03.039] [Citation(s) in RCA: 579] [Impact Index Per Article: 82.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2017] [Revised: 02/24/2018] [Accepted: 03/15/2018] [Indexed: 12/20/2022]
Abstract
We conducted the largest investigation of predisposition variants in cancer to date, discovering 853 pathogenic or likely pathogenic variants in 8% of 10,389 cases from 33 cancer types. Twenty-one genes showed single or cross-cancer associations, including novel associations of SDHA in melanoma and PALB2 in stomach adenocarcinoma. The 659 predisposition variants and 18 additional large deletions in tumor suppressors, including ATM, BRCA1, and NF1, showed low gene expression and frequent (43%) loss of heterozygosity or biallelic two-hit events. We also discovered 33 such variants in oncogenes, including missenses in MET, RET, and PTPN11 associated with high gene expression. We nominated 47 additional predisposition variants from prioritized VUSs supported by multiple evidences involving case-control frequency, loss of heterozygosity, expression effect, and co-localization with mutations and modified residues. Our integrative approach links rare predisposition variants to functional consequences, informing future guidelines of variant classification and germline genetic testing in cancer.
Collapse
Affiliation(s)
- Kuan-Lin Huang
- Department of Medicine, Washington University in St. Louis, Saint Louis, MO 63108, USA; McDonnell Genome Institute, Washington University in St. Louis, Saint Louis, MO 63108, USA
| | - R Jay Mashl
- Department of Medicine, Washington University in St. Louis, Saint Louis, MO 63108, USA; McDonnell Genome Institute, Washington University in St. Louis, Saint Louis, MO 63108, USA
| | - Yige Wu
- Department of Medicine, Washington University in St. Louis, Saint Louis, MO 63108, USA; McDonnell Genome Institute, Washington University in St. Louis, Saint Louis, MO 63108, USA
| | - Deborah I Ritter
- Baylor College of Medicine and Texas Children's Hospital, Houston, TX, USA
| | - Jiayin Wang
- School of Management, Xi'an Jiaotong University, Xi'an, Shanxi, China
| | - Clara Oh
- Department of Medicine, Washington University in St. Louis, Saint Louis, MO 63108, USA
| | - Marta Paczkowska
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada
| | | | - Matthew A Wyczalkowski
- Department of Medicine, Washington University in St. Louis, Saint Louis, MO 63108, USA; McDonnell Genome Institute, Washington University in St. Louis, Saint Louis, MO 63108, USA
| | - Ninad Oak
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Adam D Scott
- Department of Medicine, Washington University in St. Louis, Saint Louis, MO 63108, USA; McDonnell Genome Institute, Washington University in St. Louis, Saint Louis, MO 63108, USA
| | - Michal Krassowski
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada
| | | | - Kathleen E Houlahan
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada; Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - Reyka Jayasinghe
- Department of Medicine, Washington University in St. Louis, Saint Louis, MO 63108, USA; McDonnell Genome Institute, Washington University in St. Louis, Saint Louis, MO 63108, USA
| | - Liang-Bo Wang
- Department of Medicine, Washington University in St. Louis, Saint Louis, MO 63108, USA; McDonnell Genome Institute, Washington University in St. Louis, Saint Louis, MO 63108, USA
| | - Daniel Cui Zhou
- Department of Medicine, Washington University in St. Louis, Saint Louis, MO 63108, USA; McDonnell Genome Institute, Washington University in St. Louis, Saint Louis, MO 63108, USA
| | - Di Liu
- Department of Medicine, Washington University in St. Louis, Saint Louis, MO 63108, USA
| | - Song Cao
- Department of Medicine, Washington University in St. Louis, Saint Louis, MO 63108, USA; McDonnell Genome Institute, Washington University in St. Louis, Saint Louis, MO 63108, USA
| | - Young Won Kim
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Amanda Koire
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Joshua F McMichael
- McDonnell Genome Institute, Washington University in St. Louis, Saint Louis, MO 63108, USA
| | | | - Tae-Beom Kim
- Departments of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Abigail Hahn
- Institute for Systems Biology, Seattle, WA 98109, USA
| | - Chen Wang
- Department of Health Sciences Research and Department of Obstetrics and Gynecology, Mayo Clinic College of Medicine, Rochester, MN 55905 USA
| | - Michael D McLellan
- McDonnell Genome Institute, Washington University in St. Louis, Saint Louis, MO 63108, USA
| | - Fahd Al-Mulla
- Dasman Diabetes Institute and Molecular Pathology Laboratory, Kuwait University, Kuwait
| | - Kimberly J Johnson
- Brown School Master of Public Health Program, Washington University in St. Louis, Saint Louis, MO 63108, USA
| | - Olivier Lichtarge
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Paul C Boutros
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada; Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - Benjamin Raphael
- Lewis-Sigler Institute, Princeton University, Princeton, NJ 08544, USA
| | - Alexander J Lazar
- Departments of Pathology and Genomic Medicine, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Wei Zhang
- Department of Cancer Biology and Center for Genomics and Personalized Medicine Research, Wake Forest School of Medicine, Winston Salem, NC 27157 USA
| | - Michael C Wendl
- McDonnell Genome Institute, Washington University in St. Louis, Saint Louis, MO 63108, USA; Department of Genetics, Washington University in St. Louis, Saint Louis, MO 63108, USA; Department of Mathematics, Washington University in St. Louis, Saint Louis, MO 63108, USA
| | - Ramaswamy Govindan
- Department of Medicine, Washington University in St. Louis, Saint Louis, MO 63108, USA
| | - Sanjay Jain
- Department of Medicine, Washington University in St. Louis, Saint Louis, MO 63108, USA
| | - David Wheeler
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Shashikant Kulkarni
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Baylor Genetics, Houston, TX 77021, USA
| | - John F Dipersio
- Department of Medicine, Washington University in St. Louis, Saint Louis, MO 63108, USA; Siteman Cancer Center, Washington University in St. Louis, Saint Louis, MO 63108, USA
| | - Jüri Reimand
- Computational Biology Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada; Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - Funda Meric-Bernstam
- Department of Investigational Cancer Therapeutics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Ken Chen
- Departments of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | | | - Sharon E Plon
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Feng Chen
- Department of Medicine, Washington University in St. Louis, Saint Louis, MO 63108, USA; Siteman Cancer Center, Washington University in St. Louis, Saint Louis, MO 63108, USA.
| | - Li Ding
- Department of Medicine, Washington University in St. Louis, Saint Louis, MO 63108, USA; McDonnell Genome Institute, Washington University in St. Louis, Saint Louis, MO 63108, USA; Department of Genetics, Washington University in St. Louis, Saint Louis, MO 63108, USA; Siteman Cancer Center, Washington University in St. Louis, Saint Louis, MO 63108, USA.
| |
Collapse
|
33
|
Ancien F, Pucci F, Godfroid M, Rooman M. Prediction and interpretation of deleterious coding variants in terms of protein structural stability. Sci Rep 2018. [PMID: 29540703 PMCID: PMC5852127 DOI: 10.1038/s41598-018-22531-2] [Citation(s) in RCA: 62] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
The classification of human genetic variants into deleterious and neutral is a challenging issue, whose complexity is rooted in the large variety of biophysical mechanisms that can be responsible for disease conditions. For non-synonymous mutations in structured proteins, one of these is the protein stability change, which can lead to loss of protein structure or function. We developed a stability-driven knowledge-based classifier that uses protein structure, artificial neural networks and solvent accessibility-dependent combinations of statistical potentials to predict whether destabilizing or stabilizing mutations are disease-causing. Our predictor yields a balanced accuracy of 71% in cross validation. As expected, it has a very high positive predictive value of 89%: it predicts with high accuracy the subset of mutations that are deleterious because of stability issues, but is by construction unable of classifying variants that are deleterious for other reasons. Its combination with an evolutionary-based predictor increases the balanced accuracy up to 75%, and allowed predicting more than 1/4 of the variants with 95% positive predictive value. Our method, called SNPMuSiC, can be used with both experimental and modeled structures and compares favorably with other prediction tools on several independent test sets. It constitutes a step towards interpreting variant effects at the molecular scale. SNPMuSiC is freely available at https://soft.dezyme.com/.
Collapse
Affiliation(s)
- François Ancien
- Department of BioModeling, BioInformatics & BioProcesses, Université Libre de Bruxelles (ULB), CP 165/61, Roosevelt Avenue 50, 1050, Brussels, Belgium. .,Interuniversity Institute of Bioinformatics in Brussels, ULB, CP 263, Triumph Bld, 1050, Brussels, Belgium.
| | - Fabrizio Pucci
- Department of BioModeling, BioInformatics & BioProcesses, Université Libre de Bruxelles (ULB), CP 165/61, Roosevelt Avenue 50, 1050, Brussels, Belgium. .,Interuniversity Institute of Bioinformatics in Brussels, ULB, CP 263, Triumph Bld, 1050, Brussels, Belgium.
| | - Maxime Godfroid
- Department of BioModeling, BioInformatics & BioProcesses, Université Libre de Bruxelles (ULB), CP 165/61, Roosevelt Avenue 50, 1050, Brussels, Belgium.,Institute of General Microbiology, Kiel University, Am Botanischen Garten 11, 24118, Kiel, Germany
| | - Marianne Rooman
- Department of BioModeling, BioInformatics & BioProcesses, Université Libre de Bruxelles (ULB), CP 165/61, Roosevelt Avenue 50, 1050, Brussels, Belgium. .,Interuniversity Institute of Bioinformatics in Brussels, ULB, CP 263, Triumph Bld, 1050, Brussels, Belgium.
| |
Collapse
|
34
|
Sivley RM, Dou X, Meiler J, Bush WS, Capra JA. Comprehensive Analysis of Constraint on the Spatial Distribution of Missense Variants in Human Protein Structures. Am J Hum Genet 2018; 102:415-426. [PMID: 29455857 PMCID: PMC5985282 DOI: 10.1016/j.ajhg.2018.01.017] [Citation(s) in RCA: 56] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2017] [Accepted: 01/24/2018] [Indexed: 11/17/2022] Open
Abstract
The spatial distribution of genetic variation within proteins is shaped by evolutionary constraint and provides insight into the functional importance of protein regions and the potential pathogenicity of protein alterations. Here, we comprehensively evaluate the 3D spatial patterns of human germline and somatic variation in 6,604 experimentally derived protein structures and 33,144 computationally derived homology models covering 77% of all human proteins. Using a systematic approach, we quantify differences in the spatial distributions of neutral germline variants, disease-causing germline variants, and recurrent somatic variants. Neutral missense variants exhibit a general trend toward spatial dispersion, which is driven by constraint on core residues. In contrast, germline disease-causing variants are generally clustered in protein structures and form clusters more frequently than recurrent somatic variants identified from tumor sequencing. In total, we identify 215 proteins with significant spatial constraints on the distribution of disease-causing missense variants in experimentally derived protein structures, only 65 (30%) of which have been previously reported. This analysis identifies many clusters not detectable from sequence information alone; only 12% of proteins with significant clustering in 3D were identified from similar analyses of linear protein sequence. Furthermore, spatial analyses of mutations in homology-based structural models are highly correlated with those from experimentally derived structures, supporting the use of computationally derived models. Our approach highlights significant differences in the spatial constraints on different classes of mutations in protein structure and identifies regions of potential function within individual proteins.
Collapse
Affiliation(s)
- R Michael Sivley
- Department of Biomedical Informatics, Vanderbilt University, Nashville, TN 37232, USA
| | - Xiaoyi Dou
- Department of Computer Science, Vanderbilt University, Nashville, TN 37212, USA
| | - Jens Meiler
- Department of Biomedical Informatics, Vanderbilt University, Nashville, TN 37232, USA; Department of Chemistry, Vanderbilt University, Nashville, TN 37212, USA; Center for Structural Biology, Vanderbilt University, Nashville, TN 37212, USA
| | - William S Bush
- Department for Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH 44106, USA; Institute for Computational Biology, Case Western Reserve University, Cleveland, OH 44106, USA.
| | - John A Capra
- Department of Biomedical Informatics, Vanderbilt University, Nashville, TN 37232, USA; Department of Computer Science, Vanderbilt University, Nashville, TN 37212, USA; Center for Structural Biology, Vanderbilt University, Nashville, TN 37212, USA; Department of Biological Sciences, Vanderbilt University, Nashville, TN 37232, USA; Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN 37232, USA.
| |
Collapse
|
35
|
|
36
|
Uehara T, Takenouchi T, Kosaki R, Kurosawa K, Mizuno S, Kosaki K. Redefining the phenotypic spectrum of de novo heterozygous CDK13 variants: Three patients without cardiac defects. Eur J Med Genet 2017; 61:243-247. [PMID: 29222009 DOI: 10.1016/j.ejmg.2017.12.004] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2017] [Revised: 11/08/2017] [Accepted: 12/04/2017] [Indexed: 11/16/2022]
Abstract
Recently, 7 patients with de novo constitutional non-synonymous mutations in the CDK13 gene were ascertained through a trio exome analysis of a large cohort of 610 patients with congenital cardiac diseases. Despite another report describing 9 additional patients, the clinical spectrum of this condition has yet to be defined. Herein, we report 3 patients with heterozygous constitutional CDK13 mutations, who were ascertained through exome analysis of children with intellectual disability and minor anomalies, who lacked cardiac anomalies. Two patients had a c.2149G > A, p.Gly717Arg mutation, and one had a c.2525A > G, p. Asn842Ser mutation. A review of the previously described patients and those described herein has enabled the following points to be clarified. First, congenital heart diseases are not an essential feature (13/19). Second, nasal features may help syndromic recognition (14/16). Third, widely spaced and peg-shaped teeth may represent a previously unappreciated diagnostic clue for this newly identified syndrome. Here, we show that p.Gly717Arg represents a hotspot in addition to p.Asn842Ser. We suggest that this CDK13-related disorder may represent a clinically recognizable syndrome.
Collapse
Affiliation(s)
- Tomoko Uehara
- Center for Medical Genetics, Keio University School of Medicine, Tokyo, Japan
| | - Toshiki Takenouchi
- Department of Pediatrics, Keio University School of Medicine, Tokyo, Japan
| | - Rika Kosaki
- Division of Medical Genetics, National Center for Child Health and Development, Tokyo, Japan
| | - Kenji Kurosawa
- Division of Medical Genetics, Kanagawa Children's Medical Center, Yokohama, Japan
| | - Seiji Mizuno
- Department of Pediatrics, Central Hospital, Aichi Human Service Center, Aichi, Japan
| | - Kenjiro Kosaki
- Center for Medical Genetics, Keio University School of Medicine, Tokyo, Japan.
| |
Collapse
|
37
|
Baker PJ, De Nardo D, Moghaddas F, Tran LS, Bachem A, Nguyen T, Hayman T, Tye H, Vince JE, Bedoui S, Ferrero RL, Masters SL. Posttranslational Modification as a Critical Determinant of Cytoplasmic Innate Immune Recognition. Physiol Rev 2017; 97:1165-1209. [DOI: 10.1152/physrev.00026.2016] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2016] [Revised: 02/27/2017] [Accepted: 02/28/2017] [Indexed: 12/21/2022] Open
Abstract
Cell surface innate immune receptors can directly detect a variety of extracellular pathogens to which cytoplasmic innate immune sensors are rarely exposed. Instead, within the cytoplasm, the environment is rife with cellular machinery and signaling pathways that are indirectly perturbed by pathogenic microbes to activate intracellular sensors, such as pyrin, NLRP1, NLRP3, or NLRC4. Therefore, subtle changes in key intracellular processes such as phosphorylation, ubiquitination, and other pathways leading to posttranslational protein modification are key determinants of innate immune recognition in the cytoplasm. This concept is critical to establish the “guard hypothesis” whereby otherwise homeostatic pathways that keep innate immune sensors at bay are released in response to alterations in their posttranslational modification status. Originally identified in plants, evidence that a similar guardlike mechanism exists in humans has recently been identified, whereby a mutation that prevents phosphorylation of the innate immune sensor pyrin triggers a dominantly inherited autoinflammatory disease. It is also noteworthy that even when a cytoplasmic innate immune sensor has a direct ligand, such as bacterial peptidoglycan (NOD1 or NOD2), RNA (RIG-I or MDA5), or DNA (cGAS or IFI16), it can still be influenced by posttranslational modification to dramatically alter its response. Therefore, due to their existence in the cytoplasmic milieu, posttranslational modification is a key determinant of intracellular innate immune receptor functionality.
Collapse
Affiliation(s)
- Paul J. Baker
- Inflammation Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Australia; Hudson Institute of Medical Research, Monash University, Centre for Innate Immunity and Infectious Diseases, Clayton, Victoria, Australia; The Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia; and Departments of Medical Biology and of Microbiology and Immunology, The University of Melbourne, Parkville, Australia
| | - Dominic De Nardo
- Inflammation Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Australia; Hudson Institute of Medical Research, Monash University, Centre for Innate Immunity and Infectious Diseases, Clayton, Victoria, Australia; The Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia; and Departments of Medical Biology and of Microbiology and Immunology, The University of Melbourne, Parkville, Australia
| | - Fiona Moghaddas
- Inflammation Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Australia; Hudson Institute of Medical Research, Monash University, Centre for Innate Immunity and Infectious Diseases, Clayton, Victoria, Australia; The Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia; and Departments of Medical Biology and of Microbiology and Immunology, The University of Melbourne, Parkville, Australia
| | - Le Son Tran
- Inflammation Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Australia; Hudson Institute of Medical Research, Monash University, Centre for Innate Immunity and Infectious Diseases, Clayton, Victoria, Australia; The Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia; and Departments of Medical Biology and of Microbiology and Immunology, The University of Melbourne, Parkville, Australia
| | - Annabell Bachem
- Inflammation Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Australia; Hudson Institute of Medical Research, Monash University, Centre for Innate Immunity and Infectious Diseases, Clayton, Victoria, Australia; The Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia; and Departments of Medical Biology and of Microbiology and Immunology, The University of Melbourne, Parkville, Australia
| | - Tan Nguyen
- Inflammation Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Australia; Hudson Institute of Medical Research, Monash University, Centre for Innate Immunity and Infectious Diseases, Clayton, Victoria, Australia; The Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia; and Departments of Medical Biology and of Microbiology and Immunology, The University of Melbourne, Parkville, Australia
| | - Thomas Hayman
- Inflammation Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Australia; Hudson Institute of Medical Research, Monash University, Centre for Innate Immunity and Infectious Diseases, Clayton, Victoria, Australia; The Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia; and Departments of Medical Biology and of Microbiology and Immunology, The University of Melbourne, Parkville, Australia
| | - Hazel Tye
- Inflammation Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Australia; Hudson Institute of Medical Research, Monash University, Centre for Innate Immunity and Infectious Diseases, Clayton, Victoria, Australia; The Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia; and Departments of Medical Biology and of Microbiology and Immunology, The University of Melbourne, Parkville, Australia
| | - James E. Vince
- Inflammation Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Australia; Hudson Institute of Medical Research, Monash University, Centre for Innate Immunity and Infectious Diseases, Clayton, Victoria, Australia; The Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia; and Departments of Medical Biology and of Microbiology and Immunology, The University of Melbourne, Parkville, Australia
| | - Sammy Bedoui
- Inflammation Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Australia; Hudson Institute of Medical Research, Monash University, Centre for Innate Immunity and Infectious Diseases, Clayton, Victoria, Australia; The Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia; and Departments of Medical Biology and of Microbiology and Immunology, The University of Melbourne, Parkville, Australia
| | - Richard L. Ferrero
- Inflammation Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Australia; Hudson Institute of Medical Research, Monash University, Centre for Innate Immunity and Infectious Diseases, Clayton, Victoria, Australia; The Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia; and Departments of Medical Biology and of Microbiology and Immunology, The University of Melbourne, Parkville, Australia
| | - Seth L. Masters
- Inflammation Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Australia; Hudson Institute of Medical Research, Monash University, Centre for Innate Immunity and Infectious Diseases, Clayton, Victoria, Australia; The Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia; and Departments of Medical Biology and of Microbiology and Immunology, The University of Melbourne, Parkville, Australia
| |
Collapse
|
38
|
Steward CA, Parker APJ, Minassian BA, Sisodiya SM, Frankish A, Harrow J. Genome annotation for clinical genomic diagnostics: strengths and weaknesses. Genome Med 2017; 9:49. [PMID: 28558813 PMCID: PMC5448149 DOI: 10.1186/s13073-017-0441-1] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open
Abstract
The Human Genome Project and advances in DNA sequencing technologies have revolutionized the identification of genetic disorders through the use of clinical exome sequencing. However, in a considerable number of patients, the genetic basis remains unclear. As clinicians begin to consider whole-genome sequencing, an understanding of the processes and tools involved and the factors to consider in the annotation of the structure and function of genomic elements that might influence variant identification is crucial. Here, we discuss and illustrate the strengths and weaknesses of approaches for the annotation and classification of important elements of protein-coding genes, other genomic elements such as pseudogenes and the non-coding genome, comparative-genomic approaches for inferring gene function, and new technologies for aiding genome annotation, as a practical guide for clinicians when considering pathogenic sequence variation. Complete and accurate annotation of structure and function of genome features has the potential to reduce both false-negative (from missing annotation) and false-positive (from incorrect annotation) errors in causal variant identification in exome and genome sequences. Re-analysis of unsolved cases will be necessary as newer technology improves genome annotation, potentially improving the rate of diagnosis.
Collapse
Affiliation(s)
- Charles A Steward
- Congenica Ltd, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1DR, UK. .,The Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| | | | - Berge A Minassian
- Department of Pediatrics (Neurology), University of Texas Southwestern, Dallas, TX, USA.,Program in Genetics and Genome Biology and Department of Paediatrics (Neurology), The Hospital for Sick Children and University of Toronto, Toronto, Canada
| | - Sanjay M Sisodiya
- Department of Clinical and Experimental Epilepsy, UCL Institute of Neurology, London, WC1N 3BG, UK.,Chalfont Centre for Epilepsy, Chesham Lane, Chalfont St Peter, Buckinghamshire, SL9 0RJ, UK
| | - Adam Frankish
- The Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.,European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Jennifer Harrow
- The Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.,Illumina Inc, Great Chesterford, Essex, CB10 1XL, UK
| |
Collapse
|
39
|
Chakravorty S, Hegde M. Gene and Variant Annotation for Mendelian Disorders in the Era of Advanced Sequencing Technologies. Annu Rev Genomics Hum Genet 2017; 18:229-256. [PMID: 28415856 DOI: 10.1146/annurev-genom-083115-022545] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Comprehensive annotations of genetic and noncoding regions and corresponding accurate variant classification for Mendelian diseases are the next big challenge in the new genomic era of personalized medicine. Progress in the development of faster and more accurate pipelines for genome annotation and variant classification will lead to the discovery of more novel disease associations and candidate therapeutic targets. This ultimately will facilitate better patient recruitment in clinical trials. In this review, we describe the trends in research at the intersection of basic and clinical genomics that aims to increase understanding of overall genomic complexity, complex inheritance patterns of disease, and patient-phenotype-specific genomic associations. We describe the emerging field of translational functional genomics, which integrates other functional "-omics" approaches that support next-generation sequencing genomic data in order to facilitate personalized diagnostics, disease management, biomarker discovery, and medicine. We also discuss the utility of this integrated approach for diagnostic clinics and medical databases and its role in the future of personalized medicine.
Collapse
Affiliation(s)
- Samya Chakravorty
- Department of Human Genetics, Emory University School of Medicine, Atlanta, Georgia 30322;
| | - Madhuri Hegde
- Department of Human Genetics, Emory University School of Medicine, Atlanta, Georgia 30322;
| |
Collapse
|
40
|
Özdemir BC, Dotto GP. Racial Differences in Cancer Susceptibility and Survival: More Than the Color of the Skin? Trends Cancer 2017; 3:181-197. [PMID: 28718431 DOI: 10.1016/j.trecan.2017.02.002] [Citation(s) in RCA: 123] [Impact Index Per Article: 15.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2016] [Revised: 02/03/2017] [Accepted: 02/06/2017] [Indexed: 12/14/2022]
Abstract
Epidemiological studies point to race as a determining factor in cancer susceptibility. In US registries recording cancer incidence and survival by race (distinguishing 'black versus white'), individuals of African ancestry have a globally increased risk of malignancies compared with Caucasians and Asian Americans. Differences in socioeconomic status and health-care access play a key role. However, the lesser disease susceptibility of Hispanic populations with comparable lifestyles and socioeconomic status as African Americans (Hispanic paradox) points to the concomitant importance of genetic determinants. Here, we overview the molecular basis of racial disparity in cancer susceptibility ranging from genetic polymorphisms and cancer-driver gene mutations to obesity, chronic inflammation, and immune responses. We discuss implications for race-adapted cancer screening programs and clinical trials to reduce disparities in cancer burden.
Collapse
Affiliation(s)
- Berna C Özdemir
- Department of Oncology, Centre Hospitalier Universitaire Vaudois, Rue du Bugnon 46, 1011 Lausanne, Switzerland
| | - Gian-Paolo Dotto
- Department of Biochemistry, University of Lausanne, Chemin des Boveresses 155, 1066 Épalinges, Switzerland; Harvard Dermatology Department and Cutaneous Biology Research Center, Massachusetts General Hospital, 55 Fruit Street, Boston, MA 02129, USA.
| |
Collapse
|
41
|
Abou-Abbass H, Abou-El-Hassan H, Bahmad H, Zibara K, Zebian A, Youssef R, Ismail J, Zhu R, Zhou S, Dong X, Nasser M, Bahmad M, Darwish H, Mechref Y, Kobeissy F. Glycosylation and other PTMs alterations in neurodegenerative diseases: Current status and future role in neurotrauma. Electrophoresis 2016; 37:1549-61. [PMID: 26957254 PMCID: PMC4962686 DOI: 10.1002/elps.201500585] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2015] [Revised: 02/28/2016] [Accepted: 02/29/2016] [Indexed: 12/12/2022]
Abstract
Traumatic brain injuries (TBIs) present a chief public health threat affecting nations worldwide. As numbers of patients afflicted by TBI are expected to rise, the necessity to increase our understanding of the pathophysiological mechanism(s) as a result of TBI mounts. TBI is known to augment the risk of developing a number of neurodegenerative diseases (NDs) such as Alzheimer's disease (AD) and Parkinson's disease (PD). Hence, it is rational to assume that a common mechanistic ground links the pathophysiology of NDs to that of TBIs. Through this review, we aim to identify the protein-protein interactions, differential proteins expression, and PTMs, mainly glycosylation, that are involved in the pathogenesis of both ND and TBI. OVID and PubMed have been rigorously searched to identify studies that utilized advanced proteomic platforms (MS based) and systems biology tools to unfold the mechanism(s) behind ND in an attempt to unveil the mysterious biological processes that occur postinjury. Various PTMs have been found to be common between TBI and AD, whereas no similarities have been found between TBI and PD. Phosphorylated tau protein, glycosylated amyloid precursor protein, and many other modifications appear to be common in both TBI and AD. PTMs, differential protein profiles, and altered biological pathways appear to have critical roles in ND processes by interfering with their pathological condition in a manner similar to TBI. Advancement in glycoproteomic studies pertaining to ND and TBI is urgently needed in order to develop better diagnostic tools, therapies, and more favorable prognoses.
Collapse
Affiliation(s)
- Hussein Abou-Abbass
- Department of Biochemistry and Molecular Genetics, Faculty of Medicine, American University of Beirut, Beirut, Lebanon
- Faculty of Medicine, Beirut Arab University, Beirut, Lebanon
| | | | - Hisham Bahmad
- Faculty of Medicine, Beirut Arab University, Beirut, Lebanon
- Department of Anatomy, Cell Biology and Physiological Sciences, Faculty of Medicine, American University of Beirut, Beirut, Lebanon
| | - Kazem Zibara
- ER045 - Laboratory of Stem Cells, DSST, Lebanese University, Beirut, Lebanon
- Department of Biology, Faculty of Sciences-I, Lebanese University, Beirut, Lebanon
| | - Abir Zebian
- Department of Biochemistry and Molecular Genetics, Faculty of Medicine, American University of Beirut, Beirut, Lebanon
| | - Rabab Youssef
- Department of Biochemistry and Molecular Genetics, Faculty of Medicine, American University of Beirut, Beirut, Lebanon
| | - Joy Ismail
- Department of Anatomy, Cell Biology and Physiological Sciences, Faculty of Medicine, American University of Beirut, Beirut, Lebanon
| | - Rui Zhu
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, TX, USA
| | - Shiyue Zhou
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, TX, USA
| | - Xue Dong
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, TX, USA
| | - Mayse Nasser
- Faculty of Medicine, Beirut Arab University, Beirut, Lebanon
| | - Marwan Bahmad
- Faculty of Medicine, Beirut Arab University, Beirut, Lebanon
| | - Hala Darwish
- Faculty of Medicine-School of Nursing, American University of Beirut, New York, NY, USA
| | - Yehia Mechref
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, TX, USA
| | - Firas Kobeissy
- Department of Biochemistry and Molecular Genetics, Faculty of Medicine, American University of Beirut, Beirut, Lebanon
| |
Collapse
|
42
|
Frequent mutations in acetylation and ubiquitination sites suggest novel driver mechanisms of cancer. Genome Med 2016; 8:55. [PMID: 27175787 PMCID: PMC4864925 DOI: 10.1186/s13073-016-0311-2] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2015] [Accepted: 04/19/2016] [Indexed: 12/14/2022] Open
Abstract
Background Discovery of cancer drivers is a major goal of cancer research. Driver genes and pathways are often predicted using mutation frequency, assuming that statistically significant recurrence of specific somatic mutations across independent samples indicates their importance in cancer. However, many mutations, including known cancer drivers, are not observed at high frequency. Fortunately, abundant information is available about functional “active sites” in proteins that can be integrated with mutations to predict cancer driver genes, even based on low frequency mutations. Further, considering active site information predicts detailed biochemical mechanisms impacted by the mutations. Post-translational modifications (PTMs) are active sites that are regulatory switches in proteins and pathways. We analyzed acetylation and ubiquitination, two important PTM types often involved in chromatin organization and protein degradation, to find proteins that are significantly affected by tumor somatic mutations. Methods We performed computational analyses of acetylation and ubiquitination sites in a pan-cancer dataset of 3200 tumor samples from The Cancer Genome Atlas (TCGA). These analyses were targeted at different levels of biological organization including individual genes, pathway annotated gene sets, and protein-protein interaction networks. Results Acetylation and ubiquitination site mutations are enriched in cancer with significantly stronger evolutionary conservation and accumulation in protein domains. Gene-focused analysis with the ActiveDriver method reveals significant co-occurrences of acetylation and ubiquitination PTMs and mutation hotspots in known oncoproteins (TP53, AKT1, IDH1) and highlights candidate cancer driver genes with PTM-related mechanisms (e.g. several histone proteins and the splicing factor SF3B1). Pathway analysis shows that PTM mutations in acetylation and ubiquitination sites accumulate in cancer-related processes such as cell cycle, apoptosis, chromatin regulation, and metabolism. Integrated mutation analysis of clinical information and protein interaction networks suggests that many PTM-specific mutations associate with decreased patient survival. Conclusions Mutation analysis of acetylation and ubiquitination PTM sites reveals their importance in cancer. As PTM networks are increasingly mapped and related enzymes are often druggable, deeper investigation of specific associated mutations may lead to the discovery of treatment-relevant cellular mechanisms. Electronic supplementary material The online version of this article (doi:10.1186/s13073-016-0311-2) contains supplementary material, which is available to authorized users.
Collapse
|
43
|
Reimand J, Arak T, Adler P, Kolberg L, Reisberg S, Peterson H, Vilo J. g:Profiler-a web server for functional interpretation of gene lists (2016 update). Nucleic Acids Res 2016; 44:W83-9. [PMID: 27098042 PMCID: PMC4987867 DOI: 10.1093/nar/gkw199] [Citation(s) in RCA: 919] [Impact Index Per Article: 102.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2016] [Accepted: 03/13/2016] [Indexed: 12/13/2022] Open
Abstract
Functional enrichment analysis is a key step in interpreting gene lists discovered in diverse high-throughput experiments. g:Profiler studies flat and ranked gene lists and finds statistically significant Gene Ontology terms, pathways and other gene function related terms. Translation of hundreds of gene identifiers is another core feature of g:Profiler. Since its first publication in 2007, our web server has become a popular tool of choice among basic and translational researchers. Timeliness is a major advantage of g:Profiler as genome and pathway information is synchronized with the Ensembl database in quarterly updates. g:Profiler supports 213 species including mammals and other vertebrates, plants, insects and fungi. The 2016 update of g:Profiler introduces several novel features. We have added further functional datasets to interpret gene lists, including transcription factor binding site predictions, Mendelian disease annotations, information about protein expression and complexes and gene mappings of human genetic polymorphisms. Besides the interactive web interface, g:Profiler can be accessed in computational pipelines using our R package, Python interface and BioJS component. g:Profiler is freely available at http://biit.cs.ut.ee/gprofiler/.
Collapse
Affiliation(s)
- Jüri Reimand
- Ontario Institute for Cancer Research, 661 University Avenue, Toronto, ON M5G 0A3, Canada Department of Medical Biophysics, University of Toronto, 101 College Street, Toronto, ON M5G 1L7, Canada
| | - Tambet Arak
- Institute of Computer Science, University of Tartu, Liivi 2, 50409 Tartu, Estonia
| | - Priit Adler
- Institute of Computer Science, University of Tartu, Liivi 2, 50409 Tartu, Estonia
| | - Liis Kolberg
- Institute of Computer Science, University of Tartu, Liivi 2, 50409 Tartu, Estonia
| | - Sulev Reisberg
- Institute of Computer Science, University of Tartu, Liivi 2, 50409 Tartu, Estonia
| | - Hedi Peterson
- Institute of Computer Science, University of Tartu, Liivi 2, 50409 Tartu, Estonia
| | - Jaak Vilo
- Institute of Computer Science, University of Tartu, Liivi 2, 50409 Tartu, Estonia
| |
Collapse
|
44
|
Xiao Q, Miao B, Bi J, Wang Z, Li Y. Prioritizing functional phosphorylation sites based on multiple feature integration. Sci Rep 2016; 6:24735. [PMID: 27090940 PMCID: PMC4835696 DOI: 10.1038/srep24735] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2015] [Accepted: 04/04/2016] [Indexed: 12/15/2022] Open
Abstract
Protein phosphorylation is an important type of post-translational modification that is involved in a variety of biological activities. Most phosphorylation events occur on serine, threonine and tyrosine residues in eukaryotes. In recent years, many phosphorylation sites have been identified as a result of advances in mass-spectrometric techniques. However, a large percentage of phosphorylation sites may be non-functional. Systematically prioritizing functional sites from a large number of phosphorylation sites will be increasingly important for the study of their biological roles. This study focused on exploring the intrinsic features of functional phosphorylation sites to predict whether a phosphosite is likely to be functional. We found significant differences in the distribution of evolutionary conservation, kinase association, disorder score, and secondary structure between known functional and background phosphorylation datasets. We built four different types of classifiers based on the most representative features and found that their performances were similar. We also prioritized 213,837 human phosphorylation sites from a variety of phosphorylation databases, which will be helpful for subsequent functional studies. All predicted results are available for query and download on our website (Predict Functional Phosphosites, PFP, http://pfp.biosino.org/).
Collapse
Affiliation(s)
- Qingyu Xiao
- Key Lab of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, P. R. China.,University of Chinese Academy of Sciences, Beijing, P. R. China
| | - Benpeng Miao
- Key Lab of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, P. R. China.,University of Chinese Academy of Sciences, Beijing, P. R. China
| | - Jie Bi
- University of Chinese Academy of Sciences, Beijing, P. R. China.,Key Lab of Systems Biology, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, P. R. China
| | - Zhen Wang
- Key Lab of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, P. R. China
| | - Yixue Li
- Key Lab of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, P. R. China.,Shanghai Center for Bioinformation Technology, Shanghai Industrial Technology Institute, Shanghai, P. R. China
| |
Collapse
|
45
|
Keegan S, Cortens JP, Beavis RC, Fenyö D. g2pDB: A Database Mapping Protein Post-Translational Modifications to Genomic Coordinates. J Proteome Res 2016; 15:983-90. [PMID: 26842767 DOI: 10.1021/acs.jproteome.5b01018] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Large scale proteomics have made it possible to broadly screen samples for the presence of many types of post-translational modifications, such as phosphorylation, acetylation, and ubiquitination. This type of data has allowed the localization of these modifications to either a specific site on a proteolytically generated peptide or to within a small domain on the peptide. The resulting modification acceptor sites can then be mapped onto the appropriate protein sequences and the information archived. This paper describes the usage of a very large archive of experimental observations of human post-translational modifications to create a map of the most reproducible modification observations onto the complete set of human protein sequences. This set of modification acceptor sites was then directly translated into the genomic coordinates for the codons for the residues at those sites. We constructed the database g2pDB using this protein-to-codon site mapping information. The information in g2pDB has been made available through a RESTful-style API, allowing researchers to determine which specific protein modifications would be perturbed by a set of observed nucleotide variants determined by high throughput DNA or RNA sequencing.
Collapse
Affiliation(s)
- Sarah Keegan
- Center for Health Informatics and Bioinformatics, New York University Medical School , 227 East 30 Street, New York, New York 10016, United States
| | - John P Cortens
- Department of Biochemistry and Medical Genetics, University of Manitoba, Faculty of Health Sciences , 744 Bannatyne Avenue, Winnipeg, MB R3E 0W3, Canada
| | - Ronald C Beavis
- Department of Biochemistry and Medical Genetics, University of Manitoba, Faculty of Health Sciences , 744 Bannatyne Avenue, Winnipeg, MB R3E 0W3, Canada
| | - David Fenyö
- Center for Health Informatics and Bioinformatics, New York University Medical School , 227 East 30 Street, New York, New York 10016, United States
| |
Collapse
|
46
|
Valimberti I, Tiberti M, Lambrughi M, Sarcevic B, Papaleo E. E2 superfamily of ubiquitin-conjugating enzymes: constitutively active or activated through phosphorylation in the catalytic cleft. Sci Rep 2015; 5:14849. [PMID: 26463729 PMCID: PMC4604453 DOI: 10.1038/srep14849] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2015] [Accepted: 08/19/2015] [Indexed: 12/22/2022] Open
Abstract
Protein phosphorylation is a modification that offers a dynamic and reversible mechanism to regulate the majority of cellular processes. Numerous diseases are associated with aberrant regulation of phosphorylation-induced switches. Phosphorylation is emerging as a mechanism to modulate ubiquitination by regulating key enzymes in this pathway. The molecular mechanisms underpinning how phosphorylation regulates ubiquitinating enzymes, however, are elusive. Here, we show the high conservation of a functional site in E2 ubiquitin-conjugating enzymes. In catalytically active E2s, this site contains aspartate or a phosphorylatable serine and we refer to it as the conserved E2 serine/aspartate (CES/D) site. Molecular simulations of substrate-bound and -unbound forms of wild type, mutant and phosphorylated E2s, provide atomistic insight into the role of the CES/D residue for optimal E2 activity. Both the size and charge of the side group at the site play a central role in aligning the substrate lysine toward E2 catalytic cysteine to control ubiquitination efficiency. The CES/D site contributes to the fingerprint of the E2 superfamily. We propose that E2 enzymes can be divided into constitutively active or regulated families. E2s characterized by an aspartate at the CES/D site signify constitutively active E2s, whereas those containing a serine can be regulated by phosphorylation.
Collapse
Affiliation(s)
- Ilaria Valimberti
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, Piazza della Scienza 2, 20126, Milan (Italy)
| | - Matteo Tiberti
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, Piazza della Scienza 2, 20126, Milan (Italy)
| | - Matteo Lambrughi
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, Piazza della Scienza 2, 20126, Milan (Italy)
| | - Boris Sarcevic
- Cell Cycle and Cancer Unit, St. Vincent's Institute of Medical Research and The Department of Medicine, St. Vincent's Hospital, The University of Melbourne, Fitzroy, Melbourne, Victoria 3065, Australia
| | - Elena Papaleo
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, Piazza della Scienza 2, 20126, Milan (Italy)
| |
Collapse
|
47
|
MIMP: predicting the impact of mutations on kinase-substrate phosphorylation. Nat Methods 2015; 12:531-3. [PMID: 25938373 DOI: 10.1038/nmeth.3396] [Citation(s) in RCA: 57] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2014] [Accepted: 03/30/2015] [Indexed: 12/14/2022]
|