1
|
Guo G, Wang X, Zhang Y, Li T. Sequence variations of phase-separating proteins and resources for studying biomolecular condensates. Acta Biochim Biophys Sin (Shanghai) 2023; 55:1119-1132. [PMID: 37464880 PMCID: PMC10423696 DOI: 10.3724/abbs.2023131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Accepted: 06/06/2023] [Indexed: 07/20/2023] Open
Abstract
Phase separation (PS) is an important mechanism underlying the formation of biomolecular condensates. Physiological condensates are associated with numerous biological processes, such as transcription, immunity, signaling, and synaptic transmission. Changes in particular amino acids or segments can disturb the protein's phase behavior and interactions with other biomolecules in condensates. It is thus presumed that variations in the phase-separating-prone domains can significantly impact the properties and functions of condensates. The dysfunction of condensates contributes to a number of pathological processes. Pharmacological perturbation of these condensates is proposed as a promising way to restore physiological states. In this review, we characterize the variations observed in PS proteins that lead to aberrant biomolecular compartmentalization. We also showcase recent advancements in bioinformatics of membraneless organelles (MLOs), focusing on available databases useful for screening PS proteins and describing endogenous condensates, guiding researchers to seek the underlying pathogenic mechanisms of biomolecular condensates.
Collapse
Affiliation(s)
- Gaigai Guo
- Department of Biomedical InformaticsSchool of Basic Medical SciencesPeking University Health Science CenterBeijing100191China
| | - Xinxin Wang
- Department of Biomedical InformaticsSchool of Basic Medical SciencesPeking University Health Science CenterBeijing100191China
| | - Yi Zhang
- Department of Biomedical InformaticsSchool of Basic Medical SciencesPeking University Health Science CenterBeijing100191China
| | - Tingting Li
- Department of Biomedical InformaticsSchool of Basic Medical SciencesPeking University Health Science CenterBeijing100191China
- Key Laboratory for NeuroscienceMinistry of Education/National Health Commission of ChinaPeking UniversityBeijing100191China
| |
Collapse
|
2
|
Lee LYH, Loscalzo J. Network Medicine in Pathobiology. THE AMERICAN JOURNAL OF PATHOLOGY 2019; 189:1311-1326. [PMID: 31014954 DOI: 10.1016/j.ajpath.2019.03.009] [Citation(s) in RCA: 42] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Accepted: 03/05/2019] [Indexed: 12/11/2022]
Abstract
The past decade has witnessed exponential growth in the generation of high-throughput human data across almost all known dimensions of biological systems. The discipline of network medicine has rapidly evolved in parallel, providing an unbiased, comprehensive biological framework through which to interrogate and integrate systematically these large-scale, multi-omic data to enhance our understanding of disease mechanisms and to design drugs that reflect a deep knowledge of molecular pathobiology. In this review, we discuss the key principles of network medicine and the human disease network and explore the latest applications of network medicine in this multi-omic era. We also highlight the current conceptual and technological challenges, which serve as exciting opportunities by which to improve and expand the network-based applications beyond the artificial boundaries of the current state of human pathobiology.
Collapse
Affiliation(s)
| | - Joseph Loscalzo
- Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts.
| |
Collapse
|
3
|
The contribution of intrinsically disordered regions to protein function, cellular complexity, and human disease. Biochem Soc Trans 2017; 44:1185-1200. [PMID: 27911701 PMCID: PMC5095923 DOI: 10.1042/bst20160172] [Citation(s) in RCA: 257] [Impact Index Per Article: 36.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2016] [Revised: 07/20/2016] [Accepted: 07/22/2016] [Indexed: 12/23/2022]
Abstract
In the 1960s, Christian Anfinsen postulated that the unique three-dimensional structure of a protein is determined by its amino acid sequence. This work laid the foundation for the sequence–structure–function paradigm, which states that the sequence of a protein determines its structure, and structure determines function. However, a class of polypeptide segments called intrinsically disordered regions does not conform to this postulate. In this review, I will first describe established and emerging ideas about how disordered regions contribute to protein function. I will then discuss molecular principles by which regulatory mechanisms, such as alternative splicing and asymmetric localization of transcripts that encode disordered regions, can increase the functional versatility of proteins. Finally, I will discuss how disordered regions contribute to human disease and the emergence of cellular complexity during organismal evolution.
Collapse
|
4
|
Tay AP, Pang CNI, Winter DL, Wilkins MR. PTMOracle: A Cytoscape App for Covisualizing and Coanalyzing Post-Translational Modifications in Protein Interaction Networks. J Proteome Res 2017; 16:1988-2003. [PMID: 28349685 DOI: 10.1021/acs.jproteome.6b01052] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Post-translational modifications of proteins (PTMs) act as key regulators of protein activity and of protein-protein interactions (PPIs). To date, it has been difficult to comprehensively explore functional links between PTMs and PPIs. To address this, we developed PTMOracle, a Cytoscape app for coanalyzing PTMs within PPI networks. PTMOracle also allows extensive data to be integrated and coanalyzed with PPI networks, allowing the role of domains, motifs, and disordered regions to be considered. For proteins of interest, or a whole proteome, PTMOracle can generate network visualizations to reveal complex PTM-associated relationships. This is assisted by OraclePainter for coloring proteins by modifications, OracleTools for network analytics, and OracleResults for exploring tabulated findings. To illustrate the use of PTMOracle, we investigate PTM-associated relationships and their role in PPIs in four case studies. In the yeast interactome and its rich set of PTMs, we construct and explore histone-associated and domain-domain interaction networks and show how integrative approaches can predict kinases involved in phosphodegrons. In the human interactome, a phosphotyrosine-associated network is analyzed but highlights the sparse nature of human PPI networks and lack of PTM-associated data. PTMOracle is open source and available at the Cytoscape app store: http://apps.cytoscape.org/apps/ptmoracle .
Collapse
Affiliation(s)
- Aidan P Tay
- Systems Biology Initiative, School of Biotechnology and Biomolecular Sciences, The University of New South Wales , Sydney, New South Wales 2052, Australia
| | - Chi Nam Ignatius Pang
- Systems Biology Initiative, School of Biotechnology and Biomolecular Sciences, The University of New South Wales , Sydney, New South Wales 2052, Australia
| | - Daniel L Winter
- Systems Biology Initiative, School of Biotechnology and Biomolecular Sciences, The University of New South Wales , Sydney, New South Wales 2052, Australia
| | - Marc R Wilkins
- Systems Biology Initiative, School of Biotechnology and Biomolecular Sciences, The University of New South Wales , Sydney, New South Wales 2052, Australia
| |
Collapse
|
5
|
Charitou T, Bryan K, Lynn DJ. Using biological networks to integrate, visualize and analyze genomics data. Genet Sel Evol 2016; 48:27. [PMID: 27036106 PMCID: PMC4818439 DOI: 10.1186/s12711-016-0205-1] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2015] [Accepted: 03/16/2016] [Indexed: 12/22/2022] Open
Abstract
Network biology is a rapidly developing area of biomedical research and reflects the current view that complex phenotypes, such as disease susceptibility, are not the result of single gene mutations that act in isolation but are rather due to the perturbation of a gene’s network context. Understanding the topology of these molecular interaction networks and identifying the molecules that play central roles in their structure and regulation is a key to understanding complex systems. The falling cost of next-generation sequencing is now enabling researchers to routinely catalogue the molecular components of these networks at a genome-wide scale and over a large number of different conditions. In this review, we describe how to use publicly available bioinformatics tools to integrate genome-wide ‘omics’ data into a network of experimentally-supported molecular interactions. In addition, we describe how to visualize and analyze these networks to identify topological features of likely functional relevance, including network hubs, bottlenecks and modules. We show that network biology provides a powerful conceptual approach to integrate and find patterns in genome-wide genomic data but we also discuss the limitations and caveats of these methods, of which researchers adopting these methods must remain aware.
Collapse
Affiliation(s)
- Theodosia Charitou
- EMBL Australia Group, Infection and Immunity, South Australian Health and Medical Research Institute (SAHMRI), North Terrace, Adelaide, SA, 5000, Australia.,Systems Biology Ireland, University College Dublin, Belfield 4, Ireland.,Teagasc, The Agriculture and Food Development Authority, Co Meath, Ireland
| | - Kenneth Bryan
- EMBL Australia Group, Infection and Immunity, South Australian Health and Medical Research Institute (SAHMRI), North Terrace, Adelaide, SA, 5000, Australia
| | - David J Lynn
- EMBL Australia Group, Infection and Immunity, South Australian Health and Medical Research Institute (SAHMRI), North Terrace, Adelaide, SA, 5000, Australia. .,School of Medicine, Flinders University, Bedford Park, SA, 5042, Australia.
| |
Collapse
|
6
|
Tagore S, De RK. Evolutionary growth of certain metabolic pathways involved in the functioning of GAD and INS genes in Type 1 Diabetes Mellitus: Their architecture and stability. Comput Biol Med 2015; 61:19-35. [DOI: 10.1016/j.compbiomed.2015.03.012] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2014] [Revised: 03/12/2015] [Accepted: 03/13/2015] [Indexed: 11/25/2022]
|
7
|
Tay AP, Pang CNI, Twine NA, Hart-Smith G, Harkness L, Kassem M, Wilkins MR. Proteomic Validation of Transcript Isoforms, Including Those Assembled from RNA-Seq Data. J Proteome Res 2015; 14:3541-54. [PMID: 25961807 DOI: 10.1021/pr5011394] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Human proteome analysis now requires an understanding of protein isoforms. We recently published the PG Nexus pipeline, which facilitates high confidence validation of exons and splice junctions by integrating genomics and proteomics data. Here we comprehensively explore how RNA-seq transcriptomics data, and proteomic analysis of the same sample, can identify protein isoforms. RNA-seq data from human mesenchymal (hMSC) stem cells were analyzed with our new TranscriptCoder tool to generate a database of protein isoform sequences. MS/MS data from matching hMSC samples were then matched against the TranscriptCoder-derived database, along with Ensembl and the neXtProt database. Querying the TranscriptCoder-derived or Ensembl database could unambiguously identify ∼450 protein isoforms, with isoform-specific proteotypic peptides, including candidate hMSC-specific isoforms for the genes DPYSL2 and FXR1. Where isoform-specific peptides did not exist, groups of nonisoform-specific proteotypic peptides could specifically identify many isoforms. In both the above cases, isoforms will be detectable with targeted MS/MS assays. Unfortunately, our analysis also revealed that some isoforms will be difficult to identify unambiguously as they do not have peptides that are sufficiently distinguishing. We covisualize mRNA isoforms and peptides in a genome browser to illustrate the above situations. Mass spectrometry data is available via ProteomeXchange (PXD001449).
Collapse
Affiliation(s)
- Aidan P Tay
- Systems Biology Initiative, The University of New South Wales , Sydney, New South Wales 2052, Australia.,School of Biotechnology and Biomolecular Sciences, The University of New South Wales , Sydney, New South Wales 2052, Australia
| | - Chi Nam Ignatius Pang
- Systems Biology Initiative, The University of New South Wales , Sydney, New South Wales 2052, Australia.,School of Biotechnology and Biomolecular Sciences, The University of New South Wales , Sydney, New South Wales 2052, Australia
| | - Natalie A Twine
- Systems Biology Initiative, The University of New South Wales , Sydney, New South Wales 2052, Australia.,School of Biotechnology and Biomolecular Sciences, The University of New South Wales , Sydney, New South Wales 2052, Australia
| | - Gene Hart-Smith
- Systems Biology Initiative, The University of New South Wales , Sydney, New South Wales 2052, Australia.,School of Biotechnology and Biomolecular Sciences, The University of New South Wales , Sydney, New South Wales 2052, Australia
| | - Linda Harkness
- Endocrine Research Laboratory (KMEB), Department of Endocrinology and Metabolism, Odense University Hospital & University of Southern Denmark , Odense 5230, Denmark
| | - Moustapha Kassem
- Endocrine Research Laboratory (KMEB), Department of Endocrinology and Metabolism, Odense University Hospital & University of Southern Denmark , Odense 5230, Denmark
| | - Marc R Wilkins
- Systems Biology Initiative, The University of New South Wales , Sydney, New South Wales 2052, Australia.,School of Biotechnology and Biomolecular Sciences, The University of New South Wales , Sydney, New South Wales 2052, Australia
| |
Collapse
|
8
|
Schramm SJ, Jayaswal V, Goel A, Li SS, Yang YH, Mann GJ, Wilkins MR. Molecular interaction networks for the analysis of human disease: utility, limitations, and considerations. Proteomics 2014; 13:3393-405. [PMID: 24166987 DOI: 10.1002/pmic.201200570] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2012] [Revised: 09/11/2013] [Accepted: 10/07/2013] [Indexed: 01/01/2023]
Abstract
High-throughput '-omics' data can be combined with large-scale molecular interaction networks, for example, protein-protein interaction networks, to provide a unique framework for the investigation of human molecular biology. Interest in these integrative '-omics' methods is growing rapidly because of their potential to understand complexity and association with disease; such approaches have a focus on associations between phenotype and "network-type." The potential of this research is enticing, yet there remain a series of important considerations. Here, we discuss interaction data selection, data quality, the relative merits of using data from large high-throughput studies versus a meta-database of smaller literature-curated studies, and possible issues of sociological or inspection bias in interaction data. Other work underway, especially international consortia to establish data formats, quality standards and address data redundancy, and the improvements these efforts are making to the field, is also evaluated. We present options for researchers intending to use large-scale molecular interaction networks as a functional context for protein or gene expression data, including microRNAs, especially in the context of human disease.
Collapse
Affiliation(s)
- Sarah-Jane Schramm
- Sydney Medical School, Westmead Millennium Institute for Medical Research, The University of Sydney, Sydney, NSW, Australia; Melanoma Institute Australia, Sydney, NSW, Australia
| | | | | | | | | | | | | |
Collapse
|
9
|
Van Roey K, Uyar B, Weatheritt RJ, Dinkel H, Seiler M, Budd A, Gibson TJ, Davey NE. Short Linear Motifs: Ubiquitous and Functionally Diverse Protein Interaction Modules Directing Cell Regulation. Chem Rev 2014; 114:6733-78. [DOI: 10.1021/cr400585q] [Citation(s) in RCA: 293] [Impact Index Per Article: 29.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Affiliation(s)
- Kim Van Roey
- Structural
and Computational Biology Unit, European Molecular Biology Laboratory (EMBL), Meyerhofstrasse 1, 69117 Heidelberg, Germany
| | - Bora Uyar
- Structural
and Computational Biology Unit, European Molecular Biology Laboratory (EMBL), Meyerhofstrasse 1, 69117 Heidelberg, Germany
| | - Robert J. Weatheritt
- MRC
Laboratory of Molecular Biology (LMB), Francis Crick Avenue, Cambridge Biomedical Campus, Cambridge CB2 0QH, United Kingdom
| | - Holger Dinkel
- Structural
and Computational Biology Unit, European Molecular Biology Laboratory (EMBL), Meyerhofstrasse 1, 69117 Heidelberg, Germany
| | - Markus Seiler
- Structural
and Computational Biology Unit, European Molecular Biology Laboratory (EMBL), Meyerhofstrasse 1, 69117 Heidelberg, Germany
| | - Aidan Budd
- Structural
and Computational Biology Unit, European Molecular Biology Laboratory (EMBL), Meyerhofstrasse 1, 69117 Heidelberg, Germany
| | - Toby J. Gibson
- Structural
and Computational Biology Unit, European Molecular Biology Laboratory (EMBL), Meyerhofstrasse 1, 69117 Heidelberg, Germany
| | - Norman E. Davey
- Structural
and Computational Biology Unit, European Molecular Biology Laboratory (EMBL), Meyerhofstrasse 1, 69117 Heidelberg, Germany
- Department
of Physiology, University of California, San Francisco, San Francisco, California 94143, United States
| |
Collapse
|
10
|
Mining large-scale response networks reveals 'topmost activities' in Mycobacterium tuberculosis infection. Sci Rep 2014; 3:2302. [PMID: 23892477 PMCID: PMC3725478 DOI: 10.1038/srep02302] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2013] [Accepted: 07/10/2013] [Indexed: 02/02/2023] Open
Abstract
Mycobacterium tuberculosis owes its high pathogenic potential to its ability to evade host immune responses and thrive inside the macrophage. The outcome of infection is largely determined by the cellular response comprising a multitude of molecular events. The complexity and inter-relatedness in the processes makes it essential to adopt systems approaches to study them. In this work, we construct a comprehensive network of infection-related processes in a human macrophage comprising 1888 proteins and 14,016 interactions. We then compute response networks based on available gene expression profiles corresponding to states of health, disease and drug treatment. We use a novel formulation for mining response networks that has led to identifying highest activities in the cell. Highest activity paths provide mechanistic insights into pathogenesis and response to treatment. The approach used here serves as a generic framework for mining dynamic changes in genome-scale protein interaction networks.
Collapse
|
11
|
Elzawahry A, Patil A, Kumagai Y, Suzuki Y, Nakai K. Innate immunity interactome dynamics. GENE REGULATION AND SYSTEMS BIOLOGY 2014; 8:1-15. [PMID: 24453478 PMCID: PMC3885269 DOI: 10.4137/grsb.s12850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Innate immune response involves protein–protein interactions, deoxyribonucleic acid (DNA)–protein interactions and signaling cascades. So far, thousands of protein–protein interactions have been curated as a static interaction map. However, protein–protein interactions involved in innate immune response are dynamic. We recorded the dynamics in the interactome during innate immune response by combining gene expression data of lipopolysaccharide (LPS)-stimulated dendritic cells with protein–protein interactions data. We identified the differences in interactome during innate immune response by constructing differential networks and identifying protein modules, which were up-/down-regulated at each stage during the innate immune response. For each protein complex, we identified enriched biological processes and pathways. In addition, we identified core interactions that are conserved throughout the innate immune response and their enriched gene ontology terms and pathways. We defined two novel measures to assess the differences between network maps at different time points. We found that the protein interaction network at 1 hour after LPS stimulation has the highest interactions protein ratio, which indicates a role for proteins with large number of interactions in innate immune response. A pairwise differential matrix allows for the global visualization of the differences between different networks. We investigated the toll-like receptor subnetwork and found that S100A8 is down-regulated in dendritic cells after LPS stimulation. Identified protein complexes have a crucial role not only in innate immunity, but also in circadian rhythms, pathways involved in cancer, and p53 pathways. The study confirmed previous work that reported a strong correlation between cancer and immunity.
Collapse
Affiliation(s)
- Asmaa Elzawahry
- Institute of Medical Science, The University of Tokyo, Minato-ku, Tokyo, Japan. ; Graduate school of Frontier Sciences, The University of Tokyo, Kashiwa-shi, Chiba, Japan
| | - Ashwini Patil
- Institute of Medical Science, The University of Tokyo, Minato-ku, Tokyo, Japan
| | - Yutaro Kumagai
- Laboratory of Host Defence, WPI Immunology Frontier Research Center, Osaka University, Suita, Osaka, Japan. ; Global Center of Excellence Program, Frontier Biomedical Science Underlying Organelle Network Biology, Osaka University, Suita, Osaka, Japan
| | - Yutaka Suzuki
- Graduate school of Frontier Sciences, The University of Tokyo, Kashiwa-shi, Chiba, Japan
| | - Kenta Nakai
- Institute of Medical Science, The University of Tokyo, Minato-ku, Tokyo, Japan. ; Graduate school of Frontier Sciences, The University of Tokyo, Kashiwa-shi, Chiba, Japan
| |
Collapse
|
12
|
Pang CNI, Tay AP, Aya C, Twine NA, Harkness L, Hart-Smith G, Chia SZ, Chen Z, Deshpande NP, Kaakoush NO, Mitchell HM, Kassem M, Wilkins MR. Tools to covisualize and coanalyze proteomic data with genomes and transcriptomes: validation of genes and alternative mRNA splicing. J Proteome Res 2013; 13:84-98. [PMID: 24152167 DOI: 10.1021/pr400820p] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Direct links between proteomic and genomic/transcriptomic data are not frequently made, partly because of lack of appropriate bioinformatics tools. To help address this, we have developed the PG Nexus pipeline. The PG Nexus allows users to covisualize peptides in the context of genomes or genomic contigs, along with RNA-seq reads. This is done in the Integrated Genome Viewer (IGV). A Results Analyzer reports the precise base position where LC-MS/MS-derived peptides cover genes or gene isoforms, on the chromosomes or contigs where this occurs. In prokaryotes, the PG Nexus pipeline facilitates the validation of genes, where annotation or gene prediction is available, or the discovery of genes using a "virtual protein"-based unbiased approach. We illustrate this with a comprehensive proteogenomics analysis of two strains of Campylobacter concisus . For higher eukaryotes, the PG Nexus facilitates gene validation and supports the identification of mRNA splice junction boundaries and splice variants that are protein-coding. This is illustrated with an analysis of splice junctions covered by human phosphopeptides, and other examples of relevance to the Chromosome-Centric Human Proteome Project. The PG Nexus is open-source and available from https://github.com/IntersectAustralia/ap11_Samifier. It has been integrated into Galaxy and made available in the Galaxy tool shed.
Collapse
Affiliation(s)
- Chi Nam Ignatius Pang
- Systems Biology Initiative, The University of New South Wales , Sydney, New South Wales 2052, Australia
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Promiscuity as a functional trait: intrinsically disordered regions as central players of interactomes. Biochem J 2013; 454:361-9. [PMID: 23988124 DOI: 10.1042/bj20130545] [Citation(s) in RCA: 133] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Because of their pervasiveness in eukaryotic genomes and their unique properties, understanding the role that ID (intrinsically disordered) regions in proteins play in the interactome is essential for gaining a better understanding of the network. Especially critical in determining this role is their ability to bind more than one partner using the same region. Studies have revealed that proteins containing ID regions tend to take a central role in protein interaction networks; specifically, they act as hubs, interacting with multiple different partners across time and space, allowing for the co-ordination of many cellular activities. There appear to be three different modules within ID regions responsible for their functionally promiscuous behaviour: MoRFs (molecular recognition features), SLiMs (small linear motifs) and LCRs (low complexity regions). These regions allow for functionality such as engaging in the formation of dynamic heteromeric structures which can serve to increase local activity of an enzyme or store a collection of functionally related molecules for later use. However, the use of promiscuity does not come without a cost: a number of diseases that have been associated with ID-containing proteins seem to be caused by undesirable interactions occurring upon altered expression of the ID-containing protein.
Collapse
|
14
|
Jayaswal V, Schramm SJ, Mann GJ, Wilkins MR, Yang YH. VAN: an R package for identifying biologically perturbed networks via differential variability analysis. BMC Res Notes 2013; 6:430. [PMID: 24156242 PMCID: PMC4015612 DOI: 10.1186/1756-0500-6-430] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2013] [Accepted: 10/18/2013] [Indexed: 12/26/2022] Open
Abstract
BACKGROUND Large-scale molecular interaction networks are dynamic in nature and are of special interest in the analysis of complex diseases, which are characterized by network-level perturbations rather than changes in individual genes/proteins. The methods developed for the identification of differentially expressed genes or gene sets are not suitable for network-level analyses. Consequently, bioinformatics approaches that enable a joint analysis of high-throughput transcriptomics datasets and large-scale molecular interaction networks for identifying perturbed networks are gaining popularity. Typically, these approaches require the sequential application of multiple bioinformatics techniques - ID mapping, network analysis, and network visualization. Here, we present the Variability Analysis in Networks (VAN) software package: a collection of R functions to streamline this bioinformatics analysis. FINDINGS VAN determines whether there are network-level perturbations across biological states of interest. It first identifies hubs (densely connected proteins/microRNAs) in a network and then uses them to extract network modules (comprising of a hub and all its interaction partners). The function identifySignificantHubs identifies dysregulated modules (i.e. modules with changes in expression correlation between a hub and its interaction partners) using a single expression and network dataset. The function summarizeHubData identifies dysregulated modules based on a meta-analysis of multiple expression and/or network datasets. VAN also converts protein identifiers present in a MITAB-formatted interaction network to gene identifiers (UniProt identifier to Entrez identifier or gene symbol using the function generatePpiMap) and generates microRNA-gene interaction networks using TargetScan and Microcosm databases (generateMicroRnaMap). The function obtainCancerInfo is used to identify hubs (corresponding to significantly perturbed modules) that are already causally associated with cancer(s) in the Cancer Gene Census database. Additionally, VAN supports the visualization of changes to network modules in R and Cytoscape (visualizeNetwork and obtainPairSubset, respectively). We demonstrate the utility of VAN using a gene expression data from metastatic melanoma and a protein-protein interaction network from the Human Protein Reference Database. CONCLUSIONS Our package provides a comprehensive and user-friendly platform for the integrative analysis of -omics data to identify disease-associated network modules. This bioinformatics approach, which is essentially focused on the question of explaining phenotype with a 'network type' and in particular, how regulation is changing among different states of interest, is relevant to many questions including those related to network perturbations across developmental timelines.
Collapse
Affiliation(s)
- Vivek Jayaswal
- School of Mathematics and Statistics, The University of Sydney, Sydney, NSW, Australia
| | - Sarah-Jane Schramm
- Westmead Millennium Institute for Medical Research, Sydney Medical School, The University of Sydney, Sydney, NSW, Australia
- Melanoma Institute Australia, Sydney, NSW, Australia
| | - Graham J Mann
- Westmead Millennium Institute for Medical Research, Sydney Medical School, The University of Sydney, Sydney, NSW, Australia
- Melanoma Institute Australia, Sydney, NSW, Australia
| | - Marc R Wilkins
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
- Systems Biology Initiative, University of New South Wales, Sydney, NSW, Australia
| | - Yee Hwa Yang
- School of Mathematics and Statistics, The University of Sydney, Sydney, NSW, Australia
- Melanoma Institute Australia, Sydney, NSW, Australia
| |
Collapse
|
15
|
Fearnley LG, Davis MJ, Ragan MA, Nielsen LK. Extracting reaction networks from databases-opening Pandora's box. Brief Bioinform 2013; 15:973-83. [PMID: 23946492 PMCID: PMC4239801 DOI: 10.1093/bib/bbt058] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Large quantities of information describing the mechanisms of biological pathways continue to be collected in publicly available databases. At the same time, experiments have increased in scale, and biologists increasingly use pathways defined in online databases to interpret the results of experiments and generate hypotheses. Emerging computational techniques that exploit the rich biological information captured in reaction systems require formal standardized descriptions of pathways to extract these reaction networks and avoid the alternative: time-consuming and largely manual literature-based network reconstruction. Here, we systematically evaluate the effects of commonly used knowledge representations on the seemingly simple task of extracting a reaction network describing signal transduction from a pathway database. We show that this process is in fact surprisingly difficult, and the pathway representations adopted by various knowledge bases have dramatic consequences for reaction network extraction, connectivity, capture of pathway crosstalk and in the modelling of cell-cell interactions. Researchers constructing computational models built from automatically extracted reaction networks must therefore consider the issues we outline in this review to maximize the value of existing pathway knowledge.
Collapse
|
16
|
Buljan M, Chalancon G, Dunker AK, Bateman A, Balaji S, Fuxreiter M, Babu MM. Alternative splicing of intrinsically disordered regions and rewiring of protein interactions. Curr Opin Struct Biol 2013; 23:443-50. [DOI: 10.1016/j.sbi.2013.03.006] [Citation(s) in RCA: 145] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2013] [Revised: 03/19/2013] [Accepted: 03/25/2013] [Indexed: 12/31/2022]
|
17
|
Csermely P, Korcsmáros T, Kiss HJM, London G, Nussinov R. Structure and dynamics of molecular networks: a novel paradigm of drug discovery: a comprehensive review. Pharmacol Ther 2013; 138:333-408. [PMID: 23384594 PMCID: PMC3647006 DOI: 10.1016/j.pharmthera.2013.01.016] [Citation(s) in RCA: 506] [Impact Index Per Article: 46.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2013] [Accepted: 01/22/2013] [Indexed: 02/02/2023]
Abstract
Despite considerable progress in genome- and proteome-based high-throughput screening methods and in rational drug design, the increase in approved drugs in the past decade did not match the increase of drug development costs. Network description and analysis not only give a systems-level understanding of drug action and disease complexity, but can also help to improve the efficiency of drug design. We give a comprehensive assessment of the analytical tools of network topology and dynamics. The state-of-the-art use of chemical similarity, protein structure, protein-protein interaction, signaling, genetic interaction and metabolic networks in the discovery of drug targets is summarized. We propose that network targeting follows two basic strategies. The "central hit strategy" selectively targets central nodes/edges of the flexible networks of infectious agents or cancer cells to kill them. The "network influence strategy" works against other diseases, where an efficient reconfiguration of rigid networks needs to be achieved by targeting the neighbors of central nodes/edges. It is shown how network techniques can help in the identification of single-target, edgetic, multi-target and allo-network drug target candidates. We review the recent boom in network methods helping hit identification, lead selection optimizing drug efficacy, as well as minimizing side-effects and drug toxicity. Successful network-based drug development strategies are shown through the examples of infections, cancer, metabolic diseases, neurodegenerative diseases and aging. Summarizing >1200 references we suggest an optimized protocol of network-aided drug development, and provide a list of systems-level hallmarks of drug quality. Finally, we highlight network-related drug development trends helping to achieve these hallmarks by a cohesive, global approach.
Collapse
Affiliation(s)
- Peter Csermely
- Department of Medical Chemistry, Semmelweis University, P.O. Box 260, H-1444 Budapest 8, Hungary.
| | | | | | | | | |
Collapse
|
18
|
Colak R, Kim T, Michaut M, Sun M, Irimia M, Bellay J, Myers CL, Blencowe BJ, Kim PM. Distinct types of disorder in the human proteome: functional implications for alternative splicing. PLoS Comput Biol 2013; 9:e1003030. [PMID: 23633940 PMCID: PMC3635989 DOI: 10.1371/journal.pcbi.1003030] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2012] [Accepted: 02/26/2013] [Indexed: 01/07/2023] Open
Abstract
Intrinsically disordered regions have been associated with various cellular processes and are implicated in several human diseases, but their exact roles remain unclear. We previously defined two classes of conserved disordered regions in budding yeast, referred to as "flexible" and "constrained" conserved disorder. In flexible disorder, the property of disorder has been positionally conserved during evolution, whereas in constrained disorder, both the amino acid sequence and the property of disorder have been conserved. Here, we show that flexible and constrained disorder are widespread in the human proteome, and are particularly common in proteins with regulatory functions. Both classes of disordered sequences are highly enriched in regions of proteins that undergo tissue-specific (TS) alternative splicing (AS), but not in regions of proteins that undergo general (i.e., not tissue-regulated) AS. Flexible disorder is more highly enriched in TS alternative exons, whereas constrained disorder is more highly enriched in exons that flank TS alternative exons. These latter regions are also significantly more enriched in potential phosphosites and other short linear motifs associated with cell signaling. We further show that cancer driver mutations are significantly enriched in regions of proteins associated with TS and general AS. Collectively, our results point to distinct roles for TS alternative exons and flanking exons in the dynamic regulation of protein interaction networks in response to signaling activity, and they further suggest that alternatively spliced regions of proteins are often functionally altered by mutations responsible for cancer.
Collapse
Affiliation(s)
- Recep Colak
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
| | - TaeHyung Kim
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
| | - Magali Michaut
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada
| | - Mark Sun
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
| | - Manuel Irimia
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada
| | - Jeremy Bellay
- Department of Computer Science and Engineering, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Chad L. Myers
- Department of Computer Science and Engineering, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Benjamin J. Blencowe
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada
- * E-mail: (BJB); (PMK)
| | - Philip M. Kim
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science and Engineering, University of Minnesota, Minneapolis, Minnesota, United States of America
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
- * E-mail: (BJB); (PMK)
| |
Collapse
|
19
|
Diss G, Filteau M, Freschi L, Leducq JB, Rochette S, Torres-Quiroz F, Landry CR. Integrative avenues for exploring the dynamics and evolution of protein interaction networks. Curr Opin Biotechnol 2013; 24:775-83. [PMID: 23571097 DOI: 10.1016/j.copbio.2013.02.023] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2012] [Revised: 02/14/2013] [Accepted: 02/24/2013] [Indexed: 01/09/2023]
Abstract
Over the past decade, the study of protein interaction networks (PINs) has shed light on the organizing principles of living cells. However, PINs have been mostly mapped in one single condition. We outline three of the most promising avenues of investigation in this field, namely the study of first, how PINs are rewired by mutations and environmental perturbations; secondly, how inter-species interactions affect PIN achitectures; thirdly, what mechanisms and forces drive PIN evolution. These investigations will unravel the dynamics and condition dependence of PINs and will thus lead to a better functional annotation of network architecture. One major challenge to reach these goals is the integration of PINs with other cellular regulatory networks in the context of complex cellular phenotypes.
Collapse
Affiliation(s)
- Guillaume Diss
- Département de Biologie, Institut de Biologie Intégrative et des Systèmes (IBIS), PROTEO, Université Laval, Québec, Canada G1V 0A6
| | | | | | | | | | | | | |
Collapse
|
20
|
Van Roey K, Dinkel H, Weatheritt RJ, Gibson TJ, Davey NE. The switches.ELM resource: a compendium of conditional regulatory interaction interfaces. Sci Signal 2013; 6:rs7. [PMID: 23550212 DOI: 10.1126/scisignal.2003345] [Citation(s) in RCA: 88] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Short linear motifs (SLiMs) are protein interaction sites that play an important role in cell regulation by controlling protein activity, localization, and local abundance. The functionality of a SLiM can be modulated in a context-dependent manner to induce a gain, loss, or exchange of binding partners, which will affect the function of the SLiM-containing protein. As such, these conditional interactions underlie molecular decision-making in cell signaling. We identified multiple types of pre- and posttranslational switch mechanisms that can regulate the function of a SLiM and thereby control its interactions. The collected examples of experimentally characterized SLiM-based switch mechanisms were curated in the freely accessible switches.ELM resource (http://switches.elm.eu.org). On the basis of these examples, we defined and integrated rules to analyze SLiMs for putative regulatory switch mechanisms. We applied these rules to known validated SLiMs, providing evidence that more than half of these are likely to be pre- or posttranslationally regulated. In addition, we showed that posttranslationally modified sites are enriched around SLiMs, which enables cooperative and integrative regulation of protein interaction interfaces. We foresee switches.ELM complementing available resources to extend our knowledge of the molecular mechanisms underlying cell signaling.
Collapse
Affiliation(s)
- Kim Van Roey
- Structural and Computational Biology Unit, European Molecular Biology Laboratory (EMBL), Meyerhofstrasse 1, D-69117 Heidelberg, Germany
| | | | | | | | | |
Collapse
|
21
|
Tretyakov K, Goldberg T, Jin VX, Horton P. Summary of talks and papers at ISCB-Asia/SCCG 2012. BMC Genomics 2013. [PMCID: PMC3639071 DOI: 10.1186/1471-2164-14-s2-i1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Abstract
The second ISCB-Asia conference of the International Society for Computational Biology took place December 17-19, 2012, in Shenzhen, China. The conference was co-hosted by BGI as the first Shenzhen Conference on Computational Genomics (SCCG).
45 talks were presented at ISCB-Asia/SCCG 2012. The topics covered included software tools, reproducible computing, next-generation sequencing data analysis, transcription and mRNA regulation, protein structure and function, cancer genomics and personalized medicine. Nine of the proceedings track talks are included as full papers in this supplement.
In this report we first give a short overview of the conference by listing some statistics and visualizing the talk abstracts as word clouds. Then we group the talks by topic and briefly summarize each one, providing references to related publications whenever possible. Finally, we close with a few comments on the success of this conference.
Collapse
|
22
|
Wertheim B, Beukeboom L, van de Zande L. Polyploidy in Animals: Effects of Gene Expression on Sex Determination, Evolution and Ecology. Cytogenet Genome Res 2013; 140:256-69. [DOI: 10.1159/000351998] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
|
23
|
Wang C, Taciroglu A, Maetschke SR, Nelson CC, Ragan MA, Davis MJ. mCOPA: analysis of heterogeneous features in cancer expression data. J Clin Bioinforma 2012; 2:22. [PMID: 23216803 PMCID: PMC3553066 DOI: 10.1186/2043-9113-2-22] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2012] [Accepted: 12/03/2012] [Indexed: 12/13/2022] Open
Abstract
Background Cancer outlier profile analysis (COPA) has proven to be an effective approach to analyzing cancer expression data, leading to the discovery of the TMPRSS2 and ETS family gene fusion events in prostate cancer. However, the original COPA algorithm did not identify down-regulated outliers, and the currently available R package implementing the method is similarly restricted to the analysis of over-expressed outliers. Here we present a modified outlier detection method, mCOPA, which contains refinements to the outlier-detection algorithm, identifies both over- and under-expressed outliers, is freely available, and can be applied to any expression dataset. Results We compare our method to other feature-selection approaches, and demonstrate that mCOPA frequently selects more-informative features than do differential expression or variance-based feature selection approaches, and is able to recover observed clinical subtypes more consistently. We demonstrate the application of mCOPA to prostate cancer expression data, and explore the use of outliers in clustering, pathway analysis, and the identification of tumour suppressors. We analyse the under-expressed outliers to identify known and novel prostate cancer tumour suppressor genes, validating these against data in Oncomine and the Cancer Gene Index. We also demonstrate how a combination of outlier analysis and pathway analysis can identify molecular mechanisms disrupted in individual tumours. Conclusions We demonstrate that mCOPA offers advantages, compared to differential expression or variance, in selecting outlier features, and that the features so selected are better able to assign samples to clinically annotated subtypes. Further, we show that the biology explored by outlier analysis differs from that uncovered in differential expression or variance analysis. mCOPA is an important new tool for the exploration of cancer datasets and the discovery of new cancer subtypes, and can be combined with pathway and functional analysis approaches to discover mechanisms underpinning heterogeneity in cancers.
Collapse
Affiliation(s)
- Chenwei Wang
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, 4072, Australia.
| | | | | | | | | | | |
Collapse
|
24
|
Burgess DJ. Alternative splicing: proteomic rewiring through transcriptomic diversity. Nat Rev Genet 2012; 13:518-9. [PMID: 22805699 DOI: 10.1038/nrg3288] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|