1
|
Huang CH, Zaenudin E, Tsai JJP, Kurubanjerdjit N, Dessie EY, Ng KL. Dissecting molecular network structures using a network subgraph approach. PeerJ 2020; 8:e9556. [PMID: 33005483 PMCID: PMC7512139 DOI: 10.7717/peerj.9556] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Accepted: 06/25/2020] [Indexed: 11/20/2022] Open
Abstract
Biological processes are based on molecular networks, which exhibit biological functions through interactions of genetic elements or proteins. This study presents a graph-based method to characterize molecular networks by decomposing the networks into directed multigraphs: network subgraphs. Spectral graph theory, reciprocity and complexity measures were used to quantify the network subgraphs. Graph energy, reciprocity and cyclomatic complexity can optimally specify network subgraphs with some degree of degeneracy. Seventy-one molecular networks were analyzed from three network types: cancer networks, signal transduction networks, and cellular processes. Molecular networks are built from a finite number of subgraph patterns and subgraphs with large graph energies are not present, which implies a graph energy cutoff. In addition, certain subgraph patterns are absent from the three network types. Thus, the Shannon entropy of the subgraph frequency distribution is not maximal. Furthermore, frequently-observed subgraphs are irreducible graphs. These novel findings warrant further investigation and may lead to important applications. Finally, we observed that cancer-related cellular processes are enriched with subgraph-associated driver genes. Our study provides a systematic approach for dissecting biological networks and supports the conclusion that there are organizational principles underlying molecular networks.
Collapse
Affiliation(s)
- Chien-Hung Huang
- Department of Computer Science and Information Engineering, National Formosa University, Yunlin, Taiwan
| | - Efendi Zaenudin
- Department of Bioinformatics and Medical Engineering, Asia University, Taichung, Taiwan.,Research Center for Informatics, Indonesian Institute of Sciences, Bandung, Indonesia
| | - Jeffrey J P Tsai
- Department of Bioinformatics and Medical Engineering, Asia University, Taichung, Taiwan
| | | | - Eskezeia Y Dessie
- Department of Bioinformatics and Medical Engineering, Asia University, Taichung, Taiwan
| | - Ka-Lok Ng
- Department of Bioinformatics and Medical Engineering, Asia University, Taichung, Taiwan.,Department of Medical Research, China Medical University Hospital, China Medical University, Taichung, Taiwan
| |
Collapse
|
2
|
Koutrouli M, Karatzas E, Paez-Espino D, Pavlopoulos GA. A Guide to Conquer the Biological Network Era Using Graph Theory. Front Bioeng Biotechnol 2020; 8:34. [PMID: 32083072 PMCID: PMC7004966 DOI: 10.3389/fbioe.2020.00034] [Citation(s) in RCA: 108] [Impact Index Per Article: 21.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2019] [Accepted: 01/15/2020] [Indexed: 12/24/2022] Open
Abstract
Networks are one of the most common ways to represent biological systems as complex sets of binary interactions or relations between different bioentities. In this article, we discuss the basic graph theory concepts and the various graph types, as well as the available data structures for storing and reading graphs. In addition, we describe several network properties and we highlight some of the widely used network topological features. We briefly mention the network patterns, motifs and models, and we further comment on the types of biological and biomedical networks along with their corresponding computer- and human-readable file formats. Finally, we discuss a variety of algorithms and metrics for network analyses regarding graph drawing, clustering, visualization, link prediction, perturbation, and network alignment as well as the current state-of-the-art tools. We expect this review to reach a very broad spectrum of readers varying from experts to beginners while encouraging them to enhance the field further.
Collapse
Affiliation(s)
- Mikaela Koutrouli
- Institute for Fundamental Biomedical Research, BSRC “Alexander Fleming”, Vari, Greece
| | - Evangelos Karatzas
- Institute for Fundamental Biomedical Research, BSRC “Alexander Fleming”, Vari, Greece
- Department of Informatics and Telecommunications, University of Athens, Athens, Greece
| | - David Paez-Espino
- Lawrence Berkeley National Laboratory, Department of Energy, Joint Genome Institute, Walnut Creek, CA, United States
| | | |
Collapse
|
3
|
Non-coding RNA regulatory networks. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2019; 1863:194417. [PMID: 31493559 DOI: 10.1016/j.bbagrm.2019.194417] [Citation(s) in RCA: 311] [Impact Index Per Article: 51.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/28/2019] [Revised: 08/13/2019] [Accepted: 08/13/2019] [Indexed: 02/06/2023]
Abstract
It is well established that the vast majority of human RNA transcripts do not encode for proteins and that non-coding RNAs regulate cell physiology and shape cellular functions. A subset of them is involved in gene regulation at different levels, from epigenetic gene silencing to post-transcriptional regulation of mRNA stability. Notably, the aberrant expression of many non-coding RNAs has been associated with aggressive pathologies. Rapid advances in network biology indicates that the robustness of cellular processes is the result of specific properties of biological networks such as scale-free degree distribution and hierarchical modularity, suggesting that regulatory network analyses could provide new insights on gene regulation and dysfunction mechanisms. In this study we present an overview of public repositories where non-coding RNA-regulatory interactions are collected and annotated, we discuss unresolved questions for data integration and we recall existing resources to build and analyse networks.
Collapse
|
4
|
Cantone M, Küspert M, Reiprich S, Lai X, Eberhardt M, Göttle P, Beyer F, Azim K, Küry P, Wegner M, Vera J. A gene regulatory architecture that controls region-independent dynamics of oligodendrocyte differentiation. Glia 2019; 67:825-843. [PMID: 30730593 DOI: 10.1002/glia.23569] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2018] [Revised: 10/16/2018] [Accepted: 10/25/2018] [Indexed: 12/18/2022]
Abstract
Oligodendrocytes (OLs) facilitate information processing in the vertebrate central nervous system via axonal ensheathment. The structure and dynamics of the regulatory network that mediates oligodendrogenesis are poorly understood. We employed bioinformatics and meta-analysis of high-throughput datasets to reconstruct a regulatory network underpinning OL differentiation. From this network, we identified families of feedforward loops comprising the transcription factors (TFs) Olig2, Sox10, and Tcf7l2 and their targets. Among the targets, we found eight other TFs related to OL differentiation, suggesting a hierarchical architecture in which some TFs (Olig2, Sox10, and Tcf7l2) regulate via feedforward loops the expression of others (Sox2, Sox6, Sox11, Nkx2-2, Nkx6-2, Hes5, Myt1, and Myrf). Model simulations with a kinetic model reproduced the mechanisms of OL differentiation only when in the model, Sox10-mediated repression of Tcf7l2 by miR-338/miR-155 was introduced, a prediction confirmed in genetic functional experiments. Additional model simulations suggested that OLs from dorsal regions emerge through BMP/Sox9 signaling.
Collapse
Affiliation(s)
- Martina Cantone
- Laboratory of Systems Tumor Immunology, Hautklinik, Universitätsklinikum Erlangen and Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany.,Faculty of Mechanical Engineering, Specialty Division for Systems Biotechnology, Technische Universität München, Munich, Germany
| | - Melanie Küspert
- Institut für Biochemie, Emil-Fischer-Zentrum, Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany
| | - Simone Reiprich
- Institut für Biochemie, Emil-Fischer-Zentrum, Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany
| | - Xin Lai
- Laboratory of Systems Tumor Immunology, Hautklinik, Universitätsklinikum Erlangen and Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
| | - Martin Eberhardt
- Laboratory of Systems Tumor Immunology, Hautklinik, Universitätsklinikum Erlangen and Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
| | - Peter Göttle
- Neuroregeneration, Department of Neurology, Medical Faculty, Heinrich-Heine University, Düsseldorf, Germany
| | - Felix Beyer
- Neuroregeneration, Department of Neurology, Medical Faculty, Heinrich-Heine University, Düsseldorf, Germany
| | - Kasum Azim
- Neuroregeneration, Department of Neurology, Medical Faculty, Heinrich-Heine University, Düsseldorf, Germany
| | - Patrick Küry
- Neuroregeneration, Department of Neurology, Medical Faculty, Heinrich-Heine University, Düsseldorf, Germany
| | - Michael Wegner
- Institut für Biochemie, Emil-Fischer-Zentrum, Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany
| | - Julio Vera
- Laboratory of Systems Tumor Immunology, Hautklinik, Universitätsklinikum Erlangen and Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
| |
Collapse
|
5
|
Zhang J, Duy Le T, Liu L, He J, Li J. Identifying miRNA synergistic regulatory networks in heterogeneous human data via network motifs. MOLECULAR BIOSYSTEMS 2016; 12:454-63. [PMID: 26660849 DOI: 10.1039/c5mb00562k] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
Understanding the synergism of multiple microRNAs (miRNAs) in gene regulation can provide important insights into the mechanisms of complex human diseases caused by miRNA regulation. Therefore, it is important to identify miRNA synergism and study miRNA characteristics in miRNA synergistic regulatory networks. A number of methods have been proposed to identify miRNA synergism. However, most of the methods only use downstream target genes of miRNAs to infer miRNA synergism when miRNAs can also be regulated by upstream transcription factors (TFs) at the transcriptional level. Additionally, most methods are based on statistical associations identified from data without considering the causal nature of gene regulation. In this paper, we present a causality based framework, called mirSRN (miRNA synergistic regulatory network), to infer miRNA synergism in human molecular systems by considering both downstream miRNA targets and upstream TF regulation. We apply the proposed framework to two real world datasets and discover that almost all the top 10 miRNAs with the largest node degree in the mirSRNs are associated with different human diseases, including cancer, and that the mirSRNs are approximately scale-free and small-world networks. We also find that most miRNAs in the networks are frequently synergistic with other miRNAs, and miRNAs related to the same disease are likely to be synergistic and in a cluster linked to a biological function. Synergistic miRNA pairs show higher co-expression level, and may have potential functional relationships indicating collaboration between the miRNAs. Functional validation of the identified synergistic miRNAs demonstrates that these miRNAs cause different kinds of diseases. These results deepen our understanding of the biological meaning of miRNA synergism.
Collapse
Affiliation(s)
- Junpeng Zhang
- School of Engineering, Dali University, Dali, Yunnan 671003, P. R. China.
| | - Thuc Duy Le
- School of Information Technology and Mathematical Sciences, University of South Australia, Mawson Lakes, SA 5095, Australia.
| | - Lin Liu
- School of Information Technology and Mathematical Sciences, University of South Australia, Mawson Lakes, SA 5095, Australia.
| | - Jianfeng He
- Institute of Biomedical Engineering, Kunming University of Science and Technology, Kunming, Yunnan 650500, P. R. China
| | - Jiuyong Li
- School of Information Technology and Mathematical Sciences, University of South Australia, Mawson Lakes, SA 5095, Australia.
| |
Collapse
|
6
|
Zhang J, Le TD, Liu L, He J, Li J. A novel framework for inferring condition-specific TF and miRNA co-regulation of protein-protein interactions. Gene 2015; 577:55-64. [PMID: 26611531 DOI: 10.1016/j.gene.2015.11.023] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2015] [Revised: 10/16/2015] [Accepted: 11/17/2015] [Indexed: 12/11/2022]
Abstract
Recent studies have shown that transcription factors (TFs) and microRNAs (miRNAs), while independently regulate their downstream targets, collaborate with each other to regulate gene expression. However, their synergistic roles in protein-protein interactions (PPIs) remain mostly unknown. In this paper, we present a novel framework (called CoRePPI) for inferring TF and miRNA co-regulation of PPIs. Particularly, CoRePPI is aimed at discovering the co-regulation specific to a condition of interest, by using heterogeneous data, including miRNA and messenger RNA (mRNA) expression profiles, putative miRNA targets, TF targets and PPIs. CoRePPI firstly finds the network motifs indicating the co-regulation of PPIs by TFs and miRNAs in tumor and normal conditions separately. Then by identifying the differential motifs found in one condition but not in the other, it builds the networks consisting of TFs, miRNAs and their co-regulated PPIs specific to different conditions respectively. To validate CoRePPI, we apply it to the Pan-Cancer dataset which includes the expression profiles of 12 cancer types from TCGA. Through network topology analysis, we found that the tumor and normal CoRePPI networks are scale-free. Furthermore, the results of differential and intersected network analysis between the tumor and normal CoRePPI networks suggest that only a small fraction of the regulatory relationships between TFs and miRNAs are conserved in both conditions but they co-regulate different downstream PPIs in tumor and normal conditions; and in different conditions the majority of the regulatory relationships between TFs and miRNAs are different although they may regulate the same PPIs in their respective conditions. The CoRePPI sub-networks constructed for the three types of cancers (breast cancer, lung cancer and ovarian cancer) are all scale-free, and the intersection of these CoRePPI sub-networks can be utilized as the biomarker CoRePPI sub-network of the three types of cancers. The PPI enrichment analyses of the tumor and normal CoRePPI networks suggest that the co-regulating TFs and miRNAs are significantly associated with the specific biological processes, diseases and pathways. In addition, comparing with the two non-condition-specific approaches, the tumor CoRePPI network is found to have the most enriched cancer-related PPIs. Altogether, the results uncover the combined regulatory patterns of TFs and miRNAs on the PPIs, and may provide new insights for research in cancer-associated TFs and miRNAs.
Collapse
Affiliation(s)
- Junpeng Zhang
- School of Engineering, Dali University, Dali, Yunnan 671003, China.
| | - Thuc Duy Le
- School of Information Technology and Mathematical Sciences, University of South Australia, Mawson Lakes, SA 5095, Australia
| | - Lin Liu
- School of Information Technology and Mathematical Sciences, University of South Australia, Mawson Lakes, SA 5095, Australia
| | - Jianfeng He
- School of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, Yunnan, 650500, China
| | - Jiuyong Li
- School of Information Technology and Mathematical Sciences, University of South Australia, Mawson Lakes, SA 5095, Australia.
| |
Collapse
|
7
|
Hasan MM, Kahveci T. Indexing a protein-protein interaction network expedites network alignment. BMC Bioinformatics 2015; 16:326. [PMID: 26453444 PMCID: PMC4600321 DOI: 10.1186/s12859-015-0756-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2015] [Accepted: 09/30/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Network query problem aligns a small query network with an arbitrarily large target network. The complexity of this problem grows exponentially with the number of nodes in the query network if confidence in the optimality of result is desired. Scaling this problem to large query and target networks remains to be a challenge. RESULTS In this article, we develop a novel index structure that dramatically reduces the cost of the network query problem. Our index structure maintains a small set of reference networks where each reference network is a small, carefully chosen subnetwork from the target network. Along with each reference, we also store all of its non-overlapping and statistically significant alignments with the target network. Given a query network, we first align the query with the reference networks. If the alignment with a reference network yields a sufficiently large score, we compute an upper-bound to the alignment score between the query and the target using the alignments of that reference and the target (which is stored in our index). If the upper-bound is large enough, we employ a second round of alignment between the query and the target by respecting the mapping found in the first alignment. Our experiments on protein-protein interaction networks demonstrate that our index achieves a significant speed-up in running time over the state-of-the-art methods such as ColT. The alignment subnetworks obtained by our method are also statistically significant. Finally, we observe that our method finds biologically and statistically significant alignments across multiple species. CONCLUSIONS We developed a reference network based indexing structure that accelerates network query and produces functionally and statistically significant results.
Collapse
Affiliation(s)
- Md Mahmudul Hasan
- Department of Computer & Information Science and Engineering, University of Florida, Gainesville FL, 32611, USA.
| | - Tamer Kahveci
- Department of Computer & Information Science and Engineering, University of Florida, Gainesville FL, 32611, USA.
| |
Collapse
|
8
|
Rinnone F, Micale G, Bonnici V, Bader GD, Shasha D, Ferro A, Pulvirenti A, Giugno R. NetMatchStar: an enhanced Cytoscape network querying app. F1000Res 2015; 4:479. [PMID: 26594341 DOI: 10.12688/f1000research.6656.1] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 07/20/2015] [Indexed: 02/03/2023] Open
Abstract
We present NetMatchStar, a Cytoscape app to find all the occurrences of a query graph in a network and check for its significance as a motif with respect to seven different random models. The query can be uploaded or built from scratch using Cytoscape facilities. The app significantly enhances the previous NetMatch in style, performance and functionality. Notably NetMatchStar allows queries with wildcards.
Collapse
Affiliation(s)
- Fabio Rinnone
- Department of Math and Computer Science, University of Catania, Catania, 95125, Italy
| | - Giovanni Micale
- Department of Math and Computer Science, University of Catania, Catania, 95125, Italy
| | - Vincenzo Bonnici
- Department of Computer Science, University of Verona, Verona, 37134, Italy
| | - Gary D Bader
- The Donnelly Centre, University of Toronto, Toronto, ON, M5S 3E1, Canada
| | - Dennis Shasha
- Department of Computer Science, Courant Institute of Mathematical Sciences, New York University, New York, NY, 10012, USA
| | - Alfredo Ferro
- Department of Clinical and Experimental Medicine, University of Catania, Catania, 95125, Italy
| | - Alfredo Pulvirenti
- Department of Clinical and Experimental Medicine, University of Catania, Catania, 95125, Italy
| | - Rosalba Giugno
- Department of Clinical and Experimental Medicine, University of Catania, Catania, 95125, Italy
| |
Collapse
|
9
|
Rinnone F, Micale G, Bonnici V, Bader GD, Shasha D, Ferro A, Pulvirenti A, Giugno R. NetMatchStar: an enhanced Cytoscape network querying app. F1000Res 2015; 4:479. [PMID: 26594341 PMCID: PMC4642848 DOI: 10.12688/f1000research.6656.2] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 09/25/2015] [Indexed: 02/03/2023] Open
Abstract
We present NetMatchStar, a Cytoscape app to find all the occurrences of a query graph in a network and check for its significance as a motif with respect to seven different random models. The query can be uploaded or built from scratch using Cytoscape facilities. The app significantly enhances the previous NetMatch in style, performance and functionality. Notably NetMatchStar allows queries with wildcards.
Collapse
Affiliation(s)
- Fabio Rinnone
- Department of Math and Computer Science, University of Catania, Catania, 95125, Italy
| | - Giovanni Micale
- Department of Math and Computer Science, University of Catania, Catania, 95125, Italy
| | - Vincenzo Bonnici
- Department of Computer Science, University of Verona, Verona, 37134, Italy
| | - Gary D Bader
- The Donnelly Centre, University of Toronto, Toronto, ON, M5S 3E1, Canada
| | - Dennis Shasha
- Department of Computer Science, Courant Institute of Mathematical Sciences, New York University, New York, NY, 10012, USA
| | - Alfredo Ferro
- Department of Clinical and Experimental Medicine, University of Catania, Catania, 95125, Italy
| | - Alfredo Pulvirenti
- Department of Clinical and Experimental Medicine, University of Catania, Catania, 95125, Italy
| | - Rosalba Giugno
- Department of Clinical and Experimental Medicine, University of Catania, Catania, 95125, Italy
| |
Collapse
|
10
|
Micale G, Ferro A, Pulvirenti A, Giugno R. SPECTRA: An Integrated Knowledge Base for Comparing Tissue and Tumor-Specific PPI Networks in Human. Front Bioeng Biotechnol 2015; 3:58. [PMID: 26005672 PMCID: PMC4424906 DOI: 10.3389/fbioe.2015.00058] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2014] [Accepted: 04/17/2015] [Indexed: 12/11/2022] Open
Abstract
Protein–protein interaction (PPI) networks available in public repositories usually represent relationships between proteins within the cell. They ignore the specific set of tissues or tumors where the interactions take place. Indeed, proteins can form tissue-selective complexes, while they remain inactive in other tissues. For these reasons, a great attention has been recently paid to tissue-specific PPI networks, in which nodes are proteins of the global PPI network whose corresponding genes are preferentially expressed in specific tissues. In this paper, we present SPECTRA, a knowledge base to build and compare tissue or tumor-specific PPI networks. SPECTRA integrates gene expression and protein interaction data from the most authoritative online repositories. We also provide tools for visualizing and comparing such networks, in order to identify the expression and interaction changes of proteins across tissues, or between the normal and pathological states of the same tissue. SPECTRA is available as a web server at http://alpha.dmi.unict.it/spectra.
Collapse
Affiliation(s)
- Giovanni Micale
- Department of Computer Science, University of Pisa , Pisa , Italy
| | - Alfredo Ferro
- Department of Clinical and Molecular Biomedicine, University of Catania , Catania , Italy
| | - Alfredo Pulvirenti
- Department of Clinical and Molecular Biomedicine, University of Catania , Catania , Italy
| | - Rosalba Giugno
- Department of Clinical and Molecular Biomedicine, University of Catania , Catania , Italy
| |
Collapse
|
11
|
Ryall KA, Tan AC. Systems biology approaches for advancing the discovery of effective drug combinations. J Cheminform 2015; 7:7. [PMID: 25741385 PMCID: PMC4348553 DOI: 10.1186/s13321-015-0055-9] [Citation(s) in RCA: 103] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2014] [Accepted: 02/02/2015] [Indexed: 01/23/2023] Open
Abstract
Complex diseases like cancer are regulated by large, interconnected networks with many pathways affecting cell proliferation, invasion, and drug resistance. However, current cancer therapy predominantly relies on the reductionist approach of one gene-one disease. Combinations of drugs may overcome drug resistance by limiting mutations and induction of escape pathways, but given the enormous number of possible drug combinations, strategies to reduce the search space and prioritize experiments are needed. In this review, we focus on the use of computational modeling, bioinformatics and high-throughput experimental methods for discovery of drug combinations. We highlight cutting-edge systems approaches, including large-scale modeling of cell signaling networks, network motif analysis, statistical association-based models, identifying correlations in gene signatures, functional genomics, and high-throughput combination screens. We also present a list of publicly available data and resources to aid in discovery of drug combinations. Integration of these systems approaches will enable faster discovery and translation of clinically relevant drug combinations. Spectrum of Systems Biology Approaches for Drug Combinations. ![]()
Collapse
Affiliation(s)
- Karen A Ryall
- Translational Bioinformatics and Cancer Systems Biology Laboratory, Division of Medical Oncology, Department of Medicine, School of Medicine, University of Colorado Anschutz Medical Campus, 12801 E.17th Ave., L18-8116, Aurora, CO 80045 USA
| | - Aik Choon Tan
- Translational Bioinformatics and Cancer Systems Biology Laboratory, Division of Medical Oncology, Department of Medicine, School of Medicine, University of Colorado Anschutz Medical Campus, 12801 E.17th Ave., L18-8116, Aurora, CO 80045 USA ; Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO USA ; Department of Computer Science and Engineering, Korea University, Seoul, South Korea
| |
Collapse
|
12
|
Hsieh WT, Tzeng KR, Ciou JS, Tsai JJ, Kurubanjerdjit N, Huang CH, Ng KL. Transcription factor and microRNA-regulated network motifs for cancer and signal transduction networks. BMC SYSTEMS BIOLOGY 2015; 9 Suppl 1:S5. [PMID: 25707690 PMCID: PMC4331680 DOI: 10.1186/1752-0509-9-s1-s5] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Abstract Conclusions To validate the role of network motifs in cancer formation, several examples are presented which demonstrated the effectiveness of the present approach. A web-based platform has been set up which can be accessed at: http://ppi.bioinfo.asia.edu.tw/pathway/. It is very likely that our results can supply very specific CMS missing information for certain cancer types, it is an indispensable tool for cancer biology research.
Collapse
|
13
|
Cardelli L. Morphisms of reaction networks that couple structure to function. BMC SYSTEMS BIOLOGY 2014; 8:84. [PMID: 25128194 PMCID: PMC4236760 DOI: 10.1186/1752-0509-8-84] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/25/2014] [Accepted: 07/04/2014] [Indexed: 11/10/2022]
Abstract
BACKGROUND The mechanisms underlying complex biological systems are routinely represented as networks. Network kinetics is widely studied, and so is the connection between network structure and behavior. However, similarity of mechanism is better revealed by relationships between network structures. RESULTS We define morphisms (mappings) between reaction networks that establish structural connections between them. Some morphisms imply kinetic similarity, and yet their properties can be checked statically on the structure of the networks. In particular we can determine statically that a complex network will emulate a simpler network: it will reproduce its kinetics for all corresponding choices of reaction rates and initial conditions. We use this property to relate the kinetics of many common biological networks of different sizes, also relating them to a fundamental population algorithm. CONCLUSIONS Structural similarity between reaction networks can be revealed by network morphisms, elucidating mechanistic and functional aspects of complex networks in terms of simpler networks.
Collapse
Affiliation(s)
- Luca Cardelli
- Microsoft Research, 21 Station Road, Cambridge CB1 2FB, UK.
| |
Collapse
|
14
|
Micale G, Pulvirenti A, Giugno R, Ferro A. GASOLINE: a Greedy And Stochastic algorithm for optimal Local multiple alignment of Interaction NEtworks. PLoS One 2014; 9:e98750. [PMID: 24911103 PMCID: PMC4049608 DOI: 10.1371/journal.pone.0098750] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2014] [Accepted: 05/07/2014] [Indexed: 11/19/2022] Open
Abstract
The analysis of structure and dynamics of biological networks plays a central role in understanding the intrinsic complexity of biological systems. Biological networks have been considered a suitable formalism to extend evolutionary and comparative biology. In this paper we present GASOLINE, an algorithm for multiple local network alignment based on statistical iterative sampling in connection to a greedy strategy. GASOLINE overcomes the limits of current approaches by producing biologically significant alignments within a feasible running time, even for very large input instances. The method has been extensively tested on a database of real and synthetic biological networks. A comprehensive comparison with state-of-the art algorithms clearly shows that GASOLINE yields the best results in terms of both reliability of alignments and running time on real biological networks and results comparable in terms of quality of alignments on synthetic networks. GASOLINE has been developed in Java, and is available, along with all the computed alignments, at the following URL: http://ferrolab.dmi.unict.it/gasoline/gasoline.html.
Collapse
Affiliation(s)
- Giovanni Micale
- Department of Computer Science, University of Pisa, Pisa, Italy
| | - Alfredo Pulvirenti
- Department of Clinical and Molecular Biomedicine, University of Catania, Catania, Italy
- * E-mail:
| | - Rosalba Giugno
- Department of Clinical and Molecular Biomedicine, University of Catania, Catania, Italy
| | - Alfredo Ferro
- Department of Clinical and Molecular Biomedicine, University of Catania, Catania, Italy
| |
Collapse
|
15
|
Panni S, Rombo SE. Searching for repetitions in biological networks: methods, resources and tools. Brief Bioinform 2013; 16:118-36. [PMID: 24300112 DOI: 10.1093/bib/bbt084] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
We present here a compact overview of the data, models and methods proposed for the analysis of biological networks based on the search for significant repetitions. In particular, we concentrate on three problems widely studied in the literature: 'network alignment', 'network querying' and 'network motif extraction'. We provide (i) details of the experimental techniques used to obtain the main types of interaction data, (ii) descriptions of the models and approaches introduced to solve such problems and (iii) pointers to both the available databases and software tools. The intent is to lay out a useful roadmap for identifying suitable strategies to analyse cellular data, possibly based on the joint use of different interaction data types or analysis techniques.
Collapse
|
16
|
Manczinger M, Kemény L. Novel factors in the pathogenesis of psoriasis and potential drug candidates are found with systems biology approach. PLoS One 2013; 8:e80751. [PMID: 24303025 PMCID: PMC3841158 DOI: 10.1371/journal.pone.0080751] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2013] [Accepted: 10/15/2013] [Indexed: 01/12/2023] Open
Abstract
Psoriasis is a multifactorial inflammatory skin disease characterized by increased proliferation of keratinocytes, activation of immune cells and susceptibility to metabolic syndrome. Systems biology approach makes it possible to reveal novel important factors in the pathogenesis of the disease. Protein-protein, protein-DNA, merged (containing both protein-protein and protein-DNA interactions) and chemical-protein interaction networks were constructed consisting of differentially expressed genes (DEG) between lesional and non-lesional skin samples of psoriatic patients and/or the encoded proteins. DEGs were determined by microarray meta-analysis using MetaOMICS package. We used STRING for protein-protein, CisRED for protein-DNA and STITCH for chemical-protein interaction network construction. General network-, cluster- and motif-analysis were carried out in each network. Many DEG-coded proteins (CCNA2, FYN, PIK3R1, CTGF, F3) and transcription factors (AR, TFDP1, MEF2A, MECOM) were identified as central nodes, suggesting their potential role in psoriasis pathogenesis. CCNA2, TFDP1 and MECOM might play role in the hyperproliferation of keratinocytes, whereas FYN may be involved in the disturbed immunity in psoriasis. AR can be an important link between inflammation and insulin resistance, while MEF2A has role in insulin signaling. A controller sub-network was constructed from interlinked positive feedback loops that with the capability to maintain psoriatic lesional phenotype. Analysis of chemical-protein interaction networks detected 34 drugs with previously confirmed disease-modifying effects, 23 drugs with some experimental evidences, and 21 drugs with case reports suggesting their positive or negative effects. In addition, 99 unpublished drug candidates were also found, that might serve future treatments for psoriasis.
Collapse
Affiliation(s)
- Máté Manczinger
- Department of Dermatology and Allergology, University of Szeged, Szeged, Hungary
- * E-mail:
| | - Lajos Kemény
- Department of Dermatology and Allergology, University of Szeged, Szeged, Hungary
- Dermatological Research Group of the Hungarian Academy of Sciences, Szeged, Hungary
| |
Collapse
|
17
|
Giugno R, Bonnici V, Bombieri N, Pulvirenti A, Ferro A, Shasha D. GRAPES: a software for parallel searching on biological graphs targeting multi-core architectures. PLoS One 2013; 8:e76911. [PMID: 24167551 PMCID: PMC3805575 DOI: 10.1371/journal.pone.0076911] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2013] [Accepted: 08/26/2013] [Indexed: 11/19/2022] Open
Abstract
Biological applications, from genomics to ecology, deal with graphs that represents the structure of interactions. Analyzing such data requires searching for subgraphs in collections of graphs. This task is computationally expensive. Even though multicore architectures, from commodity computers to more advanced symmetric multiprocessing (SMP), offer scalable computing power, currently published software implementations for indexing and graph matching are fundamentally sequential. As a consequence, such software implementations (i) do not fully exploit available parallel computing power and (ii) they do not scale with respect to the size of graphs in the database. We present GRAPES, software for parallel searching on databases of large biological graphs. GRAPES implements a parallel version of well-established graph searching algorithms, and introduces new strategies which naturally lead to a faster parallel searching system especially for large graphs. GRAPES decomposes graphs into subcomponents that can be efficiently searched in parallel. We show the performance of GRAPES on representative biological datasets containing antiviral chemical compounds, DNA, RNA, proteins, protein contact maps and protein interactions networks.
Collapse
Affiliation(s)
- Rosalba Giugno
- Department Clinical and Molecular Biomedicine, University of Catania, Catania, Italy
- * E-mail:
| | - Vincenzo Bonnici
- Department Computer Science, University of Verona, Verona, Italy
| | - Nicola Bombieri
- Department Computer Science, University of Verona, Verona, Italy
| | - Alfredo Pulvirenti
- Department Clinical and Molecular Biomedicine, University of Catania, Catania, Italy
| | - Alfredo Ferro
- Department Clinical and Molecular Biomedicine, University of Catania, Catania, Italy
| | - Dennis Shasha
- Courant Institute of Mathematical Sciences, New York University, New York, New York, United States of America
| |
Collapse
|
18
|
Abstract
Motivation: BioPAX is a standard language for representing complex cellular processes, including metabolic networks, signal transduction and gene regulation. Owing to the inherent complexity of a BioPAX model, searching for a specific type of subnetwork can be non-trivial and difficult. Results: We developed an open source and extensible framework for defining and searching graph patterns in BioPAX models. We demonstrate its use with a sample pattern that captures directed signaling relations between proteins. We provide search results for the pattern obtained from the Pathway Commons database and compare these results with the current data in signaling databases SPIKE and SignaLink. Results show that a pattern search in public pathway data can identify a substantial amount of signaling relations that do not exist in signaling databases. Availability: BioPAX-pattern software was developed in Java. Source code and documentation is freely available at http://code.google.com/p/biopax-pattern under Lesser GNU Public License. Contact:patternsearch@cbio.mskcc.org Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Özgün Babur
- Computational Biology Center, Memorial Sloan-Kettering Cancer Center, New York, NY 10065, USA, Tri-Institutional Training Program in Computational Biology and Medicine, New York, NY 10065, USA and Banting and Best Department of Medical Research, The Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada
| | | | | | | | | | | |
Collapse
|
19
|
RMOD: a tool for regulatory motif detection in signaling network. PLoS One 2013; 8:e68407. [PMID: 23874612 PMCID: PMC3710000 DOI: 10.1371/journal.pone.0068407] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2012] [Accepted: 05/29/2013] [Indexed: 11/21/2022] Open
Abstract
Regulatory motifs are patterns of activation and inhibition that appear repeatedly in various signaling networks and that show specific regulatory properties. However, the network structures of regulatory motifs are highly diverse and complex, rendering their identification difficult. Here, we present a RMOD, a web-based system for the identification of regulatory motifs and their properties in signaling networks. RMOD finds various network structures of regulatory motifs by compressing the signaling network and detecting the compressed forms of regulatory motifs. To apply it into a large-scale signaling network, it adopts a new subgraph search algorithm using a novel data structure called path-tree, which is a tree structure composed of isomorphic graphs of query regulatory motifs. This algorithm was evaluated using various sizes of signaling networks generated from the integration of various human signaling pathways and it showed that the speed and scalability of this algorithm outperforms those of other algorithms. RMOD includes interactive analysis and auxiliary tools that make it possible to manipulate the whole processes from building signaling network and query regulatory motifs to analyzing regulatory motifs with graphical illustration and summarized descriptions. As a result, RMOD provides an integrated view of the regulatory motifs and mechanism underlying their regulatory motif activities within the signaling network. RMOD is freely accessible online at the following URL: http://pks.kaist.ac.kr/rmod.
Collapse
|
20
|
Csermely P, Korcsmáros T, Kiss HJM, London G, Nussinov R. Structure and dynamics of molecular networks: a novel paradigm of drug discovery: a comprehensive review. Pharmacol Ther 2013; 138:333-408. [PMID: 23384594 PMCID: PMC3647006 DOI: 10.1016/j.pharmthera.2013.01.016] [Citation(s) in RCA: 522] [Impact Index Per Article: 43.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2013] [Accepted: 01/22/2013] [Indexed: 02/02/2023]
Abstract
Despite considerable progress in genome- and proteome-based high-throughput screening methods and in rational drug design, the increase in approved drugs in the past decade did not match the increase of drug development costs. Network description and analysis not only give a systems-level understanding of drug action and disease complexity, but can also help to improve the efficiency of drug design. We give a comprehensive assessment of the analytical tools of network topology and dynamics. The state-of-the-art use of chemical similarity, protein structure, protein-protein interaction, signaling, genetic interaction and metabolic networks in the discovery of drug targets is summarized. We propose that network targeting follows two basic strategies. The "central hit strategy" selectively targets central nodes/edges of the flexible networks of infectious agents or cancer cells to kill them. The "network influence strategy" works against other diseases, where an efficient reconfiguration of rigid networks needs to be achieved by targeting the neighbors of central nodes/edges. It is shown how network techniques can help in the identification of single-target, edgetic, multi-target and allo-network drug target candidates. We review the recent boom in network methods helping hit identification, lead selection optimizing drug efficacy, as well as minimizing side-effects and drug toxicity. Successful network-based drug development strategies are shown through the examples of infections, cancer, metabolic diseases, neurodegenerative diseases and aging. Summarizing >1200 references we suggest an optimized protocol of network-aided drug development, and provide a list of systems-level hallmarks of drug quality. Finally, we highlight network-related drug development trends helping to achieve these hallmarks by a cohesive, global approach.
Collapse
Affiliation(s)
- Peter Csermely
- Department of Medical Chemistry, Semmelweis University, P.O. Box 260, H-1444 Budapest 8, Hungary.
| | | | | | | | | |
Collapse
|
21
|
Bonnici V, Giugno R, Pulvirenti A, Shasha D, Ferro A. A subgraph isomorphism algorithm and its application to biochemical data. BMC Bioinformatics 2013; 14 Suppl 7:S13. [PMID: 23815292 PMCID: PMC3633016 DOI: 10.1186/1471-2105-14-s7-s13] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Background Graphs can represent biological networks at the molecular, protein, or species level. An important query is to find all matches of a pattern graph to a target graph. Accomplishing this is inherently difficult (NP-complete) and the efficiency of heuristic algorithms for the problem may depend upon the input graphs. The common aim of existing algorithms is to eliminate unsuccessful mappings as early as and as inexpensively as possible. Results We propose a new subgraph isomorphism algorithm which applies a search strategy to significantly reduce the search space without using any complex pruning rules or domain reduction procedures. We compare our method with the most recent and efficient subgraph isomorphism algorithms (VFlib, LAD, and our C++ implementation of FocusSearch which was originally distributed in Modula2) on synthetic, molecules, and interaction networks data. We show a significant reduction in the running time of our approach compared with these other excellent methods and show that our algorithm scales well as memory demands increase. Conclusions Subgraph isomorphism algorithms are intensively used by biochemical tools. Our analysis gives a comprehensive comparison of different software approaches to subgraph isomorphism highlighting their weaknesses and strengths. This will help researchers make a rational choice among methods depending on their application. We also distribute an open-source package including our system and our own C++ implementation of FocusSearch together with all the used datasets (http://ferrolab.dmi.unict.it/ri.html). In future work, our findings may be extended to approximate subgraph isomorphism algorithms.
Collapse
Affiliation(s)
- Vincenzo Bonnici
- Dept. Computer Science - University of Verona, Verona, 37134, Italy
| | | | | | | | | |
Collapse
|
22
|
Gonçalves E, van Iersel M, Saez-Rodriguez J. CySBGN: a Cytoscape plug-in to integrate SBGN maps. BMC Bioinformatics 2013; 14:17. [PMID: 23324051 PMCID: PMC3599859 DOI: 10.1186/1471-2105-14-17] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2012] [Accepted: 12/27/2012] [Indexed: 12/23/2022] Open
Abstract
Background A standard graphical notation is essential to facilitate exchange of network representations of biological processes. Towards this end, the Systems Biology Graphical Notation (SBGN) has been proposed, and it is already supported by a number of tools. However, support for SBGN in Cytoscape, one of the most widely used platforms in biology to visualise and analyse networks, is limited, and in particular it is not possible to import SBGN diagrams. Results We have developed CySBGN, a Cytoscape plug-in that extends the use of Cytoscape visualisation and analysis features to SBGN maps. CySBGN adds support for Cytoscape users to visualize any of the three complementary SBGN languages: Process Description, Entity Relationship, and Activity Flow. The interoperability with other tools (CySBML plug-in and Systems Biology Format Converter) was also established allowing an automated generation of SBGN diagrams based on previously imported SBML models. The plug-in was tested using a suite of 53 different test cases that covers almost all possible entities, shapes, and connections. A rendering comparison with other tools that support SBGN was performed. To illustrate the interoperability with other Cytoscape functionalities, we present two analysis examples, shortest path calculation, and motif identification in a metabolic network. Conclusions CySBGN imports, modifies and analyzes SBGN diagrams in Cytoscape, and thus allows the application of the large palette of tools and plug-ins in this platform to networks and pathways in SBGN format.
Collapse
|
23
|
Abstract
SUMMARY CySBML is a plugin designed to work with Systems Biology Markup Language (SBML) in Cytoscape having the following features: SBML import, support of the SBML layout and qualitative model packages, navigation in network layouts based on SBML structure, access to MIRIAM and SBO-based annotations and SBML validation. CySBML includes an importer for BioModels to load SBML from standard repositories. AVAILABILITY AND IMPLEMENTATION Freely available for non-commercial purposes through the Cytoscape plugin manager or for download at http://sourceforge.net/projects/cysbml/. CONTACT cysbml-team@lists.sourceforge.net SUPPLEMENTARY INFORMATION Tutorial, usage guide, installation instructions and additional figures are available for download at http://www.charite.de/sysbio/people/koenig/software/cysbml/.
Collapse
Affiliation(s)
- Matthias König
- Department of Biochemistry, University Medicine Charité Berlin, 13347, Berlin, Germany.
| | | | | |
Collapse
|
24
|
Sahraeian SME, Yoon BJ. RESQUE: network reduction using semi-Markov random walk scores for efficient querying of biological networks. ACTA ACUST UNITED AC 2012; 28:2129-36. [PMID: 22730436 DOI: 10.1093/bioinformatics/bts341] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
MOTIVATION Recent technological advances in measuring molecular interactions have resulted in an increasing number of large-scale biological networks. Translation of these enormous network data into meaningful biological insights requires efficient computational techniques that can unearth the biological information that is encoded in the networks. One such example is network querying, which aims to identify similar subnetwork regions in a large target network that are similar to a given query network. Network querying tools can be used to identify novel biological pathways that are homologous to known pathways, thereby enabling knowledge transfer across different organisms. RESULTS In this article, we introduce an efficient algorithm for querying large-scale biological networks, called RESQUE. The proposed algorithm adopts a semi-Markov random walk (SMRW) model to probabilistically estimate the correspondence scores between nodes that belong to different networks. The target network is iteratively reduced based on the estimated correspondence scores, which are also iteratively re-estimated to improve accuracy until the best matching subnetwork emerges. We demonstrate that the proposed network querying scheme is computationally efficient, can handle any network query with an arbitrary topology and yields accurate querying results. AVAILABILITY The source code of RESQUE is freely available at http://www.ece.tamu.edu/~bjyoon/RESQUE/
Collapse
|
25
|
Li Y, Gong P, Perkins EJ, Zhang C, Wang N. RefNetBuilder: a platform for construction of integrated reference gene regulatory networks from expressed sequence tags. BMC Bioinformatics 2011; 12 Suppl 10:S20. [PMID: 22166047 PMCID: PMC3236843 DOI: 10.1186/1471-2105-12-s10-s20] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
Background Gene Regulatory Networks (GRNs) provide integrated views of gene interactions that control biological processes. Many public databases contain biological interactions extracted from experimentally validated literature reports, but most furnish only information for a few genetic model organisms. In order to provide a bioinformatic tool for researchers who work with non-model organisms, we developed RefNetBuilder, a new platform that allows construction of putative reference pathways or GRNs from expressed sequence tags (ESTs). Results RefNetBuilder was designed to have the flexibility to extract and archive pathway or GRN information from public databases such as the Kyoto Encyclopedia of Genes and Genomes (KEGG). It features sequence alignment tools such as BLAST to allow mapping ESTs to pathways and GRNs in model organisms. A scoring algorithm was incorporated to rank and select the best match for each query EST. We validated RefNetBuilder using DNA sequences of Caenorhabditis elegans, a model organism having manually curated KEGG pathways. Using the earthworm Eisenia fetida as an example, we demonstrated the functionalities and features of RefNetBuilder. Conclusions The RefNetBuilder provides a standalone application for building reference GRNs for non-model organisms on a number of operating system platforms with standard desktop computer hardware. As a new bioinformatic tool aimed for constructing putative GRNs for non-model organisms that have only ESTs available, RefNetBuilder is especially useful to explore pathway- or network-related information in these organisms.
Collapse
Affiliation(s)
- Ying Li
- School of Computing, University of Southern Mississippi, Hattiesburg, MS 39406, USA
| | | | | | | | | |
Collapse
|
26
|
Le DH, Kwon YK. NetDS: a Cytoscape plugin to analyze the robustness of dynamics and feedforward/feedback loop structures of biological networks. ACTA ACUST UNITED AC 2011; 27:2767-8. [PMID: 21828085 DOI: 10.1093/bioinformatics/btr466] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
SUMMARY NetDS is a novel Cytoscape plugin that conveniently simulates dynamics related to robustness, and examines structural properties with respect to feedforward/feedback loops. It can evaluate how robustly a network sustains a stable state against mutations by employing a Boolean network model. In addition, the plugin can examine all feedforward/feedback loops appearing in a network and determine whether or not a pair of loops is coupled. Random networks can also be generated to evaluate whether or not an interesting finding in real biological networks is significantly random. AVAILABILITY NetDS is freely available for non-commercial purposes at http://netds.sourceforge.net/. CONTACT kwonyk@ulsan.ac.kr SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Duc-Hau Le
- School of Electrical Engineering, University of Ulsan, 93, Daehak-ro, Nam-gu, Ulsan 680-749
| | | |
Collapse
|
27
|
Abstract
We consider the problem of similarity queries in biological network databases. Given a database of networks, similarity query returns all the database networks whose similarity (i.e. alignment score) to a given query network is at least a specified similarity cutoff value. Alignment of two networks is a very costly operation, which makes exhaustive comparison of all the database networks with a query impractical. To tackle this problem, we develop a novel indexing method, named RINQ (Reference-based Indexing for Biological Network Queries). Our method uses a set of reference networks to eliminate a large portion of the database quickly for each query. A reference network is a small biological network. We precompute and store the alignments of all the references with all the database networks. When our database is queried, we align the query network with all the reference networks. Using these alignments, we calculate a lower bound and an approximate upper bound to the alignment score of each database network with the query network. With the help of upper and lower bounds, we eliminate the majority of the database networks without aligning them to the query network. We also quickly identify a small portion of these as guaranteed to be similar to the query. We perform pairwise alignment only for the remaining networks. We also propose a supervised method to pick references that have a large chance of filtering the unpromising database networks. Extensive experimental evaluation suggests that (i) our method reduced the running time of a single query on a database of around 300 networks from over 2 days to only 8 h; (ii) our method outperformed the state of the art method Closure Tree and SAGA by a factor of three or more; and (iii) our method successfully identified statistically and biologically significant relationships across networks and organisms.
Collapse
Affiliation(s)
- Günhan Gülsoy
- Computer and Information Sciences and Engineering Department, University of Florida, Gainesville, FL 32611, USA.
| | | |
Collapse
|
28
|
Ferraro N, Palopoli L, Panni S, Rombo SE. Asymmetric comparison and querying of biological networks. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2011; 8:876-889. [PMID: 21321368 DOI: 10.1109/tcbb.2011.29] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
Comparing and querying the protein-protein interaction (PPI) networks of different organisms is important to infer knowledge about conservation across species. Known methods that perform these tasks operate symmetrically, i.e., they do not assign a distinct role to the input PPI networks. However, in most cases, the input networks are indeed distinguishable on the basis of how the corresponding organism is biologically well characterized. In this paper a new idea is developed, that is, to exploit differences in the characterization of organisms at hand in order to devise methods for comparing their PPI networks. We use the PPI network (called Master) of the best characterized organism as a fingerprint to guide the alignment process to the second input network (called Slave), so that generated results preferably retain the structural characteristics of the Master network. Technically, this is obtained by generating from the Master a finite automaton, called alignment model, which is then fed with (a linearization of) the Slave for the purpose of extracting, via the Viterbi algorithm, matching subgraphs. We propose an approach able to perform global alignment and network querying, and we apply it on PPI networks. We tested our method showing that the results it returns are biologically relevant.
Collapse
Affiliation(s)
- Nicola Ferraro
- Department of Electronics, Computer Science and Systems, University of Calabria, Via Pietro Bucci, Arcavacata di Rende (CS) 87036, Italy.
| | | | | | | |
Collapse
|
29
|
Fionda V, Palopoli L. Biological network querying techniques: analysis and comparison. J Comput Biol 2011; 18:595-625. [PMID: 21417941 DOI: 10.1089/cmb.2009.0144] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Research in systems biology has made available large amounts of data about interactions among cell building blocks (e.g., proteins, genes). To properly look up these data and mine useful information, the design and development of automatic tools has become crucial. These tools leverage Biological Networks as a formal model to encode molecular interactions. Biological networks can be fed as input to graph-based techniques useful to infer new information about cellular activity and evolutive processes of the species. In this context, a rather interesting family of techniques is that of network querying. Network querying tools search a whole biological network to identify conserved occurrences of a given query module for transferring biological knowledge. Indeed, inasmuch as the query network generally encodes a well-characterized functional module, its occurrences in the queried network suggest that the latter (and, as such, the corresponding organism) features the function encoded by the former. The aim of this paper is that of analyzing and comparing tools devised to query biological networks. This analysis is intended to help in understanding problems and research issues, state of the art and opportunities for researchers working in this area.
Collapse
Affiliation(s)
- Valeria Fionda
- Faculty of Computer Science, Free University of Bozen-Bolzano, Bozen-Bolzano, Italy.
| | | |
Collapse
|
30
|
Wallace IM, Bader GD, Giaever G, Nislow C. Displaying chemical information on a biological network using Cytoscape. Methods Mol Biol 2011; 781:363-76. [PMID: 21877291 DOI: 10.1007/978-1-61779-276-2_18] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Cytoscape is an open-source software package that is widely used to integrate and visualize diverse data sets in biology. This chapter explains how to use Cytoscape to integrate open-source chemical information with a biological network. By visualizing information about known compound-target interactions in the context of a biological network of interest, one can rapidly identify novel avenues to perturb the system with compounds and, for example, potentially identify therapeutically relevant targets. Herein, two different protocols are explained in detail, with no prior knowledge of Cytoscape assumed, which demonstrate how to incorporate data from the ChEMBL database with either a gene-gene or a protein-protein interaction network. ChEMBL is a very large, open-source repository of compound-target information available from the European Molecular Biology Laboratory.
Collapse
Affiliation(s)
- Iain M Wallace
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | | | | | | |
Collapse
|
31
|
Babu MM. Early Career Research Award Lecture. Structure, evolution and dynamics of transcriptional regulatory networks. Biochem Soc Trans 2010; 38:1155-78. [PMID: 20863280 DOI: 10.1042/bst0381155] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
The availability of entire genome sequences and the wealth of literature on gene regulation have enabled researchers to model an organism's transcriptional regulation system in the form of a network. In such a network, TFs (transcription factors) and TGs (target genes) are represented as nodes and regulatory interactions between TFs and TGs are represented as directed links. In the present review, I address the following topics pertaining to transcriptional regulatory networks. (i) Structure and organization: first, I introduce the concept of networks and discuss our understanding of the structure and organization of transcriptional networks. (ii) Evolution: I then describe the different mechanisms and forces that influence network evolution and shape network structure. (iii) Dynamics: I discuss studies that have integrated information on dynamics such as mRNA abundance or half-life, with data on transcriptional network in order to elucidate general principles of regulatory network dynamics. In particular, I discuss how cell-to-cell variability in the expression level of TFs could permit differential utilization of the same underlying network by distinct members of a genetically identical cell population. Finally, I conclude by discussing open questions for future research and highlighting the implications for evolution, development, disease and applications such as genetic engineering.
Collapse
Affiliation(s)
- M Madan Babu
- MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 0QH, UK.
| |
Collapse
|
32
|
Itzhaki Z, Akiva E, Margalit H. Preferential use of protein domain pairs as interaction mediators: order and transitivity. Bioinformatics 2010; 26:2564-70. [DOI: 10.1093/bioinformatics/btq495] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
33
|
Bruckner S, Hüffner F, Karp RM, Shamir R, Sharan R. Topology-free querying of protein interaction networks. J Comput Biol 2010; 17:237-52. [PMID: 20377443 DOI: 10.1089/cmb.2009.0170] [Citation(s) in RCA: 86] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
In the network querying problem, one is given a protein complex or pathway of species A and a protein-protein interaction network of species B; the goal is to identify subnetworks of B that are similar to the query in terms of sequence, topology, or both. Existing approaches mostly depend on knowledge of the interaction topology of the query in the network of species A; however, in practice, this topology is often not known. To address this problem, we develop a topology-free querying algorithm, which we call Torque. Given a query, represented as a set of proteins, Torque seeks a matching set of proteins that are sequence-similar to the query proteins and span a connected region of the network, while allowing both insertions and deletions. The algorithm uses alternatively dynamic programming and integer linear programming for the search task. We test Torque with queries from yeast, fly, and human, where we compare it to the QNet topology-based approach, and with queries from less studied species, where only topology-free algorithms apply. Torque detects many more matches than QNet, while giving results that are highly functionally coherent.
Collapse
Affiliation(s)
- Sharon Bruckner
- School of Computer Science, Tel Aviv University, Tel Aviv, Israel.
| | | | | | | | | |
Collapse
|
34
|
Janky R, Helden JV, Babu MM. Investigating transcriptional regulation: from analysis of complex networks to discovery of cis-regulatory elements. Methods 2009; 48:277-86. [PMID: 19450688 DOI: 10.1016/j.ymeth.2009.04.022] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2009] [Revised: 04/17/2009] [Accepted: 04/18/2009] [Indexed: 10/20/2022] Open
Abstract
Regulation of gene expression at the transcriptional level is a fundamental mechanism that is well conserved in all cellular systems. Due to advances in large-scale experimental analyses, we now have a wealth of information on gene regulation such as mRNA expression level across multiple conditions, genome-wide location data of transcription factors and data on transcription factor binding sites. This knowledge can be used to reconstruct transcriptional regulatory networks. Such networks are usually represented as directed graphs where regulatory interactions are depicted as directed edges from the transcription factor nodes to the target gene nodes. This abstract representation allows us to apply graph theory to study transcriptional regulation at global and local levels, to predict regulatory motifs and regulatory modules such as regulons and to compare the regulatory network of different genomes. Here we review some of the available computational methodologies for studying transcriptional regulatory networks as well as their interpretation.
Collapse
Affiliation(s)
- Rekin's Janky
- Structural Studies Division, Medical Research Council - Laboratory of Molecular Biology, Hills Road, Cambridge CB20QH, United Kingdom.
| | | | | |
Collapse
|
35
|
Banks E, Nabieva E, Chazelle B, Singh M. Organization of physical interactomes as uncovered by network schemas. PLoS Comput Biol 2008; 4:e1000203. [PMID: 18949022 PMCID: PMC2561054 DOI: 10.1371/journal.pcbi.1000203] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2008] [Accepted: 09/09/2008] [Indexed: 11/18/2022] Open
Abstract
Large-scale protein-protein interaction networks provide new opportunities for understanding cellular organization and functioning. We introduce network schemas to elucidate shared mechanisms within interactomes. Network schemas specify descriptions of proteins and the topology of interactions among them. We develop algorithms for systematically uncovering recurring, over-represented schemas in physical interaction networks. We apply our methods to the S. cerevisiae interactome, focusing on schemas consisting of proteins described via sequence motifs and molecular function annotations and interacting with one another in one of four basic network topologies. We identify hundreds of recurring and over-represented network schemas of various complexity, and demonstrate via graph-theoretic representations how more complex schemas are organized in terms of their lower-order constituents. The uncovered schemas span a wide range of cellular activities, with many signaling and transport related higher-order schemas. We establish the functional importance of the schemas by showing that they correspond to functionally cohesive sets of proteins, are enriched in the frequency with which they have instances in the H. sapiens interactome, and are useful for predicting protein function. Our findings suggest that network schemas are a powerful paradigm for organizing, interrogating, and annotating cellular networks.
Collapse
Affiliation(s)
- Eric Banks
- Department of Computer Science & Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Elena Nabieva
- Department of Computer Science & Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Bernard Chazelle
- Department of Computer Science & Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Mona Singh
- Department of Computer Science & Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| |
Collapse
|
36
|
Gao J, Ade AS, Tarcea VG, Weymouth TE, Mirel BR, Jagadish HV, States DJ. Integrating and annotating the interactome using the MiMI plugin for cytoscape. ACTA ACUST UNITED AC 2008; 25:137-8. [PMID: 18812364 DOI: 10.1093/bioinformatics/btn501] [Citation(s) in RCA: 103] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
UNLABELLED The MiMI molecular interaction repository integrates data from multiple sources, resolves interactions to standard gene names and symbols, links to annotation data from GO, MeSH and PubMed and normalizes the descriptions of interaction type. Here, we describe a Cytoscape plugin that retrieves interaction and annotation data from MiMI and links out to multiple data sources and tools. Community annotation of the interactome is supported. AVAILABILITY MiMI plugin v3.0.1 can be installed from within Cytoscape 2.6 using the Cytoscape plugin manager in 'Network and Attribute I/0' category. The plugin is also preloaded when Cytoscape is launched using Java WebStart at http://mimi.ncibi.org by querying a gene and clicking 'View in MiMI Plugin for Cytoscape' link.
Collapse
Affiliation(s)
- Jing Gao
- National Center for Integrative Biomedical Informatics, Center for Computational Medicine and Biology, University of Michigan, Ann Arbor, MI 48109, USA
| | | | | | | | | | | | | |
Collapse
|
37
|
Banks E, Nabieva E, Peterson R, Singh M. NetGrep: fast network schema searches in interactomes. Genome Biol 2008; 9:R138. [PMID: 18801179 PMCID: PMC2592716 DOI: 10.1186/gb-2008-9-9-r138] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2008] [Revised: 08/22/2008] [Accepted: 09/18/2008] [Indexed: 11/10/2022] Open
Abstract
NetGrep (http://genomics.princeton.edu/singhlab/netgrep/) is a system for searching protein interaction networks for matches to user-supplied 'network schemas'. Each schema consists of descriptions of proteins (for example, their molecular functions or putative domains) along with the desired topology and types of interactions among them. Schemas can thus describe domain-domain interactions, signaling and regulatory pathways, or more complex network patterns. NetGrep provides an advanced graphical interface for specifying schemas and fast algorithms for extracting their matches.
Collapse
Affiliation(s)
- Eric Banks
- Department of Computer Science, Princeton University, 35 Olden Street, Princeton, NJ 08540, USA
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Carl Icahn Lab, Princeton, NJ 08544, USA
| | - Elena Nabieva
- Department of Computer Science, Princeton University, 35 Olden Street, Princeton, NJ 08540, USA
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Carl Icahn Lab, Princeton, NJ 08544, USA
| | - Ryan Peterson
- Department of Computer Science, Princeton University, 35 Olden Street, Princeton, NJ 08540, USA
- Current address: Department of Computer Science, Cornell University, 4130 Upson Hall, Ithaca, NY 14853, USA
| | - Mona Singh
- Department of Computer Science, Princeton University, 35 Olden Street, Princeton, NJ 08540, USA
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Carl Icahn Lab, Princeton, NJ 08544, USA
| |
Collapse
|
38
|
Protein-Protein Interaction Network Querying by a “Focus and Zoom” Approach. COMMUNICATIONS IN COMPUTER AND INFORMATION SCIENCE 2008. [DOI: 10.1007/978-3-540-70600-7_25] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
|
39
|
Abstract
Interactions are the essence of all biomolecules because they cannot fulfill their roles without interacting with other molecules. Hence, mapping the interactions of biomolecules can be useful for understanding their roles and functions. Furthermore, the development of molecular based systems biology requires an understanding of the biomolecular interactions. In recent years, the mapping of protein-protein interactions in different species has been reported, but few reports have focused on the large-scale mapping of protein-protein interactions in human. Here, we review the developments in protein interaction mapping and we discuss issues and strategies for the mapping of the human protein interactome.
Collapse
|
40
|
Ferro A, Giugno R, Mongiovì M, Pulvirenti A, Skripin D, Shasha D. GraphFind: enhancing graph searching by low support data mining techniques. BMC Bioinformatics 2008; 9 Suppl 4:S10. [PMID: 18460171 PMCID: PMC2367637 DOI: 10.1186/1471-2105-9-s4-s10] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Biomedical and chemical databases are large and rapidly growing in size. Graphs naturally model such kinds of data. To fully exploit the wealth of information in these graph databases, a key role is played by systems that search for all exact or approximate occurrences of a query graph. To deal efficiently with graph searching, advanced methods for indexing, representation and matching of graphs have been proposed. RESULTS This paper presents GraphFind. The system implements efficient graph searching algorithms together with advanced filtering techniques that allow approximate search. It allows users to select candidate subgraphs rather than entire graphs. It implements an effective data storage based also on low-support data mining. CONCLUSIONS GraphFind is compared with Frowns, GraphGrep and gIndex. Experiments show that GraphFind outperforms the compared systems on a very large collection of small graphs. The proposed low-support mining technique which applies to any searching system also allows a significant index space reduction.
Collapse
Affiliation(s)
- Alfredo Ferro
- Dipartimento di Matematica e Informatica, Università di Catania, Catania, 95125, Italy.
| | | | | | | | | | | |
Collapse
|
41
|
Biomolecular network querying: a promising approach in systems biology. BMC SYSTEMS BIOLOGY 2008; 2:5. [PMID: 18205908 PMCID: PMC2245906 DOI: 10.1186/1752-0509-2-5] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/08/2008] [Accepted: 01/18/2008] [Indexed: 11/10/2022]
Abstract
The rapid accumulation of various network-related data from multiple species and conditions (e.g. disease versus normal) provides unprecedented opportunities to study the function and evolution of biological systems. Comparison of biomolecular networks between species or conditions is a promising approach to understanding the essential mechanisms used by living organisms. Computationally, the basic goal of this network comparison or 'querying' is to uncover identical or similar subnetworks by mapping the queried network (e.g. a pathway or functional module) to another network or network database. Such comparative analysis may reveal biologically or clinically important pathways or regulatory networks. In particular, we argue that user-friendly tools for network querying will greatly enhance our ability to study the fundamental properties of biomolecular networks at a system-wide level.
Collapse
|
42
|
Cline MS, Smoot M, Cerami E, Kuchinsky A, Landys N, Workman C, Christmas R, Avila-Campilo I, Creech M, Gross B, Hanspers K, Isserlin R, Kelley R, Killcoyne S, Lotia S, Maere S, Morris J, Ono K, Pavlovic V, Pico AR, Vailaya A, Wang PL, Adler A, Conklin BR, Hood L, Kuiper M, Sander C, Schmulevich I, Schwikowski B, Warner GJ, Ideker T, Bader GD. Integration of biological networks and gene expression data using Cytoscape. Nat Protoc 2007; 2:2366-82. [PMID: 17947979 PMCID: PMC3685583 DOI: 10.1038/nprot.2007.324] [Citation(s) in RCA: 1870] [Impact Index Per Article: 103.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Cytoscape is a free software package for visualizing, modeling and analyzing molecular and genetic interaction networks. This protocol explains how to use Cytoscape to analyze the results of mRNA expression profiling, and other functional genomics and proteomics experiments, in the context of an interaction network obtained for genes of interest. Five major steps are described: (i) obtaining a gene or protein network, (ii) displaying the network using layout algorithms, (iii) integrating with gene expression and other functional attributes, (iv) identifying putative complexes and functional modules and (v) identifying enriched Gene Ontology annotations in the network. These steps provide a broad sample of the types of analyses performed by Cytoscape.
Collapse
Affiliation(s)
- Melissa S Cline
- Institut Pasteur, 25-28 rue du Docteur Roux, 75724 Paris cedex 15, France
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|