1
|
Zhou W, Ma X, Jia X, Zheng J, Yan L, Fu Y. Construction and comprehensive analysis of the biological network related to rheumatoid arthritis-related interstitial lung disease. Int J Rheum Dis 2023; 26:132-144. [PMID: 36261881 DOI: 10.1111/1756-185x.14466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Revised: 08/22/2022] [Accepted: 10/01/2022] [Indexed: 01/04/2023]
Abstract
OBJECTIVE Interstitial lung disease (ILD) is a severe manifestation of rheumatoid arthritis (RA), which is characterized by low survival time post-diagnosis. Thus, it is important to explore the role of gene regulation related with ILD. METHOD Constructed a RA-ILD-related long chain noncoding RNA - messenger RNA (lncRNA-mRNA) network (ILD-LMN), based on ILD- and RA-related genes. We analyzed the topological properties of the resulting network. RESULT The results for network modularization and functional analysis showed that ILD-LMN performed basic and specific functions in ILD pathology. Furthermore, differential expression and correlation analysis of hub nodes revealed highly correlated competitive endogenous RNA regulatory relationships with important roles in pathological regulation. Following this, statistical analysis of disease-related single nucleotide polymorphisms (SNPs) in hub lncRNAs revealed that some of transcription factor-related SNPs were significantly associated with the expression of lncRNA. In fact, these SNPs exhibited significant differential expression in disease and normal samples. CONCLUSION These results suggest that ILD-LMN has important implications in the study of disease. Altogether, the study of RA- and ILD-related lncRNA and genes on the basis of biological network would assist in providing better treatment opportunities for ILD patients. Additionally, it would promote further research on treatment of the disease.
Collapse
Affiliation(s)
- Wei Zhou
- Department of Rheumatology, Liaocheng People's Hospital, Liaocheng, China
| | - Xianghui Ma
- Department of Rheumatology, Dongying People's Hospital, Dongying, China
| | - Xiaodong Jia
- Department of the Key Laboratory of Ophthalmology, Liaocheng People's Hospital, Liaocheng, China
| | - Juan Zheng
- Department of the Key Laboratory of Ophthalmology, Liaocheng People's Hospital, Liaocheng, China
| | - Lili Yan
- Department of the Key Laboratory of Ophthalmology, Liaocheng People's Hospital, Liaocheng, China
| | - Yanfa Fu
- Department of Rheumatology, Liaocheng People's Hospital, Liaocheng, China
| |
Collapse
|
2
|
Li Q, Milenkovic T. Supervised Prediction of Aging-Related Genes From a Context-Specific Protein Interaction Subnetwork. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:2484-2498. [PMID: 33929964 DOI: 10.1109/tcbb.2021.3076961] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Human aging is linked to many prevalent diseases. The aging process is highly influenced by genetic factors. Hence, it is important to identify human aging-related genes. We focus on supervised prediction of such genes. Gene expression-based methods for this purpose study genes in isolation from each other. While protein-protein interaction (PPI) network-based methods for this purpose account for interactions between genes' protein products, current PPI network data are context-unspecific, spanning different biological conditions. Instead, here, we focus on an aging-specific subnetwork of the entire PPI network, obtained by integrating aging-specific gene expression data and PPI network data. The potential of such data integration has been recognized but mostly in the context of cancer. So, we are the first to propose a supervised learning framework for predicting aging-related genes from an aging-specific PPI subnetwork. In a systematic and comprehensive evaluation, we find that in many of the evaluation tests: (i) using an aging-specific subnetwork indeed yields more accurate aging-related gene predictions than using the entire network, and (ii) predictive methods from our framework that have not previously been used for supervised prediction of aging-related genes outperform existing prominent methods for the same purpose. These results justify the need for our framework.
Collapse
|
3
|
Mitsche D, Penrose MD. Limit theory of combinatorial optimization for random geometric graphs. ANN APPL PROBAB 2021. [DOI: 10.1214/20-aap1661] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
4
|
Alofairi AA, Mabrouk E, Elsemman IE. Constraint-based models for dominating protein interaction networks. IET Syst Biol 2021; 15:148-162. [PMID: 34048146 PMCID: PMC8675806 DOI: 10.1049/syb2.12021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Revised: 05/10/2021] [Accepted: 05/11/2021] [Indexed: 11/19/2022] Open
Abstract
The minimum dominating set (MDSet) comprises the smallest number of graph nodes, where other graph nodes are connected with at least one MDSet node. The MDSet has been successfully applied to extract proteins that control protein–protein interaction (PPI) networks and to reveal the correlation between structural analysis and biological functions. Although the PPI network contains many MDSets, the identification of multiple MDSets is an NP‐complete problem, and it is difficult to determine the best MDSets, enriched with biological functions. Therefore, the MDSet model needs to be further expanded and validated to find constrained solutions that differ from those generated by the traditional models. Moreover, by identifying the critical set of the network, the set of nodes common to all MDSets can be time‐consuming. Herein, the authors adopted the minimisation of metabolic adjustment (MOMA) algorithm to develop a new framework, called maximisation of interaction adjustment (MOIA). In MOIA, they provide three models; the first one generates two MDSets with a minimum number of shared proteins, the second model generates constrained multiple MDSets (k‐MDSets), and the third model generates user‐defined MDSets, containing the maximum number of essential genes and/or other important genes of the PPI network. In practice, these models significantly reduce the cost of finding the critical set and classifying the graph nodes. Herein, the authors termed the critical set as the k‐critical set, where k is the number of MDSets generated by the proposed model. Then, they defined a new set of proteins called the (k−1)‐critical set, where each node belongs to (k−1) MDSets. This set has been shown to be as important as the k‐critical set and contains many essential genes, transcription factors, and protein kinases as the k‐critical set. The (k−1)‐critical set can be used to extend the search for drug target proteins. Based on the performance of the MOIA models, the authors believe the proposed methods contribute to answering key questions about the MDSets of PPI networks, and their results and analysis can be extended to other network types.
Collapse
Affiliation(s)
- Adel A Alofairi
- Department of Computer Science and Information Technology, Faculty of Science, Ibb University, Ibb, Yemen.,Department of Mathematics, Faculty of Science, Assiut University, Assiut, Egypt
| | - Emad Mabrouk
- Department of Mathematics, Faculty of Science, Assiut University, Assiut, Egypt.,College of Engineering and Technology, American University of the Middle East, Kuwait, Kuwait
| | - Ibrahim E Elsemman
- Department of Information Systems, Faculty of Computers and Information, Assiut University, Assiut, Egypt
| |
Collapse
|
5
|
Rai S, Bhatia V, Bhatnagar S. Drug repurposing for hyperlipidemia associated disorders: An integrative network biology and machine learning approach. Comput Biol Chem 2021; 92:107505. [PMID: 34030115 DOI: 10.1016/j.compbiolchem.2021.107505] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Revised: 04/21/2021] [Accepted: 05/05/2021] [Indexed: 12/31/2022]
Abstract
Hyperlipidemia causes diseases like cardiovascular disease, cancer, Type II Diabetes and Alzheimer's disease. Drugs that specifically target HL associated diseases are required for treatment. 34 KEGG pathways targeted by lipid lowering drugs were used to construct a directed protein-protein interaction network and driver nodes were determined using CytoCtrlAnalyser plugin of Cytoscape 3.6. The involvement of driver nodes of HL in other diseases was verified using GWAS. The central nodes of the network and 34 overrepresented pathways had a critical role in Hyperlipidemia. The PI3K-AKT signalling pathway, non-essentiality, non-centrality and approved drug target status were the predominant features of the driver nodes. Next, a Random Forest classifier was trained on 1445 molecular descriptors calculated using PaDEL for 50 approved lipid lowering and 84 lipid raising drugs as the positive and negative training set respectively. The classifier showed average accuracy of 76.8 % during 5-fold cross validation with AUC of 0.79 ± 0.06 for the ROC curve. The classifier was applied to select molecules with favourable properties for lipid lowering from the 130 approved drugs interacting with the identified driver nodes. We have integrated diverse network data and machine learning to predict repurposing of nine drugs for treatment of HL associated diseases.
Collapse
Affiliation(s)
- Sneha Rai
- Computational and Structural Biology Laboratory, Division of Biotechnology, Netaji Subhas Institute of Technology, Dwarka, New Delhi, 110078, India; Department of Biotechnology, Noida Institute of Engineering and Technology, Greater Noida, India
| | - Venugopal Bhatia
- Computational and Structural Biology Laboratory, Division of Biotechnology, Netaji Subhas Institute of Technology, Dwarka, New Delhi, 110078, India
| | - Sonika Bhatnagar
- Computational and Structural Biology Laboratory, Division of Biotechnology, Netaji Subhas Institute of Technology, Dwarka, New Delhi, 110078, India; Computational and Structural Biology Laboratory, Department of Biological Sciences and Engineering, Netaji Subhas University of Technology Dwarka, New Delhi 110078, India.
| |
Collapse
|
6
|
Newaz K, Wright G, Piland J, Li J, Clark PL, Emrich SJ, Milenković T. Network analysis of synonymous codon usage. Bioinformatics 2020; 36:4876-4884. [PMID: 32609328 DOI: 10.1093/bioinformatics/btaa603] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2019] [Revised: 05/05/2020] [Accepted: 06/22/2020] [Indexed: 12/25/2022] Open
Abstract
MOTIVATION Most amino acids are encoded by multiple synonymous codons, some of which are used more rarely than others. Analyses of positions of such rare codons in protein sequences revealed that rare codons can impact co-translational protein folding and that positions of some rare codons are evolutionarily conserved. Analyses of their positions in protein 3-dimensional structures, which are richer in biochemical information than sequences alone, might further explain the role of rare codons in protein folding. RESULTS We model protein structures as networks and use network centrality to measure the structural position of an amino acid. We first validate that amino acids buried within the structural core are network-central, and those on the surface are not. Then, we study potential differences between network centralities and thus structural positions of amino acids encoded by conserved rare, non-conserved rare and commonly used codons. We find that in 84% of proteins, the three codon categories occupy significantly different structural positions. We examine protein groups showing different codon centrality trends, i.e. different relationships between structural positions of the three codon categories. We see several cases of all proteins from our data with some structural or functional property being in the same group. Also, we see a case of all proteins in some group having the same property. Our work shows that codon usage is linked to the final protein structure and thus possibly to co-translational protein folding. AVAILABILITY AND IMPLEMENTATION https://nd.edu/∼cone/CodonUsage/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Khalique Newaz
- Department of Computer Science and Engineering.,Center for Network and Data Science.,Eck institute for Global Health
| | - Gabriel Wright
- Department of Computer Science and Engineering.,Eck institute for Global Health
| | - Jacob Piland
- Department of Computer Science and Engineering.,Center for Network and Data Science.,Eck institute for Global Health
| | - Jun Li
- Department of Applied and Computational Mathematics and Statistics
| | - Patricia L Clark
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Scott J Emrich
- Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, TN 37996, USA
| | - Tijana Milenković
- Department of Computer Science and Engineering.,Center for Network and Data Science.,Eck institute for Global Health
| |
Collapse
|
7
|
Complex Network Characterization Using Graph Theory and Fractal Geometry: The Case Study of Lung Cancer DNA Sequences. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app10093037] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
This paper discusses an approach developed for exploiting the local elementary movements of evolution to study complex networks in terms of shared common embedding and, consequently, shared fractal properties. This approach can be useful for the analysis of lung cancer DNA sequences and their properties by using the concepts of graph theory and fractal geometry. The proposed method advances a renewed consideration of network complexity both on local and global scales. Several researchers have illustrated the advantages of fractal mathematics, as well as its applicability to lung cancer research. Nevertheless, many researchers and clinicians continue to be unaware of its potential. Therefore, this paper aims to examine the underlying assumptions of fractals and analyze the fractal dimension and related measurements for possible application to complex networks and, especially, to the lung cancer network. The strict relationship between the lung cancer network properties and the fractal dimension is proved. Results show that the fractal dimension decreases in the lung cancer network while the topological properties of the network increase in the lung cancer network. Finally, statistical and topological significance between the complexity of the network and lung cancer network is shown.
Collapse
|
8
|
Liu S, Hachen D, Lizardo O, Poellabauer C, Striegel A, Milenković T. The power of dynamic social networks to predict individuals' mental health. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2020; 25:635-646. [PMID: 31797634 PMCID: PMC6924569] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Precision medicine has received attention both in and outside the clinic. We focus on the latter, by exploiting the relationship between individuals' social interactions and their mental health to predict one's likelihood of being depressed or anxious from rich dynamic social network data. Existing studies differ from our work in at least one aspect: they do not model social interaction data as a network; they do so but analyze static network data; they examine "correlation" between social networks and health but without making any predictions; or they study other individual traits but not mental health. In a comprehensive evaluation, we show that our predictive model that uses dynamic social network data is superior to its static network as well as non-network equivalents when run on the same data. Supplementary material for this work is available at https://nd.edu/~cone/NetHealth/PSB_SM.pdf.
Collapse
Affiliation(s)
- Shikang Liu
- Department of Computer Science and Engineering, University of Notre Dame
| | - David Hachen
- Department of Sociology, University of Notre Dame
| | - Omar Lizardo
- Department of Sociology, University of California, Los Angeles
| | | | - Aaron Striegel
- Department of Computer Science and Engineering, University of Notre Dame
| | - Tijana Milenković
- Department of Computer Science and Engineering, University of Notre Dame
| |
Collapse
|
9
|
Bouamama S, Blum C, Fages JG. An algorithm based on ant colony optimization for the minimum connected dominating set problem. Appl Soft Comput 2019. [DOI: 10.1016/j.asoc.2019.04.028] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
10
|
Network controllability analysis of intracellular signalling reveals viruses are actively controlling molecular systems. Sci Rep 2019; 9:2066. [PMID: 30765882 PMCID: PMC6375943 DOI: 10.1038/s41598-018-38224-9] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2018] [Accepted: 12/21/2018] [Indexed: 12/19/2022] Open
Abstract
In recent years control theory has been applied to biological systems with the aim of identifying the minimum set of molecular interactions that can drive the network to a required state. However, in an intra-cellular network it is unclear how control can be achieved in practice. To address this limitation we use viral infection, specifically human immunodeficiency virus type 1 (HIV-1) and hepatitis C virus (HCV), as a paradigm to model control of an infected cell. Using a large human signalling network comprised of over 6000 human proteins and more than 34000 directed interactions, we compared two states: normal/uninfected and infected. Our network controllability analysis demonstrates how a virus efficiently brings the dynamically organised host system into its control by mostly targeting existing critical control nodes, requiring fewer nodes than in the uninfected network. The lower number of control nodes is presumably to optimise exploitation of specific sub-systems needed for virus replication and/or involved in the host response to infection. Viral infection of the human system also permits discrimination between available network-control models, which demonstrates that the minimum dominating set (MDS) method better accounts for how the biological information and signals are organised during infection by identifying most viral proteins as critical driver nodes compared to the maximum matching (MM) method. Furthermore, the host driver nodes identified by MDS are distributed throughout the pathways enabling effective control of the cell via the high ‘control centrality’ of the viral and targeted host nodes. Our results demonstrate that control theory gives a more complete and dynamic understanding of virus exploitation of the host system when compared with previous analyses limited to static single-state networks.
Collapse
|
11
|
Nacher JC, Ishitsuka M, Miyazaki S, Akutsu T. Finding and analysing the minimum set of driver nodes required to control multilayer networks. Sci Rep 2019; 9:576. [PMID: 30679639 PMCID: PMC6345816 DOI: 10.1038/s41598-018-37046-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2018] [Accepted: 11/30/2018] [Indexed: 02/07/2023] Open
Abstract
It is difficult to control multilayer networks in situations with real-world complexity. Here, we first define the multilayer control problem in terms of the minimum dominating set (MDS) controllability framework and mathematically demonstrate that simple formulas can be used to estimate the size of the minimum dominating set in multilayer (MDSM) complex networks. Second, we develop a new algorithm that efficiently identifies the MDSM in up to 6 layers, with several thousand nodes in each layer network. Interestingly, the findings reveal that the MDSM size for similar networks does not significantly differ from that required to control a single network. This result opens future directions for controlling, for example, multiple species by identifying a common set of enzymes or proteins for drug targeting. We apply our methods to 70 genome-wide metabolic networks across major plant lineages, unveiling some relationships between controllability in multilayer networks and metabolic functions at the genome scale.
Collapse
Affiliation(s)
- Jose C Nacher
- Department of Information Science, Toho University, Funabashi, Chiba, 274-8510, Japan.
| | - Masayuki Ishitsuka
- Department of Information Science, Toho University, Funabashi, Chiba, 274-8510, Japan
| | - Shuichi Miyazaki
- Academic Center for Computing Media Studies, Kyoto University, Kyoto, 606-8501, Japan
| | - Tatsuya Akutsu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, 611-0011, Japan.
| |
Collapse
|
12
|
Eguchi R, Karim MB, Hu P, Sato T, Ono N, Kanaya S, Altaf-Ul-Amin M. An integrative network-based approach to identify novel disease genes and pathways: a case study in the context of inflammatory bowel disease. BMC Bioinformatics 2018; 19:264. [PMID: 30005591 PMCID: PMC6043997 DOI: 10.1186/s12859-018-2251-x] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2017] [Accepted: 06/18/2018] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND There are different and complicated associations between genes and diseases. Finding the causal associations between genes and specific diseases is still challenging. In this work we present a method to predict novel associations of genes and pathways with inflammatory bowel disease (IBD) by integrating information of differential gene expression, protein-protein interaction and known disease genes related to IBD. RESULTS We downloaded IBD gene expression data from NCBI's Gene Expression Omnibus, performed statistical analysis to determine differentially expressed genes, collected known IBD genes from DisGeNet database, which were used to construct a IBD related PPI network with HIPPIE database. We adapted our graph-based clustering algorithm DPClusO to cluster the disease PPI network. We evaluated the statistical significance of the identified clusters in the context of determining the richness of IBD genes using Fisher's exact test and predicted novel genes related to IBD. We showed 93.8% of our predictions are correct in the context of other databases and published literatures related to IBD. CONCLUSIONS Finding disease-causing genes is necessary for developing drugs with synergistic effect targeting many genes simultaneously. Here we present an approach to identify novel disease genes and pathways and discuss our approach in the context of IBD. The approach can be generalized to find disease-associated genes for other diseases.
Collapse
Affiliation(s)
- Ryohei Eguchi
- Graduate School of Science and Technology & NAIST Data Science Center, Nara Institute of Science and Technology, Nara, Japan
| | - Mohammand Bozlul Karim
- Graduate School of Science and Technology & NAIST Data Science Center, Nara Institute of Science and Technology, Nara, Japan
| | - Pingzhao Hu
- Department of Biochemistry and Medical Genetics, University of Manitoba, Winnipeg, Canada.,George and Fay Yee Centre for Healthcare Innovation, University of Manitoba, Winnipeg, Canada.,Department of Electrical and Computer Engineering, University of Manitoba, Winnipeg, Canada
| | - Tetsuo Sato
- Graduate School of Science and Technology & NAIST Data Science Center, Nara Institute of Science and Technology, Nara, Japan.,Department of Radiological Technology, Gunma Prefectural College of Health Sciences, Gunma, Japan
| | - Naoaki Ono
- Graduate School of Science and Technology & NAIST Data Science Center, Nara Institute of Science and Technology, Nara, Japan
| | - Shigehiko Kanaya
- Graduate School of Science and Technology & NAIST Data Science Center, Nara Institute of Science and Technology, Nara, Japan
| | - Md Altaf-Ul-Amin
- Graduate School of Science and Technology & NAIST Data Science Center, Nara Institute of Science and Technology, Nara, Japan.
| |
Collapse
|
13
|
Sharma A, Halu A, Decano JL, Padi M, Liu YY, Prasad RB, Fadista J, Santolini M, Menche J, Weiss ST, Vidal M, Silverman EK, Aikawa M, Barabási AL, Groop L, Loscalzo J. Controllability in an islet specific regulatory network identifies the transcriptional factor NFATC4, which regulates Type 2 Diabetes associated genes. NPJ Syst Biol Appl 2018; 4:25. [PMID: 29977601 PMCID: PMC6028434 DOI: 10.1038/s41540-018-0057-0] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2017] [Revised: 04/09/2018] [Accepted: 05/04/2018] [Indexed: 01/14/2023] Open
Abstract
Probing the dynamic control features of biological networks represents a new frontier in capturing the dysregulated pathways in complex diseases. Here, using patient samples obtained from a pancreatic islet transplantation program, we constructed a tissue-specific gene regulatory network and used the control centrality (Cc) concept to identify the high control centrality (HiCc) pathways, which might serve as key pathobiological pathways for Type 2 Diabetes (T2D). We found that HiCc pathway genes were significantly enriched with modest GWAS p-values in the DIAbetes Genetics Replication And Meta-analysis (DIAGRAM) study. We identified variants regulating gene expression (expression quantitative loci, eQTL) of HiCc pathway genes in islet samples. These eQTL genes showed higher levels of differential expression compared to non-eQTL genes in low, medium, and high glucose concentrations in rat islets. Among genes with highly significant eQTL evidence, NFATC4 belonged to four HiCc pathways. We asked if the expressions of T2D-associated candidate genes from GWAS and literature are regulated by Nfatc4 in rat islets. Extensive in vitro silencing of Nfatc4 in rat islet cells displayed reduced expression of 16, and increased expression of four putative downstream T2D genes. Overall, our approach uncovers the mechanistic connection of NFATC4 with downstream targets including a previously unknown one, TCF7L2, and establishes the HiCc pathways' relationship to T2D.
Collapse
Affiliation(s)
- Amitabh Sharma
- 1Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115 USA.,2Center for Complex Network Research and Department of Physics, Northeastern University, Boston, MA 02115 USA.,3Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA 02215 USA.,4Center for Interdisciplinary Cardiovascular Sciences, Cardiovascular Division, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02215 USA
| | - Arda Halu
- 1Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115 USA.,4Center for Interdisciplinary Cardiovascular Sciences, Cardiovascular Division, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02215 USA
| | - Julius L Decano
- 4Center for Interdisciplinary Cardiovascular Sciences, Cardiovascular Division, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02215 USA
| | - Megha Padi
- 5Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ 85721 USA
| | - Yang-Yu Liu
- 1Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115 USA
| | - Rashmi B Prasad
- 6Lund University Diabetes Center, Department of Clinical Sciences, Diabetes & Endocrinology, Skåne University Hospital Malmö, Lund University, Malmö, 20502 Sweden
| | - Joao Fadista
- 6Lund University Diabetes Center, Department of Clinical Sciences, Diabetes & Endocrinology, Skåne University Hospital Malmö, Lund University, Malmö, 20502 Sweden
| | - Marc Santolini
- 1Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115 USA.,2Center for Complex Network Research and Department of Physics, Northeastern University, Boston, MA 02115 USA
| | - Jörg Menche
- 2Center for Complex Network Research and Department of Physics, Northeastern University, Boston, MA 02115 USA.,7 CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Vienna, 1090 Austria
| | - Scott T Weiss
- 1Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115 USA
| | - Marc Vidal
- 3Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA 02215 USA.,8Department of Genetics, Harvard Medical School, Boston, MA 02115 USA
| | - Edwin K Silverman
- 1Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115 USA
| | - Masanori Aikawa
- 4Center for Interdisciplinary Cardiovascular Sciences, Cardiovascular Division, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02215 USA
| | - Albert-László Barabási
- 1Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115 USA.,2Center for Complex Network Research and Department of Physics, Northeastern University, Boston, MA 02115 USA.,3Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA 02215 USA.,9Center for Network Science, Central European University, Nador u. 9, 1051 Budapest, Hungary
| | - Leif Groop
- 6Lund University Diabetes Center, Department of Clinical Sciences, Diabetes & Endocrinology, Skåne University Hospital Malmö, Lund University, Malmö, 20502 Sweden.,10Department of Clinical Sciences, Islet cell physiology, Skåne University Hospital Malmö, Lund University, Malmö, 20502 Sweden
| | - Joseph Loscalzo
- 11Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115 USA
| |
Collapse
|
14
|
Choo SM, Ban B, Joo JI, Cho KH. The phenotype control kernel of a biomolecular regulatory network. BMC SYSTEMS BIOLOGY 2018; 12:49. [PMID: 29622038 PMCID: PMC5887232 DOI: 10.1186/s12918-018-0576-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/12/2017] [Accepted: 03/27/2018] [Indexed: 12/23/2022]
Abstract
Background Controlling complex molecular regulatory networks is getting a growing attention as it can provide a systematic way of driving any cellular state to a desired cell phenotypic state. A number of recent studies suggested various control methods, but there is still deficiency in finding out practically useful control targets that ensure convergence of any initial network state to one of attractor states corresponding to a desired cell phenotype. Results To find out practically useful control targets, we introduce a new concept of phenotype control kernel (PCK) for a Boolean network, defined as the collection of all minimal sets of control nodes having their fixed state values that can generate all possible control sets which eventually drive any initial state to one of attractor states corresponding to a particular cell phenotype of interest. We also present a detailed method with which we can identify PCK in a systematic way based on the layered network and converging tree of a given network. We identify all candidates for control nodes from the layered network and then hierarchically search for all possible minimal sets by using the converging tree. We show the usefulness of PCK by applying it to cell proliferation and apoptosis signaling networks and comparing the results with other control methods. PCK is the unique control method for Boolean network models that can be used to identify all possible minimal sets of control nodes. Interestingly, many of the minimal sets have only one or two control nodes. Conclusions Based on the new concept of PCK, we can identify all possible minimal sets of control nodes that can drive any molecular network state to one of multiple attractor states representing a same desired cell phenotype. Electronic supplementary material The online version of this article (10.1186/s12918-018-0576-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Sang-Mok Choo
- Department of Mathematics, University of Ulsan, Ulsan, 44610, Republic of Korea
| | - Byunghyun Ban
- Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea
| | - Jae Il Joo
- Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea
| | - Kwang-Hyun Cho
- Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea.
| |
Collapse
|
15
|
Liu S, Hachen D, Lizardo O, Poellabauer C, Striegel A, Milenković T. Network analysis of the NetHealth data: exploring co-evolution of individuals' social network positions and physical activities. APPLIED NETWORK SCIENCE 2018; 3:45. [PMID: 30465021 PMCID: PMC6223883 DOI: 10.1007/s41109-018-0103-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/10/2018] [Accepted: 09/25/2018] [Indexed: 05/03/2023]
Abstract
Understanding the relationship between individuals' social networks and health could help devise public health interventions for reducing incidence of unhealthy behaviors or increasing prevalence of healthy ones. In this context, we explore the co-evolution of individuals' social network positions and physical activities. We are able to do so because the NetHealth study at the University of Notre Dame has generated both high-resolution longitudinal social network (e.g., SMS) data and high-resolution longitudinal health-related behavioral (e.g., Fitbit physical activity) data. We examine trait differences between (i) users whose social network positions (i.e., centralities) change over time versus those whose centralities remain stable, (ii) users whose Fitbit physical activities change over time versus those whose physical activities remain stable, and (iii) users whose centralities and their physical activities co-evolve, i.e., correlate with each other over time. We find that centralities of a majority of all nodes change with time. These users do not show any trait difference compared to time-stable users. However, if out of all users whose centralities change with time we focus on those whose physical activities also change with time, then the resulting users are more likely to be introverted than time-stable users. Moreover, users whose centralities and physical activities both change with time and whose evolving centralities are significantly correlated (i.e., co-evolve) with evolving physical activities are more likely to be introverted as well as anxious compared to those users who are time-stable and do not have a co-evolution relationship. Our network analysis framework reveals several links between individuals' social network structure, health-related behaviors, and the other (e.g., personality) traits. In the future, our study could lead to development of a predictive model of social network structure from behavioral/trait information and vice versa.
Collapse
Affiliation(s)
- Shikang Liu
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, 46556 IN USA
| | - David Hachen
- Department of Sociology, University of Notre Dame, Notre Dame, 46556 IN USA
| | - Omar Lizardo
- Department of Sociology, University of Notre Dame, Notre Dame, 46556 IN USA
| | - Christian Poellabauer
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, 46556 IN USA
| | - Aaron Striegel
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, 46556 IN USA
| | - Tijana Milenković
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, 46556 IN USA
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, 46556 IN USA
- Interdisciplinary Center for Network Science and Applications (iCeNSA), University of Notre Dame, Notre Dame, 46556 IN USA
| |
Collapse
|
16
|
GRAFENE: Graphlet-based alignment-free network approach integrates 3D structural and sequence (residue order) data to improve protein structural comparison. Sci Rep 2017; 7:14890. [PMID: 29097661 PMCID: PMC5668259 DOI: 10.1038/s41598-017-14411-y] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2016] [Accepted: 10/11/2017] [Indexed: 12/26/2022] Open
Abstract
Initial protein structural comparisons were sequence-based. Since amino acids that are distant in the sequence can be close in the 3-dimensional (3D) structure, 3D contact approaches can complement sequence approaches. Traditional 3D contact approaches study 3D structures directly and are alignment-based. Instead, 3D structures can be modeled as protein structure networks (PSNs). Then, network approaches can compare proteins by comparing their PSNs. These can be alignment-based or alignment-free. We focus on the latter. Existing network alignment-free approaches have drawbacks: 1) They rely on naive measures of network topology. 2) They are not robust to PSN size. They cannot integrate 3) multiple PSN measures or 4) PSN data with sequence data, although this could improve comparison because the different data types capture complementary aspects of the protein structure. We address this by: 1) exploiting well-established graphlet measures via a new network alignment-free approach, 2) introducing normalized graphlet measures to remove the bias of PSN size, 3) allowing for integrating multiple PSN measures, and 4) using ordered graphlets to combine the complementary PSN data and sequence (specifically, residue order) data. We compare synthetic networks and real-world PSNs more accurately and faster than existing network (alignment-free and alignment-based), 3D contact, or sequence approaches.
Collapse
|
17
|
Ishitsuka M, Akutsu T, Nacher JC. Critical controllability analysis of directed biological networks using efficient graph reduction. Sci Rep 2017; 7:14361. [PMID: 29084972 PMCID: PMC5662738 DOI: 10.1038/s41598-017-14334-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2017] [Accepted: 10/06/2017] [Indexed: 01/02/2023] Open
Abstract
Network science has recently integrated key concepts from control theory and has applied them to the analysis of the controllability of complex networks. One of the proposed frameworks uses the Minimum Dominating Set (MDS) approach, which has been successfully applied to the identification of cancer-related proteins and in analyses of large-scale undirected networks, such as proteome-wide protein interaction networks. However, many real systems are better represented by directed networks. Therefore, fast algorithms are required for the application of MDS to directed networks. Here, we propose an algorithm that utilises efficient graph reduction to identify critical control nodes in large-scale directed complex networks. The algorithm is 176-fold faster than existing methods and increases the computable network size to 65,000 nodes. We then applied the developed algorithm to metabolic pathways consisting of 70 plant species encompassing major plant lineages ranging from algae to angiosperms and to signalling pathways from C. elegans, D. melanogaster and H. sapiens. The analysis not only identified functional pathways enriched with critical control molecules but also showed that most control categories are largely conserved across evolutionary time, from green algae and early basal plants to modern angiosperm plant lineages.
Collapse
Affiliation(s)
- Masayuki Ishitsuka
- Department of Information Science, Faculty of Science, Toho University, Funabashi, 274-8510, Japan
| | - Tatsuya Akutsu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, 611-0011, Japan
| | - Jose C Nacher
- Department of Information Science, Faculty of Science, Toho University, Funabashi, 274-8510, Japan.
| |
Collapse
|
18
|
Yoo B, Faisal FE, Chen H, Milenkovic T. Improving Identification of Key Players in Aging via Network De-Noising and Core Inference. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017; 14:1056-1069. [PMID: 26529776 DOI: 10.1109/tcbb.2015.2495170] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Current "ground truth" knowledge about human aging has been obtained by transferring aging-related knowledge from well-studied model species via sequence homology or by studying human gene expression data. Since proteins function by interacting with each other, analyzing protein-protein interaction (PPI) networks in the context of aging is promising. Unlike existing static network research of aging, since cellular functioning is dynamic, we recently integrated the static human PPI network with aging-related gene expression data to form dynamic, age-specific networks. Then, we predicted as key players in aging those proteins whose network topologies significantly changed with age. Since current networks are noisy , here, we use link prediction to de-noise the human network and predict improved key players in aging from the de-noised data. Indeed, de-noising gives more significant overlap between the predicted data and the "ground truth" aging-related data. Yet, we obtain novel predictions, which we validate in the literature. Also, we improve the predictions by an alternative strategy: removing "redundant" edges from the age-specific networks and using the resulting age-specific network "cores" to study aging. We produce new knowledge from dynamic networks encompassing multiple data types, via network de-noising or core inference, complementing the existing knowledge obtained from sequence or expression data.
Collapse
|
19
|
Gladilin E. Graph-theoretical model of global human interactome reveals enhanced long-range communicability in cancer networks. PLoS One 2017; 12:e0170953. [PMID: 28141819 PMCID: PMC5283687 DOI: 10.1371/journal.pone.0170953] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2016] [Accepted: 01/13/2017] [Indexed: 12/22/2022] Open
Abstract
Malignant transformation is known to involve substantial rearrangement of the molecular genetic landscape of the cell. A common approach to analysis of these alterations is a reductionist one and consists of finding a compact set of differentially expressed genes or associated signaling pathways. However, due to intrinsic tumor heterogeneity and tissue specificity, biomarkers defined by a small number of genes/pathways exhibit substantial variability. As an alternative to compact differential signatures, global features of genetic cell machinery are conceivable. Global network descriptors suggested in previous works are, however, known to potentially be biased by overrepresentation of interactions between frequently studied genes-proteins. Here, we construct a cellular network of 74538 directional and differential gene expression weighted protein-protein and gene regulatory interactions, and perform graph-theoretical analysis of global human interactome using a novel, degree-independent feature—the normalized total communicability (NTC). We apply this framework to assess differences in total information flow between different cancer (BRCA/COAD/GBM) and non-cancer interactomes. Our experimental results reveal that different cancer interactomes are characterized by significant enhancement of long-range NTC, which arises from circulation of information flow within robustly organized gene subnetworks. Although enhancement of NTC emerges in different cancer types from different genomic profiles, we identified a subset of 90 common genes that are related to elevated NTC in all studied tumors. Our ontological analysis shows that these genes are associated with enhanced cell division, DNA replication, stress response, and other cellular functions and processes typically upregulated in cancer. We conclude that enhancement of long-range NTC manifested in the correlated activity of genes whose tight coordination is required for survival and proliferation of all tumor cells, and, thus, can be seen as a graph-theoretical equivalent to some hallmarks of cancer. The computational framework for differential network analysis presented herein is of potential interest for a wide range of network perturbation problems given by single or multiple gene-protein activation-inhibition.
Collapse
Affiliation(s)
- Evgeny Gladilin
- Division of Theoretical Bioinformatics, German Cancer Research Center, Berliner Str. 41, 69120 Heidelberg, Germany
- BioQuant and IPMB, University Heidelberg, Im Neuenheimer Feld 267, 69120 Heidelberg, Germany
- * E-mail:
| |
Collapse
|
20
|
Karabekmez ME, Kirdar B. A novel topological centrality measure capturing biologically important proteins. MOLECULAR BIOSYSTEMS 2016; 12:666-73. [PMID: 26699451 DOI: 10.1039/c5mb00732a] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Topological centrality in protein interaction networks and its biological implications have widely been investigated in the past. In the present study, a novel metric of centrality-weighted sum of loads eigenvector centrality (WSL-EC)-based on graph spectra is defined and its performance in identifying topologically and biologically important nodes is comparatively investigated with common metrics of centrality in a human protein-protein interaction network. The metric can capture nodes from peripherals of the network differently from conventional eigenvector centrality. Different metrics were found to selectively identify hub sets that are significantly associated with different biological processes. The widely accepted metrics degree centrality, betweenness centrality, subgraph centrality and eigenvector centrality are subject to a bias towards super-hubs, whereas WSL-EC is not affected by the presence of super-hubs. WSL-EC outperforms other metrics of centrality in detecting biologically central nodes such as pathogen-interacting, cancer, ageing, HIV-1 or disease-related proteins and proteins involved in immune system processes and autoimmune diseases in the human interactome.
Collapse
Affiliation(s)
| | - Betul Kirdar
- Bogazici University, Department of Chemical Engineering, Istanbul, Turkey.
| |
Collapse
|
21
|
Abstract
We consider the observability model in networks with arbitrary topologies. We introduce a system of coupled nonlinear equations, valid under the locally treelike ansatz, to describe the size of the largest observable cluster as a function of the fraction of directly observable nodes present in the network. We perform a systematic analysis on 95 real-world graphs and compare our theoretical predictions with numerical simulations of the observability model. Our method provides almost perfect predictions in the majority of the cases, even for networks with very large values of the clustering coefficient. Potential applications of our theory include the development of efficient and scalable algorithms for real-time surveillance of social networks, and monitoring of technological networks.
Collapse
Affiliation(s)
- Yang Yang
- Department of Physics and Astronomy, Northwestern University, Evanston, Illinois 60208, USA
| | - Filippo Radicchi
- Center for Complex Networks and Systems Research, School of Informatics and Computing, Indiana University, Bloomington, Indiana 47408, USA
| |
Collapse
|
22
|
Zhang XF, Ou-Yang L, Dai DQ, Wu MY, Zhu Y, Yan H. Comparative analysis of housekeeping and tissue-specific driver nodes in human protein interaction networks. BMC Bioinformatics 2016; 17:358. [PMID: 27612563 PMCID: PMC5016887 DOI: 10.1186/s12859-016-1233-0] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2015] [Accepted: 08/31/2016] [Indexed: 12/31/2022] Open
Abstract
Background Several recent studies have used the Minimum Dominating Set (MDS) model to identify driver nodes, which provide the control of the underlying networks, in protein interaction networks. There may exist multiple MDS configurations in a given network, thus it is difficult to determine which one represents the real set of driver nodes. Because these previous studies only focus on static networks and ignore the contextual information on particular tissues, their findings could be insufficient or even be misleading. Results In this study, we develop a Collective-Influence-corrected Minimum Dominating Set (CI-MDS) model which takes into account the collective influence of proteins. By integrating molecular expression profiles and static protein interactions, 16 tissue-specific networks are established as well. We then apply the CI-MDS model to each tissue-specific network to detect MDS proteins. It generates almost the same MDSs when it is solved using different optimization algorithms. In addition, we classify MDS proteins into Tissue-Specific MDS (TS-MDS) proteins and HouseKeeping MDS (HK-MDS) proteins based on the number of tissues in which they are expressed and identified as MDS proteins. Notably, we find that TS-MDS proteins and HK-MDS proteins have significantly different topological and functional properties. HK-MDS proteins are more central in protein interaction networks, associated with more functions, evolving more slowly and subjected to a greater number of post-translational modifications than TS-MDS proteins. Unlike TS-MDS proteins, HK-MDS proteins significantly correspond to essential genes, ageing genes, virus-targeted proteins, transcription factors and protein kinases. Moreover, we find that besides HK-MDS proteins, many TS-MDS proteins are also linked to disease related genes, suggesting the tissue specificity of human diseases. Furthermore, functional enrichment analysis reveals that HK-MDS proteins carry out universally necessary biological processes and TS-MDS proteins usually involve in tissue-dependent functions. Conclusions Our study uncovers key features of TS-MDS proteins and HK-MDS proteins, and is a step forward towards a better understanding of the controllability of human interactomes. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1233-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Xiao-Fei Zhang
- School of Mathematics and Statistics & Hubei Key Laboratory of Mathematical Sciences, Central China Normal University, Luoyu Road, Wuhan, 430079, China
| | - Le Ou-Yang
- College of Information Engineering, Shenzhen University, Nanhai Ave 3688, Shenzhen, 518060, China
| | - Dao-Qing Dai
- Intelligent Data Center and Department of Mathematics, Sun Yat-Sen University, Xingang West Road, Guangzhou, 510275, China.
| | - Meng-Yun Wu
- School of Statistics and Management, Shanghai University of Finance and Economics, Guoding Road, Shanghai, 200433, China
| | - Yuan Zhu
- School of Automation, China University of Geosciences, Lumo Road, Wuhan, 430074, China
| | - Hong Yan
- Department of Electronic and Engineering, City University of Hong Kong, Tat Chee Avenue, Hong Kong, China
| |
Collapse
|
23
|
Nazarieh M, Wiese A, Will T, Hamed M, Helms V. Identification of key player genes in gene regulatory networks. BMC SYSTEMS BIOLOGY 2016; 10:88. [PMID: 27599550 PMCID: PMC5011974 DOI: 10.1186/s12918-016-0329-5] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/01/2016] [Accepted: 08/19/2016] [Indexed: 12/01/2022]
Abstract
Background Identifying the gene regulatory networks governing the workings and identity of cells is one of the main challenges in understanding processes such as cellular differentiation, reprogramming or cancerogenesis. One particular challenge is to identify the main drivers and master regulatory genes that control such cell fate transitions. In this work, we reformulate this problem as the optimization problems of computing a Minimum Dominating Set and a Minimum Connected Dominating Set for directed graphs. Results Both MDS and MCDS are applied to the well-studied gene regulatory networks of the model organisms E. coli and S. cerevisiae and to a pluripotency network for mouse embryonic stem cells. The results show that MCDS can capture most of the known key player genes identified so far in the model organisms. Moreover, this method suggests an additional small set of transcription factors as novel key players for governing the cell-specific gene regulatory network which can also be investigated with regard to diseases. To this aim, we investigated the ability of MCDS to define key drivers in breast cancer. The method identified many known drug targets as members of the MDS and MCDS. Conclusions This paper proposes a new method to identify key player genes in gene regulatory networks. The Java implementation of the heuristic algorithm explained in this paper is available as a Cytoscape plugin at http://apps.cytoscape.org/apps/mcds. The SageMath programs for solving integer linear programming formulations used in the paper are available at https://github.com/maryamNazarieh/KeyRegulatoryGenesand as supplementary material. Electronic supplementary material The online version of this article (doi:10.1186/s12918-016-0329-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Maryam Nazarieh
- Center for Bioinformatics, Saarland University, Saarbruecken, Germany.,Graduate School of Computer Science, Saarland University, Saarbruecken, Germany
| | - Andreas Wiese
- Max Planck Institut fuer Informatik (MPII), Saarbruecken, Germany
| | - Thorsten Will
- Center for Bioinformatics, Saarland University, Saarbruecken, Germany.,Graduate School of Computer Science, Saarland University, Saarbruecken, Germany
| | - Mohamed Hamed
- Center for Bioinformatics, Saarland University, Saarbruecken, Germany.,Institute for Biostatistics and Informatics in Medicine and Ageing Research, University of Rostock, Rostock, Germany
| | - Volkhard Helms
- Center for Bioinformatics, Saarland University, Saarbruecken, Germany.
| |
Collapse
|
24
|
Rund SSC, Yoo B, Alam C, Green T, Stephens MT, Zeng E, George GF, Sheppard AD, Duffield GE, Milenković T, Pfrender ME. Genome-wide profiling of 24 hr diel rhythmicity in the water flea, Daphnia pulex: network analysis reveals rhythmic gene expression and enhances functional gene annotation. BMC Genomics 2016; 17:653. [PMID: 27538446 PMCID: PMC4991082 DOI: 10.1186/s12864-016-2998-2] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2016] [Accepted: 08/05/2016] [Indexed: 11/16/2022] Open
Abstract
Background Marine and freshwater zooplankton exhibit daily rhythmic patterns of behavior and physiology which may be regulated directly by the light:dark (LD) cycle and/or a molecular circadian clock. One of the best-studied zooplankton taxa, the freshwater crustacean Daphnia, has a 24 h diel vertical migration (DVM) behavior whereby the organism travels up and down through the water column daily. DVM plays a critical role in resource tracking and the behavioral avoidance of predators and damaging ultraviolet radiation. However, there is little information at the transcriptional level linking the expression patterns of genes to the rhythmic physiology/behavior of Daphnia. Results Here we analyzed genome-wide temporal transcriptional patterns from Daphnia pulex collected over a 44 h time period under a 12:12 LD cycle (diel) conditions using a cosine-fitting algorithm. We used a comprehensive network modeling and analysis approach to identify novel co-regulated rhythmic genes that have similar network topological properties and functional annotations as rhythmic genes identified by the cosine-fitting analyses. Furthermore, we used the network approach to predict with high accuracy novel gene-function associations, thus enhancing current functional annotations available for genes in this ecologically relevant model species. Our results reveal that genes in many functional groupings exhibit 24 h rhythms in their expression patterns under diel conditions. We highlight the rhythmic expression of immunity, oxidative detoxification, and sensory process genes. We discuss differences in the chronobiology of D. pulex from other well-characterized terrestrial arthropods. Conclusions This research adds to a growing body of literature suggesting the genetic mechanisms governing rhythmicity in crustaceans may be divergent from other arthropod lineages including insects. Lastly, these results highlight the power of using a network analysis approach to identify differential gene expression and provide novel functional annotation. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-2998-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Samuel S C Rund
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN, 46556, USA.,Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, 46556, USA.,Centre for Immunity, Infection and Evolution, Institute of Evolution, University of Edinburgh, Edinburgh, EH9 3FL, UK.,Institute of Immunology and Infection Research, School of Biological Sciences, University of Edinburgh, Edinburgh, EH9 3FL, UK
| | - Boyoung Yoo
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN, 46556, USA.,Present Address: Department of Computer Science, Stanford University, Stanford, CA, 94305, USA
| | - Camille Alam
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, 46556, USA
| | - Taryn Green
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN, 46556, USA
| | - Melissa T Stephens
- Notre Dame Genomics and Bioinformatics Core Facility, University of Notre Dame, Notre Dame, IN, 46556, USA
| | - Erliang Zeng
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN, 46556, USA.,Notre Dame Genomics and Bioinformatics Core Facility, University of Notre Dame, Notre Dame, IN, 46556, USA.,Present Address: Department of Biology, University of South Dakota, Vermillion, SD, 57069, USA.,Present Address: Department of Computer Science, University of South Dakota, Vermillion, SD, 57069, USA
| | - Gary F George
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN, 46556, USA.,Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, 46556, USA
| | - Aaron D Sheppard
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN, 46556, USA.,Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, 46556, USA
| | - Giles E Duffield
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN, 46556, USA.,Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, 46556, USA
| | - Tijana Milenković
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN, 46556, USA.,Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN, 46556, USA.,Interdisciplinary Center for Network Science and Applications (iCeNSA), University of Notre Dame, Notre Dame, IN, 46556, USA
| | - Michael E Pfrender
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN, 46556, USA. .,Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, 46556, USA. .,Notre Dame Environmental Change Initiative, University of Notre Dame, Notre Dame, IN, 46556, USA.
| |
Collapse
|
25
|
Takemoto K, Akutsu T. Analysis of the Effect of Degree Correlation on the Size of Minimum Dominating Sets in Complex Networks. PLoS One 2016; 11:e0157868. [PMID: 27327273 PMCID: PMC4915616 DOI: 10.1371/journal.pone.0157868] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2016] [Accepted: 06/06/2016] [Indexed: 01/09/2023] Open
Abstract
Network controllability is an important topic in wide-ranging research fields. However, the relationship between controllability and network structure is poorly understood, although degree heterogeneity is known to determine the controllability. We focus on the size of a minimum dominating set (MDS), a measure of network controllability, and investigate the effect of degree-degree correlation, which is universally observed in real-world networks, on the size of an MDS. We show that disassortativity or negative degree-degree correlation reduces the size of an MDS using analytical treatments and numerical simulation, whereas positive correlations hardly affect the size of an MDS. This result suggests that disassortativity enhances network controllability. Furthermore, apart from the controllability issue, the developed techniques provide new ways of analyzing complex networks with degree-degree correlations.
Collapse
Affiliation(s)
- Kazuhiro Takemoto
- Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, Iizuka, Fukuoka, Japan
- * E-mail: (KT); (TA)
| | - Tatsuya Akutsu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto, Japan
- * E-mail: (KT); (TA)
| |
Collapse
|
26
|
Mayer G, Marcus K, Eisenacher M, Kohl M. Boolean modeling techniques for protein co-expression networks in systems medicine. Expert Rev Proteomics 2016; 13:555-69. [PMID: 27105325 DOI: 10.1080/14789450.2016.1181546] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
INTRODUCTION Application of systems biology/systems medicine approaches is promising for proteomics/biomedical research, but requires selection of an adequate modeling type. AREAS COVERED This article reviews the existing Boolean network modeling approaches, which provide in comparison with alternative modeling techniques several advantages for the processing of proteomics data. Application of methods for inference, reduction and validation of protein co-expression networks that are derived from quantitative high-throughput proteomics measurements is presented. It's also shown how Boolean models can be used to derive system-theoretic characteristics that describe both the dynamical behavior of such networks as a whole and the properties of different cell states (e.g. healthy or diseased cell states). Furthermore, application of methods derived from control theory is proposed in order to simulate the effects of therapeutic interventions on such networks, which is a promising approach for the computer-assisted discovery of biomarkers and drug targets. Finally, the clinical application of Boolean modeling analyses is discussed. Expert commentary: Boolean modeling of proteomics data is still in its infancy. Progress in this field strongly depends on provision of a repository with public access to relevant reference models. Also required are community supported standards that facilitate input of both proteomics and patient related data (e.g. age, gender, laboratory results, etc.).
Collapse
Affiliation(s)
- Gerhard Mayer
- a Medizinisches Proteom Center (MPC) , Ruhr-Universität Bochum , Bochum , Germany
| | - Katrin Marcus
- a Medizinisches Proteom Center (MPC) , Ruhr-Universität Bochum , Bochum , Germany
| | - Martin Eisenacher
- a Medizinisches Proteom Center (MPC) , Ruhr-Universität Bochum , Bochum , Germany
| | - Michael Kohl
- a Medizinisches Proteom Center (MPC) , Ruhr-Universität Bochum , Bochum , Germany
| |
Collapse
|
27
|
Ishitsuka M, Akutsu T, Nacher JC. Critical controllability in proteome-wide protein interaction network integrating transcriptome. Sci Rep 2016; 6:23541. [PMID: 27040162 PMCID: PMC4819195 DOI: 10.1038/srep23541] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2015] [Accepted: 02/29/2016] [Indexed: 01/09/2023] Open
Abstract
Recently, the number of essential gene entries has considerably increased. However, little is known about the relationships between essential genes and their functional roles in critical network control at both the structural (protein interaction network) and dynamic (transcriptional) levels, in part because the large size of the network prevents extensive computational analysis. Here, we present an algorithm that identifies the critical control set of nodes by reducing the computational time by 180 times and by expanding the computable network size up to 25 times, from 1,000 to 25,000 nodes. The developed algorithm allows a critical controllability analysis of large integrated systems composed of a transcriptome- and proteome-wide protein interaction network for the first time. The data-driven analysis captures a direct triad association of the structural controllability of genes, lethality and dynamic synchronization of co-expression. We believe that the identified optimized critical network control subsets may be of interest as drug targets; thus, they may be useful for drug design and development.
Collapse
Affiliation(s)
- Masayuki Ishitsuka
- Department of Information Science, Faculty of Science, Toho University, Funabashi, 274-8510, Japan
| | - Tatsuya Akutsu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji 611-0011, Japan
| | - Jose C Nacher
- Department of Information Science, Faculty of Science, Toho University, Funabashi, 274-8510, Japan
| |
Collapse
|
28
|
Nacher JC, Akutsu T. Minimum dominating set-based methods for analyzing biological networks. Methods 2016; 102:57-63. [PMID: 26773457 DOI: 10.1016/j.ymeth.2015.12.017] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2015] [Revised: 11/16/2015] [Accepted: 12/16/2015] [Indexed: 12/24/2022] Open
Abstract
The fast increase of 'multi-omics' data does not only pose a computational challenge for its analysis but also requires novel algorithmic methodologies to identify complex biological patterns and decipher the ultimate roots of human disorders. To that end, the massive integration of omics data with disease phenotypes is offering a new window into the cell functionality. The minimum dominating set (MDS) approach has rapidly emerged as a promising algorithmic method to analyze complex biological networks integrated with human disorders, which can be composed of a variety of omics data, from proteomics and transcriptomics to metabolomics. Here we review the main theoretical foundations of the methodology and the key algorithms, and examine the recent applications in which biological systems are analyzed by using the MDS approach.
Collapse
Affiliation(s)
- Jose C Nacher
- Department of Information Science, Faculty of Science, Toho University, Miyama 2-2-1, Funabashi, Chiba 274-8510, Japan.
| | - Tatsuya Akutsu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto 611-0011, Japan.
| |
Collapse
|
29
|
Kagami H, Akutsu T, Maegawa S, Hosokawa H, Nacher JC. Determining Associations between Human Diseases and non-coding RNAs with Critical Roles in Network Control. Sci Rep 2015; 5:14577. [PMID: 26459019 PMCID: PMC4602215 DOI: 10.1038/srep14577] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Accepted: 09/03/2015] [Indexed: 12/19/2022] Open
Abstract
Deciphering the association between life molecules and human diseases is currently an important task in systems biology. Research over the past decade has unveiled that the human genome is almost entirely transcribed, producing a vast number of non-protein-coding RNAs (ncRNAs) with potential regulatory functions. More recent findings suggest that many diseases may not be exclusively linked to mutations in protein-coding genes. The combination of these arguments poses the question of whether ncRNAs that play a critical role in network control are also enriched with disease-associated ncRNAs. To address this question, we mapped the available annotated information of more than 350 human disorders to the largest collection of human ncRNA-protein interactions, which define a bipartite network of almost 93,000 interactions. Using a novel algorithmic-based controllability framework applied to the constructed bipartite network, we found that ncRNAs engaged in critical network control are also statistically linked to human disorders (P-value of P = 9.8 × 10−109). Taken together, these findings suggest that the addition of those genes that encode optimized subsets of ncRNAs engaged in critical control within the pool of candidate genes could aid disease gene prioritization studies.
Collapse
Affiliation(s)
- Haruna Kagami
- Department of Information Science, Faculty of Science, Toho University, Funabashi, 274-8510, Japan
| | - Tatsuya Akutsu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji 611-0011, Japan
| | - Shingo Maegawa
- Department of Intelligence Science and Technology, Graduate School of Informatics, Kyoto University, Kyoto 606-8501, Japan
| | - Hiroshi Hosokawa
- Department of Intelligence Science and Technology, Graduate School of Informatics, Kyoto University, Kyoto 606-8501, Japan
| | - Jose C Nacher
- Department of Information Science, Faculty of Science, Toho University, Funabashi, 274-8510, Japan
| |
Collapse
|
30
|
Nguyen TP, Priami C, Caberlotto L. Novel drug target identification for the treatment of dementia using multi-relational association mining. Sci Rep 2015; 5:11104. [PMID: 26154857 PMCID: PMC4495601 DOI: 10.1038/srep11104] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2015] [Accepted: 05/13/2015] [Indexed: 12/12/2022] Open
Abstract
Dementia is a neurodegenerative condition of the brain in which there is a progressive and permanent loss of cognitive and mental performance. Despite the fact that the number of people with dementia worldwide is steadily increasing and regardless of the advances in the molecular characterization of the disease, current medical treatments for dementia are purely symptomatic and hardly effective. We present a novel multi-relational association mining method that integrates the huge amount of scientific data accumulated in recent years to predict potential novel targets for innovative therapeutic treatment of dementia. Owing to the ability of processing large volumes of heterogeneous data, our method achieves a high performance and predicts numerous drug targets including several serine threonine kinase and a G-protein coupled receptor. The predicted drug targets are mainly functionally related to metabolism, cell surface receptor signaling pathways, immune response, apoptosis, and long-term memory. Among the highly represented kinase family and among the G-protein coupled receptors, DLG4 (PSD-95), and the bradikynin receptor 2 are highlighted also for their proposed role in memory and cognition, as described in previous studies. These novel putative targets hold promises for the development of novel therapeutic approaches for the treatment of dementia.
Collapse
Affiliation(s)
- Thanh-Phuong Nguyen
- The Microsoft Research, University of Trento Centre for Computational Systems Biology (COSBI), Piazza Manifattura 1, 38068, Rovereto, Italy
- Life Sciences Research Unit, University of Luxembourg, 162 A, avenue de la Faïencerie, L-1511 Luxembourg
| | - Corrado Priami
- The Microsoft Research, University of Trento Centre for Computational Systems Biology (COSBI), Piazza Manifattura 1, 38068, Rovereto, Italy
- Department of Mathematics, University of Trento, Via Sommarive, 14-38123 Povo, Italy
| | - Laura Caberlotto
- The Microsoft Research, University of Trento Centre for Computational Systems Biology (COSBI), Piazza Manifattura 1, 38068, Rovereto, Italy
| |
Collapse
|
31
|
Hulovatyy Y, Chen H, Milenković T. Exploring the structure and function of temporal networks with dynamic graphlets. Bioinformatics 2015; 31:i171-80. [PMID: 26072480 PMCID: PMC4765862 DOI: 10.1093/bioinformatics/btv227] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
MOTIVATION With increasing availability of temporal real-world networks, how to efficiently study these data? One can model a temporal network as a single aggregate static network, or as a series of time-specific snapshots, each being an aggregate static network over the corresponding time window. Then, one can use established methods for static analysis on the resulting aggregate network(s), but losing in the process valuable temporal information either completely, or at the interface between different snapshots, respectively. Here, we develop a novel approach for studying a temporal network more explicitly, by capturing inter-snapshot relationships. RESULTS We base our methodology on well-established graphlets (subgraphs), which have been proven in numerous contexts in static network research. We develop new theory to allow for graphlet-based analyses of temporal networks. Our new notion of dynamic graphlets is different from existing dynamic network approaches that are based on temporal motifs (statistically significant subgraphs). The latter have limitations: their results depend on the choice of a null network model that is required to evaluate the significance of a subgraph, and choosing a good null model is non-trivial. Our dynamic graphlets overcome the limitations of the temporal motifs. Also, when we aim to characterize the structure and function of an entire temporal network or of individual nodes, our dynamic graphlets outperform the static graphlets. Clearly, accounting for temporal information helps. We apply dynamic graphlets to temporal age-specific molecular network data to deepen our limited knowledge about human aging. AVAILABILITY AND IMPLEMENTATION http://www.nd.edu/∼cone/DG.
Collapse
Affiliation(s)
- Y Hulovatyy
- Department of Computer Science and Engineering, Interdisciplinary Center for Network Science and Applications, and ECK Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, USA
| | - H Chen
- Department of Computer Science and Engineering, Interdisciplinary Center for Network Science and Applications, and ECK Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, USA
| | - T Milenković
- Department of Computer Science and Engineering, Interdisciplinary Center for Network Science and Applications, and ECK Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, USA
| |
Collapse
|
32
|
Crawford J, Sun Y, Milenković T. Fair evaluation of global network aligners. Algorithms Mol Biol 2015; 10:19. [PMID: 26060505 PMCID: PMC4460690 DOI: 10.1186/s13015-015-0050-8] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2014] [Accepted: 05/10/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Analogous to genomic sequence alignment, biological network alignment identifies conserved regions between networks of different species. Then, function can be transferred from well- to poorly-annotated species between aligned network regions. Network alignment typically encompasses two algorithmic components: node cost function (NCF), which measures similarities between nodes in different networks, and alignment strategy (AS), which uses these similarities to rapidly identify high-scoring alignments. Different methods use both different NCFs and different ASs. Thus, it is unclear whether the superiority of a method comes from its NCF, its AS, or both. We already showed on state-of-the-art methods, MI-GRAAL and IsoRankN, that combining NCF of one method and AS of another method can give a new superior method. Here, we evaluate MI-GRAAL against a newer approach, GHOST, by mixing-and-matching the methods' NCFs and ASs to potentially further improve alignment quality. While doing so, we approach important questions that have not been asked systematically thus far. First, we ask how much of the NCF information should come from protein sequence data compared to network topology data. Existing methods determine this parameter more-less arbitrarily, which could affect alignment quality. Second, when topological information is used in NCF, we ask how large the size of the neighborhoods of the compared nodes should be. Existing methods assume that the larger the neighborhood size, the better. RESULTS Our findings are as follows. MI-GRAAL's NCF is superior to GHOST's NCF, while the performance of the methods' ASs is data-dependent. Thus, for data on which GHOST's AS is superior to MI-GRAAL's AS, the combination of MI-GRAAL's NCF and GHOST's AS represents a new superior method. Also, which amount of sequence information is used within NCF does not affect alignment quality, while the inclusion of topological information is crucial for producing good alignments. Finally, larger neighborhood sizes are preferred, but often, it is the second largest size that is superior. Using this size instead of the largest one would decrease computational complexity. CONCLUSION Taken together, our results represent general recommendations for a fair evaluation of network alignment methods and in particular of two-stage NCF-AS approaches.
Collapse
|
33
|
Zhang XF, Ou-Yang L, Zhu Y, Wu MY, Dai DQ. Determining minimum set of driver nodes in protein-protein interaction networks. BMC Bioinformatics 2015; 16:146. [PMID: 25947063 PMCID: PMC4428234 DOI: 10.1186/s12859-015-0591-3] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2014] [Accepted: 04/22/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Recently, several studies have drawn attention to the determination of a minimum set of driver proteins that are important for the control of the underlying protein-protein interaction (PPI) networks. In general, the minimum dominating set (MDS) model is widely adopted. However, because the MDS model does not generate a unique MDS configuration, multiple different MDSs would be generated when using different optimization algorithms. Therefore, among these MDSs, it is difficult to find out the one that represents the true driver set of proteins. RESULTS To address this problem, we develop a centrality-corrected minimum dominating set (CC-MDS) model which includes heterogeneity in degree and betweenness centralities of proteins. Both the MDS model and the CC-MDS model are applied on three human PPI networks. Unlike the MDS model, the CC-MDS model generates almost the same sets of driver proteins when we implement it using different optimization algorithms. The CC-MDS model targets more high-degree and high-betweenness proteins than the uncorrected counterpart. The more central position allows CC-MDS proteins to be more important in maintaining the overall network connectivity than MDS proteins. To indicate the functional significance, we find that CC-MDS proteins are involved in, on average, more protein complexes and GO annotations than MDS proteins. We also find that more essential genes, aging genes, disease-associated genes and virus-targeted genes appear in CC-MDS proteins than in MDS proteins. As for the involvement in regulatory functions, the sets of CC-MDS proteins show much stronger enrichment of transcription factors and protein kinases. The results about topological and functional significance demonstrate that the CC-MDS model can capture more driver proteins than the MDS model. CONCLUSIONS Based on the results obtained, the CC-MDS model presents to be a powerful tool for the determination of driver proteins that can control the underlying PPI networks. The software described in this paper and the datasets used are available at https://github.com/Zhangxf-ccnu/CC-MDS .
Collapse
Affiliation(s)
- Xiao-Fei Zhang
- School of Mathematics and Statistics, Central China Normal University, Luoyu Road, Wuhan, 430079, China.
| | - Le Ou-Yang
- Intelligent Data Center and Department of Mathematics, Sun Yat-Sen University, Xingang West Road, Guangzhou, 510275, China.
| | - Yuan Zhu
- School of Mathematics and Statistics, Guangdong University of Finance and Economics, ChiSha Road, Guangzhou, 510320, China.
| | - Meng-Yun Wu
- School of Statistics and Management, Shanghai University of Finance and Economics, Guoding Road, Shanghai, 200433, China.
| | - Dao-Qing Dai
- Intelligent Data Center and Department of Mathematics, Sun Yat-Sen University, Xingang West Road, Guangzhou, 510275, China.
| |
Collapse
|
34
|
Khuri S, Wuchty S. Essentiality and centrality in protein interaction networks revisited. BMC Bioinformatics 2015; 16:109. [PMID: 25880655 PMCID: PMC4411940 DOI: 10.1186/s12859-015-0536-x] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2014] [Accepted: 03/13/2015] [Indexed: 12/29/2022] Open
Abstract
Background Minimum dominating sets (MDSet) of protein interaction networks allow the control of underlying protein interaction networks through their topological placement. While essential proteins are enriched in MDSets, we hypothesize that the statistical properties of biological functions of essential genes are enhanced when we focus on essential MDSet proteins (e-MDSet). Results Here, we determined minimum dominating sets of proteins (MDSet) in interaction networks of E. coli, S. cerevisiae and H. sapiens, defined as subsets of proteins whereby each remaining protein can be reached by a single interaction. We compared several topological and functional parameters of essential, MDSet, and essential MDSet (e-MDSet) proteins. In particular, we observed that their topological placement allowed e-MDSet proteins to provide a positive correlation between degree and lethality, connect more protein complexes, and have a stronger impact on network resilience than essential proteins alone. In comparison to essential proteins we further found that interactions between e-MDSet proteins appeared more frequently within complexes, while interactions of e-MDSet proteins between complexes were depleted. Finally, these e-MDSet proteins classified into functional groupings that play a central role in survival and adaptability. Conclusions The determination of e-MDSet of an organism highlights a set of proteins that enhances the enrichment signals of biological functions of essential proteins. As a consequence, we surmise that e-MDSets may provide a new method of evaluating the core proteins of an organism.
Collapse
Affiliation(s)
- Sawsan Khuri
- Department of Computer Science, University of Miami, Coral Gables, FL, 33146, USA. .,Center for Computational Science, University of Miami, Coral Gables, FL, 33146, USA.
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, Coral Gables, FL, 33146, USA. .,Center for Computational Science, University of Miami, Coral Gables, FL, 33146, USA.
| |
Collapse
|
35
|
Sun Y, Crawford J, Tang J, Milenković T. Simultaneous Optimization of both Node and Edge Conservation in Network Alignment via WAVE. LECTURE NOTES IN COMPUTER SCIENCE 2015. [DOI: 10.1007/978-3-662-48221-6_2] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
36
|
Faisal FE, Zhao H, Milenkovic T. Global Network Alignment in the Context of Aging. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2015; 12:40-52. [PMID: 26357077 DOI: 10.1109/tcbb.2014.2326862] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Analogous to sequence alignment, network alignment (NA) can be used to transfer biological knowledge across species between conserved network regions. NA faces two algorithmic challenges: 1) Which cost function to use to capture "similarities" between nodes in different networks? 2) Which alignment strategy to use to rapidly identify "high-scoring" alignments from all possible alignments? We "break down" existing state-of-the-art methods that use both different cost functions and different alignment strategies to evaluate each combination of their cost functions and alignment strategies. We find that a combination of the cost function of one method and the alignment strategy of another method beats the existing methods. Hence, we propose this combination as a novel superior NA method. Then, since human aging is hard to study experimentally due to long lifespan, we use NA to transfer aging-related knowledge from well annotated model species to poorly annotated human. By doing so, we produce novel human aging-related knowledge, which complements currently available knowledge about aging that has been obtained mainly by sequence alignment. We demonstrate significant similarity between topological and functional properties of our novel predictions and those of known aging-related genes. We are the first to use NA to learn more about aging.
Collapse
|
37
|
Sun K, Gonçalves JP, Larminie C, Przulj N. Predicting disease associations via biological network analysis. BMC Bioinformatics 2014; 15:304. [PMID: 25228247 PMCID: PMC4174675 DOI: 10.1186/1471-2105-15-304] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2013] [Accepted: 08/19/2014] [Indexed: 12/11/2022] Open
Abstract
Background Understanding the relationship between diseases based on the underlying biological mechanisms is one of the greatest challenges in modern biology and medicine. Exploring disease-disease associations by using system-level biological data is expected to improve our current knowledge of disease relationships, which may lead to further improvements in disease diagnosis, prognosis and treatment. Results We took advantage of diverse biological data including disease-gene associations and a large-scale molecular network to gain novel insights into disease relationships. We analysed and compared four publicly available disease-gene association datasets, then applied three disease similarity measures, namely annotation-based measure, function-based measure and topology-based measure, to estimate the similarity scores between diseases. We systematically evaluated disease associations obtained by these measures against a statistical measure of comorbidity which was derived from a large number of medical patient records. Our results show that the correlation between our similarity measures and comorbidity scores is substantially higher than expected at random, confirming that our similarity measures are able to recover comorbidity associations. We also demonstrated that our predicted disease associations correlated with disease associations generated from genome-wide association studies significantly higher than expected at random. Furthermore, we evaluated our predicted disease associations via mining the literature on PubMed, and presented case studies to demonstrate how these novel disease associations can be used to enhance our current knowledge of disease relationships. Conclusions We present three similarity measures for predicting disease associations. The strong correlation between our predictions and known disease associations demonstrates the ability of our measures to provide novel insights into disease relationships. Electronic supplementary material The online version of this article (doi:10.1186/1471-2105-15-304) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | - Nataša Przulj
- Department of Computing, Imperial College London, London, SW7 2AZ, UK.
| |
Collapse
|
38
|
Abstract
A fundamental goal of systems biology is to create models that describe relationships between biological components. Networks are an increasingly popular approach to this problem. However, a scientist interested in modeling biological (e.g., gene expression) data as a network is quickly confounded by the fundamental problem: how to construct the network? It is fairly easy to construct a network, but is it the network for the problem being considered? This is an important problem with three fundamental issues: How to weight edges in the network in order to capture actual biological interactions? What is the effect of the type of biological experiment used to collect the data from which the network is constructed? How to prune the weighted edges (or what cut-off to apply)? Differences in the construction of networks could lead to different biological interpretations. Indeed, we find that there are statistically significant dissimilarities in the functional content and topology between gene co-expression networks constructed using different edge weighting methods, data types, and edge cut-offs. We show that different types of known interactions, such as those found through Affinity Capture-Luminescence or Synthetic Lethality experiments, appear in significantly varying amounts in networks constructed in different ways. Hence, we demonstrate that different biological questions may be answered by the different networks. Consequently, we posit that the approach taken to build a network can be matched to biological questions to get targeted answers. More study is required to understand the implications of different network inference approaches and to draw reliable conclusions from networks used in the field of systems biology.
Collapse
|
39
|
Pinto JP, Machado RSR, Xavier JM, Futschik ME. Targeting molecular networks for drug research. Front Genet 2014; 5:160. [PMID: 24926314 PMCID: PMC4045242 DOI: 10.3389/fgene.2014.00160] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2014] [Accepted: 05/14/2014] [Indexed: 01/18/2023] Open
Abstract
The study of molecular networks has recently moved into the limelight of biomedical research. While it has certainly provided us with plenty of new insights into cellular mechanisms, the challenge now is how to modify or even restructure these networks. This is especially true for human diseases, which can be regarded as manifestations of distorted states of molecular networks. Of the possible interventions for altering networks, the use of drugs is presently the most feasible. In this mini-review, we present and discuss some exemplary approaches of how analysis of molecular interaction networks can contribute to pharmacology (e.g., by identifying new drug targets or prediction of drug side effects), as well as list pointers to relevant resources and software to guide future research. We also outline recent progress in the use of drugs for in vitro reprogramming of cells, which constitutes an example par excellence for altering molecular interaction networks with drugs.
Collapse
Affiliation(s)
- José P Pinto
- SysBioLab, Centre for Molecular and Structural Biomedicine, Universidade do Algarve Faro, Portugal
| | - Rui S R Machado
- SysBioLab, Centre for Molecular and Structural Biomedicine, Universidade do Algarve Faro, Portugal
| | - Joana M Xavier
- SysBioLab, Centre for Molecular and Structural Biomedicine, Universidade do Algarve Faro, Portugal
| | - Matthias E Futschik
- SysBioLab, Centre for Molecular and Structural Biomedicine, Universidade do Algarve Faro, Portugal ; Centre of Marine Sciences, Universidade do Algarve Faro, Portugal
| |
Collapse
|
40
|
Hulovatyy Y, Solava RW, Milenković T. Revealing missing parts of the interactome via link prediction. PLoS One 2014; 9:e90073. [PMID: 24594900 PMCID: PMC3940777 DOI: 10.1371/journal.pone.0090073] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2013] [Accepted: 01/29/2014] [Indexed: 12/20/2022] Open
Abstract
Protein interaction networks (PINs) are often used to "learn" new biological function from their topology. Since current PINs are noisy, their computational de-noising via link prediction (LP) could improve the learning accuracy. LP uses the existing PIN topology to predict missing and spurious links. Many of existing LP methods rely on shared immediate neighborhoods of the nodes to be linked. As such, they have limitations. Thus, in order to comprehensively study what are the topological properties of nodes in PINs that dictate whether the nodes should be linked, we introduce novel sensitive LP measures that are expected to overcome the limitations of the existing methods. We systematically evaluate the new and existing LP measures by introducing "synthetic" noise into PINs and measuring how accurate the measures are in reconstructing the original PINs. Also, we use the LP measures to de-noise the original PINs, and we measure biological correctness of the de-noised PINs with respect to functional enrichment of the predicted interactions. Our main findings are: 1) LP measures that favor nodes which are both "topologically similar" and have large shared extended neighborhoods are superior; 2) using more network topology often though not always improves LP accuracy; and 3) LP improves biological correctness of the PINs, plus we validate a significant portion of the predicted interactions in independent, external PIN data sources. Ultimately, we are less focused on identifying a superior method but more on showing that LP improves biological correctness of PINs, which is its ultimate goal in computational biology. But we note that our new methods outperform each of the existing ones with respect to at least one evaluation criterion. Alarmingly, we find that the different criteria often disagree in identifying the best method(s), which has important implications for LP communities in any domain, including social networks.
Collapse
Affiliation(s)
- Yuriy Hulovatyy
- Department of Computer Science and Engineering, ECK Institute for Global Health, and Interdisciplinary Center for Network Science and Applications, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - Ryan W. Solava
- Department of Computer Science and Engineering, ECK Institute for Global Health, and Interdisciplinary Center for Network Science and Applications, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - Tijana Milenković
- Department of Computer Science and Engineering, ECK Institute for Global Health, and Interdisciplinary Center for Network Science and Applications, University of Notre Dame, Notre Dame, Indiana, United States of America
- * E-mail:
| |
Collapse
|
41
|
Abstract
MOTIVATION Because susceptibility to diseases increases with age, studying aging gains importance. Analyses of gene expression or sequence data, which have been indispensable for investigating aging, have been limited to studying genes and their protein products in isolation, ignoring their connectivities. However, proteins function by interacting with other proteins, and this is exactly what biological networks (BNs) model. Thus, analyzing the proteins' BN topologies could contribute to the understanding of aging. Current methods for analyzing systems-level BNs deal with their static representations, even though cells are dynamic. For this reason, and because different data types can give complementary biological insights, we integrate current static BNs with aging-related gene expression data to construct dynamic age-specific BNs. Then, we apply sensitive measures of topology to the dynamic BNs to study cellular changes with age. RESULTS While global BN topologies do not significantly change with age, local topologies of a number of genes do. We predict such genes to be aging-related. We demonstrate credibility of our predictions by (i) observing significant overlap between our predicted aging-related genes and 'ground truth' aging-related genes; (ii) observing significant overlap between functions and diseases that are enriched in our aging-related predictions and those that are enriched in 'ground truth' aging-related data; (iii) providing evidence that diseases which are enriched in our aging-related predictions are linked to human aging; and (iv) validating our high-scoring novel predictions in the literature. AVAILABILITY AND IMPLEMENTATION Software executables are available upon request.
Collapse
Affiliation(s)
- Fazle E Faisal
- Department of Computer Science and Engineering, ECK Institute for Global Health and Interdisciplinary Center for Network Science and Applications, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Tijana Milenković
- Department of Computer Science and Engineering, ECK Institute for Global Health and Interdisciplinary Center for Network Science and Applications, University of Notre Dame, Notre Dame, IN 46556, USA
| |
Collapse
|
42
|
Pradhan MP, Desai A, Palakal MJ. Systems biology approach to stage-wise characterization of epigenetic genes in lung adenocarcinoma. BMC SYSTEMS BIOLOGY 2013; 7:141. [PMID: 24369052 PMCID: PMC3882327 DOI: 10.1186/1752-0509-7-141] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/01/2013] [Accepted: 12/16/2013] [Indexed: 12/12/2022]
Abstract
Background Epigenetics refers to the reversible functional modifications of the genome that do not correlate to changes in the DNA sequence. The aim of this study is to understand DNA methylation patterns across different stages of lung adenocarcinoma (LUAD). Results Our study identified 72, 93 and 170 significant DNA methylated genes in Stages I, II and III respectively. A set of common 34 significant DNA methylated genes located in the promoter section of the true CpG islands were found across stages, and these were: HOX genes, FOXG1, GRIK3, HAND2, PRKCB, etc. Of the total significant DNA methylated genes, 65 correlated with transcription function. The epigenetic analysis identified the following novel genes across all stages: PTGDR, TLX3, and POU4F2. The stage-wise analysis observed the appearance of NEUROG1 gene in Stage I and its re-appearance in Stage III. The analysis showed similar epigenetic pattern across Stage I and Stage III. Pathway analysis revealed important signaling and metabolic pathways of LUAD to correlate with epigenetics. Epigenetic subnetwork analysis identified a set of seven conserved genes across all stages: UBC, KRAS, PIK3CA, PIK3R3, RAF1, BRAF, and RAP1A. A detailed literature analysis elucidated epigenetic genes like FOXG1, HLA-G, and NKX6-2 to be known as prognostic targets. Conclusion Integrating epigenetic information for genes with expression data can be useful for comprehending in-depth disease mechanism and for the ultimate goal of better target identification.
Collapse
Affiliation(s)
| | | | - Mathew J Palakal
- School of Informatics and Computing, Indiana University Purdue University Indianapolis, Indianapolis IN, USA.
| |
Collapse
|
43
|
Abstract
MOTIVATION Small-induced subgraphs called graphlets are emerging as a possible tool for exploration of global and local structure of networks and for analysis of roles of individual nodes. One of the obstacles to their wider use is the computational complexity of algorithms for their discovery and counting. RESULTS We propose a new combinatorial method for counting graphlets and orbit signatures of network nodes. The algorithm builds a system of equations that connect counts of orbits from graphlets with up to five nodes, which allows to compute all orbit counts by enumerating just a single one. This reduces its practical time complexity in sparse graphs by an order of magnitude as compared with the existing pure enumeration-based algorithms. AVAILABILITY AND IMPLEMENTATION Source code is available freely at http://www.biolab.si/supp/orca/orca.html.
Collapse
Affiliation(s)
- Tomaž Hočevar
- Faculty of Computer and Information Science, University of Ljubljana, SI-1000 Ljubljana, Slovenia
| | | |
Collapse
|
44
|
Tian D, Choi KP. Sharp bounds and normalization of Wiener-type indices. PLoS One 2013; 8:e78448. [PMID: 24260118 PMCID: PMC3832646 DOI: 10.1371/journal.pone.0078448] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2013] [Accepted: 09/11/2013] [Indexed: 11/21/2022] Open
Abstract
Complex networks abound in physical, biological and social sciences. Quantifying a network’s topological structure facilitates network exploration and analysis, and network comparison, clustering and classification. A number of Wiener type indices have recently been incorporated as distance-based descriptors of complex networks, such as the R package QuACN. Wiener type indices are known to depend both on the network’s number of nodes and topology. To apply these indices to measure similarity of networks of different numbers of nodes, normalization of these indices is needed to correct the effect of the number of nodes in a network. This paper aims to fill this gap. Moreover, we introduce an -Wiener index of network , denoted by . This notion generalizes the Wiener index to a very wide class of Wiener type indices including all known Wiener type indices. We identify the maximum and minimum of over a set of networks with nodes. We then introduce our normalized-version of -Wiener index. The normalized -Wiener indices were demonstrated, in a number of experiments, to improve significantly the hierarchical clustering over the non-normalized counterparts.
Collapse
Affiliation(s)
- Dechao Tian
- Department of Statistics and Applied Probability, National University of Singapore, Singapore, Singapore
| | - Kwok Pui Choi
- Department of Statistics and Applied Probability, National University of Singapore, Singapore, Singapore
- Department of Mathematics, National University of Singapore, Singapore, Singapore
- * E-mail:
| |
Collapse
|
45
|
Pluchon PF, Fouqueau T, Crezé C, Laurent S, Briffotaux J, Hogrel G, Palud A, Henneke G, Godfroy A, Hausner W, Thomm M, Nicolas J, Flament D. An extended network of genomic maintenance in the archaeon Pyrococcus abyssi highlights unexpected associations between eucaryotic homologs. PLoS One 2013; 8:e79707. [PMID: 24244547 PMCID: PMC3820547 DOI: 10.1371/journal.pone.0079707] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2013] [Accepted: 09/24/2013] [Indexed: 11/18/2022] Open
Abstract
In Archaea, the proteins involved in the genetic information processing pathways, including DNA replication, transcription, and translation, share strong similarities with those of eukaryotes. Characterizations of components of the eukaryotic-type replication machinery complex provided many interesting insights into DNA replication in both domains. In contrast, DNA repair processes of hyperthermophilic archaea are less well understood and very little is known about the intertwining between DNA synthesis, repair and recombination pathways. The development of genetic system in hyperthermophilic archaea is still at a modest stage hampering the use of complementary approaches of reverse genetics and biochemistry to elucidate the function of new candidate DNA repair gene. To gain insights into genomic maintenance processes in hyperthermophilic archaea, a protein-interaction network centred on informational processes of Pyrococcus abyssi was generated by affinity purification coupled with mass spectrometry. The network consists of 132 interactions linking 87 proteins. These interactions give insights into the connections of DNA replication with recombination and repair, leading to the discovery of new archaeal components and of associations between eucaryotic homologs. Although this approach did not allow us to clearly delineate new DNA pathways, it provided numerous clues towards the function of new molecular complexes with the potential to better understand genomic maintenance processes in hyperthermophilic archaea. Among others, we found new potential partners of the replication clamp and demonstrated that the single strand DNA binding protein, Replication Protein A, enhances the transcription rate, in vitro, of RNA polymerase. This interaction map provides a valuable tool to explore new aspects of genome integrity in Archaea and also potentially in Eucaryotes.
Collapse
Affiliation(s)
- Pierre-François Pluchon
- Ifremer, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- Université de Bretagne Occidentale, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- CNRS, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
| | - Thomas Fouqueau
- Lehrstuhl für Mikrobiologie, Universität Regensburg, Regensburg, Germany
| | - Christophe Crezé
- Ifremer, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- Université de Bretagne Occidentale, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- CNRS, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
| | - Sébastien Laurent
- Ifremer, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- Université de Bretagne Occidentale, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- CNRS, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
| | - Julien Briffotaux
- Ifremer, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- Université de Bretagne Occidentale, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- CNRS, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
| | - Gaëlle Hogrel
- Ifremer, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- Université de Bretagne Occidentale, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- CNRS, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
| | - Adeline Palud
- Ifremer, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- Université de Bretagne Occidentale, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- CNRS, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
| | - Ghislaine Henneke
- Ifremer, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- Université de Bretagne Occidentale, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- CNRS, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
| | - Anne Godfroy
- Ifremer, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- Université de Bretagne Occidentale, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- CNRS, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
| | - Winfried Hausner
- Lehrstuhl für Mikrobiologie, Universität Regensburg, Regensburg, Germany
| | - Michael Thomm
- Lehrstuhl für Mikrobiologie, Universität Regensburg, Regensburg, Germany
| | - Jacques Nicolas
- IRISA-INRIA, Campus de Beaulieu, Rennes, France
- * E-mail: (DF); (JN)
| | - Didier Flament
- Ifremer, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- Université de Bretagne Occidentale, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- CNRS, UMR6197, Laboratoire de Microbiologie des Environnements Extrêmes, Plouzané, France
- * E-mail: (DF); (JN)
| |
Collapse
|
46
|
Csermely P, Korcsmáros T, Kiss HJM, London G, Nussinov R. Structure and dynamics of molecular networks: a novel paradigm of drug discovery: a comprehensive review. Pharmacol Ther 2013; 138:333-408. [PMID: 23384594 PMCID: PMC3647006 DOI: 10.1016/j.pharmthera.2013.01.016] [Citation(s) in RCA: 506] [Impact Index Per Article: 46.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2013] [Accepted: 01/22/2013] [Indexed: 02/02/2023]
Abstract
Despite considerable progress in genome- and proteome-based high-throughput screening methods and in rational drug design, the increase in approved drugs in the past decade did not match the increase of drug development costs. Network description and analysis not only give a systems-level understanding of drug action and disease complexity, but can also help to improve the efficiency of drug design. We give a comprehensive assessment of the analytical tools of network topology and dynamics. The state-of-the-art use of chemical similarity, protein structure, protein-protein interaction, signaling, genetic interaction and metabolic networks in the discovery of drug targets is summarized. We propose that network targeting follows two basic strategies. The "central hit strategy" selectively targets central nodes/edges of the flexible networks of infectious agents or cancer cells to kill them. The "network influence strategy" works against other diseases, where an efficient reconfiguration of rigid networks needs to be achieved by targeting the neighbors of central nodes/edges. It is shown how network techniques can help in the identification of single-target, edgetic, multi-target and allo-network drug target candidates. We review the recent boom in network methods helping hit identification, lead selection optimizing drug efficacy, as well as minimizing side-effects and drug toxicity. Successful network-based drug development strategies are shown through the examples of infections, cancer, metabolic diseases, neurodegenerative diseases and aging. Summarizing >1200 references we suggest an optimized protocol of network-aided drug development, and provide a list of systems-level hallmarks of drug quality. Finally, we highlight network-related drug development trends helping to achieve these hallmarks by a cohesive, global approach.
Collapse
Affiliation(s)
- Peter Csermely
- Department of Medical Chemistry, Semmelweis University, P.O. Box 260, H-1444 Budapest 8, Hungary.
| | | | | | | | | |
Collapse
|
47
|
Abstract
Molecular network data are increasingly becoming available, necessitating the development of well performing computational tools for their analyses. Such tools enabled conceptually different approaches for exploring human diseases to be undertaken, in particular, those that study the relationship between a multitude of biomolecules within a cell. Hence, a new field of network biology has emerged as part of systems biology, aiming to untangle the complexity of cellular network organization. We survey current network analysis methods that aim to give insight into human disease.
Collapse
Affiliation(s)
- Vuk Janjić
- Department of Computing, Imperial College London, 180 Queen's Gate, SW7 2AZ London, UK
| | | |
Collapse
|
48
|
Solava RW, Michaels RP, Milenkovic T. Graphlet-based edge clustering reveals pathogen-interacting proteins. Bioinformatics 2012; 28:i480-i486. [PMID: 22962470 PMCID: PMC3436803 DOI: 10.1093/bioinformatics/bts376] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
MOTIVATION Prediction of protein function from protein interaction networks has received attention in the post-genomic era. A popular strategy has been to cluster the network into functionally coherent groups of proteins and assign the entire cluster with a function based on functions of its annotated members. Traditionally, network research has focused on clustering of nodes. However, clustering of edges may be preferred: nodes belong to multiple functional groups, but clustering of nodes typically cannot capture the group overlap, while clustering of edges can. Clustering of adjacent edges that share many neighbors was proposed recently, outperforming different node clustering methods. However, since some biological processes can have characteristic 'signatures' throughout the network, not just locally, it may be of interest to consider edges that are not necessarily adjacent. RESULTS We design a sensitive measure of the 'topological similarity' of edges that can deal with edges that are not necessarily adjacent. We cluster edges that are similar according to our measure in different baker's yeast protein interaction networks, outperforming existing node and edge clustering approaches. We apply our approach to the human network to predict new pathogen-interacting proteins. This is important, since these proteins represent drug target candidates. AVAILABILITY Software executables are freely available upon request. CONTACT tmilenko@nd.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- R W Solava
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556, USA
| | | | | |
Collapse
|