1
|
Begga A, Escolano Ruiz F, Lozano MÁ. Edge-Centric Embeddings of Digraphs: Properties and Stability Under Sparsification. ENTROPY (BASEL, SWITZERLAND) 2025; 27:304. [PMID: 40149228 PMCID: PMC11941605 DOI: 10.3390/e27030304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/25/2024] [Revised: 02/28/2025] [Accepted: 03/11/2025] [Indexed: 03/29/2025]
Abstract
In this paper, we define and characterize the embedding of edges and higher-order entities in directed graphs (digraphs) and relate these embeddings to those of nodes. Our edge-centric approach consists of the following: (a) Embedding line digraphs (or their iterated versions); (b) Exploiting the rank properties of these embeddings to show that edge/path similarity can be posed as a linear combination of node similarities; (c) Solving scalability issues through digraph sparsification; (d) Evaluating the performance of these embeddings for classification and clustering. We commence by identifying the motive behind the need for edge-centric approaches. Then we proceed to introduce all the elements of the approach, and finally, we validate it. Our edge-centric embedding entails a top-down mining of links, instead of inferring them from the similarities of node embeddings. This analysis is key to discovering inter-subgraph links that hold the whole graph connected, i.e., central edges. Using directed graphs (digraphs) allows us to cluster edge-like hubs and authorities. In addition, since directed edges inherit their labels from destination (origin) nodes, their embedding provides a proxy representation for node classification and clustering as well. This representation is obtained by embedding the line digraph of the original one. The line digraph provides nice formal properties with respect to the original graph; in particular, it produces more entropic latent spaces. With these properties at hand, we can relate edge embeddings to node embeddings. The main contribution of this paper is to set and prove the linearity theorem, which poses each element of the transition matrix for an edge embedding as a linear combination of the elements of the transition matrix for the node embedding. As a result, the rank preservation property explains why embedding the line digraph and using the labels of the destination nodes provides better classification and clustering performances than embedding the nodes of the original graph. In other words, we do not only facilitate edge mining but enforce node classification and clustering. However, computing the line digraph is challenging, and a sparsification strategy is implemented for the sake of scalability. Our experimental results show that the line digraph representation of the sparsified input graph is quite stable as we increase the sparsification level, and also that it outperforms the original (node-centric) representation. For the sake of simplicity, our theorem relies on node2vec-like (factorization) embeddings. However, we also include several experiments showing how line digraphs may improve the performance of Graph Neural Networks (GNNs), also following the principle of maximum entropy.
Collapse
Affiliation(s)
- Ahmed Begga
- Department of Computer Science and Artificial Intelligence, University of Alicante, 03690 Alicante, Spain; (F.E.R.); (M.Á.L.)
| | | | | |
Collapse
|
2
|
Mara AC, Lijffijt J, De Bie T. An Empirical Evaluation of Network Representation Learning Methods. BIG DATA 2024; 12:518-537. [PMID: 35271383 DOI: 10.1089/big.2021.0107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Network representation learning methods map network nodes to vectors in an embedding space that can preserve specific properties and enable traditional downstream prediction tasks. The quality of the representations learned is then generally showcased through results on these downstream tasks. Commonly used benchmark tasks such as link prediction or network reconstruction, however, present complex evaluation pipelines and an abundance of design choices. This, together with a lack of standardized evaluation setups, can obscure the real progress in the field. In this article, we aim at investigating the impact on the performance of a variety of such design choices and perform an extensive and consistent evaluation that can shed light on the state-of-the-art on network representation learning. Our evaluation reveals that only limited progress has been made in recent years, with embedding-based approaches struggling to outperform basic heuristics in many scenarios.
Collapse
Affiliation(s)
| | - Jefrey Lijffijt
- Department of Electronics and Information Systems, Ghent University, Ghent, Belgium
| | - Tijl De Bie
- Department of Electronics and Information Systems, Ghent University, Ghent, Belgium
| |
Collapse
|
3
|
Feng X, Ma Z, Yu C, Xin R. MRNDR: Multihead Attention-Based Recommendation Network for Drug Repurposing. J Chem Inf Model 2024; 64:2654-2669. [PMID: 38373300 DOI: 10.1021/acs.jcim.3c01726] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/21/2024]
Abstract
As is well-known, the process of developing new drugs is extremely expensive, whereas drug repurposing represents a promising approach to augment the efficiency of new drug development. While this method can indeed spare us from expensive drug toxicity and safety experiments, it still demands a substantial amount of time to carry out precise efficacy experiments for specific diseases, thereby consuming a significant quantity of resources. Therefore, if we can prescreen potential other indications for selected drugs, it could result in substantial cost savings. In light of this, this paper introduces a drug repurposing recommendation model called MRNDR, which stands for Multi-head attention-based Recommendation Network for Drug Repurposing. This model serves as a prediction tool for drug-disease relationships, leveraging the multihead self-attention mechanism that demonstrates robust generalization capabilities. These capabilities stem not only from our extensive million-level training data set, BioRE (Biology Recommended Entity data), but also from the utilization of the WRDS (Weighted Representation Distance Score) algorithm proposed by us. The MRNDR model has achieved new state-of-the-art results on the GP-KG public data set, with an MRR (Mean Reciprocal Rank) score of 0.308 and a Hits@10 score of 0.628. This represents significant improvements of 4.7% (MRR) and 18.1% (Hits@10) over the current best-performing models. Additionally, to further validate the practical utility of the model, we examined results recommended by MRNDR that were not present in the training data set. Some of these recommendations have undergone clinical trials, as evidenced by their presence on ClinicalTrials.gov and the China Clinical Trials Center, indirectly confirming the applicability of MRNDR. The MRNDR model can predict the reusability of candidate drugs, reducing the need for manual expert assessments and enabling efficient drug repurposing.
Collapse
Affiliation(s)
- Xin Feng
- School of Science, Jilin Institute of Chemical Technology, Jilin 130000, P.R. China
- State Key Laboratory of Inorganic Synthesis and Preparative Chemistry, College of Chemistry, Jilin University, Changchun 130012, P.R. China
- Department of Epidemiology and Biostatistics, School of Public Health, Jilin University, Changchun 130012, P.R. China
| | - Zhansen Ma
- College of Information and Control Engineering, Jilin Institute of Chemical Technology, Jilin 130000, P.R. China
| | - Cuinan Yu
- College of Computer Science and Technology, and Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, P.R. China
| | - Ruihao Xin
- College of Information and Control Engineering, Jilin Institute of Chemical Technology, Jilin 130000, P.R. China
- College of Computer Science and Technology, and Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, P.R. China
| |
Collapse
|
4
|
Xie S, Saba L, Jiang H, Bringas OR, Oghbaie M, Stefano LD, Sherman V, LaCava J. Multiparameter screen optimizes immunoprecipitation. Biotechniques 2024; 76:145-152. [PMID: 38425263 PMCID: PMC11091867 DOI: 10.2144/btn-2023-0051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/02/2024] Open
Abstract
Immunoprecipitation (IP) coupled with mass spectrometry effectively maps protein-protein interactions when genome-wide, affinity-tagged cell collections are used. Such studies have recorded significant portions of the compositions of physiological protein complexes, providing draft 'interactomes'; yet many constituents of protein complexes still remain uncharted. This gap exists partly because high-throughput approaches cannot optimize each IP. A key challenge for IP optimization is stabilizing in vivo interactions during the transfer from cells to test tubes; failure to do so leads to the loss of genuine interactions during the IP and subsequent failure to detect. Our high-content screening method explores the relationship between in vitro chemical conditions and IP outcomes, enabling rapid empirical optimization of conditions for capturing target macromolecular assemblies.
Collapse
Affiliation(s)
- Shaoshuai Xie
- European Research Institute for the Biology of Ageing, University Medical Centre Groningen, Groningen, 9713AV, The Netherlands
| | - Leila Saba
- European Research Institute for the Biology of Ageing, University Medical Centre Groningen, Groningen, 9713AV, The Netherlands
| | - Hua Jiang
- Laboratory of Cellular & Structural Biology, The Rockefeller University, New York, NY 10065, USA
| | - Omar R Bringas
- European Research Institute for the Biology of Ageing, University Medical Centre Groningen, Groningen, 9713AV, The Netherlands
| | - Mehrnoosh Oghbaie
- European Research Institute for the Biology of Ageing, University Medical Centre Groningen, Groningen, 9713AV, The Netherlands
- Laboratory of Cellular & Structural Biology, The Rockefeller University, New York, NY 10065, USA
| | - Luciano Di Stefano
- European Research Institute for the Biology of Ageing, University Medical Centre Groningen, Groningen, 9713AV, The Netherlands
| | - Vadim Sherman
- High Energy Physics Instrument Shop, The Rockefeller University, New York, NY 10065, USA
| | - John LaCava
- European Research Institute for the Biology of Ageing, University Medical Centre Groningen, Groningen, 9713AV, The Netherlands
- Laboratory of Cellular & Structural Biology, The Rockefeller University, New York, NY 10065, USA
| |
Collapse
|
5
|
Wang X, Zhao H, Chen H. Improved Skip-Gram Based on Graph Structure Information. SENSORS (BASEL, SWITZERLAND) 2023; 23:6527. [PMID: 37514822 PMCID: PMC10383593 DOI: 10.3390/s23146527] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 07/16/2023] [Accepted: 07/17/2023] [Indexed: 07/30/2023]
Abstract
Applying the Skip-gram to graph representation learning has become a widely researched topic in recent years. Prior works usually focus on the migration application of the Skip-gram model, while Skip-gram in graph representation learning, initially applied to word embedding, is left insufficiently explored. To compensate for the shortcoming, we analyze the difference between word embedding and graph embedding and reveal the principle of graph representation learning through a case study to explain the essential idea of graph embedding intuitively. Through the case study and in-depth understanding of graph embeddings, we propose Graph Skip-gram, an extension of the Skip-gram model using graph structure information. Graph Skip-gram can be combined with a variety of algorithms for excellent adaptability. Inspired by word embeddings in natural language processing, we design a novel feature fusion algorithm to fuse node vectors based on node vector similarity. We fully articulate the ideas of our approach on a small network and provide extensive experimental comparisons, including multiple classification tasks and link prediction tasks, demonstrating that our proposed approach is more applicable to graph representation learning.
Collapse
Affiliation(s)
- Xiaojie Wang
- School of Computer Science, China West Normal University, Nanchong 637002, China
| | - Haijun Zhao
- School of Computer Science, China West Normal University, Nanchong 637002, China
| | - Huayue Chen
- School of Computer Science, China West Normal University, Nanchong 637002, China
| |
Collapse
|
6
|
Mapping the common gene networks that underlie related diseases. Nat Protoc 2023:10.1038/s41596-022-00797-1. [PMID: 36653526 DOI: 10.1038/s41596-022-00797-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Accepted: 11/21/2022] [Indexed: 01/19/2023]
Abstract
A longstanding goal of biomedicine is to understand how alterations in molecular and cellular networks give rise to the spectrum of human diseases. For diseases with shared etiology, understanding the common causes allows for improved diagnosis of each disease, development of new therapies and more comprehensive identification of disease genes. Accordingly, this protocol describes how to evaluate the extent to which two diseases, each characterized by a set of mapped genes, are colocalized in a reference gene interaction network. This procedure uses network propagation to measure the network 'distance' between gene sets. For colocalized diseases, the network can be further analyzed to extract common gene communities at progressive granularities. In particular, we show how to: (1) obtain input gene sets and a reference gene interaction network; (2) identify common subnetworks of genes that encompass or are in close proximity to all gene sets; (3) use multiscale community detection to identify systems and pathways represented by each common subnetwork to generate a network colocalized systems map; (4) validate identified genes and systems using a mouse variant database; and (5) visualize and further investigate select genes, interactions and systems for relevance to phenotype(s) of interest. We demonstrate the utility of this approach by identifying shared biological mechanisms underlying autism and congenital heart disease. However, this protocol is general and can be applied to any gene sets attributed to diseases or other phenotypes with suspected joint association. A typical NetColoc run takes less than an hour. Software and documentation are available at https://github.com/ucsd-ccbb/NetColoc .
Collapse
|
7
|
Vora DS, Kalakoti Y, Sundar D. Computational Methods and Deep Learning for Elucidating Protein Interaction Networks. Methods Mol Biol 2023; 2553:285-323. [PMID: 36227550 DOI: 10.1007/978-1-0716-2617-7_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Protein interactions play a critical role in all biological processes, but experimental identification of protein interactions is a time- and resource-intensive process. The advances in next-generation sequencing and multi-omics technologies have greatly benefited large-scale predictions of protein interactions using machine learning methods. A wide range of tools have been developed to predict protein-protein, protein-nucleic acid, and protein-drug interactions. Here, we discuss the applications, methods, and challenges faced when employing the various prediction methods. We also briefly describe ways to overcome the challenges and prospective future developments in the field of protein interaction biology.
Collapse
Affiliation(s)
- Dhvani Sandip Vora
- Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology Delhi, Hauz Khas, New Delhi, India
| | - Yogesh Kalakoti
- Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology Delhi, Hauz Khas, New Delhi, India
| | - Durai Sundar
- Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology Delhi, Hauz Khas, New Delhi, India.
- School of Artificial Intelligence, Indian Institute of Technology Delhi, Hauz Khas, New Delhi, India.
| |
Collapse
|
8
|
Zhu J, Wang L, Guo Z, Zhang T, Zhang P. Transcriptome analysis of intestine from alk-SMase knockout mice reveals the effect of alk-SMase. Cancer Cell Int 2022; 22:344. [DOI: 10.1186/s12935-022-02764-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2022] [Accepted: 10/11/2022] [Indexed: 11/10/2022] Open
Abstract
Abstract
Objective
Intestinal alkaline sphingomyelinase (alk-SMase) generates ceramide and inactivates platelet-activating factor associated with digestion and inhibition of cancer. There is few study to analyze the correlated function and characterize the genes related to alk-SMase comprehensively. We characterised transcriptome landscapes of intestine tissues from alk-SMase knockout (KO) mice aiming to identify novel associated genes and research targets.
Methods
We performed the high-resolution RNA sequencing of alk-SMase KO mice and compared them to wild type (WT) mice. Differentially expressed genes (DEGs) for the training group were screened. Functional enrichment analysis of the DEGs between KO mice and WT mice was implemented using the Database for Annotation, Visualization and Integrated Discovery (DAVID). An integrated protein–protein interaction (PPI) and Kyoto Encyclopedia of Genes and Genomes (KEGG) network was chose to study the relationship of differentially expressed gene. Moreover, quantitative real-time polymerase chain reaction (qPCR) was further used to validate the accuracy of RNA-seq technology.
Results
Our RNA-seq data found 97 differentially expressed mRNAs between the WT mice and alk-SMase gene NPP7 KO mice, in which 32 were significantly up-regulated and 65 were down-regulated, including protein coding genes, non-coding RNAs. Notably, the results of gene ontology functional enrichment analysis indicated that DEGs were functionally associated with the immune response, regulation of cell proliferation and development related terms. Additionally, an integrated network analysis was shown that some modules was significantly related to alk-SMase and with accordance of previously results. We chose 6 of these genes randomly were validated the accuracy of RNA-seq technology using qPCR and 2 genes showed difference significantly (P < 0.05).
Conclusions
We investigated the potential biological significant of alk-SMase with high resolution genome-wide transcriptome of alk-SMase knockout mice. The results revealed new insight into the functional modules related to alk-SMase was involved in the intestinal related diseases.
Collapse
|
9
|
Somogyvári M, Khatatneh S, Sőti C. Hsp90: From Cellular to Organismal Proteostasis. Cells 2022; 11:cells11162479. [PMID: 36010556 PMCID: PMC9406713 DOI: 10.3390/cells11162479] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 07/27/2022] [Accepted: 07/28/2022] [Indexed: 11/16/2022] Open
Abstract
Assuring a healthy proteome is indispensable for survival and organismal health. Proteome disbalance and the loss of the proteostasis buffer are hallmarks of various diseases. The essential molecular chaperone Hsp90 is a regulator of the heat shock response via HSF1 and a stabilizer of a plethora of signaling proteins. In this review, we summarize the role of Hsp90 in the cellular and organismal regulation of proteome maintenance.
Collapse
|
10
|
Zhu J, Wang L, Li X, Lan D, Song L, Li Y, Cheng Y, Zhang P. Transcriptome analysis alk-SMase knockout mice reveals the effect of alkaline sphingomyelinase on liver. Biochem Biophys Rep 2022; 30:101240. [PMID: 35360085 PMCID: PMC8961189 DOI: 10.1016/j.bbrep.2022.101240] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2021] [Revised: 02/21/2022] [Accepted: 02/26/2022] [Indexed: 11/25/2022] Open
|
11
|
Jiang H, Chiang CY, Chen Z, Nathan S, D'Agostino G, Paulo JA, Song G, Zhu H, Gabelli SB, Cole PA. Enzymatic analysis of WWP2 E3 ubiquitin ligase using protein microarrays identifies autophagy-related substrates. J Biol Chem 2022; 298:101854. [PMID: 35331737 PMCID: PMC9034101 DOI: 10.1016/j.jbc.2022.101854] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Revised: 03/15/2022] [Accepted: 03/16/2022] [Indexed: 11/30/2022] Open
Abstract
WWP2 is a HECT E3 ligase that targets protein Lys residues for ubiquitination and is comprised of an N-terminal C2 domain, four central WW domains, and a C-terminal catalytic HECT domain. The peptide segment between the middle WW domains, the 2,3-linker, is known to autoinhibit the catalytic domain, and this autoinhibition can be relieved by phosphorylation at Tyr369. Several protein substrates of WWP2 have been identified, including the tumor suppressor lipid phosphatase PTEN, but the full substrate landscape and biological functions of WWP2 remain to be elucidated. Here, we used protein microarray technology and the activated enzyme phosphomimetic mutant WWP2Y369E to identify potential WWP2 substrates. We identified 31 substrate hits for WWP2Y369E using protein microarrays, of which three were known autophagy receptors (NDP52, OPTN, and SQSTM1). These three hits were validated with in vitro and cell-based transfection assays and the Lys ubiquitination sites on these proteins were mapped by mass spectrometry. Among the mapped ubiquitin sites on these autophagy receptors, many had been previously identified in the endogenous proteins. Finally, we observed that WWP2 KO SH-SH5Y neuroblastoma cells using CRISPR-Cas9 showed a defect in mitophagy, which could be rescued by WWP2Y369E transfection. These studies suggest that WWP2-mediated ubiquitination of the autophagy receptors NDP52, OPTN, and SQSTM1 may positively contribute to the regulation of autophagy.
Collapse
Affiliation(s)
- Hanjie Jiang
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Boston, Massachusetts, USA; Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, Massachusetts, USA; Department of Pharmacology and Molecular Sciences, Johns Hopkins School of Medicine, Baltimore, Maryland, USA
| | - Claire Y Chiang
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Boston, Massachusetts, USA; Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, Massachusetts, USA
| | - Zan Chen
- Department of Pharmacology and Molecular Sciences, Johns Hopkins School of Medicine, Baltimore, Maryland, USA; Department of Cell Biology, Harvard Medical School, Boston, Massachusetts, USA; Department of Biophysics and Biophysical Chemistry, The Johns Hopkins School of Medicine, Baltimore, Maryland, USA
| | - Sara Nathan
- Department of Biophysics and Biophysical Chemistry, The Johns Hopkins School of Medicine, Baltimore, Maryland, USA
| | - Gabriel D'Agostino
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Boston, Massachusetts, USA; Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, Massachusetts, USA
| | - Joao A Paulo
- Department of Cell Biology, Harvard Medical School, Boston, Massachusetts, USA
| | - Guang Song
- Department of Pharmacology and Molecular Sciences, Johns Hopkins School of Medicine, Baltimore, Maryland, USA
| | - Heng Zhu
- Department of Pharmacology and Molecular Sciences, Johns Hopkins School of Medicine, Baltimore, Maryland, USA; Department of Oncology, The Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
| | - Sandra B Gabelli
- Department of Biophysics and Biophysical Chemistry, The Johns Hopkins School of Medicine, Baltimore, Maryland, USA; Department of Oncology, The Johns Hopkins University School of Medicine, Baltimore, Maryland, USA; Department of Medicine, The Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
| | - Philip A Cole
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Boston, Massachusetts, USA; Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, Massachusetts, USA; Department of Pharmacology and Molecular Sciences, Johns Hopkins School of Medicine, Baltimore, Maryland, USA.
| |
Collapse
|
12
|
Rodriguez-Lopez M, Anver S, Cotobal C, Kamrad S, Malecki M, Correia-Melo C, Hoti M, Townsend S, Marguerat S, Pong SK, Wu MY, Montemayor L, Howell M, Ralser M, Bähler J. Functional profiling of long intergenic non-coding RNAs in fission yeast. eLife 2022; 11:e76000. [PMID: 34984977 PMCID: PMC8730722 DOI: 10.7554/elife.76000] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Accepted: 12/02/2021] [Indexed: 12/19/2022] Open
Abstract
Eukaryotic genomes express numerous long intergenic non-coding RNAs (lincRNAs) that do not overlap any coding genes. Some lincRNAs function in various aspects of gene regulation, but it is not clear in general to what extent lincRNAs contribute to the information flow from genotype to phenotype. To explore this question, we systematically analysed cellular roles of lincRNAs in Schizosaccharomyces pombe. Using seamless CRISPR/Cas9-based genome editing, we deleted 141 lincRNA genes to broadly phenotype these mutants, together with 238 diverse coding-gene mutants for functional context. We applied high-throughput colony-based assays to determine mutant growth and viability in benign conditions and in response to 145 different nutrient, drug, and stress conditions. These analyses uncovered phenotypes for 47.5% of the lincRNAs and 96% of the protein-coding genes. For 110 lincRNA mutants, we also performed high-throughput microscopy and flow cytometry assays, linking 37% of these lincRNAs with cell-size and/or cell-cycle control. With all assays combined, we detected phenotypes for 84 (59.6%) of all lincRNA deletion mutants tested. For complementary functional inference, we analysed colony growth of strains ectopically overexpressing 113 lincRNA genes under 47 different conditions. Of these overexpression strains, 102 (90.3%) showed altered growth under certain conditions. Clustering analyses provided further functional clues and relationships for some of the lincRNAs. These rich phenomics datasets associate lincRNA mutants with hundreds of phenotypes, indicating that most of the lincRNAs analysed exert cellular functions in specific environmental or physiological contexts. This study provides groundwork to further dissect the roles of these lincRNAs in the relevant conditions.
Collapse
Affiliation(s)
- Maria Rodriguez-Lopez
- University College London, Institute of Healthy Ageing and Department of Genetics, Evolution & EnvironmentLondonUnited Kingdom
| | - Shajahan Anver
- University College London, Institute of Healthy Ageing and Department of Genetics, Evolution & EnvironmentLondonUnited Kingdom
| | - Cristina Cotobal
- University College London, Institute of Healthy Ageing and Department of Genetics, Evolution & EnvironmentLondonUnited Kingdom
| | - Stephan Kamrad
- University College London, Institute of Healthy Ageing and Department of Genetics, Evolution & EnvironmentLondonUnited Kingdom
- The Francis Crick Institute, Molecular Biology of Metabolism LaboratoryLondonUnited Kingdom
- Charité Universitätsmedizin Berlin, Institute of BiochemistryBerlinGermany
| | - Michal Malecki
- University College London, Institute of Healthy Ageing and Department of Genetics, Evolution & EnvironmentLondonUnited Kingdom
| | - Clara Correia-Melo
- The Francis Crick Institute, Molecular Biology of Metabolism LaboratoryLondonUnited Kingdom
| | - Mimoza Hoti
- University College London, Institute of Healthy Ageing and Department of Genetics, Evolution & EnvironmentLondonUnited Kingdom
| | - StJohn Townsend
- University College London, Institute of Healthy Ageing and Department of Genetics, Evolution & EnvironmentLondonUnited Kingdom
- The Francis Crick Institute, Molecular Biology of Metabolism LaboratoryLondonUnited Kingdom
| | - Samuel Marguerat
- University College London, Institute of Healthy Ageing and Department of Genetics, Evolution & EnvironmentLondonUnited Kingdom
| | - Sheng Kai Pong
- University College London, Institute of Healthy Ageing and Department of Genetics, Evolution & EnvironmentLondonUnited Kingdom
| | - Mary Y Wu
- The Francis Crick Institute, High Throughput ScreeningLondonUnited Kingdom
| | - Luis Montemayor
- University College London, Institute of Healthy Ageing and Department of Genetics, Evolution & EnvironmentLondonUnited Kingdom
| | - Michael Howell
- The Francis Crick Institute, High Throughput ScreeningLondonUnited Kingdom
| | - Markus Ralser
- The Francis Crick Institute, Molecular Biology of Metabolism LaboratoryLondonUnited Kingdom
- Charité Universitätsmedizin Berlin, Institute of BiochemistryBerlinGermany
| | - Jürg Bähler
- University College London, Institute of Healthy Ageing and Department of Genetics, Evolution & EnvironmentLondonUnited Kingdom
| |
Collapse
|
13
|
Zhou K, Zhang S, Wang Y, Cohen KB, Kim JD, Luo Q, Yao X, Zhou X, Xia J. High-quality gene/disease embedding in a multi-relational heterogeneous graph after a joint matrix/tensor decomposition. J Biomed Inform 2022; 126:103973. [PMID: 34995810 DOI: 10.1016/j.jbi.2021.103973] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 10/20/2021] [Accepted: 12/04/2021] [Indexed: 10/19/2022]
Abstract
MOTIVATION Node embedding of biological entity network has been widely investigated for the downstream application scenarios. To embed full semantics of gene and disease, a multi-relational heterogeneous graph is considered in a scenario where uni-relation between gene/disease and other heterogeneous entities are abundant while multi-relation between gene and disease is relatively sparse. After introducing this novel graph format, it is illuminative to design a specific data integration algorithm to fully capture the graph information and bring embeddings with high quality. RESULTS First, a typical multi-relational triple dataset was introduced, which carried significant association between gene and disease. Second, we curated all human genes and diseases in seven mainstream datasets and constructed a large-scale gene-disease network, which compromising 163,024 nodes and 25,265,607 edges, and relates to 27,165 genes, 2,665 diseases, 15,067 chemicals, 108,023 mutations, 2,363 pathways, and 7.732 phenotypes. Third, we proposed a Joint Decomposition of Heterogeneous Matrix and Tensor (JDHMT) model, which integrated all heterogeneous data resources and obtained embedding for each gene or disease. Forth, a visualized intrinsic evaluation was performed, which investigated the embeddings in terms of interpretable data clustering. Furthermore, an extrinsic evaluation was performed in the form of linking prediction. Both intrinsic and extrinsic evaluation results showed that JDHMT model outperformed other eleven state-of-the-art (SOTA) methods which are under relation-learning, proximity-preserving or message-passing paradigms. Finally, the constructed gene-disease network, embedding results and codes were made available. DATA AND CODES AVAILABILITY The constructed massive gene-disease network is available at: https://hzaubionlp.com/heterogeneous-biological-network/. The codes are available at: https://github.com/bionlp-hzau/JDHMT.
Collapse
Affiliation(s)
- Kaiyin Zhou
- Hubei Key Lab of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, Hubei, China
| | - Sheng Zhang
- College of Science, Huazhong Agricultural University, Wuhan 430070, Hubei, China
| | - Yuxing Wang
- Hubei Key Lab of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, Hubei, China
| | - Kevin Bretonnel Cohen
- School of Medicine, University of Colorado Denver, Anschutz Medical Campus, Aurora, CO, USA
| | - Jin-Dong Kim
- Database Center for Life Science (DBCLS), Research Organization of Information and Systems (ROIS), Kashiwa, Tokyo, Japan
| | - Qi Luo
- College of Science, Huazhong Agricultural University, Wuhan 430070, Hubei, China
| | - Xinzhi Yao
- Hubei Key Lab of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, Hubei, China
| | - Xingyu Zhou
- Hubei Key Lab of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, Hubei, China
| | - Jingbo Xia
- Hubei Key Lab of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, Hubei, China.
| |
Collapse
|
14
|
Amara A, Hadj Taieb MA, Ben Aouicha M. Network representation learning systematic review: Ancestors and current development state. MACHINE LEARNING WITH APPLICATIONS 2021. [DOI: 10.1016/j.mlwa.2021.100130] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
|
15
|
Sitagliptin: a potential drug for the treatment of COVID-19? ACTA PHARMACEUTICA (ZAGREB, CROATIA) 2021; 71:175-184. [PMID: 33151168 DOI: 10.2478/acph-2021-0013] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 04/15/2020] [Indexed: 02/06/2023]
Abstract
Recently, an outbreak of a fatal coronavirus, SARS-CoV-2, has emerged from China and is rapidly spreading worldwide. Possible interaction of SARS-CoV-2 with DPP4 peptidase may partly contribute to the viral pathogenesis. An integrative bioinformatics approach starting with mining the biomedical literature for high confidence DPP4-protein/gene associations followed by functional analysis using network analysis and pathway enrichment was adopted. The results indicate that the identified DPP4 networks are highly enriched in viral processes required for viral entry and infection, and as a result, we propose DPP4 as an important putative target for the treatment of COVID-19. Additionally, our protein-chemical interaction networks identified important interactions between DPP4 and sitagliptin. We conclude that sitagliptin may be beneficial for the treatment of COVID-19 disease, either as monotherapy or in combination with other therapies, especially for diabetic patients and patients with pre-existing cardiovascular conditions who are already at higher risk of COVID-19 mortality.
Collapse
|
16
|
Muzio G, O’Bray L, Borgwardt K. Biological network analysis with deep learning. Brief Bioinform 2021; 22:1515-1530. [PMID: 33169146 PMCID: PMC7986589 DOI: 10.1093/bib/bbaa257] [Citation(s) in RCA: 96] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 08/26/2020] [Accepted: 09/11/2020] [Indexed: 12/17/2022] Open
Abstract
Recent advancements in experimental high-throughput technologies have expanded the availability and quantity of molecular data in biology. Given the importance of interactions in biological processes, such as the interactions between proteins or the bonds within a chemical compound, this data is often represented in the form of a biological network. The rise of this data has created a need for new computational tools to analyze networks. One major trend in the field is to use deep learning for this goal and, more specifically, to use methods that work with networks, the so-called graph neural networks (GNNs). In this article, we describe biological networks and review the principles and underlying algorithms of GNNs. We then discuss domains in bioinformatics in which graph neural networks are frequently being applied at the moment, such as protein function prediction, protein-protein interaction prediction and in silico drug discovery and development. Finally, we highlight application areas such as gene regulatory networks and disease diagnosis where deep learning is emerging as a new tool to answer classic questions like gene interaction prediction and automatic disease prediction from data.
Collapse
Affiliation(s)
- Giulia Muzio
- Machine Learning and Computational Biology Lab at ETH Zürich
| | - Leslie O’Bray
- Machine Learning and Computational Biology Lab at ETH Zürich
| | | |
Collapse
|
17
|
Muzio G, O'Bray L, Borgwardt K. Biological network analysis with deep learning. Brief Bioinform 2021; 22:1515-1530. [PMID: 33169146 DOI: 10.1145/3447548.3467442] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 08/26/2020] [Accepted: 09/11/2020] [Indexed: 05/28/2023] Open
Abstract
Recent advancements in experimental high-throughput technologies have expanded the availability and quantity of molecular data in biology. Given the importance of interactions in biological processes, such as the interactions between proteins or the bonds within a chemical compound, this data is often represented in the form of a biological network. The rise of this data has created a need for new computational tools to analyze networks. One major trend in the field is to use deep learning for this goal and, more specifically, to use methods that work with networks, the so-called graph neural networks (GNNs). In this article, we describe biological networks and review the principles and underlying algorithms of GNNs. We then discuss domains in bioinformatics in which graph neural networks are frequently being applied at the moment, such as protein function prediction, protein-protein interaction prediction and in silico drug discovery and development. Finally, we highlight application areas such as gene regulatory networks and disease diagnosis where deep learning is emerging as a new tool to answer classic questions like gene interaction prediction and automatic disease prediction from data.
Collapse
Affiliation(s)
- Giulia Muzio
- Machine Learning and Computational Biology Lab at ETH Zürich
| | - Leslie O'Bray
- Machine Learning and Computational Biology Lab at ETH Zürich
| | | |
Collapse
|
18
|
Sabbah DA, Hajjo R, Sweidan K. Review on Epidermal Growth Factor Receptor (EGFR) Structure, Signaling Pathways, Interactions, and Recent Updates of EGFR Inhibitors. Curr Top Med Chem 2021; 20:815-834. [PMID: 32124699 DOI: 10.2174/1568026620666200303123102] [Citation(s) in RCA: 282] [Impact Index Per Article: 70.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2019] [Revised: 11/21/2019] [Accepted: 12/10/2019] [Indexed: 12/13/2022]
Abstract
The epidermal growth factor receptor (EGFR) belongs to the ERBB family of tyrosine kinase receptors. EGFR signaling cascade is a key regulator in cell proliferation, differentiation, division, survival, and cancer development. In this review, the EGFR structure and its mutations, signaling pathway, ligand binding and EGFR dimerization, EGF/EGFR interaction, and the progress in the development of EGFR inhibitors have been explored.
Collapse
Affiliation(s)
- Dima A Sabbah
- Department of Pharmacy, Faculty of Pharmacy, Al-Zaytoonah University of Jordan, P.O. Box 130, Amman 11733, Jordan
| | - Rima Hajjo
- Department of Pharmacy, Faculty of Pharmacy, Al-Zaytoonah University of Jordan, P.O. Box 130, Amman 11733, Jordan
| | - Kamal Sweidan
- Department of Chemistry, The University of Jordan, Amman 11942, Jordan
| |
Collapse
|
19
|
Hajjo R, Sabbah DA, Bardaweel SK. Chemocentric Informatics Analysis: Dexamethasone Versus Combination Therapy for COVID-19. ACS OMEGA 2020; 5:29765-29779. [PMID: 33251412 PMCID: PMC7689662 DOI: 10.1021/acsomega.0c03597] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2020] [Accepted: 11/03/2020] [Indexed: 05/08/2023]
Abstract
COVID-19 is a biphasic infectious disease with no approved vaccine or pharmacotherapy. The first drug that has shown promise in reducing COVID-19 mortality in severely-ill patients is dexamethasone, a cheap, well-known anti-inflammatory glucocorticoid, approved for the treatment of inflammatory conditions including respiratory diseases such as asthma and tuberculosis. However, about 80% of COVID-19 patients requiring oxygenation, and about 67% of patients on ventilators, are not responsive to dexamethasone therapy mainly. Additionally, using higher doses of dexamethasone for prolonged periods of time can lead to severe side effects and some patients may develop corticosteroid resistance leading to treatment failure. In order to increase the therapeutic efficacy of dexamethasone in COVID-19 patients, while minimizing dexamethasone-related complications that could result from using higher doses of the drug, we applied a chemocentric informatics approach to identify combination therapies. Our results indicated that combining dexamethasone with fast long-acting beta-2 adrenergic agonists (LABAs), such as formoterol and salmeterol, can ease respiratory symptoms hastily, until dexamethasone's anti-inflammatory and immunosuppressant effects kick in. Our studies demonstrated that LABAs and dexamethasone (or other glucocorticoids) exert synergistic effects that will augment both anti-inflammatory and fibronectin-mediated anticoagulant effects. We also propose other alternatives to LABAs that are supported by sound systems biology evidence, such as nitric oxide. Other drugs such as sevoflurane and treprostinil interact with the SARS-CoV-2 interactome and deserve further exploration. Moreover, our chemocentric informatics approach provides systems biology evidence that combination therapies for COVID-19 will have higher chances of perturbing the SARS-CoV-2 human interactome, which may negatively impact COVID-19 disease pathways.
Collapse
Affiliation(s)
- Rima Hajjo
- Department
of Pharmacy, Faculty of Pharmacy, Al-Zaytoonah
University of Jordan, P.O. Box 130, Amman 11733, Jordan
| | - Dima A. Sabbah
- Department
of Pharmacy, Faculty of Pharmacy, Al-Zaytoonah
University of Jordan, P.O. Box 130, Amman 11733, Jordan
| | - Sanaa K. Bardaweel
- Department
of Pharmaceutical Sciences, School of Pharmacy, University of Jordan, Amman 11942, Jordan
| |
Collapse
|
20
|
Hajjo R, Tropsha A. A Systems Biology Workflow for Drug and Vaccine Repurposing: Identifying Small-Molecule BCG Mimics to Reduce or Prevent COVID-19 Mortality. Pharm Res 2020; 37:212. [PMID: 33025261 PMCID: PMC7537965 DOI: 10.1007/s11095-020-02930-9] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2020] [Accepted: 09/17/2020] [Indexed: 12/18/2022]
Abstract
PURPOSE Coronavirus disease 2019 (COVID-19) is expected to continue to cause worldwide fatalities until the World population develops 'herd immunity', or until a vaccine is developed and used as a prevention. Meanwhile, there is an urgent need to identify alternative means of antiviral defense. Bacillus Calmette-Guérin (BCG) vaccine that has been recognized for its off-target beneficial effects on the immune system can be exploited to boast immunity and protect from emerging novel viruses. METHODS We developed and employed a systems biology workflow capable of identifying small-molecule antiviral drugs and vaccines that can boast immunity and affect a wide variety of viral disease pathways to protect from the fatal consequences of emerging viruses. RESULTS Our analysis demonstrates that BCG vaccine affects the production and maturation of naïve T cells resulting in enhanced, long-lasting trained innate immune responses that can provide protection against novel viruses. We have identified small-molecule BCG mimics, including antiviral drugs such as raltegravir and lopinavir as high confidence hits. Strikingly, our top hits emetine and lopinavir were independently validated by recent experimental findings that these compounds inhibit the growth of SARS-CoV-2 in vitro. CONCLUSIONS Our results provide systems biology support for using BCG and small-molecule BCG mimics as putative vaccine and drug candidates against emergent viruses including SARS-CoV-2.
Collapse
Affiliation(s)
- Rima Hajjo
- Department of Pharmacy - Computational Chemical Biology, Faculty of Pharmacy, Al-Zaytoonah University of Jordan, P.O. Box 130, Amman, 11733, Jordan.
| | - Alexander Tropsha
- Laboratory for Molecular Modeling, UNC Eshelman School of Pharmacy, UNC Chapel Hill, Chapel Hill, North Carolina, 27599, USA
| |
Collapse
|
21
|
Wu Y, Tang H, Si R, Xia S, Wang R, Wang Q. MICA enhances sensitivity to cisplatin in patients with extensive small cell lung cancer via downregulation of ABCG2. Oncol Lett 2020; 20:1143-1152. [PMID: 32724354 PMCID: PMC7377106 DOI: 10.3892/ol.2020.11646] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2019] [Accepted: 04/09/2020] [Indexed: 11/24/2022] Open
Abstract
Immunotherapy utilizing natural killer cell-activated receptor natural-killer group-2 member D ligands (NKG2DLs) has had preclinical success in the treatment of small cell lung cancer. The present study aimed to investigate the association between NKG2Ls and chemoresistance. The mRNA expression of six NKG2DLs associated with progression-free survival time (PFS) and first-line chemotherapy were assessed in the present study. Major histocompatibility complex class I polypeptide-related sequence A (MICA)-overexpressing NCI-H446 cell line was constructed, and the mRNA expression levels of 11 genes associated with chemotherapy sensitivity were determined. The results demonstrated that MICA was positively and significantly associated with PFS. Furthermore, MICA expression was 1.6 times higher in patients with prolonged PFS compared with the rapid chemoresistance group. ATP binding cassette subfamily G member 2 (ABCG2) mRNA expression was associated with chemotherapy resistance and significantly downregulated in the cell line overexpressing MICA. Moreover, following addition of nicardipine (an ABCG2 inhibitor), chemotherapeutic sensitivity increased in the MICA-overexpressing cell line. Taken together, the results of the present study suggested that MICA may enhance the chemosensitivity of patients with extensive small cell lung cancer by downregulating ABCG2.
Collapse
Affiliation(s)
- Yufeng Wu
- Department of Internal Medicine, Henan Cancer Hospital, The Affiliated Cancer Hospital of Zhengzhou University, Zhengzhou, Henan 450008, P.R. China
| | - Hong Tang
- Department of Internal Medicine, Henan Cancer Hospital, The Affiliated Cancer Hospital of Zhengzhou University, Zhengzhou, Henan 450008, P.R. China
| | - Ruirui Si
- Department of Health Center, Henan Airport Group Co., Ltd., Zhengzhou, Henan 450000, P.R. China
| | - Suhua Xia
- Department of Oncology, The First Affiliated Hospital of Soochow University, Suzhou, Jiangsu 215006, P.R. China
| | - Ruilin Wang
- Department of Internal Medicine, Henan Cancer Hospital, The Affiliated Cancer Hospital of Zhengzhou University, Zhengzhou, Henan 450008, P.R. China
| | - Qiming Wang
- Department of Internal Medicine, Henan Cancer Hospital, The Affiliated Cancer Hospital of Zhengzhou University, Zhengzhou, Henan 450008, P.R. China
| |
Collapse
|
22
|
Liu W, Fang Y, Shi Y, Cheng Y, Sun C, Cui D. The interaction of histone modification related H3F3B and NSD2 genes increases the susceptibility to schizophrenia in a Chinese population. Prog Neuropsychopharmacol Biol Psychiatry 2020; 101:109918. [PMID: 32169559 DOI: 10.1016/j.pnpbp.2020.109918] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/08/2019] [Revised: 03/06/2020] [Accepted: 03/09/2020] [Indexed: 10/24/2022]
Abstract
The role of histone modifications in the pathogenesis of schizophrenia has been proposed previously. H3F3B is a member of the histone 3. NSD2 is a histone methyltransferase that mediates dimethylation of Histone 3 lysine 36 (H3K36me2). The aim of the current study was to explore the associations between SNPs within H3F3B gene (rs60700976, rs3214028) and NSD2 gene (rs13148597, rs75820801) and the susceptibility to schizophrenia in a Chinese population. A total of 810 patients and 490 healthy controls were recruited and genetic association analyses were performed. The H3F3B gene polymorphisms rs3214028 and rs60700976 were significantly associated with schizophrenia. Rs60700976 was also associated with psychotic symptoms in schizophrenia patients. Furthermore, we found the interaction between NSD2 gene and H3F3B gene was related to the susceptibility to schizophrenia. The corresponding best three-locus model was H3F3B (rs60700976) - NSD2 (rs75820801, rs13148597), and the high-risk genotype combination was rs13148597(CC)- rs60700976(GG)-rs75820801(TT) (OR = 1.388[1.091-1.766], P = .007). The low-risk genotype combination was rs13148597(CC)-rs60700976(GG)-rs75820801(CT) (OR = 0.57 [0.330-0.985], P = .042). Our findings provided the preliminary evidence that the histone modification related H3F3B and NSD2 genes may confer the susceptibility to schizophrenia in a Chinese population.
Collapse
Affiliation(s)
- Wenxin Liu
- Shanghai Key Laboratory of Psychotic Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China; College of Life Sciences, Shanghai Normal University, Shanghai, China
| | - Yu Fang
- Shanghai Key Laboratory of Psychotic Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Yuan Shi
- Shanghai Key Laboratory of Psychotic Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Ying Cheng
- Shanghai Key Laboratory of Psychotic Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Chuanwen Sun
- College of Life Sciences, Shanghai Normal University, Shanghai, China.
| | - Donghong Cui
- Shanghai Key Laboratory of Psychotic Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China; Brain Science and Technology Research Center, Shanghai Jiao Tong University, China.
| |
Collapse
|
23
|
Abstract
In this study, we deal with the problem of biological network alignment (NA), which aims to find a node mapping between species' molecular networks that uncovers similar network regions, thus allowing for the transfer of functional knowledge between the aligned nodes. We provide evidence that current NA methods, which assume that topologically similar nodes (i.e., nodes whose network neighborhoods are isomorphic-like) have high functional relatedness, do not actually end up aligning functionally related nodes. That is, we show that the current topological similarity assumption does not hold well. Consequently, we argue that a paradigm shift is needed with how the NA problem is approached. So, we redefine NA as a data-driven framework, called TARA (data-driven NA), which attempts to learn the relationship between topological relatedness and functional relatedness without assuming that topological relatedness corresponds to topological similarity. TARA makes no assumptions about what nodes should be aligned, distinguishing it from existing NA methods. Specifically, TARA trains a classifier to predict whether two nodes from different networks are functionally related based on their network topological patterns (features). We find that TARA is able to make accurate predictions. TARA then takes each pair of nodes that are predicted as related to be part of an alignment. Like traditional NA methods, TARA uses this alignment for the across-species transfer of functional knowledge. TARA as currently implemented uses topological but not protein sequence information for functional knowledge transfer. In this context, we find that TARA outperforms existing state-of-the-art NA methods that also use topological information, WAVE and SANA, and even outperforms or complements a state-of-the-art NA method that uses both topological and sequence information, PrimAlign. Hence, adding sequence information to TARA, which is our future work, is likely to further improve its performance. The software and data are available at http://www.nd.edu/~cone/TARA/.
Collapse
Affiliation(s)
- Shawn Gu
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN, United States of America
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN, United States of America
- Center for Network and Data Science, University of Notre Dame, Notre Dame, IN, United States of America
| | - Tijana Milenković
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN, United States of America
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN, United States of America
- Center for Network and Data Science, University of Notre Dame, Notre Dame, IN, United States of America
| |
Collapse
|
24
|
Liu Q, Zhu X, Lindström M, Shi Y, Zheng J, Hao X, Gustafsson CM, Liu B. Yeast mismatch repair components are required for stable inheritance of gene silencing. PLoS Genet 2020; 16:e1008798. [PMID: 32469861 PMCID: PMC7286534 DOI: 10.1371/journal.pgen.1008798] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2019] [Revised: 06/10/2020] [Accepted: 04/26/2020] [Indexed: 11/19/2022] Open
Abstract
Alterations in epigenetic silencing have been associated with ageing and tumour formation. Although substantial efforts have been made towards understanding the mechanisms of gene silencing, novel regulators in this process remain to be identified. To systematically search for components governing epigenetic silencing, we developed a genome-wide silencing screen for yeast (Saccharomyces cerevisiae) silent mating type locus HMR. Unexpectedly, the screen identified the mismatch repair (MMR) components Pms1, Mlh1, and Msh2 as being required for silencing at this locus. We further found that the identified genes were also required for proper silencing in telomeres. More intriguingly, the MMR mutants caused a redistribution of Sir2 deacetylase, from silent mating type loci and telomeres to rDNA regions. As a consequence, acetylation levels at histone positions H3K14, H3K56, and H4K16 were increased at silent mating type loci and telomeres but were decreased in rDNA regions. Moreover, knockdown of MMR components in human HEK293T cells increased subtelomeric DUX4 gene expression. Our work reveals that MMR components are required for stable inheritance of gene silencing patterns and establishes a link between the MMR machinery and the control of epigenetic silencing.
Collapse
Affiliation(s)
- Qian Liu
- Department of Chemistry and Molecular Biology, University of Gothenburg, Medicinaregatan, Goteborg, Sweden
| | - Xuefeng Zhu
- Institute of Biomedicine, University of Gothenburg, Goteborg, Sweden
- * E-mail: (XZ); (BL)
| | - Michelle Lindström
- Department of Chemistry and Molecular Biology, University of Gothenburg, Medicinaregatan, Goteborg, Sweden
| | - Yonghong Shi
- Institute of Biomedicine, University of Gothenburg, Goteborg, Sweden
| | - Ju Zheng
- Department of Chemistry and Molecular Biology, University of Gothenburg, Medicinaregatan, Goteborg, Sweden
- Department of Biology, Functional Biology, KU Leuven, Heverlee, Belgium
| | - Xinxin Hao
- Department of Chemistry and Molecular Biology, University of Gothenburg, Medicinaregatan, Goteborg, Sweden
| | | | - Beidong Liu
- Department of Chemistry and Molecular Biology, University of Gothenburg, Medicinaregatan, Goteborg, Sweden
- Center for Large-scale cell-based screening, Faculty of Science, University of Gothenburg, Medicinaregatan, Goteborg, Sweden
- * E-mail: (XZ); (BL)
| |
Collapse
|
25
|
Yi Y, Fang Y, Wu K, Liu Y, Zhang W. Comprehensive gene and pathway analysis of cervical cancer progression. Oncol Lett 2020; 19:3316-3332. [PMID: 32256826 PMCID: PMC7074609 DOI: 10.3892/ol.2020.11439] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2019] [Accepted: 01/15/2020] [Indexed: 12/15/2022] Open
Abstract
Cervical Cancer is one of the leading causes of cancer-associated mortality in women. The present study aimed to identify key genes and pathways involved in cervical cancer (CC) progression, via a comprehensive bioinformatics analysis. The GSE63514 dataset from the Gene Expression Omnibus database was analyzed for hub genes and cancer progression was divided into four phases (phases I-IV). Pathway enrichment, protein-protein interaction (PPI) and pathway crosstalk analyses were performed, to identify key genes and pathways using a criterion nodal degree ≥5. Gene pathway analysis was determined by mapping the key genes into the key pathways. Co-expression between key genes and their effect on overall survival (OS) time was assessed using The Cancer Genome Atlas database. A total of 3,446 differentially expressed genes with 107 hub genes were identified within the four phases. A total of 14 key genes with 11 key pathways were obtained, following extraction of ≥5 degree nodes from the PPI and pathway crosstalk networks. Gene pathway analysis revealed that CDK1 and CCNB1 regulated the cell cycle and were activated in phase I. Notably, the following terms, 'pathways in cancer', 'focal adhesion' and the 'PI3K-Akt signaling pathway' ranked the highest in phases II-IV. Furthermore, FN1, ITGB1 and MMP9 may be associated with metastasis of tumor cells. STAT1 was indicated to predominantly function at the phase IV via cancer-associated signaling pathways, including 'pathways in cancer' and 'Toll-like receptor signaling pathway'. Survival analysis revealed that high ITGB1 and FN1 expression levels resulted in significantly worse OS. CDK1 and CCNB1 were revealed to regulate proliferation and differentiation through the cell cycle and viral tumorigenesis, while FN1 and ITGB1, which may be developed as novel prognostic factors, were co-expressed to induce metastasis via cancer-associated signaling pathways, including PI3K-Art signaling pathway, and focal adhesion in CC; however, the underlying molecular mechanisms require further research.
Collapse
Affiliation(s)
- Yuexiong Yi
- Department of Obstetrics and Gynecology, Zhongnan Hospital of Wuhan University, Wuhan, Hubei 430071, P.R. China
| | - Yan Fang
- Department of Obstetrics and Gynecology, Zhongnan Hospital of Wuhan University, Wuhan, Hubei 430071, P.R. China
| | - Kejia Wu
- Department of Obstetrics and Gynecology, Zhongnan Hospital of Wuhan University, Wuhan, Hubei 430071, P.R. China
| | - Yanyan Liu
- Department of Obstetrics and Gynecology, Zhongnan Hospital of Wuhan University, Wuhan, Hubei 430071, P.R. China
| | - Wei Zhang
- Department of Obstetrics and Gynecology, Zhongnan Hospital of Wuhan University, Wuhan, Hubei 430071, P.R. China
- Correspondence to: Professor Wei Zhang, Department of Obstetrics and Gynecology, Zhongnan Hospital of Wuhan University, 169 Donghu Road, Wuhan, Hubei 430071, P.R. China, E-mail:
| |
Collapse
|
26
|
Vijayan V, Gu S, Krebs ET, Meng L, MilenkoviĆ T. Pairwise Versus Multiple Global Network Alignment. IEEE ACCESS : PRACTICAL INNOVATIONS, OPEN SOLUTIONS 2020; 8:41961-41974. [PMID: 33747670 PMCID: PMC7971151 DOI: 10.1109/access.2020.2976487] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Biological network alignment (NA) aims to identify similar regions between molecular networks of different species. NA can be local or global. Just as the recent trend in the NA field, we also focus on global NA, which can be pairwise (PNA) and multiple (MNA). PNA produces aligned node pairs between two networks. MNA produces aligned node clusters between more than two networks. Recently, the focus has shifted from PNA to MNA, because MNA captures conserved regions between more networks than PNA (and MNA is thus hypothesized to yield higher-quality alignments), though at higher computational complexity. The issue is that, due to the different outputs of PNA and MNA, a PNA method is only compared to other PNA methods, and an MNA method is only compared to other MNA methods. Comparison of PNA against MNA must be done to evaluate whether MNA indeed yields higher-quality alignments, as only this would justify MNA's higher computational complexity. We introduce a framework that allows for this. We evaluate eight prominent PNA and MNA methods, on synthetic and real-world biological networks, using topological and functional alignment quality measures. We compare PNA against MNA in both a pairwise (native to PNA) and multiple (native to MNA) manner. PNA is expected to perform better under the pairwise evaluation framework. Indeed this is what we find. MNA is expected to perform better under the multiple evaluation framework. Shockingly, we find this not always to hold; PNA is often better than MNA in this framework, depending on the choice of evaluation test.
Collapse
Affiliation(s)
- Vipin Vijayan
- Center for Network and Data Science, Department of Computer Science and Engineering, Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Shawn Gu
- Center for Network and Data Science, Department of Computer Science and Engineering, Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Eric T Krebs
- Center for Network and Data Science, Department of Computer Science and Engineering, Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Lei Meng
- Center for Network and Data Science, Department of Computer Science and Engineering, Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Tijana MilenkoviĆ
- Center for Network and Data Science, Department of Computer Science and Engineering, Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, USA
| |
Collapse
|
27
|
Sanchez R, Mackenzie SA. Integrative Network Analysis of Differentially Methylated and Expressed Genes for Biomarker Identification in Leukemia. Sci Rep 2020; 10:2123. [PMID: 32034170 PMCID: PMC7005804 DOI: 10.1038/s41598-020-58123-2] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2019] [Accepted: 01/07/2020] [Indexed: 02/01/2023] Open
Abstract
Genome-wide DNA methylation and gene expression are commonly altered in pediatric acute lymphoblastic leukemia (PALL). Integrated network analysis of cytosine methylation and expression datasets has the potential to provide deeper insights into the complex disease states and their causes than individual disconnected analyses. With the purpose of identifying reliable cancer-associated methylation signal in gene regions from leukemia patients, we present an integrative network analysis of differentially methylated (DMGs) and differentially expressed genes (DEGs). The application of a novel signal detection-machine learning approach to methylation analysis of whole genome bisulfite sequencing (WGBS) data permitted a high level of methylation signal resolution in cancer-associated genes and pathways. This integrative network analysis approach revealed that gene expression and methylation consistently targeted the same gene pathways relevant to cancer: Pathways in cancer, Ras signaling pathway, PI3K-Akt signaling pathway, and Rap1 signaling pathway, among others. Detected gene hubs and hub sub-networks were integrated by signature loci associated with cancer that include, for example, NOTCH1, RAC1, PIK3CD, BCL2, and EGFR. Statistical analysis disclosed a stochastic deterministic relationship between methylation and gene expression within the set of genes simultaneously identified as DEGs and DMGs, where larger values of gene expression changes were probabilistically associated with larger values of methylation changes. Concordance analysis of the overlap between enriched pathways in DEG and DMG datasets revealed statistically significant agreement between gene expression and methylation changes. These results support the potential identification of reliable and stable methylation biomarkers at genes for cancer diagnosis and prognosis.
Collapse
Affiliation(s)
- Robersy Sanchez
- Department of Biology, The Pennsylvania State University, University Park, PA, 16802, USA.
| | - Sally A Mackenzie
- Department of Biology, The Pennsylvania State University, University Park, PA, 16802, USA. .,Department of Plant Science, The Pennsylvania State University, University Park, PA, 16802, USA.
| |
Collapse
|
28
|
Heidari F, Papagelis M. Evolving network representation learning based on random walks. APPLIED NETWORK SCIENCE 2020; 5:18. [PMID: 32215318 PMCID: PMC7081665 DOI: 10.1007/s41109-020-00257-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Accepted: 02/12/2020] [Indexed: 05/22/2023]
Abstract
Large-scale network mining and analysis is key to revealing the underlying dynamics of networks, not easily observable before. Lately, there is a fast-growing interest in learning low-dimensional continuous representations of networks that can be utilized to perform highly accurate and scalable graph mining tasks. A family of these methods is based on performing random walks on a network to learn its structural features and providing the sequence of random walks as input to a deep learning architecture to learn a network embedding. While these methods perform well, they can only operate on static networks. However, in real-world, networks are evolving, as nodes and edges are continuously added or deleted. As a result, any previously obtained network representation will now be outdated having an adverse effect on the accuracy of the network mining task at stake. The naive approach to address this problem is to re-apply the embedding method of choice every time there is an update to the network. But this approach has serious drawbacks. First, it is inefficient, because the embedding method itself is computationally expensive. Then, the network mining task outcome obtained by the subsequent network representations are not directly comparable to each other, due to the randomness involved in the new set of random walks involved each time. In this paper, we propose EvoNRL, a random-walk based method for learning representations of evolving networks. The key idea of our approach is to first obtain a set of random walks on the current state of network. Then, while changes occur in the evolving network's topology, to dynamically update the random walks in reserve, so they do not introduce any bias. That way we are in position of utilizing the updated set of random walks to continuously learn accurate mappings from the evolving network to a low-dimension network representation. Moreover, we present an analytical method for determining the right time to obtain a new representation of the evolving network that balances accuracy and time performance. A thorough experimental evaluation is performed that demonstrates the effectiveness of our method against sensible baselines and varying conditions.
Collapse
|
29
|
|
30
|
Xue D, Zhang Y, Wang Y, Wang J, An F, Sun X, Yu Z. Quantitative proteomic analysis of sperm in unexplained recurrent pregnancy loss. Reprod Biol Endocrinol 2019; 17:52. [PMID: 31288842 PMCID: PMC6617596 DOI: 10.1186/s12958-019-0496-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/22/2019] [Accepted: 06/28/2019] [Indexed: 02/18/2023] Open
Abstract
BACKGROUND Recurrent pregnancy loss (RPL) refers to two or more spontaneous abortions that occur consecutively with the same spouse. RPL severely affects human reproduction health, and causes extreme physical and mental suffering to patients and their families. METHODS We used isobaric tags for relative and absolute quantitation (iTRAQ), which was coupled with liquid chromatography mass spectrometry (LC-MS) proteomic analysis, in order to identify differentially expressed proteins. Moreover, we used western blot to analyze differentially expressed proteins. RESULTS Of the 2350 non-redundant proteins identified, 38 proteins were significantly altered and were identified as potential biomarkers for RPL. The protein-protein interaction network constructed using GeneMANIA revealed that 35.55% displayed similar co-expression, 30.87% were predicted, and 20.95% had physical interaction characteristics. Based on Gene ontology classification and KEGG pathway enrichment analyses, the majority of these differentially expressed proteins were found to be related to biological regulation, metabolic and cellular processes, protein binding and different enzymes activities, as well as disorder of fat and glucose metabolic pathways. It is noteworthy that three metabolism related biomarkers (HK1, ACLY, and FASN) were further confirmed through western blot analysis. CONCLUSIONS These results suggest that these differentially expressed proteins may be used as biomarkers for RPL, and related signaling pathways may play crucial roles in male induced RPL.
Collapse
Affiliation(s)
- Dena Xue
- Center for Reproductive Medicine, Shandong Provincial Hospital Affiliated to Shandong University, Jinan, 250001, Shandong, China
- Department of Reproductive Medicine, Affiliated Hospital of Weifang Medical University, Weifang, Shandong Province, People's Republic of China
| | - Yi Zhang
- Department of Reproductive Medicine, Affiliated Hospital of Weifang Medical University, Weifang, Shandong Province, People's Republic of China
| | - Yixin Wang
- Department of Reproductive Medicine, Affiliated Hospital of Weifang Medical University, Weifang, Shandong Province, People's Republic of China
| | - Jun Wang
- Department of Reproductive Medicine, Affiliated Hospital of Weifang Medical University, Weifang, Shandong Province, People's Republic of China
| | - Fengxiao An
- Department of Reproductive Medicine, Affiliated Hospital of Weifang Medical University, Weifang, Shandong Province, People's Republic of China
| | - Xiaowei Sun
- Department of Reproductive Medicine, Affiliated Hospital of Weifang Medical University, Weifang, Shandong Province, People's Republic of China
| | - Zhenhai Yu
- Department of Reproductive Medicine, Affiliated Hospital of Weifang Medical University, Weifang, Shandong Province, People's Republic of China.
| |
Collapse
|
31
|
He D, Liu L, Wang Y, Sheng M. A Novel Genes Signature Associated with the Progression of Polycystic Ovary Syndrome. Pathol Oncol Res 2019; 26:575-582. [PMID: 31278444 DOI: 10.1007/s12253-019-00676-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/25/2018] [Accepted: 05/27/2019] [Indexed: 12/16/2022]
Abstract
To identify genes involving in the pathogenesis of polycystic ovary syndrome (PCOS). In this study, the comprehensive analysis of GSE8157 was downloaded. Overlapping genes of differentially expressed genes (DEGs) were identified, and enrichment analysis for these genes was performed. A modular network of differentially expressed genes was constructed by weighted gene co-expression network analyses (WGCNA), and a total of 322 differentially expressed genes in 5 stable modules were screened. The correlations of genes of the stable modules in BioGRID 3.4, STRING 10.5, HPRD9 databases were screened, and the interaction network of 104 DEGs was constructed. In addition, some genes and the key words were searched in CTD. A total of 596 differentially expressed genes were screened, including 379 genes that were up-regulated in case group and down-regulated in control group and treat group, and 217 genes that were down-regulated in case group and up-regulated in control group and treat group. The differentially expressed genes were enriched in PPAR signaling pathway, Neuroactive ligand-receptor interaction, cAMP signaling pathway, of which pathways were involved in the cancer development. Finally, 7 important target genes were identified, such as APOC3 was interacted with pioglitazone, ADCY2 involved in cAMP signaling pathway, and the genes (C3AR1, HRH2, GRIA1, MLNR and TAAR2) involved in neuroactive ligand-receptor interaction. In addition, the important target genes were significantly differential expression. These results implied that the 7 important target genes were played an important role in the development and progression of PCOS. Our study implied that genes had played a key role in the development and progression of PCOS, the results showed that microarray can be use as a method for the discovery of new biomarkers and therapeutic targets for PCOS.
Collapse
Affiliation(s)
- Dongyun He
- Reproductive Medical Center, Department of Gynecology and Obstetrics, China-Japan Union Hospital of Jilin University, No.126, Xiantai Road, Changchun, 130031, China
| | - Li Liu
- Reproductive Medical Center, Department of Gynecology and Obstetrics, China-Japan Union Hospital of Jilin University, No.126, Xiantai Road, Changchun, 130031, China
| | - Yang Wang
- Department of Dermatology, The Affiliated Hospital of Changchun University of Chinese Medicine, Changchun, 130031, China
| | - Minjia Sheng
- Reproductive Medical Center, Department of Gynecology and Obstetrics, China-Japan Union Hospital of Jilin University, No.126, Xiantai Road, Changchun, 130031, China.
| |
Collapse
|
32
|
Abstract
BACKGROUND Biological networks describes the mechanisms which govern cellular functions. Temporal networks show how these networks evolve over time. Studying the temporal progression of network topologies is of utmost importance since it uncovers how a network evolves and how it resists to external stimuli and internal variations. Two temporal networks have co-evolving subnetworks if the evolving topologies of these subnetworks remain similar to each other as the network topology evolves over a period of time. In this paper, we consider the problem of identifying co-evolving subnetworks given a pair of temporal networks, which aim to capture the evolution of molecules and their interactions over time. Although this problem shares some characteristics of the well-known network alignment problems, it differs from existing network alignment formulations as it seeks a mapping of the two network topologies that is invariant to temporal evolution of the given networks. This is a computationally challenging problem as it requires capturing not only similar topologies between two networks but also their similar evolution patterns. RESULTS We present an efficient algorithm, Tempo, for solving identifying co-evolving subnetworks with two given temporal networks. We formally prove the correctness of our method. We experimentally demonstrate that Tempo scales efficiently with the size of network as well as the number of time points, and generates statistically significant alignments-even when evolution rates of given networks are high. Our results on a human aging dataset demonstrate that Tempo identifies novel genes contributing to the progression of Alzheimer's, Huntington's and Type II diabetes, while existing methods fail to do so. CONCLUSIONS Studying temporal networks in general and human aging specifically using Tempo enables us to identify age related genes from non age related genes successfully. More importantly, Tempo takes the network alignment problem one huge step forward by moving beyond the classical static network models.
Collapse
Affiliation(s)
- Rasha Elhesha
- University of Florida, CISE Department, Gainesville, Florida, 32611, US
| | - Aisharjya Sarkar
- University of Florida, CISE Department, Gainesville, Florida, 32611, US
| | - Christina Boucher
- University of Florida, CISE Department, Gainesville, Florida, 32611, US
| | - Tamer Kahveci
- University of Florida, CISE Department, Gainesville, Florida, 32611, US.
| |
Collapse
|
33
|
Sachdev K, Gupta MK. A comprehensive review of feature based methods for drug target interaction prediction. J Biomed Inform 2019; 93:103159. [PMID: 30926470 DOI: 10.1016/j.jbi.2019.103159] [Citation(s) in RCA: 85] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2018] [Revised: 03/25/2019] [Accepted: 03/26/2019] [Indexed: 12/22/2022]
Abstract
Drug target interaction is a prominent research area in the field of drug discovery. It refers to the recognition of interactions between chemical compounds and the protein targets in the human body. Wet lab experiments to identify these interactions are expensive as well as time consuming. The computational methods of interaction prediction help limit the search space for these experiments. These computational methods can be divided into ligand based approaches, docking approaches and chemogenomic approaches. In this review, we aim to describe the various feature based chemogenomic methods for drug target interaction prediction. It provides a comprehensive overview of the various techniques, datasets, tools and metrics. The feature based methods have been categorized, explained and compared. A novel framework for drug target interaction prediction has also been proposed that aims to improve the performance of existing methods. To the best of our knowledge, this is the first comprehensive review focusing only on feature based methods of drug target interaction.
Collapse
Affiliation(s)
- Kanica Sachdev
- Computer Science and Engineering Department, SMVDU, J&K, India.
| | | |
Collapse
|
34
|
Dasgupta N, Kumar Thakur B, Chakraborty A, Das S. Butyrate-Induced In Vitro Colonocyte Differentiation Network Model Identifies ITGB1, SYK, CDKN2A, CHAF1A, and LRP1 as the Prognostic Markers for Colorectal Cancer Recurrence. Nutr Cancer 2018; 71:257-271. [PMID: 30475060 DOI: 10.1080/01635581.2018.1540715] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Numerous mechanisms are believed to contribute to the role of dietary fiber-derived butyrate in the protection against the development of colorectal cancers (CRCs). To identify the most crucial butyrate-regulated genes, we exploited whole genome microarray of HT-29 cells differentiated in vitro by butyrate treatment. Butyrate differentiates HT-29 cells by relaxing the perturbation, caused by mutations of Adenomatous polyposis coli (APC) and TP53 genes, the most frequent mutations observed in CRC. We constructed protein-protein interaction network (PPIN) with the differentially expressed genes after butyrate treatment and extracted the hub genes from the PPIN, which also participated in the APC-TP53 network. The idea behind this approach was that the expression of these hub genes also regulated cell differentiation, and subsequently CRC prognosis by evading the APC-TP53 mutational effect. We used mRNA expression profile of these critical hub genes from seven large CRC cohorts. Logistic Regression showed strong evidence for association of these common hubs with CRC recurrence. In this study, we exploited PPIN to reduce the dimension of microarray biologically and identified five prognostic markers for the CRC recurrence, which were validated across different datasets. Moreover, these five biomarkers we identified increase the predictive value of the TNM staging for CRC recurrence.
Collapse
Affiliation(s)
- Nirmalya Dasgupta
- a Tumor Initiation and Maintenance Program , Sanford Burnham Prebys Medical Discovery Institute , La Jolla , California, USA.,b Department of Clinical Medicine , National Institute of Cholera and Enteric Diseases , Beliaghata , Kolkata, India
| | - Bhupesh Kumar Thakur
- b Department of Clinical Medicine , National Institute of Cholera and Enteric Diseases , Beliaghata , Kolkata, India.,c Department of Immunology , University of Toronto , Toronto , Ontario, CANADA
| | - Abhijit Chakraborty
- d Division of Vaccine Discovery , La Jolla Institute for Allergy and Immunology , La Jolla , California, USA
| | - Santasabuj Das
- b Department of Clinical Medicine , National Institute of Cholera and Enteric Diseases , Beliaghata , Kolkata, India.,e Biomedical Informatics Centre, National Institute of Cholera and Enteric Diseases , Beliaghata , Kolkata, India
| |
Collapse
|
35
|
Haas JG, Weber J, Gonzalez O, Zimmer R, Griffiths SJ. Antiviral activity of the mineralocorticoid receptor NR3C2 against Herpes simplex virus Type 1 (HSV-1) infection. Sci Rep 2018; 8:15876. [PMID: 30367157 PMCID: PMC6203759 DOI: 10.1038/s41598-018-34241-w] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2018] [Accepted: 10/11/2018] [Indexed: 01/23/2023] Open
Abstract
Analysis of a genome-scale RNA interference screen of host factors affecting herpes simplex virus type 1 (HSV-1) revealed that the mineralocorticoid receptor (MR) inhibits HSV-1 replication. As a ligand-activated transcription factor the MR regulates sodium transport and blood pressure in the kidney in response to aldosterone, but roles have recently been elucidated for the MR in other cellular processes. Here, we show that the MR and other members of the mineralocorticoid signalling pathway including HSP90 and FKBP4, possess anti-viral activity against HSV-1 independent of their effect on sodium transport, as shown by sodium channel inhibitors. Expression of the MR is upregulated upon infection in an interferon (IFN) and viral transcriptional activator VP16-dependent fashion. Furthermore, the MR and VP16, together with the cellular co-activator Oct-1, transactivate the hormone response element (HRE) present in the MR promoter and those of its transcriptional targets. As the MR induces IFN expression, our data suggests the MR is involved in a positive feedback loop that controls HSV-1 infection.
Collapse
Affiliation(s)
- Jürgen G Haas
- Division of Infection and Pathway Medicine, University of Edinburgh, Edinburgh, EH16 4SB, UK
| | - Julia Weber
- Division of Infection and Pathway Medicine, University of Edinburgh, Edinburgh, EH16 4SB, UK
| | - Orland Gonzalez
- Institute for Informatics, Ludwig-Maximilians Universität München, 80333, München, Germany
| | - Ralf Zimmer
- Institute for Informatics, Ludwig-Maximilians Universität München, 80333, München, Germany
| | - Samantha J Griffiths
- Division of Infection and Pathway Medicine, University of Edinburgh, Edinburgh, EH16 4SB, UK.
| |
Collapse
|
36
|
Kalva S, Bindusree G, Alexander V, Madasamy P. Interactome based biomarker discovery for irritable bowel syndrome—A systems biology approach. Comput Biol Chem 2018; 76:218-224. [PMID: 30071397 DOI: 10.1016/j.compbiolchem.2018.07.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2017] [Revised: 11/12/2017] [Accepted: 07/04/2018] [Indexed: 02/07/2023]
|
37
|
Vijayan V, Milenkovic T. Multiple Network Alignment via MultiMAGNA+. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:1669-1682. [PMID: 28829315 DOI: 10.1109/tcbb.2017.2740381] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Network alignment (NA) aims to find a node mapping that identifies topologically or functionally similar network regions between molecular networks of different species. Analogous to genomic sequence alignment, NA can be used to transfer biological knowledge from well- to poorly-studied species between aligned network regions. Pairwise NA (PNA) finds similar regions between two networks while multiple NA (MNA) can align more than two networks. We focus on MNA. Existing MNA methods aim to maximize total similarity over all aligned nodes (node conservation). Then, they evaluate alignment quality by measuring the amount of conserved edges, but only after the alignment is constructed. Directly optimizing edge conservation during alignment construction in addition to node conservation may result in superior alignments. Thus, we present a novel MNA method called multiMAGNA++ that can achieve this. Indeed, multiMAGNA++ outperforms or is on par with existing MNA methods, while often completing faster than existing methods. That is, multiMAGNA++ scales well to larger network data and can be parallelized effectively. During method evaluation, we also introduce new MNA quality measures to allow for more fair MNA method comparison compared to the existing alignment quality measures. The multiMAGNA++ code is available on the method's web page at http://nd.edu/~cone/multiMAGNA++/.
Collapse
|
38
|
Gu S, Johnson J, Faisal FE, Milenković T. From homogeneous to heterogeneous network alignment via colored graphlets. Sci Rep 2018; 8:12524. [PMID: 30131590 PMCID: PMC6104050 DOI: 10.1038/s41598-018-30831-w] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2018] [Accepted: 08/07/2018] [Indexed: 11/19/2022] Open
Abstract
Network alignment (NA) compares networks with the goal of finding a node mapping that uncovers highly similar (conserved) network regions. Existing NA methods are homogeneous, i.e., they can deal only with networks containing nodes and edges of one type. Due to increasing amounts of heterogeneous network data with nodes or edges of different types, we extend three recent state-of-the-art homogeneous NA methods, WAVE, MAGNA++, and SANA, to allow for heterogeneous NA for the first time. We introduce several algorithmic novelties. Namely, these existing methods compute homogeneous graphlet-based node similarities and then find high-scoring alignments with respect to these similarities, while simultaneously maximizing the amount of conserved edges. Instead, we extend homogeneous graphlets to their heterogeneous counterparts, which we then use to develop a new measure of heterogeneous node similarity. Also, we extend S3, a state-of-the-art measure of edge conservation for homogeneous NA, to its heterogeneous counterpart. Then, we find high-scoring alignments with respect to our heterogeneous node similarity and edge conservation measures. In evaluations on synthetic and real-world biological networks, our proposed heterogeneous NA methods lead to higher-quality alignments and better robustness to noise in the data than their homogeneous counterparts. The software and data from this work is available at https://nd.edu/~cone/colored_graphlets/.
Collapse
Affiliation(s)
- Shawn Gu
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN, 46556, USA
| | - John Johnson
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN, 46556, USA
| | - Fazle E Faisal
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN, 46556, USA
- Eck Institute for Global Health and Interdisciplinary Center for Network Science and Applications (iCeNSA), University of Notre Dame, Notre Dame, IN, 46556, USA
| | - Tijana Milenković
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN, 46556, USA.
- Eck Institute for Global Health and Interdisciplinary Center for Network Science and Applications (iCeNSA), University of Notre Dame, Notre Dame, IN, 46556, USA.
| |
Collapse
|
39
|
Rizzolo K, Huen J, Kumar A, Phanse S, Vlasblom J, Kakihara Y, Zeineddine HA, Minic Z, Snider J, Wang W, Pons C, Seraphim TV, Boczek EE, Alberti S, Costanzo M, Myers CL, Stagljar I, Boone C, Babu M, Houry WA. Features of the Chaperone Cellular Network Revealed through Systematic Interaction Mapping. Cell Rep 2018; 20:2735-2748. [PMID: 28903051 DOI: 10.1016/j.celrep.2017.08.074] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2017] [Revised: 07/21/2017] [Accepted: 08/23/2017] [Indexed: 10/18/2022] Open
Abstract
A comprehensive view of molecular chaperone function in the cell was obtained through a systematic global integrative network approach based on physical (protein-protein) and genetic (gene-gene or epistatic) interaction mapping. This allowed us to decipher interactions involving all core chaperones (67) and cochaperones (15) of Saccharomyces cerevisiae. Our analysis revealed the presence of a large chaperone functional supercomplex, which we named the naturally joined (NAJ) chaperone complex, encompassing Hsp40, Hsp70, Hsp90, AAA+, CCT, and small Hsps. We further found that many chaperones interact with proteins that form foci or condensates under stress conditions. Using an in vitro reconstitution approach, we demonstrate condensate formation for the highly conserved AAA+ ATPases Rvb1 and Rvb2, which are part of the R2TP complex that interacts with Hsp90. This expanded view of the chaperone network in the cell clearly demonstrates the distinction between chaperones having broad versus narrow substrate specificities in protein homeostasis.
Collapse
Affiliation(s)
- Kamran Rizzolo
- Department of Biochemistry, University of Toronto, Toronto, ON M5G 1M1, Canada
| | - Jennifer Huen
- Department of Biochemistry, University of Toronto, Toronto, ON M5G 1M1, Canada
| | - Ashwani Kumar
- Department of Computer Science, University of Regina, Regina, SK S4S 0A2, Canada
| | - Sadhna Phanse
- The Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada; Department of Biochemistry, University of Regina, Regina, SK S4S 0A2, Canada
| | - James Vlasblom
- Department of Biochemistry, University of Regina, Regina, SK S4S 0A2, Canada
| | - Yoshito Kakihara
- Department of Biochemistry, University of Toronto, Toronto, ON M5G 1M1, Canada
| | | | - Zoran Minic
- Department of Biochemistry, University of Regina, Regina, SK S4S 0A2, Canada
| | - Jamie Snider
- The Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada
| | - Wen Wang
- Department of Computer Science & Engineering, University of Minnesota-Twin Cities, Minneapolis, MN 55455, USA; Program in Bioinformatics and Computational Biology, University of Minnesota-Twin Cities, Minneapolis, MN 55455, USA
| | - Carles Pons
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute for Science and Technology, Barcelona, Catalonia, Spain
| | - Thiago V Seraphim
- Department of Biochemistry, University of Toronto, Toronto, ON M5G 1M1, Canada; Department of Biochemistry, University of Regina, Regina, SK S4S 0A2, Canada
| | - Edgar Erik Boczek
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
| | - Simon Alberti
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
| | - Michael Costanzo
- The Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada
| | - Chad L Myers
- Department of Computer Science & Engineering, University of Minnesota-Twin Cities, Minneapolis, MN 55455, USA; Program in Bioinformatics and Computational Biology, University of Minnesota-Twin Cities, Minneapolis, MN 55455, USA
| | - Igor Stagljar
- Department of Biochemistry, University of Toronto, Toronto, ON M5G 1M1, Canada; The Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 3E1, Canada
| | - Charles Boone
- The Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 3E1, Canada
| | - Mohan Babu
- Department of Biochemistry, University of Regina, Regina, SK S4S 0A2, Canada.
| | - Walid A Houry
- Department of Biochemistry, University of Toronto, Toronto, ON M5G 1M1, Canada; Department of Chemistry, University of Toronto, Toronto, ON M5S 3H6, Canada.
| |
Collapse
|
40
|
|
41
|
Systematic Evaluation of Molecular Networks for Discovery of Disease Genes. Cell Syst 2018; 6:484-495.e5. [PMID: 29605183 DOI: 10.1016/j.cels.2018.03.001] [Citation(s) in RCA: 181] [Impact Index Per Article: 25.9] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2017] [Revised: 12/19/2017] [Accepted: 02/28/2018] [Indexed: 12/27/2022]
Abstract
Gene networks are rapidly growing in size and number, raising the question of which networks are most appropriate for particular applications. Here, we evaluate 21 human genome-wide interaction networks for their ability to recover 446 disease gene sets identified through literature curation, gene expression profiling, or genome-wide association studies. While all networks have some ability to recover disease genes, we observe a wide range of performance with STRING, ConsensusPathDB, and GIANT networks having the best performance overall. A general tendency is that performance scales with network size, suggesting that new interaction discovery currently outweighs the detrimental effects of false positives. Correcting for size, we find that the DIP network provides the highest efficiency (value per interaction). Based on these results, we create a parsimonious composite network with both high efficiency and performance. This work provides a benchmark for selection of molecular networks in human disease research.
Collapse
|
42
|
Bisdemethoxycurcumin exerts pro-apoptotic effects in human pancreatic adenocarcinoma cells through mitochondrial dysfunction and a GRP78-dependent pathway. Oncotarget 2018; 7:83641-83656. [PMID: 27845899 PMCID: PMC5347794 DOI: 10.18632/oncotarget.13272] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2016] [Accepted: 10/14/2016] [Indexed: 12/27/2022] Open
Abstract
Pancreatic cancer is a highly aggressive malignancy, which is intrinsically resistant to current chemotherapies. Herein, we investigate whether bisdemethoxycurcumin (BDMC), a derivative of curcumin, potentiates gemcitabine in human pancreatic cancer cells. The result suggests that BDMC sensitizes gemcitabine by inducing mitochondrial dysfunctions and apoptosis in PANC-1 and MiaPaCa-2 pancreatic cancer cells. Utilizing two-dimensional gel electrophoresis and mass spectrometry, we identify 13 essential proteins with significantly altered expressions in response to gemcitabine alone or combined with BDMC. Protein-protein interaction network analysis pinpoints glucose-regulated protein 78 (GRP78) as the key hub activated by BDMC. We then reveal that BDMC upregulates GRP78 and facilitates apoptosis through eIF2α/CHOP pathway. Moreover, DJ-1 and prohibitin, two identified markers of chemoresistance, are increased by gemcitabine in PANC-1 cells. This could be meaningfully reversed by BDMC, suggesting that BDMC partially offsets the chemoresistance induced by gemcitabine. In summary, these findings show that BDMC promotes apoptosis through a GRP78-dependent pathway and mitochondrial dysfunctions, and potentiates the antitumor effect of gemcitabine in human pancreatic cancer cells.
Collapse
|
43
|
Chasapis CT. Shared gene-network signatures between the human heavy metal proteome and neurological disorders and cancer types. Metallomics 2018; 10:1678-1686. [DOI: 10.1039/c8mt00271a] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
In this work, for the first time, the human heavy metal proteome was predicted.
Collapse
Affiliation(s)
- Christos T. Chasapis
- Institute of Chemical Engineering Sciences
- Foundation for Research & Technology – Hellas (FORTH/ICE-HT)
- Patras
- Greece
| |
Collapse
|
44
|
Csabai L, Ölbei M, Budd A, Korcsmáros T, Fazekas D. SignaLink: Multilayered Regulatory Networks. Methods Mol Biol 2018; 1819:53-73. [PMID: 30421399 DOI: 10.1007/978-1-4939-8618-7_3] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Biological networks are graphs used to represent the inner workings of a biological system. Networks describe the relationships of the elements of biological systems using edges and nodes. However, the resulting representation of the system can sometimes be too simplistic to usefully model reality. By combining several different interaction types within one larger multilayered biological network, tools such as SignaLink provide a more nuanced view than those relying on single-layer networks (where edges only describe one kind of interaction). Multilayered networks display connections between multiple networks (i.e., protein-protein interactions and their transcriptional and posttranscriptional regulators), each one of them describing a specific set of connections. Multilayered networks also allow us to depict cross talk between cellular systems, which is a more realistic way of describing molecular interactions. They can be used to collate networks from different sources into one multilayered structure, which makes them useful as an analytic tool as well.
Collapse
Affiliation(s)
| | - Márton Ölbei
- Earlham Institute, Norwich Research Park, Norwich, UK.,Quadram Institute, Norwich Research Park, Norwich, UK
| | - Aidan Budd
- Earlham Institute, Norwich Research Park, Norwich, UK
| | - Tamás Korcsmáros
- Eötvös Loránd University, Budapest, Hungary. .,Earlham Institute, Norwich Research Park, Norwich, UK. .,Quadram Institute, Norwich Research Park, Norwich, UK.
| | - Dávid Fazekas
- Eötvös Loránd University, Budapest, Hungary.,Earlham Institute, Norwich Research Park, Norwich, UK
| |
Collapse
|
45
|
Majewska M, Wysokińska H, Kuźma Ł, Szymczyk P. Eukaryotic and prokaryotic promoter databases as valuable tools in exploring the regulation of gene transcription: a comprehensive overview. Gene 2017; 644:38-48. [PMID: 29104165 DOI: 10.1016/j.gene.2017.10.079] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2017] [Revised: 07/26/2017] [Accepted: 10/27/2017] [Indexed: 01/02/2023]
Abstract
The complete exploration of the regulation of gene expression remains one of the top-priority goals for researchers. As the regulation is mainly controlled at the level of transcription by promoters, study on promoters and findings are of great importance. This review summarizes forty selected databases that centralize experimental and theoretical knowledge regarding the organization of promoters, interacting transcription factors (TFs) and microRNAs (miRNAs) in many eukaryotic and prokaryotic species. The presented databases offer researchers valuable support in elucidating the regulation of gene transcription.
Collapse
Affiliation(s)
- Małgorzata Majewska
- Department of Biology and Pharmaceutical Botany, Medical University of Lodz, 90-151 Lodz, Poland.
| | - Halina Wysokińska
- Department of Biology and Pharmaceutical Botany, Medical University of Lodz, 90-151 Lodz, Poland
| | - Łukasz Kuźma
- Department of Biology and Pharmaceutical Botany, Medical University of Lodz, 90-151 Lodz, Poland
| | - Piotr Szymczyk
- Department of Pharmaceutical Biotechnology, Medical University of Lodz, 90-151 Lodz, Poland
| |
Collapse
|
46
|
Rietman EA, Scott JG, Tuszynski JA, Klement GL. Personalized anticancer therapy selection using molecular landscape topology and thermodynamics. Oncotarget 2017; 8:18735-18745. [PMID: 27793055 PMCID: PMC5386643 DOI: 10.18632/oncotarget.12932] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2016] [Accepted: 10/12/2016] [Indexed: 01/19/2023] Open
Abstract
Personalized anticancer therapy requires continuous consolidation of emerging bioinformatics data into meaningful and accurate information streams. The use of novel mathematical and physical approaches, namely topology and thermodynamics can enable merging differing data types for improved accuracy in selecting therapeutic targets. We describe a method that uses chemical thermodynamics and two topology measures to link RNA-seq data from individual patients with academically curated protein-protein interaction networks to select clinically relevant targets for treatment of low-grade glioma (LGG). We show that while these three histologically distinct tumor types (astrocytoma, oligoastrocytoma, and oligodendroglioma) may share potential therapeutic targets, the majority of patients would benefit from more individualized therapies. The method involves computing Gibbs free energy of the protein-protein interaction network and applying a topological filtration on the energy landscape to produce a subnetwork known as persistent homology. We then determine the most likely best target for therapeutic intervention using a topological measure of the network known as Betti number. We describe the algorithm and discuss its application to several patients.
Collapse
Affiliation(s)
- Edward A Rietman
- BINDS Laboratory, College of Information and Computer Sciences, University of Massachusetts Amherst, Amherst, MA, USA
| | - Jacob G Scott
- Wolfson Center for Mathematical Biology, Mathematical Institute, University of Oxford, Oxford, UK.,Department of Integrated Mathematical Oncology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, USA
| | - Jack A Tuszynski
- Department of Oncology, Faculty of Medicine and Dentistry, University of Alberta, Edmonton, Alberta, Canada.,Department of Physics, University of Alberta, Edmonton, Alberta, Canada
| | - Giannoula Lakka Klement
- Molecular Oncology Research Institute, Tufts Medical Center, Boston, MA, USA.,Pediatric Hematology Oncology, Floating Hospital for Children at Tufts Medical Center, Boston, MA, USA.,Sackler School of Graduate Biomedical Sciences at Tufts University, Boston, MA, USA
| |
Collapse
|
47
|
Yoo B, Faisal FE, Chen H, Milenkovic T. Improving Identification of Key Players in Aging via Network De-Noising and Core Inference. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017; 14:1056-1069. [PMID: 26529776 DOI: 10.1109/tcbb.2015.2495170] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Current "ground truth" knowledge about human aging has been obtained by transferring aging-related knowledge from well-studied model species via sequence homology or by studying human gene expression data. Since proteins function by interacting with each other, analyzing protein-protein interaction (PPI) networks in the context of aging is promising. Unlike existing static network research of aging, since cellular functioning is dynamic, we recently integrated the static human PPI network with aging-related gene expression data to form dynamic, age-specific networks. Then, we predicted as key players in aging those proteins whose network topologies significantly changed with age. Since current networks are noisy , here, we use link prediction to de-noise the human network and predict improved key players in aging from the de-noised data. Indeed, de-noising gives more significant overlap between the predicted data and the "ground truth" aging-related data. Yet, we obtain novel predictions, which we validate in the literature. Also, we improve the predictions by an alternative strategy: removing "redundant" edges from the age-specific networks and using the resulting age-specific network "cores" to study aging. We produce new knowledge from dynamic networks encompassing multiple data types, via network de-noising or core inference, complementing the existing knowledge obtained from sequence or expression data.
Collapse
|
48
|
Dai X, Hua T, Hong T. Integrated diagnostic network construction reveals a 4-gene panel and 5 cancer hallmarks driving breast cancer heterogeneity. Sci Rep 2017; 7:6827. [PMID: 28754978 PMCID: PMC5533795 DOI: 10.1038/s41598-017-07189-6] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2017] [Accepted: 06/23/2017] [Indexed: 12/26/2022] Open
Abstract
Breast cancer encompasses a group of heterogeneous diseases, each associated with distinct clinical implications. Dozens of molecular biomarkers capable of categorizing tumors into clinically relevant subgroups have been proposed which, though considerably contribute in precision medicine, complicate our understandings toward breast cancer subtyping and its clinical translation. To decipher the networking of markers with diagnostic roles on breast carcinomas, we constructed the diagnostic networks by incorporating 6 publically available gene expression datasets with protein interaction data retrieved from BioGRID on previously identified 1015 genes with breast cancer subtyping roles. The Greedy algorithm and mutual information were used to construct the integrated diagnostic network, resulting in 37 genes enclosing 43 interactions. Four genes, FAM134B, KIF2C, ALCAM, KIF1A, were identified having comparable subtyping efficacies with the initial 1015 genes evaluated by hierarchical clustering and cross validations that deploy support vector machine and k nearest neighbor algorithms. Pathway, Gene Ontology, and proliferation marker enrichment analyses collectively suggest 5 primary cancer hallmarks driving breast cancer differentiation, with those contributing to uncontrolled proliferation being the most prominent. Our results propose a 37-gene integrated diagnostic network implicating 5 cancer hallmarks that drives breast cancer heterogeneity and, in particular, a 4-gene panel with clinical diagnostic translation potential.
Collapse
Affiliation(s)
- Xiaofeng Dai
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, Wuxi, China.
- The Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, China.
| | - Tongyan Hua
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, Wuxi, China
- The Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, China
| | - Tingting Hong
- Department of medical oncology, the affiliated hospital of Jiangnan University, the fourth people's hospital of Wuxi, Wuxi, China
| |
Collapse
|
49
|
Tripathi S, Lloyd-Price J, Ribeiro A, Yli-Harja O, Dehmer M, Emmert-Streib F. sgnesR: An R package for simulating gene expression data from an underlying real gene network structure considering delay parameters. BMC Bioinformatics 2017; 18:325. [PMID: 28676075 PMCID: PMC5496254 DOI: 10.1186/s12859-017-1731-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2016] [Accepted: 06/15/2017] [Indexed: 01/04/2023] Open
Abstract
Background sgnesR (Stochastic Gene Network Expression Simulator in R) is an R package that provides an interface to simulate gene expression data from a given gene network using the stochastic simulation algorithm (SSA). The package allows various options for delay parameters and can easily included in reactions for promoter delay, RNA delay and Protein delay. A user can tune these parameters to model various types of reactions within a cell. As examples, we present two network models to generate expression profiles. We also demonstrated the inference of networks and the evaluation of association measure of edge and non-edge components from the generated expression profiles. Results The purpose of sgnesR is to enable an easy to use and a quick implementation for generating realistic gene expression data from biologically relevant networks that can be user selected. Conclusions sgnesR is freely available for academic use. The R package has been tested for R 3.2.0 under Linux, Windows and Mac OS X. Electronic supplementary material The online version of this article (doi:10.1186/s12859-017-1731-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Shailesh Tripathi
- Predictive Medicine and Data Analytics Lab, Department of Signal Processing, Tampere University of Technology, Tampere, Finland
| | - Jason Lloyd-Price
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Harvard University, Boston, USA.,Laboratory of Biosystem Dynamics, Department of Signal Processing, Tampere University of Technology, Tampere, Finland
| | - Andre Ribeiro
- Laboratory of Biosystem Dynamics, Department of Signal Processing, Tampere University of Technology, Tampere, Finland.,Institute of Biosciences and Medical Technology, Tampere, Finland
| | - Olli Yli-Harja
- Institute of Biosciences and Medical Technology, Tampere, Finland.,Computational Systems Biology, Department of Signal Processing, Tampere University of Technology, Tampere, Finland
| | - Matthias Dehmer
- Institute for Theoretical Informatics, Mathematics and Operations Research, Department of Computer Science, Universität der Bundeswehr München, Munich, Germany
| | - Frank Emmert-Streib
- Predictive Medicine and Data Analytics Lab, Department of Signal Processing, Tampere University of Technology, Tampere, Finland. .,Institute of Biosciences and Medical Technology, Tampere, Finland.
| |
Collapse
|
50
|
Le DH, Pham VH. HGPEC: a Cytoscape app for prediction of novel disease-gene and disease-disease associations and evidence collection based on a random walk on heterogeneous network. BMC SYSTEMS BIOLOGY 2017; 11:61. [PMID: 28619054 PMCID: PMC5472867 DOI: 10.1186/s12918-017-0437-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/03/2017] [Accepted: 05/31/2017] [Indexed: 12/31/2022]
Abstract
Background Finding gene-disease and disease-disease associations play important roles in the biomedical area and many prioritization methods have been proposed for this goal. Among them, approaches based on a heterogeneous network of genes and diseases are considered state-of-the-art ones, which achieve high prediction performance and can be used for diseases with/without known molecular basis. Results Here, we developed a Cytoscape app, namely HGPEC, based on a random walk with restart algorithm on a heterogeneous network of genes and diseases. This app can prioritize candidate genes and diseases by employing a heterogeneous network consisting of a network of genes/proteins and a phenotypic disease similarity network. Based on the rankings, novel disease-gene and disease-disease associations can be identified. These associations can be supported with network- and rank-based visualization as well as evidences and annotations from biomedical data. A case study on prediction of novel breast cancer-associated genes and diseases shows the abilities of HGPEC. In addition, we showed prominence in the performance of HGPEC compared to other tools for prioritization of candidate disease genes. Conclusions Taken together, our app is expected to effectively predict novel disease-gene and disease-disease associations and support network- and rank-based visualization as well as biomedical evidences for such the associations. Electronic supplementary material The online version of this article (doi:10.1186/s12918-017-0437-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Duc-Hau Le
- Vinmec Research Institute of Stem Cell and Gene Technology, 458 Minh Khai, Hai Ba Trung, Hanoi, Vietnam.,Thuyloi University, 175 Tay Son, Dong Da, Hanoi, Vietnam
| | - Van-Huy Pham
- Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh City, Vietnam.
| |
Collapse
|