1
|
Li B, Li X, Tang X, Wang J. Prediction and Evaluation of Coronavirus and Human Protein-Protein Interactions Integrating Five Different Computational Methods. Proteins 2025. [PMID: 40231383 DOI: 10.1002/prot.26826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2024] [Revised: 03/08/2025] [Accepted: 03/26/2025] [Indexed: 04/16/2025]
Abstract
The high lethality and infectiousness of coronaviruses, particularly SARS-Cov-2, pose a significant threat to human society. Understanding coronaviruses, especially the interactions between these viruses and humans, is crucial for mitigating the coronavirus pandemic. In this study, we conducted a comprehensive comparison and evaluation of five prevalent computational methods: interolog mapping, domain-domain interaction methodology, domain-motif interaction methodology, structure-based approaches, and machine learning techniques. These methods were assessed using unbiased datasets that include C1, C2h, C2v, and C3 test sets. Ultimately, we integrated these five methodologies into a unified model for predicting protein-protein interactions (PPIs) between coronaviruses and human proteins. Our final model demonstrates relatively better performance, particularly with the C2v and C3 test sets, which are frequently used datasets in practical applications. Based on this model, we further established a high-confidence PPI network between coronaviruses and humans, consisting of 18,012 interactions between 3843 human proteins and 129 coronavirus proteins. The reliability of our predictions was further validated through the current knowledge framework and network analysis. This study is anticipated to enhance mechanistic understanding of the coronavirus-human relationship a while facilitating the rediscovery of antiviral drug targets. The source codes and datasets are accessible at https://github.com/covhppilab/CoVHPPI.
Collapse
Affiliation(s)
- Binghua Li
- Key Laboratory of Smart Farming for Agricultural Animals, Huazhong Agricultural University, Wuhan, China
- Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
- College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Xiaoyu Li
- Key Laboratory of Smart Farming for Agricultural Animals, Huazhong Agricultural University, Wuhan, China
- Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
- College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Xian Tang
- Key Laboratory of Smart Farming for Agricultural Animals, Huazhong Agricultural University, Wuhan, China
- Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
- College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Jia Wang
- Key Laboratory of Smart Farming for Agricultural Animals, Huazhong Agricultural University, Wuhan, China
- Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
- College of Informatics, Huazhong Agricultural University, Wuhan, China
| |
Collapse
|
2
|
Anyaegbunam UA, Vagiona AC, ten Cate V, Bauer K, Schmidlin T, Distler U, Tenzer S, Araldi E, Bindila L, Wild P, Andrade-Navarro MA. A Map of the Lipid-Metabolite-Protein Network to Aid Multi-Omics Integration. Biomolecules 2025; 15:484. [PMID: 40305217 PMCID: PMC12024871 DOI: 10.3390/biom15040484] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2025] [Revised: 03/13/2025] [Accepted: 03/20/2025] [Indexed: 05/02/2025] Open
Abstract
The integration of multi-omics data offers transformative potential for elucidating complex molecular mechanisms underlying biological processes and diseases. In this study, we developed a lipid-metabolite-protein network that combines a protein-protein interaction network and enzymatic and genetic interactions of proteins with metabolites and lipids to provide a unified framework for multi-omics integration. Using hyperbolic embedding, the network visualizes connections across omics layers, accessible through a user-friendly Shiny R (version 1.10.0) software package. This framework ranks molecules across omics layers based on functional proximity, enabling intuitive exploration. Application in a cardiovascular disease (CVD) case study identified lipids and metabolites associated with CVD-related proteins. The analysis confirmed known associations, like cholesterol esters and sphingomyelin, and highlighted potential novel biomarkers, such as 4-imidazoleacetate and indoleacetaldehyde. Furthermore, we used the network to analyze empagliflozin's temporal effects on lipid metabolism. Functional enrichment analysis of proteins associated with lipid signatures revealed dynamic shifts in biological processes, with early effects impacting phospholipid metabolism and long-term effects affecting sphingolipid biosynthesis. Our framework offers a versatile tool for hypothesis generation, functional analysis, and biomarker discovery. By bridging molecular layers, this approach advances our understanding of disease mechanisms and therapeutic effects, with broad applications in computational biology and precision medicine.
Collapse
Affiliation(s)
- Uchenna Alex Anyaegbunam
- Computational Biology and Data Mining Group (CBDM), Institute of Organismic and Molecular Evolution (iOME), Johannes Gutenberg University, 55122 Mainz, Germany
| | - Aimilia-Christina Vagiona
- Computational Biology and Data Mining Group (CBDM), Institute of Organismic and Molecular Evolution (iOME), Johannes Gutenberg University, 55122 Mainz, Germany
| | - Vincent ten Cate
- Preventive Cardiology and Preventive Medicine, Department of Cardiology, University Medical Center, Johannes-Gutenberg University Mainz, Langenbeckstr. 1, 55131 Mainz, Germany
- Clinical Epidemiology and Systems Medicine, Center for Thrombosis and Hemostasis (CTH), University Medical Center, 55131 Mainz, Germany
- German Center for Cardiovascular Research (DZHK), Partner Site Rhine Main, University Medical Center, Johannes-Gutenberg University Mainz, 55131 Mainz, Germany
| | - Katrin Bauer
- Preventive Cardiology and Preventive Medicine, Department of Cardiology, University Medical Center, Johannes-Gutenberg University Mainz, Langenbeckstr. 1, 55131 Mainz, Germany
- German Center for Cardiovascular Research (DZHK), Partner Site Rhine Main, University Medical Center, Johannes-Gutenberg University Mainz, 55131 Mainz, Germany
- Computational Systems Medicine, Center for Thrombosis and Hemostasis (CTH), 55131 Mainz, Germany
| | - Thierry Schmidlin
- Institute of Immunology, University Medical Center, Johannes-Gutenberg University Mainz, 55131 Mainz, Germany
- Research Centre for Immunotherapy (FZI), University Medical Center, Johannes-Gutenberg University Mainz, 55131 Mainz, Germany
| | - Ute Distler
- Institute of Immunology, University Medical Center, Johannes-Gutenberg University Mainz, 55131 Mainz, Germany
- Research Centre for Immunotherapy (FZI), University Medical Center, Johannes-Gutenberg University Mainz, 55131 Mainz, Germany
| | - Stefan Tenzer
- Institute of Immunology, University Medical Center, Johannes-Gutenberg University Mainz, 55131 Mainz, Germany
- Research Centre for Immunotherapy (FZI), University Medical Center, Johannes-Gutenberg University Mainz, 55131 Mainz, Germany
| | - Elisa Araldi
- Preventive Cardiology and Preventive Medicine, Department of Cardiology, University Medical Center, Johannes-Gutenberg University Mainz, Langenbeckstr. 1, 55131 Mainz, Germany
- German Center for Cardiovascular Research (DZHK), Partner Site Rhine Main, University Medical Center, Johannes-Gutenberg University Mainz, 55131 Mainz, Germany
- Computational Systems Medicine, Center for Thrombosis and Hemostasis (CTH), 55131 Mainz, Germany
- Systems Medicine Laboratory, Department of Medicine and Surgery, University of Parma, 43121 Parma, Italy
| | - Laura Bindila
- Institute of Physiological Chemistry, University Medical Center, 55131 Mainz, Germany
| | - Philipp Wild
- Preventive Cardiology and Preventive Medicine, Department of Cardiology, University Medical Center, Johannes-Gutenberg University Mainz, Langenbeckstr. 1, 55131 Mainz, Germany
- Clinical Epidemiology and Systems Medicine, Center for Thrombosis and Hemostasis (CTH), University Medical Center, 55131 Mainz, Germany
- German Center for Cardiovascular Research (DZHK), Partner Site Rhine Main, University Medical Center, Johannes-Gutenberg University Mainz, 55131 Mainz, Germany
| | - Miguel A. Andrade-Navarro
- Computational Biology and Data Mining Group (CBDM), Institute of Organismic and Molecular Evolution (iOME), Johannes Gutenberg University, 55122 Mainz, Germany
| |
Collapse
|
3
|
Vagiona AC, Notopoulou S, Zdráhal Z, Gonçalves-Kulik M, Petrakis S, Andrade-Navarro MA. Prediction of protein interactions with function in protein (de-)phosphorylation. PLoS One 2025; 20:e0319084. [PMID: 40029919 PMCID: PMC11875375 DOI: 10.1371/journal.pone.0319084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2024] [Accepted: 01/28/2025] [Indexed: 03/06/2025] Open
Abstract
Protein-protein interactions (PPIs) form a complex network called "interactome" that regulates many functions in the cell. In recent years, there is an increasing accumulation of evidence supporting the existence of a hyperbolic geometry underlying the network representation of complex systems such as the interactome. In particular, it has been shown that the embedding of the human Protein-Interaction Network (hPIN) in hyperbolic space (H2) captures biologically relevant information. Here we explore whether this mapping contains information that would allow us to predict the function of PPIs, more specifically interactions related to post-translational modification (PTM). We used a random forest algorithm to predict PTM-related directed PPIs, concretely, protein phosphorylation and dephosphorylation, based on hyperbolic properties and centrality measures of the hPIN mapped in H2. To evaluate the efficacy of our algorithm, we predicted PTM-related PPIs of ataxin-1, a protein which is responsible for Spinocerebellar Ataxia type 1 (SCA1). Proteomics analysis in a cellular model revealed that several of the predicted PTM-PPIs were indeed dysregulated in a SCA1-related disease network. A compact cluster composed of ataxin-1, its dysregulated PTM-PPIs and their common upstream regulators may represent critical interactions for disease pathology. Thus, our algorithm may infer phosphorylation activity on proteins through directed PPIs.
Collapse
Affiliation(s)
- Aimilia-Christina Vagiona
- Faculty of Biology, Insitute of Organismic and Molecular Evolution, Johannes Gutenberg University, Biozentrum I, Mainz, Germany
| | - Sofia Notopoulou
- Institute of Applied Biosciences/Centre for Research and Technology Hellas, Thessaloniki, Greece
| | - Zbyněk Zdráhal
- Central European Institute of Technology, Masaryk University, Brno, Czech Republic
| | - Mariane Gonçalves-Kulik
- Faculty of Biology, Insitute of Organismic and Molecular Evolution, Johannes Gutenberg University, Biozentrum I, Mainz, Germany
| | - Spyros Petrakis
- Institute of Applied Biosciences/Centre for Research and Technology Hellas, Thessaloniki, Greece
| | - Miguel A. Andrade-Navarro
- Faculty of Biology, Insitute of Organismic and Molecular Evolution, Johannes Gutenberg University, Biozentrum I, Mainz, Germany
| |
Collapse
|
4
|
Ambreen S, Umar M, Noor A, Jain H, Ali R. Advanced AI and ML frameworks for transforming drug discovery and optimization: With innovative insights in polypharmacology, drug repurposing, combination therapy and nanomedicine. Eur J Med Chem 2025; 284:117164. [PMID: 39721292 DOI: 10.1016/j.ejmech.2024.117164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2024] [Revised: 11/24/2024] [Accepted: 11/27/2024] [Indexed: 12/28/2024]
Abstract
Artificial Intelligence (AI) and Machine Learning (ML) are transforming drug discovery by overcoming traditional challenges like high costs, time-consuming, and frequent failures. AI-driven approaches streamline key phases, including target identification, lead optimization, de novo drug design, and drug repurposing. Frameworks such as deep neural networks (DNNs), convolutional neural networks (CNNs), and deep reinforcement learning (DRL) models have shown promise in identifying drug targets, optimizing delivery systems, and accelerating drug repurposing. Generative adversarial networks (GANs) and variational autoencoders (VAEs) aid de novo drug design by creating novel drug-like compounds with desired properties. Case studies, such as DDR1 kinase inhibitors designed using generative models and CDK20 inhibitors developed via structure-based methods, highlight AI's ability to produce highly specific therapeutics. Models like SNF-CVAE and DeepDR further advance drug repurposing by uncovering new therapeutic applications for existing drugs. Advanced ML algorithms enhance precision in predicting drug efficacy, toxicity, and ADME-Tox properties, reducing development costs and improving drug-target interactions. AI also supports polypharmacology by optimizing multi-target drug interactions and enhances combination therapy through predictions of drug synergies and antagonisms. In nanomedicine, AI models like CURATE.AI and the Hartung algorithm optimize personalized treatments by predicting toxicological risks and real-time dosing adjustments with high accuracy. Despite its potential, challenges like data quality, model interpretability, and ethical concerns must be addressed. High-quality datasets, transparent models, and unbiased algorithms are essential for reliable AI applications. As AI continues to evolve, it is poised to revolutionize drug discovery and personalized medicine, advancing therapeutic development and patient care.
Collapse
Affiliation(s)
- Subiya Ambreen
- Department of Pharmaceutical Chemistry, Delhi Institute of Pharmaceutical Sciences and Research (DIPSAR), DPSRU, Pushp Vihar, New Delhi, 110017, India
| | - Mohammad Umar
- Department of Pharmaceutical Chemistry, Delhi Institute of Pharmaceutical Sciences and Research (DIPSAR), DPSRU, Pushp Vihar, New Delhi, 110017, India
| | - Aaisha Noor
- Department of Pharmaceutical Chemistry, Delhi Institute of Pharmaceutical Sciences and Research (DIPSAR), DPSRU, Pushp Vihar, New Delhi, 110017, India
| | - Himangini Jain
- Department of Pharmaceutical Chemistry, Delhi Institute of Pharmaceutical Sciences and Research (DIPSAR), DPSRU, Pushp Vihar, New Delhi, 110017, India
| | - Ruhi Ali
- Department of Pharmaceutical Chemistry, Delhi Institute of Pharmaceutical Sciences and Research (DIPSAR), DPSRU, Pushp Vihar, New Delhi, 110017, India.
| |
Collapse
|
5
|
Guo Y, Liu L, Lin A. Improving the identification of cancer driver modules using deep features learned from multi-omics data. Comput Biol Med 2025; 184:109322. [PMID: 39522132 DOI: 10.1016/j.compbiomed.2024.109322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Revised: 10/14/2024] [Accepted: 10/21/2024] [Indexed: 11/16/2024]
Abstract
Identifying the cancer driver modules or pathways is crucial to understanding the fundamental mechanisms of cancer occurrence and progression. The rapid abundance of cancer omics data provides unprecedented opportunities to study the driver modules in cancer, and many computational methods have been developed in recent years. However, most existing methods have limitations in considering different types of cancer omics data and cannot effectively learn informative omics features for integrated identification of driver modules. In this paper, we introduce a new integrated framework to accurately identify the cancer driver modules by integrating the protein-protein interaction network, transcriptional regulatory network, gene expression and mutation data in cancer. We first develop a series of methods to learn the deep features of functional connectivity between genes in each omics data and then construct an integrated gene functional coherence network. Furthermore, we present a two-step module mining method to efficiently identify the cancer driver modules from the integrated gene functional coherence network. Systematic experiments in three cancer types demonstrate that the proposed framework can obtain more significant driver modules than most existing methods, and some identified driver modules are associated with clinical survival phenotypes.
Collapse
Affiliation(s)
- Yang Guo
- School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China.
| | - Lingling Liu
- School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China
| | - Aofeng Lin
- School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China
| |
Collapse
|
6
|
González-Fernández M, Perry C, Gerhards NM, Francica P, Rottenberg S. Docetaxel response in BRCA1,p53-deficient mammary tumor cells is affected by Huntingtin and BAP1. Proc Natl Acad Sci U S A 2024; 121:e2402849121. [PMID: 39705313 DOI: 10.1073/pnas.2402849121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Accepted: 10/30/2024] [Indexed: 12/22/2024] Open
Abstract
Taxanes are frequently used anticancer drugs known to kill tumor cells by inducing mitotic aberrations and segregation defects. A defining feature of specific cancers, notably triple-negative breast cancer (TNBC) and particularly those deficient in BRCA1, is chromosomal instability (CIN). Here, we focused on understanding the mechanisms of docetaxel-induced cytotoxicity, especially in the context of BRCA1-deficient TNBC. Using functional genetic screens in CIN+ cells, we identified genes that mediate docetaxel response and found an interaction between Huntingtin (HTT) and BRCA1-associated protein-1 (BAP1). We employed Brca1-/-;p53-/- mammary tumor cells, derived from genetically engineered mouse tumors that closely mimic the human disease, to investigate the role of these genes in CIN+ BRCA1-deficient cells. Specifically, we observed that loss of HTT sensitizes CIN+ BRCA1-deficient mammary tumor cells to docetaxel by shortening mitotic spindle poles and increasing spindle multipolarity. In contrast, BAP1 depletion protected cells against these spindle aberrations by restoring spindle length and enhancing mitotic clustering of the extra centrosomes. In conclusion, our findings shed light on the roles of HTT and BAP1 in controlling mitotic spindle multipolarity and centrosome clustering, specifically in the absence of BRCA1. This affects the response to microtubule-targeting agents and suggests that further studies of the interaction of these genes with the mitotic spindle may provide useful insights into how to target CIN+ cells, particularly in the challenging therapeutic landscape of BRCA1-deficient TNBC.
Collapse
Affiliation(s)
- Martín González-Fernández
- Department of Infectious Diseases and Pathobiology, Institute of Animal Pathology, Vetsuisse Faculty, University of Bern, 3012 Bern, Switzerland
- Bern Center for Precision Medicine, Department of Biomedical Research, University of Bern, 3012 Bern, Switzerland
| | - Carmen Perry
- Department of Infectious Diseases and Pathobiology, Institute of Animal Pathology, Vetsuisse Faculty, University of Bern, 3012 Bern, Switzerland
- Bern Center for Precision Medicine, Department of Biomedical Research, University of Bern, 3012 Bern, Switzerland
| | - Nora Merete Gerhards
- Department of Infectious Diseases and Pathobiology, Institute of Animal Pathology, Vetsuisse Faculty, University of Bern, 3012 Bern, Switzerland
| | - Paola Francica
- Department of Infectious Diseases and Pathobiology, Institute of Animal Pathology, Vetsuisse Faculty, University of Bern, 3012 Bern, Switzerland
- Bern Center for Precision Medicine, Department of Biomedical Research, University of Bern, 3012 Bern, Switzerland
| | - Sven Rottenberg
- Department of Infectious Diseases and Pathobiology, Institute of Animal Pathology, Vetsuisse Faculty, University of Bern, 3012 Bern, Switzerland
- Bern Center for Precision Medicine, Department of Biomedical Research, University of Bern, 3012 Bern, Switzerland
| |
Collapse
|
7
|
Volzhenin K, Bittner L, Carbone A. SENSE-PPI reconstructs interactomes within, across, and between species at the genome scale. iScience 2024; 27:110371. [PMID: 39055916 PMCID: PMC11269938 DOI: 10.1016/j.isci.2024.110371] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 05/04/2024] [Accepted: 06/21/2024] [Indexed: 07/28/2024] Open
Abstract
Ab initio computational reconstructions of protein-protein interaction (PPI) networks will provide invaluable insights into cellular systems, enabling the discovery of novel molecular interactions and elucidating biological mechanisms within and between organisms. Leveraging the latest generation protein language models and recurrent neural networks, we present SENSE-PPI, a sequence-based deep learning model that efficiently reconstructs ab initio PPIs, distinguishing partners among tens of thousands of proteins and identifying specific interactions within functionally similar proteins. SENSE-PPI demonstrates high accuracy, limited training requirements, and versatility in cross-species predictions, even with non-model organisms and human-virus interactions. Its performance decreases for phylogenetically more distant model and non-model organisms, but signal alteration is very slow. In this regard, it demonstrates the important role of parameters in protein language models. SENSE-PPI is very fast and can test 10,000 proteins against themselves in a matter of hours, enabling the reconstruction of genome-wide proteomes.
Collapse
Affiliation(s)
- Konstantin Volzhenin
- Sorbonne Université, CNRS, IBPS, UMR 7238, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 75005 Paris, France
| | - Lucie Bittner
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Muséum national d’Histoire naturelle, CNRS, Sorbonne Université, EPHE, Université des Antilles, Paris, France
- Institut Universitaire de France, Paris, France
| | - Alessandra Carbone
- Sorbonne Université, CNRS, IBPS, UMR 7238, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 75005 Paris, France
- Institut Universitaire de France, Paris, France
| |
Collapse
|
8
|
Lapcik P, Stacey RG, Potesil D, Kulhanek P, Foster LJ, Bouchal P. Global Interactome Mapping Reveals Pro-tumorigenic Interactions of NF-κB in Breast Cancer. Mol Cell Proteomics 2024; 23:100744. [PMID: 38417630 PMCID: PMC10988130 DOI: 10.1016/j.mcpro.2024.100744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 02/01/2024] [Accepted: 02/23/2024] [Indexed: 03/01/2024] Open
Abstract
NF-κB pathway is involved in inflammation; however, recent data shows its role also in cancer development and progression, including metastasis. To understand the role of NF-κB interactome dynamics in cancer, we study the complexity of breast cancer interactome in luminal A breast cancer model and its rearrangement associated with NF-κB modulation. Liquid chromatography-mass spectrometry measurement of 160 size-exclusion chromatography fractions identifies 5460 protein groups. Seven thousand five hundred sixty eight interactions among these proteins have been reconstructed by PrInCE algorithm, of which 2564 have been validated in independent datasets. NF-κB modulation leads to rearrangement of protein complexes involved in NF-κB signaling and immune response, cell cycle regulation, and DNA replication. Central NF-κB transcription regulator RELA co-elutes with interactors of NF-κB activator PRMT5, and these complexes are confirmed by AlphaPulldown prediction. A complementary immunoprecipitation experiment recapitulates RELA interactions with other NF-κB factors, associating NF-κB inhibition with lower binding of NF-κB activators to RELA. This study describes a network of pro-tumorigenic protein interactions and their rearrangement upon NF-κB inhibition with potential therapeutic implications in tumors with high NF-κB activity.
Collapse
Affiliation(s)
- Petr Lapcik
- Department of Biochemistry, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - R Greg Stacey
- Michael Smith Laboratories, University of British Columbia, Vancouver, Canada
| | - David Potesil
- Proteomics Core Facility, Central European Institute of Technology, Masaryk University, Brno, Czech Republic
| | - Petr Kulhanek
- National Centre for Biomolecular Research, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Leonard J Foster
- Michael Smith Laboratories, University of British Columbia, Vancouver, Canada; Department of Biochemistry and Molecular Biology, University of British Columbia, Vancouver, Canada
| | - Pavel Bouchal
- Department of Biochemistry, Faculty of Science, Masaryk University, Brno, Czech Republic.
| |
Collapse
|
9
|
Jia P, Zhang F, Wu C, Li M. A comprehensive review of protein-centric predictors for biomolecular interactions: from proteins to nucleic acids and beyond. Brief Bioinform 2024; 25:bbae162. [PMID: 38739759 PMCID: PMC11089422 DOI: 10.1093/bib/bbae162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2024] [Revised: 02/17/2024] [Accepted: 03/31/2024] [Indexed: 05/16/2024] Open
Abstract
Proteins interact with diverse ligands to perform a large number of biological functions, such as gene expression and signal transduction. Accurate identification of these protein-ligand interactions is crucial to the understanding of molecular mechanisms and the development of new drugs. However, traditional biological experiments are time-consuming and expensive. With the development of high-throughput technologies, an increasing amount of protein data is available. In the past decades, many computational methods have been developed to predict protein-ligand interactions. Here, we review a comprehensive set of over 160 protein-ligand interaction predictors, which cover protein-protein, protein-nucleic acid, protein-peptide and protein-other ligands (nucleotide, heme, ion) interactions. We have carried out a comprehensive analysis of the above four types of predictors from several significant perspectives, including their inputs, feature profiles, models, availability, etc. The current methods primarily rely on protein sequences, especially utilizing evolutionary information. The significant improvement in predictions is attributed to deep learning methods. Additionally, sequence-based pretrained models and structure-based approaches are emerging as new trends.
Collapse
Affiliation(s)
- Pengzhen Jia
- School of Computer Science and Engineering, Central South University, 932 Lushan Road(S), Changsha 410083, China
| | - Fuhao Zhang
- School of Computer Science and Engineering, Central South University, 932 Lushan Road(S), Changsha 410083, China
- College of Information Engineering, Northwest A&F University, No. 3 Taicheng Road, Yangling, Shaanxi 712100, China
| | - Chaojin Wu
- School of Computer Science and Engineering, Central South University, 932 Lushan Road(S), Changsha 410083, China
| | - Min Li
- School of Computer Science and Engineering, Central South University, 932 Lushan Road(S), Changsha 410083, China
| |
Collapse
|
10
|
Shao L, Zhao Y, Heinrich M, Prieto-Garcia JM, Manzoni C. Active natural compounds perturb the melanoma risk-gene network. G3 (BETHESDA, MD.) 2024; 14:jkad274. [PMID: 38035793 PMCID: PMC10849364 DOI: 10.1093/g3journal/jkad274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Revised: 10/27/2023] [Accepted: 11/09/2023] [Indexed: 12/02/2023]
Abstract
Cutaneous melanoma is an aggressive type of skin cancer with a complex genetic landscape caused by the malignant transformation of melanocytes. This study aimed at providing an in silico network model based on the systematic profiling of the melanoma-associated genes considering germline mutations, somatic mutations, and genome-wide association study signals accounting for a total of 232 unique melanoma risk genes. A protein-protein interaction network was constructed using the melanoma risk genes as seeds and evaluated to describe the functional landscape in which the melanoma genes operate within the cellular milieu. Not only were the majority of the melanoma risk genes able to interact with each other at the protein level within the core of the network, but this showed significant enrichment for genes whose expression is altered in human melanoma specimens. Functional annotation showed the melanoma risk network to be significantly associated with processes related to DNA metabolism and telomeres, DNA damage and repair, cellular ageing, and response to radiation. We further explored whether the melanoma risk network could be used as an in silico tool to predict the efficacy of anti-melanoma phytochemicals, that are considered active molecules with potentially less systemic toxicity than classical cytotoxic drugs. A significant portion of the melanoma risk network showed differential expression when SK-MEL-28 human melanoma cells were exposed to the phytochemicals harmine and berberine chloride. This reinforced our hypothesis that the network modeling approach not only provides an alternative way to identify molecular pathways relevant to disease but it may also represent an alternative screening approach to prioritize potentially active compounds.
Collapse
Affiliation(s)
- Luying Shao
- Department of Pharmaceutical and Biological Chemistry, UCL School of Pharmacy, WC1N 1AX London, UK
| | - Yibo Zhao
- Department of Pharmacology, UCL School of Pharmacy, WC1N 1AX London, UK
| | - Michael Heinrich
- Department of Pharmaceutical and Biological Chemistry, UCL School of Pharmacy, WC1N 1AX London, UK
- Chinese Medicine Research Center, and Department of Pharmaceutical Sciences and Chinese Medicine Resources, College of Chinese Medicine, China Medical University, Taichung City 404333, Taiwan
| | - Jose M Prieto-Garcia
- School of Pharmacy and Biomolecular Sciences, Liverpool John Moores University, L3 3AF Liverpool, UK
| | - Claudia Manzoni
- Department of Pharmacology, UCL School of Pharmacy, WC1N 1AX London, UK
| |
Collapse
|
11
|
Davey R, Donahue C, Kesari A, Thakur N, Wang L, Hulsey-Stubbs S, Williams C, Kirby C, Leung D, Aryal U, Basler C, LaCount D. A protein-proximity screen reveals Ebola virus co-opts the mRNA decapping complex through the scaffold protein EDC4. RESEARCH SQUARE 2024:rs.3.rs-3838220. [PMID: 38352529 PMCID: PMC10862950 DOI: 10.21203/rs.3.rs-3838220/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/21/2024]
Abstract
The interaction of host and Ebola virus (EBOV) proteins is required for establishing infection of the cell. To identify protein binding partners, a proximity-dependent protein interaction screen was performed for six EBOV proteins. Hits were computationally mapped onto a human protein-protein interactome and then annotated with viral proteins to reveal known and previously undescribed EBOV-host protein interactions and processes. Importantly, this approach efficiently arranged proteins into functional complexes associated with single viral proteins. Focused characterization of interactions between EBOV VP35 and the mRNA decapping complex demonstrated that VP35 binds the scaffold protein EDC4 through the C-terminal subdomain, with each protein found associated in EBOV-infected cells. Mechanistically, depletion of three components of the complex each similarly inhibited viral replication by reducing early viral RNA synthesis. Overall, we demonstrate successful identification of EBOV protein interaction with entire cellular machines, providing a deeper understanding of replication mechanism for therapeutic intervention.
Collapse
|
12
|
Lucena-Padros H, Bravo-Gil N, Tous C, Rojano E, Seoane-Zonjic P, Fernández RM, Ranea JAG, Antiñolo G, Borrego S. Bioinformatics Prediction for Network-Based Integrative Multi-Omics Expression Data Analysis in Hirschsprung Disease. Biomolecules 2024; 14:164. [PMID: 38397401 PMCID: PMC10886964 DOI: 10.3390/biom14020164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Revised: 01/15/2024] [Accepted: 01/27/2024] [Indexed: 02/25/2024] Open
Abstract
Hirschsprung's disease (HSCR) is a rare developmental disorder in which enteric ganglia are missing along a portion of the intestine. HSCR has a complex inheritance, with RET as the major disease-causing gene. However, the pathogenesis of HSCR is still not completely understood. Therefore, we applied a computational approach based on multi-omics network characterization and clustering analysis for HSCR-related gene/miRNA identification and biomarker discovery. Protein-protein interaction (PPI) and miRNA-target interaction (MTI) networks were analyzed by DPClusO and BiClusO, respectively, and finally, the biomarker potential of miRNAs was computationally screened by miRNA-BD. In this study, a total of 55 significant gene-disease modules were identified, allowing us to propose 178 new HSCR candidate genes and two biological pathways. Moreover, we identified 12 key miRNAs with biomarker potential among 137 predicted HSCR-associated miRNAs. Functional analysis of new candidates showed that enrichment terms related to gene ontology (GO) and pathways were associated with HSCR. In conclusion, this approach has allowed us to decipher new clues of the etiopathogenesis of HSCR, although molecular experiments are further needed for clinical validations.
Collapse
Affiliation(s)
- Helena Lucena-Padros
- Department of Maternofetal Medicine, Genetics and Reproduction, Institute of Biomedicine of Seville, University Hospital Virgen del Rocío/CSIC/University of Seville, 41013 Seville, Spain
| | - Nereida Bravo-Gil
- Department of Maternofetal Medicine, Genetics and Reproduction, Institute of Biomedicine of Seville, University Hospital Virgen del Rocío/CSIC/University of Seville, 41013 Seville, Spain
- Center for Biomedical Network Research on Rare Diseases (CIBERER), 41013 Seville, Spain
| | - Cristina Tous
- Department of Maternofetal Medicine, Genetics and Reproduction, Institute of Biomedicine of Seville, University Hospital Virgen del Rocío/CSIC/University of Seville, 41013 Seville, Spain
- Center for Biomedical Network Research on Rare Diseases (CIBERER), 41013 Seville, Spain
| | - Elena Rojano
- Department of Molecular Biology and Biochemistry, University of Malaga, 29010 Malaga, Spain
- Biomedical Research Institute of Malaga, IBIMA, 29010 Malaga, Spain
| | - Pedro Seoane-Zonjic
- Department of Molecular Biology and Biochemistry, University of Malaga, 29010 Malaga, Spain
- Biomedical Research Institute of Malaga, IBIMA, 29010 Malaga, Spain
- Center for Biomedical Network Research on Rare Diseases (CIBERER), 29071 Malaga, Spain
| | - Raquel María Fernández
- Department of Maternofetal Medicine, Genetics and Reproduction, Institute of Biomedicine of Seville, University Hospital Virgen del Rocío/CSIC/University of Seville, 41013 Seville, Spain
- Center for Biomedical Network Research on Rare Diseases (CIBERER), 41013 Seville, Spain
| | - Juan A. G. Ranea
- Department of Molecular Biology and Biochemistry, University of Malaga, 29010 Malaga, Spain
- Biomedical Research Institute of Malaga, IBIMA, 29010 Malaga, Spain
- Center for Biomedical Network Research on Rare Diseases (CIBERER), 29071 Malaga, Spain
- Spanish National Bioinformatics Institute (INB/ELIXIR-ES), Instituto de Salud Carlos III (ISCIII), 28029 Madrid, Spain
| | - Guillermo Antiñolo
- Department of Maternofetal Medicine, Genetics and Reproduction, Institute of Biomedicine of Seville, University Hospital Virgen del Rocío/CSIC/University of Seville, 41013 Seville, Spain
- Center for Biomedical Network Research on Rare Diseases (CIBERER), 41013 Seville, Spain
| | - Salud Borrego
- Department of Maternofetal Medicine, Genetics and Reproduction, Institute of Biomedicine of Seville, University Hospital Virgen del Rocío/CSIC/University of Seville, 41013 Seville, Spain
- Center for Biomedical Network Research on Rare Diseases (CIBERER), 41013 Seville, Spain
| |
Collapse
|
13
|
Gkekas I, Vagiona AC, Pechlivanis N, Kastrinaki G, Pliatsika K, Iben S, Xanthopoulos K, Psomopoulos FE, Andrade-Navarro MA, Petrakis S. Intranuclear inclusions of polyQ-expanded ATXN1 sequester RNA molecules. Front Mol Neurosci 2023; 16:1280546. [PMID: 38125008 PMCID: PMC10730666 DOI: 10.3389/fnmol.2023.1280546] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Accepted: 11/16/2023] [Indexed: 12/23/2023] Open
Abstract
Spinocerebellar ataxia type 1 (SCA1) is an autosomal dominant neurodegenerative disease caused by a trinucleotide (CAG) repeat expansion in the ATXN1 gene. It is characterized by the presence of polyglutamine (polyQ) intranuclear inclusion bodies (IIBs) within affected neurons. In order to investigate the impact of polyQ IIBs in SCA1 pathogenesis, we generated a novel protein aggregation model by inducible overexpression of the mutant ATXN1(Q82) isoform in human neuroblastoma SH-SY5Y cells. Moreover, we developed a simple and reproducible protocol for the efficient isolation of insoluble IIBs. Biophysical characterization showed that polyQ IIBs are enriched in RNA molecules which were further identified by next-generation sequencing. Finally, a protein interaction network analysis indicated that sequestration of essential RNA transcripts within ATXN1(Q82) IIBs may affect the ribosome resulting in error-prone protein synthesis and global proteome instability. These findings provide novel insights into the molecular pathogenesis of SCA1, highlighting the role of polyQ IIBs and their impact on critical cellular processes.
Collapse
Affiliation(s)
- Ioannis Gkekas
- Centre for Research and Technology Hellas, Institute of Applied Biosciences, Thessaloniki, Greece
- Laboratory of Pharmacology, School of Pharmacy, Faculty of Health Sciences, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | | | - Nikolaos Pechlivanis
- Centre for Research and Technology Hellas, Institute of Applied Biosciences, Thessaloniki, Greece
| | - Georgia Kastrinaki
- Aerosol and Particle Technology Laboratory, Centre for Research and Technology Hellas, Chemical Process and Energy Resources Institute, Thessaloniki, Greece
| | - Katerina Pliatsika
- Centre for Research and Technology Hellas, Institute of Applied Biosciences, Thessaloniki, Greece
- Laboratory of Pharmacology, School of Pharmacy, Faculty of Health Sciences, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Sebastian Iben
- Department of Dermatology and Allergic Diseases, University of Ulm, Ulm, Germany
| | - Konstantinos Xanthopoulos
- Centre for Research and Technology Hellas, Institute of Applied Biosciences, Thessaloniki, Greece
- Laboratory of Pharmacology, School of Pharmacy, Faculty of Health Sciences, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Fotis E. Psomopoulos
- Centre for Research and Technology Hellas, Institute of Applied Biosciences, Thessaloniki, Greece
| | | | - Spyros Petrakis
- Centre for Research and Technology Hellas, Institute of Applied Biosciences, Thessaloniki, Greece
| |
Collapse
|
14
|
Rajarajan S, Snijesh VP, Anupama CE, Nair MG, Mavatkar AD, Naidu CM, Patil S, Nimbalkar VP, Alexander A, Pillai M, Jolly MK, Sabarinathan R, Ramesh RS, Bs S, Prabhu JS. An androgen receptor regulated gene score is associated with epithelial to mesenchymal transition features in triple negative breast cancers. Transl Oncol 2023; 37:101761. [PMID: 37603927 PMCID: PMC10465938 DOI: 10.1016/j.tranon.2023.101761] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 07/13/2023] [Accepted: 08/07/2023] [Indexed: 08/23/2023] Open
Abstract
BACKGROUND Androgen receptor (AR) is considered a marker of better prognosis in hormone receptor positive breast cancers (BC), however, its role in triple negative breast cancer (TNBC) is controversial. This may be attributed to intrinsic molecular differences or scoring methods for AR positivity. We derived AR regulated gene score and examined its utility in BC subtypes. METHODS AR regulated genes were derived by applying a bioinformatic pipeline on publicly available microarray data sets of AR+ BC cell lines and gene score was calculated as average expression of six AR regulated genes. Tumors were divided into AR high and low based on gene score and associations with clinical parameters, circulating androgens, survival and epithelial to mesenchymal transition (EMT) markers were examined, further evaluated in invitro models and public datasets. RESULTS 53% (133/249) tumors were classified as AR gene score high and were associated with significantly better clinical parameters, disease-free survival (86.13 vs 72.69 months, log rank p = 0.032) when compared to AR low tumors. 36% of TNBC (N = 66) were AR gene score high with higher expression of EMT markers (p = 0.024) and had high intratumoral levels of 5α-reductase, enzyme involved in intracrine androgen metabolism. In MDA-MB-453 treated with dihydrotestosterone, SLUG expression increased, E-cadherin decreased with increase in migration and these changes were reversed with bicalutamide. Similar results were obtained in public datasets. CONCLUSION Deciphering the role of AR in BC is difficult based on AR protein levels alone. Our results support the context dependent function of AR in driving better prognosis in ER positive tumors and EMT features in TNBC tumors.
Collapse
Affiliation(s)
- Savitha Rajarajan
- Division of Molecular Medicine, St. John's Research Institute, St. John's Medical College, Bengaluru, India; Centre for Doctoral Studies, Manipal Academy of Higher Education (MAHE), Manipal, India
| | - V P Snijesh
- Division of Molecular Medicine, St. John's Research Institute, St. John's Medical College, Bengaluru, India; Centre for Doctoral Studies, Manipal Academy of Higher Education (MAHE), Manipal, India
| | - C E Anupama
- Division of Molecular Medicine, St. John's Research Institute, St. John's Medical College, Bengaluru, India
| | - Madhumathy G Nair
- Division of Molecular Medicine, St. John's Research Institute, St. John's Medical College, Bengaluru, India
| | - Apoorva D Mavatkar
- Division of Molecular Medicine, St. John's Research Institute, St. John's Medical College, Bengaluru, India
| | - Chandrakala M Naidu
- Division of Molecular Medicine, St. John's Research Institute, St. John's Medical College, Bengaluru, India
| | - Sharada Patil
- Division of Molecular Medicine, St. John's Research Institute, St. John's Medical College, Bengaluru, India
| | - Vidya P Nimbalkar
- Division of Molecular Medicine, St. John's Research Institute, St. John's Medical College, Bengaluru, India
| | - Annie Alexander
- Division of Molecular Medicine, St. John's Research Institute, St. John's Medical College, Bengaluru, India
| | - Maalavika Pillai
- Centre for BioSystems Science and Engineering, Indian Institute of Science, Bengaluru, India
| | - Mohit Kumar Jolly
- Centre for BioSystems Science and Engineering, Indian Institute of Science, Bengaluru, India
| | | | - Rakesh S Ramesh
- Department of Surgical Oncology, St. John's Medical College, Bengaluru, India
| | - Srinath Bs
- Department of Surgery, Sri Shankara Cancer Hospital and Research Centre, Bangalore, India
| | - Jyothi S Prabhu
- Division of Molecular Medicine, St. John's Research Institute, St. John's Medical College, Bengaluru, India.
| |
Collapse
|
15
|
Jiang A, Handley RR, Lehnert K, Snell RG. From Pathogenesis to Therapeutics: A Review of 150 Years of Huntington's Disease Research. Int J Mol Sci 2023; 24:13021. [PMID: 37629202 PMCID: PMC10455900 DOI: 10.3390/ijms241613021] [Citation(s) in RCA: 43] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Revised: 08/15/2023] [Accepted: 08/18/2023] [Indexed: 08/27/2023] Open
Abstract
Huntington's disease (HD) is a debilitating neurodegenerative genetic disorder caused by an expanded polyglutamine-coding (CAG) trinucleotide repeat in the huntingtin (HTT) gene. HD behaves as a highly penetrant dominant disorder likely acting through a toxic gain of function by the mutant huntingtin protein. Widespread cellular degeneration of the medium spiny neurons of the caudate nucleus and putamen are responsible for the onset of symptomology that encompasses motor, cognitive, and behavioural abnormalities. Over the past 150 years of HD research since George Huntington published his description, a plethora of pathogenic mechanisms have been proposed with key themes including excitotoxicity, dopaminergic imbalance, mitochondrial dysfunction, metabolic defects, disruption of proteostasis, transcriptional dysregulation, and neuroinflammation. Despite the identification and characterisation of the causative gene and mutation and significant advances in our understanding of the cellular pathology in recent years, a disease-modifying intervention has not yet been clinically approved. This review includes an overview of Huntington's disease, from its genetic aetiology to clinical presentation and its pathogenic manifestation. An updated view of molecular mechanisms and the latest therapeutic developments will also be discussed.
Collapse
Affiliation(s)
- Andrew Jiang
- Applied Translational Genetics Group, Centre for Brain Research, School of Biological Sciences, The University of Auckland, Auckland 1010, New Zealand; (R.R.H.); (K.L.); (R.G.S.)
| | | | | | | |
Collapse
|
16
|
Shimada MK. Splicing Modulators Are Involved in Human Polyglutamine Diversification via Protein Complexes Shuttling between Nucleus and Cytoplasm. Int J Mol Sci 2023; 24:ijms24119622. [PMID: 37298574 DOI: 10.3390/ijms24119622] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 05/24/2023] [Accepted: 05/30/2023] [Indexed: 06/12/2023] Open
Abstract
Length polymorphisms of polyglutamine (polyQs) in triplet-repeat-disease-causing genes have diversified during primate evolution despite them conferring a risk of human-specific diseases. To explain the evolutionary process of this diversification, there is a need to focus on mechanisms by which rapid evolutionary changes can occur, such as alternative splicing. Proteins that can bind polyQs are known to act as splicing factors and may provide clues about the rapid evolutionary process. PolyQs are also characterized by the formation of intrinsically disordered (ID) regions, so I hypothesized that polyQs are involved in the transportation of various molecules between the nucleus and cytoplasm to regulate mechanisms characteristic of humans such as neural development. To determine target molecules for empirical research to understand the evolutionary change, I explored protein-protein interactions (PPIs) involving the relevant proteins. This study identified pathways related to polyQ binding as hub proteins scattered across various regulatory systems, including regulation via PQBP1, VCP, or CREBBP. Nine ID hub proteins with both nuclear and cytoplasmic localization were found. Functional annotations suggested that ID proteins containing polyQs are involved in regulating transcription and ubiquitination by flexibly changing PPI formation. These findings explain the relationships among splicing complex, polyQ length variations, and modifications in neural development.
Collapse
Affiliation(s)
- Makoto K Shimada
- Center for Medical Science, Fujita Health University, Toyoake 470-1192, Japan
| |
Collapse
|
17
|
Zheng J, Yang X, Huang Y, Yang S, Wuchty S, Zhang Z. Deep learning-assisted prediction of protein-protein interactions in Arabidopsis thaliana. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2023; 114:984-994. [PMID: 36919205 DOI: 10.1111/tpj.16188] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Revised: 02/20/2023] [Accepted: 03/09/2023] [Indexed: 05/27/2023]
Abstract
Currently, the experimentally identified interactome of Arabidopsis (Arabidopsis thaliana) is still far from complete, suggesting that computational prediction methods can complement experimental techniques. Motivated by the prosperity and success of deep learning algorithms and natural language processing techniques, we introduce an integrative deep learning framework, DeepAraPPI, allowing us to predict protein-protein interactions (PPIs) of Arabidopsis utilizing sequence, domain and Gene Ontology (GO) information. Our current DeepAraPPI comprises: (i) a word2vec encoding-based Siamese recurrent convolutional neural network (RCNN) model; (ii) a Domain2vec encoding-based multiple-layer perceptron (MLP) model; and (iii) a GO2vec encoding-based MLP model. Finally, DeepAraPPI combines the prediction results of the three individual predictors through a logistic regression model. Compiling high-quality positive and negative training and test samples by applying strict filtering strategies, DeepAraPPI shows superior performance compared with existing state-of-the-art Arabidopsis PPI prediction methods. DeepAraPPI also provides better cross-species predictive ability in rice (Oryza sativa) than traditional machine learning methods, although the overall performance in cross-species prediction remains to be improved. DeepAraPPI is freely accessible at http://zzdlab.com/deeparappi/. In the meantime, we have also made the source code and data sets of DeepAraPPI available at https://github.com/zjy1125/DeepAraPPI.
Collapse
Affiliation(s)
- Jingyan Zheng
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Xiaodi Yang
- Department of Hematology, Peking University First Hospital, Beijing, 100034, China
| | - Yan Huang
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Shiping Yang
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, Miami, FL, 33146, USA
- Department of Biology, University of Miami, Miami, FL, 33146, USA
- Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL, 33136, USA
- Institute of Data Science and Computing, University of Miami, Miami, FL, 33146, USA
| | - Ziding Zhang
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| |
Collapse
|
18
|
Huang Y, Wuchty S, Zhou Y, Zhang Z. SGPPI: structure-aware prediction of protein-protein interactions in rigorous conditions with graph convolutional network. Brief Bioinform 2023; 24:6995378. [PMID: 36682013 DOI: 10.1093/bib/bbad020] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Revised: 11/17/2022] [Accepted: 01/05/2023] [Indexed: 01/23/2023] Open
Abstract
While deep learning (DL)-based models have emerged as powerful approaches to predict protein-protein interactions (PPIs), the reliance on explicit similarity measures (e.g. sequence similarity and network neighborhood) to known interacting proteins makes these methods ineffective in dealing with novel proteins. The advent of AlphaFold2 presents a significant opportunity and also a challenge to predict PPIs in a straightforward way based on monomer structures while controlling bias from protein sequences. In this work, we established Structure and Graph-based Predictions of Protein Interactions (SGPPI), a structure-based DL framework for predicting PPIs, using the graph convolutional network. In particular, SGPPI focused on protein patches on the protein-protein binding interfaces and extracted the structural, geometric and evolutionary features from the residue contact map to predict PPIs. We demonstrated that our model outperforms traditional machine learning methods and state-of-the-art DL-based methods using non-representation-bias benchmark datasets. Moreover, our model trained on human dataset can be reliably transferred to predict yeast PPIs, indicating that SGPPI can capture converging structural features of protein interactions across various species. The implementation of SGPPI is available at https://github.com/emerson106/SGPPI.
Collapse
Affiliation(s)
- Yan Huang
- State Key Laboratory of Livestock and Poultry Biotechnology Breeding, College of Biological Sciences, China Agricultural University, Beijing 100193, China
- Department of Biomedical Informatics, Ministry of Education Key Laboratory of Molecular Cardiovascular Sciences, Center for Non-Coding RNA Medicine, School of Basic Medical Sciences, Peking University, Beijing 100191, China
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, Coral Gables, FL 33146, USA
- Department of Biology, University of Miami, Coral Gables, FL 33146, USA
- Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL 33136, USA
- Institute of Data Science and Computing, University of Miami, Coral Gables, FL 33146, USA
| | - Yuan Zhou
- Department of Biomedical Informatics, Ministry of Education Key Laboratory of Molecular Cardiovascular Sciences, Center for Non-Coding RNA Medicine, School of Basic Medical Sciences, Peking University, Beijing 100191, China
| | - Ziding Zhang
- State Key Laboratory of Livestock and Poultry Biotechnology Breeding, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| |
Collapse
|
19
|
Janacova L, Stenckova M, Lapcik P, Hrachovinova S, Bouchalova P, Potesil D, Hrstka R, Müller P, Bouchal P. Catechol-O-methyl transferase suppresses cell invasion and interplays with MET signaling in estrogen dependent breast cancer. Sci Rep 2023; 13:1285. [PMID: 36690660 PMCID: PMC9870911 DOI: 10.1038/s41598-023-28078-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2022] [Accepted: 01/12/2023] [Indexed: 01/25/2023] Open
Abstract
Catechol-O-methyl transferase (COMT) is involved in detoxification of catechol estrogens, playing cancer-protective role in cells producing or utilizing estrogen. Moreover, COMT suppressed migration potential of breast cancer (BC) cells. To delineate COMT role in metastasis of estrogen receptor (ER) dependent BC, we investigated the effect of COMT overexpression on invasion, transcriptome, proteome and interactome of MCF7 cells, a luminal A BC model, stably transduced with lentiviral vector carrying COMT gene (MCF7-COMT). 2D and 3D assays revealed that COMT overexpression associates with decreased cell invasion (p < 0.0001 for Transwell assay, p < 0.05 for spheroid formation). RNA-Seq and LC-DIA-MS/MS proteomics identified genes associated with invasion (FTO, PIR, TACSTD2, ANXA3, KRT80, S100P, PREX1, CLEC3A, LCP1) being downregulated in MCF7-COMT cells, while genes associated with less aggressive phenotype (RBPMS, ROBO2, SELENBP, EPB41L2) were upregulated both at transcript (|log2FC|> 1, adj. p < 0.05) and protein (|log2FC|> 0.58, q < 0.05) levels. Importantly, proteins driving MET signaling were less abundant in COMT overexpressing cells, and pull-down confirmed interaction between COMT and Kunitz-type protease inhibitor 2 (SPINT2), a negative regulator of MET (log2FC = 5.10, q = 1.04-7). In conclusion, COMT may act as tumor suppressor in ER dependent BC not only by detoxification of catechol estrogens but also by suppressing cell invasion and interplay with MET pathway.
Collapse
Affiliation(s)
- Lucia Janacova
- Department of Biochemistry, Faculty of Science, Masaryk University, Kamenice 5, 62500, Brno, Czech Republic
| | - Michaela Stenckova
- Research Centre for Applied Molecular Oncology, Masaryk Memorial Cancer Institute, Brno, Czech Republic
| | - Petr Lapcik
- Department of Biochemistry, Faculty of Science, Masaryk University, Kamenice 5, 62500, Brno, Czech Republic
| | - Sarka Hrachovinova
- Department of Biochemistry, Faculty of Science, Masaryk University, Kamenice 5, 62500, Brno, Czech Republic
| | - Pavla Bouchalova
- Department of Biochemistry, Faculty of Science, Masaryk University, Kamenice 5, 62500, Brno, Czech Republic
| | - David Potesil
- Proteomics Core Facility, Central European Institute for Technology, Masaryk University, Brno, Czech Republic
| | - Roman Hrstka
- Research Centre for Applied Molecular Oncology, Masaryk Memorial Cancer Institute, Brno, Czech Republic
| | - Petr Müller
- Research Centre for Applied Molecular Oncology, Masaryk Memorial Cancer Institute, Brno, Czech Republic
| | - Pavel Bouchal
- Department of Biochemistry, Faculty of Science, Masaryk University, Kamenice 5, 62500, Brno, Czech Republic.
| |
Collapse
|
20
|
Kebir S, Ullrich V, Berger P, Dobersalske C, Langer S, Rauschenbach L, Trageser D, Till A, Lorbeer FK, Wieland A, Wilhelm-Buchstab T, Ahmad A, Fröhlich H, Cima I, Prasad S, Matschke J, Jendrossek V, Remke M, Grüner BM, Roesch A, Siveke JT, Herold-Mende C, Blau T, Keyvani K, van Landeghem FK, Pietsch T, Felsberg J, Reifenberger G, Weller M, Sure U, Brüstle O, Simon M, Glas M, Scheffler B. A Sequential Targeting Strategy Interrupts AKT-Driven Subclone-Mediated Progression in Glioblastoma. Clin Cancer Res 2023; 29:488-500. [PMID: 36239995 PMCID: PMC9843437 DOI: 10.1158/1078-0432.ccr-22-0611] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Revised: 09/10/2022] [Accepted: 10/07/2022] [Indexed: 01/21/2023]
Abstract
PURPOSE Therapy resistance and fatal disease progression in glioblastoma are thought to result from the dynamics of intra-tumor heterogeneity. This study aimed at identifying and molecularly targeting tumor cells that can survive, adapt, and subclonally expand under primary therapy. EXPERIMENTAL DESIGN To identify candidate markers and to experimentally access dynamics of subclonal progression in glioblastoma, we established a discovery cohort of paired vital cell samples obtained before and after primary therapy. We further used two independent validation cohorts of paired clinical tissues to test our findings. Follow-up preclinical treatment strategies were evaluated in patient-derived xenografts. RESULTS We describe, in clinical samples, an archetype of rare ALDH1A1+ tumor cells that enrich and acquire AKT-mediated drug resistance in response to standard-of-care temozolomide (TMZ). Importantly, we observe that drug resistance of ALDH1A1+ cells is not intrinsic, but rather an adaptive mechanism emerging exclusively after TMZ treatment. In patient cells and xenograft models of disease, we recapitulate the enrichment of ALDH1A1+ cells under the influence of TMZ. We demonstrate that their subclonal progression is AKT-driven and can be interfered with by well-timed sequential rather than simultaneous antitumor combination strategy. CONCLUSIONS Drug-resistant ALDH1A1+/pAKT+ subclones accumulate in patient tissues upon adaptation to TMZ therapy. These subclones may therefore represent a dynamic target in glioblastoma. Our study proposes the combination of TMZ and AKT inhibitors in a sequential treatment schedule as a rationale for future clinical investigation.
Collapse
Affiliation(s)
- Sied Kebir
- DKFZ-Division Translational Neurooncology at the WTZ, DKTK Partner Site, University Hospital Essen, Essen, Germany
- German Cancer Consortium (DKTK)
- Division of Clinical Neurooncology, Department of Neurology, University Hospital Essen, Essen, Germany
- West German Cancer Center (WTZ), University Hospital Essen, Essen, Germany
| | - Vivien Ullrich
- DKFZ-Division Translational Neurooncology at the WTZ, DKTK Partner Site, University Hospital Essen, Essen, Germany
- German Cancer Consortium (DKTK)
- West German Cancer Center (WTZ), University Hospital Essen, Essen, Germany
| | - Pia Berger
- DKFZ-Division Translational Neurooncology at the WTZ, DKTK Partner Site, University Hospital Essen, Essen, Germany
- German Cancer Consortium (DKTK)
- West German Cancer Center (WTZ), University Hospital Essen, Essen, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Celia Dobersalske
- DKFZ-Division Translational Neurooncology at the WTZ, DKTK Partner Site, University Hospital Essen, Essen, Germany
- German Cancer Consortium (DKTK)
- West German Cancer Center (WTZ), University Hospital Essen, Essen, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Sarah Langer
- DKFZ-Division Translational Neurooncology at the WTZ, DKTK Partner Site, University Hospital Essen, Essen, Germany
- German Cancer Consortium (DKTK)
- West German Cancer Center (WTZ), University Hospital Essen, Essen, Germany
| | - Laurèl Rauschenbach
- DKFZ-Division Translational Neurooncology at the WTZ, DKTK Partner Site, University Hospital Essen, Essen, Germany
- German Cancer Consortium (DKTK)
- West German Cancer Center (WTZ), University Hospital Essen, Essen, Germany
- Department of Neurosurgery and Spine Surgery, University Hospital Essen, Essen, Germany
| | - Daniel Trageser
- Institute of Reconstructive Neurobiology, University of Bonn Medical Faculty & University Hospital Bonn, Bonn, Germany
- LIFE & BRAIN GmbH, Cellomics Unit, Bonn, Germany
| | - Andreas Till
- Institute of Reconstructive Neurobiology, University of Bonn Medical Faculty & University Hospital Bonn, Bonn, Germany
| | - Franziska K. Lorbeer
- Institute of Reconstructive Neurobiology, University of Bonn Medical Faculty & University Hospital Bonn, Bonn, Germany
| | - Anja Wieland
- Institute of Reconstructive Neurobiology, University of Bonn Medical Faculty & University Hospital Bonn, Bonn, Germany
| | | | - Ashar Ahmad
- Bonn-Aachen International Center for IT (B-IT), University of Bonn, Bonn, Germany
| | - Holger Fröhlich
- Bonn-Aachen International Center for IT (B-IT), University of Bonn, Bonn, Germany
- Department of Bioinformatics, Fraunhofer SCAI, Schloss Birlinghoven, Sankt Augustin, Germany
| | - Igor Cima
- DKFZ-Division Translational Neurooncology at the WTZ, DKTK Partner Site, University Hospital Essen, Essen, Germany
- German Cancer Consortium (DKTK)
- West German Cancer Center (WTZ), University Hospital Essen, Essen, Germany
| | - Shruthi Prasad
- DKFZ-Division Translational Neurooncology at the WTZ, DKTK Partner Site, University Hospital Essen, Essen, Germany
- German Cancer Consortium (DKTK)
- West German Cancer Center (WTZ), University Hospital Essen, Essen, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Johann Matschke
- Institute of Cell Biology (Cancer Research), University Hospital Essen, Essen, Germany
| | - Verena Jendrossek
- Institute of Cell Biology (Cancer Research), University Hospital Essen, Essen, Germany
| | - Marc Remke
- German Cancer Consortium (DKTK)
- Pediatric Neuro-Oncogenomics, University Hospital Düsseldorf, Düsseldorf, Germany
| | - Barbara M. Grüner
- German Cancer Consortium (DKTK)
- Department of Medical Oncology, West German Cancer Center, University Hospital Essen, University Duisburg-Essen, Essen, Germany
| | - Alexander Roesch
- German Cancer Consortium (DKTK)
- West German Cancer Center (WTZ), University Hospital Essen, Essen, Germany
- Department of Dermatology, University Hospital Essen, Essen, Germany
| | - Jens T. Siveke
- German Cancer Consortium (DKTK)
- West German Cancer Center (WTZ), University Hospital Essen, Essen, Germany
- Bridge Institute of Experimental Tumor Therapy, West German Cancer Center, University Hospital Essen, Essen, Germany
- Division of Solid Tumor Translational Oncology, German Cancer Consortium (DKTK, Partner Site Essen) and German Cancer Research Center, DKFZ, Heidelberg, Germany
| | - Christel Herold-Mende
- Division of Neurosurgical Research, Department of Neurosurgery, University of Heidelberg, Heidelberg, Germany
| | - Tobias Blau
- Institute of Neuropathology, University of Duisburg-Essen, Essen, Germany
| | - Kathy Keyvani
- Institute of Neuropathology, University of Duisburg-Essen, Essen, Germany
| | | | - Torsten Pietsch
- Institute of Neuropathology, University of Bonn, Bonn, Germany
| | - Jörg Felsberg
- Institute of Neuropathology, Heinrich Heine University, Düsseldorf, Germany
| | - Guido Reifenberger
- German Cancer Consortium (DKTK)
- Institute of Neuropathology, Heinrich Heine University, Düsseldorf, Germany
| | - Michael Weller
- Department of Neurology, University Hospital and University of Zurich, Zurich, Switzerland
| | - Ulrich Sure
- German Cancer Consortium (DKTK)
- West German Cancer Center (WTZ), University Hospital Essen, Essen, Germany
- Department of Neurosurgery and Spine Surgery, University Hospital Essen, Essen, Germany
| | - Oliver Brüstle
- Institute of Reconstructive Neurobiology, University of Bonn Medical Faculty & University Hospital Bonn, Bonn, Germany
- LIFE & BRAIN GmbH, Cellomics Unit, Bonn, Germany
| | - Matthias Simon
- Department of Neurosurgery, University of Bonn Medical Center, Bonn, Germany
- Department of Neurosurgery, Bethel Clinic, University of Bielefeld Medical Center, OWL, Bielefeld, Germany
| | - Martin Glas
- DKFZ-Division Translational Neurooncology at the WTZ, DKTK Partner Site, University Hospital Essen, Essen, Germany
- German Cancer Consortium (DKTK)
- Division of Clinical Neurooncology, Department of Neurology, University Hospital Essen, Essen, Germany
- West German Cancer Center (WTZ), University Hospital Essen, Essen, Germany
| | - Björn Scheffler
- DKFZ-Division Translational Neurooncology at the WTZ, DKTK Partner Site, University Hospital Essen, Essen, Germany
- German Cancer Consortium (DKTK)
- West German Cancer Center (WTZ), University Hospital Essen, Essen, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
- Center of Medical Biotechnology (ZMB), University Duisburg-Essen, Essen, Germany
- Corresponding Author: Björn Scheffler, Professor for Translational Oncology, DKFZ-Division of Translational Neurooncology at the West German Cancer Center (WTZ), DKTK Partner Site, University Hospital Essen, University Duisburg-Essen, Hufelandstraße 55, WTZ-F, UG 01.041, Essen D-45147, Germany. Phone: 49 (0)201-723-8130; Fax: 49 (0)201-723-6752; E-mail:
| |
Collapse
|
21
|
Vora DS, Kalakoti Y, Sundar D. Computational Methods and Deep Learning for Elucidating Protein Interaction Networks. Methods Mol Biol 2023; 2553:285-323. [PMID: 36227550 DOI: 10.1007/978-1-0716-2617-7_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Protein interactions play a critical role in all biological processes, but experimental identification of protein interactions is a time- and resource-intensive process. The advances in next-generation sequencing and multi-omics technologies have greatly benefited large-scale predictions of protein interactions using machine learning methods. A wide range of tools have been developed to predict protein-protein, protein-nucleic acid, and protein-drug interactions. Here, we discuss the applications, methods, and challenges faced when employing the various prediction methods. We also briefly describe ways to overcome the challenges and prospective future developments in the field of protein interaction biology.
Collapse
Affiliation(s)
- Dhvani Sandip Vora
- Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology Delhi, Hauz Khas, New Delhi, India
| | - Yogesh Kalakoti
- Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology Delhi, Hauz Khas, New Delhi, India
| | - Durai Sundar
- Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology Delhi, Hauz Khas, New Delhi, India.
- School of Artificial Intelligence, Indian Institute of Technology Delhi, Hauz Khas, New Delhi, India.
| |
Collapse
|
22
|
Kimothi D, Biyani P, Hogan JM, Davis MJ. Sequence Representations and Their Utility for Predicting Protein-Protein Interactions. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:646-657. [PMID: 34941517 DOI: 10.1109/tcbb.2021.3137325] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Protein-Protein Interactions (PPIs) are a crucial mechanism underpinning the function of the cell. So far, a wide range of machine-learning based methods have been proposed for predicting these relationships. Their success is heavily dependent on the construction of the underlying feature vectors, with most using a set of physico-chemical properties derived from the sequence. Few work directly with the sequence itself. In this paper, we explore the utility of sequence embeddings for predicting protein-protein interactions. We construct a protein pair feature vector by concatenating the embeddings of their constituent sequence. These feature vectors are then used as input to a binary classifier to make predictions. To learn sequence embeddings, we use two established Word2Vec based methods - Seq2Vec and BioVec - and we also introduce a novel feature construction method called SuperVecNW. The embeddings generated through SuperVecNW capture some network information in addition to the contextual information present in the sequences. We test the efficacy of our proposed approach on human and yeast PPI datasets and on three well-known networks: CD9, the Ras-Raf-Mek-Erk-Elk-Srf pathway, and a Wnt-related network. We demonstrate that low dimensional sequence embeddings provide better results than most alternative representations based on physico-chemical properties while offering a far simple approach to feature vector construction.
Collapse
|
23
|
Soleymani F, Paquet E, Viktor H, Michalowski W, Spinello D. Protein-protein interaction prediction with deep learning: A comprehensive review. Comput Struct Biotechnol J 2022; 20:5316-5341. [PMID: 36212542 PMCID: PMC9520216 DOI: 10.1016/j.csbj.2022.08.070] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Revised: 08/29/2022] [Accepted: 08/30/2022] [Indexed: 11/15/2022] Open
Abstract
Most proteins perform their biological function by interacting with themselves or other molecules. Thus, one may obtain biological insights into protein functions, disease prevalence, and therapy development by identifying protein-protein interactions (PPI). However, finding the interacting and non-interacting protein pairs through experimental approaches is labour-intensive and time-consuming, owing to the variety of proteins. Hence, protein-protein interaction and protein-ligand binding problems have drawn attention in the fields of bioinformatics and computer-aided drug discovery. Deep learning methods paved the way for scientists to predict the 3-D structure of proteins from genomes, predict the functions and attributes of a protein, and modify and design new proteins to provide desired functions. This review focuses on recent deep learning methods applied to problems including predicting protein functions, protein-protein interaction and their sites, protein-ligand binding, and protein design.
Collapse
Affiliation(s)
- Farzan Soleymani
- Department of Mechanical Engineering, University of Ottawa, Ottawa, ON, Canada
| | - Eric Paquet
- National Research Council, 1200 Montreal Road, Ottawa, ON K1A 0R6, Canada
| | - Herna Viktor
- School of Electrical Engineering and Computer Science, University of Ottawa, ON, Canada
| | | | - Davide Spinello
- Department of Mechanical Engineering, University of Ottawa, Ottawa, ON, Canada
| |
Collapse
|
24
|
Weng Z, Yue Z, Zhu Y, Chen JY. DEMA: a distance-bounded energy-field minimization algorithm to model and layout biomolecular networks with quantitative features. Bioinformatics 2022; 38:i359-i368. [PMID: 35758816 PMCID: PMC9235497 DOI: 10.1093/bioinformatics/btac261] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
SUMMARY In biology, graph layout algorithms can reveal comprehensive biological contexts by visually positioning graph nodes in their relevant neighborhoods. A layout software algorithm/engine commonly takes a set of nodes and edges and produces layout coordinates of nodes according to edge constraints. However, current layout engines normally do not consider node, edge or node-set properties during layout and only curate these properties after the layout is created. Here, we propose a new layout algorithm, distance-bounded energy-field minimization algorithm (DEMA), to natively consider various biological factors, i.e., the strength of gene-to-gene association, the gene's relative contribution weight and the functional groups of genes, to enhance the interpretation of complex network graphs. In DEMA, we introduce a parameterized energy model where nodes are repelled by the network topology and attracted by a few biological factors, i.e., interaction coefficient, effect coefficient and fold change of gene expression. We generalize these factors as gene weights, protein-protein interaction weights, gene-to-gene correlations and the gene set annotations-four parameterized functional properties used in DEMA. Moreover, DEMA considers further attraction/repulsion/grouping coefficient to enable different preferences in generating network views. Applying DEMA, we performed two case studies using genetic data in autism spectrum disorder and Alzheimer's disease, respectively, for gene candidate discovery. Furthermore, we implement our algorithm as a plugin to Cytoscape, an open-source software platform for visualizing networks; hence, it is convenient. Our software and demo can be freely accessed at http://discovery.informatics.uab.edu/dema. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Zhenyu Weng
- Communication and Information Security Lab, Institute of Big Data Technologies, Shenzhen Graduate School, Peking University, Shenzhen 518055, China
| | - Zongliang Yue
- Informatics Institute, School of Medicine, University of Alabama at Birmingham, Birmingham, AL 35294, USA
| | - Yuesheng Zhu
- Communication and Information Security Lab, Institute of Big Data Technologies, Shenzhen Graduate School, Peking University, Shenzhen 518055, China
| | - Jake Yue Chen
- Informatics Institute, School of Medicine, University of Alabama at Birmingham, Birmingham, AL 35294, USA
| |
Collapse
|
25
|
Hephzibah Cathryn R, Udhaya Kumar S, Younes S, Zayed H, George Priya Doss C. A review of bioinformatics tools and web servers in different microarray platforms used in cancer research. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2022; 131:85-164. [PMID: 35871897 DOI: 10.1016/bs.apcsb.2022.05.002] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Over the past decade, conventional lab work strategies have gradually shifted from being limited to a laboratory setting towards a bioinformatics era to help manage and process the vast amounts of data generated by omics technologies. The present work outlines the latest contributions of bioinformatics in analyzing microarray data and their application to cancer. We dissect different microarray platforms and their use in gene expression in cancer models. We highlight how computational advances empowered the microarray technology in gene expression analysis. The study on protein-protein interaction databases classified into primary, derived, meta-database, and prediction databases describes the strategies to curate and predict novel interaction networks in silico. In addition, we summarize the areas of bioinformatics where neural graph networks are currently being used, such as protein functions, protein interaction prediction, and in silico drug discovery and development. We also discuss the role of deep learning as a potential tool in the prognosis, diagnosis, and treatment of cancer. Integrating these resources efficiently, practically, and ethically is likely to be the most challenging task for the healthcare industry over the next decade; however, we believe that it is achievable in the long term.
Collapse
Affiliation(s)
- R Hephzibah Cathryn
- Laboratory of Integrative Genomics, Department of Integrative Biology, School of Biosciences and Technology, Vellore Institute of Technology, Vellore, India
| | - S Udhaya Kumar
- Laboratory of Integrative Genomics, Department of Integrative Biology, School of Biosciences and Technology, Vellore Institute of Technology, Vellore, India
| | - Salma Younes
- Department of Biomedical Sciences, College of Health and Sciences, Qatar University, QU Health, Doha, Qatar
| | - Hatem Zayed
- Department of Biomedical Sciences, College of Health and Sciences, Qatar University, QU Health, Doha, Qatar
| | - C George Priya Doss
- Laboratory of Integrative Genomics, Department of Integrative Biology, School of Biosciences and Technology, Vellore Institute of Technology, Vellore, India.
| |
Collapse
|
26
|
YENMİŞ G, BEŞLİ N. In vitro ve in silico analizi ile metforminin meme tümörü hücrelerinde protein profili üzerindeki etkinliği. EGE TIP DERGISI 2022. [DOI: 10.19161/etd.1126777] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Aim: This study aimed to uncover the varieties in protein profiles of Met in breast tumor (BT) cells by assessment of in vitro and in silico analysis.
Materials and Methods: Here, the cells obtained from mastectomy patients were cultured, the effective Met-dose was determined as 25 mM through cell viability and BrdU tests. Protein identification in the breast tumor cells was implemented by employing LC-MS/MS technology.
Results: The expression of SSR3, THAP3, FTH1, NEFM, ANP32A, ANP32B, KRT7 proteins was significantly decreased whereas the GARS protein increased in the 25 mM Met group compared to the Non-Met (0 mM) control group. In silico analysis, we analyzed the probable interactions of all these proteins with each other and other proteins, to evaluate the analysis of the larger protein network, and which metabolic pathway proteins are involved in.
Conclusion: The stated proteomics analysis in our study proposes a better understanding of the prognosis of breast cancer and future studies to investigate the effect of metformin in this field on proteomic pathways in other sorts of cancer.
Collapse
Affiliation(s)
- Güven YENMİŞ
- Department of Medical Biology, Faculty of Medicine, Biruni University, Istanbul, Turkiye
| | - Nail BEŞLİ
- Department of Medical Biology, Faculty of Medicine, University of Health Sciences, Istanbul, Turkiye
| |
Collapse
|
27
|
In silico Methods for Identification of Potential Therapeutic Targets. Interdiscip Sci 2022; 14:285-310. [PMID: 34826045 PMCID: PMC8616973 DOI: 10.1007/s12539-021-00491-y] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Revised: 10/19/2021] [Accepted: 11/01/2021] [Indexed: 11/01/2022]
Abstract
AbstractAt the initial stage of drug discovery, identifying novel targets with maximal efficacy and minimal side effects can improve the success rate and portfolio value of drug discovery projects while simultaneously reducing cycle time and cost. However, harnessing the full potential of big data to narrow the range of plausible targets through existing computational methods remains a key issue in this field. This paper reviews two categories of in silico methods—comparative genomics and network-based methods—for finding potential therapeutic targets among cellular functions based on understanding their related biological processes. In addition to describing the principles, databases, software, and applications, we discuss some recent studies and prospects of the methods. While comparative genomics is mostly applied to infectious diseases, network-based methods can be applied to infectious and non-infectious diseases. Nonetheless, the methods often complement each other in their advantages and disadvantages. The information reported here guides toward improving the application of big data-driven computational methods for therapeutic target discovery.
Graphical abstract
Collapse
|
28
|
Vagiona AC, Mier P, Petrakis S, Andrade-Navarro MA. Analysis of Huntington's Disease Modifiers Using the Hyperbolic Mapping of the Protein Interaction Network. Int J Mol Sci 2022; 23:5853. [PMID: 35628660 PMCID: PMC9144261 DOI: 10.3390/ijms23105853] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Revised: 05/19/2022] [Accepted: 05/19/2022] [Indexed: 02/05/2023] Open
Abstract
Huntington's disease (HD) is caused by the production of a mutant huntingtin (HTT) with an abnormally long poly-glutamine (polyQ) tract, forming aggregates and inclusions in neurons. Previous work by us and others has shown that an increase or decrease in polyQ-triggered aggregates can be passive simply due to the interaction of proteins with the aggregates. To search for proteins with active (functional) effects, which might be more effective in finding therapies and mechanisms of HD, we selected among the proteins that interact with HTT a total of 49 pairs of proteins that, while being paralogous to each other (and thus expected to have similar passive interaction with HTT), are located in different regions of the protein interaction network (suggesting participation in different pathways or complexes). Three of these 49 pairs contained members with opposite effects on HD, according to the literature. The negative members of the three pairs, MID1, IKBKG, and IKBKB, interact with PPP2CA and TUBB, which are known negative factors in HD, as well as with HSP90AA1 and RPS3. The positive members of the three pairs interact with HSPA9. Our results provide potential HD modifiers of functional relevance and reveal the dynamic aspect of paralog evolution within the interaction network.
Collapse
Affiliation(s)
- Aimilia-Christina Vagiona
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University, Hans-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany; (A.-C.V.); (P.M.)
| | - Pablo Mier
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University, Hans-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany; (A.-C.V.); (P.M.)
| | - Spyros Petrakis
- Institute of Applied Biosciences/Centre for Research and Technology Hellas, 57001 Thessaloniki, Greece;
| | - Miguel A. Andrade-Navarro
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University, Hans-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany; (A.-C.V.); (P.M.)
| |
Collapse
|
29
|
Ren P, Yang X, Wang T, Hou Y, Zhang Z. Proteome-wide prediction and analysis of the Cryptosporidium parvum protein-protein interaction network through integrative methods. Comput Struct Biotechnol J 2022; 20:2322-2331. [PMID: 35615014 PMCID: PMC9120227 DOI: 10.1016/j.csbj.2022.05.017] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Revised: 05/08/2022] [Accepted: 05/09/2022] [Indexed: 11/03/2022] Open
Abstract
By combining a sequence embedding technique (i.e., Doc2Vec) and a di-peptide composition representation to convert protein sequences into feature vectors, we proposed an RF classifier trained on the Plasmodium falciparum dataset for predicting Cryptosporidium parvum PPIs. A high-confidence Cryptosporidium parvum PPI network was identified by conjoining interolog mapping, domain-domain interaction-based inference, and the RF classifier. Some detected hub proteins and functional modules provided clues for an in-depth biological understanding of Cryptosporidium parvum.
As one of the most studied Apicomplexan parasite Cryptosporidium, Cryptosporidium parvum (C. parvum) causes worldwide serious diarrhea disease cryptosporidiosis, which can be deadly to immunodeficiency individuals, newly born children, and animals. Proteome-wide identification of protein–protein interactions (PPIs) has proven valuable in the systematic understanding of the genome-phenome relationship. However, the PPIs of C. parvum are largely unknown because of the limited experimental studies carried out. Therefore, we took full advantage of three bioinformatics methods, i.e., interolog mapping (IM), domain-domain interaction (DDI)-based inference, and machine learning (ML) method, to jointly predict PPIs of C. parvum. Due to the lack of experimental PPIs of C. parvum, we used the PPI data of Plasmodium falciparum (P. falciparum), which owned the largest number of PPIs in Apicomplexa, to train an ML model to infer C. parvum PPIs. We utilized consistent results of these three methods as the predicted high-confidence PPI network, which contains 4,578 PPIs covering 554 proteins. To further explore the biological significance of the constructed PPI network, we also conducted essential network and protein functional analysis, mainly focusing on hub proteins and functional modules. We anticipate the constructed PPI network can become an important data resource to accelerate the functional genomics studies of C. parvum as well as offer new hints to the target discovery in developing drugs/vaccines.
Collapse
|
30
|
Chen F, Ni C, Wang X, Cheng R, Pan C, Wang Y, Liang J, Zhang J, Cheng J, Chin YE, Zhou Y, Wang Z, Guo Y, Chen S, Htun S, Mathes EF, de Alba Campomanes AG, Slavotinek AM, Zhang S, Li M, Yao Z. S1P defects cause a new entity of cataract, alopecia, oral mucosal disorder, and psoriasis-like syndrome. EMBO Mol Med 2022; 14:e14904. [PMID: 35362222 PMCID: PMC9081911 DOI: 10.15252/emmm.202114904] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Revised: 02/28/2022] [Accepted: 03/08/2022] [Indexed: 11/17/2022] Open
Abstract
In this report, we discovered a new entity named cataract, alopecia, oral mucosal disorder, and psoriasis‐like (CAOP) syndrome in two unrelated and ethnically diverse patients. Furthermore, patient 1 failed to respond to regular treatment. We found that CAOP syndrome was caused by an autosomal recessive defect in the mitochondrial membrane‐bound transcription factor peptidase/site‐1 protease (MBTPS1, S1P). Mitochondrial abnormalities were observed in patient 1 with CAOP syndrome. Furthermore, we found that S1P is a novel mitochondrial protein that forms a trimeric complex with ETFA/ETFB. S1P enhances ETFA/ETFB flavination and maintains its stability. Patient S1P variants destabilize ETFA/ETFB, impair mitochondrial respiration, decrease fatty acid β‐oxidation activity, and shift mitochondrial oxidative phosphorylation (OXPHOS) to glycolysis. Mitochondrial dysfunction and inflammatory lesions in patient 1 were significantly ameliorated by riboflavin supplementation, which restored the stability of ETFA/ETFB. Our study discovered that mutations in MBTPS1 resulted in a new entity of CAOP syndrome and elucidated the mechanism of the mutations in the new disease.
Collapse
Affiliation(s)
- Fuying Chen
- Department of Dermatology, Xinhua Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China.,Institute of Dermatology, Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Cheng Ni
- Department of Dermatology, Xinhua Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China.,Institute of Dermatology, Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Xiaoxiao Wang
- Department of Dermatology, Xinhua Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China.,Institute of Dermatology, Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Ruhong Cheng
- Department of Dermatology, Xinhua Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China.,Institute of Dermatology, Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Chaolan Pan
- Department of Dermatology, Xinhua Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China.,Institute of Dermatology, Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Yumeng Wang
- Department of Dermatology, Xinhua Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China.,Institute of Dermatology, Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Jianying Liang
- Department of Dermatology, Xinhua Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Jia Zhang
- Department of Dermatology, Xinhua Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Jinke Cheng
- Shanghai Key Laboratory for Tumor Microenvironment and Inflammation, Department of Biochemistry and Molecular Cell Biology, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Y Eugene Chin
- Instituteof Health Sciences, Chinese Academy of Sciences, Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Yi Zhou
- Department of gastroenterology, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Zhen Wang
- Department of Dermatology, Children's Hospital of Shanghai Jiaotong University, Shanghai, China
| | - Yiran Guo
- Center for Data Driven Discovery in Biomedicine, Children's Hospital of Philadelphia, PA, USA
| | - She Chen
- NHC Key Laboratory of Glycoconjugate Research, Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Fudan University, Shanghai, China
| | - Stephanie Htun
- Division of Genetics, Department of Pediatrics, University of California San Francisco, San Francisco, CA, USA
| | - Erin F Mathes
- Departments of Dermatology and Pediatrics, University California, San Francisco, CA, USA
| | | | - Anne M Slavotinek
- Division of Genetics, Department of Pediatrics, University of California San Francisco, San Francisco, CA, USA
| | - Si Zhang
- NHC Key Laboratory of Glycoconjugate Research, Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Fudan University, Shanghai, China
| | - Ming Li
- Department of Dermatology, Xinhua Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China.,Institute of Dermatology, Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Zhirong Yao
- Department of Dermatology, Xinhua Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China.,Institute of Dermatology, Shanghai Jiaotong University School of Medicine, Shanghai, China
| |
Collapse
|
31
|
Casadio R, Martelli PL, Savojardo C. Machine learning solutions for predicting protein–protein interactions. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2022. [DOI: 10.1002/wcms.1618] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- Rita Casadio
- Biocomputing Group University of Bologna Bologna Italy
| | | | | |
Collapse
|
32
|
Andrades R, Recamonde-Mendoza M. Machine learning methods for prediction of cancer driver genes: a survey paper. Brief Bioinform 2022; 23:6551145. [PMID: 35323900 DOI: 10.1093/bib/bbac062] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Revised: 02/06/2022] [Accepted: 02/08/2022] [Indexed: 12/21/2022] Open
Abstract
Identifying the genes and mutations that drive the emergence of tumors is a critical step to improving our understanding of cancer and identifying new directions for disease diagnosis and treatment. Despite the large volume of genomics data, the precise detection of driver mutations and their carrying genes, known as cancer driver genes, from the millions of possible somatic mutations remains a challenge. Computational methods play an increasingly important role in discovering genomic patterns associated with cancer drivers and developing predictive models to identify these elements. Machine learning (ML), including deep learning, has been the engine behind many of these efforts and provides excellent opportunities for tackling remaining gaps in the field. Thus, this survey aims to perform a comprehensive analysis of ML-based computational approaches to identify cancer driver mutations and genes, providing an integrated, panoramic view of the broad data and algorithmic landscape within this scientific problem. We discuss how the interactions among data types and ML algorithms have been explored in previous solutions and outline current analytical limitations that deserve further attention from the scientific community. We hope that by helping readers become more familiar with significant developments in the field brought by ML, we may inspire new researchers to address open problems and advance our knowledge towards cancer driver discovery.
Collapse
Affiliation(s)
- Renan Andrades
- Institute of Informatics, Universidade Federal do Rio Grande do Sul, Porto Alegre/RS, Brazil.,Bioinformatics Core, Hospital de Clínicas de Porto Alegre, Porto Alegre/RS, Brazil
| | - Mariana Recamonde-Mendoza
- Institute of Informatics, Universidade Federal do Rio Grande do Sul, Porto Alegre/RS, Brazil.,Bioinformatics Core, Hospital de Clínicas de Porto Alegre, Porto Alegre/RS, Brazil
| |
Collapse
|
33
|
Bima AIH, Elsamanoudy AZ, Albaqami WF, Khan Z, Parambath SV, Al-Rayes N, Kaipa PR, Elango R, Banaganapalli B, Shaik NA. Integrative system biology and mathematical modeling of genetic networks identifies shared biomarkers for obesity and diabetes. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2022; 19:2310-2329. [PMID: 35240786 DOI: 10.3934/mbe.2022107] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Obesity and type 2 and diabetes mellitus (T2D) are two dual epidemics whose shared genetic pathological mechanisms are still far from being fully understood. Therefore, this study is aimed at discovering key genes, molecular mechanisms, and new drug targets for obesity and T2D by analyzing the genome wide gene expression data with different computational biology approaches. In this study, the RNA-sequencing data of isolated primary human adipocytes from individuals who are lean, obese, and T2D was analyzed by an integrated framework consisting of gene expression, protein interaction network (PIN), tissue specificity, and druggability approaches. Our findings show a total of 1932 unique differentially expressed genes (DEGs) across the diabetes versus obese group comparison (p≤0.05). The PIN analysis of these 1932 DEGs identified 190 high centrality network (HCN) genes, which were annotated against 3367 GO terms and functional pathways, like response to insulin signaling, phosphorylation, lipid metabolism, glucose metabolism, etc. (p≤0.05). By applying additional PIN and topological parameters to 190 HCN genes, we further mapped 25 high confidence genes, functionally connected with diabetes and obesity traits. Interestingly, ERBB2, FN1, FYN, HSPA1A, HBA1, and ITGB1 genes were found to be tractable by small chemicals, antibodies, and/or enzyme molecules. In conclusion, our study highlights the potential of computational biology methods in correlating expression data to topological parameters, functional relationships, and druggability characteristics of the candidate genes involved in complex metabolic disorders with a common etiological basis.
Collapse
Affiliation(s)
- Abdulhadi Ibrahim H Bima
- Department of Clinical Biochemistry, Faculty of Medicine, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Ayman Zaky Elsamanoudy
- Department of Clinical Biochemistry, Faculty of Medicine, King Abdulaziz University, Jeddah, Saudi Arabia
- Department of Medical Biochemistry and Molecular Biology, Faculty of Medicine, Mansoura University, Mansoura, Egypt
| | - Walaa F Albaqami
- Department of Science, Prince Sultan Military College of Health Sciences, Dhahran, Saudi Arabia
| | - Zeenath Khan
- Department of Science, Prince Sultan Military College of Health Sciences, Dhahran, Saudi Arabia
| | | | - Nuha Al-Rayes
- Department of Medical Laboratory Technology, Faculty of Applied Medical Sciences, King Abdulaziz University, Jeddah, Saudi Arabia
- Princess Al-Jawhara Al-Brahim Center of Excellence in Research of Hereditary Disorders, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Prabhakar Rao Kaipa
- Department of Genetics, College of Science, Osmania University, Hyderabad, India
| | - Ramu Elango
- Princess Al-Jawhara Al-Brahim Center of Excellence in Research of Hereditary Disorders, King Abdulaziz University, Jeddah, Saudi Arabia
- Department of Genetic Medicine, Faculty of Medicine, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Babajan Banaganapalli
- Princess Al-Jawhara Al-Brahim Center of Excellence in Research of Hereditary Disorders, King Abdulaziz University, Jeddah, Saudi Arabia
- Department of Genetic Medicine, Faculty of Medicine, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Noor A Shaik
- Princess Al-Jawhara Al-Brahim Center of Excellence in Research of Hereditary Disorders, King Abdulaziz University, Jeddah, Saudi Arabia
- Department of Genetic Medicine, Faculty of Medicine, King Abdulaziz University, Jeddah, Saudi Arabia
| |
Collapse
|
34
|
From complete cross-docking to partners identification and binding sites predictions. PLoS Comput Biol 2022; 18:e1009825. [PMID: 35089918 PMCID: PMC8827487 DOI: 10.1371/journal.pcbi.1009825] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2021] [Revised: 02/09/2022] [Accepted: 01/11/2022] [Indexed: 11/19/2022] Open
Abstract
Proteins ensure their biological functions by interacting with each other. Hence, characterising protein interactions is fundamental for our understanding of the cellular machinery, and for improving medicine and bioengineering. Over the past years, a large body of experimental data has been accumulated on who interacts with whom and in what manner. However, these data are highly heterogeneous and sometimes contradictory, noisy, and biased. Ab initio methods provide a means to a "blind" protein-protein interaction network reconstruction. Here, we report on a molecular cross-docking-based approach for the identification of protein partners. The docking algorithm uses a coarse-grained representation of the protein structures and treats them as rigid bodies. We applied the approach to a few hundred of proteins, in the unbound conformations, and we systematically investigated the influence of several key ingredients, such as the size and quality of the interfaces, and the scoring function. We achieved some significant improvement compared to previous works, and a very high discriminative power on some specific functional classes. We provide a readout of the contributions of shape and physico-chemical complementarity, interface matching, and specificity, in the predictions. In addition, we assessed the ability of the approach to account for protein surface multiple usages, and we compared it with a sequence-based deep learning method. This work may contribute to guiding the exploitation of the large amounts of protein structural models now available toward the discovery of unexpected partners and their complex structure characterisation.
Collapse
|
35
|
Hu X, Feng C, Ling T, Chen M. Deep learning frameworks for protein–protein interaction prediction. Comput Struct Biotechnol J 2022; 20:3223-3233. [PMID: 35832624 PMCID: PMC9249595 DOI: 10.1016/j.csbj.2022.06.025] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 05/27/2022] [Accepted: 06/12/2022] [Indexed: 11/26/2022] Open
|
36
|
Paul M, Anand A. A New Family of Similarity Measures for Scoring Confidence of Protein Interactions Using Gene Ontology. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:19-30. [PMID: 34029194 DOI: 10.1109/tcbb.2021.3083150] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
The large-scale protein-protein interaction (PPI) data has the potential to play a significant role in the endeavor of understanding cellular processes. However, the presence of a considerable fraction of false positives is a bottleneck in realizing this potential. There have been continuous efforts to utilize complementary resources for scoring confidence of PPIs in a manner that false positive interactions get a low confidence score. Gene Ontology (GO), a taxonomy of biological terms to represent the properties of gene products and their relations, has been widely used for this purpose. We utilize GO to introduce a new set of specificity measures: Relative Depth Specificity (RDS), Relative Node-based Specificity (RNS), and Relative Edge-based Specificity (RES), leading to a new family of similarity measures. We use these similarity measures to obtain a confidence score for each PPI. We evaluate the new measures using four different benchmarks. We show that all the three measures are quite effective. Notably, RNS and RES more effectively distinguish true PPIs from false positives than the existing alternatives. RES also shows a robust set-discriminating power and can be useful for protein functional clustering as well.
Collapse
|
37
|
Stivala A, Lomi A. Testing biological network motif significance with exponential random graph models. APPLIED NETWORK SCIENCE 2021; 6:91. [PMID: 34841042 PMCID: PMC8608783 DOI: 10.1007/s41109-021-00434-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Accepted: 11/08/2021] [Indexed: 06/13/2023]
Abstract
UNLABELLED Analysis of the structure of biological networks often uses statistical tests to establish the over-representation of motifs, which are thought to be important building blocks of such networks, related to their biological functions. However, there is disagreement as to the statistical significance of these motifs, and there are potential problems with standard methods for estimating this significance. Exponential random graph models (ERGMs) are a class of statistical model that can overcome some of the shortcomings of commonly used methods for testing the statistical significance of motifs. ERGMs were first introduced into the bioinformatics literature over 10 years ago but have had limited application to biological networks, possibly due to the practical difficulty of estimating model parameters. Advances in estimation algorithms now afford analysis of much larger networks in practical time. We illustrate the application of ERGM to both an undirected protein-protein interaction (PPI) network and directed gene regulatory networks. ERGM models indicate over-representation of triangles in the PPI network, and confirm results from previous research as to over-representation of transitive triangles (feed-forward loop) in an E. coli and a yeast regulatory network. We also confirm, using ERGMs, previous research showing that under-representation of the cyclic triangle (feedback loop) can be explained as a consequence of other topological features. SUPPLEMENTARY INFORMATION The online version contains supplementary material available at 10.1007/s41109-021-00434-y.
Collapse
Affiliation(s)
- Alex Stivala
- Institute of Computational Science, Università della Svizzera italiana, Via Giuseppe Buffi 13, 6900 Lugano, Switzerland
| | - Alessandro Lomi
- Institute of Computational Science, Università della Svizzera italiana, Via Giuseppe Buffi 13, 6900 Lugano, Switzerland
- The University of Exeter Business School, Rennes Drive, Exeter, EX4 4PU UK
| |
Collapse
|
38
|
Venkatraman DL, Pulimamidi D, Shukla HG, Hegde SR. Tumor relevant protein functional interactions identified using bipartite graph analyses. Sci Rep 2021; 11:21530. [PMID: 34728699 PMCID: PMC8563864 DOI: 10.1038/s41598-021-00879-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2020] [Accepted: 09/30/2021] [Indexed: 12/02/2022] Open
Abstract
An increased surge of -omics data for the diseases such as cancer allows for deriving insights into the affiliated protein interactions. We used bipartite network principles to build protein functional associations of the differentially regulated genes in 18 cancer types. This approach allowed us to combine expression data to functional associations in many cancers simultaneously. Further, graph centrality measures suggested the importance of upregulated genes such as BIRC5, UBE2C, BUB1B, KIF20A and PTH1R in cancer. Pathway analysis of the high centrality network nodes suggested the importance of the upregulation of cell cycle and replication associated proteins in cancer. Some of the downregulated high centrality proteins include actins, myosins and ATPase subunits. Among the transcription factors, mini-chromosome maintenance proteins (MCMs) and E2F family proteins appeared prominently in regulating many differentially regulated genes. The projected unipartite networks of the up and downregulated genes were comprised of 37,411 and 41,756 interactions, respectively. The conclusions obtained by collating these interactions revealed pan-cancer as well as subtype specific protein complexes and clusters. Therefore, we demonstrate that incorporating expression data from multiple cancers into bipartite graphs validates existing cancer associated mechanisms as well as directs to novel interactions and pathways.
Collapse
Affiliation(s)
| | - Deepshika Pulimamidi
- Institute of Bioinformatics and Applied Biotechnology (IBAB), Bengaluru, 560 100, India
| | - Harsh G Shukla
- Institute of Bioinformatics and Applied Biotechnology (IBAB), Bengaluru, 560 100, India
| | - Shubhada R Hegde
- Institute of Bioinformatics and Applied Biotechnology (IBAB), Bengaluru, 560 100, India.
| |
Collapse
|
39
|
Arici MK, Tuncbag N. Performance Assessment of the Network Reconstruction Approaches on Various Interactomes. Front Mol Biosci 2021; 8:666705. [PMID: 34676243 PMCID: PMC8523993 DOI: 10.3389/fmolb.2021.666705] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Accepted: 07/14/2021] [Indexed: 01/04/2023] Open
Abstract
Beyond the list of molecules, there is a necessity to collectively consider multiple sets of omic data and to reconstruct the connections between the molecules. Especially, pathway reconstruction is crucial to understanding disease biology because abnormal cellular signaling may be pathological. The main challenge is how to integrate the data together in an accurate way. In this study, we aim to comparatively analyze the performance of a set of network reconstruction algorithms on multiple reference interactomes. We first explored several human protein interactomes, including PathwayCommons, OmniPath, HIPPIE, iRefWeb, STRING, and ConsensusPathDB. The comparison is based on the coverage of each interactome in terms of cancer driver proteins, structural information of protein interactions, and the bias toward well-studied proteins. We next used these interactomes to evaluate the performance of network reconstruction algorithms including all-pair shortest path, heat diffusion with flux, personalized PageRank with flux, and prize-collecting Steiner forest (PCSF) approaches. Each approach has its own merits and weaknesses. Among them, PCSF had the most balanced performance in terms of precision and recall scores when 28 pathways from NetPath were reconstructed using the listed algorithms. Additionally, the reference interactome affects the performance of the network reconstruction approaches. The coverage and disease- or tissue-specificity of each interactome may vary, which may result in differences in the reconstructed networks.
Collapse
Affiliation(s)
- M Kaan Arici
- Graduate School of Informatics, Middle East Technical University, Ankara, Turkey.,Foot and Mouth Diseases Institute, Ministry of Agriculture and Forestry, Ankara, Turkey
| | - Nurcan Tuncbag
- Chemical and Biological Engineering, College of Engineering, Koc University, Istanbul, Turkey.,School of Medicine, Koc University, Istanbul, Turkey
| |
Collapse
|
40
|
Verma A, Ebanks K, Fok CY, Lewis PA, Bettencourt C, Bandopadhyay R. In silico comparative analysis of LRRK2 interactomes from brain, kidney and lung. Brain Res 2021; 1765:147503. [PMID: 33915162 PMCID: PMC8212912 DOI: 10.1016/j.brainres.2021.147503] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Revised: 03/06/2021] [Accepted: 04/23/2021] [Indexed: 01/11/2023]
Abstract
Mutations in LRRK2 are the most frequent cause of familial Parkinson's disease (PD), with common LRRK2 non-coding variants also acting as risk factors for idiopathic PD. Currently, therapeutic agents targeting LRRK2 are undergoing advanced clinical trials in humans, however, it is important to understand the wider implications of LRRK2 targeted treatments given that LRRK2 is expressed in diverse tissues including the brain, kidney and lungs. This presents challenges to treatment in terms of effects on peripheral organ functioning, thus, protein interactors of LRRK2 could be targeted in lieu to optimize therapeutic effects. Herein an in-silico analysis of LRRK2 direct interactors in brain tissue from various brain regionswas conducted along with a comparative analysis of the LRRK2 interactome in the brain, kidney, and lung tissues. This was carried out based on curated protein-protein interaction (PPI) data from protein interaction databases such as HIPPIE, human gene/protein expression databases and Gene ontology (GO) enrichment analysis using Bingo. Seven targets (MAP2K6, MATK, MAPT, PAK6, SH3GL2, CDC42EP3 and CHGB) were found to be viable objectives for LRRK2 based investigations for PD that would have minimal impact on optimal functioning within peripheral organs. Specifically, MAPT, CHGB, PAK6, and SH3GL2 interacted with LRRK2 in the brain and kidney but not in lung tissue whilst LRRK2-MAP2K6 interacted only in the cerebellum and MATK-LRRK2 interaction was absent in kidney tissues. CDC42EP3 expression levels were low in brain tissues compared to kidney/lung. The results of this computational analysis suggest new avenues for experimental investigations towards LRRK2-targeted therapeutics.
Collapse
Affiliation(s)
- Amrita Verma
- Reta Lila Weston Institute of Neurological Studies, Department of Clinical and Movement Neuroscience, UCL Queen Square Institute of Neurology, London WC1N 1PJ, United Kingdom
| | - Kirsten Ebanks
- Reta Lila Weston Institute of Neurological Studies, Department of Clinical and Movement Neuroscience, UCL Queen Square Institute of Neurology, London WC1N 1PJ, United Kingdom
| | - Chi-Yee Fok
- Reta Lila Weston Institute of Neurological Studies, Department of Clinical and Movement Neuroscience, UCL Queen Square Institute of Neurology, London WC1N 1PJ, United Kingdom
| | - Patrick A Lewis
- Royal Veterinary College, Royal College Street, London NW10TV, United Kingdom; Department of Neurodegenerative Disease and Queen Square Brain Bank, UCL Queen Square Institute of Neurology, London WC1N 1PJ, United Kingdom; Aligning Science Across Parkinson's (ASAP) Collaborative Research Network, Chevy Chase, MD, United States
| | - Conceicao Bettencourt
- Department of Neurodegenerative Disease and Queen Square Brain Bank, UCL Queen Square Institute of Neurology, London WC1N 1PJ, United Kingdom
| | - Rina Bandopadhyay
- Reta Lila Weston Institute of Neurological Studies, Department of Clinical and Movement Neuroscience, UCL Queen Square Institute of Neurology, London WC1N 1PJ, United Kingdom.
| |
Collapse
|
41
|
Lang B, Yang JS, Garriga-Canut M, Speroni S, Aschern M, Gili M, Hoffmann T, Tartaglia GG, Maurer SP. Matrix-screening reveals a vast potential for direct protein-protein interactions among RNA binding proteins. Nucleic Acids Res 2021; 49:6702-6721. [PMID: 34133714 PMCID: PMC8266617 DOI: 10.1093/nar/gkab490] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Revised: 04/23/2021] [Accepted: 05/20/2021] [Indexed: 01/02/2023] Open
Abstract
RNA-binding proteins (RBPs) are crucial factors of post-transcriptional gene regulation and their modes of action are intensely investigated. At the center of attention are RNA motifs that guide where RBPs bind. However, sequence motifs are often poor predictors of RBP-RNA interactions in vivo. It is hence believed that many RBPs recognize RNAs as complexes, to increase specificity and regulatory possibilities. To probe the potential for complex formation among RBPs, we assembled a library of 978 mammalian RBPs and used rec-Y2H matrix screening to detect direct interactions between RBPs, sampling > 600 K interactions. We discovered 1994 new interactions and demonstrate that interacting RBPs bind RNAs adjacently in vivo. We further find that the mRNA binding region and motif preferences of RBPs deviate, depending on their adjacently binding interaction partners. Finally, we reveal novel RBP interaction networks among major RNA processing steps and show that splicing impairing RBP mutations observed in cancer rewire spliceosomal interaction networks. The dataset we provide will be a valuable resource for understanding the combinatorial interactions of RBPs with RNAs and the resulting regulatory outcomes.
Collapse
Affiliation(s)
- Benjamin Lang
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology (BIST), Doctor Aiguader 88, Barcelona 08003, Spain.,Department of Structural Biology and Center of Excellence for Data-Driven Discovery, St. Jude Children's Research Hospital, 262 Danny Thomas Place, Memphis, TN 38105, USA
| | - Jae-Seong Yang
- Centre de Recerca en Agrigenòmica, Consortium CSIC-IRTA-UAB-UB (CRAG), Cerdanyola del Vallès, 08193 Barcelona, Spain
| | - Mireia Garriga-Canut
- Division of Engineering, New York University Abu Dhabi (NYUAD), Abu Dhabi 129188, UAE
| | - Silvia Speroni
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology (BIST), Doctor Aiguader 88, Barcelona 08003, Spain
| | - Moritz Aschern
- Centre de Recerca en Agrigenòmica, Consortium CSIC-IRTA-UAB-UB (CRAG), Cerdanyola del Vallès, 08193 Barcelona, Spain
| | - Maria Gili
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology (BIST), Doctor Aiguader 88, Barcelona 08003, Spain
| | - Tobias Hoffmann
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology (BIST), Doctor Aiguader 88, Barcelona 08003, Spain
| | - Gian Gaetano Tartaglia
- Center for Human Technologies, Istituto Italiano di Tecnologia, Via Enrico Melen 83, 16152, Genoa, Italy.,Biology and Biotechnology Department "Charles Darwin", Sapienza University of Rome, P.le A. Moro 5, Rome 00185, Italy
| | - Sebastian P Maurer
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology (BIST), Doctor Aiguader 88, Barcelona 08003, Spain.,Universitat Pompeu Fabra (UPF), Department of Experimental and Health Sciences, Barcelona, Spain
| |
Collapse
|
42
|
Alanis-Lobato G, Möllmann JS, Schaefer MH, Andrade-Navarro MA. MIPPIE: the mouse integrated protein-protein interaction reference. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2021; 2020:5850252. [PMID: 32496562 PMCID: PMC7271249 DOI: 10.1093/database/baaa035] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/25/2019] [Revised: 04/02/2020] [Accepted: 04/29/2020] [Indexed: 12/13/2022]
Abstract
Cells operate and react to environmental signals thanks to a complex network of protein–protein interactions (PPIs), the malfunction of which can severely disrupt cellular homeostasis. As a result, mapping and analyzing protein networks are key to advancing our understanding of biological processes and diseases. An invaluable part of these endeavors has been the house mouse (Mus musculus), the mammalian model organism par excellence, which has provided insights into human biology and disorders. The importance of investigating PPI networks in the context of mouse prompted us to develop the Mouse Integrated Protein–Protein Interaction rEference (MIPPIE). MIPPIE inherits a robust infrastructure from HIPPIE, its sister database of human PPIs, allowing for the assembly of reliable networks supported by different evidence sources and high-quality experimental techniques. MIPPIE networks can be further refined with tissue, directionality and effect information through a user-friendly web interface. Moreover, all MIPPIE data and meta-data can be accessed via a REST web service or downloaded as text files, thus facilitating the integration of mouse PPIs into follow-up bioinformatics pipelines.
Collapse
Affiliation(s)
- Gregorio Alanis-Lobato
- Faculty of Biology, Johannes Gutenberg University, Biozentrum I, Hans-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany.,Human Embryo and Stem Cell Laboratory, The Francis Crick Institute, 1 Midland Road, NW1 1AT London, UK
| | - Jannik S Möllmann
- Faculty of Biology, Johannes Gutenberg University, Biozentrum I, Hans-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany
| | - Martin H Schaefer
- Department of Experimental Oncology, European Institute of Oncology IRCCS, Via Adamello 16, 20139 Milan, Italy
| | - Miguel A Andrade-Navarro
- Faculty of Biology, Johannes Gutenberg University, Biozentrum I, Hans-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany
| |
Collapse
|
43
|
Freitas A, Aroso M, Rocha S, Ferreira R, Vitorino R, Gomez-Lazaro M. Bioinformatic analysis of the human brain extracellular matrix proteome in neurodegenerative disorders. Eur J Neurosci 2021; 53:4016-4033. [PMID: 34013613 DOI: 10.1111/ejn.15316] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Revised: 05/12/2021] [Accepted: 05/14/2021] [Indexed: 12/22/2022]
Abstract
Alzheimer's, Parkinson's, and Huntington's diseases are characterized by selective degeneration of specific brain areas. Although increasing number of studies report alteration of the extracellular matrix on these diseases, an exhaustive characterization at the brain's matrix level might contribute to the development of more efficient cell restoration therapies. In that regard, proteomics-based studies are a powerful approach to uncover matrix changes. However, to date, the majority of proteomics studies report no or only a few brain matrix proteins with altered expression. This study aims to reveal the changes in the brain extracellular matrix by integrating several proteomics-based studies performed with postmortem tissue. In total, 67 matrix proteins with altered expression were collected. By applying a bioinformatic approach, we were able to reveal the dysregulated biological processes. Among them are processes related to the organization of the extracellular matrix, glycosaminoglycans and proteoglycans' metabolism, blood coagulation, and response to injury and oxidative stress. In addition, a protein was found altered in all three diseases-collagen type I alpha 2-and its binding partners further identified. A ClueGO network was created, depicting the GO groups associated with these binding partners, uncovering the processes that may consequently be affected. These include cellular adhesion, cell signaling through membrane receptors, inflammatory processes, and apoptotic cell death in response to oxidative stress. Overall, we were able to associate the contribution of the modification of extracellular matrix components to essential biological processes, highlighting the investment needed on proteomics studies with specific focus on the extracellular matrix in neurodegeneration.
Collapse
Affiliation(s)
- Ana Freitas
- i3S -Instituto de Investigação e Inovação em Saúde, Universidade do Porto, Porto, Portugal.,INEB -Instituto de Engenharia Biomédica, Universidade do Porto, Porto, Portugal.,FMUP - Faculdade de Medicina, Universidade do Porto, Porto, Portugal
| | - Miguel Aroso
- i3S -Instituto de Investigação e Inovação em Saúde, Universidade do Porto, Porto, Portugal.,INEB -Instituto de Engenharia Biomédica, Universidade do Porto, Porto, Portugal
| | - Sara Rocha
- i3S -Instituto de Investigação e Inovação em Saúde, Universidade do Porto, Porto, Portugal.,INEB -Instituto de Engenharia Biomédica, Universidade do Porto, Porto, Portugal
| | - Rita Ferreira
- QOPNA &, LAQV, Department of Chemistry, University of Aveiro, Aveiro, Portugal
| | - Rui Vitorino
- Department of Medical Sciences, iBiMED, University of Aveiro, Aveiro, Portugal.,Department of Physiology and Cardiothoracic Surgery, Faculty of Medicine, University of Porto, Porto, Portugal
| | - Maria Gomez-Lazaro
- i3S -Instituto de Investigação e Inovação em Saúde, Universidade do Porto, Porto, Portugal.,INEB -Instituto de Engenharia Biomédica, Universidade do Porto, Porto, Portugal
| |
Collapse
|
44
|
Xiang Z, Gong W, Li Z, Yang X, Wang J, Wang H. Predicting Protein-Protein Interactions via Gated Graph Attention Signed Network. Biomolecules 2021; 11:799. [PMID: 34071437 PMCID: PMC8228288 DOI: 10.3390/biom11060799] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2021] [Revised: 05/24/2021] [Accepted: 05/26/2021] [Indexed: 01/01/2023] Open
Abstract
Protein-protein interactions (PPIs) play a key role in signal transduction and pharmacogenomics, and hence, accurate PPI prediction is crucial. Graph structures have received increasing attention owing to their outstanding performance in machine learning. In practice, PPIs can be expressed as a signed network (i.e., graph structure), wherein the nodes in the network represent proteins, and edges represent the interactions (positive or negative effects) of protein nodes. PPI predictions can be realized by predicting the links of the signed network; therefore, the use of gated graph attention for signed networks (SN-GGAT) is proposed herein. First, the concept of graph attention network (GAT) is applied to signed networks, in which "attention" represents the weight of neighbor nodes, and GAT updates the node features through the weighted aggregation of neighbor nodes. Then, the gating mechanism is defined and combined with the balance theory to obtain the high-order relations of protein nodes to improve the attention effect, making the attention mechanism follow the principle of "low-order high attention, high-order low attention, different signs opposite". PPIs are subsequently predicted on the Saccharomyces cerevisiae core dataset and the Human dataset. The test results demonstrate that the proposed method exhibits strong competitiveness.
Collapse
Affiliation(s)
- Zhijie Xiang
- School of Information Science and Engineering, Shandong Normal University, Jinan 250014, China; (Z.X.); (W.G.); (Z.L.); (X.Y.); (J.W.)
| | - Weijia Gong
- School of Information Science and Engineering, Shandong Normal University, Jinan 250014, China; (Z.X.); (W.G.); (Z.L.); (X.Y.); (J.W.)
| | - Zehui Li
- School of Information Science and Engineering, Shandong Normal University, Jinan 250014, China; (Z.X.); (W.G.); (Z.L.); (X.Y.); (J.W.)
| | - Xue Yang
- School of Information Science and Engineering, Shandong Normal University, Jinan 250014, China; (Z.X.); (W.G.); (Z.L.); (X.Y.); (J.W.)
| | - Jihua Wang
- School of Information Science and Engineering, Shandong Normal University, Jinan 250014, China; (Z.X.); (W.G.); (Z.L.); (X.Y.); (J.W.)
| | - Hong Wang
- School of Information Science and Engineering, Shandong Normal University, Jinan 250014, China; (Z.X.); (W.G.); (Z.L.); (X.Y.); (J.W.)
- Shandong Provincial Key Laboratory for Distributed Computer Software Novel Technology, Shandong Normal University, Jinan 250014, China
| |
Collapse
|
45
|
Basu M, Wang K, Ruppin E, Hannenhalli S. Predicting tissue-specific gene expression from whole blood transcriptome. SCIENCE ADVANCES 2021; 7:eabd6991. [PMID: 33811070 PMCID: PMC11057699 DOI: 10.1126/sciadv.abd6991] [Citation(s) in RCA: 52] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/07/2020] [Accepted: 02/12/2021] [Indexed: 06/12/2023]
Abstract
Complex diseases are mediated via transcriptional dysregulation in multiple tissues. Thus, knowing an individual's tissue-specific gene expression can provide critical information about her health. Unfortunately, for most tissues, the transcriptome cannot be obtained without invasive procedures. Could we, however, infer an individual's tissue-specific expression from her whole blood transcriptome? Here, we rigorously address this question. We find that an individual's whole blood transcriptome can significantly predict tissue-specific expression levels for ~60% of the genes on average across 32 tissues, with up to 81% of the genes in skeletal muscle. The tissue-specific expression inferred from the blood transcriptome is almost as good as the actual measured tissue expression in predicting disease state for six different complex disorders, including hypertension and type 2 diabetes, substantially surpassing the blood transcriptome. The code for tissue-specific gene expression prediction, TEEBoT, is provided, enabling others to study its potential translational value in other indications.
Collapse
Affiliation(s)
- Mahashweta Basu
- Institute for Genome Sciences, University of Maryland, Baltimore, MD, USA
| | - Kun Wang
- Cancer Data Science Laboratory, National Cancer Institute, NIH, Bethesda, MD, USA
| | - Eytan Ruppin
- Cancer Data Science Laboratory, National Cancer Institute, NIH, Bethesda, MD, USA.
| | - Sridhar Hannenhalli
- Cancer Data Science Laboratory, National Cancer Institute, NIH, Bethesda, MD, USA.
| |
Collapse
|
46
|
Muzio G, O’Bray L, Borgwardt K. Biological network analysis with deep learning. Brief Bioinform 2021; 22:1515-1530. [PMID: 33169146 PMCID: PMC7986589 DOI: 10.1093/bib/bbaa257] [Citation(s) in RCA: 96] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 08/26/2020] [Accepted: 09/11/2020] [Indexed: 12/17/2022] Open
Abstract
Recent advancements in experimental high-throughput technologies have expanded the availability and quantity of molecular data in biology. Given the importance of interactions in biological processes, such as the interactions between proteins or the bonds within a chemical compound, this data is often represented in the form of a biological network. The rise of this data has created a need for new computational tools to analyze networks. One major trend in the field is to use deep learning for this goal and, more specifically, to use methods that work with networks, the so-called graph neural networks (GNNs). In this article, we describe biological networks and review the principles and underlying algorithms of GNNs. We then discuss domains in bioinformatics in which graph neural networks are frequently being applied at the moment, such as protein function prediction, protein-protein interaction prediction and in silico drug discovery and development. Finally, we highlight application areas such as gene regulatory networks and disease diagnosis where deep learning is emerging as a new tool to answer classic questions like gene interaction prediction and automatic disease prediction from data.
Collapse
Affiliation(s)
- Giulia Muzio
- Machine Learning and Computational Biology Lab at ETH Zürich
| | - Leslie O’Bray
- Machine Learning and Computational Biology Lab at ETH Zürich
| | | |
Collapse
|
47
|
Muzio G, O'Bray L, Borgwardt K. Biological network analysis with deep learning. Brief Bioinform 2021; 22:1515-1530. [PMID: 33169146 DOI: 10.1145/3447548.3467442] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 08/26/2020] [Accepted: 09/11/2020] [Indexed: 05/28/2023] Open
Abstract
Recent advancements in experimental high-throughput technologies have expanded the availability and quantity of molecular data in biology. Given the importance of interactions in biological processes, such as the interactions between proteins or the bonds within a chemical compound, this data is often represented in the form of a biological network. The rise of this data has created a need for new computational tools to analyze networks. One major trend in the field is to use deep learning for this goal and, more specifically, to use methods that work with networks, the so-called graph neural networks (GNNs). In this article, we describe biological networks and review the principles and underlying algorithms of GNNs. We then discuss domains in bioinformatics in which graph neural networks are frequently being applied at the moment, such as protein function prediction, protein-protein interaction prediction and in silico drug discovery and development. Finally, we highlight application areas such as gene regulatory networks and disease diagnosis where deep learning is emerging as a new tool to answer classic questions like gene interaction prediction and automatic disease prediction from data.
Collapse
Affiliation(s)
- Giulia Muzio
- Machine Learning and Computational Biology Lab at ETH Zürich
| | - Leslie O'Bray
- Machine Learning and Computational Biology Lab at ETH Zürich
| | | |
Collapse
|
48
|
Levchenko A, Kanapin A, Samsonova A, Fedorenko OY, Kornetova EG, Nurgaliev T, Mazo GE, Semke AV, Kibitov AO, Bokhan NA, Gainetdinov RR, Ivanova SA. A genome-wide association study identifies a gene network associated with paranoid schizophrenia and antipsychotics-induced tardive dyskinesia. Prog Neuropsychopharmacol Biol Psychiatry 2021; 105:110134. [PMID: 33065217 DOI: 10.1016/j.pnpbp.2020.110134] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/26/2020] [Revised: 09/10/2020] [Accepted: 10/06/2020] [Indexed: 02/06/2023]
Abstract
In the present study we conducted a genome-wide association study (GWAS) in a cohort of 505 patients with paranoid schizophrenia (SCZ), of which 95 had tardive dyskinesia (TD), and 503 healthy controls. Using data generated by the PsychENCODE Consortium (PEC) and other bioinformatic databases, we revealed a gene network, implicated in neurodevelopment and brain function, associated with both these disorders. Almost all these genes are in gene or isoform co-expression PEC network modules important for the functioning of the brain; the activity of these networks is also altered in SCZ, bipolar disorder and autism spectrum disorders. The associated PEC network modules are enriched for gene ontology terms relevant to the brain development and function (CNS development, neuron development, axon ensheathment, synapse, synaptic vesicle cycle, and signaling receptor activity) and to the immune system (inflammatory response). Results of the present study suggest that orofacial and limbtruncal types of TD seem to share the molecular network with SCZ. Paranoid SCZ and abnormal involuntary movements that indicate the orofacial type of TD are associated with the same genomic loci on chromosomes 3p22.2, 8q21.13, and 13q14.2. The limbtruncal type of TD is associated with a locus on chromosome 3p13 where the best functional candidate is FOXP1, a high-confidence SCZ gene. The results of this study shed light on common pathogenic mechanisms for SCZ and TD, and indicate that the pathogenesis of the orofacial and limbtruncal types of TD might be driven by interacting genes implicated in neurodevelopment.
Collapse
Affiliation(s)
- Anastasia Levchenko
- Theodosius Dobzhansky Center for Genome Bioinformatics, Saint Petersburg State University, Saint Petersburg, Russia.
| | - Alexander Kanapin
- Theodosius Dobzhansky Center for Genome Bioinformatics, Saint Petersburg State University, Saint Petersburg, Russia
| | - Anastasia Samsonova
- Theodosius Dobzhansky Center for Genome Bioinformatics, Saint Petersburg State University, Saint Petersburg, Russia
| | - Olga Yu Fedorenko
- Mental Health Research Institute, Tomsk National Research Medical Center, Russian Academy of Sciences, Tomsk, Russia; National Research Tomsk Polytechnic University, Tomsk, Russia
| | - Elena G Kornetova
- Mental Health Research Institute, Tomsk National Research Medical Center, Russian Academy of Sciences, Tomsk, Russia; Siberian State Medical University, Tomsk, Russia
| | | | - Galina E Mazo
- Department of Endocrine Psychiatry, V.M. Bekhterev National Medical Research Center for Psychiatry and Neurology, Saint Petersburg, Russia
| | - Arkadiy V Semke
- Mental Health Research Institute, Tomsk National Research Medical Center, Russian Academy of Sciences, Tomsk, Russia
| | - Alexander O Kibitov
- Department of Endocrine Psychiatry, V.M. Bekhterev National Medical Research Center for Psychiatry and Neurology, Saint Petersburg, Russia; Laboratory of Molecular Genetics, Serbsky National Medical Research Center on Psychiatry and Addictions, Moscow, Russia
| | - Nikolay A Bokhan
- Mental Health Research Institute, Tomsk National Research Medical Center, Russian Academy of Sciences, Tomsk, Russia; Siberian State Medical University, Tomsk, Russia; National Research Tomsk State University, Tomsk, Russia
| | - Raul R Gainetdinov
- Institute of Translational Biomedicine, Saint Petersburg State University, Saint Petersburg, Russia
| | - Svetlana A Ivanova
- Mental Health Research Institute, Tomsk National Research Medical Center, Russian Academy of Sciences, Tomsk, Russia; National Research Tomsk Polytechnic University, Tomsk, Russia; Siberian State Medical University, Tomsk, Russia
| |
Collapse
|
49
|
Luo P, Chen B, Liao B, Wu F. Predicting disease‐associated genes: Computational methods, databases, and evaluations. WIRES DATA MINING AND KNOWLEDGE DISCOVERY 2021; 11. [DOI: 10.1002/widm.1383] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2019] [Accepted: 06/13/2020] [Indexed: 09/09/2024]
Abstract
AbstractComplex diseases are associated with a set of genes (called disease genes), the identification of which can help scientists uncover the mechanisms of diseases and develop new drugs and treatment strategies. Due to the huge cost and time of experimental identification techniques, many computational algorithms have been proposed to predict disease genes. Although several review publications in recent years have discussed many computational methods, some of them focus on cancer driver genes while others focus on biomolecular networks, which only cover a specific aspect of existing methods. In this review, we summarize existing methods and classify them into three categories based on their rationales. Then, the algorithms, biological data, and evaluation methods used in the computational prediction are discussed. Finally, we highlight the limitations of existing methods and point out some future directions for improving these algorithms. This review could help investigators understand the principles of existing methods, and thus develop new methods to advance the computational prediction of disease genes.This article is categorized under:Technologies > Machine LearningTechnologies > PredictionAlgorithmic Development > Biological Data Mining
Collapse
Affiliation(s)
- Ping Luo
- Division of Biomedical Engineering University of Saskatchewan Saskatoon Canada
- Princess Margaret Cancer Centre University Health Network Toronto Canada
| | - Bolin Chen
- School of Computer Science and Technology Northwestern Polytechnical University China
| | - Bo Liao
- School of Mathematics and Statistics Hainan Normal University Haikou China
| | - Fang‐Xiang Wu
- Department of Mechanical Engineering and Department of Computer Science University of Saskatchewan Saskatoon Canada
| |
Collapse
|
50
|
Kitani T, Maddipatla SC, Madupuri R, Greco C, Hartmann J, Baraniuk JN, Vasudevan S. In Search of Newer Targets for Inflammatory Bowel Disease: A Systems and a Network Medicine Approach. NETWORK AND SYSTEMS MEDICINE 2021. [DOI: 10.1089/nsm.2020.0012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Affiliation(s)
- Takashi Kitani
- Department of Neurology, Molecular and Cellular Biology, Georgetown University Medical Center, Washington, District of Columbia, USA
| | - Sushma C. Maddipatla
- Department of Biochemistry, Molecular and Cellular Biology, Georgetown University Medical Center, Washington, District of Columbia, USA
| | - Ramya Madupuri
- Department of Biochemistry, Molecular and Cellular Biology, Georgetown University Medical Center, Washington, District of Columbia, USA
| | - Christopher Greco
- Department of Biochemistry, Molecular and Cellular Biology, Georgetown University Medical Center, Washington, District of Columbia, USA
| | - Jonathan Hartmann
- Dahlgren Memorial Library, Graduate Health and Life Sciences Research Library, Georgetown University Medical Center, Washington, District of Columbia, USA
| | - James N. Baraniuk
- Division of Rheumatology, Immunology and Allergy, Department of Medicine, Georgetown University Medical Center, Washington, District of Columbia, USA
| | - Sona Vasudevan
- Department of Biochemistry, Molecular and Cellular Biology, Georgetown University Medical Center, Washington, District of Columbia, USA
| |
Collapse
|