1
|
Vagiona AC, Notopoulou S, Zdráhal Z, Gonçalves-Kulik M, Petrakis S, Andrade-Navarro MA. Prediction of protein interactions with function in protein (de-)phosphorylation. PLoS One 2025; 20:e0319084. [PMID: 40029919 PMCID: PMC11875375 DOI: 10.1371/journal.pone.0319084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2024] [Accepted: 01/28/2025] [Indexed: 03/06/2025] Open
Abstract
Protein-protein interactions (PPIs) form a complex network called "interactome" that regulates many functions in the cell. In recent years, there is an increasing accumulation of evidence supporting the existence of a hyperbolic geometry underlying the network representation of complex systems such as the interactome. In particular, it has been shown that the embedding of the human Protein-Interaction Network (hPIN) in hyperbolic space (H2) captures biologically relevant information. Here we explore whether this mapping contains information that would allow us to predict the function of PPIs, more specifically interactions related to post-translational modification (PTM). We used a random forest algorithm to predict PTM-related directed PPIs, concretely, protein phosphorylation and dephosphorylation, based on hyperbolic properties and centrality measures of the hPIN mapped in H2. To evaluate the efficacy of our algorithm, we predicted PTM-related PPIs of ataxin-1, a protein which is responsible for Spinocerebellar Ataxia type 1 (SCA1). Proteomics analysis in a cellular model revealed that several of the predicted PTM-PPIs were indeed dysregulated in a SCA1-related disease network. A compact cluster composed of ataxin-1, its dysregulated PTM-PPIs and their common upstream regulators may represent critical interactions for disease pathology. Thus, our algorithm may infer phosphorylation activity on proteins through directed PPIs.
Collapse
Affiliation(s)
- Aimilia-Christina Vagiona
- Faculty of Biology, Insitute of Organismic and Molecular Evolution, Johannes Gutenberg University, Biozentrum I, Mainz, Germany
| | - Sofia Notopoulou
- Institute of Applied Biosciences/Centre for Research and Technology Hellas, Thessaloniki, Greece
| | - Zbyněk Zdráhal
- Central European Institute of Technology, Masaryk University, Brno, Czech Republic
| | - Mariane Gonçalves-Kulik
- Faculty of Biology, Insitute of Organismic and Molecular Evolution, Johannes Gutenberg University, Biozentrum I, Mainz, Germany
| | - Spyros Petrakis
- Institute of Applied Biosciences/Centre for Research and Technology Hellas, Thessaloniki, Greece
| | - Miguel A. Andrade-Navarro
- Faculty of Biology, Insitute of Organismic and Molecular Evolution, Johannes Gutenberg University, Biozentrum I, Mainz, Germany
| |
Collapse
|
2
|
Pržulj N, Malod-Dognin N. Simplicity within biological complexity. BIOINFORMATICS ADVANCES 2025; 5:vbae164. [PMID: 39927291 PMCID: PMC11805345 DOI: 10.1093/bioadv/vbae164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/15/2024] [Revised: 10/01/2024] [Accepted: 10/23/2024] [Indexed: 02/11/2025]
Abstract
Motivation Heterogeneous, interconnected, systems-level, molecular (multi-omic) data have become increasingly available and key in precision medicine. We need to utilize them to better stratify patients into risk groups, discover new biomarkers and targets, repurpose known and discover new drugs to personalize medical treatment. Existing methodologies are limited and a paradigm shift is needed to achieve quantitative and qualitative breakthroughs. Results In this perspective paper, we survey the literature and argue for the development of a comprehensive, general framework for embedding of multi-scale molecular network data that would enable their explainable exploitation in precision medicine in linear time. Network embedding methods (also called graph representation learning) map nodes to points in low-dimensional space, so that proximity in the learned space reflects the network's topology-function relationships. They have recently achieved unprecedented performance on hard problems of utilizing few omic data in various biomedical applications. However, research thus far has been limited to special variants of the problems and data, with the performance depending on the underlying topology-function network biology hypotheses, the biomedical applications, and evaluation metrics. The availability of multi-omic data, modern graph embedding paradigms and compute power call for a creation and training of efficient, explainable and controllable models, having no potentially dangerous, unexpected behaviour, that make a qualitative breakthrough. We propose to develop a general, comprehensive embedding framework for multi-omic network data, from models to efficient and scalable software implementation, and to apply it to biomedical informatics, focusing on precision medicine and personalized drug discovery. It will lead to a paradigm shift in the computational and biomedical understanding of data and diseases that will open up ways to solve some of the major bottlenecks in precision medicine and other domains.
Collapse
Affiliation(s)
- Nataša Pržulj
- Computational Biology Department, Mohamed bin Zayed University of Artificial Intelligence, Abu Dhabi, 00000, United Arabic Emirates
- Barcelona Supercomputing Center, Barcelona 08034, Spain
- Department of Computer Science, University College London, London WC1E6BT, United Kingdom
- ICREA, Pg. Lluís Companys 23, Barcelona 08010, Spain
| | | |
Collapse
|
3
|
Gkekas I, Vagiona AC, Pechlivanis N, Kastrinaki G, Pliatsika K, Iben S, Xanthopoulos K, Psomopoulos FE, Andrade-Navarro MA, Petrakis S. Intranuclear inclusions of polyQ-expanded ATXN1 sequester RNA molecules. Front Mol Neurosci 2023; 16:1280546. [PMID: 38125008 PMCID: PMC10730666 DOI: 10.3389/fnmol.2023.1280546] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Accepted: 11/16/2023] [Indexed: 12/23/2023] Open
Abstract
Spinocerebellar ataxia type 1 (SCA1) is an autosomal dominant neurodegenerative disease caused by a trinucleotide (CAG) repeat expansion in the ATXN1 gene. It is characterized by the presence of polyglutamine (polyQ) intranuclear inclusion bodies (IIBs) within affected neurons. In order to investigate the impact of polyQ IIBs in SCA1 pathogenesis, we generated a novel protein aggregation model by inducible overexpression of the mutant ATXN1(Q82) isoform in human neuroblastoma SH-SY5Y cells. Moreover, we developed a simple and reproducible protocol for the efficient isolation of insoluble IIBs. Biophysical characterization showed that polyQ IIBs are enriched in RNA molecules which were further identified by next-generation sequencing. Finally, a protein interaction network analysis indicated that sequestration of essential RNA transcripts within ATXN1(Q82) IIBs may affect the ribosome resulting in error-prone protein synthesis and global proteome instability. These findings provide novel insights into the molecular pathogenesis of SCA1, highlighting the role of polyQ IIBs and their impact on critical cellular processes.
Collapse
Affiliation(s)
- Ioannis Gkekas
- Centre for Research and Technology Hellas, Institute of Applied Biosciences, Thessaloniki, Greece
- Laboratory of Pharmacology, School of Pharmacy, Faculty of Health Sciences, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | | | - Nikolaos Pechlivanis
- Centre for Research and Technology Hellas, Institute of Applied Biosciences, Thessaloniki, Greece
| | - Georgia Kastrinaki
- Aerosol and Particle Technology Laboratory, Centre for Research and Technology Hellas, Chemical Process and Energy Resources Institute, Thessaloniki, Greece
| | - Katerina Pliatsika
- Centre for Research and Technology Hellas, Institute of Applied Biosciences, Thessaloniki, Greece
- Laboratory of Pharmacology, School of Pharmacy, Faculty of Health Sciences, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Sebastian Iben
- Department of Dermatology and Allergic Diseases, University of Ulm, Ulm, Germany
| | - Konstantinos Xanthopoulos
- Centre for Research and Technology Hellas, Institute of Applied Biosciences, Thessaloniki, Greece
- Laboratory of Pharmacology, School of Pharmacy, Faculty of Health Sciences, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Fotis E. Psomopoulos
- Centre for Research and Technology Hellas, Institute of Applied Biosciences, Thessaloniki, Greece
| | | | - Spyros Petrakis
- Centre for Research and Technology Hellas, Institute of Applied Biosciences, Thessaloniki, Greece
| |
Collapse
|
4
|
Vagiona AC, Mier P, Petrakis S, Andrade-Navarro MA. Analysis of Huntington's Disease Modifiers Using the Hyperbolic Mapping of the Protein Interaction Network. Int J Mol Sci 2022; 23:5853. [PMID: 35628660 PMCID: PMC9144261 DOI: 10.3390/ijms23105853] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Revised: 05/19/2022] [Accepted: 05/19/2022] [Indexed: 02/05/2023] Open
Abstract
Huntington's disease (HD) is caused by the production of a mutant huntingtin (HTT) with an abnormally long poly-glutamine (polyQ) tract, forming aggregates and inclusions in neurons. Previous work by us and others has shown that an increase or decrease in polyQ-triggered aggregates can be passive simply due to the interaction of proteins with the aggregates. To search for proteins with active (functional) effects, which might be more effective in finding therapies and mechanisms of HD, we selected among the proteins that interact with HTT a total of 49 pairs of proteins that, while being paralogous to each other (and thus expected to have similar passive interaction with HTT), are located in different regions of the protein interaction network (suggesting participation in different pathways or complexes). Three of these 49 pairs contained members with opposite effects on HD, according to the literature. The negative members of the three pairs, MID1, IKBKG, and IKBKB, interact with PPP2CA and TUBB, which are known negative factors in HD, as well as with HSP90AA1 and RPS3. The positive members of the three pairs interact with HSPA9. Our results provide potential HD modifiers of functional relevance and reveal the dynamic aspect of paralog evolution within the interaction network.
Collapse
Affiliation(s)
- Aimilia-Christina Vagiona
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University, Hans-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany; (A.-C.V.); (P.M.)
| | - Pablo Mier
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University, Hans-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany; (A.-C.V.); (P.M.)
| | - Spyros Petrakis
- Institute of Applied Biosciences/Centre for Research and Technology Hellas, 57001 Thessaloniki, Greece;
| | - Miguel A. Andrade-Navarro
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University, Hans-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany; (A.-C.V.); (P.M.)
| |
Collapse
|
5
|
Gurbuz O, Alanis-Lobato G, Picart-Armada S, Sun M, Haslinger C, Lawless N, Fernandez-Albert F. Knowledge Graphs for Indication Expansion: An Explainable Target-Disease Prediction Method. Front Genet 2022; 13:814093. [PMID: 35360842 PMCID: PMC8963915 DOI: 10.3389/fgene.2022.814093] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 01/28/2022] [Indexed: 11/19/2022] Open
Abstract
Indication expansion aims to find new indications for existing targets in order to accelerate the process of launching a new drug for a disease on the market. The rapid increase in data types and data sources for computational drug discovery has fostered the use of semantic knowledge graphs (KGs) for indication expansion through target centric approaches, or in other words, target repositioning. Previously, we developed a novel method to construct a KG for indication expansion studies, with the aim of finding and justifying alternative indications for a target gene of interest. In contrast to other KGs, ours combines human-curated full-text literature and gene expression data from biomedical databases to encode relationships between genes, diseases, and tissues. Here, we assessed the suitability of our KG for explainable target-disease link prediction using a glass-box approach. To evaluate the predictive power of our KG, we applied shortest path with tissue information- and embedding-based prediction methods to a graph constructed with information published before or during 2010. We also obtained random baselines by applying the shortest path predictive methods to KGs with randomly shuffled node labels. Then, we evaluated the accuracy of the top predictions using gene-disease links reported after 2010. In addition, we investigated the contribution of the KG’s tissue expression entity to the prediction performance. Our experiments showed that shortest path-based methods significantly outperform the random baselines and embedding-based methods outperform the shortest path predictions. Importantly, removing the tissue expression entity from the KG severely impacts the quality of the predictions, especially those produced by the embedding approaches. Finally, since the interpretability of the predictions is crucial in indication expansion, we highlight the advantages of our glass-box model through the examination of example candidate target-disease predictions.
Collapse
Affiliation(s)
- Ozge Gurbuz
- Discovery Research Coordination Germany, Boehringer Ingelheim Pharma GmbH & Co. KG, Biberach an der Riss, Germany
- *Correspondence: Ozge Gurbuz, ; Francesc Fernandez-Albert,
| | - Gregorio Alanis-Lobato
- Global Computational Biology and Data Sciences, Boehringer Ingelheim Pharma GmbH & Co. KG, Biberach an der Riss, Germany
| | - Sergio Picart-Armada
- Global Computational Biology and Data Sciences, Boehringer Ingelheim Pharma GmbH & Co. KG, Biberach an der Riss, Germany
| | - Miao Sun
- Global Computational Biology and Data Sciences, Boehringer Ingelheim Pharma GmbH & Co. KG, Biberach an der Riss, Germany
| | - Christian Haslinger
- Global Computational Biology and Data Sciences, Boehringer Ingelheim Pharma GmbH & Co. KG, Biberach an der Riss, Germany
| | - Nathan Lawless
- Global Computational Biology and Data Sciences, Boehringer Ingelheim Pharma GmbH & Co. KG, Biberach an der Riss, Germany
| | - Francesc Fernandez-Albert
- Global Computational Biology and Data Sciences, Boehringer Ingelheim Pharma GmbH & Co. KG, Biberach an der Riss, Germany
- *Correspondence: Ozge Gurbuz, ; Francesc Fernandez-Albert,
| |
Collapse
|
6
|
Liu J, Zhu H, Qiu J. Locally Adjust Networks Based on Connectivity and Semantic Similarities for Disease Module Detection. Front Genet 2021; 12:726596. [PMID: 34759955 PMCID: PMC8575408 DOI: 10.3389/fgene.2021.726596] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Accepted: 09/22/2021] [Indexed: 11/13/2022] Open
Abstract
For studying the pathogenesis of complex diseases, it is important to identify the disease modules in the system level. Since the protein-protein interaction (PPI) networks contain a number of incomplete and incorrect interactome, most existing methods often lead to many disease proteins isolating from disease modules. In this paper, we propose an effective disease module identification method IDMCSS, where the used human PPI networks are obtained by adding some potential missing interactions from existing PPI networks, as well as removing some potential incorrect interactions. In IDMCSS, a network adjustment strategy is developed to add or remove links around disease proteins based on both topological and semantic information. Next, neighboring proteins of disease proteins are prioritized according to a suggested similarity between each of them and disease proteins, and the protein with the largest similarity with disease proteins is added into a candidate disease protein set one by one. The stopping criterion is set to the boundary of the disease proteins. Finally, the connected subnetwork having the largest number of disease proteins is selected as a disease module. Experimental results on asthma demonstrate the effectiveness of the method in comparison to existing algorithms for disease module identification. It is also shown that the proposed IDMCSS can obtain the disease modules having crucial biological processes of asthma and 12 targets for drug intervention can be predicted.
Collapse
Affiliation(s)
- Jia Liu
- State Key Laboratory of Media Convergence and Communication, Communication University of China, Beijing, China
| | - Huole Zhu
- Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, School of Artificial Intelligence, Anhui University, Hefei, China
- Information Materials and Intelligent Sensing Laboratory of Anhui Province, School of Artificial Intelligence, Anhui University, Hefei, China
| | - Jianfeng Qiu
- Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, School of Artificial Intelligence, Anhui University, Hefei, China
- Information Materials and Intelligent Sensing Laboratory of Anhui Province, School of Artificial Intelligence, Anhui University, Hefei, China
| |
Collapse
|
7
|
Vakil V, Trappe W. Drug Combinations: Mathematical Modeling and Networking Methods. Pharmaceutics 2019; 11:E208. [PMID: 31052580 PMCID: PMC6571786 DOI: 10.3390/pharmaceutics11050208] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2019] [Revised: 04/24/2019] [Accepted: 04/27/2019] [Indexed: 12/14/2022] Open
Abstract
Treatments consisting of mixtures of pharmacological agents have been shown to have superior effects to treatments involving single compounds. Given the vast amount of possible combinations involving multiple drugs and the restrictions in time and resources required to test all such combinations in vitro, mathematical methods are essential to model the interactive behavior of the drug mixture and the target, ultimately allowing one to better predict the outcome of the combination. In this review, we investigate various mathematical methods that model combination therapies. This survey includes the methods that focus on predicting the outcome of drug combinations with respect to synergism and antagonism, as well as the methods that explore the dynamics of combination therapy and its role in combating drug resistance. This comprehensive investigation of the mathematical methods includes models that employ pharmacodynamics equations, those that rely on signaling and how the underlying chemical networks are affected by the topological structure of the target proteins, and models that are based on stochastic models for evolutionary dynamics. Additionally, this article reviews computational methods including mathematical algorithms, machine learning, and search algorithms that can identify promising combinations of drug compounds. A description of existing data and software resources is provided that can support investigations in drug combination therapies. Finally, the article concludes with a summary of future directions for investigation by the research community.
Collapse
Affiliation(s)
- Vahideh Vakil
- WINLAB, Rutgers, The State University of New Jersey, New Brunswick, NJ 08901, USA.
| | - Wade Trappe
- WINLAB, Rutgers, The State University of New Jersey, New Brunswick, NJ 08901, USA.
| |
Collapse
|
8
|
Amell A, Roso-Llorach A, Palomero L, Cuadras D, Galván-Femenía I, Serra-Musach J, Comellas F, de Cid R, Pujana MA, Violán C. Disease networks identify specific conditions and pleiotropy influencing multimorbidity in the general population. Sci Rep 2018; 8:15970. [PMID: 30374096 PMCID: PMC6206057 DOI: 10.1038/s41598-018-34361-3] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Accepted: 10/15/2018] [Indexed: 01/16/2023] Open
Abstract
Multimorbidity is an emerging topic in public health policy because of its increasing prevalence and socio-economic impact. However, the age- and gender-dependent trends of disease associations at fine resolution, and the underlying genetic factors, remain incompletely understood. Here, by analyzing disease networks from electronic medical records of primary health care, we identify key conditions and shared genetic factors influencing multimorbidity. Three types of diseases are outlined: "central", which include chronic and non-chronic conditions, have higher cumulative risks of disease associations; "community roots" have lower cumulative risks, but inform on continuing clustered disease associations with age; and "seeds of bursts", which most are chronic, reveal outbreaks of disease associations leading to multimorbidity. The diseases with a major impact on multimorbidity are caused by genes that occupy central positions in the network of human disease genes. Alteration of lipid metabolism connects breast cancer, diabetic neuropathy and nutritional anemia. Evaluation of key disease associations by a genome-wide association study identifies shared genetic factors and further supports causal commonalities between nervous system diseases and nutritional anemias. This study also reveals many shared genetic signals with other diseases. Collectively, our results depict novel population-based multimorbidity patterns, identify key diseases within them, and highlight pleiotropy influencing multimorbidity.
Collapse
Affiliation(s)
- A Amell
- Department of Mathematics, Technical University of Catalonia, Castelldefels, Barcelona, 08860, Catalonia, Spain
| | - A Roso-Llorach
- Jordi Gol University Institute for Research Primary Healthcare (IDIAP Jordi Gol), Barcelona, 08007, Catalonia, Spain
- Autonomous University of Barcelona, Bellaterra, 08193, Catalonia, Spain
| | - L Palomero
- ProCURE, Catalan Institute of Oncology (ICO), Oncobell, Bellvitge Institute for Biomedical Research (IDIBELL), L'Hospitalet del Llobregat, Barcelona, 08908, Catalonia, Spain
| | - D Cuadras
- Statistics Department, Foundation Sant Joan de Déu, Esplugues, 08950, Catalonia, Spain
| | - I Galván-Femenía
- GCAT-Genomes for Life, Germans Trias i Pujol Health Sciences Research Institute (IGTP), Program for Predictive and Personalized Medicine of Cancer (IMPPC), Badalona, 08916, Catalonia, Spain
| | - J Serra-Musach
- ProCURE, Catalan Institute of Oncology (ICO), Oncobell, Bellvitge Institute for Biomedical Research (IDIBELL), L'Hospitalet del Llobregat, Barcelona, 08908, Catalonia, Spain
| | - F Comellas
- Department of Mathematics, Technical University of Catalonia, Castelldefels, Barcelona, 08860, Catalonia, Spain
| | - R de Cid
- GCAT-Genomes for Life, Germans Trias i Pujol Health Sciences Research Institute (IGTP), Program for Predictive and Personalized Medicine of Cancer (IMPPC), Badalona, 08916, Catalonia, Spain.
| | - M A Pujana
- ProCURE, Catalan Institute of Oncology (ICO), Oncobell, Bellvitge Institute for Biomedical Research (IDIBELL), L'Hospitalet del Llobregat, Barcelona, 08908, Catalonia, Spain.
| | - C Violán
- Jordi Gol University Institute for Research Primary Healthcare (IDIAP Jordi Gol), Barcelona, 08007, Catalonia, Spain.
- Autonomous University of Barcelona, Bellaterra, 08193, Catalonia, Spain.
| |
Collapse
|