1
|
Fischer SN, Claussen ER, Kourtis S, Sdelci S, Orchard S, Hermjakob H, Kustatscher G, Drew K. hu.MAP3.0: atlas of human protein complexes by integration of >25,000 proteomic experiments. Mol Syst Biol 2025:10.1038/s44320-025-00121-5. [PMID: 40425816 DOI: 10.1038/s44320-025-00121-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2024] [Revised: 05/07/2025] [Accepted: 05/09/2025] [Indexed: 05/29/2025] Open
Abstract
Macromolecular protein complexes carry out most cellular functions. Unfortunately, we lack the subunit composition for many human protein complexes. To address this gap we integrated >25,000 mass spectrometry experiments using a machine learning approach to identify >15,000 human protein complexes. We show our map of protein complexes is highly accurate and more comprehensive than previous maps, placing nearly 70% of human proteins into their physical contexts. We globally characterize our complexes using mass spectrometry based protein covariation data (ProteomeHD.2) and identify covarying complexes suggesting common functional associations. hu.MAP3.0 generates testable functional hypotheses for 472 uncharacterized proteins which we support using AlphaFold modeling. Additionally, we use AlphaFold modeling to identify 5871 mutually exclusive proteins in hu.MAP3.0 complexes suggesting complexes serve different functional roles depending on their subunit composition. We identify expression as the primary way cells and organisms relieve the conflict of mutually exclusive subunits. Finally, we import our complexes to EMBL-EBI's Complex Portal ( https://www.ebi.ac.uk/complexportal/home ) and provide complexes through our hu.MAP3.0 web interface ( https://humap3.proteincomplexes.org/ ). We expect our resource to be highly impactful to the broader research community.
Collapse
Affiliation(s)
- Samantha N Fischer
- Department of Biological Sciences, University of Illinois at Chicago, Chicago, IL, 60607, USA
| | - Erin R Claussen
- Department of Biological Sciences, University of Illinois at Chicago, Chicago, IL, 60607, USA
| | - Savvas Kourtis
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Sara Sdelci
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Sandra Orchard
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Georg Kustatscher
- Centre for Cell Biology, University of Edinburgh, Edinburgh, EH9 3BF, UK
| | - Kevin Drew
- Department of Biological Sciences, University of Illinois at Chicago, Chicago, IL, 60607, USA.
| |
Collapse
|
2
|
Ajadee A, Mahmud S, Ali MA, Mollah MMH, Ahmmed R, Mollah MNH. In-silico discovery of type-2 diabetes-causing host key genes that are associated with the complexity of monkeypox and repurposing common drugs. Brief Bioinform 2025; 26:bbaf215. [PMID: 40370100 PMCID: PMC12078936 DOI: 10.1093/bib/bbaf215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2024] [Revised: 04/11/2025] [Accepted: 04/21/2025] [Indexed: 05/16/2025] Open
Abstract
Monkeypox (Mpox) is a major global human health threat after COVID-19. Its treatment becomes complicated with type-2 diabetes (T2D). It may happen due to the influence of both disease-causing common host key genes (cHKGs). Therefore, it is necessary to explore both disease-causing cHKGs to reveal their shared pathogenetic mechanisms and candidate drugs as their common treatments without adverse side effect. This study aimed to address these issues. At first, 3 transcriptomics datasets for each of Mpox and 6 T2D datasets were analyzed and found 52 common host differentially expressed genes (cHDEGs) that can separate both T2D and Mpox patients from the control samples. Then top-ranked six cHDEGs (HSP90AA1, B2M, IGF1R, ALD1HA1, ASS1, and HADHA) were detected as the T2D-causing cHKGs that are associated with the complexity of Mpox through the protein-protein interaction network analysis. Then common pathogenetic processes between T2D and Mpox were disclosed by cHKG-set enrichment analysis with biological processes, molecular functions, cellular components and Kyoto Encyclopedia of Genes and Genomes pathways, and regulatory network analysis with transcription factors and microRNAs. Finally, cHKG-guided top-ranked three drug molecules (tecovirimat, vindoline, and brincidofovir) were recommended as the repurposable common therapeutic agents for both Mpox and T2D by molecular docking. The absorption, distribution, metabolism, excretion, and toxicity and drug-likeness analysis of these drug molecules indicated their good pharmacokinetics properties. The 100-ns molecular dynamics simulation results (root mean square deviation, root mean square fluctuation, and molecular mechanics generalized born surface area) with the top-ranked three complexes ASS1-tecovirimat, ALDH1A1-vindoline, and B2M-brincidofovir exhibited good pharmacodynamics properties. Therefore, the results provided in this article might be important resources for diagnosis and therapies of Mpox patients who are also suffering from T2D.
Collapse
Affiliation(s)
- Alvira Ajadee
- Bioinformatics Lab (Dry), Department of Statistics, University of Rajshahi, Rajshahi 6205, Bangladesh
| | - Sabkat Mahmud
- Bioinformatics Lab (Dry), Department of Statistics, University of Rajshahi, Rajshahi 6205, Bangladesh
| | - Md Ahad Ali
- Bioinformatics Lab (Dry), Department of Statistics, University of Rajshahi, Rajshahi 6205, Bangladesh
- Department of Chemistry, University of Rajshahi, Rajshahi 6205, Bangladesh
| | - Md Manir Hossain Mollah
- Department of Physical Sciences, Independent University Bangladesh, Bashundhara Residential Area, Dhaka 1245, Bangladesh
| | - Reaz Ahmmed
- Bioinformatics Lab (Dry), Department of Statistics, University of Rajshahi, Rajshahi 6205, Bangladesh
- Department of Biochemistry and Molecular Biology, University of Rajshahi, Rajshahi 6205, Bangladesh
| | - Md Nurul Haque Mollah
- Bioinformatics Lab (Dry), Department of Statistics, University of Rajshahi, Rajshahi 6205, Bangladesh
| |
Collapse
|
3
|
Apostolakou AE, Douska DE, Litou ZI, Trougakos IP, Iconomidou VA. Co-Deposited Proteins in Alzheimer's Disease as a Potential Treasure Trove for Drug Repurposing. Molecules 2025; 30:1736. [PMID: 40333680 PMCID: PMC12029215 DOI: 10.3390/molecules30081736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2025] [Revised: 03/30/2025] [Accepted: 04/10/2025] [Indexed: 05/09/2025] Open
Abstract
Alzheimer's disease (AD) affects an increasing number of people as the human population ages. The main pathological feature of AD, amyloid plaques, consists of the key protein amyloid-β and other co-deposited proteins. These co-deposited proteins and their protein interactors could hold some additional functional insights into AD pathophysiology. For this work, proteins found on amyloid plaques were collected from the AmyCo database. A protein-protein and protein-drug interaction network was constructed with data from the IntAct and DrugBank databases, respectively. In total, there were 12 proteins co-deposited on amyloid plaques that reportedly interact with 513 other proteins and are targets of 72 drugs. These drugs were shown to be almost entirely distinct from the panel of drugs currently approved by the FDA for AD and their corresponding protein targets. In conclusion, this work demonstrates the potential for drug repurposing of drugs that target proteins found in amyloid plaques.
Collapse
Affiliation(s)
| | | | | | | | - Vassiliki A. Iconomidou
- Section of Cell Biology and Biophysics, Department of Biology, School of Science, National and Kapodistrian University of Athens, Panepistimiopolis, 157 01 Athens, Greece; (A.E.A.); (Z.I.L.); (I.P.T.)
| |
Collapse
|
4
|
Orchard SE. What have Data Standards ever done for us? Mol Cell Proteomics 2025:100933. [PMID: 40024375 DOI: 10.1016/j.mcpro.2025.100933] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2024] [Revised: 02/21/2025] [Accepted: 02/24/2025] [Indexed: 03/04/2025] Open
Abstract
The Human Proteome Organization (HUPO) Proteomics Standards Initiative (PSI) has been successfully developing guidelines, data formats, and controlled vocabularies for both the field of molecular interaction and that of mass spectrometry for more than 20 years. This review explores some of the ways that the proteomics community has benefitted from the development of community standards and takes a look at some of the tools and resources that have been improved or developed as a result of the work of the HUPO-PSI.
Collapse
Affiliation(s)
- S E Orchard
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| |
Collapse
|
5
|
Chowdhury S, Fong SS, Uetz P. The protein interactome of Escherichia coli carbohydrate metabolism. PLoS One 2025; 20:e0315240. [PMID: 39903745 PMCID: PMC11793828 DOI: 10.1371/journal.pone.0315240] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Accepted: 11/21/2024] [Indexed: 02/06/2025] Open
Abstract
We investigate how protein-protein interactions (PPIs) can regulate carbohydrate metabolism in Escherichia coli. We specifically investigated the stoichiometry of 378 PPIs involving carbohydrate metabolic enzymes. In 48 interactions, the interactors were much more abundant than the enzyme and are thus likely to affect enzyme activity and carbohydrate metabolism. Many of these PPIs are conserved across thousands of bacteria including pathogens and microbial species. E. coli adapts to different cellular environments by adjusting the quantities of the interacting proteins (25 PPIs) in a way that the protein-enzyme interaction (PEI) is a likely mechanism to regulate its metabolism in specific environments. We predict 3 PPIs (RpsB-AdhE, DcyD-NanE and MinE-Yccx) previously not known to regulate metabolism.
Collapse
Affiliation(s)
- Shomeek Chowdhury
- Center for Integrative Life Sciences Education, Virginia Commonwealth University, Richmond, VA, United States of America
| | - Stephen S. Fong
- Center for Integrative Life Sciences Education, Virginia Commonwealth University, Richmond, VA, United States of America
| | - Peter Uetz
- Center for Biological Data Science, School of Life Sciences, Virginia Commonwealth University, Richmond, VA, United States of America
| |
Collapse
|
6
|
Smith JR, Tutaj MA, Thota J, Lamers L, Gibson AC, Kundurthi A, Gollapally VR, Brodie KC, Zacher S, Laulederkind SJF, Hayman GT, Wang SJ, Tutaj M, Kaldunski ML, Vedi M, Demos WM, De Pons JL, Dwinell MR, Kwitek AE. Standardized pipelines support and facilitate integration of diverse datasets at the Rat Genome Database. Database (Oxford) 2025; 2025:baae132. [PMID: 39841812 PMCID: PMC11753291 DOI: 10.1093/database/baae132] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2024] [Revised: 11/01/2024] [Accepted: 12/30/2024] [Indexed: 01/24/2025]
Abstract
The Rat Genome Database (RGD) is a multispecies knowledgebase which integrates genetic, multiomic, phenotypic, and disease data across 10 mammalian species. To support cross-species, multiomics studies and to enhance and expand on data manually extracted from the biomedical literature by the RGD team of expert curators, RGD imports and integrates data from multiple sources. These include major databases and a substantial number of domain-specific resources, as well as direct submissions by individual researchers. The incorporation of these diverse datatypes is handled by a growing list of automated import, export, data processing, and quality control pipelines. This article outlines the development over time of a standardized infrastructure for automated RGD pipelines with a summary of key design decisions and a focus on lessons learned.
Collapse
Affiliation(s)
- Jennifer R Smith
- Rat Genome Database, Department of Physiology, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226, United States
| | - Marek A Tutaj
- Rat Genome Database, Department of Physiology, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226, United States
| | - Jyothi Thota
- Rat Genome Database, Department of Physiology, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226, United States
| | - Logan Lamers
- Rat Genome Database, Department of Physiology, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226, United States
| | - Adam C Gibson
- Rat Genome Database, Department of Physiology, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226, United States
| | - Akhilanand Kundurthi
- Rat Genome Database, Department of Physiology, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226, United States
| | - Varun Reddy Gollapally
- Rat Genome Database, Department of Physiology, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226, United States
| | - Kent C Brodie
- Clinical and Translational Science Institute, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226, United States
| | - Stacy Zacher
- Finance and Administration, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226, United States
| | - Stanley J F Laulederkind
- Rat Genome Database, Department of Physiology, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226, United States
| | - G Thomas Hayman
- Rat Genome Database, Department of Physiology, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226, United States
| | - Shur-Jen Wang
- Rat Genome Database, Department of Physiology, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226, United States
| | - Monika Tutaj
- Rat Genome Database, Department of Physiology, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226, United States
| | - Mary L Kaldunski
- Rat Genome Database, Department of Physiology, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226, United States
| | - Mahima Vedi
- Rat Genome Database, Department of Physiology, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226, United States
| | - Wendy M Demos
- Rat Genome Database, Department of Physiology, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226, United States
| | - Jeffrey L De Pons
- Rat Genome Database, Department of Physiology, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226, United States
| | - Melinda R Dwinell
- Rat Genome Database, Department of Physiology, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226, United States
| | - Anne E Kwitek
- Rat Genome Database, Department of Physiology, Medical College of Wisconsin, 8701 Watertown Plank Rd, Milwaukee, WI 53226, United States
| |
Collapse
|
7
|
Kiouri DP, Batsis GC, Chasapis CT. Structure-Based Approaches for Protein-Protein Interaction Prediction Using Machine Learning and Deep Learning. Biomolecules 2025; 15:141. [PMID: 39858535 PMCID: PMC11763140 DOI: 10.3390/biom15010141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2024] [Revised: 01/11/2025] [Accepted: 01/14/2025] [Indexed: 01/27/2025] Open
Abstract
Protein-Protein Interaction (PPI) prediction plays a pivotal role in understanding cellular processes and uncovering molecular mechanisms underlying health and disease. Structure-based PPI prediction has emerged as a robust alternative to sequence-based methods, offering greater biological accuracy by integrating three-dimensional spatial and biochemical features. This work summarizes the recent advances in computational approaches leveraging protein structure information for PPI prediction, focusing on machine learning (ML) and deep learning (DL) techniques. These methods not only improve predictive accuracy but also provide insights into functional sites, such as binding and catalytic residues. However, challenges such as limited high-resolution structural data and the need for effective negative sampling persist. Through the integration of experimental and computational tools, structure-based prediction paves the way for comprehensive proteomic network analysis, holding promise for advancements in drug discovery, biomarker identification, and personalized medicine. Future directions include enhancing scalability and dataset reliability to expand these approaches across diverse proteomes.
Collapse
Affiliation(s)
- Despoina P. Kiouri
- Institute of Chemical Biology, National Hellenic Research Foundation, 11635 Athens, Greece; (D.P.K.); (G.C.B.)
- Laboratory of Organic Chemistry, Department of Chemistry, National and Kapodistrian University of Athens, 15772 Athens, Greece
| | - Georgios C. Batsis
- Institute of Chemical Biology, National Hellenic Research Foundation, 11635 Athens, Greece; (D.P.K.); (G.C.B.)
| | - Christos T. Chasapis
- Institute of Chemical Biology, National Hellenic Research Foundation, 11635 Athens, Greece; (D.P.K.); (G.C.B.)
| |
Collapse
|
8
|
Panni S, Pizzolotto R. Integrated Analysis of microRNA Targets Reveals New Insights into Transcriptional-Post-Transcriptional Regulatory Cross-Talk. BIOLOGY 2025; 14:43. [PMID: 39857274 PMCID: PMC11762646 DOI: 10.3390/biology14010043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2024] [Revised: 01/02/2025] [Accepted: 01/06/2025] [Indexed: 01/27/2025]
Abstract
It is becoming increasingly clear that microRNAs are key players in gene regulatory networks, modulating gene expression at post-transcriptional level. Their involvement in almost all cellular processes predicts their role in diseases, and several microRNA-based therapeutics are currently undergoing clinical testing. Despite their undeniable relevance and the substantial body of literature demonstrating their role in cancer and other pathologies, the identification of functional interactions is still challenging. To address this issue, several resources have been developed to collect information from the literature, according to different criteria and reliability scores. In the present study, we have constructed a network of verified microRNA-mRNA interactions by integrating strong-evidence couples from different resources. Our analysis of the resulting network reveals that only one-fifth of the human genes exhibits experimental validated regulation by microRNAs. A very small subset of them is controlled by more than 20 microRNAs, and these hubs are highly enriched of pivotal transcription factors and regulatory proteins, strongly suggesting a complex interplay and a combinatorial effect between transcriptional and post-transcriptional gene control. Data analysis also reveals that several microRNAs control multiple targets involved in the same pathway or biological process, likely contributing to the coordinated control of the protein levels.
Collapse
Affiliation(s)
- Simona Panni
- Dipartimento di Biologia Ecologia Scienze della Terra (DiBEST), Università della Calabria, 87036 Rende, CS, Italy;
| | | |
Collapse
|
9
|
Szklarczyk D, Nastou K, Koutrouli M, Kirsch R, Mehryary F, Hachilif R, Hu D, Peluso ME, Huang Q, Fang T, Doncheva NT, Pyysalo S, Bork P, Jensen LJ, von Mering C. The STRING database in 2025: protein networks with directionality of regulation. Nucleic Acids Res 2025; 53:D730-D737. [PMID: 39558183 PMCID: PMC11701646 DOI: 10.1093/nar/gkae1113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2024] [Revised: 10/18/2024] [Accepted: 10/29/2024] [Indexed: 11/20/2024] Open
Abstract
Proteins cooperate, regulate and bind each other to achieve their functions. Understanding the complex network of their interactions is essential for a systems-level description of cellular processes. The STRING database compiles, scores and integrates protein-protein association information drawn from experimental assays, computational predictions and prior knowledge. Its goal is to create comprehensive and objective global networks that encompass both physical and functional interactions. Additionally, STRING provides supplementary tools such as network clustering and pathway enrichment analysis. The latest version, STRING 12.5, introduces a new 'regulatory network', for which it gathers evidence on the type and directionality of interactions using curated pathway databases and a fine-tuned language model parsing the literature. This update enables users to visualize and access three distinct network types-functional, physical and regulatory-separately, each applicable to distinct research needs. In addition, the pathway enrichment detection functionality has been updated, with better false discovery rate corrections, redundancy filtering and improved visual displays. The resource now also offers improved annotations of clustered networks and provides users with downloadable network embeddings, which facilitate the use of STRING networks in machine learning and allow cross-species transfer of protein information. The STRING database is available online at https://string-db.org/.
Collapse
Affiliation(s)
- Damian Szklarczyk
- Department of Molecular Life Sciences, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
- SIB Swiss Institute of Bioinformatics, Amphipôle, Quartier UNIL-Sorge, 1015 Lausanne, Switzerland
| | - Katerina Nastou
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Blegdamsvej 3B, 2200 Copenhagen N, Denmark
| | - Mikaela Koutrouli
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Blegdamsvej 3B, 2200 Copenhagen N, Denmark
| | - Rebecca Kirsch
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Blegdamsvej 3B, 2200 Copenhagen N, Denmark
| | - Farrokh Mehryary
- TurkuNLP Lab, Department of Computing, University of Turku, Vesilinnantie 5, 20014 Turku, Finland
| | - Radja Hachilif
- Department of Molecular Life Sciences, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
- SIB Swiss Institute of Bioinformatics, Amphipôle, Quartier UNIL-Sorge, 1015 Lausanne, Switzerland
| | - Dewei Hu
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Blegdamsvej 3B, 2200 Copenhagen N, Denmark
| | - Matteo E Peluso
- Department of Molecular Life Sciences, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
- SIB Swiss Institute of Bioinformatics, Amphipôle, Quartier UNIL-Sorge, 1015 Lausanne, Switzerland
| | - Qingyao Huang
- Department of Molecular Life Sciences, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
- SIB Swiss Institute of Bioinformatics, Amphipôle, Quartier UNIL-Sorge, 1015 Lausanne, Switzerland
| | - Tao Fang
- Department of Molecular Life Sciences, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
- SIB Swiss Institute of Bioinformatics, Amphipôle, Quartier UNIL-Sorge, 1015 Lausanne, Switzerland
| | - Nadezhda T Doncheva
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Blegdamsvej 3B, 2200 Copenhagen N, Denmark
| | - Sampo Pyysalo
- TurkuNLP Lab, Department of Computing, University of Turku, Vesilinnantie 5, 20014 Turku, Finland
| | - Peer Bork
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Meyerhofstrasse 1, 69117 Heidelberg, Germany
- Max Delbrück Centre for Molecular Medicine, Robert-Rössle-Strasse 10, 13125 Berlin, Germany
- Department of Bioinformatics, Biozentrum, University of Würzburg, Am Hubland, 97074 Würzburg, Germany
| | - Lars J Jensen
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Blegdamsvej 3B, 2200 Copenhagen N, Denmark
| | - Christian von Mering
- Department of Molecular Life Sciences, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
- SIB Swiss Institute of Bioinformatics, Amphipôle, Quartier UNIL-Sorge, 1015 Lausanne, Switzerland
| |
Collapse
|
10
|
Balu S, Huget S, Medina Reyes JJ, Ragueneau E, Panneerselvam K, Fischer SN, Claussen ER, Kourtis S, Combe C, Meldal BHM, Perfetto L, Rappsilber J, Kustatscher G, Drew K, Orchard S, Hermjakob H. Complex portal 2025: predicted human complexes and enhanced visualisation tools for the comparison of orthologous and paralogous complexes. Nucleic Acids Res 2025; 53:D644-D650. [PMID: 39558156 PMCID: PMC11701666 DOI: 10.1093/nar/gkae1085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2024] [Revised: 10/16/2024] [Accepted: 10/24/2024] [Indexed: 11/20/2024] Open
Abstract
The Complex Portal (www.ebi.ac.uk/complexportal) is a manually curated reference database for molecular complexes. It is a unifying web resource linking aggregated data on composition, topology and the function of macromolecular complexes from 28 species. In addition to significantly extending the number of manually curated complexes, we have massively extended the coverage of the human complexome through the incorporation of high confidence assemblies predicted by machine-learning algorithms trained on large-scale experimental data. The current content of the portal comprising 2150 human complexes has been augmented by 14 964 machine-learning (ML) predicted complexes from hu.MAP3.0. We have refactored the website to enable easy search and filtering of these different classes of protein complexes and have implemented the Complex Navigator, a visualisation tool to facilitate comparison of related complexes in the context of orthology or paralogy. We have embedded the Rhea reaction visualisation tool into the website to enable users to view the catalytic activity of enzyme complexes.
Collapse
Affiliation(s)
- Sucharitha Balu
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Susie Huget
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Juan Jose Medina Reyes
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Eliot Ragueneau
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Kalpana Panneerselvam
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Samantha N Fischer
- Department of Biological Sciences, University of Illinois at Chicago, Chicago, IL 60607, USA
| | - Erin R Claussen
- Department of Biological Sciences, University of Illinois at Chicago, Chicago, IL 60607, USA
| | | | - Colin W Combe
- Wellcome Centre for Cell Biology, University of Edinburgh, Edinburgh EH9 3BF, UK
| | | | - Livia Perfetto
- University of Rome La Sapienza, department of Biology and Biotechnologies “C. Darwin”, Rome, Italy
| | - Juri Rappsilber
- Department of Biological Sciences, University of Illinois at Chicago, Chicago, IL 60607, USA
- Technische Universität Berlin, Chair of Bioanalytics, 10623 Berlin, Germany
| | - Georg Kustatscher
- Wellcome Centre for Cell Biology, University of Edinburgh, Edinburgh EH9 3BF, UK
| | - Kevin Drew
- Department of Biological Sciences, University of Illinois at Chicago, Chicago, IL 60607, USA
| | - Sandra Orchard
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| |
Collapse
|
11
|
Samarasinghe K, Kotlyar M, Vallet S, Hayes C, Naba A, Jurisica I, Lisacek F, Ricard-Blum S. MatrixDB 2024: an increased coverage of extracellular matrix interactions, a new Network Explorer and a new web interface. Nucleic Acids Res 2025; 53:D1677-D1682. [PMID: 39558161 PMCID: PMC11701626 DOI: 10.1093/nar/gkae1088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2024] [Revised: 10/23/2024] [Accepted: 10/27/2024] [Indexed: 11/20/2024] Open
Abstract
MatrixDB, a member of the International Molecular Exchange consortium (IMEx), is a curated interaction database focused on interactions established by extracellular matrix (ECM) constituents including proteins, proteoglycans, glycosaminoglycans and ECM bioactive fragments. The architecture of MatrixDB was upgraded to ease interaction data export, allow versioning and programmatic access and ensure sustainability. The new version of the database includes more than twice the number of manually curated and experimentally-supported interactions. High-confidence predicted interactions were imported from the Integrated Interactions Database to increase the coverage of the ECM interactome. ECM and ECM-associated proteins of five species (human, murine, bovine, avian and zebrafish) were annotated with matrisome divisions and categories, which are used for computational analyses of ECM -omic datasets. Biological pathways from the Reactome Pathway Knowledgebase were also added to the biomolecule description. New transcriptomic and expanded proteomic datasets were imported in MatrixDB to generate cell- and tissue-specific ECM networks using the newly developed in-house Network Explorer integrated in the database. MatrixDB is freely available at https://matrixdb.univ-lyon1.fr.
Collapse
Affiliation(s)
| | - Max Kotlyar
- Osteoarthritis Research Program, Division of Orthopedic Surgery, Schroeder Arthritis Institute and Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, University Health Network, Toronto, ON M5T 0S8, Canada
| | - Sylvain D Vallet
- Institut de Biologie Structurale, UMR 5075, CEA, CNRS, Université Grenoble Alpes, Grenoble 38000, France
| | | | - Alexandra Naba
- Department of Physiology and Biophysics, University of Illinois Chicago, Chicago, IL 60612, USA
| | - Igor Jurisica
- Osteoarthritis Research Program, Division of Orthopedic Surgery, Schroeder Arthritis Institute and Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, University Health Network, Toronto, ON M5T 0S8, Canada
- Departments of Medical Biophysics and Computer Science, and the Faculty of Dentistry, University of Toronto, Toronto, Ontario, Canada
- Institute of Neuroimmunology, Slovak Academy of Sciences, Bratislava, Slovakia
| | | | - Sylvie Ricard-Blum
- Institut de Chimie et Biochimie Moléculaires et Supramoléculaires (ICBMS), UMR 5246, CNRS, Université Lyon 1, Villeurbanne 69622, France
| |
Collapse
|
12
|
Coelho NM, Riahi P, Wang Y, Ali A, Norouzi M, Kotlyar M, Jurisica I, McCulloch CA. The major vault protein integrates adhesion-driven signals to regulate collagen remodeling. Cell Signal 2024; 124:111447. [PMID: 39368789 DOI: 10.1016/j.cellsig.2024.111447] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2024] [Revised: 09/21/2024] [Accepted: 09/30/2024] [Indexed: 10/07/2024]
Abstract
DDR1 interacts with fibrillar collagen and can affect β1 integrin-dependent signaling, but the mechanism that mediates functional interactions between these two different receptors is not defined. We searched for molecules that link DDR1 and β1 integrin-dependent signaling in response to collagen binding. The activation of DDR1 by binding to fibrillar collagen reduced by 5-fold, β1 integrin-dependent ERK phosphorylation that leads to MMP1 expression. In contrast, pharmacological inhibition of DDR1 or culturing cells on fibronectin restored ERK phosphorylation and MMP1 expression mediated by the β1 integrin. A phospho-site screen indicated that collagen-induced DDR1 activation inhibited β1 integrin-dependent ERK signaling by regulating autophosphorylation of focal adhesion kinase (FAK). Immunoprecipitation, mass spectrometry, and protein-protein interaction mapping showed that while DDR1 and FAK do not interact directly, the major vault protein (MVP) binds DDR1 and FAK depending on the substrate. MVP associated with DDR1 in cells expressing β1 integrin that were cultured on collagen. Knockdown of MVP restored ERK activation and MMP1 expression in DDR1-expressing cells cultured on collagen. Immunostaining of invasive cancers in human colon showed colocalization of DDR1 with MVP. These data indicate that MVP interactions with DDR1 and FAK contribute to the regulation of β1 integrin-dependent signaling pathways that drive collagen degradation.
Collapse
Affiliation(s)
- Nuno M Coelho
- Faculty of Dentistry, University of Toronto, Toronto, ON, Canada
| | - Pardis Riahi
- Faculty of Dentistry, University of Toronto, Toronto, ON, Canada
| | - Yongqiang Wang
- Faculty of Dentistry, University of Toronto, Toronto, ON, Canada
| | - Aiman Ali
- Faculty of Dentistry, University of Toronto, Toronto, ON, Canada
| | - Masoud Norouzi
- Faculty of Dentistry, University of Toronto, Toronto, ON, Canada
| | - Max Kotlyar
- Osteoarthritis Research Program, Schroeder Arthritis Institute and Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, UHN, Toronto, ON, Canada
| | - Igor Jurisica
- Faculty of Dentistry, University of Toronto, Toronto, ON, Canada; Osteoarthritis Research Program, Schroeder Arthritis Institute and Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, UHN, Toronto, ON, Canada; Departments of Medical Biophysics and Computer Science, University of Toronto, ON, Canada
| | | |
Collapse
|
13
|
Fischer SN, Claussen ER, Kourtis S, Sdelci S, Orchard S, Hermjakob H, Kustatscher G, Drew K. hu.MAP3.0: Atlas of human protein complexes by integration of > 25,000 proteomic experiments. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.11.617930. [PMID: 39464102 PMCID: PMC11507723 DOI: 10.1101/2024.10.11.617930] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 10/29/2024]
Abstract
Macromolecular protein complexes carry out most functions in the cell including essential functions required for cell survival. Unfortunately, we lack the subunit composition for all human protein complexes. To address this gap we integrated >25,000 mass spectrometry experiments using a machine learning approach to identify > 15,000 human protein complexes. We show our map of protein complexes is highly accurate and more comprehensive than previous maps, placing ~75% of human proteins into their physical contexts. We globally characterize our complexes using protein co-variation data (ProteomeHD.2) and identify co-varying complexes suggesting common functional associations. Our map also generates testable functional hypotheses for 472 uncharacterized proteins which we support using AlphaFold modeling. Additionally, we use AlphaFold modeling to identify 511 mutually exclusive protein pairs in hu.MAP3.0 complexes suggesting complexes serve different functional roles depending on their subunit composition. We identify expression as the primary way cells and organisms relieve the conflict of mutually exclusive subunits. Finally, we import our complexes to EMBL-EBI's Complex Portal (https://www.ebi.ac.uk/complexportal/home) as well as provide complexes through our hu.MAP3.0 web interface (https://humap3.proteincomplexes.org/). We expect our resource to be highly impactful to the broader research community.
Collapse
Affiliation(s)
- Samantha N. Fischer
- Department of Biological Sciences, University of Illinois at Chicago, Chicago, IL 60607
| | - Erin R. Claussen
- Department of Biological Sciences, University of Illinois at Chicago, Chicago, IL 60607
| | - Savvas Kourtis
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Sara Sdelci
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Sandra Orchard
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Georg Kustatscher
- Wellcome Centre for Cell Biology, University of Edinburgh, Edinburgh EH9 3BF, UK
| | - Kevin Drew
- Department of Biological Sciences, University of Illinois at Chicago, Chicago, IL 60607
| |
Collapse
|
14
|
Martinez K, Agirre J, Akune Y, Aoki-Kinoshita KF, Arighi C, Axelsen KB, Bolton E, Bordeleau E, Edwards NJ, Fadda E, Feizi T, Hayes C, Ives CM, Joshi HJ, Krishna Prasad K, Kossida S, Lisacek F, Liu Y, Lütteke T, Ma J, Malik A, Martin M, Mehta AY, Neelamegham S, Panneerselvam K, Ranzinger R, Ricard-Blum S, Sanou G, Shanker V, Thomas PD, Tiemeyer M, Urban J, Vita R, Vora J, Yamamoto Y, Mazumder R. Functional implications of glycans and their curation: insights from the workshop held at the 16th Annual International Biocuration Conference in Padua, Italy. Database (Oxford) 2024; 2024:baae073. [PMID: 39137905 PMCID: PMC11321244 DOI: 10.1093/database/baae073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2024] [Revised: 06/24/2024] [Accepted: 07/10/2024] [Indexed: 08/15/2024]
Abstract
Dynamic changes in protein glycosylation impact human health and disease progression. However, current resources that capture disease and phenotype information focus primarily on the macromolecules within the central dogma of molecular biology (DNA, RNA, proteins). To gain a better understanding of organisms, there is a need to capture the functional impact of glycans and glycosylation on biological processes. A workshop titled "Functional impact of glycans and their curation" was held in conjunction with the 16th Annual International Biocuration Conference to discuss ongoing worldwide activities related to glycan function curation. This workshop brought together subject matter experts, tool developers, and biocurators from over 20 projects and bioinformatics resources. Participants discussed four key topics for each of their resources: (i) how they curate glycan function-related data from publications and other sources, (ii) what type of data they would like to acquire, (iii) what data they currently have, and (iv) what standards they use. Their answers contributed input that provided a comprehensive overview of state-of-the-art glycan function curation and annotations. This report summarizes the outcome of discussions, including potential solutions and areas where curators, data wranglers, and text mining experts can collaborate to address current gaps in glycan and glycosylation annotations, leveraging each other's work to improve their respective resources and encourage impactful data sharing among resources. Database URL: https://wiki.glygen.org/Glycan_Function_Workshop_2023.
Collapse
Affiliation(s)
- Karina Martinez
- Department of Biochemistry & Molecular Medicine, The George Washington University School of Medicine and Health Sciences, 2300 I St. NW, Washington, DC 20052, United States
| | - Jon Agirre
- York Structural Biology Laboratory, Department of Chemistry, University of York, Wentworth Way, York YO10 5DD, United Kingdom
| | - Yukie Akune
- The Glycosciences Laboratory, Imperial College London, Hammersmith Campus, Du Cane Road, London W12 0NN, United Kingdom
| | - Kiyoko F Aoki-Kinoshita
- Glycan and Life Systems Integration Center (GaLSIC), Soka University, 1-236 Tangi-machi, Hachioji, Tokyo 192-8577, Japan
| | - Cecilia Arighi
- Department of Computer and Information Sciences, University of Delaware, 18 Amstel Ave, Newark, DE 19716, United States
| | - Kristian B Axelsen
- Swiss-Prot Group, Swiss Institute of Bioinformatics (SIB), CMU, 1 rue Michel Servet, Geneva 4 1211, Switzerland
| | - Evan Bolton
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, United States
| | - Emily Bordeleau
- Michael Smith Laboratories, The University of British Columbia, 2185 East Mall, Vancouver, British Columbia V6T 1Z4, Canada
| | - Nathan J Edwards
- Department of Biochemistry and Molecular & Cellular Biology, Georgetown University, 2115 Wisconsin Ave NW, Washington, DC 20007, United States
| | - Elisa Fadda
- Department of Chemistry and Hamilton Institute, Maynooth University, Kilcock Road, Maynooth, Co. Kildare W23 AH3Y, Ireland
| | - Ten Feizi
- The Glycosciences Laboratory, Imperial College London, Hammersmith Campus, Du Cane Road, London W12 0NN, United Kingdom
| | - Catherine Hayes
- Proteome Informatics Group, Swiss Institute of Bioinformatics (SIB), route de Drize 7, Geneva CH-1227, Switzerland
| | - Callum M Ives
- Department of Chemistry and Hamilton Institute, Maynooth University, Kilcock Road, Maynooth, Co. Kildare W23 AH3Y, Ireland
| | - Hiren J Joshi
- Copenhagen Center for Glycomics, Department of Cellular and Molecular Medicine, Faculty of Health Sciences, University of Copenhagen, Blegdamsvej 3, Copenhagen DK-2200, Denmark
| | - Khakurel Krishna Prasad
- ELI Beamlines Facility, The Extreme Light Infrastructure ERIC, Za Radnicí 835, Dolní Břežany 25241, Czech Republic
| | - Sofia Kossida
- IMGT, The International ImMunoGeneTics Information System, National Center for Scientific Research (CNRS), Institute of Human Genetics (IGH), University of Montpellier (UM), 141 rue de la Cardonille, Montpellier 34 090, France
| | - Frederique Lisacek
- Proteome Informatics Group, Swiss Institute of Bioinformatics (SIB), route de Drize 7, Geneva CH-1227, Switzerland
| | - Yan Liu
- The Glycosciences Laboratory, Imperial College London, Hammersmith Campus, Du Cane Road, London W12 0NN, United Kingdom
| | - Thomas Lütteke
- Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Gießen, Frankfurter Str. 100, Gießen 35392, Germany
| | - Junfeng Ma
- Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University Medical Center, 3900 Reservior Road NW, Washington, DC 20007, United States
| | - Adnan Malik
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Maria Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Akul Y Mehta
- Department of Surgery, Beth Israel Deaconess Medical Center, National Center for Functional Glycomics, Harvard Medical School, 330 Brookline Avenue, Boston, MA 02215, United States
| | - Sriram Neelamegham
- Departments of Chemical & Biological Engineering, Biomedical Engineering and Medicine, University at Buffalo, State University of New York, 906 Furnas Hall, Buffalo, NY 14260, United States
| | - Kalpana Panneerselvam
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - René Ranzinger
- Complex Carbohydrate Research Center, University of Georgia, 315 Riverbend Rd, Athens, GA 30602, United States
| | - Sylvie Ricard-Blum
- Institute of Molecular and Supramolecular Chemistry and Biochemistry (ICBMS), UMR 5246, University Lyon 1, CNRS, 43 Boulevard du 11 novembre 1918, Villeurbanne cedex F-69622, France
| | - Gaoussou Sanou
- IMGT, The International ImMunoGeneTics Information System, National Center for Scientific Research (CNRS), Institute of Human Genetics (IGH), University of Montpellier (UM), 141 rue de la Cardonille, Montpellier 34 090, France
| | - Vijay Shanker
- Department of Computer and Information Sciences, University of Delaware, 18 Amstel Ave, Newark, DE 19716, United States
| | - Paul D Thomas
- Department of Population and Public Health Sciences, University of Southern California, 2001 N Soto Street, Los Angeles, CA 90032, United States
| | - Michael Tiemeyer
- Complex Carbohydrate Research Center, University of Georgia, 315 Riverbend Rd, Athens, GA 30602, United States
| | - James Urban
- Department of Chemistry and Molecular Biology, University of Gothenburg, Medicinaregatan 7 B, Gothenburg 41390, Sweden
| | - Randi Vita
- Immune Epitope Database and Analysis Project, La Jolla Institute for Allergy & Immunology, 9420 Athena Circle, La Jolla, CA 92037, United States
| | - Jeet Vora
- Department of Biochemistry & Molecular Medicine, The George Washington University School of Medicine and Health Sciences, 2300 I St. NW, Washington, DC 20052, United States
| | - Yasunori Yamamoto
- Database Center for Life Science, Joint Support-Center for Data Science Research, Research Organization of Information and Systems, 178-4-4 Wakashiba, Kashiwa, Chiba 277-0871, Japan
| | - Raja Mazumder
- Department of Biochemistry & Molecular Medicine, The George Washington University School of Medicine and Health Sciences, 2300 I St. NW, Washington, DC 20052, United States
| |
Collapse
|
15
|
Ahsan T, Shoily SS, Ahmed T, Sajib AA. Role of the redox state of the Pirin-bound cofactor on interaction with the master regulators of inflammation and other pathways. PLoS One 2023; 18:e0289158. [PMID: 38033031 PMCID: PMC10688961 DOI: 10.1371/journal.pone.0289158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Accepted: 07/10/2023] [Indexed: 12/02/2023] Open
Abstract
Persistent cellular stress induced perpetuation and uncontrolled amplification of inflammatory response results in a shift from tissue repair toward collateral damage, significant alterations of tissue functions, and derangements of homeostasis which in turn can lead to a large number of acute and chronic pathological conditions, such as chronic heart failure, atherosclerosis, myocardial infarction, neurodegenerative diseases, diabetes, rheumatoid arthritis, and cancer. Keeping the vital role of balanced inflammation in maintaining tissue integrity in mind, the way to combating inflammatory diseases may be through identification and characterization of mediators of inflammation that can be targeted without hampering normal body function. Pirin (PIR) is a non-heme iron containing protein having two different conformations depending on the oxidation state of the iron. Through exploration of the Pirin interactome and using molecular docking approaches, we identified that the Fe2+-bound Pirin directly interacts with BCL3, NFKBIA, NFIX and SMAD9 with more resemblance to the native binding pose and higher affinity than the Fe3+-bound form. In addition, Pirin appears to have a function in the regulation of inflammation, the transition between the canonical and non-canonical NF-κB pathways, and the remodeling of the actin cytoskeleton. Moreover, Pirin signaling appears to have a critical role in tumor invasion and metastasis, as well as metabolic and neuro-pathological complications. There are regulatory variants in PIR that can influence expression of not only PIR but also other genes, including VEGFD and ACE2. Disparity exists between South Asian and European populations in the frequencies of variant alleles at some of these regulatory loci that may lead to differential occurrence of Pirin-mediated pathogenic conditions.
Collapse
Affiliation(s)
- Tamim Ahsan
- Molecular Biotechnology Division, National Institute of Biotechnology, Savar, Dhaka, Bangladesh
| | - Sabrina Samad Shoily
- Department of Genetic Engineering & Biotechnology, University of Dhaka, Dhaka, Bangladesh
| | - Tasnim Ahmed
- Department of Genetic Engineering & Biotechnology, University of Dhaka, Dhaka, Bangladesh
| | - Abu Ashfaqur Sajib
- Department of Genetic Engineering & Biotechnology, University of Dhaka, Dhaka, Bangladesh
| |
Collapse
|
16
|
Brixi G, Ye T, Hong L, Wang T, Monticello C, Lopez-Barbosa N, Vincoff S, Yudistyra V, Zhao L, Haarer E, Chen T, Pertsemlidis S, Palepu K, Bhat S, Christopher J, Li X, Liu T, Zhang S, Petersen L, DeLisa MP, Chatterjee P. SaLT&PepPr is an interface-predicting language model for designing peptide-guided protein degraders. Commun Biol 2023; 6:1081. [PMID: 37875551 PMCID: PMC10598214 DOI: 10.1038/s42003-023-05464-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Accepted: 10/13/2023] [Indexed: 10/26/2023] Open
Abstract
Protein-protein interactions (PPIs) are critical for biological processes and predicting the sites of these interactions is useful for both computational and experimental applications. We present a Structure-agnostic Language Transformer and Peptide Prioritization (SaLT&PepPr) pipeline to predict interaction interfaces from a protein sequence alone for the subsequent generation of peptidic binding motifs. Our model fine-tunes the ESM-2 protein language model (pLM) with a per-position prediction task to identify PPI sites using data from the PDB, and prioritizes motifs which are most likely to be involved within inter-chain binding. By only using amino acid sequence as input, our model is competitive with structural homology-based methods, but exhibits reduced performance compared with deep learning models that input both structural and sequence features. Inspired by our previous results using co-crystals to engineer target-binding "guide" peptides, we curate PPI databases to identify partners for subsequent peptide derivation. Fusing guide peptides to an E3 ubiquitin ligase domain, we demonstrate degradation of endogenous β-catenin, 4E-BP2, and TRIM8, and highlight the nanomolar binding affinity, low off-targeting propensity, and function-altering capability of our best-performing degraders in cancer cells. In total, our study suggests that prioritizing binders from natural interactions via pLMs can enable programmable protein targeting and modulation.
Collapse
Affiliation(s)
- Garyk Brixi
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
| | - Tianzheng Ye
- Robert F. Smith School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY, USA
| | - Lauren Hong
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
| | - Tian Wang
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
| | - Connor Monticello
- Meinig School of Biomedical Engineering, Cornell University, Ithaca, NY, USA
| | - Natalia Lopez-Barbosa
- Robert F. Smith School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY, USA
| | - Sophia Vincoff
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
| | - Vivian Yudistyra
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
| | - Lin Zhao
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
| | - Elena Haarer
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
| | - Tianlai Chen
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
| | | | - Kalyan Palepu
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
| | - Suhaas Bhat
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
| | | | - Xinning Li
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
| | - Tong Liu
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
| | - Sue Zhang
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
| | - Lillian Petersen
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
| | - Matthew P DeLisa
- Robert F. Smith School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY, USA
- Meinig School of Biomedical Engineering, Cornell University, Ithaca, NY, USA
- Cornell Institute of Biotechnology, Cornell University, Ithaca, NY, USA
| | - Pranam Chatterjee
- Department of Biomedical Engineering, Duke University, Durham, NC, USA.
- Department of Computer Science, Duke University, Durham, NC, USA.
- Department of Biostatistics and Bioinformatics, Duke University, Durham, NC, USA.
| |
Collapse
|
17
|
Panni S, Panneerselvam K, Porras P, Duesbury M, Perfetto L, Licata L, Hermjakob H, Orchard S. The landscape of microRNA interaction annotation: analysis of three rare disorders as a case study. Database (Oxford) 2023; 2023:baad066. [PMID: 37819683 PMCID: PMC10566539 DOI: 10.1093/database/baad066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 08/29/2023] [Accepted: 09/15/2023] [Indexed: 10/13/2023]
Abstract
In recent years, a huge amount of data on ncRNA interactions has been described in scientific papers and databases. Although considerable effort has been made to annotate the available knowledge in public repositories, there are still significant discrepancies in how different resources capture and interpret data on ncRNA functional and physical associations. In the present paper, we present a collection of microRNA-mRNA interactions annotated from the scientific literature following recognized standard criteria and focused on microRNAs, which regulate genes associated with rare diseases as a case study. The list of protein-coding genes with a known role in specific rare diseases was retrieved from the Genome England PanelApp, and associated microRNA-mRNA interactions were annotated in the IntAct database and compared with other datasets. RNAcentral identifiers were used for unambiguous, stable identification of ncRNAs. The information about the interaction was enhanced by a detailed description of the cell types and experimental conditions, providing a computer-interpretable summary of the published data, integrated with the huge amount of protein interactions already gathered in the database. Furthermore, for each interaction, the binding sites of the microRNA are precisely mapped on a well-defined mRNA transcript of the target gene. This information is crucial to conceive and design optimal microRNA mimics or inhibitors to interfere in vivo with a deregulated process. As these approaches become more feasible, high-quality, reliable networks of microRNA interactions are needed to help, for instance, in the selection of the best target to be inhibited and to predict potential secondary off-target effects. Database URL https://www.ebi.ac.uk/intact.
Collapse
Affiliation(s)
- Simona Panni
- Dipartimento di Biologia Ecologia e Scienze della Terra, Università della Calabria, Rende 87036, Italy
| | - Kalpana Panneerselvam
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus Hinxton, Cambridge CB10 1SD, UK
| | - Pablo Porras
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus Hinxton, Cambridge CB10 1SD, UK
- Astra Zeneca, Data Office, Data Science and AI, UK Academy House, 136 Hills Road, Cambridge CB2 8PA, UK
| | - Margaret Duesbury
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus Hinxton, Cambridge CB10 1SD, UK
| | - Livia Perfetto
- Department of Biology and Biotechnologies “Charles Darwin”, La Sapienza University, Rome, Italy
| | - Luana Licata
- Department of Biology, University of Tor Vergata, Rome, Italy
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus Hinxton, Cambridge CB10 1SD, UK
| | - Sandra Orchard
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus Hinxton, Cambridge CB10 1SD, UK
| |
Collapse
|
18
|
Bowler-Barnett EH, Fan J, Luo J, Magrane M, Martin MJ, Orchard S. UniProt and Mass Spectrometry-Based Proteomics-A 2-Way Working Relationship. Mol Cell Proteomics 2023; 22:100591. [PMID: 37301379 PMCID: PMC10404557 DOI: 10.1016/j.mcpro.2023.100591] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 05/20/2023] [Accepted: 06/07/2023] [Indexed: 06/12/2023] Open
Abstract
The human proteome comprises of all of the proteins produced by the sequences translated from the human genome with additional modifications in both sequence and function caused by nonsynonymous variants and posttranslational modifications including cleavage of the initial transcript into smaller peptides and polypeptides. The UniProtKB database (www.uniprot.org) is the world's leading high-quality, comprehensive and freely accessible resource of protein sequence and functional information and presents a summary of experimentally verified, or computationally predicted, functional information added by our expert biocuration team for each protein in the proteome. Researchers in the field of mass spectrometry-based proteomics both consume and add to the body of data available in UniProtKB, and this review highlights the information we provide to this community and the knowledge we in turn obtain from groups via deposition of large-scale datasets in public domain databases.
Collapse
Affiliation(s)
- E H Bowler-Barnett
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, United Kingdom
| | - J Fan
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, United Kingdom
| | - J Luo
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, United Kingdom
| | - M Magrane
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, United Kingdom
| | - M J Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, United Kingdom
| | - S Orchard
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, United Kingdom.
| |
Collapse
|
19
|
Licata L, Via A, Turina P, Babbi G, Benevenuta S, Carta C, Casadio R, Cicconardi A, Facchiano A, Fariselli P, Giordano D, Isidori F, Marabotti A, Martelli PL, Pascarella S, Pinelli M, Pippucci T, Russo R, Savojardo C, Scafuri B, Valeriani L, Capriotti E. Resources and tools for rare disease variant interpretation. Front Mol Biosci 2023; 10:1169109. [PMID: 37234922 PMCID: PMC10206239 DOI: 10.3389/fmolb.2023.1169109] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2023] [Accepted: 04/25/2023] [Indexed: 05/28/2023] Open
Abstract
Collectively, rare genetic disorders affect a substantial portion of the world's population. In most cases, those affected face difficulties in receiving a clinical diagnosis and genetic characterization. The understanding of the molecular mechanisms of these diseases and the development of therapeutic treatments for patients are also challenging. However, the application of recent advancements in genome sequencing/analysis technologies and computer-aided tools for predicting phenotype-genotype associations can bring significant benefits to this field. In this review, we highlight the most relevant online resources and computational tools for genome interpretation that can enhance the diagnosis, clinical management, and development of treatments for rare disorders. Our focus is on resources for interpreting single nucleotide variants. Additionally, we present use cases for interpreting genetic variants in clinical settings and review the limitations of these results and prediction tools. Finally, we have compiled a curated set of core resources and tools for analyzing rare disease genomes. Such resources and tools can be utilized to develop standardized protocols that will enhance the accuracy and effectiveness of rare disease diagnosis.
Collapse
Affiliation(s)
- Luana Licata
- Department of Biology, University of Rome Tor Vergata, Roma, Italy
| | - Allegra Via
- Department of Biochemical Sciences “A. Rossi Fanelli”, University of Rome “La Sapienza”, Roma, Italy
| | - Paola Turina
- Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
| | - Giulia Babbi
- Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
| | | | - Claudio Carta
- National Centre for Rare Diseases, Istituto Superiore di Sanità, Roma, Italy
| | - Rita Casadio
- Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
| | - Andrea Cicconardi
- Department of Physics, University of Genova, Genova, Italy
- Italiano di Tecnologia—IIT, Genova, Italy
| | - Angelo Facchiano
- National Research Council, Institute of Food Science, Avellino, Italy
| | - Piero Fariselli
- Department of Medical Sciences, University of Torino, Torino, Italy
| | - Deborah Giordano
- National Research Council, Institute of Food Science, Avellino, Italy
| | - Federica Isidori
- Medical Genetics Unit, IRCCS Azienda Ospedaliero-Universitaria di Bologna, Bologna, Italy
| | - Anna Marabotti
- Department of Chemistry and Biology “A. Zambelli”, University of Salerno, Fisciano, SA, Italy
| | - Pier Luigi Martelli
- Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
| | - Stefano Pascarella
- Department of Biochemical Sciences “A. Rossi Fanelli”, University of Rome “La Sapienza”, Roma, Italy
| | - Michele Pinelli
- Department of Molecular Medicine and Medical Biotechnology, University of Naples Federico II, Napoli, Italy
| | - Tommaso Pippucci
- Medical Genetics Unit, IRCCS Azienda Ospedaliero-Universitaria di Bologna, Bologna, Italy
| | - Roberta Russo
- Department of Molecular Medicine and Medical Biotechnology, University of Naples Federico II, Napoli, Italy
- CEINGE Biotecnologie Avanzate Franco Salvatore, Napoli, Italy
| | - Castrense Savojardo
- Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
| | - Bernardina Scafuri
- Department of Chemistry and Biology “A. Zambelli”, University of Salerno, Fisciano, SA, Italy
| | | | - Emidio Capriotti
- Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
| |
Collapse
|
20
|
Omenn GS, Lane L, Overall CM, Pineau C, Packer NH, Cristea IM, Lindskog C, Weintraub ST, Orchard S, Roehrl MH, Nice E, Liu S, Bandeira N, Chen YJ, Guo T, Aebersold R, Moritz RL, Deutsch EW. The 2022 Report on the Human Proteome from the HUPO Human Proteome Project. J Proteome Res 2023; 22:1024-1042. [PMID: 36318223 PMCID: PMC10081950 DOI: 10.1021/acs.jproteome.2c00498] [Citation(s) in RCA: 34] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
The 2022 Metrics of the Human Proteome from the HUPO Human Proteome Project (HPP) show that protein expression has now been credibly detected (neXtProt PE1 level) for 18 407 (93.2%) of the 19 750 predicted proteins coded in the human genome, a net gain of 50 since 2021 from data sets generated around the world and reanalyzed by the HPP. Conversely, the number of neXtProt PE2, PE3, and PE4 missing proteins has been reduced by 78 from 1421 to 1343. This represents continuing experimental progress on the human proteome parts list across all the chromosomes, as well as significant reclassifications. Meanwhile, applying proteomics in a vast array of biological and clinical studies continues to yield significant findings and growing integration with other omics platforms. We present highlights from the Chromosome-Centric HPP, Biology and Disease-driven HPP, and HPP Resource Pillars, compare features of mass spectrometry and Olink and Somalogic platforms, note the emergence of translation products from ribosome profiling of small open reading frames, and discuss the launch of the initial HPP Grand Challenge Project, "A Function for Each Protein".
Collapse
Affiliation(s)
- Gilbert S. Omenn
- University of Michigan, Ann Arbor, Michigan 48109, United States
- Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Lydie Lane
- CALIPHO Group, SIB Swiss Institute of Bioinformatics and University of Geneva, 1015 Lausanne, Switzerland
| | | | - Charles Pineau
- French Institute of Health and Medical Research, 35042 RENNES Cedex, France
| | - Nicolle H. Packer
- Macquarie University, Sydney, NSW 2109, Australia
- Griffith University’s Institute for Glycomics, Sydney, NSW 2109, Australia
| | | | | | - Susan T. Weintraub
- University of Texas Health Science Center-San Antonio, San Antonio, Texas 78229-3900, United States
| | - Sandra Orchard
- EMBL-EBI, Hinxton, Cambridgeshire, CB10 1SD, United Kingdom
| | - Michael H.A. Roehrl
- Memorial Sloan Kettering Cancer Center, New York, New York, 10065, United States
| | | | - Siqi Liu
- BGI Group, Shenzhen 518083, China
| | - Nuno Bandeira
- University of California, San Diego, La Jolla, California 92093, United States
| | - Yu-Ju Chen
- National Taiwan University, Academia Sinica, Nankang, Taipei 11529, Taiwan
| | - Tiannan Guo
- Westlake University Guomics Laboratory of Big Proteomic Data, Hangzhou 310024, Zhejiang Province, China
| | - Ruedi Aebersold
- Institute of Molecular Systems Biology in ETH Zurich, 8092 Zurich, Switzerland
| | - Robert L. Moritz
- Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Eric W. Deutsch
- Institute for Systems Biology, Seattle, Washington 98109, United States
| |
Collapse
|
21
|
Barrio-Hernandez I, Schwartzentruber J, Shrivastava A, Del-Toro N, Gonzalez A, Zhang Q, Mountjoy E, Suveges D, Ochoa D, Ghoussaini M, Bradley G, Hermjakob H, Orchard S, Dunham I, Anderson CA, Porras P, Beltrao P. Network expansion of genetic associations defines a pleiotropy map of human cell biology. Nat Genet 2023; 55:389-398. [PMID: 36823319 PMCID: PMC10011132 DOI: 10.1038/s41588-023-01327-9] [Citation(s) in RCA: 41] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2021] [Accepted: 01/30/2023] [Indexed: 02/25/2023]
Abstract
Interacting proteins tend to have similar functions, influencing the same organismal traits. Interaction networks can be used to expand the list of candidate trait-associated genes from genome-wide association studies. Here, we performed network-based expansion of trait-associated genes for 1,002 human traits showing that this recovers known disease genes or drug targets. The similarity of network expansion scores identifies groups of traits likely to share an underlying genetic and biological process. We identified 73 pleiotropic gene modules linked to multiple traits, enriched in genes involved in processes such as protein ubiquitination and RNA processing. In contrast to gene deletion studies, pleiotropy as defined here captures specifically multicellular-related processes. We show examples of modules linked to human diseases enriched in genes with known pathogenic variants that can be used to map targets of approved drugs for repurposing. Finally, we illustrate the use of network expansion scores to study genes at inflammatory bowel disease genome-wide association study loci, and implicate inflammatory bowel disease-relevant genes with strong functional and genetic support.
Collapse
Affiliation(s)
- Inigo Barrio-Hernandez
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
| | - Jeremy Schwartzentruber
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
- Wellcome Sanger Institute, Cambridge, UK
| | - Anjali Shrivastava
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
| | - Noemi Del-Toro
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
| | - Asier Gonzalez
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
| | - Qian Zhang
- Wellcome Sanger Institute, Cambridge, UK
| | - Edward Mountjoy
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
| | - Daniel Suveges
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
| | - David Ochoa
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
| | - Maya Ghoussaini
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
| | - Glyn Bradley
- Computational Biology, Genomic Sciences, GSK, Stevenage, UK
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
| | - Sandra Orchard
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
| | - Ian Dunham
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
- Wellcome Sanger Institute, Cambridge, UK
| | - Carl A Anderson
- Open Targets, Cambridge, UK
- Wellcome Sanger Institute, Cambridge, UK
| | - Pablo Porras
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
- Open Targets, Cambridge, UK
| | - Pedro Beltrao
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK.
- Open Targets, Cambridge, UK.
- Institute of Molecular Systems Biology, ETH Zürich, Zürich, Switzerland.
| |
Collapse
|
22
|
Deutsch EW, Vizcaíno JA, Jones AR, Binz PA, Lam H, Klein J, Bittremieux W, Perez-Riverol Y, Tabb DL, Walzer M, Ricard-Blum S, Hermjakob H, Neumann S, Mak TD, Kawano S, Mendoza L, Van Den Bossche T, Gabriels R, Bandeira N, Carver J, Pullman B, Sun Z, Hoffmann N, Shofstahl J, Zhu Y, Licata L, Quaglia F, Tosatto SCE, Orchard SE. Proteomics Standards Initiative at Twenty Years: Current Activities and Future Work. J Proteome Res 2023; 22:287-301. [PMID: 36626722 PMCID: PMC9903322 DOI: 10.1021/acs.jproteome.2c00637] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Indexed: 01/11/2023]
Abstract
The Human Proteome Organization (HUPO) Proteomics Standards Initiative (PSI) has been successfully developing guidelines, data formats, and controlled vocabularies (CVs) for the proteomics community and other fields supported by mass spectrometry since its inception 20 years ago. Here we describe the general operation of the PSI, including its leadership, working groups, yearly workshops, and the document process by which proposals are thoroughly and publicly reviewed in order to be ratified as PSI standards. We briefly describe the current state of the many existing PSI standards, some of which remain the same as when originally developed, some of which have undergone subsequent revisions, and some of which have become obsolete. Then the set of proposals currently being developed are described, with an open call to the community for participation in the forging of the next generation of standards. Finally, we describe some synergies and collaborations with other organizations and look to the future in how the PSI will continue to promote the open sharing of data and thus accelerate the progress of the field of proteomics.
Collapse
Affiliation(s)
- Eric W. Deutsch
- Institute
for Systems Biology, Seattle, Washington 98109, United States
| | - Juan Antonio Vizcaíno
- European
Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Andrew R. Jones
- Institute
of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Pierre-Alain Binz
- Clinical
Chemistry Service, Lausanne University Hospital, 1011 976 Lausanne, Switzerland
| | - Henry Lam
- Department
of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong 999077, P. R. China.
| | - Joshua Klein
- Program for
Bioinformatics, Boston University, Boston, Massachusetts 02215, United States
| | - Wout Bittremieux
- Skaggs
School
of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California 92093, United States
- Department
of Computer Science, University of Antwerp, 2020 Antwerpen, Belgium
| | - Yasset Perez-Riverol
- European
Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - David L. Tabb
- SA MRC
Centre for TB Research, DST/NRF Centre of Excellence for Biomedical
TB Research, Division of Molecular Biology and Human Genetics, Faculty
of Medicine and Health Sciences, Stellenbosch
University, Cape Town 7602, South Africa
| | - Mathias Walzer
- European
Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Sylvie Ricard-Blum
- Univ.
Lyon, Université Lyon 1, ICBMS, UMR 5246, 69622 Villeurbanne, France
| | - Henning Hermjakob
- European
Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Steffen Neumann
- Bioinformatics
and Scientific Data, Leibniz Institute of
Plant Biochemistry, 06120 Halle, Germany
- German
Centre for Integrative Biodiversity Research (iDiv), 04103 Halle-Jena-Leipzig, Germany
| | - Tytus D. Mak
- Mass Spectrometry
Data Center, National Institute of Standards
and Technology, 100 Bureau Drive, Gaithersburg, Maryland 20899, United
States
| | - Shin Kawano
- Database
Center for Life Science, Joint Support Center for Data Science Research, Research Organization of Information and Systems, Chiba 277-0871, Japan
- Faculty
of Contemporary Society, Toyama University
of International Studies, Toyama 930-1292, Japan
- School
of Frontier Engineering, Kitasato University, Sagamihara 252-0373, Japan
| | - Luis Mendoza
- Institute
for Systems Biology, Seattle, Washington 98109, United States
| | - Tim Van Den Bossche
- VIB-UGent
Center for Medical Biotechnology, VIB, 9052 Ghent, Belgium
- Department
of Biomolecular Medicine, Faculty of Medicine and Health Sciences, Ghent University, 9052 Ghent, Belgium
| | - Ralf Gabriels
- VIB-UGent
Center for Medical Biotechnology, VIB, 9052 Ghent, Belgium
- Department
of Biomolecular Medicine, Faculty of Medicine and Health Sciences, Ghent University, 9052 Ghent, Belgium
| | - Nuno Bandeira
- Skaggs
School
of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California 92093, United States
- Center
for Computational Mass Spectrometry, Department of Computer Science
and Engineering, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego 92093-0404, United States
| | - Jeremy Carver
- Center
for Computational Mass Spectrometry, Department of Computer Science
and Engineering, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego 92093-0404, United States
| | - Benjamin Pullman
- Center
for Computational Mass Spectrometry, Department of Computer Science
and Engineering, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego 92093-0404, United States
| | - Zhi Sun
- Institute
for Systems Biology, Seattle, Washington 98109, United States
| | - Nils Hoffmann
- Institute
for Bio- and Geosciences (IBG-5), Forschungszentrum
Jülich GmbH, 52428 Jülich, Germany
| | - Jim Shofstahl
- Thermo
Fisher Scientific, 355 River Oaks Parkway, San Jose, California 95134, United States
| | - Yunping Zhu
- National
Center for Protein Sciences (Beijing), Beijing
Institute of Lifeomics, #38, Life Science Park, Changping District, Beijing 102206, China
| | - Luana Licata
- Fondazione
Human Technopole, 20157 Milan, Italy
- Department
of Biology, University of Rome Tor Vergata, 00133 Rome, Italy
| | - Federica Quaglia
- Institute
of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council (CNR-IBIOM), 70126 Bari, Italy
- Department
of Biomedical Sciences, University of Padova, 35131 Padova, Italy
| | | | - Sandra E. Orchard
- European
Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| |
Collapse
|
23
|
UniProt: the Universal Protein Knowledgebase in 2023. Nucleic Acids Res 2023; 51:D523-D531. [PMID: 36408920 PMCID: PMC9825514 DOI: 10.1093/nar/gkac1052] [Citation(s) in RCA: 2952] [Impact Index Per Article: 1476.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Revised: 10/05/2022] [Accepted: 10/25/2022] [Indexed: 11/22/2022] Open
Abstract
The aim of the UniProt Knowledgebase is to provide users with a comprehensive, high-quality and freely accessible set of protein sequences annotated with functional information. In this publication we describe enhancements made to our data processing pipeline and to our website to adapt to an ever-increasing information content. The number of sequences in UniProtKB has risen to over 227 million and we are working towards including a reference proteome for each taxonomic group. We continue to extract detailed annotations from the literature to update or create reviewed entries, while unreviewed entries are supplemented with annotations provided by automated systems using a variety of machine-learning techniques. In addition, the scientific community continues their contributions of publications and annotations to UniProt entries of their interest. Finally, we describe our new website (https://www.uniprot.org/), designed to enhance our users' experience and make our data easily accessible to the research community. This interface includes access to AlphaFold structures for more than 85% of all entries as well as improved visualisations for subcellular localisation of proteins.
Collapse
|
24
|
Baltoumas FA, Sofras D, Apostolakou AE, Litou ZI, Iconomidou VA. NucEnvDB: A Database of Nuclear Envelope Proteins and Their Interactions. MEMBRANES 2023; 13:62. [PMID: 36676869 PMCID: PMC9861991 DOI: 10.3390/membranes13010062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/10/2022] [Revised: 12/27/2022] [Accepted: 12/28/2022] [Indexed: 06/17/2023]
Abstract
The nuclear envelope (NE) is a double-membrane system surrounding the nucleus of eukaryotic cells. A large number of proteins are localized in the NE, performing a wide variety of functions, from the bidirectional exchange of molecules between the cytoplasm and the nucleus to chromatin tethering, genome organization, regulation of signaling cascades, and many others. Despite its importance, several aspects of the NE, including its protein-protein interactions, remain understudied. In this work, we present NucEnvDB, a publicly available database of NE proteins and their interactions. Each database entry contains useful annotation including a description of its position in the NE, its interactions with other proteins, and cross-references to major biological repositories. In addition, the database provides users with a number of visualization and analysis tools, including the ability to construct and visualize protein-protein interaction networks and perform functional enrichment analysis for clusters of NE proteins and their interaction partners. The capabilities of NucEnvDB and its analysis tools are showcased by two informative case studies, exploring protein-protein interactions in Hutchinson-Gilford progeria and during SARS-CoV-2 infection at the level of the nuclear envelope.
Collapse
Affiliation(s)
- Fotis A. Baltoumas
- Section of Cell Biology & Biophysics, Department of Biology, School of Sciences, National & Kapodistrian University of Athens, Panepistimiopolis, 15701 Athens, Greece
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 34 Fleming St., 16672 Athens, Greece
| | - Dimitrios Sofras
- Section of Cell Biology & Biophysics, Department of Biology, School of Sciences, National & Kapodistrian University of Athens, Panepistimiopolis, 15701 Athens, Greece
- Laboratory of Molecular Cell Biology, KU Leuven, Kasteelpark Arenberg 31—Box 2438, 3001 Leuven, Belgium
| | - Avgi E. Apostolakou
- Section of Cell Biology & Biophysics, Department of Biology, School of Sciences, National & Kapodistrian University of Athens, Panepistimiopolis, 15701 Athens, Greece
| | - Zoi I. Litou
- Section of Cell Biology & Biophysics, Department of Biology, School of Sciences, National & Kapodistrian University of Athens, Panepistimiopolis, 15701 Athens, Greece
| | - Vassiliki A. Iconomidou
- Section of Cell Biology & Biophysics, Department of Biology, School of Sciences, National & Kapodistrian University of Athens, Panepistimiopolis, 15701 Athens, Greece
| |
Collapse
|
25
|
Iuchi H, Kawasaki J, Kubo K, Fukunaga T, Hokao K, Yokoyama G, Ichinose A, Suga K, Hamada M. Bioinformatics approaches for unveiling virus-host interactions. Comput Struct Biotechnol J 2023; 21:1774-1784. [PMID: 36874163 PMCID: PMC9969756 DOI: 10.1016/j.csbj.2023.02.044] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 02/22/2023] [Accepted: 02/22/2023] [Indexed: 03/03/2023] Open
Abstract
The coronavirus disease-2019 (COVID-19) pandemic has elucidated major limitations in the capacity of medical and research institutions to appropriately manage emerging infectious diseases. We can improve our understanding of infectious diseases by unveiling virus-host interactions through host range prediction and protein-protein interaction prediction. Although many algorithms have been developed to predict virus-host interactions, numerous issues remain to be solved, and the entire network remains veiled. In this review, we comprehensively surveyed algorithms used to predict virus-host interactions. We also discuss the current challenges, such as dataset biases toward highly pathogenic viruses, and the potential solutions. The complete prediction of virus-host interactions remains difficult; however, bioinformatics can contribute to progress in research on infectious diseases and human health.
Collapse
Affiliation(s)
- Hitoshi Iuchi
- Waseda Research Institute for Science and Engineering, Waseda University, Tokyo 169-8555, Japan.,Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), Tokyo 169-8555, Japan
| | - Junna Kawasaki
- Faculty of Science and Engineering, Waseda University, Okubo Shinjuku-ku, Tokyo 169-8555, Japan
| | - Kento Kubo
- Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), Tokyo 169-8555, Japan.,School of Advanced Science and Engineering, Waseda University, Okubo Shinjuku-ku, Tokyo 169-8555, Japan
| | - Tsukasa Fukunaga
- Waseda Institute for Advanced Study, Waseda University, Nishi Waseda, Shinjuku-ku, Tokyo 169-0051, Japan
| | - Koki Hokao
- School of Advanced Science and Engineering, Waseda University, Okubo Shinjuku-ku, Tokyo 169-8555, Japan
| | - Gentaro Yokoyama
- Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), Tokyo 169-8555, Japan.,School of Advanced Science and Engineering, Waseda University, Okubo Shinjuku-ku, Tokyo 169-8555, Japan
| | - Akiko Ichinose
- Waseda Research Institute for Science and Engineering, Waseda University, Tokyo 169-8555, Japan
| | - Kanta Suga
- School of Advanced Science and Engineering, Waseda University, Okubo Shinjuku-ku, Tokyo 169-8555, Japan
| | - Michiaki Hamada
- Waseda Research Institute for Science and Engineering, Waseda University, Tokyo 169-8555, Japan.,Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), Tokyo 169-8555, Japan.,School of Advanced Science and Engineering, Waseda University, Okubo Shinjuku-ku, Tokyo 169-8555, Japan.,Graduate School of Medicine, Nippon Medical School, Tokyo 113-8602, Japan
| |
Collapse
|
26
|
Ricard-Blum S. Building, Visualizing, and Analyzing Glycosaminoglycan-Protein Interaction Networks. Methods Mol Biol 2023; 2619:211-224. [PMID: 36662472 DOI: 10.1007/978-1-0716-2946-8_15] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
This chapter describes how to generate, visualize, and analyze interaction networks of glycosaminoglycans (GAGs), which are linear polyanionic polysaccharides mostly located at the cell surface and in the extracellular matrix. The protocol is divided into three major steps: (1) the collection of GAG-mediated interaction data, (2) the visualization of GAG interaction networks, and (3) the computational enrichment analyses of these networks to identify their overrepresented features (e.g., protein domains, location, molecular functions, and biological pathways) compared to a reference proteome. These analyses are critical to interpret GAG interactomic datasets, decipher their specificities and functions, and ultimately identify GAG-protein interactions to target for therapeutic purpose.
Collapse
Affiliation(s)
- Sylvie Ricard-Blum
- ICBMS, UMR 5246 University Lyon 1, CNRS, Institute of Molecular and Supramolecular Chemistry and Biochemistry, Villeurbanne Cedex, France.
| |
Collapse
|
27
|
Gosset S, Glatigny A, Gallopin M, Yi Z, Salé M, Mucchielli-Giorgi MH. APPINetwork: an R package for building and computational analysis of protein-protein interaction networks. PeerJ 2022; 10:e14204. [PMID: 36353604 PMCID: PMC9639416 DOI: 10.7717/peerj.14204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Accepted: 09/19/2022] [Indexed: 11/06/2022] Open
Abstract
Background Protein-protein interactions (PPIs) are essential to almost every process in a cell. Analysis of PPI networks gives insights into the functional relationships among proteins and may reveal important hub proteins and sub-networks corresponding to functional modules. Several good tools have been developed for PPI network analysis but they have certain limitations. Most tools are suited for studying PPI in only a small number of model species, and do not allow second-order networks to be built, or offer relevant functions for their analysis. To overcome these limitations, we have developed APPINetwork (Analysis of Protein-protein Interaction Networks). The aim was to produce a generic and user-friendly package for building and analyzing a PPI network involving proteins of interest from any species as long they are stored in a database. Methods APPINetwork is an open-source R package. It can be downloaded and installed on the collaborative development platform GitLab (https://forgemia.inra.fr/GNet/appinetwork). A graphical user interface facilitates its use. Graphical windows, buttons, and scroll bars allow the user to select or enter an organism name, choose data files and network parameters or methods dedicated to network analysis. All functions are implemented in R, except for the script identifying all proteins involved in the same biological process (developed in C) and the scripts formatting the BioGRID data file and generating the IDs correspondence file (implemented in Python 3). PPI information comes from private resources or different public databases (such as IntAct, BioGRID, and iRefIndex). The package can be deployed on Linux and macOS operating systems (OS). Deployment on Windows is possible but it requires the prior installation of Rtools and Python 3. Results APPINetwork allows the user to build a PPI network from selected public databases and add their own PPI data. In this network, the proteins have unique identifiers resulting from the standardization of the different identifiers specific to each database. In addition to the construction of the first-order network, APPINetwork offers the possibility of building a second-order network centered on the proteins of interest (proteins known for their role in the biological process studied or subunits of a complex protein) and provides the number and type of experiments that have highlighted each PPI, as well as references to articles containing experimental evidence. Conclusion More than a tool for PPI network building, APPINetwork enables the analysis of the resultant network, by searching either for the community of proteins involved in the same biological process or for the assembly intermediates of a protein complex. Results of these analyses are provided in easily exportable files. Examples files and a user manual describing each step of the process come with the package.
Collapse
Affiliation(s)
- Simon Gosset
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), Gif-sur-Yvette, France
- Université de Paris, Institute of Plant Sciences Paris-Saclay (IPS2), Gif-sur-Yvette, France
| | - Annie Glatigny
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), Gif-sur-Yvette, France
| | - Mélina Gallopin
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), Gif-sur-Yvette, France
| | - Zhou Yi
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), Gif-sur-Yvette, France
| | - Marion Salé
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), Gif-sur-Yvette, France
| | - Marie-Hélène Mucchielli-Giorgi
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), Gif-sur-Yvette, France
- Université de Paris, Institute of Plant Sciences Paris-Saclay (IPS2), Gif-sur-Yvette, France
| |
Collapse
|
28
|
de Crécy-lagard V, Amorin de Hegedus R, Arighi C, Babor J, Bateman A, Blaby I, Blaby-Haas C, Bridge AJ, Burley SK, Cleveland S, Colwell LJ, Conesa A, Dallago C, Danchin A, de Waard A, Deutschbauer A, Dias R, Ding Y, Fang G, Friedberg I, Gerlt J, Goldford J, Gorelik M, Gyori BM, Henry C, Hutinet G, Jaroch M, Karp PD, Kondratova L, Lu Z, Marchler-Bauer A, Martin MJ, McWhite C, Moghe GD, Monaghan P, Morgat A, Mungall CJ, Natale DA, Nelson WC, O’Donoghue S, Orengo C, O’Toole KH, Radivojac P, Reed C, Roberts RJ, Rodionov D, Rodionova IA, Rudolf JD, Saleh L, Sheynkman G, Thibaud-Nissen F, Thomas PD, Uetz P, Vallenet D, Carter EW, Weigele PR, Wood V, Wood-Charlson EM, Xu J. A roadmap for the functional annotation of protein families: a community perspective. Database (Oxford) 2022; 2022:baac062. [PMID: 35961013 PMCID: PMC9374478 DOI: 10.1093/database/baac062] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Revised: 06/28/2022] [Accepted: 08/03/2022] [Indexed: 12/23/2022]
Abstract
Over the last 25 years, biology has entered the genomic era and is becoming a science of 'big data'. Most interpretations of genomic analyses rely on accurate functional annotations of the proteins encoded by more than 500 000 genomes sequenced to date. By different estimates, only half the predicted sequenced proteins carry an accurate functional annotation, and this percentage varies drastically between different organismal lineages. Such a large gap in knowledge hampers all aspects of biological enterprise and, thereby, is standing in the way of genomic biology reaching its full potential. A brainstorming meeting to address this issue funded by the National Science Foundation was held during 3-4 February 2022. Bringing together data scientists, biocurators, computational biologists and experimentalists within the same venue allowed for a comprehensive assessment of the current state of functional annotations of protein families. Further, major issues that were obstructing the field were identified and discussed, which ultimately allowed for the proposal of solutions on how to move forward.
Collapse
Affiliation(s)
- Valérie de Crécy-lagard
- Department of Microbiology and Cell Sciences, University of Florida, Gainesville, FL 32611, USA
| | | | - Cecilia Arighi
- Department of Computer and Information Sciences, University of Delaware, Newark, DE 19713, USA
| | - Jill Babor
- Department of Microbiology and Cell Sciences, University of Florida, Gainesville, FL 32611, USA
| | - Alex Bateman
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Ian Blaby
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Crysten Blaby-Haas
- Biology Department, Brookhaven National Laboratory, Upton, NY 11973, USA
| | - Alan J Bridge
- Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, Geneva 4 CH-1211, Switzerland
| | - Stephen K Burley
- RCSB Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Stacey Cleveland
- Department of Microbiology and Cell Sciences, University of Florida, Gainesville, FL 32611, USA
| | - Lucy J Colwell
- Departmenf of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK
| | - Ana Conesa
- Spanish National Research Council, Institute for Integrative Systems Biology, Paterna, Valencia 46980, Spain
| | - Christian Dallago
- TUM (Technical University of Munich) Department of Informatics, Bioinformatics & Computational Biology, i12, Boltzmannstr. 3, Garching/Munich 85748, Germany
| | - Antoine Danchin
- School of Biomedical Sciences, Li KaShing Faculty of Medicine, The University of Hong Kong, 21 Sassoon Road, Pokfulam, SAR Hong Kong 999077, China
| | - Anita de Waard
- Research Collaboration Unit, Elsevier, Jericho, VT 05465, USA
| | - Adam Deutschbauer
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Raquel Dias
- Department of Microbiology and Cell Sciences, University of Florida, Gainesville, FL 32611, USA
| | - Yousong Ding
- Department of Medicinal Chemistry, Center for Natural Products, Drug Discovery and Development, University of Florida, Gainesville, FL 32610, USA
| | - Gang Fang
- NYU-Shanghai, Shanghai 200120, China
| | - Iddo Friedberg
- Department of Veterinary Microbiology and Preventive Medicine, Iowa State University, Ames, IA 50011, USA
| | - John Gerlt
- Institute for Genomic Biology and Departments of Biochemistry and Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Joshua Goldford
- Physics of Living Systems, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Mark Gorelik
- Department of Microbiology and Cell Sciences, University of Florida, Gainesville, FL 32611, USA
| | - Benjamin M Gyori
- Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA 02115, USA
| | - Christopher Henry
- Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439, USA
| | - Geoffrey Hutinet
- Department of Microbiology and Cell Sciences, University of Florida, Gainesville, FL 32611, USA
| | - Marshall Jaroch
- Department of Microbiology and Cell Sciences, University of Florida, Gainesville, FL 32611, USA
| | - Peter D Karp
- Bioinformatics Research Group, SRI International, Menlo Park, CA 94025, USA
| | | | - Zhiyong Lu
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), 8600 Rockville Pike, Bethesda, MD 20817, USA
| | - Aron Marchler-Bauer
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), 8600 Rockville Pike, Bethesda, MD 20817, USA
| | - Maria-Jesus Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Claire McWhite
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA
| | - Gaurav D Moghe
- Plant Biology Section, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA
| | - Paul Monaghan
- Department of Agricultural Education and Communication, University of Florida, Gainesville, FL 32611, USA
| | - Anne Morgat
- Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, Geneva 4 CH-1211, Switzerland
| | - Christopher J Mungall
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Darren A Natale
- Georgetown University Medical Center, Washington, DC 20007, USA
| | - William C Nelson
- Biological Sciences Division, Pacific Northwest National Laboratories, Richland, WA 99354, USA
| | - Seán O’Donoghue
- School of Biotechnology and Biomolecular Sciences, University of NSW, Sydney, NSW 2052, Australia
| | - Christine Orengo
- Department of Structural and Molecular Biology, University College London, London WC1E 6BT, UK
| | | | - Predrag Radivojac
- Khoury College of Computer Sciences, Northeastern University, Boston, MA 02115, USA
| | - Colbie Reed
- Department of Microbiology and Cell Sciences, University of Florida, Gainesville, FL 32611, USA
| | | | - Dmitri Rodionov
- Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA 92037, USA
| | - Irina A Rodionova
- Department of Bioengineering, Division of Engineering, University of California at San Diego, La Jolla, CA 92093-0412, USA
| | - Jeffrey D Rudolf
- Department of Chemistry, University of Florida, Gainesville, FL 32611, USA
| | - Lana Saleh
- New England Biolabs, Ipswich, MA 01938, USA
| | - Gloria Sheynkman
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA
| | - Francoise Thibaud-Nissen
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), 8600 Rockville Pike, Bethesda, MD 20817, USA
| | - Paul D Thomas
- Department of Population and Public Health Sciences, University of Southern California, Los Angeles, CA 90033, USA
| | - Peter Uetz
- Center for Biological Data Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - David Vallenet
- LABGeM, Génomique Métabolique, CEA, Genoscope, Institut François Jacob, Université d’Évry, Université Paris-Saclay, CNRS, Evry 91057, France
| | - Erica Watson Carter
- Department of Plant Pathology, University of Florida Citrus Research and Education Center, 700 Experiment Station Rd., Lake Alfred, FL 33850, USA
| | | | - Valerie Wood
- Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, UK
| | - Elisha M Wood-Charlson
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Jin Xu
- Department of Plant Pathology, University of Florida Citrus Research and Education Center, 700 Experiment Station Rd., Lake Alfred, FL 33850, USA
| |
Collapse
|
29
|
Glycosaminoglycan interaction networks and databases. Curr Opin Struct Biol 2022; 74:102355. [DOI: 10.1016/j.sbi.2022.102355] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2021] [Revised: 02/02/2022] [Accepted: 02/04/2022] [Indexed: 12/14/2022]
|
30
|
Vallet SD, Berthollier C, Ricard-Blum S. The glycosaminoglycan interactome 2.0. Am J Physiol Cell Physiol 2022; 322:C1271-C1278. [PMID: 35544698 DOI: 10.1152/ajpcell.00095.2022] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Glycosaminoglycans (GAGs) are complex linear polysaccharides, which are covalently attached to core proteins (except for hyaluronan) to form proteoglycans. They play key roles in the organization of the extracellular matrix, and at the cell surface where they contribute to the regulation of cell signaling and of cell adhesion. To explore the mechanisms and pathways underlying their functions, we have generated an expanded dataset of 4290 interactions corresponding to 3464 unique GAG-binding proteins, four times more than the first version of the GAG interactome (Vallet and Ricard-Blum, 2021 J Histochem Cytochem 69:93-104). The increased size of the GAG network is mostly due to the addition of GAG-binding proteins captured from cell lysates and biological fluids by affinity chromatography and identified by mass spectrometry. We review here the interaction repertoire of natural GAGs and of synthetic sulfated hyaluronan, the specificity and molecular functions of GAG-binding proteins, and the biological processes and pathways they are involved in. This dataset is also used to investigate the differences between proteins binding to iduronic acid-containing GAGs (dermatan sulfate and heparin/heparan sulfate) and those interacting with GAGs lacking iduronic acid (chondroitin sulfate, hyaluronan, and keratan sulfate).
Collapse
|
31
|
Saha D, Iannuccelli M, Brun C, Zanzoni A, Licata L. The Intricacy of the Viral-Human Protein Interaction Networks: Resources, Data, and Analyses. Front Microbiol 2022; 13:849781. [PMID: 35531299 PMCID: PMC9069133 DOI: 10.3389/fmicb.2022.849781] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2022] [Accepted: 03/11/2022] [Indexed: 11/18/2022] Open
Abstract
Viral infections are one of the major causes of human diseases that cause yearly millions of deaths and seriously threaten global health, as we have experienced with the COVID-19 pandemic. Numerous approaches have been adopted to understand viral diseases and develop pharmacological treatments. Among them, the study of virus-host protein-protein interactions is a powerful strategy to comprehend the molecular mechanisms employed by the virus to infect the host cells and to interact with their components. Experimental protein-protein interactions described in the scientific literature have been systematically captured into several molecular interaction databases. These data are organized in structured formats and can be easily downloaded by users to perform further bioinformatic and network studies. Network analysis of available virus-host interactomes allow us to understand how the host interactome is perturbed upon viral infection and what are the key host proteins targeted by the virus and the main cellular pathways that are subverted. In this review, we give an overview of publicly available viral-human protein-protein interactions resources and the community standards, curation rules and adopted ontologies. A description of the main virus-human interactome available is provided, together with the main network analyses that have been performed. We finally discuss the main limitations and future challenges to assess the quality and reliability of protein-protein interaction datasets and resources.
Collapse
Affiliation(s)
- Deeya Saha
- Aix-Marseille Univ., Inserm, TAGC, UMR_S1090, Marseille, France
| | | | - Christine Brun
- Aix-Marseille Univ., Inserm, TAGC, UMR_S1090, Marseille, France
- CNRS, Marseille, France
| | - Andreas Zanzoni
- Aix-Marseille Univ., Inserm, TAGC, UMR_S1090, Marseille, France
- *Correspondence: Andreas Zanzoni,
| | - Luana Licata
- Department of Biology, University of Rome Tor Vergata, Rome, Italy
- Luana Licata,
| |
Collapse
|
32
|
Xie VC, Styles MJ, Dickinson BC. Methods for the directed evolution of biomolecular interactions. Trends Biochem Sci 2022; 47:403-416. [PMID: 35427479 PMCID: PMC9022280 DOI: 10.1016/j.tibs.2022.01.001] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2021] [Revised: 12/27/2021] [Accepted: 01/13/2022] [Indexed: 02/06/2023]
Abstract
Noncovalent interactions between biomolecules such as proteins and nucleic acids coordinate all cellular processes through changes in proximity. Tools that perturb these interactions are and will continue to be highly valuable for basic and translational scientific endeavors. By taking cues from natural systems, such as the adaptive immune system, we can design directed evolution platforms that can generate proteins that bind to biomolecules of interest. In recent years, the platforms used to direct the evolution of biomolecular binders have greatly expanded the range of types of interactions one can evolve. Herein, we review recent advances in methods to evolve protein-protein, protein-RNA, and protein-DNA interactions.
Collapse
Affiliation(s)
| | - Matthew J Styles
- Department of Chemistry, The University of Chicago, Chicago, IL 60637, USA
| | - Bryan C Dickinson
- Department of Chemistry, The University of Chicago, Chicago, IL 60637, USA.
| |
Collapse
|
33
|
van der Velde KJ, Singh G, Kaliyaperumal R, Liao X, de Ridder S, Rebers S, Kerstens HHD, de Andrade F, van Reeuwijk J, De Gruyter FE, Hiltemann S, Ligtvoet M, Weiss MM, van Deutekom HWM, Jansen AML, Stubbs AP, Vissers LELM, Laros JFJ, van Enckevort E, Stemkens D, 't Hoen PAC, Beliën JAM, van Gijn ME, Swertz MA. FAIR Genomes metadata schema promoting Next Generation Sequencing data reuse in Dutch healthcare and research. Sci Data 2022; 9:169. [PMID: 35418585 PMCID: PMC9008059 DOI: 10.1038/s41597-022-01265-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2021] [Accepted: 03/25/2022] [Indexed: 11/08/2022] Open
Abstract
The genomes of thousands of individuals are profiled within Dutch healthcare and research each year. However, this valuable genomic data, associated clinical data and consent are captured in different ways and stored across many systems and organizations. This makes it difficult to discover rare disease patients, reuse data for personalized medicine and establish research cohorts based on specific parameters. FAIR Genomes aims to enable NGS data reuse by developing metadata standards for the data descriptions needed to FAIRify genomic data while also addressing ELSI issues. We developed a semantic schema of essential data elements harmonized with international FAIR initiatives. The FAIR Genomes schema v1.1 contains 110 elements in 9 modules. It reuses common ontologies such as NCIT, DUO and EDAM, only introducing new terms when necessary. The schema is represented by a YAML file that can be transformed into templates for data entry software (EDC) and programmatic interfaces (JSON, RDF) to ease genomic data sharing in research and healthcare. The schema, documentation and MOLGENIS reference implementation are available at https://fairgenomes.org .
Collapse
Affiliation(s)
- K Joeri van der Velde
- University of Groningen and University Medical Center Groningen, Genomics Coordination Center, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands
- University of Groningen and University Medical Center Groningen, Department of Genetics, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands
| | - Gurnoor Singh
- Radboud University Medical Center, Radboud Institute for Molecular Life Sciences, Center for Molecular and Biomolecular Informatics, Geert Grooteplein 28, 6525 GA, Nijmegen, The Netherlands
| | - Rajaram Kaliyaperumal
- Leiden University Medical Center, Department of Human Genetics, Einthovenweg 20, 2333 ZC, Leiden, The Netherlands
| | - XiaoFeng Liao
- Radboud University Medical Center, Radboud Institute for Molecular Life Sciences, Center for Molecular and Biomolecular Informatics, Geert Grooteplein 28, 6525 GA, Nijmegen, The Netherlands
| | - Sander de Ridder
- Amsterdam University Medical Center, University of Amsterdam, Department of Pathology, Meibergdreef 9, 1105 AZ, Amsterdam, The Netherlands
| | - Susanne Rebers
- The Netherlands Cancer Institute, Division of Molecular Pathology, Plesmanlaan 121, 1066 CX, Amsterdam, The Netherlands
| | - Hindrik H D Kerstens
- Prinses Máxima Center for Pediatric Oncology, Kemmeren group, Heidelberglaan 25, 3584 CS, Utrecht, The Netherlands
| | - Fernanda de Andrade
- University of Groningen and University Medical Center Groningen, Genomics Coordination Center, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands
| | - Jeroen van Reeuwijk
- Radboud University Medical Center, Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Geert Grooteplein 10, 6525 GA, Nijmegen, The Netherlands
| | - Fini E De Gruyter
- University Medical Center Utrecht, Department of Genetics, Heidelberglaan 100, 3584 CX, Utrecht, The Netherlands
| | - Saskia Hiltemann
- Erasmus Medical Center, Department of Pathology, Doctor Molewaterplein 40, 3015 GD, Rotterdam, The Netherlands
| | - Maarten Ligtvoet
- Nictiz - Dutch competence centre for electronic exchange of health and care information, Oude Middenweg 55, 2491 AC, The Hague, The Netherlands
| | - Marjan M Weiss
- Radboud University Medical Center, Department of Human Genetics, Geert Grooteplein 10, 6525 GA, Nijmegen, The Netherlands
| | - Hanneke W M van Deutekom
- University Medical Center Utrecht, Department of Genetics, Heidelberglaan 100, 3584 CX, Utrecht, The Netherlands
| | - Anne M L Jansen
- University Medical Center Utrecht, Department of Pathology, Heidelberglaan 100, 3584 CX, Utrecht, The Netherlands
| | - Andrew P Stubbs
- Erasmus Medical Center, Department of Pathology, Doctor Molewaterplein 40, 3015 GD, Rotterdam, The Netherlands
| | - Lisenka E L M Vissers
- Radboud University Medical Center, Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Geert Grooteplein 10, 6525 GA, Nijmegen, The Netherlands
| | - Jeroen F J Laros
- Leiden University Medical Center, Department of Human Genetics, Einthovenweg 20, 2333 ZC, Leiden, The Netherlands
- Leiden University Medical Center, Department of Clinical Genetics, Einthovenweg 20, 2333 ZC, Leiden, The Netherlands
- Rijksinstituut voor Volksgezondheid en Milieu, Antonie van Leeuwenhoeklaan 9, 3721 MA, Bilthoven, The Netherlands
| | - Esther van Enckevort
- University of Groningen and University Medical Center Groningen, Genomics Coordination Center, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands
| | - Daphne Stemkens
- VSOP - Patient Alliance for Rare and Genetic Diseases The Netherlands, Koninginnelaan 23, 3762 DA, Soest, The Netherlands
| | - Peter A C 't Hoen
- Radboud University Medical Center, Radboud Institute for Molecular Life Sciences, Center for Molecular and Biomolecular Informatics, Geert Grooteplein 28, 6525 GA, Nijmegen, The Netherlands
| | - Jeroen A M Beliën
- Amsterdam University Medical Center, Vrije Universiteit Amsterdam, Department of Pathology, De Boelelaan 1117, 1081 HV, Amsterdam, The Netherlands
| | - Mariëlle E van Gijn
- University of Groningen and University Medical Center Groningen, Department of Genetics, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands
| | - Morris A Swertz
- University of Groningen and University Medical Center Groningen, Genomics Coordination Center, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands.
- University of Groningen and University Medical Center Groningen, Department of Genetics, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands.
| |
Collapse
|
34
|
Melkonian M, Juigné C, Dameron O, Rabut G, Becker E. Towards a reproducible interactome: semantic-based detection of redundancies to unify protein-protein interaction databases. Bioinformatics 2022; 38:1685-1691. [PMID: 35015827 DOI: 10.1093/bioinformatics/btac013] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Revised: 11/29/2021] [Accepted: 01/06/2022] [Indexed: 02/04/2023] Open
Abstract
MOTIVATION Information on protein-protein interactions is collected in numerous primary databases with their own curation process. Several meta-databases aggregate primary databases to provide more exhaustive datasets. In addition to exhaustivity, aggregation contributes to reliability by providing an overview of the various studies and detection methods supporting an interaction. However, interactions listed in different primary databases are partly redundant because some publications reporting protein-protein interactions have been curated by multiple primary databases. Mere aggregation can thus introduce a bias if these redundancies are not identified and eliminated. To overcome this bias, meta-databases rely on the Molecular Interaction ontology that describes interaction detection methods, but they do not fully take advantage of the ontology's rich semantics, which leads to systematically overestimating interaction reproducibility. RESULTS We propose a precise definition of explicit and implicit redundancy and show that both can be easily detected using Semantic Web technologies. We apply this process to a dataset from the Agile Protein Interactomes DataServer (APID) meta-database and show that while explicit redundancies were detected by the APID aggregation process, about 15% of APID entries are implicitly redundant and should not be taken into account when presenting confidence-related metrics. More than 90% of implicit redundancies result from the aggregation of distinct primary databases, whereas the remaining occurs between entries of a single database. Finally, we build a 'reproducible interactome' with interactions that have been reproduced by multiple methods or publications. The size of the reproducible interactome is drastically impacted by removing redundancies for both yeast (-59%) and human (-56%), and we show that this is largely due to implicit redundancies. AVAILABILITY AND IMPLEMENTATION Software, data and results are available at https://gitlab.com/nnet56/reproducible-interactome, https://reproducible-interactome.genouest.org/, Zenodo (https://doi.org/10.5281/zenodo.5595037) and NDEx (https://doi.org/10.18119/N94302 and https://doi.org/10.18119/N97S4D). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Marc Melkonian
- Univ Rennes, Inria, CNRS, IRISA - UMR 6074, F-35000 Rennes, France.,Univ Rennes, CNRS, IGDR - UMR 6290, F-35000 Rennes, France
| | - Camille Juigné
- Univ Rennes, Inria, CNRS, IRISA - UMR 6074, F-35000 Rennes, France.,Pegase, Inrae, Institut Agro, 35590 Saint-Gilles, France
| | - Olivier Dameron
- Univ Rennes, Inria, CNRS, IRISA - UMR 6074, F-35000 Rennes, France
| | - Gwenaël Rabut
- Univ Rennes, CNRS, IGDR - UMR 6290, F-35000 Rennes, France
| | | |
Collapse
|
35
|
Kunowska N, Stelzl U. Decoding the cellular effects of genetic variation through interaction proteomics. Curr Opin Chem Biol 2022; 66:102100. [PMID: 34801969 DOI: 10.1016/j.cbpa.2021.102100] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2021] [Revised: 10/07/2021] [Accepted: 10/14/2021] [Indexed: 12/24/2022]
Abstract
It is often unclear how genetic variation translates into cellular phenotypes, including how much of the coding variation can be recovered in the proteome. Proteogenomic analyses of heterogenous cell lines revealed that the genetic differences impact mostly the abundance and stoichiometry of protein complexes, with the effects propagating post-transcriptionally via protein interactions onto other subunits. Conversely, large scale binary interaction analyses of missense variants revealed that loss of interaction is widespread and caused by about 50% disease-associated mutations, while deep scanning mutagenesis of binary interactions identified thousands of interaction-deficient variants per interaction. The idea that phenotypes arise from genetic variation through protein-protein interaction is therefore substantiated by both forward and reverse interaction proteomics. With improved methodologies, these two approaches combined can close the knowledge gap between nucleotide sequence variation and its functional consequences on the cellular proteome.
Collapse
Affiliation(s)
- Natalia Kunowska
- Institute of Pharmaceutical Sciences, Pharmaceutical Chemistry, University of Graz, Austria
| | - Ulrich Stelzl
- Institute of Pharmaceutical Sciences, Pharmaceutical Chemistry, University of Graz, Austria; BioTechMed-Graz, Austria; Field of Excellence BioHealth - University of Graz, Austria.
| |
Collapse
|
36
|
Lim SH, Snider J, Birimberg‐Schwartz L, Ip W, Serralha JC, Botelho HM, Lopes‐Pacheco M, Pinto MC, Moutaoufik MT, Zilocchi M, Laselva O, Esmaeili M, Kotlyar M, Lyakisheva A, Tang P, López Vázquez L, Akula I, Aboualizadeh F, Wong V, Grozavu I, Opacak‐Bernardi T, Yao Z, Mendoza M, Babu M, Jurisica I, Gonska T, Bear CE, Amaral MD, Stagljar I. CFTR interactome mapping using the mammalian membrane two-hybrid high-throughput screening system. Mol Syst Biol 2022; 18:e10629. [PMID: 35156780 PMCID: PMC8842165 DOI: 10.15252/msb.202110629] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Revised: 01/16/2022] [Accepted: 01/19/2022] [Indexed: 12/19/2022] Open
Abstract
Cystic Fibrosis Transmembrane Conductance Regulator (CFTR) is a chloride and bicarbonate channel in secretory epithelia with a critical role in maintaining fluid homeostasis. Mutations in CFTR are associated with Cystic Fibrosis (CF), the most common lethal autosomal recessive disorder in Caucasians. While remarkable treatment advances have been made recently in the form of modulator drugs directly rescuing CFTR dysfunction, there is still considerable scope for improvement of therapeutic effectiveness. Here, we report the application of a high-throughput screening variant of the Mammalian Membrane Two-Hybrid (MaMTH-HTS) to map the protein-protein interactions of wild-type (wt) and mutant CFTR (F508del), in an effort to better understand CF cellular effects and identify new drug targets for patient-specific treatments. Combined with functional validation in multiple disease models, we have uncovered candidate proteins with potential roles in CFTR function/CF pathophysiology, including Fibrinogen Like 2 (FGL2), which we demonstrate in patient-derived intestinal organoids has a significant effect on CFTR functional expression.
Collapse
Affiliation(s)
- Sang Hyun Lim
- Donnelly CentreUniversity of TorontoTorontoONCanada
- Department of BiochemistryUniversity of TorontoTorontoONCanada
| | - Jamie Snider
- Donnelly CentreUniversity of TorontoTorontoONCanada
| | - Liron Birimberg‐Schwartz
- Programme in Translational MedicineThe Hospital for Sick ChildrenTorontoONCanada
- Division of Gastroenterology, Hepatology and NutritionDepartment of PediatricsUniversity of TorontoTorontoONCanada
| | - Wan Ip
- Programme in Translational MedicineThe Hospital for Sick ChildrenTorontoONCanada
| | - Joana C Serralha
- Faculty of SciencesBioISI‐Biosystems and Integrative Sciences InstituteUniversity of LisboaLisboaPortugal
- Faculty of Life Sciences and MedicineSchool of Bioscience EducationKing’s College LondonLondonUK
| | - Hugo M Botelho
- Faculty of SciencesBioISI‐Biosystems and Integrative Sciences InstituteUniversity of LisboaLisboaPortugal
| | - Miquéias Lopes‐Pacheco
- Faculty of SciencesBioISI‐Biosystems and Integrative Sciences InstituteUniversity of LisboaLisboaPortugal
| | - Madalena C Pinto
- Faculty of SciencesBioISI‐Biosystems and Integrative Sciences InstituteUniversity of LisboaLisboaPortugal
| | - Mohamed Taha Moutaoufik
- Department of Biochemistry, Research and Innovation CentreUniversity of ReginaReginaSKCanada
| | - Mara Zilocchi
- Department of Biochemistry, Research and Innovation CentreUniversity of ReginaReginaSKCanada
| | - Onofrio Laselva
- Department of PhysiologyUniversity of TorontoTorontoONCanada
| | - Mohsen Esmaeili
- Program in Genetics and Genome BiologyThe Hospital for Sick ChildrenTorontoONCanada
| | - Max Kotlyar
- Osteoarthritis Research ProgramDivision of Orthopedic SurgerySchroeder Arthritis InstituteUniversity Health NetworkTorontoONCanada
- Data Science Discovery Centre for Chronic DiseasesKrembil Research InstituteUniversity Health NetworkTorontoONCanada
| | | | | | | | - Indira Akula
- Donnelly CentreUniversity of TorontoTorontoONCanada
| | | | | | - Ingrid Grozavu
- Donnelly CentreUniversity of TorontoTorontoONCanada
- Department of BiochemistryUniversity of TorontoTorontoONCanada
| | | | - Zhong Yao
- Donnelly CentreUniversity of TorontoTorontoONCanada
| | - Meg Mendoza
- Department of Molecular GeneticsUniversity of TorontoTorontoONCanada
| | - Mohan Babu
- Department of Biochemistry, Research and Innovation CentreUniversity of ReginaReginaSKCanada
| | - Igor Jurisica
- Osteoarthritis Research ProgramDivision of Orthopedic SurgerySchroeder Arthritis InstituteUniversity Health NetworkTorontoONCanada
- Data Science Discovery Centre for Chronic DiseasesKrembil Research InstituteUniversity Health NetworkTorontoONCanada
- Departments of Medical Biophysics and Computer ScienceUniversity of TorontoTorontoONCanada
- Institute of NeuroimmunologySlovak Academy of SciencesBratislavaSlovakia
| | - Tanja Gonska
- Programme in Translational MedicineThe Hospital for Sick ChildrenTorontoONCanada
- Division of Gastroenterology, Hepatology and NutritionDepartment of PediatricsUniversity of TorontoTorontoONCanada
| | - Christine E Bear
- Department of BiochemistryUniversity of TorontoTorontoONCanada
- Department of PhysiologyUniversity of TorontoTorontoONCanada
| | - Margarida D Amaral
- Faculty of SciencesBioISI‐Biosystems and Integrative Sciences InstituteUniversity of LisboaLisboaPortugal
| | - Igor Stagljar
- Donnelly CentreUniversity of TorontoTorontoONCanada
- Department of BiochemistryUniversity of TorontoTorontoONCanada
- Department of Molecular GeneticsUniversity of TorontoTorontoONCanada
- Mediterranean Institute for Life SciencesSplitCroatia
- School of MedicineUniversity of SplitSplitCroatia
| |
Collapse
|
37
|
Meldal BHM, Perfetto L, Combe C, Lubiana T, Ferreira Cavalcante JV, Bye-A-Jee H, Waagmeester A, del-Toro N, Shrivastava A, Barrera E, Wong E, Mlecnik B, Bindea G, Panneerselvam K, Willighagen E, Rappsilber J, Porras P, Hermjakob H, Orchard S. Complex Portal 2022: new curation frontiers. Nucleic Acids Res 2022; 50:D578-D586. [PMID: 34718729 PMCID: PMC8689886 DOI: 10.1093/nar/gkab991] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 10/07/2021] [Accepted: 10/10/2021] [Indexed: 01/02/2023] Open
Abstract
The Complex Portal (www.ebi.ac.uk/complexportal) is a manually curated, encyclopaedic database of macromolecular complexes with known function from a range of model organisms. It summarizes complex composition, topology and function along with links to a large range of domain-specific resources (i.e. wwPDB, EMDB and Reactome). Since the last update in 2019, we have produced a first draft complexome for Escherichia coli, maintained and updated that of Saccharomyces cerevisiae, added over 40 coronavirus complexes and increased the human complexome to over 1100 complexes that include approximately 200 complexes that act as targets for viral proteins or are part of the immune system. The display of protein features in ComplexViewer has been improved and the participant table is now colour-coordinated with the nodes in ComplexViewer. Community collaboration has expanded, for example by contributing to an analysis of putative transcription cofactors and providing data accessible to semantic web tools through Wikidata which is now populated with manually curated Complex Portal content through a new bot. Our data license is now CC0 to encourage data reuse. Users are encouraged to get in touch, provide us with feedback and send curation requests through the 'Support' link.
Collapse
Affiliation(s)
- Birgit H M Meldal
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Livia Perfetto
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
- Fondazione Human Technopole, 20157 Milan, Italy
| | - Colin Combe
- Wellcome Centre for Cell Biology, University of Edinburgh, Edinburgh EH9 3BF, UK
| | - Tiago Lubiana
- Department of Clinical and Toxicological Analyses, School of Pharmaceutical Sciences, University of São Paulo, Av. Professor Lineu Prestes 580, CEP 05508-000 São Paulo SP, Brasil
| | - João Vitor Ferreira Cavalcante
- Bioinformatics Multidisciplinary Environment (BioME), Digital Metropolis Institute, Federal University of Rio Grande do Norte, Av. Odilon Gomes de Lima 1722, Capim Macio, 59078-400 Natal/RN, Brasil
| | - Hema Bye-A-Jee
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | - Noemi del-Toro
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Anjali Shrivastava
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Elisabeth Barrera
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Edith Wong
- Department of Genetics, School of Medicine, Stanford University, Palo Alto, CA, USA
| | - Bernhard Mlecnik
- Laboratory of Integrative Cancer Immunology, INSERM, 75006 Paris, France
- Equipe Labellisée Ligue Contre le Cancer, 75006 Paris, France
- Centre de Recherche des Cordeliers, Sorbonne Université, Université de Paris, 75006 Paris, France
- Inovarion, 75005 Paris, France
| | - Gabriela Bindea
- Laboratory of Integrative Cancer Immunology, INSERM, 75006 Paris, France
- Equipe Labellisée Ligue Contre le Cancer, 75006 Paris, France
- Centre de Recherche des Cordeliers, Sorbonne Université, Université de Paris, 75006 Paris, France
| | - Kalpana Panneerselvam
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Egon Willighagen
- Dept of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Universiteitssingel 50, 6229 ER Maastricht, The Netherlands
| | - Juri Rappsilber
- Wellcome Centre for Cell Biology, University of Edinburgh, Edinburgh EH9 3BF, UK
- Bioanalytics, Institute of Biotechnology, Technische Universität Berlin, 13355 Berlin, Germany
| | - Pablo Porras
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Sandra Orchard
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| |
Collapse
|
38
|
Porras P, Orchard S, Licata L. IMEx Databases: Displaying Molecular Interactions into a Single, Standards-Compliant Dataset. Methods Mol Biol 2022; 2449:27-42. [PMID: 35507258 DOI: 10.1007/978-1-0716-2095-3_2] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Molecular interaction databases aim to systematically capture and organize the experimental interaction information described in the scientific literature. These data can then be used to perform network analysis, to assign putative roles to uncharacterized proteins and to investigate their involvement in cellular pathways.This chapter gives a brief overview of publicly available molecular interaction databases and focuses on the members of the IMEx Consortium, on their curation policies and standard data formats. All of the goals achieved by IMEx databases over the last 15 years, the data types provided and the many different ways in which such data can be utilized by the research community, are described in detail. The IMEx databases curate molecular interaction data to the highest caliber, following a detailed curation model and supplying rich metadata by employing common curation rules and harmonized standards. The IMEx Consortium provides comprehensively annotated molecular interaction data integrated into a single, non-redundant, open access dataset.
Collapse
Affiliation(s)
- Pablo Porras
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, UK
| | - Sandra Orchard
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, UK
| | - Luana Licata
- Department of Biology, University of Rome Tor Vergata, Rome, Italy.
| |
Collapse
|
39
|
Del Toro N, Shrivastava A, Ragueneau E, Meldal B, Combe C, Barrera E, Perfetto L, How K, Ratan P, Shirodkar G, Lu O, Mészáros B, Watkins X, Pundir S, Licata L, Iannuccelli M, Pellegrini M, Martin MJ, Panni S, Duesbury M, Vallet SD, Rappsilber J, Ricard-Blum S, Cesareni G, Salwinski L, Orchard S, Porras P, Panneerselvam K, Hermjakob H. The IntAct database: efficient access to fine-grained molecular interaction data. Nucleic Acids Res 2021; 50:D648-D653. [PMID: 34761267 PMCID: PMC8728211 DOI: 10.1093/nar/gkab1006] [Citation(s) in RCA: 156] [Impact Index Per Article: 39.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Revised: 10/06/2021] [Accepted: 10/21/2021] [Indexed: 01/18/2023] Open
Abstract
The IntAct molecular interaction database (https://www.ebi.ac.uk/intact) is a curated resource of molecular interactions, derived from the scientific literature and from direct data depositions. As of August 2021, IntAct provides more than one million binary interactions, curated by twelve global partners of the International Molecular Exchange consortium, for which the IntAct database provides a shared curation and dissemination platform. The IMEx curation policy has always emphasised a fine-grained data and curation model, aiming to capture the relevant experimental detail essential for the interpretation of the provided molecular interaction data. Here, we present recent curation focus and progress, as well as a completely redeveloped website which presents IntAct data in a much more user-friendly and detailed way.
Collapse
Affiliation(s)
- Noemi Del Toro
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Anjali Shrivastava
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Eliot Ragueneau
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Birgit Meldal
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Colin Combe
- Wellcome Centre for Cell Biology, University of Edinburgh, Edinburgh EH9 3BF, UK
| | - Elisabet Barrera
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Livia Perfetto
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Hinxton, Cambridgeshire CB10 1SD, UK.,Fondazione Human Technopole, Milan 20157, Italy
| | - Karyn How
- UCLA-DOE Institute for Genomics and Proteomics, University of California, Los Angeles, CA 90095, USA
| | - Prashansa Ratan
- UCLA-DOE Institute for Genomics and Proteomics, University of California, Los Angeles, CA 90095, USA
| | - Gautam Shirodkar
- UCLA-DOE Institute for Genomics and Proteomics, University of California, Los Angeles, CA 90095, USA
| | - Odilia Lu
- UCLA-DOE Institute for Genomics and Proteomics, University of California, Los Angeles, CA 90095, USA
| | - Bálint Mészáros
- Gibson Group, European Molecular Biology Laboratory, Heidelberg 69117, Germany
| | - Xavier Watkins
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Sangya Pundir
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Luana Licata
- Bioinformatics and Computational Biology Unit, Dept. of Molecular Biology, University of Rome Tor Vergata, Rome, Italy
| | - Marta Iannuccelli
- Bioinformatics and Computational Biology Unit, Dept. of Molecular Biology, University of Rome Tor Vergata, Rome, Italy
| | - Matteo Pellegrini
- Department of Molecular, Cell and Developmental Biology, University of California, Los Angeles, CA 90095, USA
| | - Maria Jesus Martin
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Simona Panni
- Dipartimento di Biologia, Ecologia e Scienze della Terra, Università della Calabria, Rende, Italy
| | - Margaret Duesbury
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Hinxton, Cambridgeshire CB10 1SD, UK.,UCLA-DOE Institute for Genomics and Proteomics, University of California, Los Angeles, CA 90095, USA
| | - Sylvain D Vallet
- ICBMS UMR CNRS 5246, University Lyon 1, Lyon, Villeurbanne 69622, France
| | - Juri Rappsilber
- Wellcome Centre for Cell Biology, University of Edinburgh, Edinburgh EH9 3BF, UK.,Bioanalytics, Institute of Biotechnology, Technische Universität Berlin, Berlin 13355, Germany
| | - Sylvie Ricard-Blum
- ICBMS UMR CNRS 5246, University Lyon 1, Lyon, Villeurbanne 69622, France
| | - Gianni Cesareni
- Bioinformatics and Computational Biology Unit, Dept. of Molecular Biology, University of Rome Tor Vergata, Rome, Italy
| | - Lukasz Salwinski
- UCLA-DOE Institute for Genomics and Proteomics, University of California, Los Angeles, CA 90095, USA
| | - Sandra Orchard
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Pablo Porras
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Kalpana Panneerselvam
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Henning Hermjakob
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Hinxton, Cambridgeshire CB10 1SD, UK
| |
Collapse
|
40
|
Kotlyar M, Pastrello C, Ahmed Z, Chee J, Varyova Z, Jurisica I. IID 2021: towards context-specific protein interaction analyses by increased coverage, enhanced annotation and enrichment analysis. Nucleic Acids Res 2021; 50:D640-D647. [PMID: 34755877 PMCID: PMC8728267 DOI: 10.1093/nar/gkab1034] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 10/13/2021] [Accepted: 11/03/2021] [Indexed: 01/02/2023] Open
Abstract
Improved bioassays have significantly increased the rate of identifying new protein-protein interactions (PPIs), and the number of detected human PPIs has greatly exceeded early estimates of human interactome size. These new PPIs provide a more complete view of disease mechanisms but precise understanding of how PPIs affect phenotype remains a challenge. It requires knowledge of PPI context (e.g. tissues, subcellular localizations), and functional roles, especially within pathways and protein complexes. The previous IID release focused on PPI context, providing networks with comprehensive tissue, disease, cellular localization, and druggability annotations. The current update adds developmental stages to the available contexts, and provides a way of assigning context to PPIs that could not be previously annotated due to insufficient data or incompatibility with available context categories (e.g. interactions between membrane and cytoplasmic proteins). This update also annotates PPIs with conservation across species, directionality in pathways, membership in large complexes, interaction stability (i.e. stable or transient), and mutation effects. Enrichment analysis is now available for all annotations, and includes multiple options; for example, context annotations can be analyzed with respect to PPIs or network proteins. In addition to tabular view or download, IID provides online network visualization. This update is available at http://ophid.utoronto.ca/iid.
Collapse
Affiliation(s)
- Max Kotlyar
- Osteoarthritis Research Program, Division of Orthopedic Surgery, Schroeder Arthritis Institute and Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, University Health Network, Toronto, ON M5T 0S8, Canada
| | - Chiara Pastrello
- Osteoarthritis Research Program, Division of Orthopedic Surgery, Schroeder Arthritis Institute and Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, University Health Network, Toronto, ON M5T 0S8, Canada
| | - Zuhaib Ahmed
- Osteoarthritis Research Program, Division of Orthopedic Surgery, Schroeder Arthritis Institute and Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, University Health Network, Toronto, ON M5T 0S8, Canada
| | - Justin Chee
- Osteoarthritis Research Program, Division of Orthopedic Surgery, Schroeder Arthritis Institute and Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, University Health Network, Toronto, ON M5T 0S8, Canada
| | - Zofia Varyova
- Osteoarthritis Research Program, Division of Orthopedic Surgery, Schroeder Arthritis Institute and Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, University Health Network, Toronto, ON M5T 0S8, Canada
| | - Igor Jurisica
- Osteoarthritis Research Program, Division of Orthopedic Surgery, Schroeder Arthritis Institute and Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, University Health Network, Toronto, ON M5T 0S8, Canada.,Departments of Medical Biophysics and Computer Science, University of Toronto, Toronto, ON M5S 1A4, Canada.,Institute of Neuroimmunology, Slovak Academy of Sciences, Bratislava, Slovakia
| |
Collapse
|
41
|
Kuiper M, Bonello J, Fernández-Breis JT, Bucher P, Futschik ME, Gaudet P, Kulakovskiy IV, Licata L, Logie C, Lovering RC, Makeev VJ, Orchard S, Panni S, Perfetto L, Sant D, Schulz S, Zerbino DR, Lægreid A. The Gene Regulation Knowledge Commons: The action area of GREEKC. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2021; 1865:194768. [PMID: 34757206 DOI: 10.1016/j.bbagrm.2021.194768] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Revised: 10/18/2021] [Accepted: 10/20/2021] [Indexed: 02/08/2023]
Abstract
The COST Action Gene Regulation Ensemble Effort for the Knowledge Commons (GREEKC, CA15205, www.greekc.org) organized nine workshops in a four-year period, starting September 2016. The workshops brought together a wide range of experts from all over the world working on various parts of the knowledge cycle that is central to understanding gene regulatory mechanisms. The discussions between ontologists, curators, text miners, biologists, bioinformaticians, philosophers and computational scientists spawned a host of activities aimed to update and standardise existing knowledge management workflows, encourage new experimental approaches and thoroughly involve end-users in the process to design the Gene Regulation Knowledge Commons (GRKC). The GREEKC consortium describes its main achievements, contextualised in a state-of-the-art of current tools and resources that today represent the GRKC.
Collapse
Affiliation(s)
- Martin Kuiper
- Systems Biology Group, Department of Biology, Norwegian University of Science and Technology, Trondheim, Norway.
| | - Joseph Bonello
- Faculty of Information & Communication Technology, University of Malta, Msida, Malta
| | | | - Philipp Bucher
- Swiss Institute of Bioinformatics, Quartier Sorge, Bâtiment Amphipôle, 1015 Lausanne, Switzerland
| | - Matthias E Futschik
- Systems Biology and Bioinformatics Laboratory (SysBioLab), Centre of Marine Sciences (CCMAR), University of Algarve, 8005-139 Faro, Portugal
| | - Pascale Gaudet
- SIB Swiss Institute of Bioinformatics, 1 Rue Michel-Servet, 1204 Geneva, Switzerland
| | - Ivan V Kulakovskiy
- Institute of Protein Research, Russian Academy of Sciences, Institutskaya 4, 142290 Pushchino, Russia
| | - Luana Licata
- Department of Biology, University of Rome Tor Vergata, Rome, Italy
| | - Colin Logie
- Department of Molecular Biology, Faculty of Science, Radboud University, PO Box 9101, Nijmegen 6500HG, the Netherlands
| | - Ruth C Lovering
- Functional Gene Annotation, Pre-clinical and Fundamental Science, Institute of Cardiovascular Science, University College London, 5 University Street, London WC1E 6JF, UK
| | - Vsevolod J Makeev
- Vavilov Institute of General Genetics, Russian Academy of Sciences, Gubkina 3, 119991 Moscow, Russia
| | - Sandra Orchard
- European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Simona Panni
- Department DIBEST, University of Calabria, Rende, Italy
| | - Livia Perfetto
- Fondazione Human Technopole, Department of Biology, Via Cristina Belgioioso, 171, 20157 Milan, Italy
| | - David Sant
- Department of Biomedical Informatics, University of Utah, 421 Wakara Way #140, Salt Lake City, UT 84108, United States
| | - Stefan Schulz
- Institute of Medical Informatics, Statistics and Documentation, Medical University of Graz, Auenbruggerpl. 2, Graz, Austria
| | - Daniel R Zerbino
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Astrid Lægreid
- Department of Clinical and Molecular Medicine, Norwegian University of Science and Technology, 7491 Trondheim, Norway
| | | |
Collapse
|
42
|
Arici MK, Tuncbag N. Performance Assessment of the Network Reconstruction Approaches on Various Interactomes. Front Mol Biosci 2021; 8:666705. [PMID: 34676243 PMCID: PMC8523993 DOI: 10.3389/fmolb.2021.666705] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Accepted: 07/14/2021] [Indexed: 01/04/2023] Open
Abstract
Beyond the list of molecules, there is a necessity to collectively consider multiple sets of omic data and to reconstruct the connections between the molecules. Especially, pathway reconstruction is crucial to understanding disease biology because abnormal cellular signaling may be pathological. The main challenge is how to integrate the data together in an accurate way. In this study, we aim to comparatively analyze the performance of a set of network reconstruction algorithms on multiple reference interactomes. We first explored several human protein interactomes, including PathwayCommons, OmniPath, HIPPIE, iRefWeb, STRING, and ConsensusPathDB. The comparison is based on the coverage of each interactome in terms of cancer driver proteins, structural information of protein interactions, and the bias toward well-studied proteins. We next used these interactomes to evaluate the performance of network reconstruction algorithms including all-pair shortest path, heat diffusion with flux, personalized PageRank with flux, and prize-collecting Steiner forest (PCSF) approaches. Each approach has its own merits and weaknesses. Among them, PCSF had the most balanced performance in terms of precision and recall scores when 28 pathways from NetPath were reconstructed using the listed algorithms. Additionally, the reference interactome affects the performance of the network reconstruction approaches. The coverage and disease- or tissue-specificity of each interactome may vary, which may result in differences in the reconstructed networks.
Collapse
Affiliation(s)
- M Kaan Arici
- Graduate School of Informatics, Middle East Technical University, Ankara, Turkey.,Foot and Mouth Diseases Institute, Ministry of Agriculture and Forestry, Ankara, Turkey
| | - Nurcan Tuncbag
- Chemical and Biological Engineering, College of Engineering, Koc University, Istanbul, Turkey.,School of Medicine, Koc University, Istanbul, Turkey
| |
Collapse
|
43
|
Hollander M, Do T, Will T, Helms V. Detecting Rewiring Events in Protein-Protein Interaction Networks Based on Transcriptomic Data. FRONTIERS IN BIOINFORMATICS 2021; 1:724297. [PMID: 36303788 PMCID: PMC9581068 DOI: 10.3389/fbinf.2021.724297] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2021] [Accepted: 08/23/2021] [Indexed: 12/25/2022] Open
Abstract
Proteins rarely carry out their cellular functions in isolation. Instead, eukaryotic proteins engage in about six interactions with other proteins on average. The aggregated protein interactome of an organism forms a “hairy ball”-type protein-protein interaction (PPI) network. Yet, in a typical human cell, only about half of all proteins are expressed at a particular time. Hence, it has become common practice to prune the full PPI network to the subset of expressed proteins. If RNAseq data is available, one can further resolve the specific protein isoforms present in a cell or tissue. Here, we review various approaches, software tools and webservices that enable users to construct context-specific or tissue-specific PPI networks and how these are rewired between two cellular conditions. We illustrate their different functionalities on the example of the interactions involving the human TNR6 protein. In an outlook, we describe how PPI networks may be integrated with epigenetic data or with data on the activity of splicing factors.
Collapse
|
44
|
Alborzi SZ, Ahmed Nacer A, Najjar H, Ritchie DW, Devignes MD. PPIDomainMiner: Inferring domain-domain interactions from multiple sources of protein-protein interactions. PLoS Comput Biol 2021; 17:e1008844. [PMID: 34370723 PMCID: PMC8376228 DOI: 10.1371/journal.pcbi.1008844] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Revised: 08/19/2021] [Accepted: 07/12/2021] [Indexed: 12/26/2022] Open
Abstract
Many biological processes are mediated by protein-protein interactions (PPIs). Because protein domains are the building blocks of proteins, PPIs likely rely on domain-domain interactions (DDIs). Several attempts exist to infer DDIs from PPI networks but the produced datasets are heterogeneous and sometimes not accessible, while the PPI interactome data keeps growing. We describe a new computational approach called “PPIDM” (Protein-Protein Interactions Domain Miner) for inferring DDIs using multiple sources of PPIs. The approach is an extension of our previously described “CODAC” (Computational Discovery of Direct Associations using Common neighbors) method for inferring new edges in a tripartite graph. The PPIDM method has been applied to seven widely used PPI resources, using as “Gold-Standard” a set of DDIs extracted from 3D structural databases. Overall, PPIDM has produced a dataset of 84,552 non-redundant DDIs. Statistical significance (p-value) is calculated for each source of PPI and used to classify the PPIDM DDIs in Gold (9,175 DDIs), Silver (24,934 DDIs) and Bronze (50,443 DDIs) categories. Dataset comparison reveals that PPIDM has inferred from the 2017 releases of PPI sources about 46% of the DDIs present in the 2020 release of the 3did database, not counting the DDIs present in the Gold-Standard. The PPIDM dataset contains 10,229 DDIs that are consistent with more than 13,300 PPIs extracted from the IMEx database, and nearly 23,300 DDIs (27.5%) that are consistent with more than 214,000 human PPIs extracted from the STRING database. Examples of newly inferred DDIs covering more than 10 PPIs in the IMEx database are provided. Further exploitation of the PPIDM DDI reservoir includes the inventory of possible partners of a protein of interest and characterization of protein interactions at the domain level in combination with other methods. The result is publicly available at http://ppidm.loria.fr/. We revisit at a large scale the question of inferring DDIs from PPIs. Compared to previous studies, we take a unified approach accross multiple sources of PPIs. This approach is a method for inferring new edges in a tripartite graph setting and can be compared to link prediction approaches in knowledge graphs. Aggregation of several sources is performed using an optimized weighted average of the individual scores calculated in each source. A huge dataset of over 84K DDIs is produced which far exceeds the previous datasets. We show that a significant portion of the PPIDM dataset covers a large number of PPIs from curated (IMEx) or non curated (STRING) databases. Such a reservoir of DDIs deserves further exploration and can be combined with high-throughput methods such as cross-linking mass spectrometry to identify plausible protein partners of proteins of interest.
Collapse
|
45
|
Oyagawa CRM, Grimsey NL. Cannabinoid receptor CB 1 and CB 2 interacting proteins: Techniques, progress and perspectives. Methods Cell Biol 2021; 166:83-132. [PMID: 34752341 DOI: 10.1016/bs.mcb.2021.06.011] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Cannabinoid receptors 1 and 2 (CB1 and CB2) are implicated in a range of physiological processes and have gained attention as promising therapeutic targets for a number of diseases. Protein-protein interactions play an integral role in modulating G protein-coupled receptor (GPCR) expression, subcellular distribution and signaling, and the identification and characterization of these will not only improve our understanding of GPCR function and biology, but may provide a novel avenue for therapeutic intervention. A variety of techniques are currently being used to investigate GPCR protein-protein interactions, including Förster/fluorescence and bioluminescence resonance energy transfer (FRET and BRET), proximity ligation assay (PLA), and bimolecular fluorescence complementation (BiFC). However, the reliable application of these methodologies is dependent on the use of appropriate controls and the consideration of the physiological context. Though not as extensively characterized as some other GPCRs, the investigation of CB1 and CB2 interacting proteins is a growing area of interest, and a range of interacting partners have been identified to date. This review summarizes the current state of the literature regarding the cannabinoid receptor interactome, provides commentary on the methodologies and techniques utilized, and discusses future perspectives.
Collapse
Affiliation(s)
- Caitlin R M Oyagawa
- Department of Pharmacology and Clinical Pharmacology, School of Medical Sciences, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand; Centre for Brain Research, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand; Maurice Wilkins Centre for Molecular Biodiscovery, Auckland, New Zealand
| | - Natasha L Grimsey
- Department of Pharmacology and Clinical Pharmacology, School of Medical Sciences, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand; Centre for Brain Research, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand; Maurice Wilkins Centre for Molecular Biodiscovery, Auckland, New Zealand.
| |
Collapse
|
46
|
Berthollier C, Vallet SD, Deniaud M, Clerc O, Ricard-Blum S. Building Protein-Protein and Protein-Glycosaminoglycan Interaction Networks Using MatrixDB, the Extracellular Matrix Interaction Database. Curr Protoc 2021; 1:e47. [PMID: 33794052 DOI: 10.1002/cpz1.47] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
The interaction database MatrixDB reports protein-protein and protein-glycosaminoglycan interactions in human, mammalian, and model organisms, involving at least one extracellular matrix (ECM) constituent, namely full-length proteins, ECM multimeric proteins considered as stable complexes, proteoglycans, glycosaminoglycans (GAGs), and bioactive fragments called matricryptins, which are released upon limited proteolysis of ECM proteins. The current version of MatrixDB (as of October 2020) contains 106,543 experimentally supported interactions, with all types of biomolecules combined. MatrixDB is the only database focusing on the curation of ECM protein and GAG interactions. The iNavigator integrated in MatrixDB allows users to build interaction networks online and to filter them according to expression data, quantitative proteomics data, or interaction detection methods. MatrixDB belongs to the International Molecular Exchange (IMEx) consortium, and uses its curation rules to capture interaction data, which are available in standardized exchange formats according to the Human Proteome Organization-Proteomics Standards Initiative (HUPO-PSI). © 2021 Wiley Periodicals LLC. Basic Protocol 1: Browse MatrixDB Basic Protocol 2: Create a list of biomolecules of interest to build interaction networks Basic Protocol 3: Build and export interaction networks of selected biomolecules using the iNavigator Basic Protocol 4: Build specific interaction networks using the iNavigator widgets Basic Protocol 5: Generate 3D models of glycosaminoglycan oligosaccharides using the GAG Builder tool.
Collapse
Affiliation(s)
- Coline Berthollier
- Univ Lyon, University Lyon 1, CNRS, ICBMS, UMR 5246, F-69622, Villeurbanne, France
| | - Sylvain D Vallet
- Univ Lyon, University Lyon 1, CNRS, ICBMS, UMR 5246, F-69622, Villeurbanne, France
| | - Madeline Deniaud
- Univ Lyon, University Lyon 1, CNRS, ICBMS, UMR 5246, F-69622, Villeurbanne, France
| | - Olivier Clerc
- Univ Lyon, University Lyon 1, CNRS, ICBMS, UMR 5246, F-69622, Villeurbanne, France
| | - Sylvie Ricard-Blum
- Univ Lyon, University Lyon 1, CNRS, ICBMS, UMR 5246, F-69622, Villeurbanne, France
| |
Collapse
|
47
|
Cesareni G, Sacco F, Perfetto L. Assembling Disease Networks From Causal Interaction Resources. Front Genet 2021; 12:694468. [PMID: 34178043 PMCID: PMC8226215 DOI: 10.3389/fgene.2021.694468] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Accepted: 05/19/2021] [Indexed: 12/27/2022] Open
Abstract
The development of high-throughput high-content technologies and the increased ease in their application in clinical settings has raised the expectation of an important impact of these technologies on diagnosis and personalized therapy. Patient genomic and expression profiles yield lists of genes that are mutated or whose expression is modulated in specific disease conditions. The challenge remains of extracting from these lists functional information that may help to shed light on the mechanisms that are perturbed in the disease, thus setting a rational framework that may help clinical decisions. Network approaches are playing an increasing role in the organization and interpretation of patients' data. Biological networks are generated by connecting genes or gene products according to experimental evidence that demonstrates their interactions. Till recently most approaches have relied on networks based on physical interactions between proteins. Such networks miss an important piece of information as they lack details on the functional consequences of the interactions. Over the past few years, a number of resources have started collecting causal information of the type protein A activates/inactivates protein B, in a structured format. This information may be represented as signed directed graphs where physiological and pathological signaling can be conveniently inspected. In this review we will (i) present and compare these resources and discuss the different scope in comparison with pathway resources; (ii) compare resources that explicitly capture causality in terms of data content and proteome coverage (iii) review how causal-graphs can be used to extract disease-specific Boolean networks.
Collapse
Affiliation(s)
- Gianni Cesareni
- Department of Biology, University of Rome Tor Vergata, Rome, Italy
| | - Francesca Sacco
- Department of Biology, University of Rome Tor Vergata, Rome, Italy
| | - Livia Perfetto
- Department of Biology, Fondazione Human Technopole, Milan, Italy
| |
Collapse
|
48
|
Ragueneau E, Shrivastava A, Morris JH, Del-Toro N, Hermjakob H, Porras P. IntAct App: a Cytoscape application for molecular interaction network visualisation and analysis. Bioinformatics 2021; 37:3684-3685. [PMID: 33961020 PMCID: PMC8545338 DOI: 10.1093/bioinformatics/btab319] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2021] [Revised: 04/08/2021] [Accepted: 04/27/2021] [Indexed: 01/24/2023] Open
Abstract
Summary IntAct App is a Cytoscape 3 application that grants in-depth access to IntAct’s molecular interaction data. It build networks where nodes are interacting molecules (mainly proteins, but also genes, RNA, chemicals…) and edges represent evidence of interaction. Users can query a network by providing its molecules, identified by different fields and optionally include all their interacting partners in the resulting network. The app offers three visualizations: one only displaying interactions, another representing every evidence and the last one emphasizing evidence where mutated versions of proteins were used. Users can also filter networks and click on nodes and edges to access all their related details. Finally, the application supports automation of its main features via Cytoscape commands. Availability and implementation Implementation available at https://apps.cytoscape.org/apps/intactapp, while the source code is available at https://github.com/EBI-IntAct/IntactApp.
Collapse
Affiliation(s)
- Eliot Ragueneau
- EMBL-EBI, IntAct Team, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
| | - Anjali Shrivastava
- EMBL-EBI, IntAct Team, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
| | - John H Morris
- Resource on Biocomputing, Visualization, and Informatics, University of California, San Francisco, California 94158-2517, United States
| | - Noemi Del-Toro
- EMBL-EBI, IntAct Team, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
| | - Henning Hermjakob
- EMBL-EBI, IntAct Team, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
| | - Pablo Porras
- EMBL-EBI, IntAct Team, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
| |
Collapse
|
49
|
Feuermann M, Boutet E, Morgat A, Axelsen KB, Bansal P, Bolleman J, de Castro E, Coudert E, Gasteiger E, Géhant S, Lieberherr D, Lombardot T, Neto TB, Pedruzzi I, Poux S, Pozzato M, Redaschi N, Bridge A, on behalf of the UniProt Consortium. Diverse Taxonomies for Diverse Chemistries: Enhanced Representation of Natural Product Metabolism in UniProtKB. Metabolites 2021; 11:48. [PMID: 33445429 PMCID: PMC7827101 DOI: 10.3390/metabo11010048] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2020] [Revised: 01/05/2021] [Accepted: 01/07/2021] [Indexed: 01/28/2023] Open
Abstract
The UniProt Knowledgebase UniProtKB is a comprehensive, high-quality, and freely accessible resource of protein sequences and functional annotation that covers genomes and proteomes from tens of thousands of taxa, including a broad range of plants and microorganisms producing natural products of medical, nutritional, and agronomical interest. Here we describe work that enhances the utility of UniProtKB as a support for both the study of natural products and for their discovery. The foundation of this work is an improved representation of natural product metabolism in UniProtKB using Rhea, an expert-curated knowledgebase of biochemical reactions, that is built on the ChEBI (Chemical Entities of Biological Interest) ontology of small molecules. Knowledge of natural products and precursors is captured in ChEBI, enzyme-catalyzed reactions in Rhea, and enzymes in UniProtKB/Swiss-Prot, thereby linking chemical structure data directly to protein knowledge. We provide a practical demonstration of how users can search UniProtKB for protein knowledge relevant to natural products through interactive or programmatic queries using metabolite names and synonyms, chemical identifiers, chemical classes, and chemical structures and show how to federate UniProtKB with other data and knowledge resources and tools using semantic web technologies such as RDF and SPARQL. All UniProtKB data are freely available for download in a broad range of formats for users to further mine or exploit as an annotation source, to enrich other natural product datasets and databases.
Collapse
Affiliation(s)
- Marc Feuermann
- Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 Michel-Servet, CH-1211 Geneva 4, Switzerland; (A.M.); (K.B.A.); (P.B.); (J.B.); (E.d.C.); (E.C.); (E.G.); (S.G.); (D.L.); (T.L.); (T.B.N.); (I.P.); (S.P.); (M.P.); (N.R.); (A.B.)
| | - Emmanuel Boutet
- Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 Michel-Servet, CH-1211 Geneva 4, Switzerland; (A.M.); (K.B.A.); (P.B.); (J.B.); (E.d.C.); (E.C.); (E.G.); (S.G.); (D.L.); (T.L.); (T.B.N.); (I.P.); (S.P.); (M.P.); (N.R.); (A.B.)
| | - Anne Morgat
- Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 Michel-Servet, CH-1211 Geneva 4, Switzerland; (A.M.); (K.B.A.); (P.B.); (J.B.); (E.d.C.); (E.C.); (E.G.); (S.G.); (D.L.); (T.L.); (T.B.N.); (I.P.); (S.P.); (M.P.); (N.R.); (A.B.)
| | - Kristian B. Axelsen
- Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 Michel-Servet, CH-1211 Geneva 4, Switzerland; (A.M.); (K.B.A.); (P.B.); (J.B.); (E.d.C.); (E.C.); (E.G.); (S.G.); (D.L.); (T.L.); (T.B.N.); (I.P.); (S.P.); (M.P.); (N.R.); (A.B.)
| | - Parit Bansal
- Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 Michel-Servet, CH-1211 Geneva 4, Switzerland; (A.M.); (K.B.A.); (P.B.); (J.B.); (E.d.C.); (E.C.); (E.G.); (S.G.); (D.L.); (T.L.); (T.B.N.); (I.P.); (S.P.); (M.P.); (N.R.); (A.B.)
| | - Jerven Bolleman
- Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 Michel-Servet, CH-1211 Geneva 4, Switzerland; (A.M.); (K.B.A.); (P.B.); (J.B.); (E.d.C.); (E.C.); (E.G.); (S.G.); (D.L.); (T.L.); (T.B.N.); (I.P.); (S.P.); (M.P.); (N.R.); (A.B.)
| | - Edouard de Castro
- Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 Michel-Servet, CH-1211 Geneva 4, Switzerland; (A.M.); (K.B.A.); (P.B.); (J.B.); (E.d.C.); (E.C.); (E.G.); (S.G.); (D.L.); (T.L.); (T.B.N.); (I.P.); (S.P.); (M.P.); (N.R.); (A.B.)
| | - Elisabeth Coudert
- Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 Michel-Servet, CH-1211 Geneva 4, Switzerland; (A.M.); (K.B.A.); (P.B.); (J.B.); (E.d.C.); (E.C.); (E.G.); (S.G.); (D.L.); (T.L.); (T.B.N.); (I.P.); (S.P.); (M.P.); (N.R.); (A.B.)
| | - Elisabeth Gasteiger
- Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 Michel-Servet, CH-1211 Geneva 4, Switzerland; (A.M.); (K.B.A.); (P.B.); (J.B.); (E.d.C.); (E.C.); (E.G.); (S.G.); (D.L.); (T.L.); (T.B.N.); (I.P.); (S.P.); (M.P.); (N.R.); (A.B.)
| | - Sébastien Géhant
- Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 Michel-Servet, CH-1211 Geneva 4, Switzerland; (A.M.); (K.B.A.); (P.B.); (J.B.); (E.d.C.); (E.C.); (E.G.); (S.G.); (D.L.); (T.L.); (T.B.N.); (I.P.); (S.P.); (M.P.); (N.R.); (A.B.)
| | - Damien Lieberherr
- Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 Michel-Servet, CH-1211 Geneva 4, Switzerland; (A.M.); (K.B.A.); (P.B.); (J.B.); (E.d.C.); (E.C.); (E.G.); (S.G.); (D.L.); (T.L.); (T.B.N.); (I.P.); (S.P.); (M.P.); (N.R.); (A.B.)
| | - Thierry Lombardot
- Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 Michel-Servet, CH-1211 Geneva 4, Switzerland; (A.M.); (K.B.A.); (P.B.); (J.B.); (E.d.C.); (E.C.); (E.G.); (S.G.); (D.L.); (T.L.); (T.B.N.); (I.P.); (S.P.); (M.P.); (N.R.); (A.B.)
| | - Teresa B. Neto
- Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 Michel-Servet, CH-1211 Geneva 4, Switzerland; (A.M.); (K.B.A.); (P.B.); (J.B.); (E.d.C.); (E.C.); (E.G.); (S.G.); (D.L.); (T.L.); (T.B.N.); (I.P.); (S.P.); (M.P.); (N.R.); (A.B.)
| | - Ivo Pedruzzi
- Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 Michel-Servet, CH-1211 Geneva 4, Switzerland; (A.M.); (K.B.A.); (P.B.); (J.B.); (E.d.C.); (E.C.); (E.G.); (S.G.); (D.L.); (T.L.); (T.B.N.); (I.P.); (S.P.); (M.P.); (N.R.); (A.B.)
| | - Sylvain Poux
- Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 Michel-Servet, CH-1211 Geneva 4, Switzerland; (A.M.); (K.B.A.); (P.B.); (J.B.); (E.d.C.); (E.C.); (E.G.); (S.G.); (D.L.); (T.L.); (T.B.N.); (I.P.); (S.P.); (M.P.); (N.R.); (A.B.)
| | - Monica Pozzato
- Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 Michel-Servet, CH-1211 Geneva 4, Switzerland; (A.M.); (K.B.A.); (P.B.); (J.B.); (E.d.C.); (E.C.); (E.G.); (S.G.); (D.L.); (T.L.); (T.B.N.); (I.P.); (S.P.); (M.P.); (N.R.); (A.B.)
| | - Nicole Redaschi
- Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 Michel-Servet, CH-1211 Geneva 4, Switzerland; (A.M.); (K.B.A.); (P.B.); (J.B.); (E.d.C.); (E.C.); (E.G.); (S.G.); (D.L.); (T.L.); (T.B.N.); (I.P.); (S.P.); (M.P.); (N.R.); (A.B.)
| | - Alan Bridge
- Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 Michel-Servet, CH-1211 Geneva 4, Switzerland; (A.M.); (K.B.A.); (P.B.); (J.B.); (E.d.C.); (E.C.); (E.G.); (S.G.); (D.L.); (T.L.); (T.B.N.); (I.P.); (S.P.); (M.P.); (N.R.); (A.B.)
| | - on behalf of the UniProt Consortium
- Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 Michel-Servet, CH-1211 Geneva 4, Switzerland; (A.M.); (K.B.A.); (P.B.); (J.B.); (E.d.C.); (E.C.); (E.G.); (S.G.); (D.L.); (T.L.); (T.B.N.); (I.P.); (S.P.); (M.P.); (N.R.); (A.B.)
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
- Protein Information Resource, University of Delaware, 15 Innovation Way, Suite 205, Newark, DE 19711, USA
- Protein Information Resource, Georgetown University Medical Center, 3300 Whitehaven Street NorthWest, Suite 1200, Washington, DC 20007, USA
| |
Collapse
|
50
|
Ali SA, Pastrello C, Kaur N, Peffers MJ, Ormseth MJ, Jurisica I. A Network Biology Approach to Understanding the Tissue-Specific Roles of Non-Coding RNAs in Arthritis. Front Endocrinol (Lausanne) 2021; 12:744747. [PMID: 34803912 PMCID: PMC8595833 DOI: 10.3389/fendo.2021.744747] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Accepted: 10/14/2021] [Indexed: 12/31/2022] Open
Abstract
Discovery of non-coding RNAs continues to provide new insights into some of the key molecular drivers of musculoskeletal diseases. Among these, microRNAs have received widespread attention for their roles in osteoarthritis and rheumatoid arthritis. With evidence to suggest that long non-coding RNAs and circular RNAs function as competing endogenous RNAs to sponge microRNAs, the net effect on gene expression in specific disease contexts can be elusive. Studies to date have focused on elucidating individual long non-coding-microRNA-gene target axes and circular RNA-microRNA-gene target axes, with a paucity of data integrating experimentally validated effects of non-coding RNAs. To address this gap, we curated recent studies reporting non-coding RNA axes in chondrocytes from human osteoarthritis and in fibroblast-like synoviocytes from human rheumatoid arthritis. Using an integrative computational biology approach, we then combined the findings into cell- and disease-specific networks for in-depth interpretation. We highlight some challenges to data integration, including non-existent naming conventions and out-of-date databases for non-coding RNAs, and some successes exemplified by the International Molecular Exchange Consortium for protein interactions. In this perspective article, we suggest that data integration is a useful in silico approach for creating non-coding RNA networks in arthritis and prioritizing interactions for further in vitro and in vivo experimentation in translational research.
Collapse
Affiliation(s)
- Shabana Amanda Ali
- Bone and Joint Center, Department of Orthopaedic Surgery, Henry Ford Health System, Detroit, MI, United States
- Center for Molecular Medicine and Genetics, Wayne State University, Detroit, MI, United States
- *Correspondence: Shabana Amanda Ali, ; Igor Jurisica,
| | - Chiara Pastrello
- Osteoarthritis Research Program, Division of Orthopaedics, Schroeder Arthritis Institute, University Health Network, Toronto, ON, Canada
- Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, University Health Network, Toronto, ON, Canada
| | - Navdeep Kaur
- Bone and Joint Center, Department of Orthopaedic Surgery, Henry Ford Health System, Detroit, MI, United States
| | - Mandy J. Peffers
- Department of Musculoskeletal Biology, Institute of Life Course and Medical Sciences, University of Liverpool, Liverpool, United Kingdom
| | - Michelle J. Ormseth
- Department of Research and Development, Veterans Affairs Medical Center, Nashville, TN, United States
| | - Igor Jurisica
- Osteoarthritis Research Program, Division of Orthopaedics, Schroeder Arthritis Institute, University Health Network, Toronto, ON, Canada
- Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, University Health Network, Toronto, ON, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Institute of Neuroimmunology, Slovak Academy of Sciences, Bratislava, Slovakia
- *Correspondence: Shabana Amanda Ali, ; Igor Jurisica,
| |
Collapse
|