1
|
Díaz-Santiago E, Moya-García AA, Pérez-García J, Yahyaoui R, Orengo C, Pazos F, Perkins JR, Ranea JAG. Better understanding the phenotypic effects of drugs through shared targets in genetic disease networks. Front Pharmacol 2025; 15:1470931. [PMID: 39911831 PMCID: PMC11794328 DOI: 10.3389/fphar.2024.1470931] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2024] [Accepted: 12/12/2024] [Indexed: 02/07/2025] Open
Abstract
Introduction Most drugs fail during development and there is a clear and unmet need for approaches to better understand mechanistically how drugs exert both their intended and adverse effects. Gaining traction in this field is the use of disease data linking genes with pathological phenotypes and combining this with drugtarget interaction data. Methods We introduce methodology to associate drugs with effects, both intended and adverse, using a tripartite network approach that combines drug-target and target-phenotype data, in which targets can be represented as proteins and protein domains. Results We were able to detect associations for over 140,000 ChEMBL drugs and 3,800 phenotypes, represented as Human Phenotype Ontology (HPO) terms. The overlap of these results with the SIDER databases of known drug side effects was up to 10 times higher than random, depending on the target type, disease database and score threshold used. In terms of overlap with drug-phenotype pairs extracted from the literature, the performance of our methodology was up to 17.47 times greater than random. The top results include phenotype-drug associations that represent intended effects, particularly for cancers such as chronic myelogenous leukemia, which was linked with nilotinib. They also include adverse side effects, such as blurred vision being linked with tetracaine. Discussion This work represents an important advance in our understanding of how drugs cause intended and adverse side effects through their action on disease causing genes and has potential applications for drug development and repositioning.
Collapse
Affiliation(s)
- Elena Díaz-Santiago
- Department of Molecular Biology and Biochemistry, University of Malaga, Malaga, Spain
| | | | - Jesús Pérez-García
- Department of Molecular Biology and Biochemistry, University of Malaga, Malaga, Spain
| | - Raquel Yahyaoui
- Laboratory of Inherited Metabolic Diseases and Newborn Screening, Malaga Regional University Hospital, Malaga, Spain
- Instituto de Investigación Biomédica de Málaga y Plataforma en Nanomedicina-IBIMA Plataforma BIONAND, Malaga, Spain
| | - Christine Orengo
- Department of Structural and Molecular Biology, University College London, London, United Kingdom
| | - Florencio Pazos
- Computational Systems Biology Group, Systems Biology Department, National Centre for Biotechnology (CNB-CSIC), Madrid, Spain
| | - James R. Perkins
- Department of Molecular Biology and Biochemistry, University of Malaga, Malaga, Spain
- Instituto de Investigación Biomédica de Málaga y Plataforma en Nanomedicina-IBIMA Plataforma BIONAND, Malaga, Spain
- CIBER de Enfermedades Raras, Instituto de Salud Carlos III, Madrid, Spain
| | - Juan A. G. Ranea
- Department of Molecular Biology and Biochemistry, University of Malaga, Malaga, Spain
- Instituto de Investigación Biomédica de Málaga y Plataforma en Nanomedicina-IBIMA Plataforma BIONAND, Malaga, Spain
- CIBER de Enfermedades Raras, Instituto de Salud Carlos III, Madrid, Spain
- Spanish National Bioinformatics Institute (INB/ELIXIR-ES), Instituto de Salud Carlos III (ISCIII), Madrid, Spain
| |
Collapse
|
2
|
Alías-Segura S, Pazos F, Chagoyen M. Differential expression and co-expression reveal cell types relevant to genetic disorder phenotypes. Bioinformatics 2024; 40:btae646. [PMID: 39468724 PMCID: PMC11549017 DOI: 10.1093/bioinformatics/btae646] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2024] [Revised: 10/22/2024] [Accepted: 10/25/2024] [Indexed: 10/30/2024] Open
Abstract
MOTIVATION Knowledge of the specific cell types affected by genetic alterations in rare diseases is crucial for advancing diagnostics and treatments. Despite significant progress, the cell types involved in the majority of rare disease manifestations remain largely unknown. In this study, we integrated scRNA-seq data from non-diseased samples with known genetic disorder genes and phenotypic information to predict the specific cell types disrupted by pathogenic mutations for 482 disease phenotypes. RESULTS We found significant phenotype-cell type associations focusing on differential expression and co-expression mechanisms. Our analysis revealed that 13% of the associations documented in the literature were captured through differential expression, while 42% were elucidated through co-expression analysis, also uncovering potential new associations. These findings underscore the critical role of cellular context in disease manifestation and highlight the potential of single-cell data for the development of cell-aware diagnostics and targeted therapies for rare diseases. AVAILABILITY AND IMPLEMENTATION All code generated in this work is available at https://github.com/SergioAlias/sc-coex.
Collapse
Affiliation(s)
- Sergio Alías-Segura
- Computational Systems Biology Group, Centro Nacional de Biotecnología (CNB-CSIC), Madrid, 28049, Spain
- Department of Molecular Biology and Biochemistry, Science Faculty, University of Málaga, Málaga, 29071, Spain
| | - Florencio Pazos
- Computational Systems Biology Group, Centro Nacional de Biotecnología (CNB-CSIC), Madrid, 28049, Spain
| | - Monica Chagoyen
- Computational Systems Biology Group, Centro Nacional de Biotecnología (CNB-CSIC), Madrid, 28049, Spain
| |
Collapse
|
3
|
Novoa J, Fernandez-Dumont A, Mills ENC, Moreno FJ, Pazos F. Advancing the allergenicity assessment of new proteins using a text mining resource. Food Chem Toxicol 2024; 187:114638. [PMID: 38582341 DOI: 10.1016/j.fct.2024.114638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Revised: 03/11/2024] [Accepted: 03/31/2024] [Indexed: 04/08/2024]
Abstract
With a society increasingly demanding alternative protein food sources, new strategies for evaluating protein safety issues, such as allergenic potential, are needed. Large-scale and systemic studies on allergenic proteins are hindered by the limited and non-harmonized clinical information available for these substances in dedicated databases. A missing key information is that representing the symptomatology of the allergens, especially given in terms of standard vocabularies, that would allow connecting with other biomedical resources to carry out different studies related to human health. In this work, we have generated the first resource with a comprehensive annotation of allergens' symptomatology, using a text-mining approach that extracts significant co-mentions between these entities from the scientific literature (PubMed, ∼36 million abstracts). The method identifies statistically significant co-mentions between the textual descriptions of the two types of entities in the literature as indication of relationship. 1,180 clinical signs extracted from the Human Phenotype Ontology, the Medical Subject Heading terms of PubMed together with other allergen-specific symptoms, were linked to 1,036 unique allergens annotated in two main allergen-related public databases via 14,009 relationships. This novel resource, publicly available through an interactive web interface, could serve as a starting point for future manually curated compilation of allergen symptomatology.
Collapse
Affiliation(s)
- Jorge Novoa
- Computational Systems Biology Group, National Centre for Biotechnology (CNB-CSIC), 28049, Madrid, Spain
| | | | - E N Clare Mills
- School of Biosciences and Medicine, The University of Surrey, Guildford, GU2 7XH, UK
| | - F Javier Moreno
- Instituto de Investigación en Ciencias de La Alimentación (CIAL), CSIC-UAM, CEI (UAM+CSIC), 28049, Madrid, Spain.
| | - Florencio Pazos
- Computational Systems Biology Group, National Centre for Biotechnology (CNB-CSIC), 28049, Madrid, Spain.
| |
Collapse
|
4
|
Novoa J, López-Ibáñez J, Chagoyen M, Ranea JAG, Pazos F. CoMentG: comprehensive retrieval of generic relationships between biomedical concepts from the scientific literature. Database (Oxford) 2024; 2024:baae025. [PMID: 38564426 PMCID: PMC10986793 DOI: 10.1093/database/baae025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Revised: 03/01/2024] [Accepted: 03/15/2024] [Indexed: 04/04/2024]
Abstract
The CoMentG resource contains millions of relationships between terms of biomedical interest obtained from the scientific literature. At the core of the system is a methodology for detecting significant co-mentions of concepts in the entire PubMed corpus. That method was applied to nine sets of terms covering the most important classes of biomedical concepts: diseases, symptoms/clinical signs, molecular functions, biological processes, cellular compartments, anatomic parts, cell types, bacteria and chemical compounds. We obtained more than 7 million relationships between more than 74 000 terms, and many types of relationships were not available in any other resource. As the terms were obtained from widely used resources and ontologies, the relationships are given using the standard identifiers provided by them and hence can be linked to other data. A web interface allows users to browse these associations, searching for relationships for a set of terms of interests provided as input, such as between a disease and their associated symptoms, underlying molecular processes or affected tissues. The results are presented in an interactive interface where the user can explore the reported relationships in different ways and follow links to other resources. Database URL: https://csbg.cnb.csic.es/CoMentG/.
Collapse
Affiliation(s)
- Jorge Novoa
- Computational Systems Biology, National Center for Biotechnology (CNB-CSIC), c/ Darwin, 3., Madrid 28049 , Spain
| | - Javier López-Ibáñez
- Computational Systems Biology, National Center for Biotechnology (CNB-CSIC), c/ Darwin, 3., Madrid 28049 , Spain
| | - Mónica Chagoyen
- Computational Systems Biology, National Center for Biotechnology (CNB-CSIC), c/ Darwin, 3., Madrid 28049 , Spain
| | - Juan A G Ranea
- Department of Molecular Biology and Biochemistry, University of Málaga, Avda. Cervantes, 2., Málaga 29071, Spain
- CIBER de Enfermedades Raras (CIBERER), Instituto de Salud Carlos III, Madrid, Spain
- Institute of Biomedical Research in Malaga and platform of nanomedicine (IBIMA platform BIONAND), Malaga 29071, Spain
- Spanish National Bioinformatics Institute (INB/ELIXIR-ES), Barcelona 08034, Spain
| | - Florencio Pazos
- Computational Systems Biology, National Center for Biotechnology (CNB-CSIC), c/ Darwin, 3., Madrid 28049 , Spain
| |
Collapse
|
5
|
Lopez-Ibañez J, Pazos F, Chagoyen M. MBROLE3: improved functional enrichment of chemical compounds for metabolomics data analysis. Nucleic Acids Res 2023:7161529. [PMID: 37178003 DOI: 10.1093/nar/gkad405] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 04/17/2023] [Accepted: 05/03/2023] [Indexed: 05/15/2023] Open
Abstract
MBROLE (Metabolites Biological Role) facilitates the biological interpretation of metabolomics experiments. It performs enrichment analysis of a set of chemical compounds through statistical analysis of annotations from several databases. The original MBROLE server was released in 2011 and, since then, different groups worldwide have used it to analyze metabolomics experiments from a variety of organisms. Here we present the latest version of the system, MBROLE3, accessible at http://csbg.cnb.csic.es/mbrole3. This new version contains updated annotations from previously included databases as well as a wide variety of new functional annotations, such as additional pathway databases and Gene Ontology terms. Of special relevance is the inclusion of a new category of annotations, 'indirect annotations', extracted from the scientific literature and from curated chemical-protein associations. The latter allows to analyze enriched annotations of the proteins known to interact with the set of chemical compounds of interest. Results are provided in the form of interactive tables, formatted data to download, and graphical plots.
Collapse
Affiliation(s)
- Javier Lopez-Ibañez
- Computational Systems Biology Group, National Center for Biotechnology (CNB-CSIC), 28049 Madrid, Spain
| | - Florencio Pazos
- Computational Systems Biology Group, National Center for Biotechnology (CNB-CSIC), 28049 Madrid, Spain
| | - Monica Chagoyen
- Computational Systems Biology Group, National Center for Biotechnology (CNB-CSIC), 28049 Madrid, Spain
| |
Collapse
|
6
|
Network-Based Methods for Approaching Human Pathologies from a Phenotypic Point of View. Genes (Basel) 2022; 13:genes13061081. [PMID: 35741843 PMCID: PMC9222217 DOI: 10.3390/genes13061081] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Revised: 06/10/2022] [Accepted: 06/14/2022] [Indexed: 01/27/2023] Open
Abstract
Network and systemic approaches to studying human pathologies are helping us to gain insight into the molecular mechanisms of and potential therapeutic interventions for human diseases, especially for complex diseases where large numbers of genes are involved. The complex human pathological landscape is traditionally partitioned into discrete “diseases”; however, that partition is sometimes problematic, as diseases are highly heterogeneous and can differ greatly from one patient to another. Moreover, for many pathological states, the set of symptoms (phenotypes) manifested by the patient is not enough to diagnose a particular disease. On the contrary, phenotypes, by definition, are directly observable and can be closer to the molecular basis of the pathology. These clinical phenotypes are also important for personalised medicine, as they can help stratify patients and design personalised interventions. For these reasons, network and systemic approaches to pathologies are gradually incorporating phenotypic information. This review covers the current landscape of phenotype-centred network approaches to study different aspects of human diseases.
Collapse
|
7
|
Computational Resources for Molecular Biology 2022. J Mol Biol 2022; 434:167625. [PMID: 35569508 DOI: 10.1016/j.jmb.2022.167625] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|