1
|
Arowolo O, Salemme V, Suvorov A. Towards Whole Health Toxicology: In-Silico Prediction of Diseases Sensitive to Multi-Chemical Exposures. TOXICS 2022; 10:764. [PMID: 36548597 PMCID: PMC9784704 DOI: 10.3390/toxics10120764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Revised: 11/15/2022] [Accepted: 12/07/2022] [Indexed: 06/17/2023]
Abstract
Chemical exposures from diverse sources merge on a limited number of molecular pathways described as toxicity pathways. Changes in the same set of molecular pathways in different cell and tissue types may generate seemingly unrelated health conditions. Today, no approaches are available to predict in an unbiased way sensitivities of different disease states and their combinations to multi-chemical exposures across the exposome. We propose an inductive in-silico workflow where sensitivities of genes to chemical exposures are identified based on the overlap of existing genomic datasets, and data on sensitivities of individual genes is further used to sequentially derive predictions on sensitivities of molecular pathways, disease states, and groups of disease states (syndromes). Our analysis predicts that conditions representing the most significant public health problems are among the most sensitive to cumulative chemical exposures. These conditions include six leading types of cancer in the world (prostatic, breast, stomach, lung, colorectal neoplasms, and hepatocellular carcinoma), obesity, type 2 diabetes, non-alcoholic fatty liver disease, autistic disorder, Alzheimer's disease, hypertension, heart failure, brain and myocardial ischemia, and myocardial infarction. Overall, our predictions suggest that environmental risk factors may be underestimated for the most significant public health problems.
Collapse
Affiliation(s)
- Olatunbosun Arowolo
- Department of Environmental Health Sciences, School of Public Health and Health Sciences, University of Massachusetts, 686 North Pleasant Street, Amherst, MA 01003, USA
| | - Victoria Salemme
- Department of Pharmacology, University of California, 1275 Med Science, Davis, CA 95616, USA
| | - Alexander Suvorov
- Department of Environmental Health Sciences, School of Public Health and Health Sciences, University of Massachusetts, 686 North Pleasant Street, Amherst, MA 01003, USA
| |
Collapse
|
2
|
Iturria-Medina Y, Adewale Q, Khan AF, Ducharme S, Rosa-Neto P, O’Donnell K, Petyuk VA, Gauthier S, De Jager PL, Breitner J, Bennett DA. Unified epigenomic, transcriptomic, proteomic, and metabolomic taxonomy of Alzheimer's disease progression and heterogeneity. SCIENCE ADVANCES 2022; 8:eabo6764. [PMID: 36399579 PMCID: PMC9674284 DOI: 10.1126/sciadv.abo6764] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Accepted: 09/30/2022] [Indexed: 06/16/2023]
Abstract
Alzheimer's disease (AD) is a heterogeneous disorder with abnormalities in multiple biological domains. In an advanced machine learning analysis of postmortem brain and in vivo blood multi-omics molecular data (N = 1863), we integrated epigenomic, transcriptomic, proteomic, and metabolomic profiles into a multilevel biological AD taxonomy. We obtained a personalized multilevel molecular index of AD dementia progression that predicts severity of neuropathologies, and identified three robust molecular-based subtypes that explain much of the pathologic and clinical heterogeneity of AD. These subtypes present distinct patterns of alteration in DNA methylation, RNA, proteins, and metabolites, identifiable in the brain and subsequently in blood. In addition, the genetic variations that predispose to the various AD subtypes in brain predict distinct spatial patterns of alteration in cell types, suggesting a unique influence of each putative AD variant on neuropathological mechanisms. These observations support that an individually tailored multi-omics molecular taxonomy of AD may represent distinct targets for preventive or treatment interventions.
Collapse
Affiliation(s)
- Yasser Iturria-Medina
- Neurology and Neurosurgery Department, Montreal Neurological Institute, Montreal, Canada
- McConnell Brain Imaging Centre, Montreal Neurological Institute, Montreal, Canada
- Ludmer Centre for Neuroinformatics and Mental Health, Montreal, Canada
| | - Quadri Adewale
- Neurology and Neurosurgery Department, Montreal Neurological Institute, Montreal, Canada
- McConnell Brain Imaging Centre, Montreal Neurological Institute, Montreal, Canada
- Ludmer Centre for Neuroinformatics and Mental Health, Montreal, Canada
| | - Ahmed F. Khan
- Neurology and Neurosurgery Department, Montreal Neurological Institute, Montreal, Canada
- McConnell Brain Imaging Centre, Montreal Neurological Institute, Montreal, Canada
- Ludmer Centre for Neuroinformatics and Mental Health, Montreal, Canada
| | - Simon Ducharme
- McConnell Brain Imaging Centre, Montreal Neurological Institute, Montreal, Canada
- Douglas Mental Health University Institute, Department of Psychiatry, McGill University, Montreal, Canada
| | - Pedro Rosa-Neto
- McGill University Research Centre for Studies in Aging, Douglas Research Centre, Montreal, Canada
| | - Kieran O’Donnell
- Ludmer Centre for Neuroinformatics and Mental Health, Montreal, Canada
- Yale School of Medicine, New Haven, CT 06519, USA
| | - Vladislav A. Petyuk
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - Serge Gauthier
- McGill University Research Centre for Studies in Aging, Douglas Research Centre, Montreal, Canada
| | - Philip L. De Jager
- Center for Translational and Computational Neuroimmunology, Department of Neurology and Taub Institute for Research on Alzheimer’s Disease and the Aging Brain, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - John Breitner
- Centre for Studies on Prevention of Alzheimer’s Disease (StoP-AD), Douglas Research Centre, Montreal, Canada
- Department of Psychiatry, McGill University, Montreal, Canada
| | - David A. Bennett
- Rush Alzheimer’s Disease Center, Rush University Medical Center, Chicago, IL 60612, USA
- Department of Neurological Sciences, Rush University Medical Center, Chicago, IL 60612, USA
| |
Collapse
|
3
|
Tan Y, Yang C, Wei X, Chen C, Li L, Zheng X. Enhancing Recommendation with Automated Tag Taxonomy Construction in Hyperbolic Space. PROCEEDINGS. INTERNATIONAL CONFERENCE ON DATA ENGINEERING 2022; 2022:1180-1192. [PMID: 38352961 PMCID: PMC10863611 DOI: 10.1109/icde53745.2022.00093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/16/2024]
Abstract
The sparse interactions between users and items on the web have aggravated the difficulty of their representations in recommender systems. Existing approaches leverage tags to alleviate the data sparsity problem, so as to enhance the performance and interpretability of recommendation. However, directly using flat item tags fails to fully exploit the hierarchical relations in data, but tag taxonomies are not always available. To this end, we propose TaxoRec to jointly construct a tag taxonomy automatically and perform recommendation accurately in hyperbolic space. Specifically, we first leverage hyperbolic space and enable the optimization of a discrete taxonomy structure via a representation-aware scoring function and an adaptive clustering algorithm, and preserve the hierarchical structure for interpretability. Then, we propose to capture the complex relations among users, items, and tags in a unified hyperbolic metric space, where a novel tag-enhanced aggregation mechanism and tag-enhanced metric learning algorithm for users and items are defined. Extensive experiments on four real-world benchmark datasets show drastic performance gains brought by our proposed TaxoRec framework, which constantly achieves an average of 7.76% improvement over the state-of-the-art baselines regarding both Recall and NDCG metrics. Insightful case studies also show that our automatically constructed tag taxonomies are highly accurate and interpretable.
Collapse
Affiliation(s)
- Yanchao Tan
- College of Computer Science, Zhejiang University, Hangzhou, China
| | - Carl Yang
- Department of Computer Science, Emory University, Atlanta, United States
| | - Xiangyu Wei
- College of Computer Science, Zhejiang University, Hangzhou, China
| | - Chaochao Chen
- College of Computer Science, Zhejiang University, Hangzhou, China
| | - Longfei Li
- Ant Financial Services Group, Hangzhou, China
| | - Xiaolin Zheng
- College of Computer Science, Zhejiang University, Hangzhou, China
| |
Collapse
|
4
|
Xiang J, Zhang J, Zhao Y, Wu FX, Li M. Biomedical data, computational methods and tools for evaluating disease-disease associations. Brief Bioinform 2022; 23:6522999. [PMID: 35136949 DOI: 10.1093/bib/bbac006] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Revised: 01/04/2022] [Accepted: 01/05/2022] [Indexed: 12/12/2022] Open
Abstract
In recent decades, exploring potential relationships between diseases has been an active research field. With the rapid accumulation of disease-related biomedical data, a lot of computational methods and tools/platforms have been developed to reveal intrinsic relationship between diseases, which can provide useful insights to the study of complex diseases, e.g. understanding molecular mechanisms of diseases and discovering new treatment of diseases. Human complex diseases involve both external phenotypic abnormalities and complex internal molecular mechanisms in organisms. Computational methods with different types of biomedical data from phenotype to genotype can evaluate disease-disease associations at different levels, providing a comprehensive perspective for understanding diseases. In this review, available biomedical data and databases for evaluating disease-disease associations are first summarized. Then, existing computational methods for disease-disease associations are reviewed and classified into five groups in terms of the usages of biomedical data, including disease semantic-based, phenotype-based, function-based, representation learning-based and text mining-based methods. Further, we summarize software tools/platforms for computation and analysis of disease-disease associations. Finally, we give a discussion and summary on the research of disease-disease associations. This review provides a systematic overview for current disease association research, which could promote the development and applications of computational methods and tools/platforms for disease-disease associations.
Collapse
Affiliation(s)
- Ju Xiang
- School of Computer Science and Engineering, Central South University, China
| | - Jiashuai Zhang
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha, Hunan 410083, China
| | - Yichao Zhao
- School of Computer Science and Engineering, Central South University, China
| | - Fang-Xiang Wu
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha, Hunan 410083, China
| | - Min Li
- Division of Biomedical Engineering and Department of Mechanical Engineering at University of Saskatchewan, Saskatoon, Canada
| |
Collapse
|
5
|
Prieto Santamaría L, García Del Valle EP, Zanin M, Hernández Chan GS, Pérez Gallardo Y, Rodríguez-González A. Classifying diseases by using biological features to identify potential nosological models. Sci Rep 2021; 11:21096. [PMID: 34702888 PMCID: PMC8548311 DOI: 10.1038/s41598-021-00554-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Accepted: 10/14/2021] [Indexed: 11/25/2022] Open
Abstract
Established nosological models have provided physicians an adequate enough classification of diseases so far. Such systems are important to correctly identify diseases and treat them successfully. However, these taxonomies tend to be based on phenotypical observations, lacking a molecular or biological foundation. Therefore, there is an urgent need to modernize them in order to include the heterogeneous information that is produced in the present, as could be genomic, proteomic, transcriptomic and metabolic data, leading this way to more comprehensive and robust structures. For that purpose, we have developed an extensive methodology to analyse the possibilities when it comes to generate new nosological models from biological features. Different datasets of diseases have been considered, and distinct features related to diseases, namely genes, proteins, metabolic pathways and genetical variants, have been represented as binary and numerical vectors. From those vectors, diseases distances have been computed on the basis of several metrics. Clustering algorithms have been implemented to group diseases, generating different models, each of them corresponding to the distinct combinations of the previous parameters. They have been evaluated by means of intrinsic metrics, proving that some of them are highly suitable to cover new nosologies. One of the clustering configurations has been deeply analysed, demonstrating its quality and validity in the research context, and further biological interpretations have been made. Such model was particularly generated by OPTICS clustering algorithm, by studying the distance between diseases based on gene sharedness and following cosine index metric. 729 clusters were formed in this model, which obtained a Silhouette coefficient of 0.43.
Collapse
Affiliation(s)
- Lucía Prieto Santamaría
- ETS Ingenieros Informáticos, Universidad Politécnica de Madrid, 28660 Boadilla del Monte, Madrid, Spain. .,Ezeris Networks Global Services S.L., 28028, Madrid, Spain.
| | | | - Massimiliano Zanin
- Instituto de Física Interdisciplinar y Sistemas Complejos, CSIC-UIB, 07122, Palma de Mallorca, Spain
| | | | | | | |
Collapse
|
6
|
Erro R. Genetic discrimination and fa(h)r beyond. Parkinsonism Relat Disord 2020; 75:119-120. [PMID: 32205043 DOI: 10.1016/j.parkreldis.2020.03.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/14/2020] [Accepted: 03/09/2020] [Indexed: 11/24/2022]
Affiliation(s)
- Roberto Erro
- Department of Medicine, Surgery and Dentistry "Scuola Medica Salernitana", University of Salerno, Via Allende, 84081, Baronissi, SA, Italy.
| |
Collapse
|
7
|
Yu MK, Ma J, Ono K, Zheng F, Fong SH, Gary A, Chen J, Demchak B, Pratt D, Ideker T. DDOT: A Swiss Army Knife for Investigating Data-Driven Biological Ontologies. Cell Syst 2019; 8:267-273.e3. [PMID: 30878356 PMCID: PMC7042149 DOI: 10.1016/j.cels.2019.02.003] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2018] [Revised: 12/08/2018] [Accepted: 02/08/2019] [Indexed: 01/08/2023]
Abstract
Systems biology requires not only genome-scale data but also methods to integrate these data into interpretable models. Previously, we developed approaches that organize omics data into a structured hierarchy of cellular components and pathways, called a "data-driven ontology." Such hierarchies recapitulate known cellular subsystems and discover new ones. To broadly facilitate this type of modeling, we report the development of a software library called the Data-Driven Ontology Toolkit (DDOT), consisting of a Python package (https://github.com/idekerlab/ddot) to assemble and analyze ontologies and a web application (http://hiview.ucsd.edu) to visualize them. Using DDOT, we programmatically assemble a compendium of ontologies for 652 diseases by integrating gene-disease mappings with a gene similarity network derived from omics data. For example, the ontology for Fanconi anemia describes known and novel disease mechanisms in its hierarchy of 194 genes and 74 subsystems. DDOT provides an easy interface to share ontologies online at the Network Data Exchange.
Collapse
Affiliation(s)
- Michael Ku Yu
- Department of Medicine, University of California, San Diego, La Jolla, CA 92093, USA; Graduate Program in Bioinformatics and Systems Biology, University of California, San Diego, La Jolla, CA 92093, USA; Toyota Technological Institute at Chicago, Chicago, IL 60637, USA
| | - Jianzhu Ma
- Department of Medicine, University of California, San Diego, La Jolla, CA 92093, USA
| | - Keiichiro Ono
- Department of Medicine, University of California, San Diego, La Jolla, CA 92093, USA
| | - Fan Zheng
- Department of Medicine, University of California, San Diego, La Jolla, CA 92093, USA
| | - Samson H Fong
- Department of Medicine, University of California, San Diego, La Jolla, CA 92093, USA; Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Aaron Gary
- Department of Medicine, University of California, San Diego, La Jolla, CA 92093, USA
| | - Jing Chen
- Department of Medicine, University of California, San Diego, La Jolla, CA 92093, USA
| | - Barry Demchak
- Department of Medicine, University of California, San Diego, La Jolla, CA 92093, USA
| | - Dexter Pratt
- Department of Medicine, University of California, San Diego, La Jolla, CA 92093, USA
| | - Trey Ideker
- Department of Medicine, University of California, San Diego, La Jolla, CA 92093, USA; Graduate Program in Bioinformatics and Systems Biology, University of California, San Diego, La Jolla, CA 92093, USA; Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA.
| |
Collapse
|