1
|
Trigila AP, Pisciottano F, Franchini LF. Hearing loss genes reveal patterns of adaptive evolution at the coding and non-coding levels in mammals. BMC Biol 2021; 19:244. [PMID: 34784928 PMCID: PMC8594068 DOI: 10.1186/s12915-021-01170-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Accepted: 10/21/2021] [Indexed: 11/26/2022] Open
Abstract
Background Mammals possess unique hearing capacities that differ significantly from those of the rest of the amniotes. In order to gain insights into the evolution of the mammalian inner ear, we aim to identify the set of genetic changes and the evolutionary forces that underlie this process. We hypothesize that genes that impair hearing when mutated in humans or in mice (hearing loss (HL) genes) must play important roles in the development and physiology of the inner ear and may have been targets of selective forces across the evolution of mammals. Additionally, we investigated if these HL genes underwent a human-specific evolutionary process that could underlie the evolution of phenotypic traits that characterize human hearing. Results We compiled a dataset of HL genes including non-syndromic deafness genes identified by genetic screenings in humans and mice. We found that many genes including those required for the normal function of the inner ear such as LOXHD1, TMC1, OTOF, CDH23, and PCDH15 show strong signatures of positive selection. We also found numerous noncoding accelerated regions in HL genes, and among them, we identified active transcriptional enhancers through functional enhancer assays in transgenic zebrafish. Conclusions Our results indicate that the key inner ear genes and regulatory regions underwent adaptive evolution in the basal branch of mammals and along the human-specific branch, suggesting that they could have played an important role in the functional remodeling of the cochlea. Altogether, our data suggest that morphological and functional evolution could be attained through molecular changes affecting both coding and noncoding regulatory regions. Supplementary Information The online version contains supplementary material available at 10.1186/s12915-021-01170-6.
Collapse
Affiliation(s)
- Anabella P Trigila
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular (INGEBI), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), C1428, Buenos Aires, Argentina
| | - Francisco Pisciottano
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular (INGEBI), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), C1428, Buenos Aires, Argentina.,Current address: Instituto de Biología y Medicina Experimental (IBYME), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), C1428, Buenos Aires, Argentina
| | - Lucía F Franchini
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular (INGEBI), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), C1428, Buenos Aires, Argentina.
| |
Collapse
|
2
|
Giannoula A, Centeno E, Mayer MA, Sanz F, Furlong LI. A system-level analysis of patient disease trajectories based on clinical, phenotypic and molecular similarities. Bioinformatics 2021; 37:1435-1443. [PMID: 33185649 DOI: 10.1093/bioinformatics/btaa964] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Revised: 09/16/2020] [Accepted: 11/03/2020] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Incorporating the temporal dimension into multimorbidity studies has shown to be crucial for achieving a better understanding of the disease associations. Furthermore, due to the multifactorial nature of human disease, exploring disease associations from different perspectives can provide a holistic view to support the study of their aetiology. RESULTS In this work, a temporal systems-medicine approach is proposed for identifying time-dependent multimorbidity patterns from patient disease trajectories, by integrating data from electronic health records with genetic and phenotypic information. Specifically, the disease trajectories are clustered using an unsupervised algorithm based on dynamic time warping and three disease similarity metrics: clinical, genetic and phenotypic. An evaluation method is also presented for quantitatively assessing, in the different disease spaces, both the cluster homogeneity and the respective similarities between the associated diseases within individual trajectories. The latter can facilitate exploring the origin(s) in the identified disease patterns. The proposed integrative methodology can be applied to any longitudinal cohort and disease of interest. In this article, prostate cancer is selected as a use case of medical interest to demonstrate, for the first time, the identification of temporal disease multimorbidities in different disease spaces. AVAILABILITY AND IMPLEMENTATION https://gitlab.com/agiannoula/diseasetrajectories. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Alexia Giannoula
- Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Department of Experimental and Health Sciences (DCEXS), Universitat Pompeu Fabra, 08003, Barcelona, Spain
| | - Emilio Centeno
- Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Department of Experimental and Health Sciences (DCEXS), Universitat Pompeu Fabra, 08003, Barcelona, Spain
| | - Miguel-Angel Mayer
- Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Department of Experimental and Health Sciences (DCEXS), Universitat Pompeu Fabra, 08003, Barcelona, Spain
| | - Ferran Sanz
- Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Department of Experimental and Health Sciences (DCEXS), Universitat Pompeu Fabra, 08003, Barcelona, Spain
| | - Laura I Furlong
- Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Department of Experimental and Health Sciences (DCEXS), Universitat Pompeu Fabra, 08003, Barcelona, Spain
| |
Collapse
|
3
|
Biswas S, Mitra P, Rao KS. Relation Prediction of Co-Morbid Diseases Using Knowledge Graph Completion. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:708-717. [PMID: 31295118 DOI: 10.1109/tcbb.2019.2927310] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Co-morbid disease condition refers to the simultaneous presence of one or more diseases along with the primary disease. A patient suffering from co-morbid diseases possess more mortality risk than with a disease alone. So, it is necessary to predict co-morbid disease pairs. In past years, though several methods have been proposed by researchers for predicting the co-morbid diseases, not much work is done in prediction using knowledge graph embedding using tensor factorization. Moreover, the complex-valued vector-based tensor factorization is not being used in any knowledge graph with biological and biomedical entities. We propose a tensor factorization based approach on biological knowledge graphs. Our method introduces the concept of complex-valued embedding in knowledge graphs with biological entities. Here, we build a knowledge graph with disease-gene associations and their corresponding background information. To predict the association between prevalent diseases, we use ComplEx embedding based tensor decomposition method. Besides, we obtain new prevalent disease pairs using the MCL algorithm in a disease-gene-gene network and check their corresponding inter-relations using edge prediction task.
Collapse
|
4
|
Nguyen TV, Eisman JA. Post-GWAS Polygenic Risk Score: Utility and Challenges. JBMR Plus 2020; 4:e10411. [PMID: 33210063 PMCID: PMC7657393 DOI: 10.1002/jbm4.10411] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Revised: 08/23/2020] [Accepted: 09/02/2020] [Indexed: 12/22/2022] Open
Abstract
Over the past decade, through genome‐wide association studies, more than 300 genetic variants have been identified to be associated with either BMD or fracture risk. These genetic variants are common in the general population, but they exert small to modest effects on BMD, suggesting that the utility of any single variant is limited. However, a combination of effect sizes from multiple variants in the form of the polygenic risk score (PRS) can provide a useful indicator of fracture risk beyond that obtained by conventional clinical risk factors. In this perspective, we review the progress of genetics of osteoporosis and approaches for creating PRSs, their uses, and caveats. Recent studies support the idea that the PRS, when integrated into existing fracture prediction models, can help clinicians and patients alike to better assess the fracture risk for an individual, and raise the possibility of precision risk assessment. © 2020 The Authors. JBMR Plus published by Wiley Periodicals LLC on behalf of American Society for Bone and Mineral Research.
Collapse
Affiliation(s)
- Tuan V Nguyen
- Healthy Ageing Theme Garvan Institute of Medical Research Sydney Australia.,St Vincent's Clinical School UNSW Medicine, UNSW Sydney Australia.,School of Medicine Sydney University of Notre Dame Sydney Australia.,School of Biomedical Engineering University of Technology Sydney Australia
| | - John A Eisman
- Healthy Ageing Theme Garvan Institute of Medical Research Sydney Australia.,St Vincent's Clinical School UNSW Medicine, UNSW Sydney Australia.,School of Medicine Sydney University of Notre Dame Sydney Australia
| |
Collapse
|
5
|
Akram P, Liao L. Prediction of comorbid diseases using weighted geometric embedding of human interactome. BMC Med Genomics 2019; 12:161. [PMID: 31888634 PMCID: PMC6936100 DOI: 10.1186/s12920-019-0605-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2019] [Accepted: 10/16/2019] [Indexed: 01/20/2023] Open
Abstract
BACKGROUND Comorbidity is the phenomenon of two or more diseases occurring simultaneously not by random chance and presents great challenges to accurate diagnosis and treatment. As an effort toward better understanding the genetic causes of comorbidity, in this work, we have developed a computational method to predict comorbid diseases. Two diseases sharing common genes tend to increase their comorbidity. Previous work shows that after mapping the associated genes onto the human interactome the distance between the two disease modules (subgraphs) is correlated with comorbidity. METHODS To fully incorporate structural characteristics of interactome as features into prediction of comorbidity, our method embeds the human interactome into a high dimensional geometric space with weights assigned to the network edges and uses the projection onto different dimension to "fingerprint" disease modules. A supervised machine learning classifier is then trained to discriminate comorbid diseases versus non-comorbid diseases. RESULTS In cross-validation using a benchmark dataset of more than 10,000 disease pairs, we report that our model achieves remarkable performance of ROC score = 0.90 for comorbidity threshold at relative risk RR = 0 and 0.76 for comorbidity threshold at RR = 1, and significantly outperforms the previous method and the interactome generated by annotated data. To further incorporate prior knowledge pathways association with diseases, we weight the protein-protein interaction network edges according to their frequency of occurring in those pathways in such a way that edges with higher frequency will more likely be selected in the minimum spanning tree for geometric embedding. Such weighted embedding is shown to lead to further improvement of comorbid disease prediction. CONCLUSION The work demonstrates that embedding the two-dimension planar graph of human interactome into a high dimensional geometric space allows for characterizing and capturing disease modules (subgraphs formed by the disease associated genes) from multiple perspectives, and hence provides enriched features for a supervised classifier to discriminate comorbid disease pairs from non-comorbid disease pairs more accurately than based on simply the module separation.
Collapse
Affiliation(s)
- Pakeeza Akram
- School of Electrical Engineering and Computer Science (SEECS), National University of Sciences and Technology (NUST), H-12, Islamabad, Pakistan
- Department of Computer Science, University of Delaware, Newark, USA
| | - Li Liao
- Department of Computer Science, University of Delaware, Newark, USA
| |
Collapse
|
6
|
Chen X, Shi W, Deng L. Prediction of Disease Comorbidity Using HeteSim Scores based on Multiple Heterogeneous Networks. Curr Gene Ther 2019; 19:232-241. [DOI: 10.2174/1566523219666190917155959] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2019] [Revised: 06/14/2019] [Accepted: 06/16/2019] [Indexed: 12/25/2022]
Abstract
Background:
Accumulating experimental studies have indicated that disease comorbidity
causes additional pain to patients and leads to the failure of standard treatments compared to patients
who have a single disease. Therefore, accurate prediction of potential comorbidity is essential to design
more efficient treatment strategies. However, only a few disease comorbidities have been discovered
in the clinic.
Objective:
In this work, we propose PCHS, an effective computational method for predicting disease
comorbidity.
Materials and Methods:
We utilized the HeteSim measure to calculate the relatedness score for different
disease pairs in the global heterogeneous network, which integrates six networks based on biological
information, including disease-disease associations, drug-drug interactions, protein-protein interactions
and associations among them. We built the prediction model using the Support Vector Machine
(SVM) based on the HeteSim scores.
Results and Conclusion:
The results showed that PCHS performed significantly better than previous
state-of-the-art approaches and achieved an AUC score of 0.90 in 10-fold cross-validation. Furthermore,
some of our predictions have been verified in literatures, indicating the effectiveness of our method.
Collapse
Affiliation(s)
- Xuegong Chen
- School of Computer Science and Engineering, Central South University, Changsha, 410075, China
| | - Wanwan Shi
- School of Computer Science and Engineering, Central South University, Changsha, 410075, China
| | - Lei Deng
- School of Computer Science and Engineering, Central South University, Changsha, 410075, China
| |
Collapse
|
7
|
Nam Y, Lee DG, Bang S, Kim JH, Kim JH, Shin H. The translational network for metabolic disease - from protein interaction to disease co-occurrence. BMC Bioinformatics 2019; 20:576. [PMID: 31722666 PMCID: PMC6854734 DOI: 10.1186/s12859-019-3106-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2018] [Accepted: 09/20/2019] [Indexed: 02/08/2023] Open
Abstract
Background The recent advances in human disease network have provided insights into establishing the relationships between the genotypes and phenotypes of diseases. In spite of the great progress, it yet remains as only a map of topologies between diseases, but not being able to be a pragmatic diagnostic/prognostic tool in medicine. It can further evolve from a map to a translational tool if it equips with a function of scoring that measures the likelihoods of the association between diseases. Then, a physician, when practicing on a patient, can suggest several diseases that are highly likely to co-occur with a primary disease according to the scores. In this study, we propose a method of implementing ‘n-of-1 utility’ (n potential diseases of one patient) to human disease network—the translational disease network. Results We first construct a disease network by introducing the notion of walk in graph theory to protein-protein interaction network, and then provide a scoring algorithm quantifying the likelihoods of disease co-occurrence given a primary disease. Metabolic diseases, that are highly prevalent but have found only a few associations in previous studies, are chosen as entries of the network. Conclusions The proposed method substantially increased connectivity between metabolic diseases and provided scores of co-occurring diseases. The increase in connectivity turned the disease network info-richer. The result lifted the AUC of random guessing up to 0.72 and appeared to be concordant with the existing literatures on disease comorbidity.
Collapse
Affiliation(s)
- Yonghyun Nam
- Department of Industrial Engineering, Ajou University, 206, World cup-ro, Yeongtong-gu, Suwon-si, Gyeonggi-do, 16499, Republic of Korea
| | - Dong-Gi Lee
- Department of Industrial Engineering, Ajou University, 206, World cup-ro, Yeongtong-gu, Suwon-si, Gyeonggi-do, 16499, Republic of Korea
| | - Sunjoo Bang
- Department of Industrial Engineering, Ajou University, 206, World cup-ro, Yeongtong-gu, Suwon-si, Gyeonggi-do, 16499, Republic of Korea
| | - Ju Han Kim
- Seoul National University Biomedical Informatics (SNUBI), Division of Biomedical Informatics, Seoul National University College of Medicine, 103, Daehak-ro, Jongno-gu, Seoul, 03080, Republic of Korea
| | - Jae-Hoon Kim
- Department of Industrial Engineering, Ajou University, 206, World cup-ro, Yeongtong-gu, Suwon-si, Gyeonggi-do, 16499, Republic of Korea.
| | - Hyunjung Shin
- Department of Industrial Engineering, Ajou University, 206, World cup-ro, Yeongtong-gu, Suwon-si, Gyeonggi-do, 16499, Republic of Korea.
| |
Collapse
|
8
|
Rana S, Sultana A. Association of the Variant rs7561317 Downstream of the TMEM18 Gene with Overweight/Obesity and Related Anthropometric Traits in a Sample of Pakistani Population. Biochem Genet 2019; 58:257-278. [PMID: 31628562 DOI: 10.1007/s10528-019-09940-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2019] [Accepted: 10/01/2019] [Indexed: 12/17/2022]
Abstract
Obesity is a multifactorial disorder and requires favorable environment for its expression. However, some individuals are more prone to weight gain than others in an obesogenic environment. Thus, at individual level, who becomes overweight or obese is mostly determined by genetic factors. The current study was undertaken to explore for the first time the association of the TMEM18 rs7561317 variant with overweight/obesity and related anthropometric, metabolic, physical, and behavioral traits in a sample of Pakistani population with association between the rs7561317 and many traits was not investigated before in any population. The current study involved a total of 612 subjects including 306 overweight/obese and equal number of age- and sex-matched normal weight individuals. Obesity-related parameters were determined and the variant was genotyped by allelic discrimination assay. All the aforementioned associations were assessed by regression analyses adjusted for covariates and corrected for multiple comparisons. The results revealed a significant association of the TMEM18 rs7561317 with overweight/obese phenotype in more than one genetic model. Therefore, h-index (degree of dominance) was calculated, which indicated the recessive mode of inheritance for the above-said association. Similarly, a significant association of the rs7561317 with obesity-related anthropometric traits and clinical surrogate markers of visceral adiposity was observed. Thus, GG genotype of the rs7561317 was found to increase 1.74 times the risk of overweight/obesity in Pakistani population (OR = 1.74, 95% CI 1.210-2.496, p = 0.003) while low physical activity seemed to accentuate the TMEM18 rs7561317-associated risk of overweight/obesity (OR = 2.696, 95% CI 1.485-4.896, p = 0.004).
Collapse
Affiliation(s)
- Sobia Rana
- Molecular Biology and Human Genetics Laboratory, Dr. Panjwani Center for Molecular Medicine and Drug Research (PCMD), International Center for Chemical and Biological Sciences (ICCBS), University of Karachi, Karachi, 75270, Pakistan.
| | - Ayesha Sultana
- Molecular Biology and Human Genetics Laboratory, Dr. Panjwani Center for Molecular Medicine and Drug Research (PCMD), International Center for Chemical and Biological Sciences (ICCBS), University of Karachi, Karachi, 75270, Pakistan
| |
Collapse
|
9
|
Shoshi A, Hofestädt R, Zolotareva O, Friedrichs M, Maier A, Ivanisenko VA, Dosenko VE, Bragina EY. GenCoNet - A Graph Database for the Analysis of Comorbidities by Gene Networks. J Integr Bioinform 2018; 15:/j/jib.2018.15.issue-4/jib-2018-0049/jib-2018-0049.xml. [PMID: 30864352 PMCID: PMC6348742 DOI: 10.1515/jib-2018-0049] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2018] [Accepted: 10/31/2018] [Indexed: 12/11/2022] Open
Abstract
The prevalence of comorbid diseases poses a major health issue for millions of people worldwide and an enormous socio-economic burden for society. The molecular mechanisms for the development of comorbidities need to be investigated. For this purpose, a workflow system was developed to aggregate data on biomedical entities from heterogeneous data sources. The process of integrating and merging all data sources of the workflow system was implemented as a semi-automatic pipeline that provides the import, fusion, and analysis of the highly connected biomedical data in a Neo4j database GenCoNet. As a starting point, data on the common comorbid diseases essential hypertension and bronchial asthma was integrated. GenCoNet (https://genconet.kalis-amts.de) is a curated database that provides a better understanding of hereditary bases of comorbidities.
Collapse
Affiliation(s)
- Alban Shoshi
- Bielefeld University, Bioinformatics/Medical Informatics Department, Bielefeld, Germany
| | - Ralf Hofestädt
- Bielefeld University, Bioinformatics/Medical Informatics Department, Bielefeld, Germany
| | - Olga Zolotareva
- Bielefeld University, Bioinformatics/Medical Informatics Department, Bielefeld, Germany.,Bielefeld University, International Research Group "Computational Methods for the Analysis of the Diversity and Dynamics of Genomes", Bielefeld, Germany
| | - Marcel Friedrichs
- Bielefeld University, Bioinformatics/Medical Informatics Department, Bielefeld, Germany
| | - Alex Maier
- Bielefeld University, Bioinformatics/Medical Informatics Department, Bielefeld, Germany
| | - Vladimir A Ivanisenko
- Institute of Cytology and Genetics, Siberian Branch, Russian Academy of Sciences, Novosibirsk, Russia
| | | | - Elena Yu Bragina
- Research Institute of Medical Genetics, Tomsk NRMC, Tomsk, Russia
| |
Collapse
|
10
|
Fröhlich H, Balling R, Beerenwinkel N, Kohlbacher O, Kumar S, Lengauer T, Maathuis MH, Moreau Y, Murphy SA, Przytycka TM, Rebhan M, Röst H, Schuppert A, Schwab M, Spang R, Stekhoven D, Sun J, Weber A, Ziemek D, Zupan B. From hype to reality: data science enabling personalized medicine. BMC Med 2018; 16:150. [PMID: 30145981 PMCID: PMC6109989 DOI: 10.1186/s12916-018-1122-7] [Citation(s) in RCA: 208] [Impact Index Per Article: 29.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/28/2018] [Accepted: 07/09/2018] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND Personalized, precision, P4, or stratified medicine is understood as a medical approach in which patients are stratified based on their disease subtype, risk, prognosis, or treatment response using specialized diagnostic tests. The key idea is to base medical decisions on individual patient characteristics, including molecular and behavioral biomarkers, rather than on population averages. Personalized medicine is deeply connected to and dependent on data science, specifically machine learning (often named Artificial Intelligence in the mainstream media). While during recent years there has been a lot of enthusiasm about the potential of 'big data' and machine learning-based solutions, there exist only few examples that impact current clinical practice. The lack of impact on clinical practice can largely be attributed to insufficient performance of predictive models, difficulties to interpret complex model predictions, and lack of validation via prospective clinical trials that demonstrate a clear benefit compared to the standard of care. In this paper, we review the potential of state-of-the-art data science approaches for personalized medicine, discuss open challenges, and highlight directions that may help to overcome them in the future. CONCLUSIONS There is a need for an interdisciplinary effort, including data scientists, physicians, patient advocates, regulatory agencies, and health insurance organizations. Partially unrealistic expectations and concerns about data science-based solutions need to be better managed. In parallel, computational methods must advance more to provide direct benefit to clinical practice.
Collapse
Affiliation(s)
- Holger Fröhlich
- UCB Biosciences GmbH, Alfred-Nobel-Str. Str. 10, 40789 Monheim, Germany
- University of Bonn, Bonn-Aachen International Center for IT, Endenicher Allee 19c, 53115 Bonn, Germany
| | - Rudi Balling
- University of Luxembourg, 6 avenue du Swing, 4367 Belvaux, Luxembourg
| | - Niko Beerenwinkel
- Department of Biosciences and Engineering, ETH Zurich, Mattenstr. 26, 4058 Basel, Switzerland
| | - Oliver Kohlbacher
- University of Tübingen, WSI/ZBIT, Sand 14, 72076 Tübingen, Germany
- Max Planck Institute for Developmental Biology, Max-Planck-Ring 5, 72076 Tübingen, Germany
- Quantitative Biology Center, University of Tübingen, Auf der Morgenstelle 8, 72076 Tübingen, Germany
- Institute for Translational Bioinformatics, University Medical Center Tübingen, Sand 14, 72076 Tübingen, Germany
| | - Santosh Kumar
- Department of Computer Science, University of Memphis, 2222 Dunn Hall, Memphis, TN 38152 USA
| | - Thomas Lengauer
- Max-Planck-Institute for Informatics, 66123 Saarbrücken, Germany
| | - Marloes H. Maathuis
- ETH Zurich, Seminar für Statistik, Rämistrasse 101, 8092 Zurich, Switzerland
| | - Yves Moreau
- University of Leuven, ESAT, Kasteelpark Arenberg 10, 3001 Leuven, Belgium
| | - Susan A. Murphy
- Harvard University, Science Center 400 Suite, Oxford Street, Cambridge, MA 02138-2901 USA
| | - Teresa M. Przytycka
- National Center of Biotechnology Information, National Institute of Health, 8600 Rockville Pike, Bethesda, MD 20894-6075 USA
| | - Michael Rebhan
- Novartis Institutes for Biomedical Research, 4056 Basel, Switzerland
| | - Hannes Röst
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1 Canada
| | - Andreas Schuppert
- RWTH Aachen, Joint Research Center for Computational Biomedicine, Pauwelsstrasse 19, 52074 Aachen, Germany
| | - Matthias Schwab
- Dr. Margarete Fischer-Bosch Institute of Clinical Pharmacology, Aucherbachstrasse 112, 70376 Stuttgart, Germany
- University of Tübingen, Departments of Clinical Pharmacology and of Pharmacy and Biochemistry, Tübingen, Germany
| | - Rainer Spang
- University of Regensburg, Institute of Functional Genomics, Am BioPark 9, 93053 Regensburg, Germany
| | - Daniel Stekhoven
- ETH Zurich, NEXUS Personalized Health Technol., Otto-Stern-Weg 7, 8093 Zurich, Switzerland
| | - Jimeng Sun
- Georgia Tech University, 801 Atlantic Drive, Atlanta, GA 30332-0280 USA
| | - Andreas Weber
- Institute for Computer Science, University of Bonn, Endenicher Allee 19a, 53115 Bonn, Germany
| | - Daniel Ziemek
- Pfizer, Worldwide Research and Development, Linkstraße 10, 10785 Berlin, Germany
| | - Blaz Zupan
- Faculty of Computer and Information Science, University of Ljubljana, Večna pot 113, SI-1000 Ljubljana, Slovenia
| |
Collapse
|
11
|
Brunson JC, Laubenbacher RC. Applications of network analysis to routinely collected health care data: a systematic review. J Am Med Inform Assoc 2018; 25:210-221. [PMID: 29025116 PMCID: PMC6664849 DOI: 10.1093/jamia/ocx052] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2017] [Revised: 04/18/2017] [Accepted: 04/23/2017] [Indexed: 01/21/2023] Open
Abstract
Objective To survey network analyses of datasets collected in the course of routine operations in health care settings and identify driving questions, methods, needs, and potential for future research. Materials and Methods A search strategy was designed to find studies that applied network analysis to routinely collected health care datasets and was adapted to 3 bibliographic databases. The results were grouped according to a thematic analysis of their settings, objectives, data, and methods. Each group received a methodological synthesis. Results The search found 189 distinct studies reported before August 2016. We manually partitioned the sample into 4 groups, which investigated institutional exchange, physician collaboration, clinical co-occurrence, and workplace interaction networks. Several robust and ongoing research programs were discerned within (and sometimes across) the groups. Little interaction was observed between these programs, despite conceptual and methodological similarities. Discussion We use the literature sample to inform a discussion of good practice at this methodological interface, including the concordance of motivations, study design, data, and tools and the validation and standardization of techniques. We then highlight instances of positive feedback between methodological development and knowledge domains and assess the overall cohesion of the sample.
Collapse
|
12
|
He F, Zhu G, Wang YY, Zhao XM, Huang DS. PCID: A Novel Approach for Predicting Disease Comorbidity by Integrating Multi-Scale Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017; 14:678-686. [PMID: 27076462 DOI: 10.1109/tcbb.2016.2550443] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Disease comorbidity is the presence of one or more diseases along with a primary disorder, which causes additional pain to patients and leads to the failure of standard treatments compared with single diseases. Therefore, the identification of potential comorbidity can help prevent those comorbid diseases when treating a primary disease. Unfortunately, most of current known disease comorbidities are discovered occasionally in clinic, and our knowledge about comorbidity is far from complete. Despite the fact that many efforts have been made to predict disease comorbidity, the prediction accuracy of existing computational approaches needs to be improved. By investigating the factors underlying disease comorbidity, e.g., mutated genes and rewired protein-protein interactions (PPIs), we here present a novel algorithm to predict disease comorbidity by integrating multi-scale data ranging from genes to phenotypes. Benchmark results on real data show that our approach outperforms existing algorithms, and some of our novel predictions are validated with those reported in literature, indicating the effectiveness and predictive power of our approach. In addition, we identify some pathway and PPI patterns that underlie the co-occurrence between a primary disease and certain disease classes, which can help explain how the comorbidity is initiated from molecular perspectives.
Collapse
|
13
|
HybridRanker: Integrating network topology and biomedical knowledge to prioritize cancer candidate genes. J Biomed Inform 2016; 64:139-146. [DOI: 10.1016/j.jbi.2016.10.003] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2016] [Revised: 08/13/2016] [Accepted: 10/06/2016] [Indexed: 11/20/2022]
|
14
|
Abstract
This review focuses at the problem of the genetic basis of comorbidity. We discuss the concepts and terms relating to combinations of diseases. The guidelines of the study of comorbidity using modern high throughput methods and approaches of genetics, molecular biology and bioinformatics are designated. In this review we present results of studies showing genetic specificity for the combined phenotypes dif-ferent from the isolated disease, we considergene-gene and gene-environment interactions in comorbidity. We also discuss the role of single nucleotide polymorphisms and structural genome variations in the development of comorbidity. Own results of researching shared genes of inversely comorbid diseases like as bronchial asthma and tuberculosis are presented.
Collapse
Affiliation(s)
- Ye. Yu. Bragina
- Research Institute for Medical Genetics, Tomsk, Russian Federation
| | - M. B. Freidin
- Research Institute for Medical Genetics, Tomsk, Russian Federation
| |
Collapse
|
15
|
Network regularised Cox regression and multiplex network models to predict disease comorbidities and survival of cancer. Comput Biol Chem 2015; 59 Pt B:15-31. [DOI: 10.1016/j.compbiolchem.2015.08.010] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2015] [Revised: 08/21/2015] [Accepted: 08/25/2015] [Indexed: 12/17/2022]
|
16
|
Han SK, Kim I, Hwang J, Kim S. Network Modules of the Cross-Species Genotype-Phenotype Map Reflect the Clinical Severity of Human Diseases. PLoS One 2015; 10:e0136300. [PMID: 26301634 PMCID: PMC4547739 DOI: 10.1371/journal.pone.0136300] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2015] [Accepted: 08/02/2015] [Indexed: 01/09/2023] Open
Abstract
Recent advances in genome sequencing techniques have improved our understanding of the genotype-phenotype relationship between genetic variants and human diseases. However, genetic variations uncovered from patient populations do not provide enough information to understand the mechanisms underlying the progression and clinical severity of human diseases. Moreover, building a high-resolution genotype-phenotype map is difficult due to the diverse genetic backgrounds of the human population. We built a cross-species genotype-phenotype map to explain the clinical severity of human genetic diseases. We developed a data-integrative framework to investigate network modules composed of human diseases mapped with gene essentiality measured from a model organism. Essential and nonessential genes connect diseases of different types which form clusters in the human disease network. In a large patient population study, we found that disease classes enriched with essential genes tended to show a higher mortality rate than disease classes enriched with nonessential genes. Moreover, high disease mortality rates are explained by the multiple comorbid relationships and the high pleiotropy of disease genes found in the essential gene-enriched diseases. Our results reveal that the genotype-phenotype map of a model organism can facilitate the identification of human disease-gene associations and predict human disease progression.
Collapse
Affiliation(s)
- Seong Kyu Han
- Department of Life Sciences, Pohang University of Science and Technology, Pohang, 790–784, Korea
| | - Inhae Kim
- Department of Life Sciences, Pohang University of Science and Technology, Pohang, 790–784, Korea
| | - Jihye Hwang
- Department of IT Convergence and Engineering, Pohang University of Science and Technology, Pohang, 790–784, Korea
| | - Sanguk Kim
- Department of Life Sciences, Pohang University of Science and Technology, Pohang, 790–784, Korea
- * E-mail:
| |
Collapse
|
17
|
Moni MA, Liò P. Network-based analysis of comorbidities risk during an infection: SARS and HIV case studies. BMC Bioinformatics 2014; 15:333. [PMID: 25344230 PMCID: PMC4363349 DOI: 10.1186/1471-2105-15-333] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2013] [Accepted: 09/19/2014] [Indexed: 01/02/2023] Open
Abstract
Background Infections are often associated to comorbidity that increases the risk of medical conditions which can lead to further morbidity and mortality. SARS is a threat which is similar to MERS virus, but the comorbidity is the key aspect to underline their different impacts. One UK doctor says "I’d rather have HIV than diabetes" as life expectancy among diabetes patients is lower than that of HIV. However, HIV has a comorbidity impact on the diabetes. Results We present a quantitative framework to compare and explore comorbidity between diseases. By using neighbourhood based benchmark and topological methods, we have built comorbidity relationships network based on the OMIM and our identified significant genes. Then based on the gene expression, PPI and signalling pathways data, we investigate the comorbidity association of these 2 infective pathologies with other 7 diseases (heart failure, kidney disorder, breast cancer, neurodegenerative disorders, bone diseases, Type 1 and Type 2 diabetes). Phenotypic association is measured by calculating both the Relative Risk as the quantified measures of comorbidity tendency of two disease pairs and the ϕ-correlation to measure the robustness of the comorbidity associations. The differential gene expression profiling strongly suggests that the response of SARS affected patients seems to be mainly an innate inflammatory response and statistically dysregulates a large number of genes, pathways and PPIs subnetworks in different pathologies such as chronic heart failure (21 genes), breast cancer (16 genes) and bone diseases (11 genes). HIV-1 induces comorbidities relationship with many other diseases, particularly strong correlation with the neurological, cancer, metabolic and immunological diseases. Similar comorbidities risk is observed from the clinical information. Moreover, SARS and HIV infections dysregulate 4 genes (ANXA3, GNS, HIST1H1C, RASA3) and 3 genes (HBA1, TFRC, GHITM) respectively that affect the ageing process. It is notable that HIV and SARS similarly dysregulated 11 genes and 3 pathways. Only 4 significantly dysregulated genes are common between SARS-CoV and MERS-CoV, including NFKBIA that is a key regulator of immune responsiveness implicated in susceptibility to infectious and inflammatory diseases. Conclusions Our method presents a ripe opportunity to use data-driven approaches for advancing our current knowledge on disease mechanism and predicting disease comorbidities in a quantitative way. Electronic supplementary material The online version of this article (doi:10.1186/1471-2105-15-333) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Mohammad Ali Moni
- Computer Laboratory, University of Cambridge, William Gates Building, 15 JJ Thomson Avenue, Cambridge CB3 0FD, UK.
| | | |
Collapse
|
18
|
Sun K, Gonçalves JP, Larminie C, Przulj N. Predicting disease associations via biological network analysis. BMC Bioinformatics 2014; 15:304. [PMID: 25228247 PMCID: PMC4174675 DOI: 10.1186/1471-2105-15-304] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2013] [Accepted: 08/19/2014] [Indexed: 12/11/2022] Open
Abstract
Background Understanding the relationship between diseases based on the underlying biological mechanisms is one of the greatest challenges in modern biology and medicine. Exploring disease-disease associations by using system-level biological data is expected to improve our current knowledge of disease relationships, which may lead to further improvements in disease diagnosis, prognosis and treatment. Results We took advantage of diverse biological data including disease-gene associations and a large-scale molecular network to gain novel insights into disease relationships. We analysed and compared four publicly available disease-gene association datasets, then applied three disease similarity measures, namely annotation-based measure, function-based measure and topology-based measure, to estimate the similarity scores between diseases. We systematically evaluated disease associations obtained by these measures against a statistical measure of comorbidity which was derived from a large number of medical patient records. Our results show that the correlation between our similarity measures and comorbidity scores is substantially higher than expected at random, confirming that our similarity measures are able to recover comorbidity associations. We also demonstrated that our predicted disease associations correlated with disease associations generated from genome-wide association studies significantly higher than expected at random. Furthermore, we evaluated our predicted disease associations via mining the literature on PubMed, and presented case studies to demonstrate how these novel disease associations can be used to enhance our current knowledge of disease relationships. Conclusions We present three similarity measures for predicting disease associations. The strong correlation between our predictions and known disease associations demonstrates the ability of our measures to provide novel insights into disease relationships. Electronic supplementary material The online version of this article (doi:10.1186/1471-2105-15-304) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | - Nataša Przulj
- Department of Computing, Imperial College London, London, SW7 2AZ, UK.
| |
Collapse
|
19
|
Paik H, Heo HS, Ban HJ, Cho SB. Unraveling human protein interaction networks underlying co-occurrences of diseases and pathological conditions. J Transl Med 2014; 12:99. [PMID: 24731539 PMCID: PMC4021415 DOI: 10.1186/1479-5876-12-99] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2014] [Accepted: 04/03/2014] [Indexed: 12/18/2022] Open
Abstract
Background Human diseases frequently cause complications such as obesity-induced diabetes and share numbers of pathological conditions, such as inflammation, by dysfunctions of common functional modules, such as protein–protein interactions (PPIs). Methods Our developed pipeline, ICod (Interaction analysis for disease Comorbidity), grades similarities between pairs of disease-related PPIs including comorbid diseases and pathological conditions. ICod displayed a disease similarity network consisting of nodes of disease PPIs and edges of similarity value. As a proof of concept, eight complex diseases and pathological conditions, such as type 2 diabetes, obesity, inflammation, and cancers, were examined to discover whether PPIs shared between diseases were associated with comorbidities. Results By comparing Medicare reports of disease co-occurrences from 31 million patients, the disease similarity network shows that PPIs of pathological conditions, including insulin resistance, and inflammation, overlap significantly with PPIs of various comorbid diseases, including diabetes, obesity, and cancers (p < 0.05). Interestingly, maintaining connectivity between essential genes was more drastically perturbed by removing a node of a disease-related gene rather than a pathological condition-related gene, such as one related to inflammations. Conclusion Thus, PPIs of pathological symptoms are underlying functional modules across diseases accompanying comorbidity phenomena, whereas they contribute only marginally to maintaining interactions between essential genes.
Collapse
Affiliation(s)
| | | | | | - Seong Beom Cho
- Division of Bio-Medical Informatics, Center for Genome Science, National Institute of Health, OHTAC, 187 Osongsaengmyeong2(i)-ro, Gangoe-myeon, Cheongwon-gun, ChoongchungBuk-do, South Korea.
| |
Collapse
|
20
|
López-Álvarez MR, Jones DC, Jiang W, Traherne JA, Trowsdale J. Copy number and nucleotide variation of the LILR family of myelomonocytic cell activating and inhibitory receptors. Immunogenetics 2014; 66:73-83. [PMID: 24257760 PMCID: PMC3894450 DOI: 10.1007/s00251-013-0742-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2013] [Accepted: 10/24/2013] [Indexed: 12/11/2022]
Abstract
Leukocyte immunoglobulin-like receptors (LILR) are cell surface molecules that regulate the activities of myelomonocytic cells through the balance of inhibitory and activation signals. LILR genes are located within the leukocyte receptor complex (LRC) on chromosome 19q13.4 adjacent to KIR genes, which are subject to allelic and copy number variation (CNV). LILRB3 (ILT5) and LILRA6 (ILT8) are highly polymorphic receptors with similar extracellular domains. LILRB3 contains inhibitory ITIM motifs and LILRA6 is coupled to an adaptor with activating ITAM motifs. We analysed the sequences of the extracellular immunoglobulin domain-encoding regions of LILRB3 and LILRA6 in 20 individuals, and determined the copy number of these receptors, in addition to those of other members of the LILR family. We found 41 polymorphic sites within the extracellular domains of LILRB3 and LILRA6. Twenty-four of these sites were common to both receptors. LILRA6, but not LILRB3, exhibited CNV. In 20 out of 48 human cell lines from the International Histocompatibility Working Group, LILRA6 was deleted or duplicated. The only other LILR gene exhibiting genomic aberration was LILRA3, in this case due to a partial deletion.
Collapse
Affiliation(s)
- María R. López-Álvarez
- Department of Pathology, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QP UK
- Cambridge Institute for Medical Research, University of Cambridge, Cambridge Biomedical Campus, Wellcome Trust/MRC Building, Hills Road, Cambridge, CB2 0XY UK
| | - Des C. Jones
- Department of Pathology, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QP UK
| | - Wei Jiang
- Department of Pathology, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QP UK
- Cambridge Institute for Medical Research, University of Cambridge, Cambridge Biomedical Campus, Wellcome Trust/MRC Building, Hills Road, Cambridge, CB2 0XY UK
| | - James A. Traherne
- Department of Pathology, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QP UK
- Cambridge Institute for Medical Research, University of Cambridge, Cambridge Biomedical Campus, Wellcome Trust/MRC Building, Hills Road, Cambridge, CB2 0XY UK
| | - John Trowsdale
- Department of Pathology, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QP UK
- Cambridge Institute for Medical Research, University of Cambridge, Cambridge Biomedical Campus, Wellcome Trust/MRC Building, Hills Road, Cambridge, CB2 0XY UK
- Immunology Division, Pathology Department, University of Cambridge, Cambridge, CB2 1QP UK
| |
Collapse
|
21
|
Wellness and health omics linked to the environment: the WHOLE approach to personalized medicine. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2014; 799:1-14. [PMID: 24292959 DOI: 10.1007/978-1-4614-8778-4_1] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
The WHOLE approach to personalized medicine represents an effort to integrate clinical and genomic profiling jointly into preventative health care and the promotion of wellness. Our premise is that genotypes alone are insufficient to predict health outcomes, since they fail to account for individualized responses to the environment and life history. Instead, integrative genomic approaches incorporating whole genome sequences and transcriptome and epigenome profiles, all combined with extensive clinical data obtained at annual health evaluations, have the potential to provide more informative wellness classification. As with traditional medicine where the physician interprets subclinical signs in light of the person's health history, truly personalized medicine will be founded on algorithms that extract relevant information from genomes but will also require interpretation in light of the triggers, behaviors, and environment that are unique to each person. This chapter discusses some of the major obstacles to implementation, from development of risk scores through integration of diverse omic data types to presentation of results in a format that fosters development of personal health action plans.
Collapse
|
22
|
Abstract
The etiology of skeletal disease is driven by genetic and environmental factors. Genome-wide association studies (GWAS) of osteoporotic phenotypes have identified novel candidate genes, but have only uncovered a small proportion of the trait variance explained. This "missing heritability" is caused by several factors, including the failure to consider gene-by-environmental (G*E) interactions. Some G*E interactions have been investigated, but new approaches to integrate environmental data into genomic studies are needed. Advances in genotyping and meta-analysis techniques now allow combining genotype data from multiple studies, but the measurement of key environmental factors in large human cohorts still lags behind, as do the statistical tools needed to incorporate these measures in genome-wide association meta-studies. This review focuses on discussing ways to enhance G*E interaction studies in humans and how the use of rodent models can inform genetic studies. Understanding G*E interactions will provide opportunities to effectively target intervention strategies for individualized therapy.
Collapse
|