1
|
Wack M, Coulet A, Burgun A, Rance B. Enhancing clinical data warehousing with provenance data to support longitudinal analyses and large file management: The gitOmmix approach for genomic and image data. J Biomed Inform 2025; 163:104788. [PMID: 39952627 DOI: 10.1016/j.jbi.2025.104788] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2024] [Revised: 01/27/2025] [Accepted: 02/04/2025] [Indexed: 02/17/2025]
Abstract
BACKGROUND If hospital Clinical Data Warehouses are to address today's focus in personalized medicine, they need to be able to track patients longitudinally and manage the large data sets generated by whole genome sequencing, RNA analyses, and complex imaging studies. Current Clinical Data Warehouses address neither issue. This paper reports on methods to enrich current systems by providing provenance data allowing patient histories to be followed longitudinally and managing the linking and versioning of large data sets from whatever source. The methods are open source and applicable to any clinical data warehouse system, whether data schema it uses. METHOD We introduce gitOmmix, an approach that overcomes these limitations, and illustrate its usefulness in the management of medical omics data. gitOmmix relies on (i) a file versioning system: git, (ii) an extension that handles large files: git-annex, (iii) a provenance knowledge graph: PROV-O, and (iv) an alignment between the git versioning information and the provenance knowledge graph. RESULTS Capabilities inherited from git and git-annex enable retracing the history of a clinical interpretation back to the patient sample, through supporting data and analyses. In addition, the provenance knowledge graph, aligned with the git versioning information, enables querying and browsing provenance relationships between these elements. CONCLUSION gitOmmix adds a provenance layer to CDWs, while scaling to large files and being agnostic of the CDW system. For these reasons, we think that it is a viable and generalizable solution for omics clinical studies.
Collapse
Affiliation(s)
- Maxime Wack
- Centre de Recherche des Cordeliers, UMRS 1138, Inserm, Université Paris Cité, Sorbonne Université, Paris, France; Inria Paris, Paris, France; Department of Biomedical Informatics, Hôpital Européen Georges Pompidou, AP-HP, Paris, France; Centre Hospitalier National d'Ophtalmologie des Quinze-Vingts, IHU FOReSIGHT, 75012 Paris, France.
| | - Adrien Coulet
- Centre de Recherche des Cordeliers, UMRS 1138, Inserm, Université Paris Cité, Sorbonne Université, Paris, France; Inria Paris, Paris, France.
| | - Anita Burgun
- Centre de Recherche des Cordeliers, UMRS 1138, Inserm, Université Paris Cité, Sorbonne Université, Paris, France; Inria Paris, Paris, France; Department of Biomedical Informatics, Hôpital Européen Georges Pompidou, AP-HP, Paris, France.
| | - Bastien Rance
- Centre de Recherche des Cordeliers, UMRS 1138, Inserm, Université Paris Cité, Sorbonne Université, Paris, France; Inria Paris, Paris, France; Department of Biomedical Informatics, Hôpital Européen Georges Pompidou, AP-HP, Paris, France.
| |
Collapse
|
2
|
Pratiwi N, Ulfah AJ, Rachmadina R, Irham LM, Afief AR, Adikusuma W, Darmawi D, Kemal RA, Rangkuti IF, Savira M. Promising candidate drug target genes for repurposing in cervical cancer: A bioinformatics-based approach. NARRA J 2024; 4:e938. [PMID: 39816079 PMCID: PMC11731801 DOI: 10.52225/narra.v4i3.938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/05/2024] [Accepted: 10/28/2024] [Indexed: 01/18/2025]
Abstract
Cervical cancer is the fourth most common cancer among women globally, and studies have shown that genetic variants play a significant role in its development. A variety of germline and somatic mutations are associated with cervical cancer. However, genomic data derived from these mutations have not been extensively utilized for the development of repurposed drugs for cervical cancer. The objective of this study was to identify novel potential drugs that could be repurposed for cervical cancer treatment through a bioinformatics approach. A comprehensive genomic and bioinformatics database integration strategy was employed to identify potential drug target genes for cervical cancer. Using the GWAS and PheWAS databases, a total of 232 genes associated with cervical cancer were identified. These pharmacological target genes were further refined by applying a biological threshold of six functional annotations. The drug target genes were then cross-referenced with cancer treatment candidates using the DrugBank database. Among the identified genes, LTA, TNFRSF1A, PRKCZ, PDE4B, and PARP were highlighted as promising targets for repurposed drugs. Notably, these five target genes overlapped with 12 drugs that could potentially be repurposed for cervical cancer treatment. Among these, talazoparib, a potent PARP inhibitor, emerged as a particularly promising candidate. Interestingly, talazoparib is currently being investigated for safety and tolerability in other cancers but has not yet been studied in the context of cervical cancer. Further clinical trials are necessary to validate this finding and explore its potential as a repurposed drug for cervical cancer.
Collapse
Affiliation(s)
- Nurfi Pratiwi
- Master Program in Biomedical Sciences, Faculty of Medicine, Universitas Riau, Pekanbaru, Indonesia
- Department of Histology, Faculty of Medicine, Universitas Riau, Pekanbaru, Indonesia
| | - Aida J. Ulfah
- Master Program in Biomedical Sciences, Faculty of Medicine, Universitas Riau, Pekanbaru, Indonesia
| | - Rachmadina Rachmadina
- Master Program in Biomedical Sciences, Faculty of Medicine, Universitas Riau, Pekanbaru, Indonesia
| | - Lalu M. Irham
- Faculty of Pharmacy, Universitas Ahmad Dahlan, Yogyakarta, Indonesia
- Faculty of Pharmacy, Silpakorn University, Nakhon Pathom, Thailand
| | - Arief R. Afief
- Faculty of Pharmacy, Universitas YPIB Majalengka, Majalengka, Indonesia
| | - Wirawan Adikusuma
- Department of Pharmacy, Universitas Muhammadiyah Mataram, Mataram, Indonesia
- Research Center for Computing, Research Organization for Electronics and Informatics, National Research and Innovation Agency (BRIN), Cibinong Science Center, Cibinong, Indonesia
| | - Darmawi Darmawi
- Department of Histology, Faculty of Medicine, Universitas Riau, Pekanbaru, Indonesia
- Graduate School in Biomedical Sciences, Faculty of Medicine, Universitas Riau, Pekanbaru, Indonesia
| | - Rahmat A. Kemal
- Department of Medical Biology, Faculty of Medicine, Universitas Riau, Pekanbaru, Indonesia
| | - Ina F. Rangkuti
- Department of Pathological Anatomy, Faculty of Medicine, Universitas Riau, Pekanbaru, Indonesia
| | - Maya Savira
- Graduate School in Biomedical Sciences, Faculty of Medicine, Universitas Riau, Pekanbaru, Indonesia
- Department of Microbiology, Faculty of Medicine, Universitas Riau, Pekanbaru, Indonesia
| |
Collapse
|
3
|
Zagkos L, Dib MJ, Cronjé HT, Elliott P, Dehghan A, Tzoulaki I, Gill D, Daghlas I. Cerebrospinal Fluid C1-Esterase Inhibitor and Tie-1 Levels Affect Cognitive Performance: Evidence from Proteome-Wide Mendelian Randomization. Genes (Basel) 2024; 15:71. [PMID: 38254961 PMCID: PMC10815381 DOI: 10.3390/genes15010071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 12/29/2023] [Accepted: 01/01/2024] [Indexed: 01/24/2024] Open
Abstract
OBJECTIVE The association of cerebrospinal fluid (CSF) protein levels with cognitive function in the general population remains largely unexplored. We performed Mendelian randomization (MR) analyses to query which CSF proteins may have potential causal effects on cognitive performance. METHODS AND ANALYSIS Genetic associations with CSF proteins were obtained from a genome-wide association study conducted in up to 835 European-ancestry individuals and for cognitive performance from a meta-analysis of GWAS including 257,841 European-ancestry individuals. We performed Mendelian randomization (MR) analyses to test the effect of randomly allocated variation in 154 genetically predicted CSF protein levels on cognitive performance. Findings were validated by performing colocalization analyses and considering cognition-related phenotypes. RESULTS Genetically predicted C1-esterase inhibitor levels in the CSF were associated with a better cognitive performance (SD units of cognitive performance per 1 log-relative fluorescence unit (RFU): 0.23, 95% confidence interval: 0.12 to 0.35, p = 7.91 × 10-5), while tyrosine-protein kinase receptor Tie-1 (sTie-1) levels were associated with a worse cognitive performance (-0.43, -0.62 to -0.23, p = 2.08 × 10-5). These findings were supported by colocalization analyses and by concordant effects on distinct cognition-related and brain-volume measures. CONCLUSIONS Human genetics supports a role for the C1-esterase inhibitor and sTie-1 in cognitive performance.
Collapse
Affiliation(s)
- Loukas Zagkos
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London SW7 2BX, UK; (P.E.); (A.D.); (I.T.); (D.G.)
| | - Marie-Joe Dib
- Division of Cardiovascular Medicine, Hospital of the University of Pennsylvania, Philadelphia, PA 19104, USA;
| | - Héléne T. Cronjé
- Department of Public Health, Section of Epidemiology, University of Copenhagen, 1165 Copenhagen, Denmark;
| | - Paul Elliott
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London SW7 2BX, UK; (P.E.); (A.D.); (I.T.); (D.G.)
- UK Dementia Research Institute at Imperial College London, Hammersmith Hospital, London W1T 7NF, UK
- Medical Research Council Centre for Environment and Health, School of Public Health, Imperial College London, London SW7 2AZ, UK
| | - Abbas Dehghan
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London SW7 2BX, UK; (P.E.); (A.D.); (I.T.); (D.G.)
- UK Dementia Research Institute at Imperial College London, Hammersmith Hospital, London W1T 7NF, UK
- Medical Research Council Centre for Environment and Health, School of Public Health, Imperial College London, London SW7 2AZ, UK
| | - Ioanna Tzoulaki
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London SW7 2BX, UK; (P.E.); (A.D.); (I.T.); (D.G.)
- UK Dementia Research Institute at Imperial College London, Hammersmith Hospital, London W1T 7NF, UK
- Medical Research Council Centre for Environment and Health, School of Public Health, Imperial College London, London SW7 2AZ, UK
- Centre for Systems Biology, Biomedical Research Foundation of the Academy of Athens, 11527 Athens, Greece
| | - Dipender Gill
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London SW7 2BX, UK; (P.E.); (A.D.); (I.T.); (D.G.)
| | - Iyas Daghlas
- Department of Neurology, University of California, San Francisco, San Francisco, CA 94143, USA;
| |
Collapse
|
4
|
Cao Q, Du X, Jiang XY, Tian Y, Gao CH, Liu ZY, Xu T, Tao XX, Lei M, Wang XQ, Ye LL, Duan DD. Phenome-wide association study and precision medicine of cardiovascular diseases in the post-COVID-19 era. Acta Pharmacol Sin 2023; 44:2347-2357. [PMID: 37532784 PMCID: PMC10692238 DOI: 10.1038/s41401-023-01119-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Accepted: 05/29/2023] [Indexed: 08/04/2023]
Abstract
SARS-CoV-2 infection causes injuries of not only the lungs but also the heart and endothelial cells in vasculature of multiple organs, and induces systemic inflammation and immune over-reactions, which makes COVID-19 a disease phenome that simultaneously affects multiple systems. Cardiovascular diseases (CVD) are intrinsic risk and causative factors for severe COVID-19 comorbidities and death. The wide-spread infection and reinfection of SARS-CoV-2 variants and the long-COVID may become a new common threat to human health and propose unprecedented impact on the risk factors, pathophysiology, and pharmacology of many diseases including CVD for a long time. COVID-19 has highlighted the urgent demand for precision medicine which needs new knowledge network to innovate disease taxonomy for more precise diagnosis, therapy, and prevention of disease. A deeper understanding of CVD in the setting of COVID-19 phenome requires a paradigm shift from the current phenotypic study that focuses on the virus or individual symptoms to phenomics of COVID-19 that addresses the inter-connectedness of clinical phenotypes, i.e., clinical phenome. Here, we summarize the CVD manifestations in the full clinical spectrum of COVID-19, and the phenome-wide association study of CVD interrelated to COVID-19. We discuss the underlying biology for CVD in the COVID-19 phenome and the concept of precision medicine with new phenomic taxonomy that addresses the overall pathophysiological responses of the body to the SARS-CoV-2 infection. We also briefly discuss the unique taxonomy of disease as Zheng-hou patterns in traditional Chinese medicine, and their potential implications in precision medicine of CVD in the post-COVID-19 era.
Collapse
Affiliation(s)
- Qian Cao
- Center for Phenomics of Traditional Chinese Medicine, the Affiliated Traditional Chinese Medicine Hospital, Southwest Medical University, Luzhou, 646000, China
| | - Xin Du
- Center for Phenomics of Traditional Chinese Medicine, the Affiliated Traditional Chinese Medicine Hospital, Southwest Medical University, Luzhou, 646000, China
| | - Xiao-Yan Jiang
- Center for Phenomics of Traditional Chinese Medicine, the Affiliated Traditional Chinese Medicine Hospital, Southwest Medical University, Luzhou, 646000, China
| | - Yuan Tian
- Center for Phenomics of Traditional Chinese Medicine, the Affiliated Traditional Chinese Medicine Hospital, Southwest Medical University, Luzhou, 646000, China
| | - Chen-Hao Gao
- Center for Phenomics of Traditional Chinese Medicine, the Affiliated Traditional Chinese Medicine Hospital, Southwest Medical University, Luzhou, 646000, China
| | - Zi-Yu Liu
- Center for Phenomics of Traditional Chinese Medicine, the Affiliated Traditional Chinese Medicine Hospital, Southwest Medical University, Luzhou, 646000, China
| | - Ting Xu
- Center for Phenomics of Traditional Chinese Medicine, the Affiliated Traditional Chinese Medicine Hospital, Southwest Medical University, Luzhou, 646000, China
| | - Xing-Xing Tao
- Center for Phenomics of Traditional Chinese Medicine, the Affiliated Traditional Chinese Medicine Hospital, Southwest Medical University, Luzhou, 646000, China
| | - Ming Lei
- Center for Phenomics of Traditional Chinese Medicine, the Affiliated Traditional Chinese Medicine Hospital, Southwest Medical University, Luzhou, 646000, China
| | - Xiao-Qiang Wang
- Center for Phenomics of Traditional Chinese Medicine, the Affiliated Traditional Chinese Medicine Hospital, Southwest Medical University, Luzhou, 646000, China
| | - Lingyu Linda Ye
- Center for Phenomics of Traditional Chinese Medicine, the Affiliated Traditional Chinese Medicine Hospital, Southwest Medical University, Luzhou, 646000, China.
- Institute of Integrated Chinese and Western Medicine, Southwest Medical University, Luzhou, 646000, China.
- Key Laboratory of Autoimmune Diseases and Precision Medicie, People's Hospital of Ningxia Hui Autonomous Region, Yinchuan, 750001, China.
| | - Dayue Darrel Duan
- Center for Phenomics of Traditional Chinese Medicine, the Affiliated Traditional Chinese Medicine Hospital, Southwest Medical University, Luzhou, 646000, China.
- Institute of Integrated Chinese and Western Medicine, Southwest Medical University, Luzhou, 646000, China.
- Key Laboratory of Autoimmune Diseases and Precision Medicie, People's Hospital of Ningxia Hui Autonomous Region, Yinchuan, 750001, China.
- The Department of Pharmacology, University of Nevada Reno School of Medicine, Reno, NV, 89557, USA.
| |
Collapse
|
5
|
Mukerji M. Ayurgenomics-based frameworks in precision and integrative medicine: Translational opportunities. CAMBRIDGE PRISMS. PRECISION MEDICINE 2023; 1:e29. [PMID: 38550940 PMCID: PMC10953754 DOI: 10.1017/pcm.2023.15] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 05/22/2023] [Accepted: 06/11/2023] [Indexed: 11/06/2024]
Abstract
In today's globalized and flat world, a patient can access and seek multiple health and disease management options. A digitally enabled participatory framework that allows an evidence-based informed choice is likely to assume an immense importance in the future. In India, traditional knowledge systems, like Ayurveda, coexist with modern medicine. However, due to limited crosstalk between the clinicians of both disciplines, a patient attempts integrative medicine by seeking both options independently with limited understanding and evidence. There is a need for an integrative medicine platform with a formalized approach, which allows practitioners from the two diverse systems to crosstalk, coexist, and coevolve for an informed cross-referral that benefits the patients. To be successful, this needs frameworks that enable the bridging of disciplines through a common interface with shared ontologies. Ayurgenomics is an emerging discipline that explores the principles and practices of Ayurveda combined with genomics approaches for mainstream integration. The present review highlights how in conjunction with different disciplines and technologies this has provided frameworks for (1) the discovery of molecular correlates to build ontological links between the two systems, (2) the discovery of biomarkers and targets for early actionable interventions, (3) understanding molecular mechanisms of drug action from its usage perspective in Ayurveda with applications in repurposing, (4) understanding the network and P4 medicine perspective of Ayurveda through a common organizing principle, (5) non-invasive stratification of healthy and diseased individuals using a compendium of system-level phenotypes, and (6) developing evidence-based solutions for practice in integrative medicine settings. The concordance between the two contrasting streams has been built through extensive explorations and iterations of the concepts of Ayurveda and genomic observations using state-of-the-art technologies, computational approaches, and model system studies. These highlight the enormous potential of a trans-disciplinary approach in evolving solutions for personalized interventions in integrative medicine settings.
Collapse
Affiliation(s)
- Mitali Mukerji
- Department of Bioscience and Bioengineering, Indian Institute of Technology Jodhpur, Karwar, India
- School of Artificial Intelligence and Data Science (AIDE), Indian Institute of Technology Jodhpur, Karwar, India
| |
Collapse
|
6
|
Ying W. Phenomic Studies on Diseases: Potential and Challenges. PHENOMICS (CHAM, SWITZERLAND) 2023; 3:285-299. [PMID: 36714223 PMCID: PMC9867904 DOI: 10.1007/s43657-022-00089-4] [Citation(s) in RCA: 39] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/28/2021] [Revised: 11/21/2022] [Accepted: 11/24/2022] [Indexed: 01/23/2023]
Abstract
The rapid development of such research field as multi-omics and artificial intelligence (AI) has made it possible to acquire and analyze the multi-dimensional big data of human phenomes. Increasing evidence has indicated that phenomics can provide a revolutionary strategy and approach for discovering new risk factors, diagnostic biomarkers and precision therapies of diseases, which holds profound advantages over conventional approaches for realizing precision medicine: first, the big data of patients' phenomes can provide remarkably richer information than that of the genomes; second, phenomic studies on diseases may expose the correlations among cross-scale and multi-dimensional phenomic parameters as well as the mechanisms underlying the correlations; and third, phenomics-based studies are big data-driven studies, which can significantly enhance the possibility and efficiency for generating novel discoveries. However, phenomic studies on human diseases are still in early developmental stage, which are facing multiple major challenges and tasks: first, there is significant deficiency in analytical and modeling approaches for analyzing the multi-dimensional data of human phenomes; second, it is crucial to establish universal standards for acquirement and management of phenomic data of patients; third, new methods and devices for acquirement of phenomic data of patients under clinical settings should be developed; fourth, it is of significance to establish the regulatory and ethical guidelines for phenomic studies on diseases; and fifth, it is important to develop effective international cooperation. It is expected that phenomic studies on diseases would profoundly and comprehensively enhance our capacity in prevention, diagnosis and treatment of diseases.
Collapse
Affiliation(s)
- Weihai Ying
- Med-X Research Institute and School of Biomedical Engineering, Shanghai Jiao Tong University, 1954 Huashan Road, Shanghai, 200030 China
- Collaborative Innovation Center for Genetics and Development, Shanghai, 200043 China
| |
Collapse
|
7
|
Lisboa RO, Sekula RF, Bezamat M, Deeley K, Santana-da-Silva LC, Vieira AR. Pain perception genes, asthma, and oral health: A reverse genetics study. PLoS One 2022; 17:e0277036. [PMID: 36395102 PMCID: PMC9671307 DOI: 10.1371/journal.pone.0277036] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Accepted: 10/18/2022] [Indexed: 11/19/2022] Open
Abstract
Pain is an experience of a subjective nature, interpreted in a personal way and according to an extensive palette of factors unique to each individual. Orofacial pain can be acute or chronic and it is usually the main reason for the patient to seek dental care. Pain perception varies widely among individuals. This variability is considered a mosaic of factors, which include biopsychosocial factors and genetic factors. Understanding these differences can be extremely beneficial for pain management in a personalized and more efficient way. We performed association studies to investigate phenotypes associated with genetic markers in pain-related genes in two groups of patients who received more or less anesthesia during dental treatment. The study group was comprised of 1289 individuals participating in the Dental Registry and DNA Repository Project (DRDR) of the University of Pittsburgh, with 900 participants in the group that received the most anesthesia and 389 constituting the comparison group that received less anesthesia. We tested 58 phenotypes and genotypic data of seven SNPs in genes that are associated with pain perception, pain modulation and response to drugs used in pain treatment: COMT (rs4818 and rs6269), GCH1 (rs3783641), DRD2 (rs6276), OPRM1 (rs1799971), SCN9A (rs6746030) and SCN10A (rs6795970). The analysis revealed a protective effect of rs1799971 on asthma in the total sample. rs3783641 was associated with salivary secretion disorders in females who received more anesthesia. rs1799971 was also associated with periodontitis in Whites who received less anesthesia. rs4818 was associated with disease and other tongue conditions in the group composed of Blacks who received less anesthesia. In conclusion, our study implicated variants in pain-related genes in asthma and oral phenotypes.
Collapse
Affiliation(s)
- Rosany O. Lisboa
- Laboratory of Inborn Errors of Metabolism, Institute of Biological Sciences, Federal University of Pará, Pará, Brazil
- Departments of Oral and Craniofacial Sciences, Pediatric Dentistry and Center for Craniofacial and Dental Genetics, School of Dental Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
- Graduate Program in Oncology and Medical Sciences, Federal University of Pará, Pará, Brazil
| | - Raymond F. Sekula
- Department of Neurological Surgery, Columbia University Vagelos School of Medicine, New York, New York, United States of America
| | - Mariana Bezamat
- Departments of Oral and Craniofacial Sciences, Pediatric Dentistry and Center for Craniofacial and Dental Genetics, School of Dental Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Kathleen Deeley
- Departments of Oral and Craniofacial Sciences, Pediatric Dentistry and Center for Craniofacial and Dental Genetics, School of Dental Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Luiz Carlos Santana-da-Silva
- Laboratory of Inborn Errors of Metabolism, Institute of Biological Sciences, Federal University of Pará, Pará, Brazil
- Graduate Program in Oncology and Medical Sciences, Federal University of Pará, Pará, Brazil
| | - Alexandre R. Vieira
- Departments of Oral and Craniofacial Sciences, Pediatric Dentistry and Center for Craniofacial and Dental Genetics, School of Dental Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
- * E-mail:
| |
Collapse
|
8
|
Bennett DA, Du H. An Overview of Methods and Exemplars of the Use of Mendelian Randomisation in Nutritional Research. Nutrients 2022; 14:3408. [PMID: 36014914 PMCID: PMC9412324 DOI: 10.3390/nu14163408] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Revised: 08/12/2022] [Accepted: 08/15/2022] [Indexed: 12/09/2022] Open
Abstract
Objectives: It is crucial to elucidate the causal relevance of nutritional exposures (such as dietary patterns, food intake, macronutrients intake, circulating micronutrients), or biomarkers in non-communicable diseases (NCDs) in order to find effective strategies for NCD prevention. Classical observational studies have found evidence of associations between nutritional exposures and NCD development, but such studies are prone to confounding and other biases. This has direct relevance for translation research, as using unreliable evidence can lead to the failure of trials of nutritional interventions. Facilitated by the availability of large-scale genetic data, Mendelian randomization studies are increasingly used to ascertain the causal relevance of nutritional exposures and biomarkers for many NCDs. Methods: A narrative overview was conducted in order to demonstrate and describe the utility of Mendelian randomization studies, for individuals with little prior knowledge engaged in nutritional epidemiological research. Results: We provide an overview, rationale and basic description of the methods, as well as strengths and limitations of Mendelian randomization studies. We give selected examples from the contemporary nutritional literature where Mendelian randomization has provided useful evidence on the potential causal relevance of nutritional exposures. Conclusions: The selected exemplars demonstrate the importance of well-conducted Mendelian randomization studies as a robust tool to prioritize nutritional exposures for further investigation.
Collapse
Affiliation(s)
- Derrick A. Bennett
- Clinical Trial Service Unit and Epidemiological Studies Unit (CTSU), Nuffield Department of Population Health, University of Oxford, Oxford OX3 7LF, UK
- Medical Research Council Population Health Research Unit, University of Oxford, Oxford OX1 3QR, UK
| | - Huaidong Du
- Clinical Trial Service Unit and Epidemiological Studies Unit (CTSU), Nuffield Department of Population Health, University of Oxford, Oxford OX3 7LF, UK
- Medical Research Council Population Health Research Unit, University of Oxford, Oxford OX1 3QR, UK
| |
Collapse
|
9
|
Crespi B, Yang N. Three laws of teleonometrics. Biol J Linn Soc Lond 2022. [DOI: 10.1093/biolinnean/blac068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Abstract
We define teleonometrics as the theoretical and empirical study of teleonomy. We propose three laws for teleonometrics. The first law describes the hierarchical organization of teleonomic functions across biological levels from genes to individuals. According to this law, the number of goal-directed functions increases from individuals (one goal, maximizing inclusive fitness) to intermediate levels and to genes and alleles (myriad time-, space- and context-dependent goals, depending upon degrees and patterns of pleiotropy). The second law describes the operation of teleonomic functions under trade-offs, coadaptations and negative and positive pleiotropies, which are universal in biological systems. According to this law, the functions of an allele, gene or trait are described and defined by patterns of antagonistic (trading off) and compatible (coadapted) functions. The third law of teleonometrics is that the major transitions in evolution are driven by the origins of novel, emergent goals associated with functional changes and by the breaking and reshaping of trade-offs, especially by mechanisms involving increases in resources or time, and new divisions of labour or function. We illustrate the application of these laws using data from three empirical vignettes, which help to show the usefulness of teleonometric viewpoints for understanding the interfaces between function, trade-offs and dysfunctions manifest as disease.
Collapse
Affiliation(s)
- Bernard Crespi
- Department of Biological Sciences, Simon Fraser University , Burnaby, British Columbia, V5A 1S6 , Canada
| | - Nancy Yang
- Department of Biological Sciences, Simon Fraser University , Burnaby, British Columbia, V5A 1S6 , Canada
| |
Collapse
|
10
|
Heath L, Earls JC, Magis AT, Kornilov SA, Lovejoy JC, Funk CC, Rappaport N, Logsdon BA, Mangravite LM, Kunkle BW, Martin ER, Naj AC, Ertekin-Taner N, Golde TE, Hood L, Price ND. Manifestations of Alzheimer's disease genetic risk in the blood are evident in a multiomic analysis in healthy adults aged 18 to 90. Sci Rep 2022; 12:6117. [PMID: 35413975 PMCID: PMC9005657 DOI: 10.1038/s41598-022-09825-2] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Accepted: 03/23/2022] [Indexed: 01/18/2023] Open
Abstract
Genetics play an important role in late-onset Alzheimer's Disease (AD) etiology and dozens of genetic variants have been implicated in AD risk through large-scale GWAS meta-analyses. However, the precise mechanistic effects of most of these variants have yet to be determined. Deeply phenotyped cohort data can reveal physiological changes associated with genetic risk for AD across an age spectrum that may provide clues to the biology of the disease. We utilized over 2000 high-quality quantitative measurements obtained from blood of 2831 cognitively normal adult clients of a consumer-based scientific wellness company, each with CLIA-certified whole-genome sequencing data. Measurements included: clinical laboratory blood tests, targeted chip-based proteomics, and metabolomics. We performed a phenome-wide association study utilizing this diverse blood marker data and 25 known AD genetic variants and an AD-specific polygenic risk score (PGRS), adjusting for sex, age, vendor (for clinical labs), and the first four genetic principal components; sex-SNP interactions were also assessed. We observed statistically significant SNP-analyte associations for five genetic variants after correction for multiple testing (for SNPs in or near NYAP1, ABCA7, INPP5D, and APOE), with effects detectable from early adulthood. The ABCA7 SNP and the APOE2 and APOE4 encoding alleles were associated with lipid variability, as seen in previous studies; in addition, six novel proteins were associated with the e2 allele. The most statistically significant finding was between the NYAP1 variant and PILRA and PILRB protein levels, supporting previous functional genomic studies in the identification of a putative causal variant within the PILRA gene. We did not observe associations between the PGRS and any analyte. Sex modified the effects of four genetic variants, with multiple interrelated immune-modulating effects associated with the PICALM variant. In post-hoc analysis, sex-stratified GWAS results from an independent AD case-control meta-analysis supported sex-specific disease effects of the PICALM variant, highlighting the importance of sex as a biological variable. Known AD genetic variation influenced lipid metabolism and immune response systems in a population of non-AD individuals, with associations observed from early adulthood onward. Further research is needed to determine whether and how these effects are implicated in early-stage biological pathways to AD. These analyses aim to complement ongoing work on the functional interpretation of AD-associated genetic variants.
Collapse
Affiliation(s)
- Laura Heath
- Institute for Systems Biology, Seattle, WA, USA.
- Sage Bionetworks, Seattle, WA, USA.
| | - John C Earls
- Institute for Systems Biology, Seattle, WA, USA
- Thorne HealthTech, New York, NY, USA
| | | | | | | | - Cory C Funk
- Institute for Systems Biology, Seattle, WA, USA
| | | | | | | | - Brian W Kunkle
- John P. Hussman Institute for Human Genomics, University of Miami Miller School of Medicine, Miami, FL, USA
- Dr. John T. Macdonald Foundation Department of Human Genetics, University of Miami Miller School of Medicine, Miami, FL, USA
| | - Eden R Martin
- John P. Hussman Institute for Human Genomics, University of Miami Miller School of Medicine, Miami, FL, USA
- Dr. John T. Macdonald Foundation Department of Human Genetics, University of Miami Miller School of Medicine, Miami, FL, USA
| | - Adam C Naj
- Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
- Department of Pathology and Laboratory Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Nilüfer Ertekin-Taner
- Department of Neurology, Mayo Clinic, Jacksonville, FL, USA
- Department of Neuroscience, Mayo Clinic, Jacksonville, FL, USA
| | - Todd E Golde
- Department of Neuroscience, College of Medicine, McKnight Brain Institute, Center for Translational Research in Neurodegenerative Disease University of Florida, Gainesville, FL, USA
| | - Leroy Hood
- Institute for Systems Biology, Seattle, WA, USA
- Providence St. Joseph Health, Renton, WA, USA
| | - Nathan D Price
- Institute for Systems Biology, Seattle, WA, USA.
- Thorne HealthTech, New York, NY, USA.
| |
Collapse
|
11
|
New insights into pathogenesis of IgA nephropathy. Int Urol Nephrol 2022; 54:1873-1880. [DOI: 10.1007/s11255-021-03094-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2020] [Accepted: 12/08/2021] [Indexed: 10/19/2022]
|
12
|
Salvatore M, Gu T, Mack JA, Prabhu Sankar S, Patil S, Valley TS, Singh K, Nallamothu BK, Kheterpal S, Lisabeth L, Fritsche LG, Mukherjee B. A Phenome-Wide Association Study (PheWAS) of COVID-19 Outcomes by Race Using the Electronic Health Records Data in Michigan Medicine. J Clin Med 2021; 10:jcm10071351. [PMID: 33805886 PMCID: PMC8037108 DOI: 10.3390/jcm10071351] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Revised: 03/10/2021] [Accepted: 03/17/2021] [Indexed: 12/16/2022] Open
Abstract
Background: We performed a phenome-wide association study to identify pre-existing conditions related to Coronavirus disease 2019 (COVID-19) prognosis across the medical phenome and how they vary by race. Methods: The study is comprised of 53,853 patients who were tested/diagnosed for COVID-19 between 10 March and 2 September 2020 at a large academic medical center. Results: Pre-existing conditions strongly associated with hospitalization were renal failure, pulmonary heart disease, and respiratory failure. Hematopoietic conditions were associated with intensive care unit (ICU) admission/mortality and mental disorders were associated with mortality in non-Hispanic Whites. Circulatory system and genitourinary conditions were associated with ICU admission/mortality in non-Hispanic Blacks. Conclusions: Understanding pre-existing clinical diagnoses related to COVID-19 outcomes informs the need for targeted screening to support specific vulnerable populations to improve disease prevention and healthcare delivery.
Collapse
Affiliation(s)
- Maxwell Salvatore
- Department of Biostatistics, University of Michigan School of Public Health, 1415 Washington Heights, Ann Arbor, MI 48109, USA; (M.S.); (T.G.); (J.A.M.); (S.P.); (L.G.F.)
- Center for Precision Health Data Science, University of Michigan, Ann Arbor, MI 48109, USA;
- Department of Epidemiology, University of Michigan School of Public Health, Ann Arbor, MI 48109, USA;
| | - Tian Gu
- Department of Biostatistics, University of Michigan School of Public Health, 1415 Washington Heights, Ann Arbor, MI 48109, USA; (M.S.); (T.G.); (J.A.M.); (S.P.); (L.G.F.)
- Center for Precision Health Data Science, University of Michigan, Ann Arbor, MI 48109, USA;
| | - Jasmine A. Mack
- Department of Biostatistics, University of Michigan School of Public Health, 1415 Washington Heights, Ann Arbor, MI 48109, USA; (M.S.); (T.G.); (J.A.M.); (S.P.); (L.G.F.)
| | - Swaraaj Prabhu Sankar
- Center for Precision Health Data Science, University of Michigan, Ann Arbor, MI 48109, USA;
- Rogel Cancer Center, Michigan Medicine, Ann Arbor, MI 48109, USA
- Data Office for Clinical and Translational Research, University of Michigan, Ann Arbor, MI 41809, USA
| | - Snehal Patil
- Department of Biostatistics, University of Michigan School of Public Health, 1415 Washington Heights, Ann Arbor, MI 48109, USA; (M.S.); (T.G.); (J.A.M.); (S.P.); (L.G.F.)
- Center for Precision Health Data Science, University of Michigan, Ann Arbor, MI 48109, USA;
| | - Thomas S. Valley
- Division of Pulmonary and Critical Care Medicine, University of Michigan Medicine, Ann Arbor, MI 48109, USA;
- Department of Internal Medicine, Michigan Medicine, Ann Arbor, MI 48109, USA;
- Institute for Healthcare Policy and Innovation, University of Michigan, Ann Arbor, MI 48109, USA; (K.S.); (S.K.)
| | - Karandeep Singh
- Institute for Healthcare Policy and Innovation, University of Michigan, Ann Arbor, MI 48109, USA; (K.S.); (S.K.)
- Department of Learning Health Sciences, University of Michigan, Ann Arbor, MI 48109, USA
| | - Brahmajee K. Nallamothu
- Department of Internal Medicine, Michigan Medicine, Ann Arbor, MI 48109, USA;
- Division of Cardiovascular Medicine, Michigan Medicine, Ann Arbor, MI 48109, USA
| | - Sachin Kheterpal
- Institute for Healthcare Policy and Innovation, University of Michigan, Ann Arbor, MI 48109, USA; (K.S.); (S.K.)
- Department of Anesthesiology, Michigan Medicine, Ann Arbor, MI 48109, USA
| | - Lynda Lisabeth
- Department of Epidemiology, University of Michigan School of Public Health, Ann Arbor, MI 48109, USA;
| | - Lars G. Fritsche
- Department of Biostatistics, University of Michigan School of Public Health, 1415 Washington Heights, Ann Arbor, MI 48109, USA; (M.S.); (T.G.); (J.A.M.); (S.P.); (L.G.F.)
- Center for Precision Health Data Science, University of Michigan, Ann Arbor, MI 48109, USA;
- Rogel Cancer Center, Michigan Medicine, Ann Arbor, MI 48109, USA
- Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI 48109, USA
| | - Bhramar Mukherjee
- Department of Biostatistics, University of Michigan School of Public Health, 1415 Washington Heights, Ann Arbor, MI 48109, USA; (M.S.); (T.G.); (J.A.M.); (S.P.); (L.G.F.)
- Center for Precision Health Data Science, University of Michigan, Ann Arbor, MI 48109, USA;
- Department of Epidemiology, University of Michigan School of Public Health, Ann Arbor, MI 48109, USA;
- Correspondence: ; Tel.: +1-(734)-764-6544
| |
Collapse
|
13
|
Li R, Duan R, Zhang X, Lumley T, Pendergrass S, Bauer C, Hakonarson H, Carrell DS, Smoller JW, Wei WQ, Carroll R, Velez Edwards DR, Wiesner G, Sleiman P, Denny JC, Mosley JD, Ritchie MD, Chen Y, Moore JH. Lossless integration of multiple electronic health records for identifying pleiotropy using summary statistics. Nat Commun 2021; 12:168. [PMID: 33420026 PMCID: PMC7794298 DOI: 10.1038/s41467-020-20211-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2020] [Accepted: 11/13/2020] [Indexed: 11/22/2022] Open
Abstract
Increasingly, clinical phenotypes with matched genetic data from bio-bank linked electronic health records (EHRs) have been used for pleiotropy analyses. Thus far, pleiotropy analysis using individual-level EHR data has been limited to data from one site. However, it is desirable to integrate EHR data from multiple sites to improve the detection power and generalizability of the results. Due to privacy concerns, individual-level patients' data are not easily shared across institutions. As a result, we introduce Sum-Share, a method designed to efficiently integrate EHR and genetic data from multiple sites to perform pleiotropy analysis. Sum-Share requires only summary-level data and one round of communication from each site, yet it produces identical test statistics compared with that of pooled individual-level data. Consequently, Sum-Share can achieve lossless integration of multiple datasets. Using real EHR data from eMERGE, Sum-Share is able to identify 1734 potential pleiotropic SNPs for five cardiovascular diseases.
Collapse
Affiliation(s)
- Ruowang Li
- Department of Biostatistics, Epidemiology & Informatics, University of Pennsylvania, Philadelphia, PA, USA
| | - Rui Duan
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Xinyuan Zhang
- Department of Biostatistics, Epidemiology & Informatics, University of Pennsylvania, Philadelphia, PA, USA
| | - Thomas Lumley
- Department of Statistics, University of Auckland, Auckland, New Zealand
| | - Sarah Pendergrass
- Biomedical and Translational Informatics Institute, Geisinger, Danville, PA, USA
| | - Christopher Bauer
- Biomedical and Translational Informatics Institute, Geisinger, Danville, PA, USA
| | - Hakon Hakonarson
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - David S Carrell
- Kaiser Permanente Washington Health Research Institute, Seattle, WA, USA
| | - Jordan W Smoller
- Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Wei-Qi Wei
- Department of Biomedical Informatics, Vanderbilt University Medical Centre, Nashville, TN, USA
| | - Robert Carroll
- Department of Biomedical Informatics, Vanderbilt University Medical Centre, Nashville, TN, USA
| | - Digna R Velez Edwards
- Clinical and Translational Hereditary Cancer Program, Division of Genetic Medicine, Department of Medicine, Vanderbilt-Ingram Cancer Center, Vanderbilt University, Nashville, TN, USA
| | - Georgia Wiesner
- Clinical and Translational Hereditary Cancer Program, Division of Genetic Medicine, Department of Medicine, Vanderbilt-Ingram Cancer Center, Vanderbilt University, Nashville, TN, USA
| | - Patrick Sleiman
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Josh C Denny
- Department of Biomedical Informatics, Vanderbilt University Medical Centre, Nashville, TN, USA
| | - Jonathan D Mosley
- Department of Biomedical Informatics, Vanderbilt University Medical Centre, Nashville, TN, USA
| | - Marylyn D Ritchie
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Yong Chen
- Department of Biostatistics, Epidemiology & Informatics, University of Pennsylvania, Philadelphia, PA, USA.
| | - Jason H Moore
- Department of Biostatistics, Epidemiology & Informatics, University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
14
|
Quan Y, Zhang QY, Lv BM, Xu RF, Zhang HY. Genome-wide pathogenesis interpretation using a heat diffusion-based systems genetics method and implications for gene function annotation. Mol Genet Genomic Med 2020; 8:e1456. [PMID: 32869547 PMCID: PMC7549611 DOI: 10.1002/mgg3.1456] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2020] [Revised: 07/08/2020] [Accepted: 07/27/2020] [Indexed: 12/27/2022] Open
Abstract
Background Genetics is best dedicated to interpreting pathogenesis and revealing gene functions. The past decade has witnessed unprecedented progress in genetics, particularly in genome‐wide identification of disorder variants through Genome‐Wide Association Studies (GWAS) and Phenome‐Wide Association Studies (PheWAS). However, it is still a great challenge to use GWAS/PheWAS‐derived data to elucidate pathogenesis. Methods In this study, we used HotNet2, a heat diffusion‐based systems genetics algorithm, to calculate the networks for disease genes obtained from GWAS and PheWAS, with an attempt to get deeper insights into disease pathogenesis at a molecular level. Results Through HotNet2 calculation, significant networks for 202 (for GWAS) and 167 (for PheWAS) types of diseases were identified and evaluated, respectively. The GWAS‐derived disease networks exhibit a stronger biomedical relevance than PheWAS counterparts. Therefore, the GWAS‐derived networks were used for pathogenesis interpretation by integrating the accumulated biomedical information. As a result, the pathogenesis for 64 diseases was elucidated in terms of mutation‐caused abnormal transcriptional regulation, and 47 diseases were preliminarily interpreted in terms of mutation‐caused varied protein‐protein interactions. In addition, 3,802 genes (including 46 function‐unknown genes) were assigned with new functions by disease network information, some of which were validated through mice gene knockout experiments. Conclusions Systems genetics algorithm HotNet2 can efficiently establish genotype‐phenotype links at the level of biological networks. Compared with original GWAS/PheWAS results, HotNet2‐calculated disease‐gene associations have stronger biomedical significance, hence provide better interpretations for the pathogenesis of genome‐wide variants, and offer new insights into gene functions as well. These results are also helpful in drug development.
Collapse
Affiliation(s)
- Yuan Quan
- School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen Graduate School, Shenzhen, China.,Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Qing-Ye Zhang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Bo-Min Lv
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Rui-Feng Xu
- School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen Graduate School, Shenzhen, China
| | - Hong-Yu Zhang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China
| |
Collapse
|
15
|
Beesley LJ, Salvatore M, Fritsche LG, Pandit A, Rao A, Brummett C, Willer CJ, Lisabeth LD, Mukherjee B. The emerging landscape of health research based on biobanks linked to electronic health records: Existing resources, statistical challenges, and potential opportunities. Stat Med 2020; 39:773-800. [PMID: 31859414 PMCID: PMC7983809 DOI: 10.1002/sim.8445] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2018] [Revised: 09/10/2019] [Accepted: 11/16/2019] [Indexed: 01/03/2023]
Abstract
Biobanks linked to electronic health records provide rich resources for health-related research. With improvements in administrative and informatics infrastructure, the availability and utility of data from biobanks have dramatically increased. In this paper, we first aim to characterize the current landscape of available biobanks and to describe specific biobanks, including their place of origin, size, and data types. The development and accessibility of large-scale biorepositories provide the opportunity to accelerate agnostic searches, expedite discoveries, and conduct hypothesis-generating studies of disease-treatment, disease-exposure, and disease-gene associations. Rather than designing and implementing a single study focused on a few targeted hypotheses, researchers can potentially use biobanks' existing resources to answer an expanded selection of exploratory questions as quickly as they can analyze them. However, there are many obvious and subtle challenges with the design and analysis of biobank-based studies. Our second aim is to discuss statistical issues related to biobank research such as study design, sampling strategy, phenotype identification, and missing data. We focus our discussion on biobanks that are linked to electronic health records. Some of the analytic issues are illustrated using data from the Michigan Genomics Initiative and UK Biobank, two biobanks with two different recruitment mechanisms. We summarize the current body of literature for addressing these challenges and discuss some standing open problems. This work complements and extends recent reviews about biobank-based research and serves as a resource catalog with analytical and practical guidance for statisticians, epidemiologists, and other medical researchers pursuing research using biobanks.
Collapse
Affiliation(s)
| | | | | | - Anita Pandit
- University of Michigan, Department of Biostatistics
| | - Arvind Rao
- University of Michigan, Department of Computational Medicine and Bioinformatics
| | - Chad Brummett
- University of Michigan, Department of Anesthesiology
| | - Cristen J. Willer
- University of Michigan, Department of Computational Medicine and Bioinformatics
| | | | | |
Collapse
|
16
|
Li R, Duan R, Kember RL, Rader DJ, Damrauer SM, Moore JH, Chen Y. A regression framework to uncover pleiotropy in large-scale electronic health record data. J Am Med Inform Assoc 2019; 26:1083-1090. [PMID: 31529123 DOI: 10.1093/jamia/ocz084] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2018] [Revised: 04/17/2019] [Accepted: 05/16/2019] [Indexed: 11/12/2022] Open
Abstract
OBJECTIVE Pleiotropy, where 1 genetic locus affects multiple phenotypes, can offer significant insights in understanding the complex genotype-phenotype relationship. Although individual genotype-phenotype associations have been thoroughly explored, seemingly unrelated phenotypes can be connected genetically through common pleiotropic loci or genes. However, current analyses of pleiotropy have been challenged by both methodologic limitations and a lack of available suitable data sources. MATERIALS AND METHODS In this study, we propose to utilize a new regression framework, reduced rank regression, to simultaneously analyze multiple phenotypes and genotypes to detect pleiotropic effects. We used a large-scale biobank linked electronic health record data from the Penn Medicine BioBank to select 5 cardiovascular diseases (hypertension, cardiac dysrhythmias, ischemic heart disease, congestive heart failure, and heart valve disorders) and 5 mental disorders (mood disorders; anxiety, phobic and dissociative disorders; alcohol-related disorders; neurological disorders; and delirium dementia) to validate our framework. RESULTS Compared with existing methods, reduced rank regression showed a higher power to distinguish known associated single-nucleotide polymorphisms from random single-nucleotide polymorphisms. In addition, genome-wide gene-based investigation of pleiotropy showed that reduced rank regression was able to identify candidate genetic variants with novel pleiotropic effects compared to existing methods. CONCLUSION The proposed regression framework offers a new approach to account for the phenotype and genotype correlations when identifying pleiotropic effects. By jointly modeling multiple phenotypes and genotypes together, the method has the potential to distinguish confounding from causal genotype and phenotype associations.
Collapse
Affiliation(s)
- Ruowang Li
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
- Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Rui Duan
- Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Rachel L Kember
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
- Corporal Michael J. Crescenz VA Medical Center, Philadelphia, Pennsylvania, USA
- Regeneron Genetics Center, Tarrytown, New York, USA
| | - Daniel J Rader
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
- Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Scott M Damrauer
- Corporal Michael J. Crescenz VA Medical Center, Philadelphia, Pennsylvania, USA
- Department of Surgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Jason H Moore
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
- Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Yong Chen
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
- Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| |
Collapse
|
17
|
Gu HF. Genetic and Epigenetic Studies in Diabetic Kidney Disease. Front Genet 2019; 10:507. [PMID: 31231424 PMCID: PMC6566106 DOI: 10.3389/fgene.2019.00507] [Citation(s) in RCA: 55] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2018] [Accepted: 05/08/2019] [Indexed: 01/19/2023] Open
Abstract
Chronic kidney disease is a worldwide health crisis, while diabetic kidney disease (DKD) has become the leading cause of end-stage renal disease (ESRD). DKD is a microvascular complication and occurs in 30–40% of diabetes patients. Epidemiological investigations and clinical observations on the familial clustering and heritability in DKD have highlighted an underlying genetic susceptibility. Furthermore, DKD is a progressive and long-term diabetic complication, in which epigenetic effects and environmental factors interact with an individual’s genetic background. In recent years, researchers have undertaken genetic and epigenetic studies of DKD in order to better understand its molecular mechanisms. In this review, clinical material, research approaches and experimental designs that have been used for genetic and epigenetic studies of DKD are described. Current information from genetic and epigenetic studies of DKD and ESRD in patients with diabetes, including the approaches of genome-wide association study (GWAS) or epigenome-wide association study (EWAS) and candidate gene association analyses, are summarized. Further investigation of molecular defects in DKD with new approaches such as next generation sequencing analysis and phenome-wide association study (PheWAS) is also discussed.
Collapse
Affiliation(s)
- Harvest F Gu
- Center for Pathophysiology, School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing, China
| |
Collapse
|
18
|
Abstract
The promise of personalized genomic medicine is that knowledge of a person's gene sequences and activity will facilitate more appropriate medical interventions, particularly drug prescriptions, to reduce the burden of disease. Early successes in oncology and pediatrics have affirmed the power of positive diagnosis and are mostly based on detection of one or a few mutations that drive the specific pathology. However, genetically more complex diseases require the development of polygenic risk scores (PRSs) that have variable accuracy. The rarity of events often means that they have necessarily low precision: many called positives are actually not at risk, and only a fraction of cases are prevented by targeted therapy. In some situations, negative prediction may better define the population at low risk. Here, I review five conditions across a broad spectrum of chronic disease (opioid pain medication, hypertension, type 2 diabetes, major depression, and osteoporotic bone fracture), considering in each case how genetic prediction might be used to target drug prescription. This leads to a call for more research designed to evaluate genetic likelihood of response to therapy and a call for evaluation of PRS, not just in terms of sensitivity and specificity but also with respect to potential clinical efficacy.
Collapse
Affiliation(s)
- Greg Gibson
- Center for Integrative Genomics and School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia, United States of America
- * E-mail:
| |
Collapse
|
19
|
Zhang X, Basile AO, Pendergrass SA, Ritchie MD. Real world scenarios in rare variant association analysis: the impact of imbalance and sample size on the power in silico. BMC Bioinformatics 2019; 20:46. [PMID: 30669967 PMCID: PMC6343276 DOI: 10.1186/s12859-018-2591-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2018] [Accepted: 12/26/2018] [Indexed: 11/11/2022] Open
Abstract
Background The development of sequencing techniques and statistical methods provides great opportunities for identifying the impact of rare genetic variation on complex traits. However, there is a lack of knowledge on the impact of sample size, case numbers, the balance of cases vs controls for both burden and dispersion based rare variant association methods. For example, Phenome-Wide Association Studies may have a wide range of case and control sample sizes across hundreds of diagnoses and traits, and with the application of statistical methods to rare variants, it is important to understand the strengths and limitations of the analyses. Results We conducted a large-scale simulation of randomly selected low-frequency protein-coding regions using twelve different balanced samples with an equal number of cases and controls as well as twenty-one unbalanced sample scenarios. We further explored statistical performance of different minor allele frequency thresholds and a range of genetic effect sizes. Our simulation results demonstrate that using an unbalanced study design has an overall higher type I error rate for both burden and dispersion tests compared with a balanced study design. Regression has an overall higher type I error with balanced cases and controls, while SKAT has higher type I error for unbalanced case-control scenarios. We also found that both type I error and power were driven by the number of cases in addition to the case to control ratio under large control group scenarios. Based on our power simulations, we observed that a SKAT analysis with case numbers larger than 200 for unbalanced case-control models yielded over 90% power with relatively well controlled type I error. To achieve similar power in regression, over 500 cases are needed. Moreover, SKAT showed higher power to detect associations in unbalanced case-control scenarios than regression. Conclusions Our results provide important insights into rare variant association study designs by providing a landscape of type I error and statistical power for a wide range of sample sizes. These results can serve as a benchmark for making decisions about study design for rare variant analyses. Electronic supplementary material The online version of this article (10.1186/s12859-018-2591-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Xinyuan Zhang
- Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Anna O Basile
- Department of Biomedical Informatics, Columbia University, New York, NY, USA
| | - Sarah A Pendergrass
- Biomedical and Translational Informatics Institute, Geisinger, Danville, PA, USA
| | - Marylyn D Ritchie
- Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA. .,Department of Genetics, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA, USA.
| |
Collapse
|
20
|
Precision medicine review: rare driver mutations and their biophysical classification. Biophys Rev 2019; 11:5-19. [PMID: 30610579 PMCID: PMC6381362 DOI: 10.1007/s12551-018-0496-2] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2018] [Accepted: 12/18/2018] [Indexed: 02/07/2023] Open
Abstract
How can biophysical principles help precision medicine identify rare driver mutations? A major tenet of pragmatic approaches to precision oncology and pharmacology is that driver mutations are very frequent. However, frequency is a statistical attribute, not a mechanistic one. Rare mutations can also act through the same mechanism, and as we discuss below, “latent driver” mutations may also follow the same route, with “helper” mutations. Here, we review how biophysics provides mechanistic guidelines that extend precision medicine. We outline principles and strategies, especially focusing on mutations that drive cancer. Biophysics has contributed profoundly to deciphering biological processes. However, driven by data science, precision medicine has skirted some of its major tenets. Data science embodies genomics, tissue- and cell-specific expression levels, making it capable of defining genome- and systems-wide molecular disease signatures. It classifies cancer driver genes/mutations and affected pathways, and its associated protein structural data guide drug discovery. Biophysics complements data science. It considers structures and their heterogeneous ensembles, explains how mutational variants can signal through distinct pathways, and how allo-network drugs can be harnessed. Biophysics clarifies how one mutation—frequent or rare—can affect multiple phenotypic traits by populating conformations that favor interactions with other network modules. It also suggests how to identify such mutations and their signaling consequences. Biophysics offers principles and strategies that can help precision medicine push the boundaries to transform our insight into biological processes and the practice of personalized medicine. By contrast, “phenotypic drug discovery,” which capitalizes on physiological cellular conditions and first-in-class drug discovery, may not capture the proper molecular variant. This is because variants of the same protein can express more than one phenotype, and a phenotype can be encoded by several variants.
Collapse
|
21
|
Genomic and Phenomic Research in the 21st Century. Trends Genet 2018; 35:29-41. [PMID: 30342790 DOI: 10.1016/j.tig.2018.09.007] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2018] [Revised: 09/24/2018] [Accepted: 09/25/2018] [Indexed: 02/06/2023]
Abstract
The field of human genomics has changed dramatically over time. Initial genomic studies were predominantly restricted to rare disorders in small families. Over the past decade, researchers changed course from family-based studies and instead focused on common diseases and traits in populations of unrelated individuals. With further advancements in biobanking, computer science, electronic health record (EHR) data, and more affordable high-throughput genomics, we are experiencing a new paradigm in human genomic research. Rapidly changing technologies and resources now make it possible to study thousands of diseases simultaneously at the genomic level. This review will focus on these advancements as scientists begin to incorporate phenome-wide strategies in human genomic research to understand the etiology of human diseases and develop new drugs to treat them.
Collapse
|
22
|
Basile AO, Ritchie MD. Informatics and machine learning to define the phenotype. Expert Rev Mol Diagn 2018; 18:219-226. [PMID: 29431517 DOI: 10.1080/14737159.2018.1439380] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
INTRODUCTION For the past decade, the focus of complex disease research has been the genotype. From technological advancements to the development of analysis methods, great progress has been made. However, advances in our definition of the phenotype have remained stagnant. Phenotype characterization has recently emerged as an exciting area of informatics and machine learning. The copious amounts of diverse biomedical data that have been collected may be leveraged with data-driven approaches to elucidate trait-related features and patterns. Areas covered: In this review, the authors discuss the phenotype in traditional genetic associations and the challenges this has imposed.Approaches for phenotype refinement that can aid in more accurate characterization of traits are also discussed. Further, the authors highlight promising machine learning approaches for establishing a phenotype and the challenges of electronic health record (EHR)-derived data. Expert commentary: The authors hypothesize that through unsupervised machine learning, data-driven approaches can be used to define phenotypes rather than relying on expert clinician knowledge. Through the use of machine learning and an unbiased set of features extracted from clinical repositories, researchers will have the potential to further understand complex traits and identify patient subgroups. This knowledge may lead to more preventative and precise clinical care.
Collapse
Affiliation(s)
- Anna Okula Basile
- a Department of Biochemistry and Molecular Biology , The Pennsylvania State University , State College , PA , USA
| | - Marylyn DeRiggi Ritchie
- a Department of Biochemistry and Molecular Biology , The Pennsylvania State University , State College , PA , USA.,b Department of Genetics , University of Pennsylvania, Perelman School of Medicine , Philadelphia , PA , USA
| |
Collapse
|
23
|
Pasipoularides A. Implementing genome-driven personalized cardiology in clinical practice. J Mol Cell Cardiol 2018; 115:142-157. [PMID: 29343412 PMCID: PMC5820118 DOI: 10.1016/j.yjmcc.2018.01.008] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/16/2017] [Revised: 01/04/2018] [Accepted: 01/12/2018] [Indexed: 12/18/2022]
Abstract
Genomics designates the coordinated investigation of a large number of genes in the context of a biological process or disease. It may be long before we attain comprehensive understanding of the genomics of common complex cardiovascular diseases (CVDs) such as inherited cardiomyopathies, valvular diseases, primary arrhythmogenic conditions, congenital heart syndromes, hypercholesterolemia and atherosclerotic heart disease, hypertensive syndromes, and heart failure with preserved/reduced ejection fraction. Nonetheless, as genomics is evolving rapidly, it is constructive to survey now pertinent concepts and breakthroughs. Today, clinical multimodal electronic medical/health records (EMRs/EHRs) incorporating genomic information establish a continuously-learning, vast knowledge-network with seamless cycling between clinical application and research. It can inform insights into specific pathogenetic pathways, guide biomarker-assisted precise diagnoses and individualized treatments, and stratify prognoses. Complex CVDs blend multiple interacting genomic variants, epigenetics, and environmental risk-factors, engendering progressions of multifaceted disease-manifestations, including clinical symptoms and signs. There is no straight-line linkage between genetic cause(s) or causal gene-variant(s) and disease phenotype(s). Because of interactions involving modifier-gene influences, (micro)-environmental, and epigenetic effects, the same variant may actually produce dissimilar abnormalities in different individuals. Implementing genome-driven personalized cardiology in clinical practice reveals that the study of CVDs at the level of molecules and cells can yield crucial clinical benefits. Complementing evidence-based medicine guidelines from large ("one-size fits all") randomized controlled trials, genomics-based personalized or precision cardiology is a most-creditable paradigm: It provides customizable approaches to prevent, diagnose, and manage CVDs with treatments directly/precisely aimed at causal defects identified by high-throughput genomic technologies. They encompass stem cell and gene therapies exploiting CRISPR-Cas9-gene-editing, and metabolomic-pharmacogenomic therapeutic modalities, precisely fine-tuned for the individual patient. Following the Human Genome Project, many expected genomics technology to provide imminent solutions to intractable medical problems, including CVDs. This eagerness has reaped some disappointment that advances have not yet materialized to the degree anticipated. Undoubtedly, personalized genetic/genomics testing is an emergent technology that should not be applied without supplementary phenotypic/clinical information: Genotype≠Phenotype. However, forthcoming advances in genomics will naturally build on prior attainments and, combined with insights into relevant epigenetics and environmental factors, can plausibly eradicate intractable CVDs, improving human health and well-being.
Collapse
Affiliation(s)
- Ares Pasipoularides
- Consulting Professor of Surgery, Emeritus Faculty of Surgery and of Biomedical Engineering, Duke University School of Medicine and Graduate School, Durham, NC 27710, USA.
| |
Collapse
|
24
|
Abstract
PURPOSE OF REVIEW Over many decades, researchers have been designing studies to investigate the relationship between genotypes and phenotypes to gain an understanding about the effect of genetics on disease. Recently, a high-throughput approach called phenome-wide associations studies (PheWAS) have been extensively used to identify associations between genetic variants and many diseases and traits simultaneously. In this review, we describe the value of PheWAS along with methodological issues and challenges in interpretation for current applications of PheWAS. RECENT FINDINGS PheWAS have uncovered a paradigm to identify new associations for genetic loci across many diseases. The application of PheWAS have been effective with phenotype data from electronic health records, epidemiological studies, and clinical trials data. SUMMARY The key strength of a PheWAS is to identify the association of one or more genetic variants with multiple phenotypes, which can showcase interconnections among the phenotypes due to shared genetic associations. While the PheWAS approach appears promising, there are a number of challenges that need to be addressed to provide additional robustness to PheWAS findings.
Collapse
Affiliation(s)
- Anurag Verma
- Biomedical and Translational Informatics Institute, Geisinger Health System, Danville, PA
- The Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA
| | - Marylyn D Ritchie
- Biomedical and Translational Informatics Institute, Geisinger Health System, Danville, PA
- The Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA
| |
Collapse
|
25
|
Multiphenotype association study of patients randomized to initiate antiretroviral regimens in AIDS Clinical Trials Group protocol A5202. Pharmacogenet Genomics 2017; 27:101-111. [PMID: 28099408 PMCID: PMC5285297 DOI: 10.1097/fpc.0000000000000263] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Supplemental Digital Content is available in the text. Background High-throughput approaches are increasingly being used to identify genetic associations across multiple phenotypes simultaneously. Here, we describe a pilot analysis that considered multiple on-treatment laboratory phenotypes from antiretroviral therapy-naive patients who were randomized to initiate antiretroviral regimens in a prospective clinical trial, AIDS Clinical Trials Group protocol A5202. Participants and methods From among 5 9545 294 polymorphisms imputed genome-wide, we analyzed 2544, including 2124 annotated in the PharmGKB, and 420 previously associated with traits in the GWAS Catalog. We derived 774 phenotypes on the basis of context from six variables: plasma atazanavir (ATV) pharmacokinetics, plasma efavirenz (EFV) pharmacokinetics, change in the CD4+ T-cell count, HIV-1 RNA suppression, fasting low-density lipoprotein-cholesterol, and fasting triglycerides. Permutation testing assessed the likelihood of associations being by chance alone. Pleiotropy was assessed for polymorphisms with the lowest P-values. Results This analysis included 1181 patients. At P less than 1.5×10−4, most associations were not by chance alone. Polymorphisms with the lowest P-values for EFV pharmacokinetics (CYPB26 rs3745274), low-density lipoprotein -cholesterol (APOE rs7412), and triglyceride (APOA5 rs651821) phenotypes had been associated previously with those traits in previous studies. The association between triglycerides and rs651821 was present with ATV-containing regimens, but not with EFV-containing regimens. Polymorphisms with the lowest P-values for ATV pharmacokinetics, CD4 T-cell count, and HIV-1 RNA phenotypes had not been reported previously to be associated with that trait. Conclusion Using data from a prospective HIV clinical trial, we identified expected genetic associations, potentially novel associations, and at least one context-dependent association. This study supports high-throughput strategies that simultaneously explore multiple phenotypes from clinical trials’ datasets for genetic associations.
Collapse
|
26
|
Zhang YP, Zhang YY, Duan DD. From Genome-Wide Association Study to Phenome-Wide Association Study: New Paradigms in Obesity Research. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2016; 140:185-231. [PMID: 27288830 DOI: 10.1016/bs.pmbts.2016.02.003] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Obesity is a condition in which excess body fat has accumulated over an extent that increases the risk of many chronic diseases. The current clinical classification of obesity is based on measurement of body mass index (BMI), waist-hip ratio, and body fat percentage. However, these measurements do not account for the wide individual variations in fat distribution, degree of fatness or health risks, and genetic variants identified in the genome-wide association studies (GWAS). In this review, we will address this important issue with the introduction of phenome, phenomics, and phenome-wide association study (PheWAS). We will discuss the new paradigm shift from GWAS to PheWAS in obesity research. In the era of precision medicine, phenomics and PheWAS provide the required approaches to better definition and classification of obesity according to the association of obese phenome with their unique molecular makeup, lifestyle, and environmental impact.
Collapse
Affiliation(s)
- Y-P Zhang
- Pediatric Heart Center, Beijing Anzhen Hospital, Capital Medical University, Beijing, China
| | - Y-Y Zhang
- Department of Cardiology, Changzhou Second People's Hospital, Changzhou, Jiangsu, China
| | - D D Duan
- Laboratory of Cardiovascular Phenomics, Center for Cardiovascular Research, Department of Pharmacology, and Center for Molecular Medicine, University of Nevada School of Medicine, Reno, NV, United States.
| |
Collapse
|
27
|
Biological findings from the PheWAS catalog: focus on connective tissue-related disorders (pelvic floor dysfunction, abdominal hernia, varicose veins and hemorrhoids). Hum Genet 2016; 135:779-95. [DOI: 10.1007/s00439-016-1672-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2016] [Accepted: 04/17/2016] [Indexed: 01/31/2023]
|
28
|
Novel Biomarkers and Treatments of Cardiac Diseases. BIOMED RESEARCH INTERNATIONAL 2016; 2016:1315627. [PMID: 26989677 PMCID: PMC4773528 DOI: 10.1155/2016/1315627] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 12/24/2015] [Accepted: 12/24/2015] [Indexed: 12/01/2022]
|
29
|
Unravelling the human genome-phenome relationship using phenome-wide association studies. Nat Rev Genet 2016; 17:129-45. [PMID: 26875678 DOI: 10.1038/nrg.2015.36] [Citation(s) in RCA: 182] [Impact Index Per Article: 20.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Advances in genotyping technology have, over the past decade, enabled the focused search for common genetic variation associated with human diseases and traits. With the recently increased availability of detailed phenotypic data from electronic health records and epidemiological studies, the impact of one or more genetic variants on the phenome is starting to be characterized both in clinical and population-based settings using phenome-wide association studies (PheWAS). These studies reveal a number of challenges that will need to be overcome to unlock the full potential of PheWAS for the characterization of the complex human genome-phenome relationship.
Collapse
|
30
|
Tenenbaum JD. Translational Bioinformatics: Past, Present, and Future. GENOMICS PROTEOMICS & BIOINFORMATICS 2016; 14:31-41. [PMID: 26876718 PMCID: PMC4792852 DOI: 10.1016/j.gpb.2016.01.003] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/16/2015] [Accepted: 01/20/2016] [Indexed: 02/04/2023]
Abstract
Though a relatively young discipline, translational bioinformatics (TBI) has become a key component of biomedical research in the era of precision medicine. Development of high-throughput technologies and electronic health records has caused a paradigm shift in both healthcare and biomedical research. Novel tools and methods are required to convert increasingly voluminous datasets into information and actionable knowledge. This review provides a definition and contextualization of the term TBI, describes the discipline’s brief history and past accomplishments, as well as current foci, and concludes with predictions of future directions in the field.
Collapse
Affiliation(s)
- Jessica D Tenenbaum
- Department of Biostatistics and Bioinformatics, Duke University School of Medicine, Durham, NC 27710, USA.
| |
Collapse
|
31
|
Millard LAC, Davies NM, Timpson NJ, Tilling K, Flach PA, Davey Smith G. MR-PheWAS: hypothesis prioritization among potential causal effects of body mass index on many outcomes, using Mendelian randomization. Sci Rep 2015; 5:16645. [PMID: 26568383 PMCID: PMC4644974 DOI: 10.1038/srep16645] [Citation(s) in RCA: 72] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2015] [Accepted: 10/16/2015] [Indexed: 01/03/2023] Open
Abstract
Observational cohort studies can provide rich datasets with a diverse range of phenotypic variables. However, hypothesis-driven epidemiological analyses by definition only test particular hypotheses chosen by researchers. Furthermore, observational analyses may not provide robust evidence of causality, as they are susceptible to confounding, reverse causation and measurement error. Using body mass index (BMI) as an exemplar, we demonstrate a novel extension to the phenome-wide association study (pheWAS) approach, using automated screening with genotypic instruments to screen for causal associations amongst any number of phenotypic outcomes. We used a sample of 8,121 children from the ALSPAC dataset, and tested the linear association of a BMI-associated allele score with 172 phenotypic outcomes (with variable sample sizes). We also performed an instrumental variable analysis to estimate the causal effect of BMI on each phenotype. We found 21 of the 172 outcomes were associated with the allele score at an unadjusted p < 0.05 threshold, and use Bonferroni corrections, permutation testing and estimates of the false discovery rate to consider the strength of results given the number of tests performed. The most strongly associated outcomes included leptin, lipid profile, and blood pressure. We also found novel evidence of effects of BMI on a global self-worth score.
Collapse
Affiliation(s)
- Louise A C Millard
- MRC Integrative Epidemiology Unit (IEU) at the University of Bristol, University of Bristol, Bristol.,Intelligent Systems Laboratory, Department of Computer Science, University of Bristol, UK
| | - Neil M Davies
- MRC Integrative Epidemiology Unit (IEU) at the University of Bristol, University of Bristol, Bristol
| | - Nic J Timpson
- MRC Integrative Epidemiology Unit (IEU) at the University of Bristol, University of Bristol, Bristol
| | - Kate Tilling
- MRC Integrative Epidemiology Unit (IEU) at the University of Bristol, University of Bristol, Bristol
| | - Peter A Flach
- MRC Integrative Epidemiology Unit (IEU) at the University of Bristol, University of Bristol, Bristol.,Intelligent Systems Laboratory, Department of Computer Science, University of Bristol, UK
| | - George Davey Smith
- MRC Integrative Epidemiology Unit (IEU) at the University of Bristol, University of Bristol, Bristol
| |
Collapse
|
32
|
Pendergrass SA, Verma A, Okula A, Hall MA, Crawford DC, Ritchie MD. Phenome-Wide Association Studies: Embracing Complexity for Discovery. Hum Hered 2015. [PMID: 26201697 DOI: 10.1159/000381851] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
The inherent complexity of biological systems can be leveraged for a greater understanding of the impact of genetic architecture on outcomes, traits, and pharmacological response. The genome-wide association study (GWAS) approach has well-developed methods and relatively straight-forward methodologies; however, the bigger picture of the impact of genetic architecture on phenotypic outcome still remains to be elucidated even with an ever-growing number of GWAS performed. Greater consideration of the complexity of biological processes, using more data from the phenome, exposome, and diverse -omic resources, including considering the interplay of pleiotropy and genetic interactions, may provide additional leverage for making the most of the incredible wealth of information available for study. Here, we describe how incorporating greater complexity into analyses through the use of additional phenotypic data and widespread deployment of phenome-wide association studies may provide new insights into genetic factors influencing diseases, traits, and pharmacological response.
Collapse
Affiliation(s)
- Sarah A Pendergrass
- Biomedical and Translational Informatics Program, Geisinger Health System, Danville, Pa., USA
| | | | | | | | | | | |
Collapse
|
33
|
Tyler AL, Crawford DC, Pendergrass SA. The detection and characterization of pleiotropy: discovery, progress, and promise. Brief Bioinform 2015. [PMID: 26223525 DOI: 10.1093/bib/bbv050] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
The impact of a single genetic locus on multiple phenotypes, or pleiotropy, is an important area of research. Biological systems are dynamic complex networks, and these networks exist within and between cells. In humans, the consideration of multiple phenotypes such as physiological traits, clinical outcomes and drug response, in the context of genetic variation, can provide ways of developing a more complete understanding of the complex relationships between genetic architecture and how biological systems function in health and disease. In this article, we describe recent studies exploring the relationships between genetic loci and more than one phenotype. We also cover methodological developments incorporating pleiotropy applied to model organisms as well as humans, and discuss how stepping beyond the analysis of a single phenotype leads to a deeper understanding of complex genetic architecture.
Collapse
|