1
|
Martínez-Magaña JJ, Hurtado-Soriano J, Rivero-Segura NA, Montalvo-Ortiz JL, Garcia-delaTorre P, Becerril-Rojas K, Gomez-Verjan JC. Towards a Novel Frontier in the Use of Epigenetic Clocks in Epidemiology. Arch Med Res 2024; 55:103033. [PMID: 38955096 DOI: 10.1016/j.arcmed.2024.103033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Revised: 05/10/2024] [Accepted: 06/17/2024] [Indexed: 07/04/2024]
Abstract
Health problems associated with aging are a major public health concern for the future. Aging is a complex process with wide intervariability among individuals. Therefore, there is a need for innovative public health strategies that target factors associated with aging and the development of tools to assess the effectiveness of these strategies accurately. Novel approaches to measure biological age, such as epigenetic clocks, have become relevant. These clocks use non-sequential variable information from the genome and employ mathematical algorithms to estimate biological age based on DNA methylation levels. Therefore, in the present study, we comprehensively review the current status of the epigenetic clocks and their associations across the human phenome. We emphasize the potential utility of these tools in an epidemiological context, particularly in evaluating the impact of public health interventions focused on promoting healthy aging. Our review describes associations between epigenetic clocks and multiple traits across the life and health span. Additionally, we highlighted the evolution of studies beyond mere associations to establish causal mechanisms between epigenetic age and disease. We explored the application of epigenetic clocks to measure the efficacy of interventions focusing on rejuvenation.
Collapse
Affiliation(s)
- José Jaime Martínez-Magaña
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT, USA; U.S. Department of Veterans Affairs National Center for Post-Traumatic Stress Disorder, Clinical Neuroscience Division, West Haven, CT, USA; VA Connecticut Healthcare System, West Haven, CT, USA
| | | | | | - Janitza L Montalvo-Ortiz
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT, USA; U.S. Department of Veterans Affairs National Center for Post-Traumatic Stress Disorder, Clinical Neuroscience Division, West Haven, CT, USA; VA Connecticut Healthcare System, West Haven, CT, USA
| | - Paola Garcia-delaTorre
- Unidad de Investigación Epidemiológica y en Servicios de Salud, Área de Envejecimiento, Centro Médico Nacional, Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico
| | | | | |
Collapse
|
2
|
Dapas M, Lee YL, Wentworth-Sheilds W, Im HK, Ober C, Schoettler N. Revealing polygenic pleiotropy using genetic risk scores for asthma. HGG ADVANCES 2023; 4:100233. [PMID: 37663543 PMCID: PMC10474095 DOI: 10.1016/j.xhgg.2023.100233] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Accepted: 08/11/2023] [Indexed: 09/05/2023] Open
Abstract
In this study we examined how genetic risk for asthma associates with different features of the disease and with other medical conditions and traits. Using summary statistics from two multi-ancestry genome-wide association studies of asthma, we modeled polygenic risk scores (PRSs) and validated their predictive performance in the UK Biobank. We then performed phenome-wide association studies of the asthma PRSs with 371 heritable traits in the UK Biobank. We identified 228 total significant associations across a variety of organ systems, including associations that varied by PRS model, sex, age of asthma onset, ancestry, and human leukocyte antigen region alleles. Our results highlight pervasive pleiotropy between asthma and numerous other traits and conditions and elucidate pathways that contribute to asthma and its comorbidities.
Collapse
Affiliation(s)
- Matthew Dapas
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| | - Yu Lin Lee
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
- Biological Sciences Collegiate Division, University of Chicago, Chicago, IL, USA
| | | | - Hae Kyung Im
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL, USA
| | - Carole Ober
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| | - Nathan Schoettler
- Section of Pulmonary and Critical Care Medicine, Department of Medicine, University of Chicago, Chicago, IL, USA
| |
Collapse
|
3
|
Wang Y, Namba S, Lopera E, Kerminen S, Tsuo K, Läll K, Kanai M, Zhou W, Wu KH, Favé MJ, Bhatta L, Awadalla P, Brumpton B, Deelen P, Hveem K, Lo Faro V, Mägi R, Murakami Y, Sanna S, Smoller JW, Uzunovic J, Wolford BN, Willer C, Gamazon ER, Cox NJ, Surakka I, Okada Y, Martin AR, Hirbo J. Global Biobank analyses provide lessons for developing polygenic risk scores across diverse cohorts. CELL GENOMICS 2023; 3:100241. [PMID: 36777179 PMCID: PMC9903818 DOI: 10.1016/j.xgen.2022.100241] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Revised: 08/28/2022] [Accepted: 12/03/2022] [Indexed: 01/06/2023]
Abstract
Polygenic risk scores (PRSs) have been widely explored in precision medicine. However, few studies have thoroughly investigated their best practices in global populations across different diseases. We here utilized data from Global Biobank Meta-analysis Initiative (GBMI) to explore methodological considerations and PRS performance in 9 different biobanks for 14 disease endpoints. Specifically, we constructed PRSs using pruning and thresholding (P + T) and PRS-continuous shrinkage (CS). For both methods, using a European-based linkage disequilibrium (LD) reference panel resulted in comparable or higher prediction accuracy compared with several other non-European-based panels. PRS-CS overall outperformed the classic P + T method, especially for endpoints with higher SNP-based heritability. Notably, prediction accuracy is heterogeneous across endpoints, biobanks, and ancestries, especially for asthma, which has known variation in disease prevalence across populations. Overall, we provide lessons for PRS construction, evaluation, and interpretation using GBMI resources and highlight the importance of best practices for PRS in the biobank-scale genomics era.
Collapse
Affiliation(s)
- Ying Wang
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Stanley Center for Psychiatric Research and Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Shinichi Namba
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita 565-0871, Japan
| | - Esteban Lopera
- Department of Genetics, UMCG, University of Groningen, Groningen, the Netherlands
| | - Sini Kerminen
- Institute for Molecular Medicine Finland, FIMM, HiLIFE, University of Helsinki, Helsinki, Finland
| | - Kristin Tsuo
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Stanley Center for Psychiatric Research and Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Kristi Läll
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Masahiro Kanai
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Stanley Center for Psychiatric Research and Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita 565-0871, Japan
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Wei Zhou
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Stanley Center for Psychiatric Research and Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Kuan-Han Wu
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48103, USA
| | | | - Laxmi Bhatta
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, 7030 Trondheim, Norway
| | - Philip Awadalla
- Ontario Institute for Cancer Research, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Ben Brumpton
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, 7030 Trondheim, Norway
- HUNT Research Centre, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, 7600 Levanger, Norway
- Clinic of Medicine, St. Olav’s Hospital, Trondheim University Hospital, 7030 Trondheim, Norway
| | - Patrick Deelen
- Department of Genetics, UMCG, University of Groningen, Groningen, the Netherlands
- Oncode Institute, Utrecht, the Netherlands
| | - Kristian Hveem
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, 7030 Trondheim, Norway
- HUNT Research Centre, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, 7600 Levanger, Norway
| | - Valeria Lo Faro
- Department of Ophthalmology, University Medical Center Groningen, University of Groningen, Groningen, the Netherlands
- Department of Clinical Genetics, Amsterdam University Medical Center (AMC), Amsterdam, the Netherlands
- Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Reedik Mägi
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Yoshinori Murakami
- Division of Molecular Pathology, Institute of Medical Science, the University of Tokyo, Tokyo, Japan
| | - Serena Sanna
- Department of Genetics, UMCG, University of Groningen, Groningen, the Netherlands
- Institute for Genetics and Biomedical Research (IRGB), National Research Council (CNR), 09100 Cagliari, Italy
| | - Jordan W. Smoller
- Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA
| | | | - Brooke N. Wolford
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48103, USA
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, 7030 Trondheim, Norway
| | - Cristen Willer
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, 7030 Trondheim, Norway
- Department of Internal Medicine, University of Michigan, Ann Arbor, MI 48109, USA
- Department of Biostatistics and Center for Statistical Genetics, and Department of Human Genetics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Eric R. Gamazon
- Department of Medicine, Division of Genetic Medicine, Vanderbilt University School of Medicine, Nashville, TN, USA
- MRC Epidemiology Unit, University of Cambridge, Cambridge, UK
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Nancy J. Cox
- Department of Medicine, Division of Genetic Medicine, Vanderbilt University School of Medicine, Nashville, TN, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Ida Surakka
- Department of Internal Medicine, University of Michigan, Ann Arbor, MI 48109, USA
| | - Yukinori Okada
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita 565-0871, Japan
- Laboratory for Systems Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Laboratory of Statistical Immunology, Immunology Frontier Research Center (WPI-IFReC) and Center for Infectious Disease Education and Research (CiDER), Osaka University, Suita 565-0871, Japan
- Department of Genome Informatics, Graduate School of Medicine, the University of Tokyo, Tokyo 113-0033, Japan
| | - Alicia R. Martin
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Stanley Center for Psychiatric Research and Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Jibril Hirbo
- Department of Medicine, Division of Genetic Medicine, Vanderbilt University School of Medicine, Nashville, TN, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
| |
Collapse
|
4
|
Estiri H, Strasser ZH, Murphy SN. High-throughput phenotyping with temporal sequences. J Am Med Inform Assoc 2021; 28:772-781. [PMID: 33313899 DOI: 10.1093/jamia/ocaa288] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Accepted: 11/04/2020] [Indexed: 12/15/2022] Open
Abstract
OBJECTIVE High-throughput electronic phenotyping algorithms can accelerate translational research using data from electronic health record (EHR) systems. The temporal information buried in EHRs is often underutilized in developing computational phenotypic definitions. This study aims to develop a high-throughput phenotyping method, leveraging temporal sequential patterns from EHRs. MATERIALS AND METHODS We develop a representation mining algorithm to extract 5 classes of representations from EHR diagnosis and medication records: the aggregated vector of the records (aggregated vector representation), the standard sequential patterns (sequential pattern mining), the transitive sequential patterns (transitive sequential pattern mining), and 2 hybrid classes. Using EHR data on 10 phenotypes from the Mass General Brigham Biobank, we train and validate phenotyping algorithms. RESULTS Phenotyping with temporal sequences resulted in a superior classification performance across all 10 phenotypes compared with the standard representations in electronic phenotyping. The high-throughput algorithm's classification performance was superior or similar to the performance of previously published electronic phenotyping algorithms. We characterize and evaluate the top transitive sequences of diagnosis records paired with the records of risk factors, symptoms, complications, medications, or vaccinations. DISCUSSION The proposed high-throughput phenotyping approach enables seamless discovery of sequential record combinations that may be difficult to assume from raw EHR data. Transitive sequences offer more accurate characterization of the phenotype, compared with its individual components, and reflect the actual lived experiences of the patients with that particular disease. CONCLUSION Sequential data representations provide a precise mechanism for incorporating raw EHR records into downstream machine learning. Our approach starts with user interpretability and works backward to the technology.
Collapse
Affiliation(s)
- Hossein Estiri
- Harvard Medical School, Boston, Massachusetts, USA.,Massachusetts General Hospital, Boston, Massachusetts, USA.,Mass General Brigham, Boston, Massachusetts, USA
| | - Zachary H Strasser
- Harvard Medical School, Boston, Massachusetts, USA.,Massachusetts General Hospital, Boston, Massachusetts, USA.,Mass General Brigham, Boston, Massachusetts, USA
| | - Shawn N Murphy
- Harvard Medical School, Boston, Massachusetts, USA.,Massachusetts General Hospital, Boston, Massachusetts, USA.,Mass General Brigham, Boston, Massachusetts, USA
| |
Collapse
|
5
|
Alekseyenko AV, Hamidi B, Faith TD, Crandall KA, Powers JG, Metts CL, Madory JE, Carroll SL, Obeid JS, Lenert LA. Each patient is a research biorepository: informatics-enabled research on surplus clinical specimens via the living BioBank. J Am Med Inform Assoc 2021; 28:138-143. [PMID: 33166379 PMCID: PMC7810447 DOI: 10.1093/jamia/ocaa236] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Accepted: 09/09/2020] [Indexed: 02/02/2023] Open
Abstract
The ability to analyze human specimens is the pillar of modern-day translational research. To enhance the research availability of relevant clinical specimens, we developed the Living BioBank (LBB) solution, which allows for just-in-time capture and delivery of phenotyped surplus laboratory medicine specimens. The LBB is a system-of-systems integrating research feasibility databases in i2b2, a real-time clinical data warehouse, and an informatics system for institutional research services management (SPARC). LBB delivers deidentified clinical data and laboratory specimens. We further present an extension to our solution, the Living µBiome Bank, that allows the user to request and receive phenotyped specimen microbiome data. We discuss the details of the implementation of the LBB system and the necessary regulatory oversight for this solution. The conducted institutional focus group of translational investigators indicates an overall positive sentiment towards potential scientific results generated with the use of LBB. Reference implementation of LBB is available at https://LivingBioBank.musc.edu.
Collapse
Affiliation(s)
- Alexander V Alekseyenko
- Biomedical Informatics Center, Medical University of South Carolina, Charleston, South Carolina, USA.,Department of Public Health Sciences, Medical University of South Carolina, Charleston, South Carolina, USA.,Department of Oral Health Sciences, Medical University of South Carolina, Charleston, South Carolina, USA.,Department of Healthcare Leadership and Management, Medical University of South Carolina, Charleston, South Carolina, USA
| | - Bashir Hamidi
- Biomedical Informatics Center, Medical University of South Carolina, Charleston, South Carolina, USA
| | - Trevor D Faith
- Biomedical Informatics Center, Medical University of South Carolina, Charleston, South Carolina, USA
| | - Keith A Crandall
- Department of Biostatistics & Bioinformatics, Computational Biology Institute, Milken Institute School of Public Health, George Washington University, Washington DC, USA
| | | | - Christopher L Metts
- Biomedical Informatics Center, Medical University of South Carolina, Charleston, South Carolina, USA.,Division of Pathology Informatics, Department of Pathology and Laboratory Medicine, Medical University of South Carolina, Charleston, South Carolina, USA.,Department of Pathology and Laboratory Medicine, Medical University of South Carolina, Charleston, South Carolina, USA
| | - James E Madory
- Division of Pathology Informatics, Department of Pathology and Laboratory Medicine, Medical University of South Carolina, Charleston, South Carolina, USA.,Department of Pathology and Laboratory Medicine, Medical University of South Carolina, Charleston, South Carolina, USA
| | - Steven L Carroll
- Department of Pathology and Laboratory Medicine, Medical University of South Carolina, Charleston, South Carolina, USA
| | - Jihad S Obeid
- Biomedical Informatics Center, Medical University of South Carolina, Charleston, South Carolina, USA.,Department of Public Health Sciences, Medical University of South Carolina, Charleston, South Carolina, USA
| | - Leslie A Lenert
- Biomedical Informatics Center, Medical University of South Carolina, Charleston, South Carolina, USA.,Department of Public Health Sciences, Medical University of South Carolina, Charleston, South Carolina, USA.,Department of Medicine, Medical University of South Carolina, Charleston, South Carolina, USA
| |
Collapse
|
6
|
Zhang Y, Leung K, Perera RAPM, Lee CK, Peiris JSM, Wu JT. Harnessing the potential of blood donation archives for influenza surveillance and control. PLoS One 2020; 15:e0233605. [PMID: 32470010 PMCID: PMC7259782 DOI: 10.1371/journal.pone.0233605] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2019] [Accepted: 05/10/2020] [Indexed: 02/04/2023] Open
Abstract
Many blood donation services around the globe maintain large archives of serum and/or plasma specimens of blood donations which could potentially be used for serologic surveillance and risk assessment of influenza. Harnessing this potential requires robust evidence that the outcomes of influenza serology in plasma, which is rarely used, is consistent with that in serum, which is the conventional choice of specimens for influenza serology. We harvested EDTA-plasma specimens from the blood donation archives of Hong Kong Red Cross Transfusion Services, where EDTA is the type of anticoagulant used for plasma collection, compared their antibody titers and responses to that in serum. Influenza A/H1N1/California/7/2009 and A/H3N2/Victoria/208/2009 were the test strains. Our results showed that antibody titers in 609 matched serum/EDTA-plasma specimens (i.e. obtained from the same donor at the same time) had good agreement inferred by Intraclass Correlation Coefficient, the value of which was 0.82 (95% CI: 0.77-0.86) for hemagglutination inhibition assay and 0.95 (95% CI: 0.93-0.96) for microneutralization assay; seroconversion rates (based on hemagglutination inhibition titers) during the 2010 and 2011 influenza seasons in Hong Kong inferred from paired EDTA-plasma were similar to that inferred from paired sera. Our study provided the proof-of-concept that blood donation archives could be leveraged as a valuable source of longitudinal blood specimens for the surveillance, control and risk assessment of both pandemic and seasonal influenza.
Collapse
Affiliation(s)
- Yanyu Zhang
- School of Public Health, WHO Collaborating Center for Infectious Disease Epidemiology and Control, The University of Hong Kong, Hong Kong Special Administrative Region, China
| | - Kathy Leung
- School of Public Health, WHO Collaborating Center for Infectious Disease Epidemiology and Control, The University of Hong Kong, Hong Kong Special Administrative Region, China
| | - Ranawaka A. P. M. Perera
- Center of Influenza Research and School of Public Health, The University of Hong Kong, Hong Kong Special Administrative Region, China
| | - Cheuk-Kwong Lee
- Hong Kong Red Cross Blood Transfusion Service, Hospital Authority, Hong Kong Special Administrative Region, China
| | - J. S. Malik Peiris
- Center of Influenza Research and School of Public Health, The University of Hong Kong, Hong Kong Special Administrative Region, China
| | - Joseph T. Wu
- School of Public Health, WHO Collaborating Center for Infectious Disease Epidemiology and Control, The University of Hong Kong, Hong Kong Special Administrative Region, China
| |
Collapse
|
7
|
Wells QS, Gupta DK, Smith JG, Collins SP, Storrow AB, Ferguson J, Smith ML, Pulley JM, Collier S, Wang X, Roden DM, Gerszten RE, Wang TJ. Accelerating Biomarker Discovery Through Electronic Health Records, Automated Biobanking, and Proteomics. J Am Coll Cardiol 2020; 73:2195-2205. [PMID: 31047008 DOI: 10.1016/j.jacc.2019.01.074] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/28/2018] [Revised: 01/22/2019] [Accepted: 01/23/2019] [Indexed: 01/13/2023]
Abstract
BACKGROUND Circulating biomarkers can facilitate diagnosis and risk stratification for complex conditions such as heart failure (HF). Newer molecular platforms can accelerate biomarker discovery, but they require significant resources for data and sample acquisition. OBJECTIVES The purpose of this study was to test a pragmatic biomarker discovery strategy integrating automated clinical biobanking with proteomics. METHODS Using the electronic health record, the authors identified patients with and without HF, retrieved their discarded plasma samples, and screened these specimens using a DNA aptamer-based proteomic platform (1,129 proteins). Candidate biomarkers were validated in 3 different prospective cohorts. RESULTS In an automated manner, plasma samples from 1,315 patients (31% with HF) were collected. Proteomic analysis of a 96-patient subset identified 9 candidate biomarkers (p < 4.42 × 10-5). Two proteins, angiopoietin-2 and thrombospondin-2, were associated with HF in 3 separate validation cohorts. In an emergency department-based registry of 852 dyspneic patients, the 2 biomarkers improved discrimination of acute HF compared with a clinical score (p < 0.0001) or clinical score plus B-type natriuretic peptide (p = 0.02). In a community-based cohort (n = 768), both biomarkers predicted incident HF independent of traditional risk factors and N-terminal pro-B-type natriuretic peptide (hazard ratio per SD increment: 1.35 [95% confidence interval: 1.14 to 1.61; p = 0.0007] for angiopoietin-2, and 1.37 [95% confidence interval: 1.06 to 1.79; p = 0.02] for thrombospondin-2). Among 30 advanced HF patients, concentrations of both biomarkers declined (80% to 84%) following cardiac transplant (p < 0.001 for both). CONCLUSIONS A novel strategy integrating electronic health records, discarded clinical specimens, and proteomics identified 2 biomarkers that robustly predict HF across diverse clinical settings. This approach could accelerate biomarker discovery for many diseases.
Collapse
Affiliation(s)
- Quinn S Wells
- Vanderbilt Translational and Clinical Cardiovascular Research Center, Vanderbilt University Medical Center, Nashville, Tennessee; Division of Cardiovascular Medicine, Vanderbilt University Medical Center, Nashville, Tennessee
| | - Deepak K Gupta
- Vanderbilt Translational and Clinical Cardiovascular Research Center, Vanderbilt University Medical Center, Nashville, Tennessee; Division of Cardiovascular Medicine, Vanderbilt University Medical Center, Nashville, Tennessee.
| | - J Gustav Smith
- Department of Cardiology, Clinical Sciences, Lund University and Skane University Hospital, Lund, Sweden
| | - Sean P Collins
- Department of Emergency Medicine, Vanderbilt University Medical Center, Nashville, Tennessee
| | - Alan B Storrow
- Department of Emergency Medicine, Vanderbilt University Medical Center, Nashville, Tennessee
| | - Jane Ferguson
- Vanderbilt Translational and Clinical Cardiovascular Research Center, Vanderbilt University Medical Center, Nashville, Tennessee; Division of Cardiovascular Medicine, Vanderbilt University Medical Center, Nashville, Tennessee
| | - Maya Landenhed Smith
- Department of Cardiothoracic Surgery, Clinical Sciences, Lund University and Skane University Hospital, Lund, Sweden
| | - Jill M Pulley
- Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, Tennessee
| | - Sarah Collier
- Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, Tennessee
| | - Xiaoming Wang
- Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, Tennessee
| | - Dan M Roden
- Division of Cardiovascular Medicine, Vanderbilt University Medical Center, Nashville, Tennessee; Departments of Medicine, Pharmacology, and Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee
| | - Robert E Gerszten
- Division of Cardiovascular Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts
| | - Thomas J Wang
- Vanderbilt Translational and Clinical Cardiovascular Research Center, Vanderbilt University Medical Center, Nashville, Tennessee; Division of Cardiovascular Medicine, Vanderbilt University Medical Center, Nashville, Tennessee
| |
Collapse
|
8
|
Brittain EL, Chan SY. Integration of complex data sources to provide biologic insight into pulmonary vascular disease (2015 Grover Conference Series). Pulm Circ 2016; 6:251-60. [PMID: 27683602 DOI: 10.1086/686995] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
The application of complex data sources to pulmonary vascular diseases is an emerging and promising area of investigation. The use of -omics platforms, in silico modeling of gene networks, and linkage of large human cohorts with DNA biobanks are beginning to bear biologic insight into pulmonary hypertension. These approaches to high-throughput molecular phenotyping offer the possibility of discovering new therapeutic targets and identifying variability in response to therapy that can be leveraged to improve clinical care. Optimizing the methods for analyzing complex data sources and accruing large, well-phenotyped human cohorts linked to biologic data remain significant challenges. Here, we discuss two specific types of complex data sources-gene regulatory networks and DNA-linked electronic medical record cohorts-that illustrate the promise, challenges, and current limitations of these approaches to understanding and managing pulmonary vascular disease.
Collapse
Affiliation(s)
- Evan L Brittain
- Division of Cardiovascular Medicine and Vanderbilt Translational and Clinical Cardiovascular Center, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | - Stephen Y Chan
- Division of Cardiology, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania, USA; and Center for Pulmonary Vascular Biology and Medicine, Pittsburgh Heart, Lung, and Blood Vascular Medicine Institute, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, USA
| |
Collapse
|
9
|
May JC, Gant-Branum RL, McLean JA. Targeting the untargeted in molecular phenomics with structurally-selective ion mobility-mass spectrometry. Curr Opin Biotechnol 2016; 39:192-197. [PMID: 27132126 DOI: 10.1016/j.copbio.2016.04.013] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2016] [Revised: 04/06/2016] [Accepted: 04/13/2016] [Indexed: 12/25/2022]
Abstract
Systems-wide molecular phenomics is rapidly expanding through technological advances in instrumentation and bioinformatics. Strategies such as structural mass spectrometry, which utilizes size and shape measurements with molecular weight, serve to characterize the sum of molecular expression in biological contexts, where broad-scale measurements are made that are interpreted through big data statistical techniques to reveal underlying patterns corresponding to phenotype. The data density, data dimensionality, data projection, and data interrogation are all critical aspects of these approaches to turn data into salient information. Untargeted molecular phenomics is already having a dramatic impact in discovery science from drug discovery to synthetic biology. It is evident that these emerging techniques will integrate closely in broad efforts aimed at precision medicine.
Collapse
Affiliation(s)
- Jody Christopher May
- Department of Chemistry, Center for Innovative Technology, Vanderbilt Institute of Chemical Biology, Vanderbilt Institute for Integrative Biosystems Research and Education, Vanderbilt University, Nashville, TN 37235, USA
| | - Randi Lee Gant-Branum
- Department of Chemistry, Center for Innovative Technology, Vanderbilt Institute of Chemical Biology, Vanderbilt Institute for Integrative Biosystems Research and Education, Vanderbilt University, Nashville, TN 37235, USA
| | - John Allen McLean
- Department of Chemistry, Center for Innovative Technology, Vanderbilt Institute of Chemical Biology, Vanderbilt Institute for Integrative Biosystems Research and Education, Vanderbilt University, Nashville, TN 37235, USA.
| |
Collapse
|
10
|
Text Mining for Precision Medicine: Bringing Structure to EHRs and Biomedical Literature to Understand Genes and Health. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016; 939:139-166. [PMID: 27807747 DOI: 10.1007/978-981-10-1503-8_7] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
The key question of precision medicine is whether it is possible to find clinically actionable granularity in diagnosing disease and classifying patient risk. The advent of next-generation sequencing and the widespread adoption of electronic health records (EHRs) have provided clinicians and researchers a wealth of data and made possible the precise characterization of individual patient genotypes and phenotypes. Unstructured text-found in biomedical publications and clinical notes-is an important component of genotype and phenotype knowledge. Publications in the biomedical literature provide essential information for interpreting genetic data. Likewise, clinical notes contain the richest source of phenotype information in EHRs. Text mining can render these texts computationally accessible and support information extraction and hypothesis generation. This chapter reviews the mechanics of text mining in precision medicine and discusses several specific use cases, including database curation for personalized cancer medicine, patient outcome prediction from EHR-derived cohorts, and pharmacogenomic research. Taken as a whole, these use cases demonstrate how text mining enables effective utilization of existing knowledge sources and thus promotes increased value for patients and healthcare systems. Text mining is an indispensable tool for translating genotype-phenotype data into effective clinical care that will undoubtedly play an important role in the eventual realization of precision medicine.
Collapse
|