1
|
Gao X, Li J, Liu X, Peng Q, Jing H, Hadi S, Teschendorff AE, Wang S, Liu F. FastQTLmapping: an ultra-fast and memory efficient package for mQTL-like analysis. BMC Bioinformatics 2025; 26:112. [PMID: 40289089 PMCID: PMC12036243 DOI: 10.1186/s12859-025-06130-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2024] [Accepted: 04/02/2025] [Indexed: 04/30/2025] Open
Abstract
BACKGROUND FastQTLmapping addresses the need for an ultra-fast and memory-efficient solver capable of handling exhaustive multiple regression analysis with a vast number of dependent and explanatory variables, including covariates. This challenge is especially pronounced in methylation quantitative trait loci (mQTL)-like analysis, which typically involves high-dimensional genetic and epigenetic data. RESULTS FastQTLmapping is a precompiled C++ software solution accelerated by Intel MKL and GSL, freely available at https://github.com/Fun-Gene/fastQTLmapping . Compared to state-of-the-art methods (MatrixEQTL, FastQTL, and TensorQTL), fastQTLmapping demonstrated an order of magnitude speed improvement, coupled with a marked reduction in peak memory usage. In a large dataset consisting of 3500 individuals, 8 million SNPs, 0.8 million CpGs, and 20 covariates, fastQTLmapping completed the entire mQTL analysis in 4.5 h with only 13.1 GB peak memory usage. CONCLUSIONS FastQTLmapping effectively expedites comprehensive mQTL analyses by providing a robust and generic approach that accommodates large-scale genomic datasets with covariates. This solution has the potential to streamline mQTL-like studies and inform future method development for efficient computational genomics.
Collapse
Affiliation(s)
- Xingjian Gao
- National Clinical Research Center of Kidney Diseases, Jinling Hospital, Nanjing, Jiangsu, China
| | - Jiarui Li
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Xinxuan Liu
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, China National Center for Bioinformation, Chinese Academy of Sciences, Beijing, China
| | - Qianqian Peng
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Han Jing
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Sibte Hadi
- Department of Forensic Sciences, College of Criminal Justice, Naif Arab University of Security Sciences, 11452, Riyadh, Kingdom of Saudi Arabia
| | - Andrew E Teschendorff
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, China National Center for Bioinformation, Chinese Academy of Sciences, Beijing, China
| | - Sijia Wang
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, China National Center for Bioinformation, Chinese Academy of Sciences, Beijing, China
- Taizhou Institute of Health Sciences, Fudan University, Taizhou, Jiangsu, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China
| | - Fan Liu
- Department of Forensic Sciences, College of Criminal Justice, Naif Arab University of Security Sciences, 11452, Riyadh, Kingdom of Saudi Arabia.
| |
Collapse
|
2
|
Gao C, Wei H, Zhang K. LORSEN: Fast and Efficient eQTL Mapping With Low Rank Penalized Regression. Front Genet 2021; 12:690926. [PMID: 34868194 PMCID: PMC8636089 DOI: 10.3389/fgene.2021.690926] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2021] [Accepted: 10/08/2021] [Indexed: 12/02/2022] Open
Abstract
Characterization of genetic variations that are associated with gene expression levels is essential to understand cellular mechanisms that underline human complex traits. Expression quantitative trait loci (eQTL) mapping attempts to identify genetic variants, such as single nucleotide polymorphisms (SNPs), that affect the expression of one or more genes. With the availability of a large volume of gene expression data, it is necessary and important to develop fast and efficient statistical and computational methods to perform eQTL mapping for such large scale data. In this paper, we proposed a new method, the low rank penalized regression method (LORSEN), for eQTL mapping. We evaluated and compared the performance of LORSEN with two existing methods for eQTL mapping using extensive simulations as well as real data from the HapMap3 project. Simulation studies showed that our method outperformed two commonly used methods for eQTL mapping, LORS and FastLORS, in many scenarios in terms of area under the curve (AUC). We illustrated the usefulness of our method by applying it to SNP variants data and gene expression levels on four chromosomes from the HapMap3 Project.
Collapse
Affiliation(s)
- Cheng Gao
- Department of Mathematical Sciences, Michigan Technological University, Houghton, MI, United States
| | - Hairong Wei
- College of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI, United States
| | - Kui Zhang
- Department of Mathematical Sciences, Michigan Technological University, Houghton, MI, United States
| |
Collapse
|
3
|
Thind AS, Monga I, Thakur PK, Kumari P, Dindhoria K, Krzak M, Ranson M, Ashford B. Demystifying emerging bulk RNA-Seq applications: the application and utility of bioinformatic methodology. Brief Bioinform 2021; 22:6330938. [PMID: 34329375 DOI: 10.1093/bib/bbab259] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2021] [Revised: 06/14/2021] [Accepted: 06/18/2021] [Indexed: 12/13/2022] Open
Abstract
Significant innovations in next-generation sequencing techniques and bioinformatics tools have impacted our appreciation and understanding of RNA. Practical RNA sequencing (RNA-Seq) applications have evolved in conjunction with sequence technology and bioinformatic tools advances. In most projects, bulk RNA-Seq data is used to measure gene expression patterns, isoform expression, alternative splicing and single-nucleotide polymorphisms. However, RNA-Seq holds far more hidden biological information including details of copy number alteration, microbial contamination, transposable elements, cell type (deconvolution) and the presence of neoantigens. Recent novel and advanced bioinformatic algorithms developed the capacity to retrieve this information from bulk RNA-Seq data, thus broadening its scope. The focus of this review is to comprehend the emerging bulk RNA-Seq-based analyses, emphasizing less familiar and underused applications. In doing so, we highlight the power of bulk RNA-Seq in providing biological insights.
Collapse
Affiliation(s)
- Amarinder Singh Thind
- University of Wollongong, Wollongong, Australia.,Illawarra Health and Medical Research Institute, Wollongong, Australia
| | - Isha Monga
- Columbia University, New York City, NY, USA
| | | | - Pallawi Kumari
- Institute of Microbial Technology, Council of Scientific and Industrial Research, Chandigarh, India
| | - Kiran Dindhoria
- Institute of Microbial Technology, Council of Scientific and Industrial Research, Chandigarh, India
| | | | - Marie Ranson
- University of Wollongong, Wollongong, Australia.,Illawarra Health and Medical Research Institute, Wollongong, Australia
| | - Bruce Ashford
- University of Wollongong, Wollongong, Australia.,Illawarra Health and Medical Research Institute, Wollongong, Australia
| |
Collapse
|
4
|
Fantauzzi MF, Aguiar JA, Tremblay BJM, Mansfield MJ, Yanagihara T, Chandiramohan A, Revill S, Ryu MH, Carlsten C, Ask K, Stämpfli M, Doxey AC, Hirota JA. Expression of endocannabinoid system components in human airway epithelial cells: impact of sex and chronic respiratory disease status. ERJ Open Res 2020; 6:00128-2020. [PMID: 33344628 PMCID: PMC7737429 DOI: 10.1183/23120541.00128-2020] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2020] [Accepted: 09/18/2020] [Indexed: 12/12/2022] Open
Abstract
Cannabis smoking is the dominant route of delivery, with the airway epithelium functioning as the site of first contact. The endocannabinoid system is responsible for mediating the physiological effects of inhaled phytocannabinoids. The expression of the endocannabinoid system in the airway epithelium and contribution to normal physiological responses remains to be defined. To begin to address this knowledge gap, a curated dataset of 1090 unique human bronchial brushing gene expression profiles was created. The dataset included 616 healthy subjects, 136 subjects with asthma, and 338 subjects with COPD. A 32-gene endocannabinoid signature was analysed across all samples with sex and disease-specific analyses performed. Immunohistochemistry and immunoblots were performed to probe in situ and in vitro protein expression. CB1, CB2, and TRPV1 protein signal is detectable in human airway epithelial cells in situ and in vitro, justifying examining the downstream endocannabinoid pathway. Sex status was associated with differential expression of 7 of 32 genes. In contrast, disease status was associated with differential expression of 21 of 32 genes in people with asthma and 26 of 32 genes in people with COPD. We confirm at the protein level that TRPV1, the most differentially expressed candidate in our analyses, was upregulated in airway epithelial cells from people with asthma relative to healthy subjects. Our data demonstrate that the endocannabinoid system is expressed in human airway epithelial cells with expression impacted by disease status and minimally by sex. The data suggest that cannabis consumers may have differential physiological responses in the respiratory mucosa.
Collapse
Affiliation(s)
- Matthew F Fantauzzi
- Firestone Institute for Respiratory Health - Division of Respirology, Dept of Medicine, McMaster University, Hamilton, ON, Canada.,McMaster Immunology Research Centre, McMaster University, Hamilton, ON, Canada
| | | | | | - Michael J Mansfield
- Genomics and Regulatory Systems Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Japan
| | - Toyoshi Yanagihara
- Firestone Institute for Respiratory Health - Division of Respirology, Dept of Medicine, McMaster University, Hamilton, ON, Canada
| | - Abiram Chandiramohan
- Firestone Institute for Respiratory Health - Division of Respirology, Dept of Medicine, McMaster University, Hamilton, ON, Canada
| | - Spencer Revill
- Firestone Institute for Respiratory Health - Division of Respirology, Dept of Medicine, McMaster University, Hamilton, ON, Canada
| | - Min Hyung Ryu
- Division of Respiratory Medicine, Dept of Medicine, University of British Columbia, Vancouver, BC, Canada
| | - Chris Carlsten
- Division of Respiratory Medicine, Dept of Medicine, University of British Columbia, Vancouver, BC, Canada
| | - Kjetil Ask
- Firestone Institute for Respiratory Health - Division of Respirology, Dept of Medicine, McMaster University, Hamilton, ON, Canada.,McMaster Immunology Research Centre, McMaster University, Hamilton, ON, Canada
| | - Martin Stämpfli
- Firestone Institute for Respiratory Health - Division of Respirology, Dept of Medicine, McMaster University, Hamilton, ON, Canada.,McMaster Immunology Research Centre, McMaster University, Hamilton, ON, Canada
| | - Andrew C Doxey
- Firestone Institute for Respiratory Health - Division of Respirology, Dept of Medicine, McMaster University, Hamilton, ON, Canada.,Dept of Biology, University of Waterloo, Waterloo, ON, Canada
| | - Jeremy A Hirota
- Firestone Institute for Respiratory Health - Division of Respirology, Dept of Medicine, McMaster University, Hamilton, ON, Canada.,McMaster Immunology Research Centre, McMaster University, Hamilton, ON, Canada.,Dept of Biology, University of Waterloo, Waterloo, ON, Canada.,Division of Respiratory Medicine, Dept of Medicine, University of British Columbia, Vancouver, BC, Canada
| |
Collapse
|
5
|
Jeng XJ, Rhyne J, Zhang T, Tzeng JY. Effective SNP ranking improves the performance of eQTL mapping. Genet Epidemiol 2020; 44:611-619. [PMID: 32216117 DOI: 10.1002/gepi.22293] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2019] [Revised: 02/21/2020] [Accepted: 03/11/2020] [Indexed: 11/06/2022]
Abstract
Genome-wide expression quantitative trait loci (eQTLs) mapping explores the relationship between gene expression and DNA variants, such as single-nucleotide polymorphism (SNPs), to understand genetic basis of human diseases. Due to the large number of genes and SNPs that need to be assessed, current methods for eQTL mapping often suffer from low detection power, especially for identifying trans-eQTLs. In this paper, we propose the idea of performing SNP ranking based on the higher criticism statistic, a summary statistic developed in large-scale signal detection. We illustrate how the HC-based SNP ranking can effectively prioritize eQTL signals over noise, greatly reduce the burden of joint modeling, and improve the power for eQTL mapping. Numerical results in simulation studies demonstrate the superior performance of our method compared to existing methods. The proposed method is also evaluated in HapMap eQTL data analysis and the results are compared to a database of known eQTLs.
Collapse
Affiliation(s)
- X Jessie Jeng
- Department of Statistics, North Carolina State University, Raleigh, North Carolina
| | - Jacob Rhyne
- Department of Statistics, North Carolina State University, Raleigh, North Carolina
| | - Teng Zhang
- Department of Statistics, North Carolina State University, Raleigh, North Carolina
| | - Jung-Ying Tzeng
- Department of Statistics, North Carolina State University, Raleigh, North Carolina.,Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina.,Department of Statistics, National Cheng-Kung University, Tainan, Taiwan.,Division of Biostatistics, Institute of Epidemiology and Preventive Medicine, National Taiwan University, Taipei, Taiwan
| |
Collapse
|
6
|
A Multi-Omics Perspective of Quantitative Trait Loci in Precision Medicine. Trends Genet 2020; 36:318-336. [PMID: 32294413 DOI: 10.1016/j.tig.2020.01.009] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2019] [Revised: 01/05/2020] [Accepted: 01/21/2020] [Indexed: 02/07/2023]
Abstract
Quantitative trait loci (QTL) analysis is an important approach to investigate the effects of genetic variants identified through an increasing number of large-scale, multidimensional 'omics data sets. In this 'big data' era, the research community has identified a significant number of molecular QTLs (molQTLs) and increased our understanding of their effects. Herein, we review multiple categories of molQTLs, including those associated with transcriptome, post-transcriptional regulation, epigenetics, proteomics, metabolomics, and the microbiome. We summarize approaches to identify molQTLs and to infer their causal effects. We further discuss the integrative analysis of molQTLs through a multi-omics perspective. Our review highlights future opportunities to better understand the functional significance of genetic variants and to utilize the discovery of molQTLs in precision medicine.
Collapse
|
7
|
Lukowski SW, Lloyd-Jones LR, Holloway A, Kirsten H, Hemani G, Yang J, Small K, Zhao J, Metspalu A, Dermitzakis ET, Gibson G, Spector TD, Thiery J, Scholz M, Montgomery GW, Esko T, Visscher PM, Powell JE. Genetic correlations reveal the shared genetic architecture of transcription in human peripheral blood. Nat Commun 2017; 8:483. [PMID: 28883458 PMCID: PMC5589780 DOI: 10.1038/s41467-017-00473-z] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2016] [Accepted: 06/30/2017] [Indexed: 01/29/2023] Open
Abstract
Transcript co-expression is regulated by a combination of shared genetic and environmental factors. Here, we estimate the proportion of co-expression that is due to shared genetic variance. To do so, we estimated the genetic correlations between each pairwise combination of 2469 transcripts that are highly heritable and expressed in whole blood in 1748 unrelated individuals of European ancestry. We identify 556 pairs with a significant genetic correlation of which 77% are located on different chromosomes, and report 934 expression quantitative trait loci, identified in an independent cohort, with significant effects on both transcripts in a genetically correlated pair. We show significant enrichment for transcription factor control and physical proximity through chromatin interactions as possible mechanisms of shared genetic control. Finally, we construct networks of interconnected transcripts and identify their underlying biological functions. Using genetic correlations to investigate transcriptional co-regulation provides valuable insight into the nature of the underlying genetic architecture of gene regulation. Covariance of gene expression pairs is due to a combination of shared genetic and environmental factors. Here the authors estimate the genetic correlation between highly heritable pairs and identify transcription factor control and chromatin interactions as possible mechanisms of correlation.
Collapse
Affiliation(s)
- Samuel W Lukowski
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, 4072, Australia
| | - Luke R Lloyd-Jones
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, 4072, Australia.,Centre for Neurogenetics and Statistical Genomics, Queensland Brain Institute, University of Queensland, Brisbane, QLD, 4072, Australia
| | - Alexander Holloway
- Centre for Neurogenetics and Statistical Genomics, Queensland Brain Institute, University of Queensland, Brisbane, QLD, 4072, Australia
| | - Holger Kirsten
- Institute for Medical Informatics, Statistics and Epidemiology, University of Leipzig, Leipzig, 04107, Germany.,LIFE Leipzig Research Center for Civilization Diseases, University of Leipzig, Leipzig, 04103, Germany
| | - Gibran Hemani
- MRC Integrative Epidemiology Unit (IEU) at the University of Bristol, Bristol, BS8 2BN, UK.,School of Social and Community Medicine, University of Bristol, Bristol, BS8 2BN, UK
| | - Jian Yang
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, 4072, Australia.,Centre for Neurogenetics and Statistical Genomics, Queensland Brain Institute, University of Queensland, Brisbane, QLD, 4072, Australia
| | - Kerrin Small
- Department of Twin Research and Genetic Epidemiology, King's College London, London, SE1 7EH, UK
| | - Jing Zhao
- School of Biology and Center for Integrative Genomics, Georgia Institute of Technology, Atlanta, GA, 30332, USA
| | - Andres Metspalu
- Estonian Genome Center, University of Tartu, Tartu, 51010, Estonia
| | - Emmanouil T Dermitzakis
- Department of Genetic Medicine and Development, University of Geneva, Geneva, CH-1211, Switzerland
| | - Greg Gibson
- School of Biology and Center for Integrative Genomics, Georgia Institute of Technology, Atlanta, GA, 30332, USA
| | - Timothy D Spector
- Department of Twin Research and Genetic Epidemiology, King's College London, London, SE1 7EH, UK
| | - Joachim Thiery
- LIFE Leipzig Research Center for Civilization Diseases, University of Leipzig, Leipzig, 04103, Germany.,Institute of Laboratory Medicine, Clinical Chemistry and Molecular Diagnostics, University of Leipzig, Leipzig, 04103, Germany
| | - Markus Scholz
- Institute for Medical Informatics, Statistics and Epidemiology, University of Leipzig, Leipzig, 04107, Germany.,LIFE Leipzig Research Center for Civilization Diseases, University of Leipzig, Leipzig, 04103, Germany
| | - Grant W Montgomery
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, 4072, Australia.,QIMR Berghofer Medical Research Institute, 300 Herston Road, Brisbane, QLD, 4006, Australia
| | - Tonu Esko
- Estonian Genome Center, University of Tartu, Tartu, 51010, Estonia
| | - Peter M Visscher
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, 4072, Australia.,Centre for Neurogenetics and Statistical Genomics, Queensland Brain Institute, University of Queensland, Brisbane, QLD, 4072, Australia
| | - Joseph E Powell
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, 4072, Australia. .,Centre for Neurogenetics and Statistical Genomics, Queensland Brain Institute, University of Queensland, Brisbane, QLD, 4072, Australia.
| |
Collapse
|
8
|
Yuan L, Zhu L, Guo WL, Zhou X, Zhang Y, Huang Z, Huang DS. Nonconvex Penalty Based Low-Rank Representation and Sparse Regression for eQTL Mapping. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017; 14:1154-1164. [PMID: 28114074 DOI: 10.1109/tcbb.2016.2609420] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
This paper addresses the problem of accounting for confounding factors and expression quantitative trait loci (eQTL) mapping in the study of SNP-gene associations. The existing convex penalty based algorithm has limited capacity to keep main information of matrix in the process of reducing matrix rank. We present an algorithm, which use nonconvex penalty based low-rank representation to account for confounding factors and make use of sparse regression for eQTL mapping (NCLRS). The efficiency of the presented algorithm is evaluated by comparing the results of 18 synthetic datasets given by NCLRS and presented algorithm, respectively. The experimental results or biological dataset show that our approach is an effective tool to account for non-genetic effects than currently existing methods.
Collapse
|
9
|
Salerno S, Mehrmohamadi M, Liberti MV, Wan M, Wells MT, Booth JG, Locasale JW. RRmix: A method for simultaneous batch effect correction and analysis of metabolomics data in the absence of internal standards. PLoS One 2017; 12:e0179530. [PMID: 28662051 PMCID: PMC5491020 DOI: 10.1371/journal.pone.0179530] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2016] [Accepted: 05/31/2017] [Indexed: 01/22/2023] Open
Abstract
With the surge of interest in metabolism and the appreciation of its diverse roles in numerous biomedical contexts, the number of metabolomics studies using liquid chromatography coupled to mass spectrometry (LC-MS) approaches has increased dramatically in recent years. However, variation that occurs independently of biological signal and noise (i.e. batch effects) in metabolomics data can be substantial. Standard protocols for data normalization that allow for cross-study comparisons are lacking. Here, we investigate a number of algorithms for batch effect correction and differential abundance analysis, and compare their performance. We show that linear mixed effects models, which account for latent (i.e. not directly measurable) factors, produce satisfactory results in the presence of batch effects without the need for internal controls or prior knowledge about the nature and sources of unwanted variation in metabolomics data. We further introduce an algorithm-RRmix-within the family of latent factor models and illustrate its suitability for differential abundance analysis in the presence of strong batch effects. Together this analysis provides a framework for systematically standardizing metabolomics data.
Collapse
Affiliation(s)
- Stephen Salerno
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Mahya Mehrmohamadi
- Department of Molecular Biology and Genetics, Field of Genomics, Genetics and Development, Cornell University, Ithaca, New York, United States of America
- Duke Cancer Institute, Duke Molecular Physiology Institute and Department of Pharmacology and Cancer Biology, Duke University School of Medicine, Durham, North Carolina, United States of America
| | - Maria V. Liberti
- Duke Cancer Institute, Duke Molecular Physiology Institute and Department of Pharmacology and Cancer Biology, Duke University School of Medicine, Durham, North Carolina, United States of America
- Department of Molecular Biology and Genetics, Field of Biochemistry, Molecular, and Cell Biology, Cornell University, Ithaca, New York, United States of America
| | - Muting Wan
- Department of Statistical Science, Cornell University, Ithaca, New York, United States of America
| | - Martin T. Wells
- Department of Statistical Science, Cornell University, Ithaca, New York, United States of America
| | - James G. Booth
- Department of Statistical Science, Cornell University, Ithaca, New York, United States of America
| | - Jason W. Locasale
- Department of Molecular Biology and Genetics, Field of Genomics, Genetics and Development, Cornell University, Ithaca, New York, United States of America
- Duke Cancer Institute, Duke Molecular Physiology Institute and Department of Pharmacology and Cancer Biology, Duke University School of Medicine, Durham, North Carolina, United States of America
- Department of Molecular Biology and Genetics, Field of Biochemistry, Molecular, and Cell Biology, Cornell University, Ithaca, New York, United States of America
| |
Collapse
|
10
|
Ju JH, Shenoy SA, Crystal RG, Mezey JG. An independent component analysis confounding factor correction framework for identifying broad impact expression quantitative trait loci. PLoS Comput Biol 2017; 13:e1005537. [PMID: 28505156 PMCID: PMC5448815 DOI: 10.1371/journal.pcbi.1005537] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2016] [Revised: 05/30/2017] [Accepted: 04/28/2017] [Indexed: 11/19/2022] Open
Abstract
Genome-wide expression Quantitative Trait Loci (eQTL) studies in humans have provided numerous insights into the genetics of both gene expression and complex diseases. While the majority of eQTL identified in genome-wide analyses impact a single gene, eQTL that impact many genes are particularly valuable for network modeling and disease analysis. To enable the identification of such broad impact eQTL, we introduce CONFETI: Confounding Factor Estimation Through Independent component analysis. CONFETI is designed to address two conflicting issues when searching for broad impact eQTL: the need to account for non-genetic confounding factors that can lower the power of the analysis or produce broad impact eQTL false positives, and the tendency of methods that account for confounding factors to model broad impact eQTL as non-genetic variation. The key advance of the CONFETI framework is the use of Independent Component Analysis (ICA) to identify variation likely caused by broad impact eQTL when constructing the sample covariance matrix used for the random effect in a mixed model. We show that CONFETI has better performance than other mixed model confounding factor methods when considering broad impact eQTL recovery from synthetic data. We also used the CONFETI framework and these same confounding factor methods to identify eQTL that replicate between matched twin pair datasets in the Multiple Tissue Human Expression Resource (MuTHER), the Depression Genes Networks study (DGN), the Netherlands Study of Depression and Anxiety (NESDA), and multiple tissue types in the Genotype-Tissue Expression (GTEx) consortium. These analyses identified both cis-eQTL and trans-eQTL impacting individual genes, and CONFETI had better or comparable performance to other mixed model confounding factor analysis methods when identifying such eQTL. In these analyses, we were able to identify and replicate a few broad impact eQTL although the overall number was small even when applying CONFETI. In light of these results, we discuss the broad impact eQTL that have been previously reported from the analysis of human data and suggest that considerable caution should be exercised when making biological inferences based on these reported eQTL.
Collapse
Affiliation(s)
- Jin Hyun Ju
- Department of Genetic Medicine, Weill Cornell Medical College, New York, NY, United States of America
- Institute for Computational Biomedicine, Weill Cornell Medical College, New York, NY, United States of America
| | - Sushila A. Shenoy
- Department of Genetic Medicine, Weill Cornell Medical College, New York, NY, United States of America
| | - Ronald G. Crystal
- Department of Genetic Medicine, Weill Cornell Medical College, New York, NY, United States of America
| | - Jason G. Mezey
- Department of Genetic Medicine, Weill Cornell Medical College, New York, NY, United States of America
- Institute for Computational Biomedicine, Weill Cornell Medical College, New York, NY, United States of America
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, NY, United States of America
- * E-mail:
| |
Collapse
|
11
|
Abstract
The aim of expression Quantitative Trait Locus (eQTL) mapping is the identification of DNA sequence variants that explain variation in gene expression. Given the recent yield of trait-associated genetic variants identified by large-scale genome-wide association analyses (GWAS), eQTL mapping has become a useful tool to understand the functional context where these variants operate and eventually narrow down functional gene targets for disease. Despite its extensive application to complex (polygenic) traits and disease, the majority of eQTL studies still rely on univariate data modeling strategies, i.e., testing for association of all transcript-marker pairs. However these "one at-a-time" strategies are (1) unable to control the number of false-positives when an intricate Linkage Disequilibrium structure is present and (2) are often underpowered to detect the full spectrum of trans-acting regulatory effects. Here we present our viewpoint on the most recent advances on eQTL mapping approaches, with a focus on Bayesian methodology. We review the advantages of the Bayesian approach over frequentist methods and provide an empirical example of polygenic eQTL mapping to illustrate the different properties of frequentist and Bayesian methods. Finally, we discuss how multivariate eQTL mapping approaches have distinctive features with respect to detection of polygenic effects, accuracy, and interpretability of the results.
Collapse
Affiliation(s)
- Martha Imprialou
- Centre for Complement and Inflammation Research, Imperial College London, Hammersmith Hospital, Du Cane Road, London, W12 0NN, UK
| | - Enrico Petretto
- Duke-NUS Medical School, 8 College Road, Singapore, 169857, Singapore.
| | - Leonardo Bottolo
- Department of Medical Genetics, University of Cambridge, Box 238, Lv 6 Addenbrooke's Treatment Centre, Addenbrooke's Hospital, Cambridge Biomedical Campus, Cambridge, CB2 0QQ, UK.
- Department of Mathematics, Imperial College London, 180 Queen's Gate, London, SW7 2AZ, UK.
| |
Collapse
|
12
|
Joshi AD, Andersson C, Buch S, Stender S, Noordam R, Weng LC, Weeke PE, Auer PL, Boehm B, Chen C, Choi H, Curhan G, Denny JC, De Vivo I, Eicher JD, Ellinghaus D, Folsom AR, Fuchs C, Gala M, Haessler J, Hofman A, Hu F, Hunter DJ, Janssen HL, Kang JH, Kooperberg C, Kraft P, Kratzer W, Lieb W, Lutsey PL, Murad SD, Nordestgaard BG, Pasquale LR, Reiner AP, Ridker PM, Rimm E, Rose LM, Shaffer CM, Schafmayer C, Tamimi RM, Uitterlinden AG, Völker U, Völzke H, Wakabayashi Y, Wiggs JL, Zhu J, Roden DM, Stricker BH, Tang W, Teumer A, Hampe J, Tybjærg-Hansen A, Chasman DI, Chan AT, Johnson AD. Four Susceptibility Loci for Gallstone Disease Identified in a Meta-analysis of Genome-Wide Association Studies. Gastroenterology 2016; 151:351-363.e28. [PMID: 27094239 PMCID: PMC4959966 DOI: 10.1053/j.gastro.2016.04.007] [Citation(s) in RCA: 69] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/07/2015] [Revised: 04/06/2016] [Accepted: 04/07/2016] [Indexed: 01/01/2023]
Abstract
BACKGROUND & AIMS A genome-wide association study (GWAS) of 280 cases identified the hepatic cholesterol transporter ABCG8 as a locus associated with risk for gallstone disease, but findings have not been reported from any other GWAS of this phenotype. We performed a large-scale, meta-analysis of GWASs of individuals of European ancestry with available prior genotype data, to identify additional genetic risk factors for gallstone disease. METHODS We obtained per-allele odds ratio (OR) and standard error estimates using age- and sex-adjusted logistic regression models within each of the 10 discovery studies (8720 cases and 55,152 controls). We performed an inverse variance weighted, fixed-effects meta-analysis of study-specific estimates to identify single-nucleotide polymorphisms that were associated independently with gallstone disease. Associations were replicated in 6489 cases and 62,797 controls. RESULTS We observed independent associations for 2 single-nucleotide polymorphisms at the ABCG8 locus: rs11887534 (OR, 1.69; 95% confidence interval [CI], 1.54-1.86; P = 2.44 × 10(-60)) and rs4245791 (OR, 1.27; P = 1.90 × 10(-34)). We also identified and/or replicated associations for rs9843304 in TM4SF4 (OR, 1.12; 95% CI, 1.08-1.16; P = 6.09 × 10(-11)), rs2547231 in SULT2A1 (encodes a sulfoconjugation enzyme that acts on hydroxysteroids and cholesterol-derived sterol bile acids) (OR, 1.17; 95% CI, 1.12-1.21; P = 2.24 × 10(-10)), rs1260326 in glucokinase regulatory protein (OR, 1.12; 95% CI, 1.07-1.17; P = 2.55 × 10(-10)), and rs6471717 near CYP7A1 (encodes an enzyme that catalyzes conversion of cholesterol to primary bile acids) (OR, 1.11; 95% CI, 1.08-1.15; P = 8.84 × 10(-9)). Among individuals of African American and Hispanic American ancestry, rs11887534 and rs4245791 were associated positively with gallstone disease risk, whereas the association for the rs1260326 variant was inverse. CONCLUSIONS In this large-scale GWAS of gallstone disease, we identified 4 loci in genes that have putative functions in cholesterol metabolism and transport, and sulfonylation of bile acids or hydroxysteroids.
Collapse
Affiliation(s)
- Amit D. Joshi
- Program in Genetic Epidemiology and Statistical Genetics, Harvard School of Public Health, Boston, MA,Division of Gastroenterology, Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, MA,Clinical and Translational Epidemiology Unit, Massachusetts General Hospital Boston, MA,To whom correspondence should be addressed: Amit D. Joshi, MBBS, PhD, Clinical and Translational Epidemiology Unit, Division of Gastroenterology, Massachusetts General Hospital, 55 Fruit Street, Boston, Massachusetts 02114, USA. Tel: +1 617 724 7558; Charlotte Andersson, MD, PhD, The Framingham Heart Study, 73 Mt Wayte Avenue, Framingham, Massachusetts 01702, USA. , Andrew T. Chan, MD, MPH, Massachusetts General Hospital and Harvard Medical School, Clinical and Translational Epidemiology Unit, Division of Gastroenterology, GRJ-825C, Boston, Massachusetts 02114, USA. Tel:+1 617 724 0283; Fax: +1 617 726 3673; , Andrew D. Johnson, PhD, Division of Intramural Research, National Heart, Lung and Blood Institute, Cardiovascular Epidemiology and Human Genomics Branch, The Framingham Heart Study, 73 Mt. Wayte Ave., Suite #2, Framingham, MA, 01702, USA. Tel: +1 508 663 4082; Fax: +1 508 626 1262;
| | - Charlotte Andersson
- The National Heart, Lung, and Blood Institute's Framingham Heart Study, Framingham, Massachusetts.
| | - Stephan Buch
- Medical Department 1, University Hospital Dresden, TU Dresden, Dresden Germany
| | - Stefan Stender
- Department of Clinical Biochemistry, Rigshospitalet, Copenhagen, Denmark
| | - Raymond Noordam
- Department of Internal Medicine, Erasmus Medical Center, Rotterdam, the Netherlands,Department of Epidemiology, Erasmus Medical Center, Rotterdam, the Netherlands
| | - Lu-Chen Weng
- Division of Epidemiology and Community Health, School of Public Health, University of Minnesota, MN
| | - Peter E. Weeke
- Department of Medicine, Vanderbilt University, Nashville, TN,Department of Cardiology, The Heart Centre, Rigshospitalet, Copenhagen University Hospital, Denmark
| | - Paul L. Auer
- Joseph J. Zilber School of Public Health, University of Wisconsin, Milwaukee,Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA
| | - Bernhard Boehm
- Department of Internal Medicine I, Ulm University Hospital, Ulm, Germany
| | - Constance Chen
- Program in Genetic Epidemiology and Statistical Genetics, Harvard School of Public Health, Boston, MA
| | - Hyon Choi
- Division of Rheumatology, Allergy, and Immunology, Massachusetts General Hospital, Boston, MA
| | - Gary Curhan
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA,Renal Division, Department of Medicine, Brigham and Women’s Hospital, Boston, MA
| | - Joshua C. Denny
- Department of Medicine, Vanderbilt University, Nashville, TN,Department of Biomedical Informatics, Vanderbilt University, Nashville, TN
| | - Immaculata De Vivo
- Program in Genetic Epidemiology and Statistical Genetics, Harvard School of Public Health, Boston, MA,Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA,Department of Epidemiology, Harvard School of Public Health, Boston, MA
| | - John D. Eicher
- The National Heart, Lung, and Blood Institute’s Framingham Heart Study, Framingham, MA,Population Sciences Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Framingham, MA
| | - David Ellinghaus
- Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Kiel, Germany
| | - Aaron R. Folsom
- Division of Epidemiology and Community Health, School of Public Health, University of Minnesota, MN
| | - Charles Fuchs
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA,Department of Medical Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA
| | - Manish Gala
- Division of Gastroenterology, Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, MA
| | - Jeffrey Haessler
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA
| | - Albert Hofman
- Department of Epidemiology, Erasmus Medical Center, Rotterdam, the Netherlands
| | - Frank Hu
- Department of Epidemiology, Harvard School of Public Health, Boston, MA,Department of Nutrition, Harvard School of Public Health, Boston, MA
| | - David J. Hunter
- Program in Genetic Epidemiology and Statistical Genetics, Harvard School of Public Health, Boston, MA,Department of Epidemiology, Harvard School of Public Health, Boston, MA
| | - Harry L.A. Janssen
- Department of Gastroenterology and Hepatology, Erasmus MC, Rotterdam, the Netherlands,Toronto Centre for Liver Disease, Toronto Western and General Hospital, University Health Network, Toronto, Canada
| | - Jae H. Kang
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA
| | - Charles Kooperberg
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA
| | - Peter Kraft
- Program in Genetic Epidemiology and Statistical Genetics, Harvard School of Public Health, Boston, MA,Department of Epidemiology, Harvard School of Public Health, Boston, MA
| | - Wolfgang Kratzer
- Department of Internal Medicine I, Ulm University Hospital, Ulm, Germany
| | - Wolfgang Lieb
- Institute of Epidemiology, Christian Albrechts Universität Kiel, Niemannsweg 11, Kiel, Germany
| | - Pamela L. Lutsey
- Division of Epidemiology and Community Health, School of Public Health, University of Minnesota, MN
| | - Sarwa Darwish Murad
- Department of Gastroenterology and Hepatology, Erasmus MC, Rotterdam, the Netherlands
| | - Børge G. Nordestgaard
- The Copenhagen General Population Study and,Department of Clinical Biochemistry, Herlev Hospital, Herlev Denmark,Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Louis R. Pasquale
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA,Department of Ophthalmology, Harvard Medical School, Massachusetts Eye and Ear Infirmary, Boston, MA
| | - Alex P. Reiner
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA
| | - Paul M Ridker
- Division of Preventive Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA
| | - Eric Rimm
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA,Department of Epidemiology, Harvard School of Public Health, Boston, MA,Department of Nutrition, Harvard School of Public Health, Boston, MA
| | - Lynda M. Rose
- Division of Preventive Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA
| | | | - Clemens Schafmayer
- Department of General, Abdominal, Thoracic and Transplantation Surgery, University of Kiel, Kiel, Germany
| | - Rulla M. Tamimi
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA,Department of Epidemiology, Harvard School of Public Health, Boston, MA
| | - André G Uitterlinden
- Department of Internal Medicine, Erasmus Medical Center, Rotterdam, the Netherlands,Department of Epidemiology, Erasmus Medical Center, Rotterdam, the Netherlands
| | - Uwe Völker
- Department of Functional Genomics, Interfaculty Institute for Genetics and Functional Genomics, University Medicine Greifswald, Germany
| | - Henry Völzke
- Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany,German Center for Cardiovascular Research, Partner Site Greifswald,German Center for Diabetes Research, Site Greifswald
| | - Yoshiyuki Wakabayashi
- The National Heart, Lung, and Blood Institute, DNA Sequencing Core Laboratory, Bethesda, MD
| | - Janey L. Wiggs
- Department of Ophthalmology, Harvard Medical School, Massachusetts Eye and Ear Infirmary, Boston, MA
| | - Jun Zhu
- The National Heart, Lung, and Blood Institute, DNA Sequencing Core Laboratory, Bethesda, MD
| | - Dan M. Roden
- Department of Medicine, Vanderbilt University, Nashville, TN
| | - Bruno H. Stricker
- Department of Internal Medicine, Erasmus Medical Center, Rotterdam, the Netherlands,Department of Epidemiology, Erasmus Medical Center, Rotterdam, the Netherlands
| | - Weihong Tang
- Division of Epidemiology and Community Health, School of Public Health, University of Minnesota, MN
| | - Alexander Teumer
- Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
| | - Jochen Hampe
- Medical Department 1, University Hospital Dresden, TU Dresden, Dresden Germany
| | - Anne Tybjærg-Hansen
- Department of Clinical Biochemistry, Rigshospitalet, Copenhagen, Denmark,Department of Clinical Biochemistry, Herlev Hospital, Herlev Denmark
| | - Daniel I. Chasman
- Division of Preventive Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA
| | - Andrew T. Chan
- Division of Gastroenterology, Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, MA,Clinical and Translational Epidemiology Unit, Massachusetts General Hospital Boston, MA,Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA,To whom correspondence should be addressed: Amit D. Joshi, MBBS, PhD, Clinical and Translational Epidemiology Unit, Division of Gastroenterology, Massachusetts General Hospital, 55 Fruit Street, Boston, Massachusetts 02114, USA. Tel: +1 617 724 7558; Charlotte Andersson, MD, PhD, The Framingham Heart Study, 73 Mt Wayte Avenue, Framingham, Massachusetts 01702, USA. , Andrew T. Chan, MD, MPH, Massachusetts General Hospital and Harvard Medical School, Clinical and Translational Epidemiology Unit, Division of Gastroenterology, GRJ-825C, Boston, Massachusetts 02114, USA. Tel:+1 617 724 0283; Fax: +1 617 726 3673; , Andrew D. Johnson, PhD, Division of Intramural Research, National Heart, Lung and Blood Institute, Cardiovascular Epidemiology and Human Genomics Branch, The Framingham Heart Study, 73 Mt. Wayte Ave., Suite #2, Framingham, MA, 01702, USA. Tel: +1 508 663 4082; Fax: +1 508 626 1262;
| | - Andrew D. Johnson
- The National Heart, Lung, and Blood Institute’s Framingham Heart Study, Framingham, MA,Population Sciences Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Framingham, MA,To whom correspondence should be addressed: Amit D. Joshi, MBBS, PhD, Clinical and Translational Epidemiology Unit, Division of Gastroenterology, Massachusetts General Hospital, 55 Fruit Street, Boston, Massachusetts 02114, USA. Tel: +1 617 724 7558; Charlotte Andersson, MD, PhD, The Framingham Heart Study, 73 Mt Wayte Avenue, Framingham, Massachusetts 01702, USA. , Andrew T. Chan, MD, MPH, Massachusetts General Hospital and Harvard Medical School, Clinical and Translational Epidemiology Unit, Division of Gastroenterology, GRJ-825C, Boston, Massachusetts 02114, USA. Tel:+1 617 724 0283; Fax: +1 617 726 3673; , Andrew D. Johnson, PhD, Division of Intramural Research, National Heart, Lung and Blood Institute, Cardiovascular Epidemiology and Human Genomics Branch, The Framingham Heart Study, 73 Mt. Wayte Ave., Suite #2, Framingham, MA, 01702, USA. Tel: +1 508 663 4082; Fax: +1 508 626 1262;
| |
Collapse
|
13
|
Luo W, Obeidat M, Di Narzo AF, Chen R, Sin DD, Paré PD, Hao K. Airway Epithelial Expression Quantitative Trait Loci Reveal Genes Underlying Asthma and Other Airway Diseases. Am J Respir Cell Mol Biol 2016; 54:177-87. [PMID: 26102239 DOI: 10.1165/rcmb.2014-0381oc] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Genome-wide association studies (GWASs) have identified loci that are robustly associated with asthma and related phenotypes; however, the molecular mechanisms underlying these associations need to be explored. The most relevant tissues to study the functional consequences of asthma are the airways. We used publically available data to derive expression quantitative trait loci (eQTLs) for human epithelial cells from small and large airways and applied the eQTLs in the interpretation of GWAS results of asthma and related phenotypes. For the small airways (n = 105), we discovered 660 eQTLs at a 10% false discovery rate (FDR), among which 315 eQTLs were not previously reported in a large-scale eQTL study of whole lung tissue. A large fraction of the identified eQTLs is supported by data from Encyclopedia of DNA Elements (ENCODE) showing that the eQTLs reside in regulatory elements (57.5 and 67.6% of cis- and trans-eQTLs, respectively). Published pulmonary GWAS hits were enriched as airway epithelial eQTLs (9.2-fold). Further, genes regulated by asthma GWAS loci in epithelium are significantly enriched in immune response pathways, such as IL-4 signaling (FDR, 5.2 × 10(-4)). The airway epithelial eQTLs described in this study are complementary to previously reported lung eQTLs and represent a powerful resource to link GWAS-associated variants to their regulatory function and thus elucidate the molecular mechanisms underlying asthma and airway-related conditions.
Collapse
Affiliation(s)
- Wei Luo
- 1 College of Computer Science and Technology, Huaqiao University, Xiamen, China.,2 Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York
| | - Ma'en Obeidat
- 3 The University of British Columbia Center for Heart Lung Innovation, St. Paul's Hospital, Vancouver, British Columbia, Canada.,4 Respiratory Division, Department of Medicine, University of British Columbia, Vancouver, British Columbia, Canada
| | - Antonio Fabio Di Narzo
- 2 Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York.,5 Icahn Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, New York; and
| | - Rong Chen
- 2 Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York.,5 Icahn Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, New York; and
| | - Don D Sin
- 3 The University of British Columbia Center for Heart Lung Innovation, St. Paul's Hospital, Vancouver, British Columbia, Canada.,4 Respiratory Division, Department of Medicine, University of British Columbia, Vancouver, British Columbia, Canada
| | - Peter D Paré
- 3 The University of British Columbia Center for Heart Lung Innovation, St. Paul's Hospital, Vancouver, British Columbia, Canada.,4 Respiratory Division, Department of Medicine, University of British Columbia, Vancouver, British Columbia, Canada
| | - Ke Hao
- 2 Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York.,5 Icahn Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, New York; and.,6 Department of Respiratory Medicine, Shanghai Tenth People's Hospital, Tongji University, Shanghai, China
| |
Collapse
|
14
|
Grünewald TGP, Bernard V, Gilardi-Hebenstreit P, Raynal V, Surdez D, Aynaud MM, Mirabeau O, Cidre-Aranaz F, Tirode F, Zaidi S, Perot G, Jonker AH, Lucchesi C, Le Deley MC, Oberlin O, Marec-Bérard P, Véron AS, Reynaud S, Lapouble E, Boeva V, Rio Frio T, Alonso J, Bhatia S, Pierron G, Cancel-Tassin G, Cussenot O, Cox DG, Morton LM, Machiela MJ, Chanock SJ, Charnay P, Delattre O. Chimeric EWSR1-FLI1 regulates the Ewing sarcoma susceptibility gene EGR2 via a GGAA microsatellite. Nat Genet 2015. [PMID: 26214589 DOI: 10.1038/ng.3363] [Citation(s) in RCA: 126] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Deciphering the ways in which somatic mutations and germline susceptibility variants cooperate to promote cancer is challenging. Ewing sarcoma is characterized by fusions between EWSR1 and members of the ETS gene family, usually EWSR1-FLI1, leading to the generation of oncogenic transcription factors that bind DNA at GGAA motifs. A recent genome-wide association study identified susceptibility variants near EGR2. Here we found that EGR2 knockdown inhibited proliferation, clonogenicity and spheroidal growth in vitro and induced regression of Ewing sarcoma xenografts. Targeted germline deep sequencing of the EGR2 locus in affected subjects and controls identified 291 Ewing-associated SNPs. At rs79965208, the A risk allele connected adjacent GGAA repeats by converting an interspaced GGAT motif into a GGAA motif, thereby increasing the number of consecutive GGAA motifs and thus the EWSR1-FLI1-dependent enhancer activity of this sequence, with epigenetic characteristics of an active regulatory element. EWSR1-FLI1 preferentially bound to the A risk allele, which increased global and allele-specific EGR2 expression. Collectively, our findings establish cooperation between a dominant oncogene and a susceptibility variant that regulates a major driver of Ewing sarcomagenesis.
Collapse
Affiliation(s)
- Thomas G P Grünewald
- Genetics and Biology of Cancers Unit, Institut Curie, PSL Research University, Paris, France.,INSERM U830, Institut Curie Research Center, Paris, France
| | - Virginie Bernard
- Institut Curie Genomics of Excellence (ICGex) Platform, Institut Curie Research Center, Paris, France
| | - Pascale Gilardi-Hebenstreit
- École Normale Supérieure (ENS), Institut de Biologie de l'ENS (IBENS), INSERM U1024, CNRS UMR8197, Paris, France
| | - Virginie Raynal
- Genetics and Biology of Cancers Unit, Institut Curie, PSL Research University, Paris, France.,INSERM U830, Institut Curie Research Center, Paris, France.,Institut Curie Genomics of Excellence (ICGex) Platform, Institut Curie Research Center, Paris, France
| | - Didier Surdez
- Genetics and Biology of Cancers Unit, Institut Curie, PSL Research University, Paris, France.,INSERM U830, Institut Curie Research Center, Paris, France
| | - Marie-Ming Aynaud
- Genetics and Biology of Cancers Unit, Institut Curie, PSL Research University, Paris, France.,INSERM U830, Institut Curie Research Center, Paris, France
| | - Olivier Mirabeau
- Genetics and Biology of Cancers Unit, Institut Curie, PSL Research University, Paris, France.,INSERM U830, Institut Curie Research Center, Paris, France
| | - Florencia Cidre-Aranaz
- Instituto de Investigación de Enfermedades Raras, Instituto de Salud Carlos III, Madrid, Spain
| | - Franck Tirode
- Genetics and Biology of Cancers Unit, Institut Curie, PSL Research University, Paris, France.,INSERM U830, Institut Curie Research Center, Paris, France
| | - Sakina Zaidi
- Genetics and Biology of Cancers Unit, Institut Curie, PSL Research University, Paris, France.,INSERM U830, Institut Curie Research Center, Paris, France
| | - Gaëlle Perot
- INSERM U916 Biology of Sarcomas, Institut Bergonié, Bordeaux, France
| | - Anneliene H Jonker
- Genetics and Biology of Cancers Unit, Institut Curie, PSL Research University, Paris, France.,INSERM U830, Institut Curie Research Center, Paris, France
| | - Carlo Lucchesi
- Genetics and Biology of Cancers Unit, Institut Curie, PSL Research University, Paris, France.,INSERM U830, Institut Curie Research Center, Paris, France
| | - Marie-Cécile Le Deley
- Département d'Epidémiologie et de Biostatistiques, Institut Gustave Roussy, Villejuif, France
| | - Odile Oberlin
- Département de Pédiatrie, Institut Gustave Roussy, Villejuif, France
| | - Perrine Marec-Bérard
- Institute for Pediatric Hematology and Oncology, Leon-Bérard Cancer Center, University of Lyon, Lyon, France
| | - Amélie S Véron
- INSERM U1052, Léon-Bérard Cancer Centre, Cancer Research Center of Lyon, Lyon, France
| | - Stephanie Reynaud
- Unité Génétique Somatique (UGS), Institut Curie Centre Hospitalier, Paris, France
| | - Eve Lapouble
- Unité Génétique Somatique (UGS), Institut Curie Centre Hospitalier, Paris, France
| | - Valentina Boeva
- INSERM U900, Bioinformatics, Biostatistics, Epidemiology and Computational Systems Biology of Cancer, Institut Curie Research Center, Paris, France.,Mines ParisTech, Fontainebleau, France
| | - Thomas Rio Frio
- Institut Curie Genomics of Excellence (ICGex) Platform, Institut Curie Research Center, Paris, France
| | - Javier Alonso
- Instituto de Investigación de Enfermedades Raras, Instituto de Salud Carlos III, Madrid, Spain
| | - Smita Bhatia
- Institute for Cancer Outcomes and Survivorship, School of Medicine, University of Alabama, Birmingham, Alabama, USA
| | - Gaëlle Pierron
- Unité Génétique Somatique (UGS), Institut Curie Centre Hospitalier, Paris, France
| | - Geraldine Cancel-Tassin
- Centre de Recherche sur les Pathologies Prostatiques (CeRePP)-Laboratory for Urology, Research Team 2, UPMC, Hôpital Tenon, Paris, France
| | - Olivier Cussenot
- Centre de Recherche sur les Pathologies Prostatiques (CeRePP)-Laboratory for Urology, Research Team 2, UPMC, Hôpital Tenon, Paris, France
| | - David G Cox
- INSERM U1052, Léon-Bérard Cancer Centre, Cancer Research Center of Lyon, Lyon, France
| | - Lindsay M Morton
- Division of Cancer Epidemiology and Genetics (DCEG), National Cancer Institute (NCI), Bethesda, Maryland, USA
| | - Mitchell J Machiela
- Division of Cancer Epidemiology and Genetics (DCEG), National Cancer Institute (NCI), Bethesda, Maryland, USA
| | - Stephen J Chanock
- Division of Cancer Epidemiology and Genetics (DCEG), National Cancer Institute (NCI), Bethesda, Maryland, USA
| | - Patrick Charnay
- École Normale Supérieure (ENS), Institut de Biologie de l'ENS (IBENS), INSERM U1024, CNRS UMR8197, Paris, France
| | - Olivier Delattre
- Genetics and Biology of Cancers Unit, Institut Curie, PSL Research University, Paris, France.,INSERM U830, Institut Curie Research Center, Paris, France.,Institut Curie Genomics of Excellence (ICGex) Platform, Institut Curie Research Center, Paris, France.,Unité Génétique Somatique (UGS), Institut Curie Centre Hospitalier, Paris, France
| |
Collapse
|
15
|
Albert FW, Kruglyak L. The role of regulatory variation in complex traits and disease. Nat Rev Genet 2015; 16:197-212. [DOI: 10.1038/nrg3891] [Citation(s) in RCA: 675] [Impact Index Per Article: 67.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
16
|
Ganesh S, Chasman D, Larson M, Guo X, Verwoert G, Bis J, Gu X, Smith A, Yang ML, Zhang Y, Ehret G, Rose L, Hwang SJ, Papanicolau G, Sijbrands E, Rice K, Eiriksdottir G, Pihur V, Ridker P, Vasan R, Newton-Cheh C, Raffel LJ, Amin N, Rotter JI, Liu K, Launer LJ, Xu M, Caulfield M, Morrison AC, Johnson AD, Vaidya D, Dehghan A, Li G, Bouchard C, Harris TB, Zhang H, Boerwinkle E, Siscovick DS, Gao W, Uitterlinden AG, Rivadeneira F, Hofman A, Willer CJ, Franco OH, Huo Y, Witteman JC, Munroe PB, Gudnason V, Palmas W, van Duijn C, Fornage M, Levy D, Psaty BM, Chakravarti A, Newton-Cheh C, Johnson T, Gateva V, Tobin M, Bochud M, Coin L, Najjar S, Zhao J, Heath S, Eyheramendy S, Papadakis K, Voight B, Scott L, Zhang F, Farrall M, Tanaka T, Wallace C, Chambers J, Khaw KT, Nilsson P, van der Harst P, Polidoro S, Grobbee D, Onland-Moret N, Bots M, Wain L, Elliott K, Teumer A, Luan J, Lucas G, Kuusisto J, Burton P, Hadley D, McArdle W, Brown M, Dominiczak A, Newhouse S, Samani N, Webster J, Zeggini E, Beckmann J, Bergmann S, Lim N, Song K, Vollenweider P, Waeber G, et alGanesh S, Chasman D, Larson M, Guo X, Verwoert G, Bis J, Gu X, Smith A, Yang ML, Zhang Y, Ehret G, Rose L, Hwang SJ, Papanicolau G, Sijbrands E, Rice K, Eiriksdottir G, Pihur V, Ridker P, Vasan R, Newton-Cheh C, Raffel LJ, Amin N, Rotter JI, Liu K, Launer LJ, Xu M, Caulfield M, Morrison AC, Johnson AD, Vaidya D, Dehghan A, Li G, Bouchard C, Harris TB, Zhang H, Boerwinkle E, Siscovick DS, Gao W, Uitterlinden AG, Rivadeneira F, Hofman A, Willer CJ, Franco OH, Huo Y, Witteman JC, Munroe PB, Gudnason V, Palmas W, van Duijn C, Fornage M, Levy D, Psaty BM, Chakravarti A, Newton-Cheh C, Johnson T, Gateva V, Tobin M, Bochud M, Coin L, Najjar S, Zhao J, Heath S, Eyheramendy S, Papadakis K, Voight B, Scott L, Zhang F, Farrall M, Tanaka T, Wallace C, Chambers J, Khaw KT, Nilsson P, van der Harst P, Polidoro S, Grobbee D, Onland-Moret N, Bots M, Wain L, Elliott K, Teumer A, Luan J, Lucas G, Kuusisto J, Burton P, Hadley D, McArdle W, Brown M, Dominiczak A, Newhouse S, Samani N, Webster J, Zeggini E, Beckmann J, Bergmann S, Lim N, Song K, Vollenweider P, Waeber G, Waterworth D, Yuan X, Groop L, Orho-Melander M, Allione A, Di Gregorio A, Guarrera S, Panico S, Ricceri F, Romanazzi V, Sacerdote C, Vineis P, Barroso I, Sandhu M, Luben R, Crawford G, Jousilahti P, Perola M, Boehnke M, Bonnycastle L, Collins F, Jackson A, Mohlke K, Stringham H, Valle T, Willer C, Bergman R, Morken M, Döring A, Gieger C, Illig T, Meitinger T, Org E, Pfeufer A, Wichmann H, Kathiresan S, Marrugat J, O’Donnell C, Schwartz S, Siscovick D, Subirana I, Freimer N, Hartikainen AL, McCarthy M, O’Reilly P, Peltonen L, Pouta A, de Jong P, Snieder H, van Gilst W, Clarke R, Goel A, Hamsten A, Peden J, Seedorf U, Syvänen AC, Tognoni G, Lakatta E, Sanna S, Scheet P, Schlessinger D, Scuteri A, Dörr M, Ernst F, Felix S, Homuth G, Lorbeer R, Reffelmann T, Rettig R, Völker U, Galan P, Gut I, Hercberg S, Lathrop G, Zeleneka D, Deloukas P, Soranzo N, Williams F, Zhai G, Salomaa V, Laakso M, Elosua R, Forouhi N, Völzke H, Uiterwaal C, van der Schouw Y, Numans M, Matullo G, Navis G, Berglund G, Bingham S, Kooner J, Paterson A, Connell J, Bandinelli S, Ferrucci L, Watkins H, Spector T, Tuomilehto J, Altshuler D, Strachan D, Laan M, Meneton P, Wareham N, Uda M, Jarvelin MR, Mooser V, Melander O, Loos R, Elliott P, Abecasis G, Caulfield M, Munroe P. Effects of long-term averaging of quantitative blood pressure traits on the detection of genetic associations. Am J Hum Genet 2014; 95:49-65. [PMID: 24975945 DOI: 10.1016/j.ajhg.2014.06.002] [Show More Authors] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2014] [Accepted: 06/03/2014] [Indexed: 01/11/2023] Open
Abstract
Blood pressure (BP) is a heritable, quantitative trait with intraindividual variability and susceptibility to measurement error. Genetic studies of BP generally use single-visit measurements and thus cannot remove variability occurring over months or years. We leveraged the idea that averaging BP measured across time would improve phenotypic accuracy and thereby increase statistical power to detect genetic associations. We studied systolic BP (SBP), diastolic BP (DBP), mean arterial pressure (MAP), and pulse pressure (PP) averaged over multiple years in 46,629 individuals of European ancestry. We identified 39 trait-variant associations across 19 independent loci (p < 5 × 10(-8)); five associations (in four loci) uniquely identified by our LTA analyses included those of SBP and MAP at 2p23 (rs1275988, near KCNK3), DBP at 2q11.2 (rs7599598, in FER1L5), and PP at 6p21 (rs10948071, near CRIP3) and 7p13 (rs2949837, near IGFBP3). Replication analyses conducted in cohorts with single-visit BP data showed positive replication of associations and a nominal association (p < 0.05). We estimated a 20% gain in statistical power with long-term average (LTA) as compared to single-visit BP association studies. Using LTA analysis, we identified genetic loci influencing BP. LTA might be one way of increasing the power of genetic associations for continuous traits in extant samples for other phenotypes that are measured serially over time.
Collapse
|
17
|
Agler AH, Crystal RG, Mezey JG, Fuller J, Gao C, Hansen JG, Cassano PA. Differential expression of vitamin E and selenium-responsive genes by disease severity in chronic obstructive pulmonary disease. COPD 2014; 10:450-8. [PMID: 23875740 DOI: 10.3109/15412555.2012.761958] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Antioxidant nutritional status is hypothesized to influence chronic obstructive pulmonary disease (COPD) susceptibility and progression. Although past studies relate antioxidants to gene expression, there are no data in patients with COPD. This study investigated the hypothesis that antioxidant status is compromised in patients with COPD, and antioxidant-responsive genes differentially express in a similar pattern. Lung tissue samples from patients with COPD were assayed for vitamin E and gene expression. Selenium and vitamin E were assayed in corresponding plasma samples. Discovery based genome-wide expression analysis compared moderate, severe, and very severe COPD (GOLD II-IV) patients to mild and at-risk/normal (GOLD 0-I). Hypotheses-driven analyses assessed differential gene expression by disease severity for vitamin E-responsive and selenium-responsive genes. GOLD II-IV COPD patients had 30% lower lung tissue vitamin E levels compared to GOLD 0-I participants (p = 0.0082). No statistically significant genome-wide differences in expression by disease severity were identified. Hypothesis-driven analyses of 109 genes found 16 genes differentially expressed (padjusted < 0.05) by disease severity including 6 selenium-responsive genes (range in fold-change -1.39 to 2.25), 6 vitamin E-responsive genes (fold-change -2.30 to 1.51), and 4 COPD-associated genes. Lung tissue vitamin E in patients with COPD was associated with disease severity and vitamin E-responsive genes were differentially expressed by disease severity. Although nutritional status is hypothesized to contribute to COPD risk, and is of therapeutic interest, evidence to date is mainly observational. The findings reported herein are novel, and support a role of vitamin E in COPD progression.
Collapse
Affiliation(s)
- Anne H Agler
- Division of Nutritional Sciences, Cornell University, Ithaca, NY 14853, USA
| | | | | | | | | | | | | |
Collapse
|
18
|
Reardon BJ, Hansen JG, Crystal RG, Houston DK, Kritchevsky SB, Harris T, Lohman K, Liu Y, O'Connor GT, Wilk JB, Mezey J, Gao C, Cassano PA. Vitamin D-responsive SGPP2 variants associated with lung cell expression and lung function. BMC MEDICAL GENETICS 2013; 14:122. [PMID: 24274704 PMCID: PMC3907038 DOI: 10.1186/1471-2350-14-122] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/21/2013] [Accepted: 11/08/2013] [Indexed: 12/05/2022]
Abstract
Background Vitamin D is associated with lung health in epidemiologic studies, but mechanisms mediating observed associations are poorly understood. This study explores mechanisms for an effect of vitamin D in lung through an in vivo gene expression study, an expression quantitative trait loci (eQTL) analysis in lung tissue, and a population-based cohort study of sequence variants. Methods Microarray analysis investigated the association of gene expression in small airway epithelial cells with serum 25(OH)D in adult non-smokers. Sequence variants in candidate genes identified by the microarray were investigated in a lung tissue eQTL database, and also in relation to cross-sectional pulmonary function in the Health, Aging, and Body Composition (Health ABC) study, stratified by race, with replication in the Framingham Heart Study (FHS). Results 13 candidate genes had significant differences in expression by serum 25(OH)D (nominal p < 0.05), and a genome-wide significant eQTL association was detected for SGPP2. In Health ABC, SGPP2 SNPs were associated with FEV1 in both European- and African-Americans, and the gene-level association was replicated in European-American FHS participants. SNPs in 5 additional candidate genes (DAPK1, FSTL1, KAL1, KCNS3, and RSAD2) were associated with FEV1 in Health ABC participants. Conclusions SGPP2, a sphingosine-1-phosphate phosphatase, is a novel vitamin D-responsive gene associated with lung function. The identified associations will need to be followed up in further studies.
Collapse
Affiliation(s)
- Brian J Reardon
- Division of Nutritional Sciences, Cornell University, 209 Savage Hall, Ithaca, NY 14853, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|