1
|
Bonder MJ, Luijk R, Zhernakova DV, Moed M, Deelen P, Vermaat M, van Iterson M, van Dijk F, van Galen M, Bot J, Slieker RC, Jhamai PM, Verbiest M, Suchiman HED, Verkerk M, van der Breggen R, van Rooij J, Lakenberg N, Arindrarto W, Kielbasa SM, Jonkers I, van 't Hof P, Nooren I, Beekman M, Deelen J, van Heemst D, Zhernakova A, Tigchelaar EF, Swertz MA, Hofman A, Uitterlinden AG, Pool R, van Dongen J, Hottenga JJ, Stehouwer CDA, van der Kallen CJH, Schalkwijk CG, van den Berg LH, van Zwet EW, Mei H, Li Y, Lemire M, Hudson TJ, Slagboom PE, Wijmenga C, Veldink JH, van Greevenbroek MMJ, van Duijn CM, Boomsma DI, Isaacs A, Jansen R, van Meurs JBJ, 't Hoen PAC, Franke L, Heijmans BT. Disease variants alter transcription factor levels and methylation of their binding sites. Nat Genet 2016; 49:131-138. [PMID: 27918535 DOI: 10.1038/ng.3721] [Citation(s) in RCA: 289] [Impact Index Per Article: 36.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2015] [Accepted: 10/18/2016] [Indexed: 12/15/2022]
Abstract
Most disease-associated genetic variants are noncoding, making it challenging to design experiments to understand their functional consequences. Identification of expression quantitative trait loci (eQTLs) has been a powerful approach to infer the downstream effects of disease-associated variants, but most of these variants remain unexplained. The analysis of DNA methylation, a key component of the epigenome, offers highly complementary data on the regulatory potential of genomic regions. Here we show that disease-associated variants have widespread effects on DNA methylation in trans that likely reflect differential occupancy of trans binding sites by cis-regulated transcription factors. Using multiple omics data sets from 3,841 Dutch individuals, we identified 1,907 established trait-associated SNPs that affect the methylation levels of 10,141 different CpG sites in trans (false discovery rate (FDR) < 0.05). These included SNPs that affect both the expression of a nearby transcription factor (such as NFKB1, CTCF and NKX2-3) and methylation of its respective binding site across the genome. Trans methylation QTLs effectively expose the downstream effects of disease-associated variants.
Collapse
Affiliation(s)
- Marc Jan Bonder
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands
| | - René Luijk
- Molecular Epidemiology Section, Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, the Netherlands
| | - Daria V Zhernakova
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands
| | - Matthijs Moed
- Molecular Epidemiology Section, Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, the Netherlands
| | - Patrick Deelen
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands.,University of Groningen, University Medical Center Groningen, Genomics Coordination Center, Groningen, the Netherlands
| | - Martijn Vermaat
- Department of Human Genetics, Leiden University Medical Center, Leiden, the Netherlands
| | - Maarten van Iterson
- Molecular Epidemiology Section, Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, the Netherlands
| | - Freerk van Dijk
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands.,University of Groningen, University Medical Center Groningen, Genomics Coordination Center, Groningen, the Netherlands
| | - Michiel van Galen
- University of Groningen, University Medical Center Groningen, Genomics Coordination Center, Groningen, the Netherlands
| | - Jan Bot
- SURFsara, Amsterdam, the Netherlands
| | - Roderick C Slieker
- Molecular Epidemiology Section, Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, the Netherlands
| | - P Mila Jhamai
- Department of Internal Medicine, ErasmusMC, Rotterdam, the Netherlands
| | - Michael Verbiest
- University of Groningen, University Medical Center Groningen, Genomics Coordination Center, Groningen, the Netherlands
| | - H Eka D Suchiman
- Molecular Epidemiology Section, Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, the Netherlands
| | - Marijn Verkerk
- Department of Internal Medicine, ErasmusMC, Rotterdam, the Netherlands
| | - Ruud van der Breggen
- Molecular Epidemiology Section, Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, the Netherlands
| | - Jeroen van Rooij
- Department of Internal Medicine, ErasmusMC, Rotterdam, the Netherlands
| | - Nico Lakenberg
- Molecular Epidemiology Section, Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, the Netherlands
| | - Wibowo Arindrarto
- Medical Statistics Section, Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, the Netherlands
| | - Szymon M Kielbasa
- Sequence Analysis Support Core, Leiden University Medical Center, Leiden, the Netherlands
| | - Iris Jonkers
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands
| | - Peter van 't Hof
- Sequence Analysis Support Core, Leiden University Medical Center, Leiden, the Netherlands
| | | | - Marian Beekman
- Molecular Epidemiology Section, Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, the Netherlands
| | - Joris Deelen
- Molecular Epidemiology Section, Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, the Netherlands
| | - Diana van Heemst
- Department of Gerontology and Geriatrics, Leiden University Medical Center, Leiden, the Netherlands
| | - Alexandra Zhernakova
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands
| | - Ettje F Tigchelaar
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands
| | - Morris A Swertz
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands.,University of Groningen, University Medical Center Groningen, Genomics Coordination Center, Groningen, the Netherlands
| | - Albert Hofman
- Department of Epidemiology, ErasmusMC, Rotterdam, the Netherlands
| | | | - René Pool
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Neuroscience Campus Amsterdam, Amsterdam, the Netherlands
| | - Jenny van Dongen
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Neuroscience Campus Amsterdam, Amsterdam, the Netherlands
| | - Jouke J Hottenga
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Neuroscience Campus Amsterdam, Amsterdam, the Netherlands
| | - Coen D A Stehouwer
- Department of Internal Medicine, Maastricht University Medical Center, Maastricht, the Netherlands.,School for Cardiovascular Diseases (CARIM), Maastricht University Medical Center, Maastricht, the Netherlands
| | - Carla J H van der Kallen
- Department of Internal Medicine, Maastricht University Medical Center, Maastricht, the Netherlands.,School for Cardiovascular Diseases (CARIM), Maastricht University Medical Center, Maastricht, the Netherlands
| | - Casper G Schalkwijk
- Department of Internal Medicine, Maastricht University Medical Center, Maastricht, the Netherlands.,School for Cardiovascular Diseases (CARIM), Maastricht University Medical Center, Maastricht, the Netherlands
| | - Leonard H van den Berg
- Department of Neurology, Brain Center Rudolf Magnus, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Erik W van Zwet
- Medical Statistics Section, Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, the Netherlands
| | - Hailiang Mei
- Sequence Analysis Support Core, Leiden University Medical Center, Leiden, the Netherlands
| | - Yang Li
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands
| | - Mathieu Lemire
- Ontario Institute for Cancer Research, Toronto, Ontario, Canada
| | - Thomas J Hudson
- Ontario Institute for Cancer Research, Toronto, Ontario, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada.,Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | | | - P Eline Slagboom
- Molecular Epidemiology Section, Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, the Netherlands
| | - Cisca Wijmenga
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands
| | - Jan H Veldink
- Department of Neurology, Brain Center Rudolf Magnus, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Marleen M J van Greevenbroek
- Department of Internal Medicine, Maastricht University Medical Center, Maastricht, the Netherlands.,School for Cardiovascular Diseases (CARIM), Maastricht University Medical Center, Maastricht, the Netherlands
| | - Cornelia M van Duijn
- Genetic Epidemiology Unit, Department of Epidemiology, ErasmusMC, Rotterdam, the Netherlands
| | - Dorret I Boomsma
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Neuroscience Campus Amsterdam, Amsterdam, the Netherlands
| | - Aaron Isaacs
- School for Cardiovascular Diseases (CARIM), Maastricht University Medical Center, Maastricht, the Netherlands.,Genetic Epidemiology Unit, Department of Epidemiology, ErasmusMC, Rotterdam, the Netherlands.,Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht, the Netherlands
| | - Rick Jansen
- Department of Psychiatry, VU University Medical Center, Neuroscience Campus Amsterdam, Amsterdam, the Netherlands
| | | | - Peter A C 't Hoen
- Department of Human Genetics, Leiden University Medical Center, Leiden, the Netherlands
| | - Lude Franke
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands
| | - Bastiaan T Heijmans
- Molecular Epidemiology Section, Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, the Netherlands
| |
Collapse
|
2
|
Zhernakova DV, Deelen P, Vermaat M, van Iterson M, van Galen M, Arindrarto W, van 't Hof P, Mei H, van Dijk F, Westra HJ, Bonder MJ, van Rooij J, Verkerk M, Jhamai PM, Moed M, Kielbasa SM, Bot J, Nooren I, Pool R, van Dongen J, Hottenga JJ, Stehouwer CDA, van der Kallen CJH, Schalkwijk CG, Zhernakova A, Li Y, Tigchelaar EF, de Klein N, Beekman M, Deelen J, van Heemst D, van den Berg LH, Hofman A, Uitterlinden AG, van Greevenbroek MMJ, Veldink JH, Boomsma DI, van Duijn CM, Wijmenga C, Slagboom PE, Swertz MA, Isaacs A, van Meurs JBJ, Jansen R, Heijmans BT, 't Hoen PAC, Franke L. Identification of context-dependent expression quantitative trait loci in whole blood. Nat Genet 2016; 49:139-145. [PMID: 27918533 DOI: 10.1038/ng.3737] [Citation(s) in RCA: 255] [Impact Index Per Article: 31.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2015] [Accepted: 11/02/2016] [Indexed: 02/07/2023]
Abstract
Genetic risk factors often localize to noncoding regions of the genome with unknown effects on disease etiology. Expression quantitative trait loci (eQTLs) help to explain the regulatory mechanisms underlying these genetic associations. Knowledge of the context that determines the nature and strength of eQTLs may help identify cell types relevant to pathophysiology and the regulatory networks underlying disease. Here we generated peripheral blood RNA-seq data from 2,116 unrelated individuals and systematically identified context-dependent eQTLs using a hypothesis-free strategy that does not require previous knowledge of the identity of the modifiers. Of the 23,060 significant cis-regulated genes (false discovery rate (FDR) ≤ 0.05), 2,743 (12%) showed context-dependent eQTL effects. The majority of these effects were influenced by cell type composition. A set of 145 cis-eQTLs depended on type I interferon signaling. Others were modulated by specific transcription factors binding to the eQTL SNPs.
Collapse
Affiliation(s)
- Daria V Zhernakova
- University of Groningen, University Medical Center Groningen, Genomics Coordination Center, Groningen, the Netherlands
| | - Patrick Deelen
- University of Groningen, University Medical Center Groningen, Genomics Coordination Center, Groningen, the Netherlands.,University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands
| | - Martijn Vermaat
- Department of Human Genetics, Leiden University Medical Center, Leiden, the Netherlands
| | - Maarten van Iterson
- Molecular Epidemiology Section, Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, the Netherlands
| | - Michiel van Galen
- Department of Human Genetics, Leiden University Medical Center, Leiden, the Netherlands
| | - Wibowo Arindrarto
- Sequence Analysis Support Core, Leiden University Medical Center, Leiden, the Netherlands
| | - Peter van 't Hof
- Sequence Analysis Support Core, Leiden University Medical Center, Leiden, the Netherlands
| | - Hailiang Mei
- Sequence Analysis Support Core, Leiden University Medical Center, Leiden, the Netherlands
| | - Freerk van Dijk
- University of Groningen, University Medical Center Groningen, Genomics Coordination Center, Groningen, the Netherlands.,University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands
| | - Harm-Jan Westra
- Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, USA.,Partners Center for Personalized Genetic Medicine, Boston, Massachusetts, USA.,Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Marc Jan Bonder
- University of Groningen, University Medical Center Groningen, Genomics Coordination Center, Groningen, the Netherlands
| | - Jeroen van Rooij
- Department of Internal Medicine, ErasmusMC, Rotterdam, the Netherlands
| | - Marijn Verkerk
- Department of Internal Medicine, ErasmusMC, Rotterdam, the Netherlands
| | - P Mila Jhamai
- Department of Internal Medicine, ErasmusMC, Rotterdam, the Netherlands
| | - Matthijs Moed
- Molecular Epidemiology Section, Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, the Netherlands
| | - Szymon M Kielbasa
- Molecular Epidemiology Section, Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, the Netherlands
| | - Jan Bot
- SURFsara, Amsterdam, the Netherlands
| | | | - René Pool
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Neuroscience Campus Amsterdam, Amsterdam, the Netherlands
| | - Jenny van Dongen
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Neuroscience Campus Amsterdam, Amsterdam, the Netherlands
| | - Jouke J Hottenga
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Neuroscience Campus Amsterdam, Amsterdam, the Netherlands
| | - Coen D A Stehouwer
- Department of Internal Medicine, Maastricht University Medical Center, Maastricht, the Netherlands.,School for Cardiovascular Diseases (CARIM), Maastricht University Medical Center, Maastricht, the Netherlands
| | - Carla J H van der Kallen
- Department of Internal Medicine, Maastricht University Medical Center, Maastricht, the Netherlands.,School for Cardiovascular Diseases (CARIM), Maastricht University Medical Center, Maastricht, the Netherlands
| | - Casper G Schalkwijk
- Department of Internal Medicine, Maastricht University Medical Center, Maastricht, the Netherlands.,School for Cardiovascular Diseases (CARIM), Maastricht University Medical Center, Maastricht, the Netherlands
| | - Alexandra Zhernakova
- University of Groningen, University Medical Center Groningen, Genomics Coordination Center, Groningen, the Netherlands
| | - Yang Li
- University of Groningen, University Medical Center Groningen, Genomics Coordination Center, Groningen, the Netherlands
| | - Ettje F Tigchelaar
- University of Groningen, University Medical Center Groningen, Genomics Coordination Center, Groningen, the Netherlands
| | - Niek de Klein
- University of Groningen, University Medical Center Groningen, Genomics Coordination Center, Groningen, the Netherlands
| | - Marian Beekman
- Molecular Epidemiology Section, Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, the Netherlands
| | - Joris Deelen
- Molecular Epidemiology Section, Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, the Netherlands
| | - Diana van Heemst
- Department of Gerontology and Geriatrics, Leiden University Medical Center, Leiden, the Netherlands
| | - Leonard H van den Berg
- Department of Neurology, Brain Center Rudolf Magnus, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Albert Hofman
- Department of Epidemiology, ErasmusMC, Rotterdam, the Netherlands
| | | | - Marleen M J van Greevenbroek
- Department of Internal Medicine, Maastricht University Medical Center, Maastricht, the Netherlands.,School for Cardiovascular Diseases (CARIM), Maastricht University Medical Center, Maastricht, the Netherlands
| | - Jan H Veldink
- Department of Neurology, Brain Center Rudolf Magnus, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Dorret I Boomsma
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Neuroscience Campus Amsterdam, Amsterdam, the Netherlands
| | - Cornelia M van Duijn
- Genetic Epidemiology Unit, Department of Epidemiology, ErasmusMC, Rotterdam, the Netherlands
| | - Cisca Wijmenga
- University of Groningen, University Medical Center Groningen, Genomics Coordination Center, Groningen, the Netherlands
| | - P Eline Slagboom
- Molecular Epidemiology Section, Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, the Netherlands
| | - Morris A Swertz
- University of Groningen, University Medical Center Groningen, Genomics Coordination Center, Groningen, the Netherlands.,University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands
| | - Aaron Isaacs
- School for Cardiovascular Diseases (CARIM), Maastricht University Medical Center, Maastricht, the Netherlands.,Genetic Epidemiology Unit, Department of Epidemiology, ErasmusMC, Rotterdam, the Netherlands.,Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht, the Netherlands
| | | | - Rick Jansen
- Department of Psychiatry, VU University Medical Center, Neuroscience Campus Amsterdam, Amsterdam, the Netherlands
| | - Bastiaan T Heijmans
- Molecular Epidemiology Section, Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, the Netherlands
| | - Peter A C 't Hoen
- Department of Human Genetics, Leiden University Medical Center, Leiden, the Netherlands
| | - Lude Franke
- University of Groningen, University Medical Center Groningen, Genomics Coordination Center, Groningen, the Netherlands
| |
Collapse
|
3
|
Slieker RC, van Iterson M, Luijk R, Beekman M, Zhernakova DV, Moed MH, Mei H, van Galen M, Deelen P, Bonder MJ, Zhernakova A, Uitterlinden AG, Tigchelaar EF, Stehouwer CDA, Schalkwijk CG, van der Kallen CJH, Hofman A, van Heemst D, de Geus EJ, van Dongen J, Deelen J, van den Berg LH, van Meurs J, Jansen R, 't Hoen PAC, Franke L, Wijmenga C, Veldink JH, Swertz MA, van Greevenbroek MMJ, van Duijn CM, Boomsma DI, Slagboom PE, Heijmans BT. Age-related accrual of methylomic variability is linked to fundamental ageing mechanisms. Genome Biol 2016; 17:191. [PMID: 27654999 PMCID: PMC5032245 DOI: 10.1186/s13059-016-1053-6] [Citation(s) in RCA: 91] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2016] [Accepted: 09/01/2016] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND Epigenetic change is a hallmark of ageing but its link to ageing mechanisms in humans remains poorly understood. While DNA methylation at many CpG sites closely tracks chronological age, DNA methylation changes relevant to biological age are expected to gradually dissociate from chronological age, mirroring the increased heterogeneity in health status at older ages. RESULTS Here, we report on the large-scale identification of 6366 age-related variably methylated positions (aVMPs) identified in 3295 whole blood DNA methylation profiles, 2044 of which have a matching RNA-seq gene expression profile. aVMPs are enriched at polycomb repressed regions and, accordingly, methylation at those positions is associated with the expression of genes encoding components of polycomb repressive complex 2 (PRC2) in trans. Further analysis revealed trans-associations for 1816 aVMPs with an additional 854 genes. These trans-associated aVMPs are characterized by either an age-related gain of methylation at CpG islands marked by PRC2 or a loss of methylation at enhancers. This distinct pattern extends to other tissues and multiple cancer types. Finally, genes associated with aVMPs in trans whose expression is variably upregulated with age (733 genes) play a key role in DNA repair and apoptosis, whereas downregulated aVMP-associated genes (121 genes) are mapped to defined pathways in cellular metabolism. CONCLUSIONS Our results link age-related changes in DNA methylation to fundamental mechanisms that are thought to drive human ageing.
Collapse
Affiliation(s)
- Roderick C Slieker
- Molecular Epidemiology section, Leiden University Medical Center, Einthovenweg 20, 2333 ZC, Leiden, The Netherlands
| | - Maarten van Iterson
- Molecular Epidemiology section, Leiden University Medical Center, Einthovenweg 20, 2333 ZC, Leiden, The Netherlands
| | - René Luijk
- Molecular Epidemiology section, Leiden University Medical Center, Einthovenweg 20, 2333 ZC, Leiden, The Netherlands
| | - Marian Beekman
- Molecular Epidemiology section, Leiden University Medical Center, Einthovenweg 20, 2333 ZC, Leiden, The Netherlands
| | - Daria V Zhernakova
- Department of Genetics, University Medical Centre Groningen, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands
| | - Matthijs H Moed
- Molecular Epidemiology section, Leiden University Medical Center, Einthovenweg 20, 2333 ZC, Leiden, The Netherlands
| | - Hailiang Mei
- Sequence Analysis Support Core, Leiden University Medical Center, Einthovenweg 20, 2333 ZC, Leiden, The Netherlands
| | - Michiel van Galen
- Department of Human Genetics, Leiden University Medical Center, Einthovenweg 20, 2333 ZC, Leiden, The Netherlands
| | - Patrick Deelen
- Department of Genetics, University Medical Centre Groningen, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands
| | - Marc Jan Bonder
- Department of Genetics, University Medical Centre Groningen, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands
| | - Alexandra Zhernakova
- Department of Genetics, University Medical Centre Groningen, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands
| | - André G Uitterlinden
- Department of Internal Medicine, Erasmus University Medical Center, Wytemaweg 80, 3015 CN, Rotterdam, The Netherlands
| | - Ettje F Tigchelaar
- Department of Genetics, University Medical Centre Groningen, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands
| | - Coen D A Stehouwer
- Department of Internal Medicine and School for Cardiovascular Diseases (CARIM), Maastricht University Medical Center, Universiteitssingel 50, 6229 ER, Maastricht, The Netherlands
| | - Casper G Schalkwijk
- Department of Internal Medicine and School for Cardiovascular Diseases (CARIM), Maastricht University Medical Center, Universiteitssingel 50, 6229 ER, Maastricht, The Netherlands
| | - Carla J H van der Kallen
- Department of Internal Medicine and School for Cardiovascular Diseases (CARIM), Maastricht University Medical Center, Universiteitssingel 50, 6229 ER, Maastricht, The Netherlands
| | - Albert Hofman
- Department of Epidemiology, Erasmus University Medical Center, Dr. Molewaterplein 50, 3015 GE, Rotterdam, The Netherlands
| | - Diana van Heemst
- Department of Gerontology and Geriatrics, Leiden University Medical Center, Albinusdreef 2, 2333 ZA, Leiden, The Netherlands
| | - Eco J de Geus
- Department of Biological Psychology, VU University Amsterdam, Neuroscience Campus Amsterdam, Van der Boechorststraat 1, 1081 BT, Amsterdam, The Netherlands
| | - Jenny van Dongen
- Department of Biological Psychology, VU University Amsterdam, Neuroscience Campus Amsterdam, Van der Boechorststraat 1, 1081 BT, Amsterdam, The Netherlands
| | - Joris Deelen
- Molecular Epidemiology section, Leiden University Medical Center, Einthovenweg 20, 2333 ZC, Leiden, The Netherlands
| | - Leonard H van den Berg
- Department of Neurology, Brain Center Rudolf Magnus, University Medical Center Utrecht, Universiteitsweg 100, 3584 CG, Utrecht, The Netherlands
| | - Joyce van Meurs
- Department of Internal Medicine, Erasmus University Medical Center, Wytemaweg 80, 3015 CN, Rotterdam, The Netherlands
| | - Rick Jansen
- Department of Psychiatry, VU University Medical Center, Neuroscience Campus Amsterdam, A.J. Ernststraat 1187, 1081 HL, Amsterdam, The Netherlands
| | - Peter A C 't Hoen
- Department of Human Genetics, Leiden University Medical Center, Einthovenweg 20, 2333 ZC, Leiden, The Netherlands
| | - Lude Franke
- Department of Genetics, University Medical Centre Groningen, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands
| | - Cisca Wijmenga
- Department of Genetics, University Medical Centre Groningen, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands
| | - Jan H Veldink
- Department of Neurology, Brain Center Rudolf Magnus, University Medical Center Utrecht, Universiteitsweg 100, 3584 CG, Utrecht, The Netherlands
| | - Morris A Swertz
- Genomics Coordination Center, University Medical Center Groningen, University of Groningen, Hanzeplein 1, 9713 GZ, Groningen, The Netherlands
| | - Marleen M J van Greevenbroek
- Department of Internal Medicine and School for Cardiovascular Diseases (CARIM), Maastricht University Medical Center, Universiteitssingel 50, 6229 ER, Maastricht, The Netherlands
| | - Cornelia M van Duijn
- Department of Genetic Epidemiology, Erasmus University Medical Center, Dr. Molewaterplein 50, 3015 GE, Rotterdam, The Netherlands
| | - Dorret I Boomsma
- Department of Biological Psychology, VU University Amsterdam, Neuroscience Campus Amsterdam, Van der Boechorststraat 1, 1081 BT, Amsterdam, The Netherlands
| | | | - P Eline Slagboom
- Molecular Epidemiology section, Leiden University Medical Center, Einthovenweg 20, 2333 ZC, Leiden, The Netherlands
| | - Bastiaan T Heijmans
- Molecular Epidemiology section, Leiden University Medical Center, Einthovenweg 20, 2333 ZC, Leiden, The Netherlands.
| |
Collapse
|
4
|
Dekkers KF, van Iterson M, Slieker RC, Moed MH, Bonder MJ, van Galen M, Mei H, Zhernakova DV, van den Berg LH, Deelen J, van Dongen J, van Heemst D, Hofman A, Hottenga JJ, van der Kallen CJH, Schalkwijk CG, Stehouwer CDA, Tigchelaar EF, Uitterlinden AG, Willemsen G, Zhernakova A, Franke L, 't Hoen PAC, Jansen R, van Meurs J, Boomsma DI, van Duijn CM, van Greevenbroek MMJ, Veldink JH, Wijmenga C, van Zwet EW, Slagboom PE, Jukema JW, Heijmans BT. Blood lipids influence DNA methylation in circulating cells. Genome Biol 2016; 17:138. [PMID: 27350042 PMCID: PMC4922056 DOI: 10.1186/s13059-016-1000-6] [Citation(s) in RCA: 126] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2016] [Accepted: 06/06/2016] [Indexed: 01/19/2023] Open
Abstract
Background Cells can be primed by external stimuli to obtain a long-term epigenetic memory. We hypothesize that long-term exposure to elevated blood lipids can prime circulating immune cells through changes in DNA methylation, a process that may contribute to the development of atherosclerosis. To interrogate the causal relationship between triglyceride, low-density lipoprotein (LDL) cholesterol, and high-density lipoprotein (HDL) cholesterol levels and genome-wide DNA methylation while excluding confounding and pleiotropy, we perform a stepwise Mendelian randomization analysis in whole blood of 3296 individuals. Results This analysis shows that differential methylation is the consequence of inter-individual variation in blood lipid levels and not vice versa. Specifically, we observe an effect of triglycerides on DNA methylation at three CpGs, of LDL cholesterol at one CpG, and of HDL cholesterol at two CpGs using multivariable Mendelian randomization. Using RNA-seq data available for a large subset of individuals (N = 2044), DNA methylation of these six CpGs is associated with the expression of CPT1A and SREBF1 (for triglycerides), DHCR24 (for LDL cholesterol) and ABCG1 (for HDL cholesterol), which are all key regulators of lipid metabolism. Conclusions Our analysis suggests a role for epigenetic priming in end-product feedback control of lipid metabolism and highlights Mendelian randomization as an effective tool to infer causal relationships in integrative genomics data. Electronic supplementary material The online version of this article (doi:10.1186/s13059-016-1000-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Koen F Dekkers
- Molecular Epidemiology section, Leiden University Medical Center, Einthovenweg 20, Leiden, The Netherlands
| | - Maarten van Iterson
- Molecular Epidemiology section, Leiden University Medical Center, Einthovenweg 20, Leiden, The Netherlands
| | - Roderick C Slieker
- Molecular Epidemiology section, Leiden University Medical Center, Einthovenweg 20, Leiden, The Netherlands
| | - Matthijs H Moed
- Molecular Epidemiology section, Leiden University Medical Center, Einthovenweg 20, Leiden, The Netherlands
| | - Marc Jan Bonder
- Department of Genetics, University of Groningen, University Medical Centre Groningen, Broerstraat 5, Groningen, The Netherlands
| | - Michiel van Galen
- Department of Human Genetics, Leiden University Medical Center, Einthovenweg 20, Leiden, The Netherlands
| | - Hailiang Mei
- Sequence Analysis Support Core, Leiden University Medical Center, Einthovenweg 20, Leiden, The Netherlands
| | - Daria V Zhernakova
- Department of Genetics, University of Groningen, University Medical Centre Groningen, Broerstraat 5, Groningen, The Netherlands
| | - Leonard H van den Berg
- Department of Neurology, Brain Center Rudolf Magnus, University Medical Center Utrecht, Heidelberglaan 100, Utrecht, The Netherlands
| | - Joris Deelen
- Molecular Epidemiology section, Leiden University Medical Center, Einthovenweg 20, Leiden, The Netherlands
| | - Jenny van Dongen
- Department of Biological Psychology, VU University Amsterdam, Neuroscience Campus Amsterdam, De Boelelaan 1117, Amsterdam, The Netherlands
| | - Diana van Heemst
- Department of Gerontology and Geriatrics, Leiden University Medical Center, Einthovenweg 20, Leiden, The Netherlands
| | - Albert Hofman
- Department of Genetic Epidemiology, ErasmusMC, 's-Gravendijkwal 230, Rotterdam, The Netherlands
| | - Jouke J Hottenga
- Department of Biological Psychology, VU University Amsterdam, Neuroscience Campus Amsterdam, De Boelelaan 1117, Amsterdam, The Netherlands
| | - Carla J H van der Kallen
- Department of Internal Medicine and School for Cardiovascular Diseases (CARIM), Maastricht University Medical Center, P. Debyelaan 25, Maastricht, The Netherlands
| | - Casper G Schalkwijk
- Department of Internal Medicine and School for Cardiovascular Diseases (CARIM), Maastricht University Medical Center, P. Debyelaan 25, Maastricht, The Netherlands
| | - Coen D A Stehouwer
- Department of Internal Medicine and School for Cardiovascular Diseases (CARIM), Maastricht University Medical Center, P. Debyelaan 25, Maastricht, The Netherlands
| | - Ettje F Tigchelaar
- Department of Genetics, University of Groningen, University Medical Centre Groningen, Broerstraat 5, Groningen, The Netherlands
| | - André G Uitterlinden
- Department of Internal Medicine, ErasmusMC, 's-Gravendijkwal 230, Rotterdam, The Netherlands
| | - Gonneke Willemsen
- Department of Biological Psychology, VU University Amsterdam, Neuroscience Campus Amsterdam, De Boelelaan 1117, Amsterdam, The Netherlands
| | - Alexandra Zhernakova
- Department of Genetics, University of Groningen, University Medical Centre Groningen, Broerstraat 5, Groningen, The Netherlands
| | - Lude Franke
- Department of Genetics, University of Groningen, University Medical Centre Groningen, Broerstraat 5, Groningen, The Netherlands
| | - Peter A C 't Hoen
- Department of Human Genetics, Leiden University Medical Center, Einthovenweg 20, Leiden, The Netherlands
| | - Rick Jansen
- Department of Psychiatry, VU University Medical Center, Neuroscience Campus Amsterdam, De Boelelaan 1117, Amsterdam, The Netherlands
| | - Joyce van Meurs
- Department of Internal Medicine, ErasmusMC, 's-Gravendijkwal 230, Rotterdam, The Netherlands
| | - Dorret I Boomsma
- Department of Biological Psychology, VU University Amsterdam, Neuroscience Campus Amsterdam, De Boelelaan 1117, Amsterdam, The Netherlands
| | - Cornelia M van Duijn
- Department of Genetic Epidemiology, ErasmusMC, 's-Gravendijkwal 230, Rotterdam, The Netherlands
| | - Marleen M J van Greevenbroek
- Department of Internal Medicine and School for Cardiovascular Diseases (CARIM), Maastricht University Medical Center, P. Debyelaan 25, Maastricht, The Netherlands
| | - Jan H Veldink
- Department of Neurology, Brain Center Rudolf Magnus, University Medical Center Utrecht, Heidelberglaan 100, Utrecht, The Netherlands
| | - Cisca Wijmenga
- Department of Genetics, University of Groningen, University Medical Centre Groningen, Broerstraat 5, Groningen, The Netherlands
| | | | - Erik W van Zwet
- Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Einthovenweg 20, Leiden, The Netherlands
| | - P Eline Slagboom
- Molecular Epidemiology section, Leiden University Medical Center, Einthovenweg 20, Leiden, The Netherlands
| | - J Wouter Jukema
- Department of Cardiology, Leiden University Medical Center, Einthovenweg 20, Leiden, The Netherlands
| | - Bastiaan T Heijmans
- Molecular Epidemiology section, Leiden University Medical Center, Einthovenweg 20, Leiden, The Netherlands.
| |
Collapse
|
5
|
Anvar SY, Khachatryan L, Vermaat M, van Galen M, Pulyakhina I, Ariyurek Y, Kraaijeveld K, den Dunnen JT, de Knijff P, ’t Hoen PAC, Laros JFJ. Determining the quality and complexity of next-generation sequencing data without a reference genome. Genome Biol 2014; 15:555. [PMID: 25514851 PMCID: PMC4298064 DOI: 10.1186/s13059-014-0555-3] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2014] [Accepted: 11/27/2014] [Indexed: 01/22/2023] Open
Abstract
We describe an open-source kPAL package that facilitates an alignment-free assessment of the quality and comparability of sequencing datasets by analyzing k-mer frequencies. We show that kPAL can detect technical artefacts such as high duplication rates, library chimeras, contamination and differences in library preparation protocols. kPAL also successfully captures the complexity and diversity of microbiomes and provides a powerful means to study changes in microbial communities. Together, these features make kPAL an attractive and broadly applicable tool to determine the quality and comparability of sequence libraries even in the absence of a reference sequence. kPAL is freely available at https://github.com/LUMC/kPAL webcite.
Collapse
Affiliation(s)
- Seyed Yahya Anvar
- />Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
- />Leiden Genome Technology Center, Leiden University Medical Center, Leiden, The Netherlands
| | - Lusine Khachatryan
- />Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Martijn Vermaat
- />Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Michiel van Galen
- />Leiden Genome Technology Center, Leiden University Medical Center, Leiden, The Netherlands
| | - Irina Pulyakhina
- />Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Yavuz Ariyurek
- />Leiden Genome Technology Center, Leiden University Medical Center, Leiden, The Netherlands
| | - Ken Kraaijeveld
- />Leiden Genome Technology Center, Leiden University Medical Center, Leiden, The Netherlands
- />Department of Ecological Science, VU University Amsterdam, Amsterdam, The Netherlands
| | - Johan T den Dunnen
- />Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
- />Leiden Genome Technology Center, Leiden University Medical Center, Leiden, The Netherlands
- />Department of Clinical Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Peter de Knijff
- />Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Peter AC ’t Hoen
- />Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Jeroen FJ Laros
- />Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
- />Leiden Genome Technology Center, Leiden University Medical Center, Leiden, The Netherlands
| |
Collapse
|
6
|
Hilbers FS, Meijers CM, Laros JFJ, van Galen M, Hoogerbrugge N, Vasen HFA, Nederlof PM, Wijnen JT, van Asperen CJ, Devilee P. Exome sequencing of germline DNA from non-BRCA1/2 familial breast cancer cases selected on the basis of aCGH tumor profiling. PLoS One 2013; 8:e55734. [PMID: 23383274 PMCID: PMC3561352 DOI: 10.1371/journal.pone.0055734] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2012] [Accepted: 12/30/2012] [Indexed: 12/17/2022] Open
Abstract
The bulk of familial breast cancer risk (∼70%) cannot be explained by mutations in the known predisposition genes, primarily BRCA1 and BRCA2. Underlying genetic heterogeneity in these cases is the probable explanation for the failure of all attempts to identify further high-risk alleles. While exome sequencing of non-BRCA1/2 breast cancer cases is a promising strategy to detect new high-risk genes, rational approaches to the rigorous pre-selection of cases are needed to reduce heterogeneity. We selected six families in which the tumours of multiple cases showed a specific genomic profile on array comparative genomic hybridization (aCGH). Linkage analysis in these families revealed a region on chromosome 4 with a LOD score of 2.49 under homogeneity. We then analysed the germline DNA of two patients from each family using exome sequencing. Initially focusing on the linkage region, no potentially pathogenic variants could be identified in more than one family. Variants outside the linkage region were then analysed, and we detected multiple possibly pathogenic variants in genes that encode DNA integrity maintenance proteins. However, further analysis led to the rejection of all variants due to poor co-segregation or a relatively high allele frequency in a control population. We concluded that using CGH results to focus on a sub-set of families for sequencing analysis did not enable us to identify a common genetic change responsible for the aggregation of breast cancer in these families. Our data also support the emerging view that non-BRCA1/2 hereditary breast cancer families have a very heterogeneous genetic basis.
Collapse
Affiliation(s)
- Florentine S Hilbers
- Department of Human Genetics, Leiden University Medical Centre, Leiden, The Netherlands.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
7
|
Uil TG, Vellinga J, de Vrij J, van den Hengel SK, Rabelink MJWE, Cramer SJ, Eekels JJM, Ariyurek Y, van Galen M, Hoeben RC. Directed adenovirus evolution using engineered mutator viral polymerases. Nucleic Acids Res 2010; 39:e30. [PMID: 21138963 PMCID: PMC3061072 DOI: 10.1093/nar/gkq1258] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
Adenoviruses (Ads) are the most frequently used viruses for oncolytic and gene therapy purposes. Most Ad-based vectors have been generated through rational design. Although this led to significant vector improvements, it is often hampered by an insufficient understanding of Ad’s intricate functions and interactions. Here, to evade this issue, we adopted a novel, mutator Ad polymerase-based, ‘accelerated-evolution’ approach that can serve as general method to generate or optimize adenoviral vectors. First, we site specifically substituted Ad polymerase residues located in either the nucleotide binding pocket or the exonuclease domain. This yielded several polymerase mutants that, while fully supportive of viral replication, increased Ad’s intrinsic mutation rate. Mutator activities of these mutants were revealed by performing deep sequencing on pools of replicated viruses. The strongest identified mutators carried replacements of residues implicated in ssDNA binding at the exonuclease active site. Next, we exploited these mutators to generate the genetic diversity required for directed Ad evolution. Using this new forward genetics approach, we isolated viral mutants with improved cytolytic activity. These mutants revealed a common mutation in a splice acceptor site preceding the gene for the adenovirus death protein (ADP). Accordingly, the isolated viruses showed high and untimely expression of ADP, correlating with a severe deregulation of E3 transcript splicing.
Collapse
Affiliation(s)
- Taco G Uil
- Department of Molecular Cell Biology, Leiden University Medical Center, Leiden, 2300 RC, The Netherlands
| | | | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Ramos YFM, Hestand MS, Verlaan M, Krabbendam E, Ariyurek Y, van Galen M, van Dam H, van Ommen GJB, den Dunnen JT, Zantema A, 't Hoen PAC. Genome-wide assessment of differential roles for p300 and CBP in transcription regulation. Nucleic Acids Res 2010; 38:5396-408. [PMID: 20435671 PMCID: PMC2938195 DOI: 10.1093/nar/gkq184] [Citation(s) in RCA: 110] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2009] [Revised: 02/26/2010] [Accepted: 03/04/2010] [Indexed: 12/20/2022] Open
Abstract
Despite high levels of homology, transcription coactivators p300 and CREB binding protein (CBP) are both indispensable during embryogenesis. They are largely known to regulate the same genes. To identify genes preferentially regulated by p300 or CBP, we performed an extensive genome-wide survey using the ChIP-seq on cell-cycle synchronized cells. We found that 57% of the tags were within genes or proximal promoters, with an overall preference for binding to transcription start and end sites. The heterogeneous binding patterns possibly reflect the divergent roles of CBP and p300 in transcriptional regulation. Most of the 16 103 genes were bound by both CBP and p300. However, after stimulation 89 and 1944 genes were preferentially bound by CBP or p300, respectively. Target genes were found to be primarily involved in the regulation of metabolic and developmental processes, and transcription, with CBP showing a stronger preference than p300 for genes active in negative regulation of transcription. Analysis of transcription factor binding sites suggest that CBP and p300 have many partners in common, but AP-1 and Serum Response Factor (SRF) appear to be more prominent in CBP-specific sequences, whereas AP-2 and SP1 are enriched in p300-specific targets. Taken together, our findings further elucidate the distinct roles of coactivators p300 and CBP in transcriptional regulation.
Collapse
Affiliation(s)
- Yolande F M Ramos
- Department of Molecular Cell Biology, Leiden University Medical Center, Postzone S4-0P, PO Box 9600, 2300 RC Leiden, The Netherlands.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
9
|
Out AA, van Minderhout IJHM, Goeman JJ, Ariyurek Y, Ossowski S, Schneeberger K, Weigel D, van Galen M, Taschner PEM, Tops CMJ, Breuning MH, van Ommen GJB, den Dunnen JT, Devilee P, Hes FJ. Deep sequencing to reveal new variants in pooled DNA samples. Hum Mutat 2010; 30:1703-12. [PMID: 19842214 DOI: 10.1002/humu.21122] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
We evaluated massive parallel sequencing and long-range PCR (LRP) for rare variant detection and allele frequency estimation in pooled DNA samples. Exons 2 to 16 of the MUTYH gene were analyzed in breast cancer patients with Illumina's (Solexa) technology. From a pool of 287 genomic DNA samples we generated a single LRP product, while the same LRP was performed on 88 individual samples and the resulting products then pooled. Concentrations of constituent samples were measured with fluorimetry for genomic DNA and high-resolution melting curve analysis (HR-MCA) for LRP products. Illumina sequencing results were compared to Sanger sequencing data of individual samples. Correlation between allele frequencies detected by both methods was poor in the first pool, presumably because the genomic samples amplified unequally in the LRP, due to DNA quality variability. In contrast, allele frequencies correlated well in the second pool, in which all expected alleles at a frequency of 1% and higher were reliably detected, plus the majority of singletons (0.6% allele frequency). We describe custom bioinformatics and statistics to optimize detection of rare variants and to estimate required sequencing depth. Our results provide directions for designing high-throughput analyses of candidate genes.
Collapse
Affiliation(s)
- Astrid A Out
- Department of Clinical Genetics, Leiden University Medical Center, Leiden, The Netherlands.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Hestand MS, van Galen M, Villerius MP, van Ommen GJB, den Dunnen JT, 't Hoen PAC. CORE_TF: a user-friendly interface to identify evolutionary conserved transcription factor binding sites in sets of co-regulated genes. BMC Bioinformatics 2008; 9:495. [PMID: 19036135 PMCID: PMC2613159 DOI: 10.1186/1471-2105-9-495] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2008] [Accepted: 11/26/2008] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The identification of transcription factor binding sites is difficult since they are only a small number of nucleotides in size, resulting in large numbers of false positives and false negatives in current approaches. Computational methods to reduce false positives are to look for over-representation of transcription factor binding sites in a set of similarly regulated promoters or to look for conservation in orthologous promoter alignments. RESULTS We have developed a novel tool, "CORE_TF" (Conserved and Over-REpresented Transcription Factor binding sites) that identifies common transcription factor binding sites in promoters of co-regulated genes. To improve upon existing binding site predictions, the tool searches for position weight matrices from the TRANSFAC R database that are over-represented in an experimental set compared to a random set of promoters and identifies cross-species conservation of the predicted transcription factor binding sites. The algorithm has been evaluated with expression and chromatin-immunoprecipitation on microarray data. We also implement and demonstrate the importance of matching the random set of promoters to the experimental promoters by GC content, which is a unique feature of our tool. CONCLUSION The program CORE_TF is accessible in a user friendly web interface at http://www.LGTC.nl/CORE_TF. It provides a table of over-represented transcription factor binding sites in the users input genes' promoters and a graphical view of evolutionary conserved transcription factor binding sites. In our test data sets it successfully predicts target transcription factors and their binding sites.
Collapse
Affiliation(s)
- Matthew S Hestand
- The Center for Human and Clinical Genetics, Leiden University Medical Center, Postzone S4-0P, PO Box 9600, 2300 RC Leiden, The Netherlands.
| | | | | | | | | | | |
Collapse
|