1
|
Lancaster EE, Vladimirov VI, Riley BP, Landry JW, Roberson-Nay R, York TP. LARGE-SCALE INTEGRATION OF DNA METHYLATION AND GENE EXPRESSION ARRAY PLATFORMS IDENTIFIES BOTH cis AND trans RELATIONSHIPS. Epigenetics 2022; 17:1753-1773. [PMID: 35608069 PMCID: PMC9621057 DOI: 10.1080/15592294.2022.2079293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Although epigenome-wide association studies (EWAS) have been successful in identifying DNA methylation (DNAm) patterns associated with disease states, any further characterization of etiologic mechanisms underlying disease remains elusive. This knowledge gap does not originate from a lack of DNAm–trait associations, but rather stems from study design issues that affect the interpretability of EWAS results. Despite known limitations in predicting the function of a particular CpG site, most EWAS maintain the broad assumption that altered DNAm results in a concomitant change of transcription at the most proximal gene. This study integrated DNAm and gene expression (GE) measurements in two cohorts, the Adolescent and Young Adult Twin Study (AYATS) and the Pregnancy, Race, Environment, Genes (PREG) study, to improve the understanding of epigenomic regulatory mechanisms. CpG sites associated with GE in cis were enriched in areas of transcription factor binding and areas of intermediate-to-low CpG density. CpG sites associated with trans GE were also enriched in areas of known regulatory significance, including enhancer regions. These results highlight issues with restricting DNAm-transcript annotations to small genomic intervals and question the validity of assuming a cis DNAm–GE pathway. Based on these findings, the interpretation of EWAS results is limited in studies without multi-omic support and further research should identify genomic regions in which GE-associated DNAm is overrepresented. An in-depth characterization of GE-associated CpG sites could improve predictions of the downstream functional impact of altered DNAm and inform best practices for interpreting DNAm–trait associations generated by EWAS.
Collapse
Affiliation(s)
- Eva E Lancaster
- Department of Psychiatry, Virginia Commonwealth University, Richmond, VA 23220
| | | | - Brien P Riley
- Department of Psychiatry, Virginia Commonwealth University, Richmond, VA 23220.,Department of Human and Molecular Genetics, Virginia Commonwealth University, Richmond, VA 23220
| | - Joseph W Landry
- Department of Human and Molecular Genetics, Virginia Commonwealth University, Richmond, VA 23220
| | - Roxann Roberson-Nay
- Department of Psychiatry, Virginia Commonwealth University, Richmond, VA 23220.,Department of Psychology, Virginia Commonwealth University, Richmond, VA 23220
| | - Timothy P York
- Department of Human and Molecular Genetics, Virginia Commonwealth University, Richmond, VA 23220.,Department of Obstetrics and Gynecology, Virginia Commonwealth University, Richmond, VA 23220
| |
Collapse
|
2
|
Wang T, Qiao J, Zhang S, Wei Y, Zeng P. Simultaneous test and estimation of total genetic effect in eQTL integrative analysis through mixed models. Brief Bioinform 2022; 23:6535679. [PMID: 35212359 DOI: 10.1093/bib/bbac038] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2021] [Revised: 01/22/2022] [Accepted: 02/07/2021] [Indexed: 11/14/2022] Open
Abstract
Integration of expression quantitative trait loci (eQTL) into genome-wide association studies (GWASs) is a promising manner to reveal functional roles of associated single-nucleotide polymorphisms (SNPs) in complex phenotypes and has become an active research field in post-GWAS era. However, how to efficiently incorporate eQTL mapping study into GWAS for prioritization of causal genes remains elusive. We herein proposed a novel method termed as Mixed transcriptome-wide association studies (TWAS) and mediated Variance estimation (MTV) by modeling the effects of cis-SNPs of a gene as a function of eQTL. MTV formulates the integrative method and TWAS within a unified framework via mixed models and therefore includes many prior methods/tests as special cases. We further justified MTV from another two statistical perspectives of mediation analysis and two-stage Mendelian randomization. Relative to existing methods, MTV is superior for pronounced features including the processing of direct effects of cis-SNPs on phenotypes, the powerful likelihood ratio test for assessment of joint effects of cis-SNPs and genetically regulated gene expression (GReX), two useful quantities to measure relative genetic contributions of GReX and cis-SNPs to phenotypic variance, and the computationally efferent parameter expansion expectation maximum algorithm. With extensive simulations, we identified that MTV correctly controlled the type I error in joint evaluation of the total genetic effect and proved more powerful to discover true association signals across various scenarios compared to existing methods. We finally applied MTV to 41 complex traits/diseases available from three GWASs and discovered many new associated genes that had otherwise been missed by existing methods. We also revealed that a small but substantial fraction of phenotypic variation was mediated by GReX. Overall, MTV constructs a robust and realistic modeling foundation for integrative omics analysis and has the advantage of offering more attractive biological interpretations of GWAS results.
Collapse
Affiliation(s)
- Ting Wang
- Department of Biostatistics at Xuzhou Medical University, China
| | - Jiahao Qiao
- Department of Biostatistics at Xuzhou Medical University, China
| | - Shuo Zhang
- Department of Biostatistics at Xuzhou Medical University, China
| | - Yongyue Wei
- Department of Biostatistics at Nanjing Medical University, China
| | - Ping Zeng
- Department of Biostatistics, Center for Medical Statistics and Data Analysis and Key Laboratory of Human Genetics and Environmental Medicine at Xuzhou Medical University, China
| |
Collapse
|
3
|
Zhong W, Spracklen CN, Mohlke KL, Zheng X, Fine J, Li Y. Multi-SNP mediation intersection-union test. Bioinformatics 2020; 35:4724-4729. [PMID: 31099385 DOI: 10.1093/bioinformatics/btz285] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Revised: 03/19/2019] [Accepted: 04/16/2019] [Indexed: 12/27/2022] Open
Abstract
SUMMARY Tens of thousands of reproducibly identified GWAS (Genome-Wide Association Studies) variants, with the vast majority falling in non-coding regions resulting in no eventual protein products, call urgently for mechanistic interpretations. Although numerous methods exist, there are few, if any methods, for simultaneously testing the mediation effects of multiple correlated SNPs via some mediator (e.g. the expression of a gene in the neighborhood) on phenotypic outcome. We propose multi-SNP mediation intersection-union test (SMUT) to fill in this methodological gap. Our extensive simulations demonstrate the validity of SMUT as well as substantial, up to 92%, power gains over alternative methods. In addition, SMUT confirmed known mediators in a real dataset of Finns for plasma adiponectin level, which were missed by many alternative methods. We believe SMUT will become a useful tool to generate mechanistic hypotheses underlying GWAS variants, facilitating functional follow-up. AVAILABILITY AND IMPLEMENTATION The R package SMUT is publicly available from CRAN at https://CRAN.R-project.org/package=SMUT. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Wujuan Zhong
- Department of Biostatistics, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Cassandra N Spracklen
- Department of Genetics, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Karen L Mohlke
- Department of Genetics, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Xiaojing Zheng
- Department of Pediatrics, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Jason Fine
- Department of Biostatistics, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.,Department of Statistics and Operations Research, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Yun Li
- Department of Biostatistics, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.,Department of Genetics, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.,Department of Computer Science, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| |
Collapse
|
4
|
Peng C, Wang J, Asante I, Louie S, Jin R, Chatzi L, Casey G, Thomas DC, Conti DV. A latent unknown clustering integrating multi-omics data (LUCID) with phenotypic traits. Bioinformatics 2019; 36:842-850. [PMID: 31504184 PMCID: PMC7986585 DOI: 10.1093/bioinformatics/btz667] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2019] [Revised: 08/04/2019] [Accepted: 08/21/2019] [Indexed: 01/31/2023] Open
Abstract
MOTIVATION Epidemiologic, clinical and translational studies are increasingly generating multiplatform omics data. Methods that can integrate across multiple high-dimensional data types while accounting for differential patterns are critical for uncovering novel associations and underlying relevant subgroups. RESULTS We propose an integrative model to estimate latent unknown clusters (LUCID) aiming to both distinguish unique genomic, exposure and informative biomarkers/omic effects while jointly estimating subgroups relevant to the outcome of interest. Simulation studies indicate that we can obtain consistent estimates reflective of the true simulated values, accurately estimate subgroups and recapitulate subgroup-specific effects. We also demonstrate the use of the integrated model for future prediction of risk subgroups and phenotypes. We apply this approach to two real data applications to highlight the integration of genomic, exposure and metabolomic data. AVAILABILITY AND IMPLEMENTATION The LUCID method is implemented through the LUCIDus R package available on CRAN (https://CRAN.R-project.org/package=LUCIDus). SUPPLEMENTARY INFORMATION Supplementary materials are available at Bioinformatics online.
Collapse
Affiliation(s)
- Cheng Peng
- Department of Preventive Medicine, Keck School of Medicine, Los Angeles, CA 90089, USA
| | - Jun Wang
- Department of Preventive Medicine, Keck School of Medicine, Los Angeles, CA 90089, USA
| | - Isaac Asante
- Department of Clinical Pharmacy, School of Pharmacy, University of Southern California, Los Angeles, CA 90089, USA
| | - Stan Louie
- Department of Clinical Pharmacy, School of Pharmacy, University of Southern California, Los Angeles, CA 90089, USA
| | - Ran Jin
- Department of Preventive Medicine, Keck School of Medicine, Los Angeles, CA 90089, USA
| | - Lida Chatzi
- Department of Preventive Medicine, Keck School of Medicine, Los Angeles, CA 90089, USA
| | - Graham Casey
- Center for Public Health Genomics, Department of Public Health Sciences, University of Virginia, Charlottesville, VA 22908, USA
| | - Duncan C Thomas
- Department of Preventive Medicine, Keck School of Medicine, Los Angeles, CA 90089, USA
| | | |
Collapse
|
5
|
Shan N, Wang Z, Hou L. Identification of trans-eQTLs using mediation analysis with multiple mediators. BMC Bioinformatics 2019; 20:126. [PMID: 30925861 PMCID: PMC6440281 DOI: 10.1186/s12859-019-2651-6] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Background Mapping expression quantitative trait loci (eQTLs) has provided insight into gene regulation. Compared to cis-eQTLs, the regulatory mechanisms of trans-eQTLs are less known. Previous studies suggest that trans-eQTLs may regulate expression of remote genes by altering the expression of nearby genes. Trans-association has been studied in the mediation analysis with a single mediator. However, prior applications with one mediator are prone to model misspecification due to correlations between genes. Motivated from the observation that trans-eQTLs are more likely to associate with more than one cis-gene than randomly selected SNPs in the GTEx dataset, we developed a computational method to identify trans-eQTLs that are mediated by multiple mediators. Results We proposed two hypothesis tests for testing the total mediation effect (TME) and the component-wise mediation effects (CME), respectively. We demonstrated in simulation studies that the type I error rates were controlled in both tests despite model misspecification. The TME test was more powerful than the CME test when the two mediation effects are in the same direction, while the CME test was more powerful than the TME test when the two mediation effects are in opposite direction. Multiple mediator analysis had increased power to detect mediated trans-eQTLs, especially in large samples. In the HapMap3 data, we identified 11 mediated trans-eQTLs that were not detected by the single mediator analysis in the combined samples of African populations. Moreover, the mediated trans-eQTLs in the HapMap3 samples are more likely to be trait-associated SNPs. In terms of computation, although there is no limit in the number of mediators in our model, analysis takes more time when adding additional mediators. In the analysis of the HapMap3 samples, we included at most 5 cis-gene mediators. Majority of the trios we considered have one or two mediators. Conclusions Trans-eQTLs are more likely to associate with multiple cis-genes than randomly selected SNPs. Mediation analysis with multiple mediators improves power of identification of mediated trans-eQTLs, especially in large samples. Electronic supplementary material The online version of this article (10.1186/s12859-019-2651-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Nayang Shan
- Center for Statistical Science, Tsinghua University, Beijing, 100084, China.,Department of Industrial Engineering, Tsinghua University, Beijing, 100084, China
| | - Zuoheng Wang
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, 06510, USA.
| | - Lin Hou
- Center for Statistical Science, Tsinghua University, Beijing, 100084, China. .,Department of Industrial Engineering, Tsinghua University, Beijing, 100084, China. .,MOE Key Laboratory of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing, 100084, China.
| |
Collapse
|
6
|
Huang YT. Genome-wide analyses of sparse mediation effects under composite null hypotheses. Ann Appl Stat 2019. [DOI: 10.1214/18-aoas1181] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
7
|
Cheng Y, Wang C, Zhu M, Dai J, Wang Y, Geng L, Li Z, Zhang J, Ma H, Jin G, Lin D, Hu Z, Shen H. Targeted sequencing of chromosome 15q25 identified novel variants associated with risk of lung cancer and smoking behavior in Chinese. Carcinogenesis 2017; 38:552-558. [DOI: 10.1093/carcin/bgx025] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2016] [Accepted: 03/25/2017] [Indexed: 01/15/2023] Open
|
8
|
Huang YT, Cai T, Kim E. Integrative genomic testing of cancer survival using semiparametric linear transformation models. Stat Med 2016; 35:2831-44. [PMID: 26887583 PMCID: PMC10392002 DOI: 10.1002/sim.6900] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2015] [Revised: 01/11/2016] [Accepted: 01/19/2016] [Indexed: 01/12/2023]
Abstract
The wide availability of multi-dimensional genomic data has spurred increasing interests in integrating multi-platform genomic data. Integrative analysis of cancer genome landscape can potentially lead to deeper understanding of the biological process of cancer. We integrate epigenetics (DNA methylation and microRNA expression) and gene expression data in tumor genome to delineate the association between different aspects of the biological processes and brain tumor survival. To model the association, we employ a flexible semiparametric linear transformation model that incorporates both the main effects of these genomic measures as well as the possible interactions among them. We develop variance component tests to examine different coordinated effects by testing various subsets of model coefficients for the genomic markers. A Monte Carlo perturbation procedure is constructed to approximate the null distribution of the proposed test statistics. We further propose omnibus testing procedures to synthesize information from fitting various parsimonious sub-models to improve power. Simulation results suggest that our proposed testing procedures maintain proper size under the null and outperform standard score tests. We further illustrate the utility of our procedure in two genomic analyses for survival of glioblastoma multiforme patients. Copyright © 2016 John Wiley & Sons, Ltd.
Collapse
Affiliation(s)
- Yen-Tsung Huang
- Departments of Epidemiology and Biostatistics, Brown University, 121 South Main St.Box G-S121-2 Providence, 02912, RI, U.S.A
| | - Tianxi Cai
- Department of Biostatistics, School of Public Health, Harvard University, 655 Huntington Ave., Boston, 02115, MA, U.S.A
| | - Eunhee Kim
- Office of Biostatistics National Institute of Neurological Disorders and Stroke, National Institutes of Health, 10 Center Drive, Building 10/Rm 5N230, Bethesda, 20892, MD
| |
Collapse
|
9
|
Sex dimorphism in a mediatory role of the posterior midcingulate cortex in the association between anxiety and pain sensitivity. Exp Brain Res 2016; 234:3119-3131. [DOI: 10.1007/s00221-016-4710-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2016] [Accepted: 06/19/2016] [Indexed: 10/21/2022]
|
10
|
Sun YV, Hu YJ. Integrative Analysis of Multi-omics Data for Discovery and Functional Studies of Complex Human Diseases. ADVANCES IN GENETICS 2016; 93:147-90. [PMID: 26915271 DOI: 10.1016/bs.adgen.2015.11.004] [Citation(s) in RCA: 256] [Impact Index Per Article: 28.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Complex and dynamic networks of molecules are involved in human diseases. High-throughput technologies enable omics studies interrogating thousands to millions of makers with similar biochemical properties (eg, transcriptomics for RNA transcripts). However, a single layer of "omics" can only provide limited insights into the biological mechanisms of a disease. In the case of genome-wide association studies, although thousands of single nucleotide polymorphisms have been identified for complex diseases and traits, the functional implications and mechanisms of the associated loci are largely unknown. Additionally, the genomic variants alone are not able to explain the changing disease risk across the life span. DNA, RNA, protein, and metabolite often have complementary roles to jointly perform a certain biological function. Such complementary effects and synergistic interactions between omic layers in the life course can only be captured by integrative study of multiple molecular layers. Building upon the success in single-omics discovery research, population studies started adopting the multi-omics approach to better understanding the molecular function and disease etiology. Multi-omics approaches integrate data obtained from different omic levels to understand their interrelation and combined influence on the disease processes. Here, we summarize major omics approaches available in population research, and review integrative approaches and methodologies interrogating multiple omic layers, which enhance the gene discovery and functional analysis of human diseases. We seek to provide analytical recommendations for different types of multi-omics data and study designs to guide the emerging multi-omic research, and to suggest improvement of the existing analytical methods.
Collapse
Affiliation(s)
- Yan V Sun
- Department of Epidemiology, Rollins School of Public Health, Atlanta, GA, United States; Department of Biomedical Informatics, School of Medicine, Atlanta, GA, United States
| | - Yi-Juan Hu
- Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, GA, United States
| |
Collapse
|
11
|
Huang YT, Cai T. Mediation analysis for survival data using semiparametric probit models. Biometrics 2015; 72:563-74. [PMID: 26618735 DOI: 10.1111/biom.12445] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2015] [Revised: 07/01/2015] [Accepted: 09/01/2015] [Indexed: 01/09/2023]
Abstract
Causal mediation modeling has become a popular approach for studying the effect of an exposure on an outcome through mediators. Currently, the literature on mediation analyses with survival outcomes largely focused on settings with a single mediator and quantified the mediation effects on the hazard, log hazard and log survival time (Lange and Hansen 2011; VanderWeele 2011). In this article, we propose a multi-mediator model for survival data by employing a flexible semiparametric probit model. We characterize path-specific effects (PSEs) of the exposure on the outcome mediated through specific mediators. We derive closed form expressions for PSEs on a transformed survival time and the survival probabilities. Statistical inference on the PSEs is developed using a nonparametric maximum likelihood estimator under the semiparametric probit model and the functional Delta method. Results from simulation studies suggest that our proposed methods perform well in finite sample. We illustrate the utility of our method in a genomic study of glioblastoma multiforme survival.
Collapse
Affiliation(s)
- Yen-Tsung Huang
- Departments of Epidemiology and Biostatistics, Brown University, 121 South Main Street, Providence, Rhode Island 02912, U.S.A
| | - Tianxi Cai
- Department of Biostatistics, Harvard School of Public Health, 655 Huntington Avenue, Boston, Massachusetts 02115, U.S.A
| |
Collapse
|
12
|
Huang YT, Pan WC. Hypothesis test of mediation effect in causal mediation model with high-dimensional continuous mediators. Biometrics 2015; 72:402-13. [DOI: 10.1111/biom.12421] [Citation(s) in RCA: 76] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2014] [Revised: 07/01/2015] [Accepted: 08/01/2015] [Indexed: 11/30/2022]
Affiliation(s)
- Yen-Tsung Huang
- Departments of Epidemiology and Biostatistics; Brown University; 121 South Main Street Providence Rhode Island 02912 U.S.A
| | - Wen-Chi Pan
- Institute of Environmental and Occupational Health Sciences; National Yang-Ming University; No. 155, Section 2 Linong Street Beitou District Taipei City 112 Taiwan
| |
Collapse
|
13
|
Richmond RC, Timpson NJ, Sørensen TIA. Exploring possible epigenetic mediation of early-life environmental exposures on adiposity and obesity development. Int J Epidemiol 2015; 44:1191-8. [PMID: 25953782 DOI: 10.1093/ije/dyv066] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/23/2015] [Indexed: 12/17/2022] Open
Affiliation(s)
| | | | - Thorkild I A Sørensen
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK, Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Medical and Health Sciences, University of Copenhagen, Copenhagen, Denmark and Institute of Preventive Medicine, Bispebjerg and Frederiksberg Hospitals, Copenhagen, Denmark
| |
Collapse
|
14
|
Huen K, Yousefi P, Street K, Eskenazi B, Holland N. PON1 as a model for integration of genetic, epigenetic, and expression data on candidate susceptibility genes. ENVIRONMENTAL EPIGENETICS 2015; 1:dvv003. [PMID: 26913202 PMCID: PMC4762373 DOI: 10.1093/eep/dvv003] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/15/2015] [Revised: 06/30/2015] [Accepted: 07/14/2015] [Indexed: 05/27/2023]
Abstract
Recent genome- and epigenome-wide studies demonstrate that the DNA methylation is controlled in part by genetics, highlighting the importance of integrating genetic and epigenetic data. To better understand molecular mechanisms affecting gene expression, we used the candidate susceptibility gene paraoxonase 1 (PON1) as a model to assess associations of PON1 genetic polymorphisms with DNA methylation and arylesterase activity, a marker of PON1 expression. PON1 has been associated with susceptibility to obesity, cardiovascular disease, and pesticide exposure. In this study, we assessed DNA methylation in 18 CpG sites located along PON1 shores, shelves, and its CpG island in blood specimens collected from newborns and 9-year-old children participating (n = 449) in the CHAMACOS birth cohort study. The promoter polymorphism, PON1-108 , was strongly associated with methylation, particularly for CpG sites located near the CpG island (P << 0.0005). Among newborns, these relationships were even more pronounced after adjusting for blood cell composition. We also observed significant decreases in arylesterase activity with increased methylation at the same nine CpG sites at both ages. Using causal mediation analysis, we found statistically significant indirect effects of methylation (β(95% confidence interval): 6.9(1.5, 12.4)) providing evidence that DNA methylation mediates the relationship between PON1-108 genotype and PON1 expression. Our findings show that integration of genetic, epigenetic, and expression data can shed light on the functional mechanisms involving genetic and epigenetic regulation of candidate susceptibility genes like PON1.
Collapse
Affiliation(s)
- Karen Huen
- School of Public Health, University of California, Berkeley, 50 University Hall #7360, Berkeley, CA 94720-7360, USA
| | - Paul Yousefi
- School of Public Health, University of California, Berkeley, 50 University Hall #7360, Berkeley, CA 94720-7360, USA
| | - Kelly Street
- School of Public Health, University of California, Berkeley, 50 University Hall #7360, Berkeley, CA 94720-7360, USA
| | - Brenda Eskenazi
- School of Public Health, University of California, Berkeley, 50 University Hall #7360, Berkeley, CA 94720-7360, USA
| | - Nina Holland
- School of Public Health, University of California, Berkeley, 50 University Hall #7360, Berkeley, CA 94720-7360, USA
| |
Collapse
|