1
|
Chepelev I, Harley IT, Harley JB. Modeling of horizontal pleiotropy identifies possible causal gene expression in systemic lupus erythematosus. FRONTIERS IN LUPUS 2023; 1:1234578. [PMID: 37799268 PMCID: PMC10554754 DOI: 10.3389/flupu.2023.1234578] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/07/2023]
Abstract
Background Systemic lupus erythematosus (SLE) is a chronic autoimmune condition with complex causes involving genetic and environmental factors. While genome-wide association studies (GWASs) have identified genetic loci associated with SLE, the functional genomic elements responsible for disease development remain largely unknown. Mendelian Randomization (MR) is an instrumental variable approach to causal inference based on data from observational studies, where genetic variants are employed as instrumental variables (IVs). Methods This study utilized a two-step strategy to identify causal genes for SLE. In the first step, the classical MR method was employed, assuming the absence of horizontal pleiotropy, to estimate the causal effect of gene expression on SLE. In the second step, advanced probabilistic MR methods (PMR-Egger, MRAID, and MR-MtRobin) were applied to the genes identified in the first step, considering horizontal pleiotropy, to filter out false positives. PMR-Egger and MRAID analyses utilized whole blood expression quantitative trait loci (eQTL) and SLE GWAS summary data, while MR-MtRobin analysis used an independent eQTL dataset from multiple immune cell types along with the same SLE GWAS data. Results The initial MR analysis identified 142 genes, including 43 outside of chromosome 6. Subsequently, applying the advanced MR methods reduced the number of genes with significant causal effects on SLE to 66. PMR-Egger, MRAID, and MR-MtRobin, respectively, identified 13, 7, and 16 non-chromosome 6 genes with significant causal effects. All methods identified expression of PHRF1 gene as causal for SLE. A comprehensive literature review was conducted to enhance understanding of the functional roles and mechanisms of the identified genes in SLE development. Conclusions The findings from the three MR methods exhibited overlapping genes with causal effects on SLE, demonstrating consistent results. However, each method also uncovered unique genes due to different modelling assumptions and technical factors, highlighting the complementary nature of the approaches. Importantly, MRAID demonstrated a reduced percentage of causal genes from the Major Histocompatibility complex (MHC) region on chromosome 6, indicating its potential in minimizing false positive findings. This study contributes to unraveling the mechanisms underlying SLE by employing advanced probabilistic MR methods to identify causal genes, thereby enhancing our understanding of SLE pathogenesis.
Collapse
Affiliation(s)
- Iouri Chepelev
- Research Service, US Department of Veterans Affairs Medical Center, Cincinnati, OH, United States
- Cincinnati Education and Research for Veterans Foundation, Cincinnati, OH, United States
| | - Isaac T.W. Harley
- US Department of Veterans Affairs Medical Center, Aurora, CO, United States
- Department of Immunology and Microbiology, University of Colorado School of Medicine, Aurora, CO, United States
- Division of Rheumatology, Department of Medicine, University of Colorado School of Medicine, Aurora, CO, United States
| | - John B. Harley
- Research Service, US Department of Veterans Affairs Medical Center, Cincinnati, OH, United States
- Cincinnati Education and Research for Veterans Foundation, Cincinnati, OH, United States
| |
Collapse
|
2
|
Shi D, Wang Y, Zhang Z, Cao Y, Hu YQ. MR-BOIL: Causal inference in one-sample Mendelian randomization for binary outcome with integrated likelihood method. Genet Epidemiol 2023; 47:332-357. [PMID: 36808763 DOI: 10.1002/gepi.22520] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Revised: 12/14/2022] [Accepted: 02/01/2023] [Indexed: 02/21/2023]
Abstract
Mendelian randomization is a statistical method for inferring the causal relationship between exposures and outcomes using an economics-derived instrumental variable approach. The research results are relatively complete when both exposures and outcomes are continuous variables. However, due to the noncollapsing nature of the logistic model, the existing methods inherited from the linear model for exploring binary outcome cannot take the effect of confounding factors into account, which leads to biased estimate of the causal effect. In this article, we propose an integrated likelihood method MR-BOIL to investigate causal relationships for binary outcomes by treating confounders as latent variables in one-sample Mendelian randomization. Under the assumption of a joint normal distribution of the confounders, we use expectation maximization algorithm to estimate the causal effect. Extensive simulations demonstrate that the estimator of MR-BOIL is asymptotically unbiased and that our method improves statistical power without inflating type I error rate. We then apply this method to analyze the data from Atherosclerosis Risk in Communications Study. The results show that MR-BOIL can better identify plausible causal relationships with high reliability, compared with the unreliable results of existing methods. MR-BOIL is implemented in R and the corresponding R code is provided for free download.
Collapse
Affiliation(s)
- Dapeng Shi
- Shanghai Center for Mathematical Sciences, Fudan University, Shanghai, China
| | - Yuquan Wang
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Institute of Biostatistics, School of Life Sciences, Fudan University, Shanghai, China
| | - Ziyong Zhang
- Department of Statistics and Data Science, School of Management, Fudan University, Shanghai, China
| | - Yunlong Cao
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Institute of Biostatistics, School of Life Sciences, Fudan University, Shanghai, China
| | - Yue-Qing Hu
- Shanghai Center for Mathematical Sciences, Fudan University, Shanghai, China.,State Key Laboratory of Genetic Engineering, Human Phenome Institute, Institute of Biostatistics, School of Life Sciences, Fudan University, Shanghai, China
| |
Collapse
|
3
|
Boehm FJ, Zhou X. Statistical methods for Mendelian randomization in genome-wide association studies: A review. Comput Struct Biotechnol J 2022; 20:2338-2351. [PMID: 35615025 PMCID: PMC9123217 DOI: 10.1016/j.csbj.2022.05.015] [Citation(s) in RCA: 49] [Impact Index Per Article: 24.5] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 05/08/2022] [Accepted: 05/09/2022] [Indexed: 11/15/2022] Open
Affiliation(s)
- Frederick J. Boehm
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Xiang Zhou
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA
- Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA
- Corresponding author at: Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA.
| |
Collapse
|
4
|
Yuan Z, Liu L, Guo P, Yan R, Xue F, Zhou X. Likelihood-based Mendelian randomization analysis with automated instrument selection and horizontal pleiotropic modeling. SCIENCE ADVANCES 2022; 8:eabl5744. [PMID: 35235357 PMCID: PMC8890724 DOI: 10.1126/sciadv.abl5744] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/24/2021] [Accepted: 01/05/2022] [Indexed: 05/03/2023]
Abstract
Mendelian randomization (MR) is a common tool for identifying causal risk factors underlying diseases. Here, we present a method, MR with automated instrument determination (MRAID), for effective MR analysis. MRAID borrows ideas from fine-mapping analysis to model an initial set of candidate single-nucleotide polymorphisms that are in potentially high linkage disequilibrium with each other and automatically selects among them the suitable instruments for causal inference. MRAID also explicitly models both uncorrelated and correlated horizontal pleiotropic effects that are widespread for complex trait analysis. MRAID achieves both tasks through a joint likelihood framework and relies on a scalable sampling-based algorithm to compute calibrated P values. Comprehensive and realistic simulations show that MRAID can provide calibrated type I error control and reduce false positives while being more powerful than existing approaches. We illustrate the benefits of MRAID for an MR screening analysis across 645 trait pairs in U.K. Biobank, identifying multiple lifestyle causal risk factors of cardiovascular disease-related traits.
Collapse
Affiliation(s)
- Zhongshang Yuan
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, Shandong 250012, China
- Institute for Medical Dataology, Cheeloo College of Medicine, Shandong University, Jinan, Shandong 250012, China
| | - Lu Liu
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, Shandong 250012, China
- Institute for Medical Dataology, Cheeloo College of Medicine, Shandong University, Jinan, Shandong 250012, China
| | - Ping Guo
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, Shandong 250012, China
- Institute for Medical Dataology, Cheeloo College of Medicine, Shandong University, Jinan, Shandong 250012, China
| | - Ran Yan
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, Shandong 250012, China
- Institute for Medical Dataology, Cheeloo College of Medicine, Shandong University, Jinan, Shandong 250012, China
| | - Fuzhong Xue
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, Shandong 250012, China
- Institute for Medical Dataology, Cheeloo College of Medicine, Shandong University, Jinan, Shandong 250012, China
| | - Xiang Zhou
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA
- Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
5
|
Allman PH, Aban I, Long DM, Patki A, MacKenzie T, Irvin MR, Lange LA, Lange E, Cutter G, Tiwari HK. Mendelian randomization in the multivariate general linear model framework. Genet Epidemiol 2021; 46:17-31. [PMID: 34672390 DOI: 10.1002/gepi.22435] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Revised: 10/10/2021] [Accepted: 10/11/2021] [Indexed: 11/12/2022]
Abstract
Mendelian randomization (MR) is an application of instrumental variable (IV) methods to observational data in which the IV is a genetic variant. MR methods applicable to the general exponential family of distributions are currently not well characterized. We adapt a general linear model framework to the IV setting and propose a general MR method applicable to any full-rank distribution from the exponential family. Empirical bias and coverage are estimated via simulations. The proposed method is compared to several existing MR methods. Real data analyses are performed using data from the REGARDS study to estimate the potential causal effect of smoking frequency on stroke risk in African Americans. In simulations with binary variates and very weak instruments the proposed method had the lowest median [Q1 , Q3 ] bias (0.10 [-3.68 to 3.62]); compared with 2SPS (0.27 [-3.74 to 4.26]) and the Wald method (-0.69 [-1.72 to 0.35]). Low bias was observed throughout other simulation scenarios; as well as more than 90% coverage for the proposed method. In simulations with count variates, the proposed method performed comparably to 2SPS; the Wald method maintained the most consistent low bias; and 2SRI was biased towards the null. Real data analyses find no evidence for a causal effect of smoking frequency on stroke risk. The proposed MR method has low bias and acceptable coverage across a wide range of distributional scenarios and instrument strengths; and provides a more parsimonious framework for asymptotic hypothesis testing compared to existing two-stage procedures.
Collapse
Affiliation(s)
- Phillip H Allman
- Department of Biostatistics, University of Alabama at Birmingham, Birmingham, Alabama, USA
| | - Inmaculada Aban
- Department of Biostatistics, University of Alabama at Birmingham, Birmingham, Alabama, USA
| | - Dustin M Long
- Department of Biostatistics, University of Alabama at Birmingham, Birmingham, Alabama, USA
| | - Amit Patki
- Department of Biostatistics, University of Alabama at Birmingham, Birmingham, Alabama, USA
| | - Todd MacKenzie
- Department of Biomedical Data Science, Dartmouth College, Hanover, New Hampshire, USA
| | - Marguerite R Irvin
- Department of Epidemiology, University of Alabama at Birmingham, Birmingham, Alabama, USA
| | - Leslie A Lange
- Division of Biomedical Informatics and Personalized Medicine, Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, Colorado, USA
| | - Ethan Lange
- Division of Biomedical Informatics and Personalized Medicine, Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, Colorado, USA
| | - Gary Cutter
- Department of Biostatistics, University of Alabama at Birmingham, Birmingham, Alabama, USA
| | - Hemant K Tiwari
- Department of Biostatistics, University of Alabama at Birmingham, Birmingham, Alabama, USA
| |
Collapse
|