1
|
Ruffieux H, Davison AC, Hager J, Irincheeva I. Efficient inference for genetic association studies with multiple outcomes. Biostatistics 2017; 18:618-636. [DOI: 10.1093/biostatistics/kxx007] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2016] [Accepted: 02/06/2017] [Indexed: 02/04/2023] Open
Abstract
SUMMARY
Combined inference for heterogeneous high-dimensional data is critical in modern biology, where clinical and various kinds of molecular data may be available from a single study. Classical genetic association studies regress a single clinical outcome on many genetic variants one by one, but there is an increasing demand for joint analysis of many molecular outcomes and genetic variants in order to unravel functional interactions. Unfortunately, most existing approaches to joint modeling are either too simplistic to be powerful or are impracticable for computational reasons. Inspired by Richardson and others (2010, Bayesian Statistics 9), we consider a sparse multivariate regression model that allows simultaneous selection of predictors and associated responses. As Markov chain Monte Carlo (MCMC) inference on such models can be prohibitively slow when the number of genetic variants exceeds a few thousand, we propose a variational inference approach which produces posterior information very close to that of MCMC inference, at a much reduced computational cost. Extensive numerical experiments show that our approach outperforms popular variable selection methods and tailored Bayesian procedures, dealing within hours with problems involving hundreds of thousands of genetic variants and tens to hundreds of clinical or molecular outcomes.
Collapse
Affiliation(s)
- Helene Ruffieux
- Nestlé Institute of Health Sciences SA, EPFL Innovation Park, 1015 Lausanne, Switzerland Ecole Polytechnique Fédérale de Lausanne, EPFL SB MATH STAT, Station 8, 1015 Lausanne, Switzerland
| | - Anthony C. Davison
- Ecole Polytechnique Fédérale de Lausanne, EPFL SB MATH STAT, Station 8, 1015 Lausanne, Switzerland
| | - Jorg Hager
- Nestlé Institute of Health Sciences SA, EPFL Innovation Park, 1015 Lausanne, Switzerland
| | - Irina Irincheeva
- Nestlé Institute of Health Sciences SA, EPFL Innovation Park, 1015 Lausanne, Switzerland
| |
Collapse
|