1
|
Kuksa PP, Ionita M, Carter L, Cifello J, Clark K, Valladares O, Leung YY, Wang LS. BTS: scalable Bayesian Tissue Score for prioritizing GWAS variants and their functional contexts across omics data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2024.10.30.621077. [PMID: 39975395 PMCID: PMC11838512 DOI: 10.1101/2024.10.30.621077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 02/21/2025]
Abstract
Motivation statistics from genome-wide association studies (GWAS) are widely used in fine-mapping and colocalization analyses to identify causal variants and their enrichment in functional contexts, such as affected cell types and genomic features. With the expansion of functional genomic (FG) datasets, which now include hundreds of thousands of tracks across various cell and tissue types, it is critical to establish scalable algorithms integrating thousands of diverse FG annotations with GWAS results. Results We propose BTS (Bayesian Tissue Score), a novel, highly efficient algorithm uniquely designed for 1) identifying affected cell types and functional elements (context-mapping) and 2) fine-mapping potentially causal variants in a context-specific manner using large collections of cell type-specific FG annotation tracks. BTS leverages GWAS summary statistics and annotation-specific Bayesian models to analyze genome-wide annotation tracks, including enhancers, open chromatin, and histone marks. We evaluated BTS on GWAS summary statistics for immune and cardiovascular traits, such as Inflammatory Bowel Disease (IBD), Rheumatoid Arthritis (RA), Systemic Lupus Erythematosus (SLE), and Coronary Artery Disease (CAD). Our results demonstrate that BTS is over 100x more efficient in estimating functional annotation effects and context-specific variant fine-mapping compared to existing methods. Importantly, this large-scale Bayesian approach prioritizes both known and novel annotations, cell types, genomic regions, and variants and provides valuable biological insights into the functional contexts of these diseases. Availability and implementation Docker image is available at https://hub.docker.com/r/wanglab/bts with pre-installed BTS R package (https://bitbucket.org/wanglab-upenn/BTS-R) and BTS GWAS summary statistics analysis pipeline (https://bitbucket.org/wanglab-upenn/bts-pipeline).
Collapse
Affiliation(s)
- Pavel P. Kuksa
- Penn Neurodegeneration Genomics Center, Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Matei Ionita
- Penn Neurodegeneration Genomics Center, Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Luke Carter
- Penn Neurodegeneration Genomics Center, Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Jeffrey Cifello
- Penn Neurodegeneration Genomics Center, Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Kaylyn Clark
- Penn Neurodegeneration Genomics Center, Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Otto Valladares
- Penn Neurodegeneration Genomics Center, Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Yuk Yee Leung
- Penn Neurodegeneration Genomics Center, Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Li-San Wang
- Penn Neurodegeneration Genomics Center, Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
2
|
Reales G, Amos CI, Benveniste O, Chinoy H, De Bleecker J, De Paepe B, Doria A, Gregersen PK, Lamb JA, Limaye V, Lundberg IE, Machado PM, Maurer B, Miller FW, Molberg Ø, Pachman LM, Padyukov L, Radstake TR, Reed AM, Rider LG, Rothwell S, Selva-O'Callaghan A, Vencovský J, Wedderburn LR, Wallace C. Discovery of new myositis genetic associations through leveraging other immune-mediated diseases. HGG ADVANCES 2024; 5:100336. [PMID: 39044428 PMCID: PMC11350499 DOI: 10.1016/j.xhgg.2024.100336] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Revised: 07/16/2024] [Accepted: 07/16/2024] [Indexed: 07/25/2024] Open
Abstract
Genome-wide association studies (GWASs) have been successful at finding associations between genetic variants and human traits, including the immune-mediated diseases (IMDs). However, the requirement of large sample sizes for discovery poses a challenge for learning about less common diseases, where increasing volunteer numbers might not be feasible. An example of this is myositis (or idiopathic inflammatory myopathies [IIM]s), a group of rare, heterogeneous autoimmune diseases affecting skeletal muscle and other organs, severely impairing life quality. Here, we applied a feature engineering method to borrow information from larger IMD GWASs to find new genetic associations with IIM and its subgroups. Combining this approach with two clustering methods, we found 17 IMDs genetically close to IIM, including some common comorbid conditions, such as systemic sclerosis and Sjögren's syndrome, as well as hypo- and hyperthyroidism. All IIM subtypes were genetically similar within this framework. Next, we colocalized IIM signals that overlapped IMD signals, and found seven potentially novel myositis associations mapped to immune-related genes, including BLK, IRF5/TNPO3, and ITK/HAVCR2, implicating a role for both B and T cells in IIM. This work proposes a new paradigm of genetic discovery in rarer diseases by leveraging information from more common IMD, and can be expanded to other conditions and traits beyond IMD.
Collapse
Affiliation(s)
- Guillermo Reales
- Cambridge Institute of Therapeutic Immunology and Infectious Disease (CITIID), University of Cambridge, Cambridge, UK; Department of Medicine, University of Cambridge, Cambridge, UK.
| | | | - Olivier Benveniste
- Department of Internal Medicine and Clinical Immunology, Pitié-Salpêtrière Hospital, Paris, France
| | - Hector Chinoy
- Department of Rheumatology, Salford Royal Hospital, Northern Care Alliance NHS Foundation Trust, Manchester Academic Health Science Centre, Salford, UK; Division of Musculoskeletal and Dermatological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester, UK
| | - Jan De Bleecker
- Department of Neurology, Ghent University, Ghent, Belgium; Neuromuscular Reference Center, Ghent University Hospital, Ghent, Belgium
| | - Boel De Paepe
- Department of Neurology, Ghent University, Ghent, Belgium; Neuromuscular Reference Center, Ghent University Hospital, Ghent, Belgium
| | - Andrea Doria
- Rheumatology Unit, Department of Medicine, University of Padova, Padova, Italy
| | - Peter K Gregersen
- The Robert S. Boas Center for Genomics and Human Genetics, The Feinstein Institute, Manhasset, NY, USA
| | - Janine A Lamb
- Epidemiology and Public Health Group, Division of Population Health, Health Services Research & Primary Care, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
| | - Vidya Limaye
- Rheumatology Unit, Royal Adelaide Hospital, Adelaide, South Australia, Australia; Discipline of Medicine, Adelaide University, Adelaide, South Australia, Australia
| | - Ingrid E Lundberg
- Division of Rheumatology, Department of Medicine, Solna, Karolinska Institutet, Karolinska University Hospital, Stockholm, Sweden
| | - Pedro M Machado
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology. London, UK; Centre for Rheumatology, UCL Division of Medicine, University College London, London, UK
| | - Britta Maurer
- Department of Rheumatology and Immunology, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland
| | - Frederick W Miller
- Environmental Autoimmunity Group, National Institute of Environmental Health Sciences, NIH, Bethesda, MD, USA
| | - Øyvind Molberg
- Department of Rheumatology, Oslo University Hospital, Oslo, Norway
| | - Lauren M Pachman
- Children's Hospital of Chicago, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Leonid Padyukov
- Division of Rheumatology, Department of Medicine, Solna, Karolinska Institutet, Karolinska University Hospital, Stockholm, Sweden
| | - Timothy R Radstake
- Department of Rheumatology and Clinical Immunology, University Medical Center, Utrecht, the Netherlands
| | - Ann M Reed
- Department of Pediatrics, Duke University, Durham, NC, USA
| | - Lisa G Rider
- Environmental Autoimmunity Group, National Institute of Environmental Health Sciences, NIH, Bethesda, MD, USA
| | - Simon Rothwell
- Centre for Genetics and Genomics Versus Arthritis, Centre for Musculoskeletal Research, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
| | - Albert Selva-O'Callaghan
- Internal Medicine Department, Vall d'Hebron General Hospital, Universitat Autonoma de Barcelona, Barcelona, Spain
| | - Jiri Vencovský
- Institute of Rheumatology and Department of Rheumatology, First Medical Faculty, Charles University, Prague, Czech Republic
| | - Lucy R Wedderburn
- Centre for Adolescent Rheumatology Versus Arthritis, UCL Great Ormond Street Institute of Child Health, University College London, London, UK; NIHR Biomedical Research Centre at Great Ormond Street Hospital, London, UK
| | - Chris Wallace
- Cambridge Institute of Therapeutic Immunology and Infectious Disease (CITIID), University of Cambridge, Cambridge, UK; Department of Medicine, University of Cambridge, Cambridge, UK; MRC Biostatistics Unit, University of Cambridge, Cambridge, UK
| |
Collapse
|
3
|
Poch T, Bahn J, Casar C, Krause J, Evangelakos I, Gilladi H, Kunzmann LK, Laschtowitz A, Iuso N, Schäfer AM, Liebig LA, Steinmann S, Sebode M, Folseraas T, Engesæter LK, Karlsen TH, Franke A, Hubner N, Schlein C, Galun E, Huber S, Lohse AW, Gagliani N, Schwinge D, Schramm C. Intergenic risk variant rs56258221 skews the fate of naive CD4 + T cells via miR4464-BACH2 interplay in primary sclerosing cholangitis. Cell Rep Med 2024; 5:101620. [PMID: 38901430 PMCID: PMC11293351 DOI: 10.1016/j.xcrm.2024.101620] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2023] [Revised: 04/16/2024] [Accepted: 05/27/2024] [Indexed: 06/22/2024]
Abstract
Primary sclerosing cholangitis (PSC) is an immune-mediated liver disease of unknown pathogenesis, with a high risk to develop cirrhosis and malignancies. Functional dysregulation of T cells and association with genetic polymorphisms in T cell-related genes were previously reported for PSC. Here, we genotyped a representative PSC cohort for several disease-associated risk loci and identified rs56258221 (BACH2/MIR4464) to correlate with not only the peripheral blood T cell immunophenotype but also the functional capacities of naive CD4+ T (CD4+ TN) cells in people with PSC. Mechanistically, rs56258221 leads to an increased expression of miR4464, in turn causing attenuated translation of BACH2, a major gatekeeper of T cell quiescence. Thereby, the fate of CD4+ TN is skewed toward polarization into pro-inflammatory subsets. Clinically, people with PSC carrying rs56258221 show signs of accelerated disease progression. The data presented here highlight the importance of assigning functional outcomes to disease-associated genetic polymorphisms as potential drivers of diseases.
Collapse
Affiliation(s)
- Tobias Poch
- I. Department of Medicine, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany
| | - Jonas Bahn
- I. Department of Medicine, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany
| | - Christian Casar
- I. Department of Medicine, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany; Bioinformatics Core, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany
| | - Jenny Krause
- I. Department of Medicine, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany; European Reference Network for Hepatological Diseases (ERN RARE-LIVER), 20246 Hamburg, Germany
| | - Ioannis Evangelakos
- Institute of Human Genetics, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany
| | - Hilla Gilladi
- The Goldyne-Savad Institute of Gene Therapy, Hadassah Hebrew University Hospital, Jerusalem 91120, Israel
| | - Lilly K Kunzmann
- I. Department of Medicine, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany
| | - Alena Laschtowitz
- I. Department of Medicine, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany; Department of Hepatology and Gastroenterology, Charité Universitätsmedizin Berlin, 13353 Berlin, Germany
| | - Nicola Iuso
- I. Department of Medicine, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany
| | - Anne-Marie Schäfer
- I. Department of Medicine, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany
| | - Laura A Liebig
- I. Department of Medicine, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany; Cardiovascular and Metabolic Sciences, Max Delbrück Centre for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Silja Steinmann
- I. Department of Medicine, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany; European Reference Network for Hepatological Diseases (ERN RARE-LIVER), 20246 Hamburg, Germany
| | - Marcial Sebode
- I. Department of Medicine, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany; European Reference Network for Hepatological Diseases (ERN RARE-LIVER), 20246 Hamburg, Germany
| | - Trine Folseraas
- European Reference Network for Hepatological Diseases (ERN RARE-LIVER), 20246 Hamburg, Germany; Norwegian PSC Research Centre, Department of Transplantation Medicine, Oslo University Hospital Rikshospitalet, 0372 Oslo, Norway; Research Institute of Internal Medicine, Oslo University Hospital Rikshospitalet, 0372 Oslo, Norway
| | - Lise K Engesæter
- European Reference Network for Hepatological Diseases (ERN RARE-LIVER), 20246 Hamburg, Germany; Norwegian PSC Research Centre, Department of Transplantation Medicine, Oslo University Hospital Rikshospitalet, 0372 Oslo, Norway; Research Institute of Internal Medicine, Oslo University Hospital Rikshospitalet, 0372 Oslo, Norway
| | - Tom H Karlsen
- European Reference Network for Hepatological Diseases (ERN RARE-LIVER), 20246 Hamburg, Germany; Norwegian PSC Research Centre, Department of Transplantation Medicine, Oslo University Hospital Rikshospitalet, 0372 Oslo, Norway; Research Institute of Internal Medicine, Oslo University Hospital Rikshospitalet, 0372 Oslo, Norway
| | - Andre Franke
- Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, 24105 Kiel, Germany
| | - Norbert Hubner
- Cardiovascular and Metabolic Sciences, Max Delbrück Centre for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; DZHK (German Centre for Cardiovascular Research), Partner Site Berlin, 10785 Berlin, Germany; Charité-Universitätsmedizin Berlin, 10117 Berlin, Germany
| | - Christian Schlein
- Institute of Human Genetics, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany
| | - Eithan Galun
- The Goldyne-Savad Institute of Gene Therapy, Hadassah Hebrew University Hospital, Jerusalem 91120, Israel
| | - Samuel Huber
- I. Department of Medicine, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany; European Reference Network for Hepatological Diseases (ERN RARE-LIVER), 20246 Hamburg, Germany; Hamburg Centre for Translational Immunology, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany
| | - Ansgar W Lohse
- I. Department of Medicine, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany; European Reference Network for Hepatological Diseases (ERN RARE-LIVER), 20246 Hamburg, Germany; Hamburg Centre for Translational Immunology, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany
| | - Nicola Gagliani
- I. Department of Medicine, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany; Hamburg Centre for Translational Immunology, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany; Department for General, Visceral and Thoracic Surgery, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany; Immunology and Allergy Unit, Department of Medicine Solna, Karolinska Institute, 17177 Solna, Sweden
| | - Dorothee Schwinge
- I. Department of Medicine, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany
| | - Christoph Schramm
- I. Department of Medicine, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany; European Reference Network for Hepatological Diseases (ERN RARE-LIVER), 20246 Hamburg, Germany; Hamburg Centre for Translational Immunology, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany; Martin Zeitz Centre for Rare Diseases, University Medical Centre Hamburg-Eppendorf, 20246 Hamburg, Germany.
| |
Collapse
|
4
|
Zou Y, Carbonetto P, Xie D, Wang G, Stephens M. Fast and flexible joint fine-mapping of multiple traits via the Sum of Single Effects model. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.04.14.536893. [PMID: 37425935 PMCID: PMC10327118 DOI: 10.1101/2023.04.14.536893] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
We introduce mvSuSiE, a multi-trait fine-mapping method for identifying putative causal variants from genetic association data (individual-level or summary data). mvSuSiE learns patterns of shared genetic effects from data, and exploits these patterns to improve power to identify causal SNPs. Comparisons on simulated data show that mvSuSiE is competitive in speed, power and precision with existing multi-trait methods, and uniformly improves on single-trait fine-mapping (SuSiE) in each trait separately. We applied mvSuSiE to jointly fine-map 16 blood cell traits using data from the UK Biobank. By jointly analyzing the traits and modeling heterogeneous effect sharing patterns, we discovered a much larger number of causal SNPs (>3,000) compared with single-trait fine-mapping, and with narrower credible sets. mvSuSiE also more comprehensively characterized the ways in which the genetic variants affect one or more blood cell traits; 68% of causal SNPs showed significant effects in more than one blood cell type.
Collapse
Affiliation(s)
- Yuxin Zou
- Department of Statistics, University of Chicago, Chicago, IL, USA
- Regeneron Genetics Center, Regeneron Pharmaceuticals, Inc., Tarrytown, NY, USA
| | - Peter Carbonetto
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| | - Dongyue Xie
- Department of Statistics, University of Chicago, Chicago, IL, USA
| | - Gao Wang
- Gertrude. H. Sergievsky Center, Department of Neurology, Columbia University, New York, NY, USA
| | - Matthew Stephens
- Department of Statistics, University of Chicago, Chicago, IL, USA
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| |
Collapse
|
5
|
Gkatzionis A, Burgess S, Newcombe PJ. Statistical methods for cis-Mendelian randomization with two-sample summary-level data. Genet Epidemiol 2023; 47:3-25. [PMID: 36273411 PMCID: PMC7614127 DOI: 10.1002/gepi.22506] [Citation(s) in RCA: 46] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 08/28/2022] [Accepted: 08/29/2022] [Indexed: 02/03/2023]
Abstract
Mendelian randomization (MR) is the use of genetic variants to assess the existence of a causal relationship between a risk factor and an outcome of interest. Here, we focus on two-sample summary-data MR analyses with many correlated variants from a single gene region, particularly on cis-MR studies which use protein expression as a risk factor. Such studies must rely on a small, curated set of variants from the studied region; using all variants in the region requires inverting an ill-conditioned genetic correlation matrix and results in numerically unstable causal effect estimates. We review methods for variable selection and estimation in cis-MR with summary-level data, ranging from stepwise pruning and conditional analysis to principal components analysis, factor analysis, and Bayesian variable selection. In a simulation study, we show that the various methods have comparable performance in analyses with large sample sizes and strong genetic instruments. However, when weak instrument bias is suspected, factor analysis and Bayesian variable selection produce more reliable inferences than simple pruning approaches, which are often used in practice. We conclude by examining two case studies, assessing the effects of low-density lipoprotein-cholesterol and serum testosterone on coronary heart disease risk using variants in the HMGCR and SHBG gene regions, respectively.
Collapse
Affiliation(s)
- Apostolos Gkatzionis
- MRC Biostatistics Unit, School of Clinical Medicine, University of Cambridge, Cambridge, UK
- MRC Integrative Epidemiology Unit, Department of Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Stephen Burgess
- MRC Biostatistics Unit, School of Clinical Medicine, University of Cambridge, Cambridge, UK
- Department of Public Health and Primary Care, School of Clinical Medicine, University of Cambridge, Cambridge, UK
| | - Paul J. Newcombe
- MRC Biostatistics Unit, School of Clinical Medicine, University of Cambridge, Cambridge, UK
| |
Collapse
|
6
|
Zuber V, Grinberg NF, Gill D, Manipur I, Slob EAW, Patel A, Wallace C, Burgess S. Combining evidence from Mendelian randomization and colocalization: Review and comparison of approaches. Am J Hum Genet 2022; 109:767-782. [PMID: 35452592 PMCID: PMC7612737 DOI: 10.1016/j.ajhg.2022.04.001] [Citation(s) in RCA: 241] [Impact Index Per Article: 80.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Mendelian randomization and colocalization are two statistical approaches that can be applied to summarized data from genome-wide association studies (GWASs) to understand relationships between traits and diseases. However, despite similarities in scope, they are different in their objectives, implementation, and interpretation, in part because they were developed to serve different scientific communities. Mendelian randomization assesses whether genetic predictors of an exposure are associated with the outcome and interprets an association as evidence that the exposure has a causal effect on the outcome, whereas colocalization assesses whether two traits are affected by the same or distinct causal variants. When considering genetic variants in a single genetic region, both approaches can be performed. While a positive colocalization finding typically implies a non-zero Mendelian randomization estimate, the reverse is not generally true: there are several scenarios which would lead to a non-zero Mendelian randomization estimate but lack evidence for colocalization. These include the existence of distinct but correlated causal variants for the exposure and outcome, which would violate the Mendelian randomization assumptions, and a lack of strong associations with the outcome. As colocalization was developed in the GWAS tradition, typically evidence for colocalization is concluded only when there is strong evidence for associations with both traits. In contrast, a non-zero estimate from Mendelian randomization can be obtained despite only nominally significant genetic associations with the outcome at the locus. In this review, we discuss how the two approaches can provide complementary information on potential therapeutic targets.
Collapse
Affiliation(s)
- Verena Zuber
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, UK; MRC Centre for Environment and Health, School of Public Health, Imperial College London, London, UK; UK Dementia Research Institute at Imperial College, Imperial College London, London, UK
| | | | - Dipender Gill
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, UK; Clinical Pharmacology and Therapeutics Section, Institute of Medical and Biomedical Education and Institute for Infection and Immunity, St George's, University of London, London, UK; Clinical Pharmacology Group, Pharmacy and Medicines Directorate, St George's University Hospitals NHS Foundation Trust, London, UK; Genetics Department, Novo Nordisk Research Centre Oxford, Oxford, UK
| | - Ichcha Manipur
- Cambridge Institute of Therapeutic Immunology & Infectious Disease, University of Cambridge, Cambridge, UK; Department of Medicine, School of Clinical Medicine, University of Cambridge, Cambridge, UK
| | - Eric A W Slob
- MRC Biostatistics Unit, University of Cambridge, Cambridge, UK
| | - Ashish Patel
- MRC Biostatistics Unit, University of Cambridge, Cambridge, UK
| | - Chris Wallace
- Cambridge Institute of Therapeutic Immunology & Infectious Disease, University of Cambridge, Cambridge, UK; Department of Medicine, School of Clinical Medicine, University of Cambridge, Cambridge, UK; MRC Biostatistics Unit, University of Cambridge, Cambridge, UK
| | - Stephen Burgess
- MRC Biostatistics Unit, University of Cambridge, Cambridge, UK; Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK.
| |
Collapse
|
7
|
Kundu K, Tardaguila M, Mann AL, Watt S, Ponstingl H, Vasquez L, Von Schiller D, Morrell NW, Stegle O, Pastinen T, Sawcer SJ, Anderson CA, Walter K, Soranzo N. Genetic associations at regulatory phenotypes improve fine-mapping of causal variants for 12 immune-mediated diseases. Nat Genet 2022; 54:251-262. [PMID: 35288711 DOI: 10.1038/s41588-022-01025-y] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Accepted: 01/31/2022] [Indexed: 12/11/2022]
Abstract
The resolution of causal genetic variants informs understanding of disease biology. We used regulatory quantitative trait loci (QTLs) from the BLUEPRINT, GTEx and eQTLGen projects to fine-map putative causal variants for 12 immune-mediated diseases. We identify 340 unique loci that colocalize with high posterior probability (≥98%) with regulatory QTLs and apply Bayesian frameworks to fine-map associations at each locus. We show that fine-mapping credible sets derived from regulatory QTLs are smaller compared to disease summary statistics. Further, they are enriched for more functionally interpretable candidate causal variants and for putatively causal insertion/deletion (INDEL) polymorphisms. Finally, we use massively parallel reporter assays to evaluate candidate causal variants at the ITGA4 locus associated with inflammatory bowel disease. Overall, our findings suggest that fine-mapping applied to disease-colocalizing regulatory QTLs can enhance the discovery of putative causal disease variants and enhance insights into the underlying causal genes and molecular mechanisms.
Collapse
Affiliation(s)
- Kousik Kundu
- Human Genetics, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK.,Department of Haematology, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK
| | - Manuel Tardaguila
- Human Genetics, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Alice L Mann
- Human Genetics, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Stephen Watt
- Human Genetics, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Hannes Ponstingl
- Human Genetics, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Louella Vasquez
- Human Genetics, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Dominique Von Schiller
- Human Genetics, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Nicholas W Morrell
- Division of Respiratory Medicine, Department of Medicine, University of Cambridge School of Clinical Medicine, Addenbrooke's and Papworth Hospitals, Cambridge, UK
| | - Oliver Stegle
- Genome Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany.,Division of Computational Genomics and Systems Genetics, German Cancer Research Center, Heidelberg, Germany.,Cellular Genetics, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Tomi Pastinen
- Genomic Medicine Center, Children's Mercy Kansas City and Children's Mercy Research Institute, Kansas City, MO, USA
| | - Stephen J Sawcer
- Department of Clinical Neurosciences, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK
| | - Carl A Anderson
- Human Genetics, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Klaudia Walter
- Human Genetics, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Nicole Soranzo
- Human Genetics, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK. .,Department of Haematology, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK. .,British Heart Foundation Centre of Research Excellence, University of Cambridge, Cambridge, UK. .,National Institute for Health Research Blood and Transplant Research Unit in Donor Health and Genomics, University of Cambridge, Cambridge, UK. .,Genomics Research Centre, Human Technopole, Milan, Italy.
| |
Collapse
|
8
|
Wang QS, Huang H. Methods for statistical fine-mapping and their applications to auto-immune diseases. Semin Immunopathol 2022; 44:101-113. [PMID: 35041074 PMCID: PMC8837575 DOI: 10.1007/s00281-021-00902-8] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Accepted: 10/22/2021] [Indexed: 01/07/2023]
Abstract
Although genome-wide association studies (GWAS) have identified thousands of loci in the human genome that are associated with different traits, understanding the biological mechanisms underlying the association signals identified in GWAS remains challenging. Statistical fine-mapping is a method aiming to refine GWAS signals by evaluating which variant(s) are truly causal to the phenotype. Here, we review the types of statistical fine-mapping methods that have been widely used to date, with a focus on recently developed functionally informed fine-mapping (FIFM) methods that utilize functional annotations. We then systematically review the applications of statistical fine-mapping in autoimmune disease studies to highlight the value of statistical fine-mapping in biological contexts.
Collapse
Affiliation(s)
- Qingbo S Wang
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Osaka, Japan.
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| | - Hailiang Huang
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Department of Medicine, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
9
|
Hernández N, Soenksen J, Newcombe P, Sandhu M, Barroso I, Wallace C, Asimit JL. The flashfm approach for fine-mapping multiple quantitative traits. Nat Commun 2021; 12:6147. [PMID: 34686674 PMCID: PMC8536717 DOI: 10.1038/s41467-021-26364-y] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Accepted: 10/04/2021] [Indexed: 11/08/2022] Open
Abstract
Joint fine-mapping that leverages information between quantitative traits could improve accuracy and resolution over single-trait fine-mapping. Using summary statistics, flashfm (flexible and shared information fine-mapping) fine-maps signals for multiple traits, allowing for missing trait measurements and use of related individuals. In a Bayesian framework, prior model probabilities are formulated to favour model combinations that share causal variants to capitalise on information between traits. Simulation studies demonstrate that both approaches produce broadly equivalent results when traits have no shared causal variants. When traits share at least one causal variant, flashfm reduces the number of potential causal variants by 30% compared with single-trait fine-mapping. In a Ugandan cohort with 33 cardiometabolic traits, flashfm gave a 20% reduction in the total number of potential causal variants from single-trait fine-mapping. Here we show flashfm is computationally efficient and can easily be deployed across publicly available summary statistics for signals in up to six traits.
Collapse
Affiliation(s)
- N Hernández
- MRC Biostatistics Unit, University of Cambridge, Cambridge, UK
| | - J Soenksen
- Exeter Centre of Excellence for Diabetes Research (EXCEED), University of Exeter Medical School, Exeter, UK
- School of Life Sciences, University of Glasgow, Glasgow, UK
| | - P Newcombe
- MRC Biostatistics Unit, University of Cambridge, Cambridge, UK
| | - M Sandhu
- Dept of Epidemiology & Biostatistics, School of Public Health, Imperial College London, London, UK
| | - I Barroso
- Exeter Centre of Excellence for Diabetes Research (EXCEED), University of Exeter Medical School, Exeter, UK
| | - C Wallace
- MRC Biostatistics Unit, University of Cambridge, Cambridge, UK
- Cambridge Institute of Therapeutic Immunology & Infectious Disease (CITIID), University of Cambridge, Cambridge, UK
| | - J L Asimit
- MRC Biostatistics Unit, University of Cambridge, Cambridge, UK.
| |
Collapse
|
10
|
Gkatzionis A, Burgess S, Conti DV, Newcombe PJ. Bayesian variable selection with a pleiotropic loss function in Mendelian randomization. Stat Med 2021; 40:5025-5045. [PMID: 34155684 PMCID: PMC8446304 DOI: 10.1002/sim.9109] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2019] [Revised: 04/17/2021] [Accepted: 06/07/2021] [Indexed: 01/04/2023]
Abstract
Mendelian randomization is the use of genetic variants as instruments to assess the existence of a causal relationship between a risk factor and an outcome. A Mendelian randomization analysis requires a set of genetic variants that are strongly associated with the risk factor and only associated with the outcome through their effect on the risk factor. We describe a novel variable selection algorithm for Mendelian randomization that can identify sets of genetic variants which are suitable in both these respects. Our algorithm is applicable in the context of two-sample summary-data Mendelian randomization and employs a recently proposed theoretical extension of the traditional Bayesian statistics framework, including a loss function to penalize genetic variants that exhibit pleiotropic effects. The algorithm offers robust inference through the use of model averaging, as we illustrate by running it on a range of simulation scenarios and comparing it against established pleiotropy-robust Mendelian randomization methods. In a real-data application, we study the effect of systolic and diastolic blood pressure on the risk of suffering from coronary heart disease (CHD). Based on a recent large-scale GWAS for blood pressure, we use 395 genetic variants for systolic and 391 variants for diastolic blood pressure. Both traits are shown to have significant risk-increasing effects on CHD risk.
Collapse
Affiliation(s)
- Apostolos Gkatzionis
- MRC Biostatistics Unit, University of Cambridge, Cambridge, UK
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK
| | - Stephen Burgess
- MRC Biostatistics Unit, University of Cambridge, Cambridge, UK
- Department of Public Health and Primary Care, School of Clinical Medicine, University of Cambridge, Cambridge, UK
| | - David V. Conti
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California
| | | |
Collapse
|
11
|
LaPierre N, Taraszka K, Huang H, He R, Hormozdiari F, Eskin E. Identifying causal variants by fine mapping across multiple studies. PLoS Genet 2021; 17:e1009733. [PMID: 34543273 PMCID: PMC8491908 DOI: 10.1371/journal.pgen.1009733] [Citation(s) in RCA: 40] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2020] [Revised: 10/05/2021] [Accepted: 07/21/2021] [Indexed: 11/18/2022] Open
Abstract
Increasingly large Genome-Wide Association Studies (GWAS) have yielded numerous variants associated with many complex traits, motivating the development of "fine mapping" methods to identify which of the associated variants are causal. Additionally, GWAS of the same trait for different populations are increasingly available, raising the possibility of refining fine mapping results further by leveraging different linkage disequilibrium (LD) structures across studies. Here, we introduce multiple study causal variants identification in associated regions (MsCAVIAR), a method that extends the popular CAVIAR fine mapping framework to a multiple study setting using a random effects model. MsCAVIAR only requires summary statistics and LD as input, accounts for uncertainty in association statistics using a multivariate normal model, allows for multiple causal variants at a locus, and explicitly models the possibility of different SNP effect sizes in different populations. We demonstrate the efficacy of MsCAVIAR in both a simulation study and a trans-ethnic, trans-biobank fine mapping analysis of High Density Lipoprotein (HDL).
Collapse
Affiliation(s)
- Nathan LaPierre
- Department of Computer Science, University of California, Los Angeles, California, United States
| | - Kodi Taraszka
- Department of Computer Science, University of California, Los Angeles, California, United States
| | - Helen Huang
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, California, United States
| | - Rosemary He
- Department of Mathematics, University of California, Los Angeles, California, United States
| | - Farhad Hormozdiari
- Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States
| | - Eleazar Eskin
- Department of Computer Science, University of California, Los Angeles, California, United States
- Department of Human Genetics, University of California, Los Angeles, California, United States
- Department of Computational Medicine, University of California, Los Angeles, California, United States
| |
Collapse
|
12
|
Wallace C. A more accurate method for colocalisation analysis allowing for multiple causal variants. PLoS Genet 2021; 17:e1009440. [PMID: 34587156 PMCID: PMC8504726 DOI: 10.1371/journal.pgen.1009440] [Citation(s) in RCA: 237] [Impact Index Per Article: 59.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Revised: 10/11/2021] [Accepted: 09/12/2021] [Indexed: 01/05/2023] Open
Abstract
In genome-wide association studies (GWAS) it is now common to search for, and find, multiple causal variants located in close proximity. It has also become standard to ask whether different traits share the same causal variants, but one of the popular methods to answer this question, coloc, makes the simplifying assumption that only a single causal variant exists for any given trait in any genomic region. Here, we examine the potential of the recently proposed Sum of Single Effects (SuSiE) regression framework, which can be used for fine-mapping genetic signals, for use with coloc. SuSiE is a novel approach that allows evidence for association at multiple causal variants to be evaluated simultaneously, whilst separating the statistical support for each variant conditional on the causal signal being considered. We show this results in more accurate coloc inference than other proposals to adapt coloc for multiple causal variants based on conditioning. We therefore recommend that coloc be used in combination with SuSiE to optimise accuracy of colocalisation analyses when multiple causal variants exist.
Collapse
Affiliation(s)
- Chris Wallace
- Cambridge Institute of Therapeutic Immunology and Infectious Disease, University of Cambridge, Cambridge, United Kingdom
- MRC Biostatistics Unit, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
13
|
Robertson CC, Inshaw JRJ, Onengut-Gumuscu S, Chen WM, Santa Cruz DF, Yang H, Cutler AJ, Crouch DJM, Farber E, Bridges SL, Edberg JC, Kimberly RP, Buckner JH, Deloukas P, Divers J, Dabelea D, Lawrence JM, Marcovina S, Shah AS, Greenbaum CJ, Atkinson MA, Gregersen PK, Oksenberg JR, Pociot F, Rewers MJ, Steck AK, Dunger DB, Wicker LS, Concannon P, Todd JA, Rich SS. Fine-mapping, trans-ancestral and genomic analyses identify causal variants, cells, genes and drug targets for type 1 diabetes. Nat Genet 2021; 53:962-971. [PMID: 34127860 PMCID: PMC8273124 DOI: 10.1038/s41588-021-00880-5] [Citation(s) in RCA: 153] [Impact Index Per Article: 38.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Accepted: 05/05/2021] [Indexed: 12/13/2022]
Abstract
We report the largest and most diverse genetic study of type 1 diabetes (T1D) to date (61,427 participants), yielding 78 genome-wide-significant (P < 5 × 10-8) regions, including 36 that are new. We define credible sets of T1D-associated variants and show that they are enriched in immune-cell accessible chromatin, particularly CD4+ effector T cells. Using chromatin-accessibility profiling of CD4+ T cells from 115 individuals, we map chromatin-accessibility quantitative trait loci and identify five regions where T1D risk variants co-localize with chromatin-accessibility quantitative trait loci. We highlight rs72928038 in BACH2 as a candidate causal T1D variant leading to decreased enhancer accessibility and BACH2 expression in T cells. Finally, we prioritize potential drug targets by integrating genetic evidence, functional genomic maps and immune protein-protein interactions, identifying 12 genes implicated in T1D that have been targeted in clinical trials for autoimmune diseases. These findings provide an expanded genomic landscape for T1D.
Collapse
Affiliation(s)
- Catherine C Robertson
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, USA
| | - Jamie R J Inshaw
- JDRF/Wellcome Diabetes and Inflammation Laboratory, Wellcome Centre for Human Genetics, Nuffield Department of Medicine, NIHR Oxford Biomedical Research Centre, University of Oxford, Oxford, UK
| | - Suna Onengut-Gumuscu
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
- Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA
| | - Wei-Min Chen
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
- Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA
| | - David Flores Santa Cruz
- JDRF/Wellcome Diabetes and Inflammation Laboratory, Wellcome Centre for Human Genetics, Nuffield Department of Medicine, NIHR Oxford Biomedical Research Centre, University of Oxford, Oxford, UK
| | - Hanzhi Yang
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
| | - Antony J Cutler
- JDRF/Wellcome Diabetes and Inflammation Laboratory, Wellcome Centre for Human Genetics, Nuffield Department of Medicine, NIHR Oxford Biomedical Research Centre, University of Oxford, Oxford, UK
| | - Daniel J M Crouch
- JDRF/Wellcome Diabetes and Inflammation Laboratory, Wellcome Centre for Human Genetics, Nuffield Department of Medicine, NIHR Oxford Biomedical Research Centre, University of Oxford, Oxford, UK
| | - Emily Farber
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
| | - S Louis Bridges
- Division of Rheumatology, Department of Medicine, Hospital for Special Surgery, New York, NY, USA
- Division of Rheumatology, Department of Medicine, Weill Cornell Medical College, New York, NY, USA
| | - Jeffrey C Edberg
- Division of Clinical Immunology and Rheumatology, Department of Medicine, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Robert P Kimberly
- Division of Clinical Immunology and Rheumatology, Department of Medicine, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Jane H Buckner
- Center for Translational Immunology, Benaroya Research Institute, Seattle, WA, USA
| | - Panos Deloukas
- Clinical Pharmacology, William Harvey Research Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, UK
- Princess Al-Jawhara Al-Brahim Centre of Excellence in Research of Hereditary Disorders (PACER-HD), King Abdulaziz University, Jeddah, Saudi Arabia
| | - Jasmin Divers
- Division of Health Services Research, Department of Foundations of Medicine, New York University Long Island School of Medicine, Mineola, NY, USA
| | - Dana Dabelea
- Colorado School of Public Health and Lifecourse Epidemiology of Adiposity and Diabetes (LEAD) Center, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Jean M Lawrence
- Department of Research and Evaluation, Kaiser Permanente Southern California, Pasadena, CA, USA
| | - Santica Marcovina
- Northwest Lipid Metabolism and Diabetes Research Laboratories, University of Washington, Seattle, WA, USA
- Medpace Reference Laboratories, Cincinnati, OH, USA
| | - Amy S Shah
- Cincinnati Children's Hospital Medical Center and the University of Cincinnati, Cincinnati, OH, USA
| | - Carla J Greenbaum
- Center for Interventional Immunology, Benaroya Research Institute, Seattle, WA, USA
- Diabetes Program, Benaroya Research Institute, Seattle, WA, USA
| | - Mark A Atkinson
- Department of Pathology, Immunology, and Laboratory Medicine, University of Florida, Gainesville, FL, USA
| | - Peter K Gregersen
- Robert S. Boas Center for Genomics and Human Genetics, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, USA
| | - Jorge R Oksenberg
- Department of Neurology and Weill Institute for Neurosciences, University of California at San Francisco, San Francisco, CA, USA
| | - Flemming Pociot
- Department of Pediatrics, Herlev University Hospital, Copenhagen, Denmark
- Institute of Clinical Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
- Type 1 Diabetes Biology, Department of Clinical Research, Steno Diabetes Center Copenhagen, Gentofte, Denmark
| | - Marian J Rewers
- Barbara Davis Center for Diabetes, School of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Andrea K Steck
- Barbara Davis Center for Diabetes, School of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - David B Dunger
- Department of Paediatrics, University of Cambridge, Cambridge, UK
- Wellcome Trust Medical Research Council Institute of Metabolic Science, University of Cambridge, Cambridge, UK
| | - Linda S Wicker
- JDRF/Wellcome Diabetes and Inflammation Laboratory, Wellcome Centre for Human Genetics, Nuffield Department of Medicine, NIHR Oxford Biomedical Research Centre, University of Oxford, Oxford, UK
| | - Patrick Concannon
- Department of Pathology, Immunology, and Laboratory Medicine, University of Florida, Gainesville, FL, USA
- Genetics Institute, University of Florida, Gainesville, FL, USA
| | - John A Todd
- JDRF/Wellcome Diabetes and Inflammation Laboratory, Wellcome Centre for Human Genetics, Nuffield Department of Medicine, NIHR Oxford Biomedical Research Centre, University of Oxford, Oxford, UK.
| | - Stephen S Rich
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
- Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA
| |
Collapse
|
14
|
Abdallah AM, Abu-Madi M. The Genetic Control of the Rheumatic Heart: Closing the Genotype-Phenotype Gap. Front Med (Lausanne) 2021; 8:611036. [PMID: 33842495 PMCID: PMC8024521 DOI: 10.3389/fmed.2021.611036] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Accepted: 01/07/2021] [Indexed: 12/20/2022] Open
Abstract
Rheumatic heart disease (RHD) is a heritable inflammatory condition characterized by carditis, arthritis, and systemic disease. Although remaining neglected, the last 3 years has seen some promising advances in RHD research. Whilst it is clear that RHD can be triggered by recurrent group A streptococcal infections, the mechanisms driving clinical progression are still poorly understood. This review summarizes our current understanding of the genetics implicated in this process and the genetic determinants that predispose some people to RHD. The evidence demonstrating the importance of individual cell types and cellular states in delineating causal genetic variants is discussed, highlighting phenotype/genotype correlations where possible. Genetic fine mapping and functional studies in extreme phenotypes, together with large-scale omics studies including genomics, transcriptomics, epigenomics, and metabolomics, are expected to provide new information not only on RHD but also on the mechanisms of other autoimmune diseases and facilitate future clinical translation.
Collapse
Affiliation(s)
- Atiyeh M Abdallah
- Biomedical and Pharmaceutical Research Unit, Department of Biomedical Sciences, College of Health Sciences, QU Health, Qatar University, Doha, Qatar
| | - Marawan Abu-Madi
- Biomedical and Pharmaceutical Research Unit, Department of Biomedical Sciences, College of Health Sciences, QU Health, Qatar University, Doha, Qatar
| |
Collapse
|
15
|
GWAS for autoimmune Addison's disease identifies multiple risk loci and highlights AIRE in disease susceptibility. Nat Commun 2021; 12:959. [PMID: 33574239 PMCID: PMC7878795 DOI: 10.1038/s41467-021-21015-8] [Citation(s) in RCA: 48] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2020] [Accepted: 12/17/2020] [Indexed: 02/07/2023] Open
Abstract
Autoimmune Addison's disease (AAD) is characterized by the autoimmune destruction of the adrenal cortex. Low prevalence and complex inheritance have long hindered successful genetic studies. We here report the first genome-wide association study on AAD, which identifies nine independent risk loci (P < 5 × 10-8). In addition to loci implicated in lymphocyte function and development shared with other autoimmune diseases such as HLA, BACH2, PTPN22 and CTLA4, we associate two protein-coding alterations in Autoimmune Regulator (AIRE) with AAD. The strongest, p.R471C (rs74203920, OR = 3.4 (2.7-4.3), P = 9.0 × 10-25) introduces an additional cysteine residue in the zinc-finger motif of the second PHD domain of the AIRE protein. This unbiased elucidation of the genetic contribution to development of AAD points to the importance of central immunological tolerance, and explains 35-41% of heritability (h2).
Collapse
|
16
|
Hutchinson A, Asimit J, Wallace C. Fine-mapping genetic associations. Hum Mol Genet 2020; 29:R81-R88. [PMID: 32744321 PMCID: PMC7733401 DOI: 10.1093/hmg/ddaa148] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Revised: 06/04/2020] [Accepted: 07/09/2020] [Indexed: 02/07/2023] Open
Abstract
Whilst thousands of genetic variants have been associated with human traits, identifying the subset of those variants that are causal requires a further 'fine-mapping' step. We review the basic fine-mapping approach, which is computationally fast and requires only summary data, but depends on an assumption of a single causal variant per associated region which is recognized as biologically unrealistic. We discuss different ways that the approach has been built upon to accommodate multiple causal variants in a region and to incorporate additional layers of functional annotation data. We further review methods for simultaneous fine-mapping of multiple datasets, either exploiting different linkage disequilibrium (LD) structures across ancestries or borrowing information between distinct but related traits. Finally, we look to the future and the opportunities that will be offered by increasingly accurate maps of causal variants for a multitude of human traits.
Collapse
Affiliation(s)
- Anna Hutchinson
- MRC Biostatistics Unit, Cambridge Biomedical Campus, Cambridge Institute of Public Health, Cambridge CB2 0SR, UK
| | - Jennifer Asimit
- MRC Biostatistics Unit, Cambridge Biomedical Campus, Cambridge Institute of Public Health, Cambridge CB2 0SR, UK
| | - Chris Wallace
- MRC Biostatistics Unit, Cambridge Biomedical Campus, Cambridge Institute of Public Health, Cambridge CB2 0SR, UK
- Cambridge Institute of Therapeutic Immunology & Infectious Disease (CITIID), Jeffrey Cheah Biomedical Centre, Cambridge Biomedical Campus, University of Cambridge, Cambridge, CB2 0AW, UK
- Department of Medicine, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, CB2 2QQ, UK
| |
Collapse
|
17
|
Shi H, Burch KS, Johnson R, Freund MK, Kichaev G, Mancuso N, Manuel AM, Dong N, Pasaniuc B. Localizing Components of Shared Transethnic Genetic Architecture of Complex Traits from GWAS Summary Data. Am J Hum Genet 2020; 106:805-817. [PMID: 32442408 PMCID: PMC7273527 DOI: 10.1016/j.ajhg.2020.04.012] [Citation(s) in RCA: 58] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2019] [Accepted: 04/20/2020] [Indexed: 12/19/2022] Open
Abstract
Despite strong transethnic genetic correlations reported in the literature for many complex traits, the non-transferability of polygenic risk scores across populations suggests the presence of population-specific components of genetic architecture. We propose an approach that models GWAS summary data for one trait in two populations to estimate genome-wide proportions of population-specific/shared causal SNPs. In simulations across various genetic architectures, we show that our approach yields approximately unbiased estimates with in-sample LD and slight upward-bias with out-of-sample LD. We analyze nine complex traits in individuals of East Asian and European ancestry, restricting to common SNPs (MAF > 5%), and find that most common causal SNPs are shared by both populations. Using the genome-wide estimates as priors in an empirical Bayes framework, we perform fine-mapping and observe that high-posterior SNPs (for both the population-specific and shared causal configurations) have highly correlated effects in East Asians and Europeans. In population-specific GWAS risk regions, we observe a 2.8× enrichment of shared high-posterior SNPs, suggesting that population-specific GWAS risk regions harbor shared causal SNPs that are undetected in the other GWASs due to differences in LD, allele frequencies, and/or sample size. Finally, we report enrichments of shared high-posterior SNPs in 53 tissue-specific functional categories and find evidence that SNP-heritability enrichments are driven largely by many low-effect common SNPs.
Collapse
Affiliation(s)
- Huwenbo Shi
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA 90095, USA; Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.
| | - Kathryn S Burch
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA 90095, USA.
| | - Ruth Johnson
- Department of Computer Science, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Malika K Freund
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Gleb Kichaev
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Nicholas Mancuso
- Center for Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA
| | - Astrid M Manuel
- Department of Biological Sciences, Florida International University, Miami, FL 33199, USA
| | - Natalie Dong
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
| | - Bogdan Pasaniuc
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA 90095, USA; Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA; Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA; Department of Computational Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA
| |
Collapse
|
18
|
Hutchinson A, Watson H, Wallace C. Improving the coverage of credible sets in Bayesian genetic fine-mapping. PLoS Comput Biol 2020; 16:e1007829. [PMID: 32282791 PMCID: PMC7179948 DOI: 10.1371/journal.pcbi.1007829] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2020] [Revised: 04/23/2020] [Accepted: 03/27/2020] [Indexed: 12/19/2022] Open
Abstract
Genome Wide Association Studies (GWAS) have successfully identified thousands of loci associated with human diseases. Bayesian genetic fine-mapping studies aim to identify the specific causal variants within GWAS loci responsible for each association, reporting credible sets of plausible causal variants, which are interpreted as containing the causal variant with some "coverage probability". Here, we use simulations to demonstrate that the coverage probabilities are over-conservative in most fine-mapping situations. We show that this is because fine-mapping data sets are not randomly selected from amongst all causal variants, but from amongst causal variants with larger effect sizes. We present a method to re-estimate the coverage of credible sets using rapid simulations based on the observed, or estimated, SNP correlation structure, we call this the "adjusted coverage estimate". This is extended to find "adjusted credible sets", which are the smallest set of variants such that their adjusted coverage estimate meets the target coverage. We use our method to improve the resolution of a fine-mapping study of type 1 diabetes. We found that in 27 out of 39 associated genomic regions our method could reduce the number of potentially causal variants to consider for follow-up, and found that none of the 95% or 99% credible sets required the inclusion of more variants-a pattern matched in simulations of well powered GWAS. Crucially, our method requires only GWAS summary statistics and remains accurate when SNP correlations are estimated from a large reference panel. Using our method to improve the resolution of fine-mapping studies will enable more efficient expenditure of resources in the follow-up process of annotating the variants in the credible set to determine the implicated genes and pathways in human diseases.
Collapse
Affiliation(s)
- Anna Hutchinson
- MRC Biostatistics Unit, Cambridge Institute of Public Health, Cambridge, United Kingdom
| | - Hope Watson
- MRC Biostatistics Unit, Cambridge Institute of Public Health, Cambridge, United Kingdom
| | - Chris Wallace
- MRC Biostatistics Unit, Cambridge Institute of Public Health, Cambridge, United Kingdom
- Cambridge Institute for Therapeutic Immunology and Infectious Disease, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|