1
|
Zajac GJM, Gagliano Taliun SA, Sidore C, Graham SE, Åsvold BO, Brumpton B, Nielsen JB, Zhou W, Gabrielsen M, Skogholt AH, Fritsche LG, Schlessinger D, Cucca F, Hveem K, Willer CJ, Abecasis GR. A fast linkage method for population GWAS cohorts with related individuals. Genet Epidemiol 2023; 47:231-248. [PMID: 36739617 PMCID: PMC10027464 DOI: 10.1002/gepi.22516] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Revised: 10/27/2022] [Accepted: 01/24/2023] [Indexed: 02/07/2023]
Abstract
Linkage analysis, a class of methods for detecting co-segregation of genomic segments and traits in families, was used to map disease-causing genes for decades before genotyping arrays and dense SNP genotyping enabled genome-wide association studies in population samples. Population samples often contain related individuals, but the segregation of alleles within families is rarely used because traditional linkage methods are computationally inefficient for larger datasets. Here, we describe Population Linkage, a novel application of Haseman-Elston regression as a method of moments estimator of variance components and their standard errors. We achieve additional computational efficiency by using modern methods for detection of IBD segments and variance component estimation, efficient preprocessing of input data, and minimizing redundant numerical calculations. We also refined variance component models to account for the biases in population-scale methods for IBD segment detection. We ran Population Linkage on four blood lipid traits in over 70,000 individuals from the HUNT and SardiNIA studies, successfully detecting 25 known genetic signals. One notable linkage signal that appeared in both was for low-density lipoprotein (LDL) cholesterol levels in the region near the gene APOE (LOD = 29.3, variance explained = 4.1%). This is the region where the missense variants rs7412 and rs429358, which together make up the ε2, ε3, and ε4 alleles each account for 2.4% and 0.8% of variation in circulating LDL cholesterol. Our results show the potential for linkage analysis and other large-scale applications of method of moments variance components estimation.
Collapse
Affiliation(s)
- Gregory JM Zajac
- Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI
| | - Sarah A Gagliano Taliun
- Department of Medicine and Department of Neurosciences, Université de Montréal, Montréal, QC H3T 1J4, Canada
- Montréal Heart Institute, Montréal, QC H1T 1C8, Canada
| | - Carlo Sidore
- Istituto di Ricerca Genetica e Biomedica - CNR, Cagliari, Italy
- Dipartimento di Scienze Biomediche, Università di Sassari, Sassari, Italy
| | - Sarah E Graham
- Department of Internal Medicine, Division of Cardiology, University of Michigan, Ann Arbor, MI
| | - Bjørn Olav Åsvold
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Trondheim, Norway
- Department of Endocrinology, Clinic of Medicine, St. Olavs hospital, Trondheim University Hospital, Trondheim, Norway
- HUNT Research Centre, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Levanger 7600, Norway
| | - Ben Brumpton
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Trondheim, Norway
- HUNT Research Centre, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Levanger 7600, Norway
- Clinic of Medicine, St. Olavs Hospital, Trondheim University Hospital, Trondheim, Norway
| | - Jonas B Nielsen
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Trondheim, Norway
| | - Wei Zhou
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA
| | - Maiken Gabrielsen
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Trondheim, Norway
| | - Anne Heidi Skogholt
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Trondheim, Norway
| | - Lars G Fritsche
- Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI
| | | | - Francesco Cucca
- Istituto di Ricerca Genetica e Biomedica - CNR, Cagliari, Italy
- Dipartimento di Scienze Biomediche, Università di Sassari, Sassari, Italy
| | - Kristian Hveem
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Trondheim, Norway
- HUNT Research Centre, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Levanger 7600, Norway
- Department of Medicine, Levanger Hospital, Nord-Trøndelag Hospital Trust, Levanger 7600, Norway
| | - Cristen J Willer
- Department of Internal Medicine, Division of Cardiology, University of Michigan, Ann Arbor, MI
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI
- Department of Human Genetics, University of Michigan, Ann Arbor, MI
| | - Gonçalo R Abecasis
- Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI
| |
Collapse
|
2
|
Abstract
Metabolic syndrome (MetS) is a highly heritable disease and a major public health burden worldwide. MetS diagnosis criteria are met by the simultaneous presence of any three of the following: high triglycerides, low HDL/high LDL cholesterol, insulin resistance, hypertension, and central obesity. These diseases act synergistically in people suffering from MetS and dramatically increase risk of morbidity and mortality due to stroke and cardiovascular disease, as well as certain cancers. Each of these component features is itself a complex disease, as is MetS. As a genetically complex disease, genetic risk factors for MetS are numerous, but not very powerful individually, often requiring specific environmental stressors for the disease to manifest. When taken together, all sequence variants that contribute to MetS disease risk explain only a fraction of the heritable variance, suggesting additional, novel loci have yet to be discovered. In this article, we will give a brief overview on the genetic concepts needed to interpret genome-wide association studies (GWAS) and quantitative trait locus (QTL) data, summarize the state of the field of MetS physiological genomics, and to introduce tools and resources that can be used by the physiologist to integrate genomics into their own research on MetS and any of its component features. There is a wealth of phenotypic and molecular data in animal models and humans that can be leveraged as outlined in this article. Integrating these multi-omic QTL data for complex diseases such as MetS provides a means to unravel the pathways and mechanisms leading to complex disease and promise for novel treatments. © 2022 American Physiological Society. Compr Physiol 12:1-40, 2022.
Collapse
Affiliation(s)
- Karen C Clark
- Department of Physiology, Medical College of Wisconsin, Milwaukee, Wisconsin, USA
| | - Anne E Kwitek
- Department of Physiology, Medical College of Wisconsin, Milwaukee, Wisconsin, USA
| |
Collapse
|
3
|
Al-Sarraj Y, Al-Dous E, Taha RZ, Ahram D, Alshaban F, Tolfat M, El-Shanti H, Albagha OM. Family-Based Genome-Wide Association Study of Autism Spectrum Disorder in Middle Eastern Families. Genes (Basel) 2021; 12:761. [PMID: 34069769 PMCID: PMC8157263 DOI: 10.3390/genes12050761] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Revised: 05/13/2021] [Accepted: 05/13/2021] [Indexed: 12/20/2022] Open
Abstract
Autism spectrum disorder (ASD) is a neurodevelopmental disease characterized by abnormalities in language and social communication with substantial clinical heterogeneity. Genetic factors play an important role in ASD with heritability estimated between 70% to 80%. Genome-wide association studies (GWAS) have identified multiple loci associated with ASD. However, most studies were performed on European populations and little is known about the genetic architecture of ASD in Middle Eastern populations. Here, we report the first GWAS of ASD in the Middle eastern population of Qatar. We analyzed 171 families with ASD, using linear mixed models adjusting for relatedness and other confounders. Results showed that common single nucleotide polymorphisms (SNP) in seven loci are associated with ASD (p < 1 × 10-5). Although the identified loci did not reach genome-wide significance, many of the top associated SNPs are located within or near genes that have been implicated in ASD or related neurodevelopmental disorders. These include GORASP2, GABBR2, ANKS6, THSD4, ERCC6L, ARHGEF6, and HDAC8. Additionally, three of the top associated SNPs were significantly associated with gene expression. We also found evidence of association signals in two previously reported ASD-susceptibility loci (rs10099100 and rs4299400). Our results warrant further functional studies and replication to provide further insights into the genetic architecture of ASD.
Collapse
Affiliation(s)
- Yasser Al-Sarraj
- College of Health and Life Sciences, Hamad Bin Khalifa University, Doha 34110, Qatar; (Y.A.-S.); (E.A.-D.)
- Qatar Biomedical Research Institute (QBRI), Hamad Bin Khalifa University, Doha 34110, Qatar; (R.Z.T.); (D.A.); (F.A.); (H.E.-S.)
| | - Eman Al-Dous
- College of Health and Life Sciences, Hamad Bin Khalifa University, Doha 34110, Qatar; (Y.A.-S.); (E.A.-D.)
- Qatar Biomedical Research Institute (QBRI), Hamad Bin Khalifa University, Doha 34110, Qatar; (R.Z.T.); (D.A.); (F.A.); (H.E.-S.)
| | - Rowaida Z. Taha
- Qatar Biomedical Research Institute (QBRI), Hamad Bin Khalifa University, Doha 34110, Qatar; (R.Z.T.); (D.A.); (F.A.); (H.E.-S.)
| | - Dina Ahram
- Qatar Biomedical Research Institute (QBRI), Hamad Bin Khalifa University, Doha 34110, Qatar; (R.Z.T.); (D.A.); (F.A.); (H.E.-S.)
- Division of Nephrology, Columbia University Medical Center, New York, NY 10032, USA
| | - Fouad Alshaban
- Qatar Biomedical Research Institute (QBRI), Hamad Bin Khalifa University, Doha 34110, Qatar; (R.Z.T.); (D.A.); (F.A.); (H.E.-S.)
| | - Mohammed Tolfat
- The Shafallah Center for Children with Special Needs, Doha 33123, Qatar;
| | - Hatem El-Shanti
- Qatar Biomedical Research Institute (QBRI), Hamad Bin Khalifa University, Doha 34110, Qatar; (R.Z.T.); (D.A.); (F.A.); (H.E.-S.)
- Department of Pediatrics, Carver College of Medicine, University of Iowa, Iowa City, IA 52242, USA
| | - Omar M.E. Albagha
- College of Health and Life Sciences, Hamad Bin Khalifa University, Doha 34110, Qatar; (Y.A.-S.); (E.A.-D.)
- Qatar Biomedical Research Institute (QBRI), Hamad Bin Khalifa University, Doha 34110, Qatar; (R.Z.T.); (D.A.); (F.A.); (H.E.-S.)
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh EH4 2XU, UK
| |
Collapse
|
4
|
Labadie JD, Elvers I, Feigelson HS, Magzamen S, Yoshimoto J, Dossey J, Burnett R, Avery AC. Genome-wide association analysis of canine T zone lymphoma identifies link to hypothyroidism and a shared association with mast-cell tumors. BMC Genomics 2020; 21:464. [PMID: 32631225 PMCID: PMC7339439 DOI: 10.1186/s12864-020-06872-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2019] [Accepted: 06/26/2020] [Indexed: 01/23/2023] Open
Abstract
Background T zone lymphoma (TZL), a histologic variant of peripheral T cell lymphoma, represents about 12% of all canine lymphomas. Golden Retrievers appear predisposed, representing over 40% of TZL cases. Prior research found that asymptomatic aged Golden Retrievers frequently have populations of T zone-like cells (phenotypically identical to TZL) of undetermined significance (TZUS), potentially representing a pre-clinical state. These findings suggest a genetic risk factor for this disease and caused us to investigate potential genes of interest using a genome-wide association study of privately-owned U.S. Golden Retrievers. Results Dogs were categorized as TZL (n = 95), TZUS (n = 142), or control (n = 101) using flow cytometry and genotyped using the Illumina CanineHD BeadChip. Using a mixed linear model adjusting for population stratification, we found association with genome-wide significance in regions on chromosomes 8 and 14. The chromosome 14 peak included four SNPs (Odds Ratio = 1.18–1.19, p = .3 × 10− 5–5.1 × 10− 5) near three hyaluronidase genes (SPAM1, HYAL4, and HYALP1). Targeted resequencing of this region using a custom sequence capture array identified missense mutations in all three genes; the variant in SPAM1 was predicted to be damaging. These mutations were also associated with risk for mast cell tumors among Golden Retrievers in an unrelated study. The chromosome 8 peak contained 7 SNPs (Odds Ratio = 1.24–1.42, p = 2.7 × 10− 7–7.5 × 10− 5) near genes involved in thyroid hormone regulation (DIO2 and TSHR). A prior study from our laboratory found hypothyroidism is inversely associated with TZL risk. No coding mutations were found with targeted resequencing but identified variants may play a regulatory role for all or some of the genes. Conclusions The pathogenesis of canine TZL may be related to hyaluronan breakdown and subsequent production of pro-inflammatory and pro-oncogenic byproducts. The association on chromosome 8 may indicate thyroid hormone is involved in TZL development, consistent with findings from a previous study evaluating epidemiologic risk factors for TZL. Future work is needed to elucidate these mechanisms.
Collapse
Affiliation(s)
- Julia D Labadie
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA. .,Department of Environmental and Radiological Health Sciences, College of Veterinary Medicine and Biomedical Sciences, Colorado State University, Fort Collins, CO, USA.
| | - Ingegerd Elvers
- Department of Medical Biochemistry and Microbiology, Uppsala University, Broad Institute of MIT and Harvard, Cambridge, Massachusetts and Science for Life Laboratory, Uppsala, Sweden
| | | | - Sheryl Magzamen
- Department of Environmental and Radiological Health Sciences, College of Veterinary Medicine and Biomedical Sciences, Colorado State University, Fort Collins, CO, USA
| | - Janna Yoshimoto
- Department of Microbiology, Immunology and Pathology, College of Veterinary Medicine and Biomedical Sciences, Colorado State University, Fort Collins, CO, USA
| | - Jeremy Dossey
- Department of Microbiology, Immunology and Pathology, College of Veterinary Medicine and Biomedical Sciences, Colorado State University, Fort Collins, CO, USA
| | - Robert Burnett
- Department of Microbiology, Immunology and Pathology, College of Veterinary Medicine and Biomedical Sciences, Colorado State University, Fort Collins, CO, USA
| | - Anne C Avery
- Department of Microbiology, Immunology and Pathology, College of Veterinary Medicine and Biomedical Sciences, Colorado State University, Fort Collins, CO, USA
| |
Collapse
|
5
|
Thomson RJ, McMorran B, Hoy W, Jose M, Whittock L, Thornton T, Burgio G, Mathews JD, Foote S. New Genetic Loci Associated With Chronic Kidney Disease in an Indigenous Australian Population. Front Genet 2019; 10:330. [PMID: 31040861 PMCID: PMC6476903 DOI: 10.3389/fgene.2019.00330] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2018] [Accepted: 03/28/2019] [Indexed: 12/11/2022] Open
Abstract
The common occurrence of renal disease in Australian Aboriginal populations such as Tiwi Islanders may be determined by environmental and genetic factors. To explore genetic contributions, we performed a genome-wide association study (GWAS) of urinary albumin creatinine ratio (ACR) in a sample of 249 Tiwi individuals with genotype data from a 370K Affymetrix single nucleotide polymorphism (SNP) array. A principal component analysis (PCA) of the 249 individual Tiwi cohort and samples from 11 populations included in phase III of the HapMap Project indicated that Tiwi Islanders are a relatively distinct and unique population with no close genetic relationships to the other ethnic groups. After adjusting for age and sex, the proportion of ACR variance explained by the 370K SNPs was estimated to be 37% (using the software GCTA.31; likelihood ratio = 8.06, p-value = 0.002). The GWAS identified eight SNPs that were nominally significantly associated with ACR (p < 0.0005). A replication study of these SNPs was performed in an independent cohort of 497 individuals on the eight SNPs. Four of these SNPs were significantly associated with ACR in the replication sample (p < 0.05), rs4016189 located near the CRIM1 gene (p = 0.000751), rs443816 located in the gene encoding UGT2B11 (p = 0.022), rs6461901 located near the NFE2L3 gene, and rs1535656 located in the RAB14 gene. The SNP rs4016189 was still significant after adjusting for multiple testing. A structural equation model (SEM) demonstrated that the rs4016189 SNP was not associated with other phenotypes such as estimated glomerular filtration rate (eGFR), diabetes, and blood pressure.
Collapse
Affiliation(s)
- Russell J Thomson
- Centre for Research in Mathematics, School of Computing, Engineering and Mathematics, Western Sydney University, Sydney, NSW, Australia
| | - Brendan McMorran
- John Curtin School of Medical Research, Australian National University, Canberra, ACT, Australia
| | - Wendy Hoy
- Centre for Chronic Disease, Faculty of Health, The University of Queensland, Brisbane, QLD, Australia
| | - Matthew Jose
- Menzies Institute of Medical Research, College of Health and Medicine, University of Tasmania, Hobart, TAS, Australia.,School of Medicine, College of Health and Medicine, University of Tasmania, Hobart, TAS, Australia
| | - Lucy Whittock
- Institute for Marine and Antarctic Studies, College of Sciences and Engineering, University of Tasmania, Hobart, TAS, Australia
| | - Tim Thornton
- Department of Biostatistics, School of Public Health, University of Washington, Seattle, WA, United States
| | - Gaétan Burgio
- John Curtin School of Medical Research, Australian National University, Canberra, ACT, Australia
| | - John Duncan Mathews
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, VIC, Australia
| | - Simon Foote
- John Curtin School of Medical Research, Australian National University, Canberra, ACT, Australia
| |
Collapse
|