1
|
Lo YC, Tian H, Chan TF, Jeon S, Alatorre K, Dinh BL, Maskarinec G, Taparra K, Nakatsuka N, Yu M, Chen CY, Lin YF, Wilkens LR, Le Marchand L, Haiman CA, Chiang CWK. The accuracy of polygenic score models for BMI and Type II diabetes in the Native Hawaiian population. Commun Biol 2025; 8:651. [PMID: 40269120 PMCID: PMC12018950 DOI: 10.1038/s42003-025-08050-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2024] [Accepted: 04/07/2025] [Indexed: 04/25/2025] Open
Abstract
Polygenic scores (PGS) are promising in stratifying individuals based on the genetic susceptibility to complex diseases or traits. However, the accuracy of PGS models, typically trained in European- or East Asian-ancestry populations, tend to perform poorly in other ethnic minority populations and their accuracies have not been evaluated for Native Hawaiians. In particular, for body mass index (BMI) and type-2 diabetes (T2D), Polynesian-ancestry individuals such as Native Hawaiians or Samoans exhibit varied distribution from other continental populations, but are understudied, particularly in the context of PGS. Using BMI and T2D as examples of metabolic traits of importance to Polynesian populations (along with height as a comparison of a similarly highly polygenic trait), here we examine the prediction accuracies of PGS models in a large Native Hawaiian sample from the Multiethnic Cohort with up to 5300 individuals. We find evidence of lowered prediction accuracies for the PGS models in some cases, particularly for height. We also find that using the Native Hawaiian samples as an optimization cohort during training does not consistently improve PGS performance. Moreover, even the best-performing PGS models among Native Hawaiians have lowered prediction accuracy among the subset of individuals most enriched with Polynesian ancestry. Our findings indicate that factors such as admixture histories, sample size, and diversity in GWAS can influence PGS performance for complex traits among Native Hawaiian samples. This study provides an initial survey of PGS performance among Native Hawaiians and exposes the current gaps and challenges associated with improving polygenic prediction models for underrepresented minority populations.
Collapse
Affiliation(s)
- Ying-Chu Lo
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - He Tian
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Tsz Fung Chan
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Soyoung Jeon
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Kimberli Alatorre
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Bryan L Dinh
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Gertraud Maskarinec
- Epidemiology Program, University of Hawai'i Cancer Center, University of Hawai'i, Manoa, Honolulu, HI, USA
| | - Kekoa Taparra
- Standard Health Care, Department of Radiation Oncology, Palo Alto, CA, USA
| | | | - Mingrui Yu
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | | - Yen-Feng Lin
- Center for Neuropsychiatric Research, National Health Research Institutes, Miaoli, Taiwan
- Department of Public Health & Medical Humanities, School of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan
- Institute of Behavioral Medicine, College of Medicine, National Cheng Kung University, Tainan, Taiwan
| | - Lynne R Wilkens
- Epidemiology Program, University of Hawai'i Cancer Center, University of Hawai'i, Manoa, Honolulu, HI, USA
| | - Loic Le Marchand
- Epidemiology Program, University of Hawai'i Cancer Center, University of Hawai'i, Manoa, Honolulu, HI, USA
| | - Christopher A Haiman
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Cancer Epidemiology Program, Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, USA
| | - Charleston W K Chiang
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA.
- Cancer Epidemiology Program, Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, USA.
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA.
| |
Collapse
|
2
|
Fan C, Cahoon JL, Dinh BL, Ortega-Del Vecchyo D, Huber CD, Edge MD, Mancuso N, Chiang CWK. A likelihood-based framework for demographic inference from genealogical trees. Nat Genet 2025; 57:865-874. [PMID: 40113903 DOI: 10.1038/s41588-025-02129-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Accepted: 02/14/2025] [Indexed: 03/22/2025]
Abstract
The demographic history of a population underlies patterns of genetic variation and is encoded in the gene-genealogical trees of the sampled haplotypes. Here we propose a demographic inference framework called the genealogical likelihood (gLike). Our method uses a graph-based structure to summarize the relationships among all lineages in a gene-genealogical tree with all possible trajectories of population memberships through time and derives the full likelihood across trees under a parameterized demographic model. We show through simulations and empirical applications that for populations that have experienced multiple admixtures, gLike can accurately estimate dozens of demographic parameters, including ancestral population sizes, admixture timing and admixture proportions, and it outperforms conventional demographic inference methods using the site frequency spectrum. Taken together, our proposed gLike framework harnesses underused genealogical information to offer high sensitivity and accuracy in inferring complex demographies for humans and other species.
Collapse
Affiliation(s)
- Caoqi Fan
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA.
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA.
| | - Jordan L Cahoon
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
- Department of Computer Science, University of Southern California, Los Angeles, CA, USA
| | - Bryan L Dinh
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Diego Ortega-Del Vecchyo
- Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Querétaro, México
| | - Christian D Huber
- Department of Biology, Penn State University, University Park, PA, USA
| | - Michael D Edge
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Nicholas Mancuso
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Charleston W K Chiang
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA.
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA.
| |
Collapse
|
3
|
Ha EK, Shriner D, Callier SL, Riley L, Adeyemo AA, Rotimi CN, Bentley AR. Native Hawaiian and Pacific Islander populations in genomic research. NPJ Genom Med 2024; 9:45. [PMID: 39349931 PMCID: PMC11442686 DOI: 10.1038/s41525-024-00428-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Accepted: 09/06/2024] [Indexed: 10/04/2024] Open
Abstract
The role of genomic research and medicine in improving health continues to grow significantly, highlighting the need for increased equitable inclusion of diverse populations in genomics. Native Hawaiian and Pacific Islander (NHPI) communities are often missing from these efforts to ensure that the benefits of genomics are accessible to all individuals. In this article, we analyze the qualities of NHPI populations relevant to their inclusion in genomic research and investigate their current representation using data from the genome-wide association studies (GWAS) catalog. A discussion of the barriers NHPI experience regarding participating in research and recommendations to improve NHPI representation in genomic research are also included.
Collapse
Affiliation(s)
- Edra K Ha
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
- University of Hawai'i at Mānoa, Honolulu, HI, USA
- University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
| | - Daniel Shriner
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Shawneequa L Callier
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
- Department of Clinical Research and Leadership, The George Washington University School of Medicine and Health Sciences, Washington, DC, USA
| | | | - Adebowale A Adeyemo
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Charles N Rotimi
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Amy R Bentley
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.
| |
Collapse
|
4
|
Rivara AC, Russell EM, Carlson JC, Pomer A, Naseri T, Reupena MS, Manna SL, Viali S, Minster RL, Weeks DE, DeLany JP, Kershaw EE, McGarvey ST, Hawley NL. Associations between fasting glucose rate-of-change and the missense variant, rs373863828, in an adult Samoan cohort. PLoS One 2024; 19:e0302643. [PMID: 38829901 PMCID: PMC11146712 DOI: 10.1371/journal.pone.0302643] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Accepted: 04/09/2024] [Indexed: 06/05/2024] Open
Abstract
BACKGROUND The A allele of rs373863828 in CREB3 regulatory factor is associated with high Body Mass Index, but lower odds of type 2 diabetes. These associations have been replicated elsewhere, but to date all studies have been cross-sectional. Our aims were (1) to describe the development of type 2 diabetes and change in fasting glucose between 2010 and 2018 among a longitudinal cohort of adult Samoans without type 2 diabetes or who were not using diabetes medications at baseline, and (2) to examine associations between fasting glucose rate-of-change (mmol/L per year) and the A allele of rs373863828. METHODS We describe and test differences in fasting glucose, the development of type 2 diabetes, body mass index, age, smoking status, physical activity, urbanicity of residence, and household asset scores between 2010 and 2018 among a cohort of n = 401 adult Samoans, selected to have a ~2:2:1 ratio of GG:AG: AA rs373863828 genotypes. Multivariate linear regression was used to test whether fasting glucose rate-of-change was associated with rs373863828 genotype, and other baseline variables. RESULTS By 2018, fasting glucose and BMI significantly increased among all genotype groups, and a substantial portion of the sample developed type 2 diabetes mellitus. The A allele was associated with a lower fasting glucose rate-of-change (β = -0.05 mmol/L/year per allele, p = 0.058 among women; β = -0.004 mmol/L/year per allele, p = 0.863 among men), after accounting for baseline variables. Mean fasting glucose and mean BMI increased over an eight-year period and a substantial number of individuals developed type 2 diabetes by 2018. However, fasting glucose rate-of-change, and type 2 diabetes development was lower among females with AG and AA genotypes. CONCLUSIONS Further research is needed to understand the effect of the A allele on fasting glucose and type 2 diabetes development. Based on our observations that other risk factors increased over time, we advocate for the continued promotion for diabetes prevention and treatment programming, and the reduction of modifiable risk factors, in this setting.
Collapse
Affiliation(s)
- Anna C. Rivara
- Department of Chronic Disease Epidemiology, Yale School of Public Health, New Haven, Connecticut, United States of America
| | - Emily M. Russell
- Department of Human Genetics, School of Public Health, University of Pittsburgh, Pittsburgh, PA, United States of America
| | - Jenna C. Carlson
- Department of Human Genetics, School of Public Health, University of Pittsburgh, Pittsburgh, PA, United States of America
- Department of Biostatistics, School of Public Health, University of Pittsburgh, Pittsburgh, PA, United States of America
| | - Alysa Pomer
- Center of Surgery and Public Health, Brigham and Women’s Hospital, Boston, MA, United States of America
| | - Take Naseri
- Family Health Clinic, Apia, Samoa
- Naseri & Associates Health Consultancy Firm, Apia, Samoa
| | | | - Samantha L. Manna
- Department of Human Genetics, School of Public Health, University of Pittsburgh, Pittsburgh, PA, United States of America
- Center for Craniofacial and Dental Genetics, Department of Oral and Craniofacial Sciences, School of Dental Medicine, University of Pittsburgh, Pittsburgh, PA, United States of America
| | - Satupaitea Viali
- Department of Chronic Disease Epidemiology, Yale School of Public Health, New Haven, Connecticut, United States of America
- Oceania University of Medicine, Apia, Samoa
| | - Ryan L. Minster
- Department of Human Genetics, School of Public Health, University of Pittsburgh, Pittsburgh, PA, United States of America
| | - Daniel E. Weeks
- Department of Human Genetics, School of Public Health, University of Pittsburgh, Pittsburgh, PA, United States of America
- Department of Biostatistics, School of Public Health, University of Pittsburgh, Pittsburgh, PA, United States of America
| | - James P. DeLany
- Advent Health Orlando, Translational Research Institute, Orlando, FL, United States of America
| | - Erin E. Kershaw
- Division of Endocrinology, Department of Medicine, University of Pittsburgh, Pittsburgh, PA, United States of America
| | - Stephen T. McGarvey
- Department of Epidemiology, International Health Institute, School of Public Health, Brown University, Providence, RI, United States of America
- Department of Anthropology, Brown University, Providence, RI, United States of America
| | - Nicola L. Hawley
- Department of Chronic Disease Epidemiology, Yale School of Public Health, New Haven, Connecticut, United States of America
| |
Collapse
|
5
|
Cahoon JL, Rui X, Tang E, Simons C, Langie J, Chen M, Lo YC, Chiang CWK. Imputation accuracy across global human populations. Am J Hum Genet 2024; 111:979-989. [PMID: 38604166 PMCID: PMC11080279 DOI: 10.1016/j.ajhg.2024.03.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 03/14/2024] [Accepted: 03/15/2024] [Indexed: 04/13/2024] Open
Abstract
Genotype imputation is now fundamental for genome-wide association studies but lacks fairness due to the underrepresentation of references from non-European ancestries. The state-of-the-art imputation reference panel released by the Trans-Omics for Precision Medicine (TOPMed) initiative improved the imputation of admixed African-ancestry and Hispanic/Latino samples, but imputation for populations primarily residing outside of North America may still fall short in performance due to persisting underrepresentation. To illustrate this point, we imputed the genotypes of over 43,000 individuals across 123 populations around the world and identified numerous populations where imputation accuracy paled in comparison to that of European-ancestry populations. For instance, the mean imputation r-squared (Rsq) for variants with minor allele frequencies between 1% and 5% in Saudi Arabians (n = 1,061), Vietnamese (n = 1,264), Thai (n = 2,435), and Papua New Guineans (n = 776) were 0.79, 0.78, 0.76, and 0.62, respectively, compared to 0.90-0.93 for comparable European populations matched in sample size and SNP array content. Outside of Africa and Latin America, Rsq appeared to decrease as genetic distances to European-ancestry reference increased, as predicted. Using sequencing data as ground truth, we also showed that Rsq may over-estimate imputation accuracy for non-European populations more than European populations, suggesting further disparity in accuracy between populations. Using 1,496 sequenced individuals from Taiwan Biobank as a second reference panel to TOPMed, we also assessed a strategy to improve imputation for non-European populations with meta-imputation, but this design did not improve accuracy across frequency spectra. Taken together, our analyses suggest that we must ultimately strive to increase diversity and size to promote equity within genetics research.
Collapse
Affiliation(s)
- Jordan L Cahoon
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, Los Angeles, CA 90033, USA; Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, Los Angeles, CA 90089, USA; Department of Computer Science, University of Southern California, Los Angeles, Los Angeles, CA 90089, USA
| | - Xinyue Rui
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, Los Angeles, CA 90033, USA
| | - Echo Tang
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, Los Angeles, CA 90089, USA
| | - Christopher Simons
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, Los Angeles, CA 90089, USA
| | - Jalen Langie
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, Los Angeles, CA 90033, USA
| | - Minhui Chen
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, Los Angeles, CA 90033, USA
| | - Ying-Chu Lo
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, Los Angeles, CA 90033, USA
| | - Charleston W K Chiang
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, Los Angeles, CA 90033, USA; Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, Los Angeles, CA 90089, USA; Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, Los Angeles, CA 90033, USA.
| |
Collapse
|
6
|
Dinh BL, Tang E, Taparra K, Nakatsuka N, Chen F, Chiang CWK. Recombination map tailored to Native Hawaiians may improve robustness of genomic scans for positive selection. Hum Genet 2024; 143:85-99. [PMID: 38157018 PMCID: PMC10794367 DOI: 10.1007/s00439-023-02625-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2023] [Accepted: 11/25/2023] [Indexed: 01/03/2024]
Abstract
Recombination events establish the patterns of haplotypic structure in a population and estimates of recombination rates are used in several downstream population and statistical genetic analyses. Using suboptimal maps from distantly related populations may reduce the efficacy of genomic analyses, particularly for underrepresented populations such as the Native Hawaiians. To overcome this challenge, we constructed recombination maps using genome-wide array data from two study samples of Native Hawaiians: one reflecting the current admixed state of Native Hawaiians (NH map) and one based on individuals of enriched Polynesian ancestries (PNS map) with the potential to be used for less admixed Polynesian populations such as the Samoans. We found the recombination landscape to be less correlated with those from other continental populations (e.g. Spearman's rho = 0.79 between PNS and CEU (Utah residents with Northern and Western European ancestry) compared to 0.92 between YRI (Yoruba in Ibadan, Nigeria) and CEU at 50 kb resolution), likely driven by the unique demographic history of the Native Hawaiians. PNS also shared the fewest recombination hotspots with other populations (e.g. 8% of hotspots shared between PNS and CEU compared to 27% of hotspots shared between YRI and CEU). We found that downstream analyses in the Native Hawaiian population, such as local ancestry inference, imputation, and IBD segment and relatedness detections, would achieve similar efficacy when using the NH map compared to an omnibus map. However, for genome scans of adaptive loci using integrated haplotype scores, we found several loci with apparent genome-wide significant signals (|Z-score|> 4) in Native Hawaiians that would not have been significant when analyzed using NH-specific maps. Population-specific recombination maps may therefore improve the robustness of haplotype-based statistics and help us better characterize the evolutionary history that may underlie Native Hawaiian-specific health conditions that persist today.
Collapse
Affiliation(s)
- Bryan L Dinh
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Echo Tang
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Kekoa Taparra
- Department of Radiation Oncology, Stanford University, Palo Alto, CA, USA
| | | | - Fei Chen
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Charleston W K Chiang
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA.
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA.
| |
Collapse
|
7
|
Lo YC, Chan TF, Jeon S, Maskarinec G, Taparra K, Nakatsuka N, Yu M, Chen CY, Lin YF, Wilkens LR, Le Marchand L, Haiman CA, Chiang CWK. The accuracy of polygenic score models for anthropometric traits and Type II Diabetes in the Native Hawaiian Population. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.12.25.23300499. [PMID: 38234828 PMCID: PMC10793530 DOI: 10.1101/2023.12.25.23300499] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/19/2024]
Abstract
Polygenic scores (PGS) are promising in stratifying individuals based on the genetic susceptibility to complex diseases or traits. However, the accuracy of PGS models, typically trained in European- or East Asian-ancestry populations, tend to perform poorly in other ethnic minority populations, and their accuracies have not been evaluated for Native Hawaiians. Using body mass index, height, and type-2 diabetes as examples of highly polygenic traits, we evaluated the prediction accuracies of PGS models in a large Native Hawaiian sample from the Multiethnic Cohort with up to 5,300 individuals. We evaluated both publicly available PGS models or genome-wide PGS models trained in this study using the largest available GWAS. We found evidence of lowered prediction accuracies for the PGS models in some cases, particularly for height. We also found that using the Native Hawaiian samples as an optimization cohort during training did not consistently improve PGS performance. Moreover, even the best performing PGS models among Native Hawaiians would have lowered prediction accuracy among the subset of individuals most enriched with Polynesian ancestry. Our findings indicate that factors such as admixture histories, sample size and diversity in GWAS can influence PGS performance for complex traits among Native Hawaiian samples. This study provides an initial survey of PGS performance among Native Hawaiians and exposes the current gaps and challenges associated with improving polygenic prediction models for underrepresented minority populations.
Collapse
Affiliation(s)
- Ying-Chu Lo
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Tsz Fung Chan
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Soyoung Jeon
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Gertraud Maskarinec
- Epidemiology Program, University of Hawai'i Cancer Center, University of Hawai'i, Manoa, Honolulu, HI, USA
| | - Kekoa Taparra
- Standard Health Care, Department of Radiation Oncology, Palo Alto, CA, USA
| | | | - Mingrui Yu
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Neuropsychiatric Research, National Health Research Institutes, Miaoli, Taiwan
| | - Chia-Yen Chen
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Neuropsychiatric Research, National Health Research Institutes, Miaoli, Taiwan
- Biogen, Cambridge, MA, USA
- Psychiatric and Neurodevelopmental Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Yen-Feng Lin
- Center for Neuropsychiatric Research, National Health Research Institutes, Miaoli, Taiwan
- Department of Public Health & Medical Humanities, School of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan
- Institute of Behavioral Medicine, College of Medicine, National Cheng Kung University, Tainan, Taiwan
| | - Lynne R Wilkens
- Epidemiology Program, University of Hawai'i Cancer Center, University of Hawai'i, Manoa, Honolulu, HI, USA
| | - Loic Le Marchand
- Epidemiology Program, University of Hawai'i Cancer Center, University of Hawai'i, Manoa, Honolulu, HI, USA
| | - Christopher A Haiman
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Cancer Epidemiology Program, Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, USA
| | - Charleston W K Chiang
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Cancer Epidemiology Program, Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, USA
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| |
Collapse
|
8
|
Link V, Schraiber JG, Fan C, Dinh B, Mancuso N, Chiang CWK, Edge MD. Tree-based QTL mapping with expected local genetic relatedness matrices. Am J Hum Genet 2023; 110:2077-2091. [PMID: 38065072 PMCID: PMC10716520 DOI: 10.1016/j.ajhg.2023.10.017] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2023] [Revised: 10/26/2023] [Accepted: 10/27/2023] [Indexed: 12/18/2023] Open
Abstract
Understanding the genetic basis of complex phenotypes is a central pursuit of genetics. Genome-wide association studies (GWASs) are a powerful way to find genetic loci associated with phenotypes. GWASs are widely and successfully used, but they face challenges related to the fact that variants are tested for association with a phenotype independently, whereas in reality variants at different sites are correlated because of their shared evolutionary history. One way to model this shared history is through the ancestral recombination graph (ARG), which encodes a series of local coalescent trees. Recent computational and methodological breakthroughs have made it feasible to estimate approximate ARGs from large-scale samples. Here, we explore the potential of an ARG-based approach to quantitative-trait locus (QTL) mapping, echoing existing variance-components approaches. We propose a framework that relies on the conditional expectation of a local genetic relatedness matrix (local eGRM) given the ARG. Simulations show that our method is especially beneficial for finding QTLs in the presence of allelic heterogeneity. By framing QTL mapping in terms of the estimated ARG, we can also facilitate the detection of QTLs in understudied populations. We use local eGRM to analyze two chromosomes containing known body size loci in a sample of Native Hawaiians. Our investigations can provide intuition about the benefits of using estimated ARGs in population- and statistical-genetic methods in general.
Collapse
Affiliation(s)
- Vivian Link
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Joshua G Schraiber
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Caoqi Fan
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA; Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Bryan Dinh
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA; Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Nicholas Mancuso
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA; Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Charleston W K Chiang
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA; Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Michael D Edge
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA.
| |
Collapse
|
9
|
Paik KE, Hicklen R, Kaggwa F, Puyat CV, Nakayama LF, Ong BA, Shropshire JNI, Villanueva C. Digital Determinants of Health: Health data poverty amplifies existing health disparities-A scoping review. PLOS DIGITAL HEALTH 2023; 2:e0000313. [PMID: 37824445 PMCID: PMC10569513 DOI: 10.1371/journal.pdig.0000313] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/15/2023] [Accepted: 07/02/2023] [Indexed: 10/14/2023]
Abstract
Artificial intelligence (AI) and machine learning (ML) have an immense potential to transform healthcare as already demonstrated in various medical specialties. This scoping review focuses on the factors that influence health data poverty, by conducting a literature review, analysis, and appraisal of results. Health data poverty is often an unseen factor which leads to perpetuating or exacerbating health disparities. Improvements or failures in addressing health data poverty will directly impact the effectiveness of AI/ML systems. The potential causes are complex and may enter anywhere along the development process. The initial results highlighted studies with common themes of health disparities (72%), AL/ML bias (28%) and biases in input data (18%). To properly evaluate disparities that exist we recommend a strengthened effort to generate unbiased equitable data, improved understanding of the limitations of AI/ML tools, and rigorous regulation with continuous monitoring of the clinical outcomes of deployed tools.
Collapse
Affiliation(s)
- Kenneth Eugene Paik
- MIT Critical Data, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Rachel Hicklen
- Research Medical Library, MD Anderson Cancer Center, Houston, Texas, United States of America
| | - Fred Kaggwa
- Department of Computer Science, Mbarara University of Science & Technology, Mbarara, Uganda
| | | | - Luis Filipe Nakayama
- MIT Critical Data, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Department of Ophthalmology, São Paulo Federal University, São Paulo, Brazil
| | - Bradley Ashley Ong
- Department of Neurology, Neurological Institute, Cleveland Clinic, Cleveland, Ohio, United States of America
| | | | - Cleva Villanueva
- Instituto Politécnico Nacional, Escuela Superior de Medicina, Mexico City, Mexico
| |
Collapse
|
10
|
Dinh BL, Tang E, Taparra K, Nakatsuka N, Chen F, Chiang CWK. Recombination map tailored to Native Hawaiians improves robustness of genomic scans for positive selection. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.12.548735. [PMID: 37503129 PMCID: PMC10370006 DOI: 10.1101/2023.07.12.548735] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Recombination events establish the patterns of haplotypic structure in a population and estimates of recombination rates are used in several downstream population and statistical genetic analyses. Using suboptimal maps from distantly related populations may reduce the efficacy of genomic analyses, particularly for underrepresented populations such as the Native Hawaiians. To overcome this challenge, we constructed recombination maps using genome-wide array data from two study samples of Native Hawaiians: one reflecting the current admixed state of Native Hawaiians (NH map), and one based on individuals of enriched Polynesian ancestries (PNS map) with the potential to be used for less admixed Polynesian populations such as the Samoans. We found the recombination landscape to be less correlated with those from other continental populations (e.g. Spearman's rho = 0.79 between PNS and CEU (Utah residents with Northern and Western European ancestry) compared to 0.92 between YRI (Yoruba in Ibadan, Nigeria) and CEU at 50 kb resolution), likely driven by the unique demographic history of the Native Hawaiians. PNS also shared the fewest recombination hotspots with other populations (e.g. 8% of hotspots shared between PNS and CEU compared to 27% of hotspots shared between YRI and CEU). We found that downstream analyses in the Native Hawaiian population, such as local ancestry inference, imputation, and IBD segment and relatedness detections, would achieve similar efficacy when using the NH map compared to an omnibus map. However, for genome scans of adaptive loci using integrated haplotype scores, we found several loci with apparent genome-wide significant signals (|Z-score| > 4) in Native Hawaiians that would not have been significant when analyzed using NH-specific maps. Population-specific recombination maps may therefore improve the robustness of haplotype-based statistics and help us better characterize the evolutionary history that may underlie Native Hawaiian-specific health conditions that persist today.
Collapse
Affiliation(s)
- Bryan L Dinh
- Department of Quantitative and Computational Biology, University of Southern California
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Echo Tang
- Department of Quantitative and Computational Biology, University of Southern California
| | - Kekoa Taparra
- Department of Radiation Oncology, Stanford University, Palo Alto, California
| | | | - Fei Chen
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Charleston W K Chiang
- Department of Quantitative and Computational Biology, University of Southern California
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| |
Collapse
|
11
|
Link V, Schraiber JG, Fan C, Dinh B, Mancuso N, Chiang CW, Edge MD. Tree-based QTL mapping with expected local genetic relatedness matrices. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.07.536093. [PMID: 37066144 PMCID: PMC10104234 DOI: 10.1101/2023.04.07.536093] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/18/2023]
Abstract
Understanding the genetic basis of complex phenotypes is a central pursuit of genetics. Genome-wide Association Studies (GWAS) are a powerful way to find genetic loci associated with phenotypes. GWAS are widely and successfully used, but they face challenges related to the fact that variants are tested for association with a phenotype independently, whereas in reality variants at different sites are correlated because of their shared evolutionary history. One way to model this shared history is through the ancestral recombination graph (ARG), which encodes a series of local coalescent trees. Recent computational and methodological breakthroughs have made it feasible to estimate approximate ARGs from large-scale samples. Here, we explore the potential of an ARG-based approach to quantitative-trait locus (QTL) mapping, echoing existing variance-components approaches. We propose a framework that relies on the conditional expectation of a local genetic relatedness matrix given the ARG (local eGRM). Simulations show that our method is especially beneficial for finding QTLs in the presence of allelic heterogeneity. By framing QTL mapping in terms of the estimated ARG, we can also facilitate the detection of QTLs in understudied populations. We use local eGRM to identify a large-effect BMI locus, the CREBRF gene, in a sample of Native Hawaiians in which it was not previously detectable by GWAS because of a lack of population-specific imputation resources. Our investigations can provide intuition about the benefits of using estimated ARGs in population- and statistical-genetic methods in general.
Collapse
Affiliation(s)
- Vivian Link
- Department of Quantitative and Computational Biology, University of Southern California
| | - Joshua G. Schraiber
- Department of Quantitative and Computational Biology, University of Southern California
| | - Caoqi Fan
- Department of Quantitative and Computational Biology, University of Southern California
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Bryan Dinh
- Department of Quantitative and Computational Biology, University of Southern California
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Nicholas Mancuso
- Department of Quantitative and Computational Biology, University of Southern California
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Charleston W.K. Chiang
- Department of Quantitative and Computational Biology, University of Southern California
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Michael D. Edge
- Department of Quantitative and Computational Biology, University of Southern California
| |
Collapse
|
12
|
Zhang JZ, Heinsberg LW, Krishnan M, Hawley NL, Major TJ, Carlson JC, Hindmarsh JH, Watson H, Qasim M, Stamp LK, Dalbeth N, Murphy R, Sun G, Cheng H, Naseri T, Reupena MS, Kershaw EE, Deka R, McGarvey ST, Minster RL, Merriman TR, Weeks DE. Multivariate analysis of a missense variant in CREBRF reveals associations with measures of adiposity in people of Polynesian ancestries. Genet Epidemiol 2023; 47:105-118. [PMID: 36352773 PMCID: PMC9892232 DOI: 10.1002/gepi.22508] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 09/02/2022] [Accepted: 10/05/2022] [Indexed: 11/11/2022]
Abstract
The minor allele of rs373863828, a missense variant in CREB3 Regulatory Factor, is associated with several cardiometabolic phenotypes in Polynesian peoples. To better understand the variant, we tested the association of rs373863828 with a panel of correlated phenotypes (body mass index [BMI], weight, height, HDL cholesterol, triglycerides, and total cholesterol) using multivariate Bayesian association and network analyses in a Samoa cohort (n = 1632), Aotearoa New Zealand cohort (n = 1419), and combined cohort (n = 2976). An expanded set of phenotypes (adding estimated fat and fat-free mass, abdominal circumference, hip circumference, and abdominal-hip ratio) was tested in the Samoa cohort (n = 1496). In the Samoa cohort, we observed significant associations (log10 Bayes Factor [BF] ≥ 5.0) between rs373863828 and the overall phenotype panel (8.81), weight (8.30), and BMI (6.42). In the Aotearoa New Zealand cohort, we observed suggestive associations (1.5 < log10 BF < 5) between rs373863828 and the overall phenotype panel (4.60), weight (3.27), and BMI (1.80). In the combined cohort, we observed concordant signals with larger log10 BFs. In the Samoa-specific expanded phenotype analyses, we also observed significant associations between rs373863828 and fat mass (5.65), abdominal circumference (5.34), and hip circumference (5.09). Bayesian networks provided evidence for a direct association of rs373863828 with weight and indirect associations with height and BMI.
Collapse
Affiliation(s)
- Jerry Z. Zhang
- Department of Biostatistics, School of Public Health, University of Pittsburgh, Pittsburgh, PA
| | - Lacey W. Heinsberg
- Department of Human Genetics, School of Public Health, University of Pittsburgh, Pittsburgh, PA
| | - Mohanraj Krishnan
- Department of Human Genetics, School of Public Health, University of Pittsburgh, Pittsburgh, PA
| | - Nicola L. Hawley
- Department of Chronic Disease Epidemiology, Yale University School of Public Health, New Haven, CT
| | - Tanya J. Major
- Department of Biochemistry, University of Otago, Dunedin, New Zealand
| | - Jenna C. Carlson
- Department of Biostatistics, School of Public Health, University of Pittsburgh, Pittsburgh, PA
- Department of Human Genetics, School of Public Health, University of Pittsburgh, Pittsburgh, PA
| | | | - Huti Watson
- Ngāti Porou Hauora Charitable Trust, Te Puia Springs, Tairāwhiti, New Zealand
| | - Muhammad Qasim
- Ngāti Porou Hauora Charitable Trust, Te Puia Springs, Tairāwhiti, New Zealand
| | - Lisa K. Stamp
- Department of Medicine, University of Otago, Christchurch, New Zealand
| | - Nicola Dalbeth
- Department of Medicine, University of Auckland, Auckland, New Zealand
| | - Rinki Murphy
- Department of Medicine, University of Auckland, Auckland, New Zealand
- Maurice Wilkins Centre for Molecular Biodiscovery, Auckland, New Zealand
| | - Guangyun Sun
- Department of Environmental and Public Health Sciences, College of Medicine, University of Cincinnati, Cincinnati, OH
| | - Hong Cheng
- Department of Environmental and Public Health Sciences, College of Medicine, University of Cincinnati, Cincinnati, OH
| | - Take Naseri
- Ministry of Health, Government of Samoa, Apia, Samoa
| | | | - Erin E. Kershaw
- Division of Endocrinology and Metabolism, Department of Medicine, University of Pittsburgh, Pittsburgh, PA
| | - Ranjan Deka
- Department of Environmental and Public Health Sciences, College of Medicine, University of Cincinnati, Cincinnati, OH
| | - Stephen T. McGarvey
- International Health Institute, Department of Epidemiology, School of Public Health, Brown University, Providence, RI
- Department of Anthropology, Brown University, Providence, RI
| | - Ryan L. Minster
- Department of Human Genetics, School of Public Health, University of Pittsburgh, Pittsburgh, PA
| | - Tony R. Merriman
- Department of Biochemistry, University of Otago, Dunedin, New Zealand
- Maurice Wilkins Centre for Molecular Biodiscovery, Auckland, New Zealand
- Division of Clinical Immunology and Rheumatology, University of Alabama Birmingham, Birmingham, AL
| | - Daniel E. Weeks
- Department of Biostatistics, School of Public Health, University of Pittsburgh, Pittsburgh, PA
- Department of Human Genetics, School of Public Health, University of Pittsburgh, Pittsburgh, PA
| |
Collapse
|
13
|
Pascal LE, Frahm KA, Skalitzky KO, DeFranco DB, Rigatti LH, Lu R, Liu TT. Genetic alterations in CREBRF influence prostate cancer survival and impact prostate tissue homeostasis in mice. AMERICAN JOURNAL OF CLINICAL AND EXPERIMENTAL UROLOGY 2023; 11:27-39. [PMID: 36923723 PMCID: PMC10009309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 11/06/2022] [Accepted: 02/16/2023] [Indexed: 03/18/2023]
Abstract
BACKGROUND Risk factors for prostate cancer include age, environment, race and ethnicity. Genetic variants in cyclic-adenosine-monophosphate-response-element-binding protein 3 regulatory factor (CREBRF) gene are frequently observed in Pacific Islanders, a population with elevated prostate cancer incidence. CREBRF has been shown to play a role in other cancers, however its function in prostate homeostasis and tumorigenesis has not been previously explored. We determined the incidence of CREBRF alterations in publicly available databases and examined the impact of CREBRF deletion on the murine prostate in order to determine whether CREBRF impacts prostate physiology or pathophysiology. METHODS Alterations in CREBRF were identified in prostate cancer patients via in silico analysis of several publicly available datasets through cBioPortal. Male Crebrf knockout and wild-type littermate mice were generated and examined for prostate defects at 4 months of age. Immunohistochemical staining of murine prostate sections was used to determine the impact of Crebrf knockout on proliferation, apoptosis, inflammation and blood vessel density in the prostate. Serum adipokine levels were measured using a Luminex Multiplex Assay. RESULTS CREBRF alterations were identified in up to 4.05% of prostate tumors and the mutations identified were categorized as likely damaging. Median survival of prostate cancer patients with genetic alterations in CREBRF was 41.23 months, compared to 131 months for patients without these changes. In the murine model, the prostates of Crebrf knockout mice had reduced epithelial proliferation and increased TUNEL+ apoptotic cells. Circulating adipokines PAI-1 and MCP-1 were also altered in Crebrf knockout mice compared to age-matched controls. CONCLUSIONS Prostate cancer patients with genetic alterations in CREBRF had a significantly decreased overall survival suggesting that wild type CREBRF may play a role in limiting prostate tumorigenesis and progression. The murine knockout model demonstrated that CREBRF could modulate proliferation and apoptosis and macrophage density in the prostate. Serum levels of adipokines PAI-1 and MCP-1 were also altered and may contribute to the phenotypic changes observed in the prostates of Crebrf knockout mice. Future studies focused on populations susceptible to CREBRF mutations and mechanistic studies will be required to fully elucidate the potential role of CREBRF in prostate tumorigenesis.
Collapse
Affiliation(s)
- Laura E Pascal
- Department of Pharmacology and Chemical Biology, University of Pittsburgh School of Medicine Pittsburgh, PA, USA.,UPMC Hillman Cancer Center, University of Pittsburgh School of Medicine Pittsburgh, PA, USA
| | - Krystle A Frahm
- Department of Pharmacology and Chemical Biology, University of Pittsburgh School of Medicine Pittsburgh, PA, USA
| | | | - Donald B DeFranco
- Department of Pharmacology and Chemical Biology, University of Pittsburgh School of Medicine Pittsburgh, PA, USA.,Pittsburgh Institute for Neurodegenerative Diseases, University of Pittsburgh School of Medicine Pittsburgh, PA, USA
| | - Lora H Rigatti
- Division of Laboratory Animal Resources, University of Pittsburgh School of Medicine Pittsburgh, PA, USA
| | - Ray Lu
- Department of Molecular and Cellular Biology, University of Guelph Guelph, ON, Canada
| | - Teresa T Liu
- Department of Urology, University of Wisconsin Madison, WI, USA
| |
Collapse
|
14
|
Developing CIRdb as a catalog of natural genetic variation in the Canary Islanders. Sci Rep 2022; 12:16132. [PMID: 36168029 PMCID: PMC9514705 DOI: 10.1038/s41598-022-20442-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 09/13/2022] [Indexed: 11/29/2022] Open
Abstract
The current inhabitants of the Canary Islands have a unique genetic makeup in the European diversity landscape due to the existence of African footprints from recent admixture events, especially of North African components (> 20%). The underrepresentation of non-Europeans in genetic studies and the sizable North African ancestry, which is nearly absent from all existing catalogs of worldwide genetic diversity, justify the need to develop CIRdb, a population-specific reference catalog of natural genetic variation in the Canary Islanders. Based on array genotyping of the selected unrelated donors and comparisons against available datasets from European, sub-Saharan, and North African populations, we illustrate the intermediate genetic differentiation of Canary Islanders between Europeans and North Africans and the existence of within-population differences that are likely driven by genetic isolation. Here we describe the overall design and the methods that are being implemented to further develop CIRdb. This resource will help to strengthen the implementation of Precision Medicine in this population by contributing to increase the diversity in genetic studies. Among others, this will translate into improved ability to fine map disease genes and simplify the identification of causal variants and estimate the prevalence of unattended Mendelian diseases.
Collapse
|
15
|
Fan C, Mancuso N, Chiang CWK. A genealogical estimate of genetic relationships. Am J Hum Genet 2022; 109:812-824. [PMID: 35417677 PMCID: PMC9118131 DOI: 10.1016/j.ajhg.2022.03.016] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2021] [Accepted: 03/25/2022] [Indexed: 12/23/2022] Open
Abstract
The application of genetic relationships among individuals, characterized by a genetic relationship matrix (GRM), has far-reaching effects in human genetics. However, the current standard to calculate the GRM treats linked markers as independent and does not explicitly model the underlying genealogical history of the study sample. Here, we propose a coalescent-informed framework, namely the expected GRM (eGRM), to infer the expected relatedness between pairs of individuals given an ancestral recombination graph (ARG) of the sample. Through extensive simulations, we show that the eGRM is an unbiased estimate of latent pairwise genome-wide relatedness and is robust when computed with ARG inferred from incomplete genetic data. As a result, the eGRM better captures the structure of a population than the canonical GRM, even when using the same genetic information. More importantly, our framework allows a principled approach to estimate the eGRM at different time depths of the ARG, thereby revealing the time-varying nature of population structure in a sample. When applied to SNP array genotypes from a population sample from Northern and Eastern Finland, we find that clustering analysis with the eGRM reveals population structure driven by subpopulations that would not be apparent via the canonical GRM and that temporally the population model is consistent with recent divergence and expansion. Taken together, our proposed eGRM provides a robust tree-centric estimate of relatedness with wide application to genetic studies.
Collapse
Affiliation(s)
- Caoqi Fan
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA; Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA.
| | - Nicholas Mancuso
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA; Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA; Norris Comprehensive Cancer Center, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Charleston W K Chiang
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA; Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA.
| |
Collapse
|
16
|
Limited Metabolic Effect of the CREBRF R457Q Obesity Variant in Mice. Cells 2022; 11:cells11030497. [PMID: 35159305 PMCID: PMC8833978 DOI: 10.3390/cells11030497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 01/21/2022] [Accepted: 01/25/2022] [Indexed: 11/17/2022] Open
Abstract
The Arg457Gln missense variant in the CREBRF gene has previously been identified as driving excess body weight in Pacific/Oceanic populations. Intriguingly, Arg457Gln variant carriers also demonstrate paradoxical reductions in diabetes risk, indicating that the gene has a critical role in whole-body metabolism. To study the function of this variant in more detail, we generated mice on an FVB/N background with the Crebrf Arg458Gln variant knocked in to replace the endogenous Crebrf. The whole-body metabolic phenotype was characterized for male and female mice on a regular chow diet or an 8-week high-fat challenge. Regular assessment of body composition found that the Crebrf variant had no influence on total body weight or fat mass at any time point. Glucose tolerance tests demonstrated no obvious genotype effect on glucose homeostasis, with indirect calorimetry measures of whole-body energy expenditure likewise unaffected. Male chow-fed variant carriers displayed a trend towards increased lean mass and significantly reduced sensitivity to insulin administration. Overall, this novel mouse model showed only limited phenotypic effects associated with the Crebrf missense variant. The inability to recapitulate results of human association studies may invite reconsideration of the precise mechanistic link between CREBRF function and the risks of obesity and diabetes in variant allele carriers.
Collapse
|
17
|
Xu ZM, Rüeger S, Zwyer M, Brites D, Hiza H, Reinhard M, Rutaihwa L, Borrell S, Isihaka F, Temba H, Maroa T, Naftari R, Hella J, Sasamalo M, Reither K, Portevin D, Gagneux S, Fellay J. Using population-specific add-on polymorphisms to improve genotype imputation in underrepresented populations. PLoS Comput Biol 2022; 18:e1009628. [PMID: 35025869 PMCID: PMC8791479 DOI: 10.1371/journal.pcbi.1009628] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2021] [Revised: 01/26/2022] [Accepted: 11/10/2021] [Indexed: 12/13/2022] Open
Abstract
Genome-wide association studies rely on the statistical inference of untyped variants, called imputation, to increase the coverage of genotyping arrays. However, the results are often suboptimal in populations underrepresented in existing reference panels and array designs, since the selected single nucleotide polymorphisms (SNPs) may fail to capture population-specific haplotype structures, hence the full extent of common genetic variation. Here, we propose to sequence the full genomes of a small subset of an underrepresented study cohort to inform the selection of population-specific add-on tag SNPs and to generate an internal population-specific imputation reference panel, such that the remaining array-genotyped cohort could be more accurately imputed. Using a Tanzania-based cohort as a proof-of-concept, we demonstrate the validity of our approach by showing improvements in imputation accuracy after the addition of our designed add-on tags to the base H3Africa array. Genome-wide association studies, which study the association between genetic variants and various phenotypes, typically rely on genotyping arrays. Only a small proportion of genetic variants within the genome are typed on genotyping arrays. Untyped variants are statistically inferred through a process known as genotype imputation, where correlations between variants (haplotypes) observed in external reference panels are leveraged to infer untyped variants in the study population. However, for study populations that are underrepresented in existing reference panels, the quality of imputation is often sub-optimal. This is because typed variants incorporated on existing genotyping arrays can be unsuitable for the study population, and haplotype structures can be different between the reference and the study population. Here, we illustrate an approach to select a custom set of population-specific typed variants to improve genotype imputation in such underrepresented populations.
Collapse
Affiliation(s)
- Zhi Ming Xu
- School of Life Sciences, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Sina Rüeger
- School of Life Sciences, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Michaela Zwyer
- Swiss Tropical and Public Health Institute, Basel, Switzerland
- University of Basel, Basel, Switzerland
| | - Daniela Brites
- Swiss Tropical and Public Health Institute, Basel, Switzerland
- University of Basel, Basel, Switzerland
| | - Hellen Hiza
- Swiss Tropical and Public Health Institute, Basel, Switzerland
- University of Basel, Basel, Switzerland
- Ifakara Health Institute, Dar es Salaam, Tanzania
| | - Miriam Reinhard
- Swiss Tropical and Public Health Institute, Basel, Switzerland
- University of Basel, Basel, Switzerland
| | - Liliana Rutaihwa
- Swiss Tropical and Public Health Institute, Basel, Switzerland
- University of Basel, Basel, Switzerland
| | - Sonia Borrell
- Swiss Tropical and Public Health Institute, Basel, Switzerland
- University of Basel, Basel, Switzerland
| | | | | | - Thomas Maroa
- Ifakara Health Institute, Dar es Salaam, Tanzania
| | | | - Jerry Hella
- Ifakara Health Institute, Dar es Salaam, Tanzania
| | | | - Klaus Reither
- Swiss Tropical and Public Health Institute, Basel, Switzerland
- University of Basel, Basel, Switzerland
| | - Damien Portevin
- Swiss Tropical and Public Health Institute, Basel, Switzerland
- University of Basel, Basel, Switzerland
| | - Sebastien Gagneux
- Swiss Tropical and Public Health Institute, Basel, Switzerland
- University of Basel, Basel, Switzerland
| | - Jacques Fellay
- School of Life Sciences, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Precision Medicine Unit, Lausanne University Hospital and University of Lausanne, Lausanne, Switzerland
- * E-mail:
| |
Collapse
|
18
|
Burden HJ, Adams S, Kulatea B, Wright-McNaughton M, Sword D, Ormsbee JJ, Watene-O'Sullivan C, Merriman TR, Knopp JL, Chase JG, Krebs JD, Hall RM, Plank LD, Murphy R, Shepherd PR, Merry TL. The CREBRF diabetes-protective rs373863828-A allele is associated with enhanced early insulin release in men of Māori and Pacific ancestry. Diabetologia 2021; 64:2779-2789. [PMID: 34417843 DOI: 10.1007/s00125-021-05552-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Accepted: 06/29/2021] [Indexed: 10/20/2022]
Abstract
AIMS/HYPOTHESIS The minor A allele of rs373863828 (CREBRF p.Arg457Gln) is associated with increased BMI, but reduced risk of type 2 and gestational diabetes in Polynesian (Pacific peoples and Aotearoa New Zealand Māori) populations. This study investigates the effect of the A allele on insulin release and sensitivity in overweight/obese men without diabetes. METHODS A mixed meal tolerance test was completed by 172 men (56 with the A allele) of Māori or Pacific ancestry, and 44 (24 with the A allele) had a frequently sampled IVGTT and hyperinsulinaemic-euglycaemic clamp. Mixed linear models with covariates age, ancestry and BMI were used to analyse the association between the A allele of rs373863828 and markers of insulin release and blood glucose regulation. RESULTS The A allele of rs373863828 is associated with a greater increase in plasma insulin 30 min following a meal challenge without affecting the elevation in plasma glucose or incretins glucagon-like polypeptide-1 or gastric inhibitory polypeptide. Consistent with this point, following an i.v. infusion of a glucose bolus, participants with an A allele had higher early (p < 0.05 at 2 and 4 min) plasma insulin and C-peptide concentrations for a similar elevation in blood glucose as those homozygous for the major (G) allele. Despite increased plasma insulin, rs373863828 genotype was not associated with a significant difference (p > 0.05) in insulin sensitivity index or glucose disposal during hyperinsulinaemic-euglycaemic clamp. CONCLUSIONS/INTERPRETATION rs373863828-A allele associates with increased glucose-stimulated insulin release without affecting insulin sensitivity, suggesting that CREBRF p.Arg457Gln may increase insulin release to reduce the risk of type 2 diabetes.
Collapse
Affiliation(s)
- Hannah J Burden
- Department of Molecular Medicine and Pathology, School of Medical Sciences, The University of Auckland, Auckland, New Zealand
- Maurice Wilkins Centre for Molecular Biodiscovery, The University of Auckland, Auckland, New Zealand
| | - Shannon Adams
- Discipline of Nutrition, School of Medical Sciences, The University of Auckland, Auckland, New Zealand
| | - Braydon Kulatea
- Discipline of Nutrition, School of Medical Sciences, The University of Auckland, Auckland, New Zealand
| | | | - Danielle Sword
- Department of Medicine, University of Otago Wellington, Wellington, New Zealand
| | - Jennifer J Ormsbee
- Centre for Bioengineering, Department of Mechanical Bioengineering, University of Canterbury, Christchurch, New Zealand
| | - Conor Watene-O'Sullivan
- Maurice Wilkins Centre for Molecular Biodiscovery, The University of Auckland, Auckland, New Zealand
- Moko Foundation, Kaitaia, New Zealand
- Waharoa Ki Te Toi Research Centre, Kaitaia, New Zealand
| | - Tony R Merriman
- Maurice Wilkins Centre for Molecular Biodiscovery, The University of Auckland, Auckland, New Zealand
- Department of Biochemistry, School of Biomedical Sciences, University of Otago, Dunedin, New Zealand
- Division of Clinical Immunology and Rheumatology, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Jennifer L Knopp
- Centre for Bioengineering, Department of Mechanical Bioengineering, University of Canterbury, Christchurch, New Zealand
| | - J Geoffrey Chase
- Centre for Bioengineering, Department of Mechanical Bioengineering, University of Canterbury, Christchurch, New Zealand
| | - Jeremy D Krebs
- Maurice Wilkins Centre for Molecular Biodiscovery, The University of Auckland, Auckland, New Zealand
- Department of Medicine, University of Otago Wellington, Wellington, New Zealand
| | - Rosemary M Hall
- Maurice Wilkins Centre for Molecular Biodiscovery, The University of Auckland, Auckland, New Zealand
- Department of Medicine, University of Otago Wellington, Wellington, New Zealand
| | - Lindsay D Plank
- Department of Surgery, School of Medicine, The University of Auckland, Auckland, New Zealand
| | - Rinki Murphy
- Maurice Wilkins Centre for Molecular Biodiscovery, The University of Auckland, Auckland, New Zealand
- Department of Medicine, School of Medicine, The University of Auckland, Auckland, New Zealand
| | - Peter R Shepherd
- Department of Molecular Medicine and Pathology, School of Medical Sciences, The University of Auckland, Auckland, New Zealand
- Maurice Wilkins Centre for Molecular Biodiscovery, The University of Auckland, Auckland, New Zealand
| | - Troy L Merry
- Maurice Wilkins Centre for Molecular Biodiscovery, The University of Auckland, Auckland, New Zealand.
- Discipline of Nutrition, School of Medical Sciences, The University of Auckland, Auckland, New Zealand.
| |
Collapse
|
19
|
Chiang CWK. The Opportunities and Challenges of Integrating Population Histories Into Genetic Studies for Diverse Populations: A Motivating Example From Native Hawaiians. Front Genet 2021; 12:643883. [PMID: 34646295 PMCID: PMC8503554 DOI: 10.3389/fgene.2021.643883] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2020] [Accepted: 08/19/2021] [Indexed: 11/25/2022] Open
Abstract
There is a well-recognized need to include diverse populations in genetic studies, but several obstacles continue to be prohibitive, including (but are not limited to) the difficulty of recruiting individuals from diverse populations in large numbers and the lack of representation in available genomic references. These obstacles notwithstanding, studying multiple diverse populations would provide informative, population-specific insights. Using Native Hawaiians as an example of an understudied population with a unique evolutionary history, I will argue that by developing key genomic resources and integrating evolutionary thinking into genetic epidemiology, we will have the opportunity to efficiently advance our knowledge of the genetic risk factors, ameliorate health disparity, and improve healthcare in this underserved population.
Collapse
Affiliation(s)
- Charleston W K Chiang
- Department of Population and Public Health Sciences, Center for Genetic Epidemiology, Keck School of Medicine, University of Southern California, Los Angeles, CA, United States.,Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, United States
| |
Collapse
|
20
|
Kanshana JS, Mattila PE, Ewing MC, Wood AN, Schoiswohl G, Meyer AC, Kowalski A, Rosenthal SL, Gingras S, Kaufman BA, Lu R, Weeks DE, McGarvey ST, Minster RL, Hawley NL, Kershaw EE. A murine model of the human CREBRFR457Q obesity-risk variant does not influence energy or glucose homeostasis in response to nutritional stress. PLoS One 2021; 16:e0251895. [PMID: 34520472 PMCID: PMC8439463 DOI: 10.1371/journal.pone.0251895] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Accepted: 08/09/2021] [Indexed: 01/02/2023] Open
Abstract
Obesity and diabetes have strong heritable components, yet the genetic contributions to these diseases remain largely unexplained. In humans, a missense variant in Creb3 regulatory factor (CREBRF) [rs373863828 (p.Arg457Gln); CREBRFR457Q] is strongly associated with increased odds of obesity but decreased odds of diabetes. Although virtually nothing is known about CREBRF's mechanism of action, emerging evidence implicates it in the adaptive transcriptional response to nutritional stress downstream of TORC1. The objectives of this study were to generate a murine model with knockin of the orthologous variant in mice (CREBRFR458Q) and to test the hypothesis that this CREBRF variant promotes obesity and protects against diabetes by regulating energy and glucose homeostasis downstream of TORC1. To test this hypothesis, we performed extensive phenotypic analysis of CREBRFR458Q knockin mice at baseline and in response to acute (fasting/refeeding), chronic (low- and high-fat diet feeding), and extreme (prolonged fasting) nutritional stress as well as with pharmacological TORC1 inhibition, and aging to 52 weeks. The results demonstrate that the murine CREBRFR458Q model of the human CREBRFR457Q variant does not influence energy/glucose homeostasis in response to these interventions, with the exception of possible greater loss of fat relative to lean mass with age. Alternative preclinical models and/or studies in humans will be required to decipher the mechanisms linking this variant to human health and disease.
Collapse
Affiliation(s)
- Jitendra S. Kanshana
- Division of Endocrinology and Metabolism, Department of Medicine, University of Pittsburgh, Pittsburgh, PA, United States of America
| | - Polly E. Mattila
- Division of Endocrinology and Metabolism, Department of Medicine, University of Pittsburgh, Pittsburgh, PA, United States of America
| | - Michael C. Ewing
- Division of Endocrinology and Metabolism, Department of Medicine, University of Pittsburgh, Pittsburgh, PA, United States of America
| | - Ashlee N. Wood
- Division of Endocrinology and Metabolism, Department of Medicine, University of Pittsburgh, Pittsburgh, PA, United States of America
| | - Gabriele Schoiswohl
- Department of Pharmacology and Toxicology, University of Graz, Graz, Austria
| | - Anna C. Meyer
- Division of Endocrinology and Metabolism, Department of Medicine, University of Pittsburgh, Pittsburgh, PA, United States of America
| | - Aneta Kowalski
- Division of Endocrinology and Metabolism, Department of Medicine, University of Pittsburgh, Pittsburgh, PA, United States of America
| | - Samantha L. Rosenthal
- Center for Craniofacial and Dental Genetics, Department of Oral and Craniofacial Sciences, University of Pittsburgh, Pittsburgh, PA, United States of America
| | - Sebastien Gingras
- Department of Immunology, University of Pittsburgh, Pittsburgh, PA, United States of America
| | - Brett A. Kaufman
- Division of Cardiology, Department of Medicine, University of Pittsburgh, Pittsburgh, PA, United States of America
| | - Ray Lu
- Department of Molecular and Cellular Biology, College of Biological Science, University of Guelph, Guelph, ON, Canada
| | - Daniel E. Weeks
- Department of Human Genetics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
- Department of Biostatistics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Stephen T. McGarvey
- International Health Institute, Department of Epidemiology, Brown University School of Public Health, Providence, Rhode Island, United States of America
- Department of Anthropology, Brown University, Providence, Rhode Island, United States of America
| | - Ryan L. Minster
- Department of Human Genetics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Nicola L. Hawley
- Department of Chronic Disease Epidemiology, Yale School of Public Health, New Haven, Connecticut, United States of America
| | - Erin E. Kershaw
- Division of Endocrinology and Metabolism, Department of Medicine, University of Pittsburgh, Pittsburgh, PA, United States of America
| |
Collapse
|
21
|
Müller SJ, Schurz H, Tromp G, van der Spuy GD, Hoal EG, van Helden PD, Owusu-Dabo E, Meyer CG, Muntau B, Thye T, Niemann S, Warren RM, Streicher E, Möller M, Kinnear C. A multi-phenotype genome-wide association study of clades causing tuberculosis in a Ghanaian- and South African cohort. Genomics 2021; 113:1802-1815. [PMID: 33862184 DOI: 10.1016/j.ygeno.2021.04.024] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2020] [Revised: 03/26/2021] [Accepted: 04/11/2021] [Indexed: 01/31/2023]
Abstract
Despite decades of research and advancements in diagnostics and treatment, tuberculosis remains a major public health concern. New computational methods are needed to interrogate the intersection of host- and bacterial genomes. Paired host genotype datum and infecting bacterial isolate information were analysed for associations using a multinomial logistic regression framework implemented in SNPTest. A cohort of 853 admixed South African participants and a Ghanaian cohort of 1359 participants were included. Two directly genotyped variants, namely rs529920 and rs41472447, were identified in the Ghanaian cohort as being statistically significantly associated with risk for infection with strains of different members of the MTBC. Thus, a multinomial logistic regression using paired host-pathogen data may prove valuable for investigating the complex relationships driving infectious disease.
Collapse
Affiliation(s)
- Stephanie J Müller
- DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa; South African Tuberculosis Bioinformatics Initiative (SATBBI), Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa.
| | - Haiko Schurz
- DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa; South African Tuberculosis Bioinformatics Initiative (SATBBI), Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Gerard Tromp
- DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa; South African Tuberculosis Bioinformatics Initiative (SATBBI), Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Gian D van der Spuy
- DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa; South African Tuberculosis Bioinformatics Initiative (SATBBI), Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Eileen G Hoal
- DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Paul D van Helden
- DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Ellis Owusu-Dabo
- School of Public Health, College of Health Sciences, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana
| | - Christian G Meyer
- Institute of Tropical Medicine, Eberhard-Karls University, Tübingen, Germany; Faculty of Medicine, Duy Tan University, Da Nang, Vietnam
| | - Birgit Muntau
- National Reference Centre for Tropical Pathogens, Bernhard Nocht Institute for Tropical Medicine, Hamburg, Germany
| | - Thorsten Thye
- Bernhard Nocht Institute for Tropical Medicine, Hamburg, Germany
| | - Stefan Niemann
- German Centre for Infection Research (DZIF), Partner site Hamburg-Lübeck-Borstel, Borstel, Germany
| | - Robin M Warren
- DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Elizabeth Streicher
- DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Marlo Möller
- DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Craig Kinnear
- DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| |
Collapse
|
22
|
Sun H, Lin M, Russell EM, Minster RL, Chan TF, Dinh BL, Naseri T, Reupena MS, Lum-Jones A, Cheng I, Wilkens LR, Le Marchand L, Haiman CA, Chiang CWK. The impact of global and local Polynesian genetic ancestry on complex traits in Native Hawaiians. PLoS Genet 2021; 17:e1009273. [PMID: 33571193 PMCID: PMC7877570 DOI: 10.1371/journal.pgen.1009273] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2020] [Accepted: 11/18/2020] [Indexed: 12/17/2022] Open
Abstract
Epidemiological studies of obesity, Type-2 diabetes (T2D), cardiovascular diseases and several common cancers have revealed an increased risk in Native Hawaiians compared to European- or Asian-Americans living in the Hawaiian islands. However, there remains a gap in our understanding of the genetic factors that affect the health of Native Hawaiians. To fill this gap, we studied the genetic risk factors at both the chromosomal and sub-chromosomal scales using genome-wide SNP array data on ~4,000 Native Hawaiians from the Multiethnic Cohort. We estimated the genomic proportion of Native Hawaiian ancestry ("global ancestry," which we presumed to be Polynesian in origin), as well as this ancestral component along each chromosome ("local ancestry") and tested their respective association with binary and quantitative cardiometabolic traits. After attempting to adjust for non-genetic covariates evaluated through questionnaires, we found that per 10% increase in global Polynesian genetic ancestry, there is a respective 8.6%, and 11.0% increase in the odds of being diabetic (P = 1.65×10-4) and having heart failure (P = 2.18×10-4), as well as a 0.059 s.d. increase in BMI (P = 1.04×10-10). When testing the association of local Polynesian ancestry with risk of disease or biomarkers, we identified a chr6 region associated with T2D. This association was driven by an uniquely prevalent variant in Polynesian ancestry individuals. However, we could not replicate this finding in an independent Polynesian cohort from Samoa due to the small sample size of the replication cohort. In conclusion, we showed that Polynesian ancestry, which likely capture both genetic and lifestyle risk factors, is associated with an increased risk of obesity, Type-2 diabetes, and heart failure, and that larger cohorts of Polynesian ancestry individuals will be needed to replicate the putative association on chr6 with T2D.
Collapse
Affiliation(s)
- Hanxiao Sun
- Center for Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Meng Lin
- Center for Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Emily M. Russell
- Department of Human Genetics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Ryan L. Minster
- Department of Human Genetics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Tsz Fung Chan
- Center for Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Bryan L. Dinh
- Center for Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, California, United States of America
| | - Take Naseri
- Ministry of Health, Government of Samoa, Apia, Samoa
| | | | - Annette Lum-Jones
- Epidemiology Program, University of Hawai‘i Cancer Center, University of Hawai‘i, Manoa, Honolulu, Hawaii, United States of America
| | | | - Iona Cheng
- Department of Epidemiology and Biostatistics, University of California San Francisco, San Francisco, California, United States of America
| | - Lynne R. Wilkens
- Epidemiology Program, University of Hawai‘i Cancer Center, University of Hawai‘i, Manoa, Honolulu, Hawaii, United States of America
| | - Loïc Le Marchand
- Epidemiology Program, University of Hawai‘i Cancer Center, University of Hawai‘i, Manoa, Honolulu, Hawaii, United States of America
| | - Christopher A. Haiman
- Center for Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Charleston W. K. Chiang
- Center for Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, California, United States of America
| |
Collapse
|
23
|
Atkinson EG, Maihofer AX, Kanai M, Martin AR, Karczewski KJ, Santoro ML, Ulirsch JC, Kamatani Y, Okada Y, Finucane HK, Koenen KC, Nievergelt CM, Daly MJ, Neale BM. Tractor uses local ancestry to enable the inclusion of admixed individuals in GWAS and to boost power. Nat Genet 2021; 53:195-204. [PMID: 33462486 PMCID: PMC7867648 DOI: 10.1038/s41588-020-00766-y] [Citation(s) in RCA: 123] [Impact Index Per Article: 30.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2020] [Accepted: 12/15/2020] [Indexed: 12/26/2022]
Abstract
Admixed populations are routinely excluded from genomic studies due to concerns over population structure. Here, we present a statistical framework and software package, Tractor, to facilitate the inclusion of admixed individuals in association studies by leveraging local ancestry. We test Tractor with simulated and empirical two-way admixed African-European cohorts. Tractor generates accurate ancestry-specific effect-size estimates and P values, can boost genome-wide association study (GWAS) power and improves the resolution of association signals. Using a local ancestry-aware regression model, we replicate known hits for blood lipids, discover novel hits missed by standard GWAS and localize signals closer to putative causal variants.
Collapse
Affiliation(s)
- Elizabeth G Atkinson
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA.
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| | - Adam X Maihofer
- Department of Psychiatry, University of California, San Diego, La Jolla, CA, USA
| | - Masahiro Kanai
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Program in Bioinformatics and Integrative Genomics, Harvard Medical School, Boston, MA, USA
- Department of Statistical Genetics, Graduate School of Medicine, Osaka University, Suita, Japan
| | - Alicia R Martin
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Konrad J Karczewski
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Marcos L Santoro
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Departamento de Psiquiatria, Universidade Federal de São Paulo, São Paulo, Brazil
- Departamento de Morfologia e Genética, Universidade Federal de São Paulo, São Paulo, Brazil
| | - Jacob C Ulirsch
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA, USA
| | - Yoichiro Kamatani
- Laboratory of Complex Trait Genomics, Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Tokyo, Japan
| | - Yukinori Okada
- Department of Statistical Genetics, Graduate School of Medicine, Osaka University, Suita, Japan
- Laboratory of Statistical Immunology, Immunology Frontier Research Center (WPI-IFReC), Osaka University, Suita, Japan
- Integrated Frontier Research for Medical Science Division, Institute for Open and Transdisciplinary Research Initiatives, Osaka University, Suita, Japan
| | - Hilary K Finucane
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Karestan C Koenen
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | | | - Mark J Daly
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
| | - Benjamin M Neale
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| |
Collapse
|
24
|
Taparra K, Miller RC, Deville C. Navigating Native Hawaiian and Pacific Islander Cancer Disparities From a Cultural and Historical Perspective. JCO Oncol Pract 2021; 17:130-134. [PMID: 33497251 DOI: 10.1200/op.20.00831] [Citation(s) in RCA: 35] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Affiliation(s)
- Kekoa Taparra
- Department of Radiation Oncology, Mayo Clinic, Rochester, MN.,Transitional Year Program, Gundersen Health System, La Crosse, WI
| | - Robert C Miller
- Department of Radiation Oncology, University of Maryland School of Medicine, Baltimore, MD
| | - Curtiland Deville
- Department of Radiation Oncology and Molecular Radiation Sciences, Johns Hopkins University, Baltimore, MD
| |
Collapse
|