1
|
Thomas CE, Peters U. Genomic landscape of cancer in racially and ethnically diverse populations. Nat Rev Genet 2025; 26:336-349. [PMID: 39609636 DOI: 10.1038/s41576-024-00796-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/14/2024] [Indexed: 11/30/2024]
Abstract
Cancer incidence and mortality rates can vary widely among different racial and ethnic groups, attributed to a complex interplay of genetic, environmental and social factors. Recently, substantial progress has been made in investigating hereditary genetic risk factors and in characterizing tumour genomes. However, most research has been conducted in individuals of European ancestries and, increasingly, in individuals of Asian ancestries. The study of germline and somatic genetics in cancer across racial and ethnic groups using omics technologies offers opportunities to identify similarities and differences in both heritable traits and the molecular features of cancer genomes. An improved understanding of population-specific cancer genomics, as well as translation of those findings across populations, will help reduce cancer disparities and ensure that personalized medicine and public health approaches are equitable across racial and ethnic groups.
Collapse
Affiliation(s)
- Claire E Thomas
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Ulrike Peters
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA, USA.
- Department of Epidemiology, University of Washington, Seattle, WA, USA.
| |
Collapse
|
2
|
Dias JA, Chen T, Xing H, Wang X, Rodriguez AA, Madduri RK, Kraft P, Zhang H. Evaluating Multi-Ancestry Genome-Wide Association Methods: Statistical Power, Population Structure, and Practical Implications. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2025:2025.03.11.25323772. [PMID: 40162269 PMCID: PMC11952640 DOI: 10.1101/2025.03.11.25323772] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/02/2025]
Abstract
The increasing availability of diverse biobanks has enabled multi-ancestry genome-wide association studies (GWAS), enhancing the discovery of genetic variants across traits and diseases. However, the choice of an optimal method remains debated due to challenges in statistical power differences across ancestral groups and approaches to account for population structure. Two primary strategies exist: (1) Pooled analysis, which combines individuals from all genetic backgrounds into a single dataset while adjusting for population stratification using principal components, increasing the sample size and statistical power but requiring careful control of population stratification. (2) Meta-analysis, which performs ancestry-group-specific GWAS and subsequently combines summary statistics, potentially capturing fine-scale population structure, but facing limitations in handling admixed individuals. Using large-scale simulations with varying sample sizes and ancestry compositions, we compare these methods alongside real data analyses of eight continuous and five binary traits from the UK Biobank ( N ≈ 324,000 ) and All of Us Research Program ( N ≈ 207,000 ). Our results demonstrate that pooled analysis generally exhibits better statistical power while effectively adjusting for population stratification. We further present a theoretical framework linking power differences to allele frequency variations across populations. These findings, validated across both biobanks, highlight pooled analysis as a robust and scalable strategy for multi-ancestry GWAS, improving genetic discovery while maintaining rigorous population structure control.
Collapse
Affiliation(s)
- Julie-Alexia Dias
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Tony Chen
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Hua Xing
- Division of Cancer Epidemiology & Genetics, National Cancer Institute, National Instituters of Health, Bethesda, MD, USA
- Cancer Genomics Research Laboratory, Frederick National Laboratory for Cancer Research, Leidos Biomedical Research Inc, Rockville, MD, USA
| | - Xiaoyu Wang
- Division of Cancer Epidemiology & Genetics, National Cancer Institute, National Instituters of Health, Bethesda, MD, USA
- Cancer Genomics Research Laboratory, Frederick National Laboratory for Cancer Research, Leidos Biomedical Research Inc, Rockville, MD, USA
| | - Alex A. Rodriguez
- Data Science and Learning division, Argonne National Laboratory, Lemont, IL, USA
| | - Ravi K. Madduri
- Data Science and Learning division, Argonne National Laboratory, Lemont, IL, USA
| | - Peter Kraft
- Division of Cancer Epidemiology & Genetics, National Cancer Institute, National Instituters of Health, Bethesda, MD, USA
| | - Haoyu Zhang
- Division of Cancer Epidemiology & Genetics, National Cancer Institute, National Instituters of Health, Bethesda, MD, USA
| |
Collapse
|
3
|
Hu S, Ferreira LAF, Shi S, Hellenthal G, Marchini J, Lawson DJ, Myers SR. Fine-scale population structure and widespread conservation of genetic effect sizes between human groups across traits. Nat Genet 2025; 57:379-389. [PMID: 39901012 PMCID: PMC11821542 DOI: 10.1038/s41588-024-02035-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 11/18/2024] [Indexed: 02/05/2025]
Abstract
Understanding genetic differences between populations is essential for avoiding confounding in genome-wide association studies and improving polygenic score (PGS) portability. We developed a statistical pipeline to infer fine-scale Ancestry Components and applied it to UK Biobank data. Ancestry Components identify population structure not captured by widely used principal components, improving stratification correction for geographically correlated traits. To estimate the similarity of genetic effect sizes between groups, we developed ANCHOR, which estimates changes in the predictive power of an existing PGS in distinct local ancestry segments. ANCHOR infers highly similar (estimated correlation 0.98 ± 0.07) effect sizes between UK Biobank participants of African and European ancestry for 47 of 53 quantitative phenotypes, suggesting that gene-environment and gene-gene interactions do not play major roles in poor cross-ancestry PGS transferability for these traits in the United Kingdom, and providing optimism that shared causal mutations operate similarly in different populations.
Collapse
Affiliation(s)
- Sile Hu
- Department of Statistics, University of Oxford, Oxford, UK.
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK.
- Human Genetics Centre of Excellence, Novo Nordisk Research Centre Oxford, Oxford, UK.
| | - Lino A F Ferreira
- Department of Statistics, University of Oxford, Oxford, UK
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Sinan Shi
- Department of Statistics, University of Oxford, Oxford, UK
| | - Garrett Hellenthal
- Department of Genetics, Evolution and Environment, University College London, London, UK
- UCL Genetics Institute, University College London, London, UK
| | | | - Daniel J Lawson
- Department of Statistical Science, School of Mathematics, University of Bristol, Bristol, UK.
- MRC Integrative Epidemiology Unit, Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK.
| | - Simon R Myers
- Department of Statistics, University of Oxford, Oxford, UK.
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK.
| |
Collapse
|
4
|
Schaid DJ, McDonnell SK, Akhtari FS, Sinnwell JP, Batzler A, Cobran EK, Motsinger-Reif A. Polygenic scores and social determinants of health: Their correlations and potential biases. HGG ADVANCES 2025; 6:100389. [PMID: 39604263 PMCID: PMC11699593 DOI: 10.1016/j.xhgg.2024.100389] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2024] [Revised: 11/25/2024] [Accepted: 11/25/2024] [Indexed: 11/29/2024] Open
Abstract
The use of polygenic scores (PGS) for personalized medicine has gained momentum, along with caution to avoid accentuating health disparities. Greater ancestral diversity in genetic studies is needed, as well as close attention to the social determinants of health (SDoH).We measured the correlations between 3,030 PGS from the PGS Catalog and SDoH among participants in the Personalized Environment and Genes Study (PEGS). Correlations mainly ranged from -0.05 to 0.05, yet there was a heterogeneity of correlations across SDoH themes, with the largest amount of heterogeneity for PGS predicting body measures and smoking, as well as some common diseases. We also quantify the expected bias of PGS effect size on disease risk when strong predictors, such as SDoH, are omitted from models, emphasizing the importance of including SDoH with PGS to avoid biased estimates of PGS risk and to achieve equitable precision medicine.
Collapse
Affiliation(s)
- Daniel J Schaid
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN 55905, USA.
| | - Shannon K McDonnell
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN 55905, USA
| | - Farida S Akhtari
- Biostatistics & Computational Biology Branch, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Jason P Sinnwell
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN 55905, USA
| | - Anthony Batzler
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN 55905, USA
| | - Ewan K Cobran
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN 55905, USA
| | - Alison Motsinger-Reif
- Biostatistics & Computational Biology Branch, National Institute of Environmental Health Sciences, Durham, NC, USA
| |
Collapse
|
5
|
Pagnuco I, Eyre S, Rattray M, Morris AP. Transferability of Single- and Cross-Tissue Transcriptome Imputation Models Across Ancestry Groups. Genet Epidemiol 2025; 49:e22611. [PMID: 39812501 PMCID: PMC11734644 DOI: 10.1002/gepi.22611] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2024] [Revised: 10/02/2024] [Accepted: 12/19/2024] [Indexed: 01/16/2025]
Abstract
Transcriptome-wide association studies (TWAS) investigate the links between genetically regulated gene expression and complex traits. TWAS involves imputing gene expression using expression quantitative trait loci (eQTL) as predictors and testing the association between the imputed expression and the trait. The effectiveness of TWAS depends on the accuracy of these imputation models, which require genotype and gene expression data from the same samples. However, publicly accessible resources, such as the Genotype Tissue Expression (GTEx) Project, are biased toward individuals of European ancestry, potentially reducing prediction accuracy into other ancestry groups. This study explored eQTL transferability across ancestry groups by comparing two imputation models: PrediXcan (tissue-specific) and UTMOST (cross-tissue). Both models were trained on tissues from the GTEx Project using European ancestry individuals and then tested on data sets of European ancestry and African American individuals. Results showed that both models performed best when the training and testing data sets were from the same ancestry group, with the cross-tissue approach generally outperforming the tissue-specific approach. This study underscores that eQTL detection is influenced by ancestry and tissue context. Developing ancestry-specific reference panels across tissues can improve prediction accuracy, enhancing TWAS analysis and our understanding of the biological processes contributing to complex traits.
Collapse
Affiliation(s)
- Inti Pagnuco
- Centre for Genetics and Genomics Versus Arthritis, Centre for Musculoskeletal Research, Division of Musculoskeletal and Dermatological SciencesThe University of ManchesterManchesterUK
| | - Stephen Eyre
- Centre for Genetics and Genomics Versus Arthritis, Centre for Musculoskeletal Research, Division of Musculoskeletal and Dermatological SciencesThe University of ManchesterManchesterUK
| | - Magnus Rattray
- Division of Informatics, Imaging and Data SciencesThe University of ManchesterManchesterUK
| | - Andrew P. Morris
- Centre for Genetics and Genomics Versus Arthritis, Centre for Musculoskeletal Research, Division of Musculoskeletal and Dermatological SciencesThe University of ManchesterManchesterUK
| |
Collapse
|
6
|
Martinez KL, Klein A, Martin JR, Sampson CU, Giles JB, Beck ML, Bhakta K, Quatraro G, Farol J, Karnes JH. Disparities in ABO blood type determination across diverse ancestries: a systematic review and validation in the All of Us Research Program. J Am Med Inform Assoc 2024; 31:3022-3031. [PMID: 38917427 PMCID: PMC11631141 DOI: 10.1093/jamia/ocae161] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2024] [Revised: 05/02/2024] [Accepted: 06/19/2024] [Indexed: 06/27/2024] Open
Abstract
OBJECTIVES ABO blood types have widespread clinical use and robust associations with disease. The purpose of this study is to evaluate the portability and suitability of tag single-nucleotide polymorphisms (tSNPs) used to determine ABO alleles and blood types across diverse populations in published literature. MATERIALS AND METHODS Bibliographic databases were searched for studies using tSNPs to determine ABO alleles. We calculated linkage between tSNPs and functional variants across inferred continental ancestry groups from 1000 Genomes. We compared r2 across ancestry and assessed real-world consequences by comparing tSNP-derived blood types to serology in a diverse population from the All of Us Research Program. RESULTS Linkage between functional variants and O allele tSNPs was significantly lower in African (median r2 = 0.443) compared to East Asian (r2 = 0.946, P = 1.1 × 10-5) and European (r2 = 0.869, P = .023) populations. In All of Us, discordance between tSNP-derived blood types and serology was high across all SNPs in African ancestry individuals and linkage was strongly correlated with discordance across all ancestries (ρ = -0.90, P = 3.08 × 10-23). DISCUSSION Many studies determine ABO blood types using tSNPs. However, tSNPs with low linkage disequilibrium promote misinference of ABO blood types, particularly in diverse populations. We observe common use of inappropriate tSNPs to determine ABO blood type, particularly for O alleles and with some tSNPs mistyping up to 58% of individuals. CONCLUSION Our results highlight the lack of transferability of tSNPs across ancestries and potential exacerbation of disparities in genomic research for underrepresented populations. This is especially relevant as more diverse cohorts are made publicly available.
Collapse
Affiliation(s)
- Kiana L Martinez
- Department of Pharmacy Practice and Science, The University of Arizona R. Ken Coit College of Pharmacy, Tucson, AZ 85721, United States
| | - Andrew Klein
- Department of Pharmacy Practice and Science, The University of Arizona R. Ken Coit College of Pharmacy, Tucson, AZ 85721, United States
| | - Jennifer R Martin
- Department of Pharmacy Practice and Science, The University of Arizona R. Ken Coit College of Pharmacy, Tucson, AZ 85721, United States
- Department of the University of Arizona Health Sciences Library, The University of Arizona, Tucson, AZ 85721, United States
| | - Chinwuwanuju U Sampson
- Department of Pharmacy Practice and Science, The University of Arizona R. Ken Coit College of Pharmacy, Tucson, AZ 85721, United States
| | - Jason B Giles
- Department of Pharmacy Practice and Science, The University of Arizona R. Ken Coit College of Pharmacy, Tucson, AZ 85721, United States
| | - Madison L Beck
- Department of Pharmacy Practice and Science, The University of Arizona R. Ken Coit College of Pharmacy, Tucson, AZ 85721, United States
| | - Krupa Bhakta
- Department of Pharmacy Practice and Science, The University of Arizona R. Ken Coit College of Pharmacy, Tucson, AZ 85721, United States
| | - Gino Quatraro
- Department of Pharmacy Practice and Science, The University of Arizona R. Ken Coit College of Pharmacy, Tucson, AZ 85721, United States
| | - Juvie Farol
- Department of Clinical and Translational Science, The University of Arizona College of Medicine, Tucson, AZ 85721, United States
| | - Jason H Karnes
- Department of Pharmacy Practice and Science, The University of Arizona R. Ken Coit College of Pharmacy, Tucson, AZ 85721, United States
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37232, United States
| |
Collapse
|
7
|
Diany R, Gagliano Taliun SA. Systematic Review and Phenome-Wide Scans of Genetic Associations with Vascular Cognitive Impairment. Adv Biol (Weinh) 2024; 8:e2300692. [PMID: 38935518 DOI: 10.1002/adbi.202300692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2023] [Revised: 03/12/2024] [Indexed: 06/29/2024]
Abstract
Vascular cognitive impairment (VCI) is a heterogenous form of cognitive impairment that results from cerebrovascular disease. It is a result of both genetic and non-genetic factors. Although much research has been conducted on the genetic contributors to other forms of cognitive impairment (e.g. Alzheimer's disease), knowledge is lacking on the genetic factors associated with VCI. A better understanding of the genetics of VCI will be critical for prevention and treatment. To begin to fill this gap, the genetic contributors are reviewed with VCI from the literature. Phenome-wide scans of the identified genes are conducted and genetic variants identified in the review in large-scale resources displaying genetic variant-trait association information. Gene set are also carried out enrichment analysis using the genes identified from the review. Thirty one articles are identified meeting the search criteria and filters, from which 107 unique protein-coding genes are noted related to VCI. The phenome-wide scans and gene set enrichment analysis identify pathways associated with a diverse set of biological systems. This results indicate that genes with evidence of involvement in VCI are involved in a diverse set of biological functions. This information can facilitate downstream research to better dissect possible shared biological mechanisms for future therapies.
Collapse
Affiliation(s)
- Rime Diany
- Faculty of Medicine, Université de Montréal, 2900 Boulevard Edouard-Montpetit, Montréal, Québec, H3C 3J7, Canada
| | - Sarah A Gagliano Taliun
- Department of Medicine & Department of Neurosciences, Faculty of Medicine, Université de Montréal, 2900 Boulevard Edouard-Montpetit, Montréal, Québec, H3C 3J7, Canada
- Montreal Heart Institute, 5000 rue Bélanger, Montréal, Québec, H1T 1C8, Canada
| |
Collapse
|
8
|
de Souza AES, da Silva CHS, de Oliveira RDCS, Guimarães APA, da Silva ANLM, Diniz IG, de Oliveira HSS, de Sousa DS, Figueiredo FADPL, Costa GDLC, Guerreiro JF. Investigation of genetic markers associated to type 2 diabetes mellitus in Santarém-Pará. Genet Mol Biol 2024; 47:e20230107. [PMID: 39133695 PMCID: PMC11308377 DOI: 10.1590/1678-4685-gmb-2023-0107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Accepted: 05/22/2024] [Indexed: 08/15/2024] Open
Abstract
Genetic, epigenetic and environmental factors play an important role in the genesis of Type 2 Diabetes Mellitus (T2D). In the genetic context, one of the strategies used to investigate possible associations with diabetes is the search for Single Nucleotide Polymorphisms (SNPs), involving the comparison of alelle frequencies, the phenotypic variations and other relevant factors, such as environmental influences and lifestyle choices, Thus, the aim of this study was to find the relationship of risk variants for T2D in SNPs (rs4994) in the ADRB3 gene; (rs1799854) in the ABCC8 gene; (rs7901695 and rs12255372) in the TCF7L2 gene; and (rs8050136) in the FTO gene in a sample of the population of the municipality of Santarém (PA), Brazilian Amazon, in the northern region of Brazil. ABCC8 (rs1799854 C>T) showed a statistically significant association with T2D. Each chosen gene and SNP has been previously implicated in T2D risk according to existing scientific literature, owing to their roles in glucose regulation and body fat.
Collapse
Affiliation(s)
- Adjanny Estela Santos de Souza
- Universidade do Estado do Pará, Faculdade de Medicina, Departamento
de Morfologia e Ciências Fisiológicas, Belém, PA, Brazil
| | - Caio Henrique Silva da Silva
- Universidade do Estado do Pará, Faculdade de Medicina, Departamento
de Morfologia e Ciências Fisiológicas, Belém, PA, Brazil
| | | | - Ana Paula Araújo Guimarães
- Universidade do Estado do Pará, Faculdade de Medicina, Departamento
de Morfologia e Ciências Fisiológicas, Belém, PA, Brazil
| | | | - Isabela Guerreiro Diniz
- Universidade Federal do Pará, Instituto de Ciências Biológicas,
Laboratório de Genética Humana e Médica, Belém, PA, Brazil
| | - Haiala Soter Silva de Oliveira
- Universidade Federal do Pará, Instituto de Ciências Biológicas,
Laboratório de Genética Humana e Médica, Belém, PA, Brazil
| | - Diego Sarmento de Sousa
- Universidade Federal do Oeste do Pará (UFOPA), Santarém, PA,
Brazil
- Centro Universitário da Amazônia (UNAMA), Santarém, PA,
Brazil
| | | | - Greice de Lemos Cardoso Costa
- Universidade Federal do Pará, Instituto de Ciências Biológicas,
Laboratório de Genética Humana e Médica, Belém, PA, Brazil
| | - João Farias Guerreiro
- Universidade Federal do Pará, Instituto de Ciências Biológicas,
Laboratório de Genética Humana e Médica, Belém, PA, Brazil
| |
Collapse
|
9
|
Bharti N, Banerjee R, Achalare A, Kasibhatla SM, Joshi R. Estimation of genetic variation in vitiligo associated genes: Population genomics perspective. BMC Genom Data 2024; 25:72. [PMID: 39060965 PMCID: PMC11282599 DOI: 10.1186/s12863-024-01254-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2024] [Accepted: 07/16/2024] [Indexed: 07/28/2024] Open
Abstract
BACKGROUND Vitiligo is an auto-immune progressive depigmentation disorder of the skin due to loss of melanocytes. Genetic risk is one of the important factors for development of vitiligo. Preponderance of vitiligo in certain ethnicities is known which can be analysed by understanding the distribution of allele frequencies across normal populations. Earlier GWAS identified 108 risk alleles for vitiligo in Europeans and East Asians. In this study, 64 of these risk alleles were used for analysing their enrichment and depletion across populations (1000 Genomes Project and IndiGen) with reference to 1000 Genomes dataset. Genetic risk scores were calculated and Fisher's exact test was performed to understand statistical significance of their variation in each population with respect to 1000 Genomes dataset as reference. In addition to SNPs reported in GWAS, significant variation in allele frequencies of 1079 vitiligo-related genes were also analysed. Two-tailed Chi-square test and Bonferroni's multiple adjustment values along with fixation index (≥ 0.5) and minimum allele frequency (≥ 0.05) were calculated and used to prioritise the variants based on pairwise comparison across populations. RESULTS Risk alleles rs1043101 and rs10768122 belong to 3 prime UTR of glutamate receptor gene SLC1A2 are found to be highly enriched in the South Asian population when compared with the 'global normal' population. Intron variant rs4766578 (ATXN2) was found to be deleted in SAS, EAS and AFR and enriched in EUR and AMR1. This risk allele is found to be under positive selection in SAS, AMR1 and EUR. From the ancillary vitiligo gene list, nonsynonymous variant rs16891982 was found to be enriched in the European and the Admixed American populations and depleted in all others. rs2279238 and rs11039155 belonging to the LXR-α gene involved in regulation of metalloproteinase 2 and 9 (melanocyte precursors) were found to be associated with vitiligo in the North Indian population (in earlier study). CONCLUSION The differential enrichment/depletion profile of the risk alleles provides insight into the underlying inter-population variations. This would provide clues towards prioritisation of SNPs associated with vitiligo thereby elucidating its preponderance in different ethnic groups.
Collapse
Affiliation(s)
- Neeraj Bharti
- HPC-Medical and Bioinformatics Applications Group, Centre for Development of Advanced Computing, Innovation Park, Pashan, Pune, 411008, Maharashtra, India
| | - Ruma Banerjee
- HPC-Medical and Bioinformatics Applications Group, Centre for Development of Advanced Computing, Innovation Park, Pashan, Pune, 411008, Maharashtra, India
| | - Archana Achalare
- HPC-Medical and Bioinformatics Applications Group, Centre for Development of Advanced Computing, Innovation Park, Pashan, Pune, 411008, Maharashtra, India
| | - Sunitha Manjari Kasibhatla
- HPC-Medical and Bioinformatics Applications Group, Centre for Development of Advanced Computing, Innovation Park, Pashan, Pune, 411008, Maharashtra, India
| | - Rajendra Joshi
- HPC-Medical and Bioinformatics Applications Group, Centre for Development of Advanced Computing, Innovation Park, Pashan, Pune, 411008, Maharashtra, India.
| |
Collapse
|
10
|
Topriceanu CC, Chaturvedi N, Mathur R, Garfield V. Validity of European-centric cardiometabolic polygenic scores in multi-ancestry populations. Eur J Hum Genet 2024; 32:697-707. [PMID: 38182743 PMCID: PMC11153583 DOI: 10.1038/s41431-023-01517-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2023] [Revised: 10/29/2023] [Accepted: 11/28/2023] [Indexed: 01/07/2024] Open
Abstract
Polygenic scores (PGSs) provide an individual level estimate of genetic risk for any given disease. Since most PGSs have been derived from genome wide association studies (GWASs) conducted in populations of White European ancestry, their validity in other ancestry groups remains unconfirmed. This is especially relevant for cardiometabolic diseases which are known to disproportionately affect people of non-European ancestry. Thus, we aimed to evaluate the performance of PGSs for glycaemic traits (glycated haemoglobin, and type 1 and type 2 diabetes mellitus), cardiometabolic risk factors (body mass index, hypertension, high- and low-density lipoproteins, and total cholesterol and triglycerides) and cardiovascular diseases (including stroke and coronary artery disease) in people of White European, South Asian, and African Caribbean ethnicity in the UK Biobank. Whilst PGSs incorporated some GWAS data from multi-ethnic populations, the vast majority originated from White Europeans. For most outcomes, PGSs derived mostly from European populations had an overall better performance in White Europeans compared to South Asians and African Caribbeans. Thus, multi-ancestry GWAS data are needed to derive ancestry stratified PGSs to tackle health inequalities.
Collapse
Affiliation(s)
- Constantin-Cristian Topriceanu
- Department of Population Science and Experimental Medicine, Institute of Cardiovascular Science, University College London, Gower Street, London, WC1E 6BT, UK.
- MRC Unit for Lifelong Health and Ageing, University College London, 1-19 Torrington Place, London, WC1E 7HB, UK.
| | - Nish Chaturvedi
- Department of Population Science and Experimental Medicine, Institute of Cardiovascular Science, University College London, Gower Street, London, WC1E 6BT, UK
- MRC Unit for Lifelong Health and Ageing, University College London, 1-19 Torrington Place, London, WC1E 7HB, UK
| | - Rohini Mathur
- Centre for Primary Care, Wolfson Institute of Population Health, Queen Mary University of London, London, UK
| | - Victoria Garfield
- Department of Population Science and Experimental Medicine, Institute of Cardiovascular Science, University College London, Gower Street, London, WC1E 6BT, UK
- MRC Unit for Lifelong Health and Ageing, University College London, 1-19 Torrington Place, London, WC1E 7HB, UK
| |
Collapse
|
11
|
Troubat L, Fettahoglu D, Henches L, Aschard H, Julienne H. Multi-trait GWAS for diverse ancestries: mapping the knowledge gap. BMC Genomics 2024; 25:375. [PMID: 38627641 PMCID: PMC11022331 DOI: 10.1186/s12864-024-10293-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Accepted: 04/09/2024] [Indexed: 04/19/2024] Open
Abstract
BACKGROUND Approximately 95% of samples analyzed in univariate genome-wide association studies (GWAS) are of European ancestry. This bias toward European ancestry populations in association screening also exists for other analyses and methods that are often developed and tested on European ancestry only. However, existing data in non-European populations, which are often of modest sample size, could benefit from innovative approaches as recently illustrated in the context of polygenic risk scores. METHODS Here, we extend and assess the potential limitations and gains of our multi-trait GWAS pipeline, JASS (Joint Analysis of Summary Statistics), for the analysis of non-European ancestries. To this end, we conducted the joint GWAS of 19 hematological traits and glycemic traits across five ancestries (European (EUR), admixed American (AMR), African (AFR), East Asian (EAS), and South-East Asian (SAS)). RESULTS We detected 367 new genome-wide significant associations in non-European populations (15 in Admixed American (AMR), 72 in African (AFR) and 280 in East Asian (EAS)). New associations detected represent 5%, 17% and 13% of associations in the AFR, AMR and EAS populations, respectively. Overall, multi-trait testing increases the replication of European associated loci in non-European ancestry by 15%. Pleiotropic effects were highly similar at significant loci across ancestries (e.g. the mean correlation between multi-trait genetic effects of EUR and EAS ancestries was 0.88). For hematological traits, strong discrepancies in multi-trait genetic effects are tied to known evolutionary divergences: the ARKC1 loci, which is adaptive to overcome p.vivax induced malaria. CONCLUSIONS Multi-trait GWAS can be a valuable tool to narrow the genetic knowledge gap between European and non-European populations.
Collapse
Affiliation(s)
- Lucie Troubat
- Department of Computational Biology, Institut Pasteur, Université Paris Cité, Paris, F-75015, France
| | - Deniz Fettahoglu
- Department of Computational Biology, Institut Pasteur, Université Paris Cité, Paris, F-75015, France
| | - Léo Henches
- Department of Computational Biology, Institut Pasteur, Université Paris Cité, Paris, F-75015, France
| | - Hugues Aschard
- Department of Computational Biology, Institut Pasteur, Université Paris Cité, Paris, F-75015, France
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA
| | - Hanna Julienne
- Department of Computational Biology, Institut Pasteur, Université Paris Cité, Paris, F-75015, France.
- Institut Pasteur, Université Paris Cité, Bioinformatics and Biostatistics Hub, Paris, F-75015, France.
| |
Collapse
|
12
|
Garro-Núñez D, Picado-Martínez MJ, Espinoza-Campos E, Ugalde-Araya D, Macaya G, Raventós H, Chavarría-Soley G. Systematic exploration of a decade of publications on psychiatric genetics in Latin America. Am J Med Genet B Neuropsychiatr Genet 2024; 195:e32960. [PMID: 37860990 DOI: 10.1002/ajmg.b.32960] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Revised: 08/08/2023] [Accepted: 09/29/2023] [Indexed: 10/21/2023]
Abstract
Psychiatric disorders have a great impact in terms of mortality, morbidity, and disability across the lifespan. Considerable effort has been devoted to understanding their complex and heterogeneous genetic architecture, including diverse ancestry populations. Our aim was to review the psychiatric genetics research published with Latin American populations from 2010 to 2019, and classify it according to country of origin, type of analysis, source of funding, and other variables. We found that most publications came from Brazil, Mexico, and Colombia. Also, local funds are generally not large enough for genome-wide studies in Latin America, with the exception of Brazil and Mexico; larger studies are often done in collaboration with international partners, mostly funded by US agencies. In most of the larger studies, the participants are individuals of Latin American ancestry living in the United States, which limits the potential for exploring the complex gene-environment interaction. Family studies, traditionally strong in Latin America, represent about 30% of the total research publications. Scarce local resources for research in Latin America have probably been an important limitation for conducting bigger and more complex studies, contributing to the reduced representation of these populations in global psychiatric genetics studies. Increasing diversity must be a goal to improve generalizability and applicability in clinical settings.
Collapse
Affiliation(s)
| | | | | | - Daniela Ugalde-Araya
- Center for Research in Cellular and Molecular Biology, Universidad de Costa Rica, San José, Costa Rica
| | - Gabriel Macaya
- Center for Research in Cellular and Molecular Biology, Universidad de Costa Rica, San José, Costa Rica
| | - Henriette Raventós
- Biology School, Universidad de Costa Rica, San José, Costa Rica
- Center for Research in Cellular and Molecular Biology, Universidad de Costa Rica, San José, Costa Rica
| | - Gabriela Chavarría-Soley
- Biology School, Universidad de Costa Rica, San José, Costa Rica
- Center for Research in Cellular and Molecular Biology, Universidad de Costa Rica, San José, Costa Rica
| |
Collapse
|
13
|
Zhang S, Jiang Z, Zeng P. Incorporating genetic similarity of auxiliary samples into eGene identification under the transfer learning framework. J Transl Med 2024; 22:258. [PMID: 38461317 PMCID: PMC10924384 DOI: 10.1186/s12967-024-05053-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Accepted: 03/01/2024] [Indexed: 03/11/2024] Open
Abstract
BACKGROUND The term eGene has been applied to define a gene whose expression level is affected by at least one independent expression quantitative trait locus (eQTL). It is both theoretically and empirically important to identify eQTLs and eGenes in genomic studies. However, standard eGene detection methods generally focus on individual cis-variants and cannot efficiently leverage useful knowledge acquired from auxiliary samples into target studies. METHODS We propose a multilocus-based eGene identification method called TLegene by integrating shared genetic similarity information available from auxiliary studies under the statistical framework of transfer learning. We apply TLegene to eGene identification in ten TCGA cancers which have an explicit relevant tissue in the GTEx project, and learn genetic effect of variant in TCGA from GTEx. We also adopt TLegene to the Geuvadis project to evaluate its usefulness in non-cancer studies. RESULTS We observed substantial genetic effect correlation of cis-variants between TCGA and GTEx for a larger number of genes. Furthermore, consistent with the results of our simulations, we found that TLegene was more powerful than existing methods and thus identified 169 distinct candidate eGenes, which was much larger than the approach that did not consider knowledge transfer across target and auxiliary studies. Previous studies and functional enrichment analyses provided empirical evidence supporting the associations of discovered eGenes, and it also showed evidence of allelic heterogeneity of gene expression. Furthermore, TLegene identified more eGenes in Geuvadis and revealed that these eGenes were mainly enriched in cells EBV transformed lymphocytes tissue. CONCLUSION Overall, TLegene represents a flexible and powerful statistical method for eGene identification through transfer learning of genetic similarity shared across auxiliary and target studies.
Collapse
Affiliation(s)
- Shuo Zhang
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Zhou Jiang
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Ping Zeng
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
- Center for Medical Statistics and Data Analysis, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
- Key Laboratory of Human Genetics and Environmental Medicine, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
- Key Laboratory of Environment and Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
- Xuzhou Engineering Research Innovation Center of Biological Data Mining and Healthcare Transformation, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
- Jiangsu Engineering Research Center of Biological Data Mining and Healthcare Transformation, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
| |
Collapse
|
14
|
Lappalainen T, Li YI, Ramachandran S, Gusev A. Genetic and molecular architecture of complex traits. Cell 2024; 187:1059-1075. [PMID: 38428388 PMCID: PMC10977002 DOI: 10.1016/j.cell.2024.01.023] [Citation(s) in RCA: 19] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Revised: 12/20/2023] [Accepted: 01/16/2024] [Indexed: 03/03/2024]
Abstract
Human genetics has emerged as one of the most dynamic areas of biology, with a broadening societal impact. In this review, we discuss recent achievements, ongoing efforts, and future challenges in the field. Advances in technology, statistical methods, and the growing scale of research efforts have all provided many insights into the processes that have given rise to the current patterns of genetic variation. Vast maps of genetic associations with human traits and diseases have allowed characterization of their genetic architecture. Finally, studies of molecular and cellular effects of genetic variants have provided insights into biological processes underlying disease. Many outstanding questions remain, but the field is well poised for groundbreaking discoveries as it increases the use of genetic data to understand both the history of our species and its applications to improve human health.
Collapse
Affiliation(s)
- Tuuli Lappalainen
- New York Genome Center, New York, NY, USA; Science for Life Laboratory, Department of Gene Technology, KTH Royal Institute of Technology, Stockholm, Sweden.
| | - Yang I Li
- Section of Genetic Medicine, University of Chicago, Chicago, IL, USA; Department of Human Genetics, University of Chicago, Chicago, IL, USA
| | - Sohini Ramachandran
- Ecology, Evolution and Organismal Biology, Center for Computational Molecular Biology, and the Data Science Institute, Brown University, Providence, RI 029129, USA
| | - Alexander Gusev
- Harvard Medical School and Dana-Farber Cancer Institute, Boston, MA, USA
| |
Collapse
|
15
|
Jia K, Shen J. Transcriptome-wide association studies associated with Crohn's disease: challenges and perspectives. Cell Biosci 2024; 14:29. [PMID: 38403629 PMCID: PMC10895848 DOI: 10.1186/s13578-024-01204-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Accepted: 02/04/2024] [Indexed: 02/27/2024] Open
Abstract
Crohn's disease (CD) is regarded as a lifelong progressive disease affecting all segments of the intestinal tract and multiple organs. Based on genome-wide association studies (GWAS) and gene expression data, transcriptome-wide association studies (TWAS) can help identify susceptibility genes associated with pathogenesis and disease behavior. In this review, we overview seven reported TWASs of CD, summarize their study designs, and discuss the key methods and steps used in TWAS, which affect the prioritization of susceptibility genes. This article summarized the screening of tissue-specific susceptibility genes for CD, and discussed the reported potential pathological mechanisms of overlapping susceptibility genes related to CD in a certain tissue type. We observed that ileal lipid-related metabolism and colonic extracellular vesicles may be involved in the pathogenesis of CD by performing GO pathway enrichment analysis for susceptibility genes. We further pointed the low reproducibility of TWAS associated with CD and discussed the reasons for these issues, strategies for solving them. In the future, more TWAS are needed to be designed into large-scale, unified cohorts, unified analysis pipelines, and fully classified databases of expression trait loci.
Collapse
Affiliation(s)
- Keyu Jia
- Laboratory of Medicine, Baoshan Branch, Ren Ji Hospital, School of Medicine, Nephrology department, Shanghai Jiao Tong University, 1058 Huanzhen Northroad, Shanghai, 200444, China
| | - Jun Shen
- Laboratory of Medicine, Baoshan Branch, Ren Ji Hospital, School of Medicine, Nephrology department, Shanghai Jiao Tong University, 1058 Huanzhen Northroad, Shanghai, 200444, China.
- Division of Gastroenterology and Hepatology, Key Laboratory of Gastroenterology and Hepatology, Ministry of Health, Inflammatory Bowel Research Center, Ren Ji Hospital, School of Medicine, Shanghai Institute of Digestive Disease, Shanghai Jiao Tong University, Shanghai, China.
- NHC Key Laboratory of Digestive Diseases, Renji Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China.
- Division of Gastroenterology and Hepatology, Baoshan Branch, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China.
| |
Collapse
|
16
|
Jeng XJ, Hu Y, Venkat V, Lu TP, Tzeng JY. Transfer learning with false negative control improves polygenic risk prediction. PLoS Genet 2023; 19:e1010597. [PMID: 38011285 PMCID: PMC10723713 DOI: 10.1371/journal.pgen.1010597] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2023] [Revised: 12/15/2023] [Accepted: 11/09/2023] [Indexed: 11/29/2023] Open
Abstract
Polygenic risk score (PRS) is a quantity that aggregates the effects of variants across the genome and estimates an individual's genetic predisposition for a given trait. PRS analysis typically contains two input data sets: base data for effect size estimation and target data for individual-level prediction. Given the availability of large-scale base data, it becomes more common that the ancestral background of base and target data do not perfectly match. In this paper, we treat the GWAS summary information obtained in the base data as knowledge learned from a pre-trained model, and adopt a transfer learning framework to effectively leverage the knowledge learned from the base data that may or may not have similar ancestral background as the target samples to build prediction models for target individuals. Our proposed transfer learning framework consists of two main steps: (1) conducting false negative control (FNC) marginal screening to extract useful knowledge from the base data; and (2) performing joint model training to integrate the knowledge extracted from base data with the target training data for accurate trans-data prediction. This new approach can significantly enhance the computational and statistical efficiency of joint-model training, alleviate over-fitting, and facilitate more accurate trans-data prediction when heterogeneity level between target and base data sets is small or high.
Collapse
Affiliation(s)
- Xinge Jessie Jeng
- Department of Statistics, North Carolina State University, Raleigh, North Carolina, United States of America
| | - Yifei Hu
- Department of Statistics, North Carolina State University, Raleigh, North Carolina, United States of America
| | - Vaishnavi Venkat
- Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, United States of America
| | - Tzu-Pin Lu
- Institute of Health Data Analytics and Statistics, National Taiwan University, Taipei, Taiwan
- Department of Public Health, National Taiwan University, Taipei, Taiwan
| | - Jung-Ying Tzeng
- Department of Statistics, North Carolina State University, Raleigh, North Carolina, United States of America
- Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, United States of America
- Institute of Health Data Analytics and Statistics, National Taiwan University, Taipei, Taiwan
| |
Collapse
|
17
|
Zambrano AK, Cadena-Ullauri S, Guevara-Ramírez P, Ruiz-Pozo VA, Tamayo-Trujillo R, Paz-Cruz E, Ibarra-Rodríguez AA, Doménech N. Genetic diet interactions of ACE: the increased hypertension predisposition in the Latin American population. Front Nutr 2023; 10:1241017. [PMID: 37964928 PMCID: PMC10640988 DOI: 10.3389/fnut.2023.1241017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Accepted: 10/12/2023] [Indexed: 11/16/2023] Open
Abstract
Hypertension is one of the primary risk factors associated with cardiovascular diseases (CVDs). It is a condition that affects people worldwide, and its prevalence is increasing due to several factors, such as lack of physical activity, population aging, and unhealthy diets. Notably, this increase has primarily occurred in low and middle-income countries (LMICs). In Latin America, approximately 40% of adults have been diagnosed with hypertension. Moreover, reports have shown that the Latin American genetic composition is highly diverse, and this genetic background can influence various biological processes, including disease predisposition and treatment effectiveness. Research has shown that Western dietary patterns, which include increased consumption of red meat, refined grains, sugar, and ultra-processed food, have spread across the globe, including Latin America, due to globalization processes. Furthermore, a higher than recommended sodium consumption, which has been associated with hypertension, has been identified across different regions, including Asia, Europe, America, Oceania, and Africa. In conclusion, hypertension is a multifactorial disease involving environmental and genetic factors. In Latin America, hypertension prevalence is increasing due to various factors, including age, the adoption of a "Westernized" diet, and potential genetic predisposition factors involving the ACE gene. Furthermore, identifying the genetic and molecular mechanisms of the disease, its association with diet, and how they interact is essential for the development of personalized treatments to increase its efficacy and reduce side effects.
Collapse
Affiliation(s)
- Ana Karina Zambrano
- Centro de Investigación Genética y Genómica, Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito, Ecuador
| | - Santiago Cadena-Ullauri
- Centro de Investigación Genética y Genómica, Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito, Ecuador
| | - Patricia Guevara-Ramírez
- Centro de Investigación Genética y Genómica, Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito, Ecuador
| | - Viviana A. Ruiz-Pozo
- Centro de Investigación Genética y Genómica, Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito, Ecuador
| | - Rafael Tamayo-Trujillo
- Centro de Investigación Genética y Genómica, Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito, Ecuador
| | - Elius Paz-Cruz
- Centro de Investigación Genética y Genómica, Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito, Ecuador
| | | | - Nieves Doménech
- Instituto de Investigación Biomédica de A Coruña (INIBIC)-CIBERCV, Complexo Hospitalario Universitario de A Coruña (CHUAC), Sergas, Universidad da Coruña (UDC), La Coruña, Spain
| |
Collapse
|
18
|
Gyawali PK, Le Guen Y, Liu X, Belloy ME, Tang H, Zou J, He Z. Improving genetic risk prediction across diverse population by disentangling ancestry representations. Commun Biol 2023; 6:964. [PMID: 37736834 PMCID: PMC10517023 DOI: 10.1038/s42003-023-05352-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2022] [Accepted: 09/12/2023] [Indexed: 09/23/2023] Open
Abstract
Risk prediction models using genetic data have seen increasing traction in genomics. However, most of the polygenic risk models were developed using data from participants with similar (mostly European) ancestry. This can lead to biases in the risk predictors resulting in poor generalization when applied to minority populations and admixed individuals such as African Americans. To address this issue, largely due to the prediction models being biased by the underlying population structure, we propose a deep-learning framework that leverages data from diverse population and disentangles ancestry from the phenotype-relevant information in its representation. The ancestry disentangled representation can be used to build risk predictors that perform better across minority populations. We applied the proposed method to the analysis of Alzheimer's disease genetics. Comparing with standard linear and nonlinear risk prediction methods, the proposed method substantially improves risk prediction in minority populations, including admixed individuals, without needing self-reported ancestry information.
Collapse
Affiliation(s)
- Prashnna K Gyawali
- Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA, USA.
| | - Yann Le Guen
- Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA, USA
- Institut du Cerveau-Paris Brain Institute-ICM, Paris, France
| | - Xiaoxia Liu
- Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA, USA
| | - Michael E Belloy
- Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA, USA
| | - Hua Tang
- Department of Genetics, Stanford University, Stanford, CA, USA
| | - James Zou
- Department of Biomedical Data Science, Stanford University, Stanford, CA, USA.
| | - Zihuai He
- Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA, USA.
- Quantitative Sciences Unit, Department of Medicine (Biomedical Informatics Research), Stanford University, Stanford, CA, USA.
| |
Collapse
|
19
|
Abstract
Admixed populations constitute a large portion of global human genetic diversity, yet they are often left out of genomics analyses. This exclusion is problematic, as it leads to disparities in the understanding of the genetic structure and history of diverse cohorts and the performance of genomic medicine across populations. Admixed populations have particular statistical challenges, as they inherit genomic segments from multiple source populations-the primary reason they have historically been excluded from genetic studies. In recent years, however, an increasing number of statistical methods and software tools have been developed to account for and leverage admixture in the context of genomics analyses. Here, we provide a survey of such computational strategies for the informed consideration of admixture to allow for the well-calibrated inclusion of mixed ancestry populations in large-scale genomics studies, and we detail persisting gaps in existing tools.
Collapse
Affiliation(s)
- Taotao Tan
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA;
| | - Elizabeth G Atkinson
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA;
| |
Collapse
|
20
|
Cruz LA, Cooke Bailey JN, Crawford DC. Importance of Diversity in Precision Medicine: Generalizability of Genetic Associations Across Ancestry Groups Toward Better Identification of Disease Susceptibility Variants. Annu Rev Biomed Data Sci 2023; 6:339-356. [PMID: 37196357 PMCID: PMC10720270 DOI: 10.1146/annurev-biodatasci-122220-113250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
Genome-wide association studies (GWAS) revolutionized our understanding of common genetic variation and its impact on common human disease and traits. Developed and adopted in the mid-2000s, GWAS led to searchable genotype-phenotype catalogs and genome-wide datasets available for further data mining and analysis for the eventual development of translational applications. The GWAS revolution was swift and specific, including almost exclusively populations of European descent, to the neglect of the majority of the world's genetic diversity. In this narrative review, we recount the GWAS landscape of the early years that established a genotype-phenotype catalog that is now universally understood to be inadequate for a complete understanding of complex human genetics. We then describe approaches taken to augment the genotype-phenotype catalog, including the study populations, collaborative consortia, and study design approaches aimed to generalize and then ultimately discover genome-wide associations in non-European descent populations. The collaborations and data resources established in the efforts to diversify genomic findings undoubtedly provide the foundations of the next chapters of genetic association studies with the advent of budget-friendly whole-genome sequencing.
Collapse
Affiliation(s)
- Lauren A Cruz
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio, USA;
- Cleveland Institute for Computational Biology, Case Western Reserve University, Cleveland, Ohio, USA
| | - Jessica N Cooke Bailey
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio, USA;
- Department of Genetics and Genome Sciences, Case Western Reserve University, Cleveland, Ohio, USA
- Cleveland Institute for Computational Biology, Case Western Reserve University, Cleveland, Ohio, USA
| | - Dana C Crawford
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio, USA;
- Department of Genetics and Genome Sciences, Case Western Reserve University, Cleveland, Ohio, USA
- Cleveland Institute for Computational Biology, Case Western Reserve University, Cleveland, Ohio, USA
| |
Collapse
|
21
|
Lehmann B, Mackintosh M, McVean G, Holmes C. Optimal strategies for learning multi-ancestry polygenic scores vary across traits. Nat Commun 2023; 14:4023. [PMID: 37419925 PMCID: PMC10328935 DOI: 10.1038/s41467-023-38930-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Accepted: 05/22/2023] [Indexed: 07/09/2023] Open
Abstract
Polygenic scores (PGSs) are individual-level measures that aggregate the genome-wide genetic predisposition to a given trait. As PGS have predominantly been developed using European-ancestry samples, trait prediction using such European ancestry-derived PGS is less accurate in non-European ancestry individuals. Although there has been recent progress in combining multiple PGS trained on distinct populations, the problem of how to maximize performance given a multiple-ancestry cohort is largely unexplored. Here, we investigate the effect of sample size and ancestry composition on PGS performance for fifteen traits in UK Biobank. For some traits, PGS estimated using a relatively small African-ancestry training set outperformed, on an African-ancestry test set, PGS estimated using a much larger European-ancestry only training set. We observe similar, but not identical, results when considering other minority-ancestry groups within UK Biobank. Our results emphasise the importance of targeted data collection from underrepresented groups in order to address existing disparities in PGS performance.
Collapse
Affiliation(s)
- Brieuc Lehmann
- Department of Statistical Science, University College London, London, UK.
| | | | - Gil McVean
- Big Data Institute, University of Oxford, Oxford, UK
| | - Chris Holmes
- The Alan Turing Institute, London, UK
- Big Data Institute, University of Oxford, Oxford, UK
- Department of Statistics, University of Oxford, Oxford, UK
| |
Collapse
|
22
|
Bhatraju PK, Stanaway IB, Palmer MR, Menon R, Schaub JA, Menez S, Srivastava A, Wilson FP, Kiryluk K, Palevsky PM, Naik AS, Sakr SS, Jarvik GP, Parikh CR, Ware LB, Ikizler TA, Siew ED, Chinchilli VM, Coca SG, Garg AX, Go AS, Kaufman JS, Kimmel PL, Himmelfarb J, Wurfel MM. Genome-wide Association Study for AKI. KIDNEY360 2023; 4:870-880. [PMID: 37273234 PMCID: PMC10371295 DOI: 10.34067/kid.0000000000000175] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Accepted: 04/03/2023] [Indexed: 06/06/2023]
Abstract
Key Points Two genetic variants in the DISP1-TLR5 gene locus were associated with risk of AKI. DISP1 and TLR5 were differentially regulated in kidney biopsy tissue from patients with AKI compared with no AKI. Background Although common genetic risks for CKD are well established, genetic factors influencing risk for AKI in hospitalized patients are poorly understood. Methods We conducted a genome-wide association study in 1369 participants in the Assessment, Serial Evaluation, and Subsequent Sequelae of AKI Study; a multiethnic population of hospitalized participants with and without AKI matched on demographics, comorbidities, and kidney function before hospitalization. We then completed functional annotation of top-performing variants for AKI using single-cell RNA sequencing data from kidney biopsies in 12 patients with AKI and 18 healthy living donors from the Kidney Precision Medicine Project. Results No genome-wide significant associations with AKI risk were found in Assessment, Serial Evaluation, and Subsequent Sequelae of AKI (P < 5×10 −8 ). The top two variants with the strongest association with AKI mapped to the dispatched resistance-nodulation-division (RND) transporter family member 1 (DISP1) gene and toll-like receptor 5 (TLR5) gene locus, rs17538288 (odds ratio, 1.55; 95% confidence interval, 1.32 to 182; P = 9.47×10 −8 ) and rs7546189 (odds ratio, 1.53; 95% confidence interval, 1.30 to 1.81; P = 4.60×10 −7 ). In comparison with kidney tissue from healthy living donors, kidney biopsies in patients with AKI showed differential DISP1 expression in proximal tubular epithelial cells (adjusted P = 3.9× 10−2) and thick ascending limb of the loop of Henle (adjusted P = 8.7× 10−3) and differential TLR5 gene expression in thick ascending limb of the loop of Henle (adjusted P = 4.9× 10−30). Conclusions AKI is a heterogeneous clinical syndrome with various underlying risk factors, etiologies, and pathophysiology that may limit the identification of genetic variants. Although no variants reached genome-wide significance, we report two variants in the intergenic region between DISP1 and TLR5 , suggesting this region as a novel risk for AKI susceptibility.
Collapse
Affiliation(s)
- Pavan K Bhatraju
- Division of Pulmonary, Critical Care and Sleep Medicine, Department of Medicine, University of Washington, Seattle, Washington
- Kidney Research Institute, Division of Nephrology, Department of Medicine, University of Washington, Seattle, Washington
| | - Ian B Stanaway
- Kidney Research Institute, Division of Nephrology, Department of Medicine, University of Washington, Seattle, Washington
| | - Melody R Palmer
- Departments of Medicine (Medical Genetics) and Genome Sciences, University of Washington School of Medicine, Seattle, Washington
| | - Rajasree Menon
- Division of Nephrology, Department of Medicine, Michigan Medicine, Ann Arbor, Michigan
| | - Jennifer A Schaub
- Division of Nephrology, Department of Medicine, Michigan Medicine, Ann Arbor, Michigan
| | - Steven Menez
- Division of Nephrology, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - Anand Srivastava
- Department of Medicine, Division of Nephrology and Hypertension, Northwestern University School of Medicine, Chicago, Illinois
| | - F Perry Wilson
- Program of Applied Translational Research, Yale School of Medicine, New Haven, Connecticut
| | - Krzysztof Kiryluk
- Division of Nephrology, Department of Medicine, Vagelos College of Physicians & Surgeons, Columbia University, New York City, New York
| | - Paul M Palevsky
- Kidney Medicine Section, VA Pittsburgh Healthcare System, Pittsburgh, Pennsylvania
- Renal-Electrolyte Division, Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania
| | - Abhijit S Naik
- Division of Nephrology, University of Michigan, Ann Arbor, Michigan
| | - Sana S Sakr
- Division of Pulmonary, Critical Care and Sleep Medicine, Department of Medicine, University of Washington, Seattle, Washington
| | - Gail P Jarvik
- Departments of Medicine (Medical Genetics) and Genome Sciences, University of Washington School of Medicine, Seattle, Washington
| | - Chirag R Parikh
- Division of Nephrology, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - Lorraine B Ware
- Division of Allergy, Pulmonary and Critical Care, Vanderbilt University Medical Center, Nashville, Tennessee
| | - T Alp Ikizler
- Division of Nephrology and Hypertension, Vanderbilt University Medical Center, Nashville, Tennessee
| | - Edward D Siew
- Division of Nephrology and Hypertension, Vanderbilt University Medical Center, Nashville, Tennessee
| | - Vernon M Chinchilli
- Department of Public Health Sciences, Penn State College of Medicine, Hershey, Pennsylvania
| | - Steve G Coca
- Section of Nephrology, Department of Internal Medicine, Mount Sinai School of Medicine, New York, New York
| | - Amit X Garg
- Division of Nephrology, Department of Medicine, Western University, London, Ontario, Canada
| | - Alan S Go
- Division of Nephrology, Department of Medicine, University of California, San Francisco, California
- Division of Research, Kaiser Permanente Northern California, Oakland, California
- Department of Epidemiology and Biostatistics, University of California, San Francisco, California
| | - James S Kaufman
- Division of Nephrology, New York University School of Medicine, New York, New York
- Division of Nephrology, VA New York Harbor Healthcare System, New York, New York
| | - Paul L Kimmel
- Division of Renal Diseases and Hypertension, Department of Medicine, George Washington University Medical Center, Washington, DC
| | - Jonathan Himmelfarb
- Kidney Research Institute, Division of Nephrology, Department of Medicine, University of Washington, Seattle, Washington
| | - Mark M Wurfel
- Division of Pulmonary, Critical Care and Sleep Medicine, Department of Medicine, University of Washington, Seattle, Washington
- Kidney Research Institute, Division of Nephrology, Department of Medicine, University of Washington, Seattle, Washington
| |
Collapse
|
23
|
Qiao J, Wu Y, Zhang S, Xu Y, Zhang J, Zeng P, Wang T. Evaluating significance of European-associated index SNPs in the East Asian population for 31 complex phenotypes. BMC Genomics 2023; 24:324. [PMID: 37312035 DOI: 10.1186/s12864-023-09425-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Accepted: 06/01/2023] [Indexed: 06/15/2023] Open
Abstract
BACKGROUND Genome-wide association studies (GWASs) have identified many single-nucleotide polymorphisms (SNPs) associated with complex phenotypes in the European (EUR) population; however, the extent to which EUR-associated SNPs can be generalized to other populations such as East Asian (EAS) is not clear. RESULTS By leveraging summary statistics of 31 phenotypes in the EUR and EAS populations, we first evaluated the difference in heritability between the two populations and calculated the trans-ethnic genetic correlation. We observed the heritability estimates of some phenotypes varied substantially across populations and 53.3% of trans-ethnic genetic correlations were significantly smaller than one. Next, we examined whether EUR-associated SNPs of these phenotypes could be identified in EAS using the trans-ethnic false discovery rate method while accounting for winner's curse for SNP effect in EUR and difference of sample sizes in EAS. We found on average 54.5% of EUR-associated SNPs were also significant in EAS. Furthermore, we discovered non-significant SNPs had higher effect heterogeneity, and significant SNPs showed more consistent linkage disequilibrium and allele frequency patterns between the two populations. We also demonstrated non-significant SNPs were more likely to undergo natural selection. CONCLUSIONS Our study revealed the extent to which EUR-associated SNPs could be significant in the EAS population and offered deep insights into the similarity and diversity of genetic architectures underlying phenotypes in distinct ancestral groups.
Collapse
Affiliation(s)
- Jiahao Qiao
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Yuxuan Wu
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Shuo Zhang
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Yue Xu
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Jinhui Zhang
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Ping Zeng
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
- Center for Medical Statistics and Data Analysis, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
- Key Laboratory of Human Genetics and Environmental Medicine, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
- Key Laboratory of Environment and Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
- Engineering Research Innovation Center of Biological Data Mining and Healthcare Transformation, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
| | - Ting Wang
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
| |
Collapse
|
24
|
Zhang J, Zhang S, Qiao J, Wang T, Zeng P. Similarity and diversity of genetic architecture for complex traits between East Asian and European populations. BMC Genomics 2023; 24:314. [PMID: 37308816 DOI: 10.1186/s12864-023-09434-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Accepted: 06/07/2023] [Indexed: 06/14/2023] Open
Abstract
BACKGROUND Genome-wide association studies have detected a large number of single-nucleotide polymorphisms (SNPs) associated with complex traits in diverse ancestral groups. However, the trans-ethnic similarity and diversity of genetic architecture is not well understood currently. RESULTS By leveraging summary statistics of 37 traits from East Asian (Nmax=254,373) or European (Nmax=693,529) populations, we first evaluated the trans-ethnic genetic correlation (ρg) and found substantial evidence of shared genetic overlap underlying these traits between the two populations, with [Formula: see text] ranging from 0.53 (se = 0.11) for adult-onset asthma to 0.98 (se = 0.17) for hemoglobin A1c. However, 88.9% of the genetic correlation estimates were significantly less than one, indicating potential heterogeneity in genetic effect across populations. We next identified common associated SNPs using the conjunction conditional false discovery rate method and observed 21.7% of trait-associated SNPs can be identified simultaneously in both populations. Among these shared associated SNPs, 20.8% showed heterogeneous influence on traits between the two ancestral populations. Moreover, we demonstrated that population-common associated SNPs often exhibited more consistent linkage disequilibrium and allele frequency pattern across ancestral groups compared to population-specific or null ones. We also revealed population-specific associated SNPs were much likely to undergo natural selection compared to population-common associated SNPs. CONCLUSIONS Our study provides an in-depth understanding of similarity and diversity regarding genetic architecture for complex traits across diverse populations, and can assist in trans-ethnic association analysis, genetic risk prediction, and causal variant fine mapping.
Collapse
Affiliation(s)
- Jinhui Zhang
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China
| | - Shuo Zhang
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China
| | - Jiahao Qiao
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China
| | - Ting Wang
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China.
| | - Ping Zeng
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China.
- Center for Medical Statistics and Data Analysis, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China.
- Key Laboratory of Human Genetics and Environmental Medicine, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China.
- Key Laboratory of Environment and Health, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China.
- Engineering Research Innovation Center of Biological Data Mining and Healthcare Transformation, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China.
| |
Collapse
|
25
|
Loughnan RJ, Palmer CE, Thompson WK, Dale AM, Jernigan TL, Chieh Fan C. Intelligence Polygenic Score Is More Predictive of Crystallized Measures: Evidence From the Adolescent Brain Cognitive Development (ABCD) Study. Psychol Sci 2023; 34:714-725. [PMID: 37146216 PMCID: PMC10404902 DOI: 10.1177/09567976231160702] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2021] [Accepted: 02/03/2023] [Indexed: 05/07/2023] Open
Abstract
Findings in adults have shown that crystallized measures of intelligence, which are more culturally sensitive than fluid intelligence measures, have greater heritability; however, these results have not been found in children. The present study used data from 8,518 participants between 9 and 11 years old from the Adolescent Brain Cognitive Development (ABCD) Study. We found that polygenic predictors of intelligence test performance (based on genome-wide association meta-analyses of data from 269,867 individuals) and of educational attainment (based on data from 1.1 million individuals) predicted neurocognitive performance. We found that crystallized measures were more strongly associated with both polygenic predictors than were fluid measures. This mirrored heritability differences reported previously in adults and suggests similar associations in children. This may be consistent with a prominent role of gene-environment correlation in cognitive development measured by crystallized intelligence tests. Environmental and experiential mediators may represent malleable targets for improving cognitive outcomes.
Collapse
Affiliation(s)
- Robert J. Loughnan
- Department of Cognitive Science, University of California San Diego
- Population Neuroscience and Genetics, University of California San Diego
| | - Clare E. Palmer
- Center for Human Development, University of California San Diego
| | - Wesley K. Thompson
- Center for Population Neuroscience and Genetics, Laureate Institute for Brain Research
| | - Anders M. Dale
- Department of Cognitive Science, University of California San Diego
- Center for Multimodal Imaging and Genetics, University of California San Diego School of Medicine
- Department of Radiology, University of California San Diego School of Medicine
- Department of Neuroscience, University of California San Diego School of Medicine
| | - Terry L. Jernigan
- Department of Cognitive Science, University of California San Diego
- Center for Human Development, University of California San Diego
- Department of Radiology, University of California San Diego School of Medicine
- Department of Neuroscience, University of California San Diego School of Medicine
| | - Chun Chieh Fan
- Center for Population Neuroscience and Genetics, Laureate Institute for Brain Research
- Department of Radiology, University of California San Diego School of Medicine
| |
Collapse
|
26
|
Drouet DE, Liu S, Crawford DC. Assessment of multi-population polygenic risk scores for lipid traits in African Americans. PeerJ 2023; 11:e14910. [PMID: 37214096 PMCID: PMC10198155 DOI: 10.7717/peerj.14910] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Accepted: 01/25/2023] [Indexed: 05/24/2023] Open
Abstract
Polygenic risk scores (PRS) based on genome-wide discoveries are promising predictors or classifiers of disease development, severity, and/or progression for common clinical outcomes. A major limitation of most risk scores is the paucity of genome-wide discoveries in diverse populations, prompting an emphasis to generate these needed data for trans-population and population-specific PRS construction. Given diverse genome-wide discoveries are just now being completed, there has been little opportunity for PRS to be evaluated in diverse populations independent from the discovery efforts. To fill this gap, we leverage here summary data from a recent genome-wide discovery study of lipid traits (HDL-C, LDL-C, triglycerides, and total cholesterol) conducted in diverse populations represented by African Americans, Hispanics, Asians, Native Hawaiians, Native Americans, and others by the Population Architecture using Genomics and Epidemiology (PAGE) Study. We constructed lipid trait PRS using PAGE Study published genetic variants and weights in an independent African American adult patient population linked to de-identified electronic health records and genotypes from the Illumina Metabochip (n = 3,254). Using multi-population lipid trait PRS, we assessed levels of association for their respective lipid traits, clinical outcomes (cardiovascular disease and type 2 diabetes), and common clinical labs. While none of the multi-population PRS were strongly associated with the tested trait or outcome, PRSLDL-Cwas nominally associated with cardiovascular disease. These data demonstrate the complexity in applying PRS to real-world clinical data even when data from multiple populations are available.
Collapse
Affiliation(s)
- Domenica E. Drouet
- Department of Medicine, Case Western Reserve University, Cleveland, OH, United States of America
| | - Shiying Liu
- Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH, United States of America
| | - Dana C. Crawford
- Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH, United States of America
- Cleveland Institute for Computational Biology, Case Western Reserve University, Cleveland, OH, United States of America
- Genetics and Genome Sciences, Case Western Reserve University, Cleveland, OH, United States of America
| |
Collapse
|
27
|
Atkinson EG. Estimation of cross-ancestry genetic correlations within ancestry tracts of admixed samples. Nat Genet 2023; 55:527-529. [PMID: 36941440 DOI: 10.1038/s41588-023-01325-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/23/2023]
Affiliation(s)
- Elizabeth G Atkinson
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.
| |
Collapse
|
28
|
Momin MM, Shin J, Lee S, Truong B, Benyamin B, Lee SH. A method for an unbiased estimate of cross-ancestry genetic correlation using individual-level data. Nat Commun 2023; 14:722. [PMID: 36759513 PMCID: PMC9911789 DOI: 10.1038/s41467-023-36281-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Accepted: 01/24/2023] [Indexed: 02/11/2023] Open
Abstract
Cross-ancestry genetic correlation is an important parameter to understand the genetic relationship between two ancestry groups. However, existing methods cannot properly account for ancestry-specific genetic architecture, which is diverse across ancestries, producing biased estimates of cross-ancestry genetic correlation. Here, we present a method to construct a genomic relationship matrix (GRM) that can correctly account for the relationship between ancestry-specific allele frequencies and ancestry-specific allelic effects. Through comprehensive simulations, we show that the proposed method outperforms existing methods in the estimations of SNP-based heritability and cross-ancestry genetic correlation. The proposed method is further applied to anthropometric and other complex traits from the UK Biobank data across ancestry groups. For obesity, the estimated genetic correlation between African and European ancestry cohorts is significantly different from unity, suggesting that obesity is genetically heterogenous between these two ancestries.
Collapse
Affiliation(s)
- Md Moksedul Momin
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, 5000, Australia
- UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA, 5000, Australia
- Department of Genetics and Animal Breeding, Faculty of Veterinary Medicine, Chattogram Veterinary and Animal Sciences University (CVASU), Khulshi, Chattogram, 4225, Bangladesh
- South Australian Health and Medical Research Institute (SAHMRI), University of South Australia, Adelaide, SA, 5000, Australia
| | - Jisu Shin
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, 5000, Australia
- UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA, 5000, Australia
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, 22908, USA
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, USA
| | - Soohyun Lee
- Division of Animal Breeding and Genetics, National Institute of Animal Science (NIAS), Cheonan, South Korea
| | - Buu Truong
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, 5000, Australia
| | - Beben Benyamin
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, 5000, Australia
- UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA, 5000, Australia
- South Australian Health and Medical Research Institute (SAHMRI), University of South Australia, Adelaide, SA, 5000, Australia
| | - S Hong Lee
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, 5000, Australia.
- UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA, 5000, Australia.
- South Australian Health and Medical Research Institute (SAHMRI), University of South Australia, Adelaide, SA, 5000, Australia.
| |
Collapse
|
29
|
Lee D, Koo T, Park J, Shin HT. Comparison of Risk Allele Frequencies of Psoriasis-Associated Single-Nucleotide Polymorphisms in Different Population Groups. Ann Dermatol 2023; 35:32-37. [PMID: 36750456 PMCID: PMC9905865 DOI: 10.5021/ad.22.110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Revised: 08/18/2022] [Accepted: 10/12/2022] [Indexed: 01/31/2023] Open
Abstract
BACKGROUND The prevalence of psoriasis differs by population, and it appears to be more common among Europeans than in East Asians. Recent genome-wide association studies (GWAS) have identified alleles that increase the risk of psoriasis, and these alleles may present different frequencies in different geographic regions. OBJECTIVE We aimed to gain insights into the causes of differences in disease frequencies according to populations and the factors affecting prevalence and pattern differences. METHODS We collected a total of 147 psoriasis-associated single-nucleotide polymorphisms (SNPs) from the GWAS catalog and compared the allele frequency differences in 27 populations using public population frequency in the 1000 Genomes Project phase 3 (n=2,504) and the Korean Reference Genome Database (n=1,722). Additionally, we calculated the composited genetic risk scores across the population groups. RESULTS There were distinct patterns of allele frequencies in different population groups. In many cases, East Asians exhibited allele frequencies opposite to that of Europeans. The genetic risk score was higher in Europeans (average: 0.487) and Americans (average: 0.492) than in East Asians (average: 0.471). The prevalence of psoriasis correlated with the average genetic risk score of the population. CONCLUSION We observed a difference in the allele frequencies of psoriasis-associated SNPs between the studied populations. This result suggests that the difference in the prevalence of psoriasis between population groups can be interpreted to some extent by the genotype.
Collapse
Affiliation(s)
- Donghoon Lee
- Department of Dermatology, Veterans Health Service Medical Center, Seoul, Korea
| | - Taehan Koo
- Department of Dermatology, Veterans Health Service Medical Center, Seoul, Korea
| | - Jiho Park
- Oaro Dermatology Clinic, Seoul, Korea
| | - Hyun-Tae Shin
- Department of Dermatology, Inha University College of Medicine, Incheon, Korea.,Research Center for Controlling Intercellular Communication (RCIC), Inha University College of Medicine, Incheon, Korea.
| |
Collapse
|
30
|
Chernoff MB, Delgado D, Tong L, Chen L, Oliva M, Tamayo LI, Best LG, Cole S, Jasmine F, Kibriya MG, Nelson H, Huang L, Haack K, Kent J, Umans JG, Graziano J, Navas-Acien A, Karagas MR, Ahsan H, Pierce BL. Sequencing-based fine-mapping and in silico functional characterization of the 10q24.32 arsenic metabolism efficiency locus across multiple arsenic-exposed populations. PLoS Genet 2023; 19:e1010588. [PMID: 36668670 PMCID: PMC9891528 DOI: 10.1371/journal.pgen.1010588] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Revised: 02/01/2023] [Accepted: 12/20/2022] [Indexed: 01/22/2023] Open
Abstract
Inorganic arsenic is highly toxic and carcinogenic to humans. Exposed individuals vary in their ability to metabolize arsenic, and variability in arsenic metabolism efficiency (AME) is associated with risks of arsenic-related toxicities. Inherited genetic variation in the 10q24.32 region, near the arsenic methyltransferase (AS3MT) gene, is associated with urine-based measures of AME in multiple arsenic-exposed populations. To identify potential causal variants in this region, we applied fine mapping approaches to targeted sequencing data generated for exposed individuals from Bangladeshi, American Indian, and European American populations (n = 2,357, 557, and 648 respectively). We identified three independent association signals for Bangladeshis, two for American Indians, and one for European Americans. The size of the confidence sets for each signal varied from 4 to 85 variants. There was one signal shared across all three populations, represented by the same SNP in American Indians and European Americans (rs191177668) and in strong linkage disequilibrium (LD) with a lead SNP in Bangladesh (rs145537350). Beyond this shared signal, differences in LD patterns, minor allele frequency (MAF) (e.g., rs12573221 ~13% in Bangladesh ~0.2% among American Indians), and/or heterogeneity in effect sizes across populations likely contributed to the apparent population specificity of the additional identified signals. One of our potential causal variants influences AS3MT expression and nearby DNA methylation in numerous GTEx tissue types (with rs4919690 as a likely causal variant). Several SNPs in our confidence sets overlap transcription factor binding sites and cis-regulatory elements (from ENCODE). Taken together, our analyses reveal multiple potential causal variants in the 10q24.32 region influencing AME, including a variant shared across populations, and elucidate potential biological mechanisms underlying the impact of genetic variation on AME.
Collapse
Affiliation(s)
- Meytal Batya Chernoff
- Department of Public Health Sciences, University of Chicago, Chicago, Illinois, United States of America
- Interdisciplinary Scientist Training Program, University of Chicago, Chicago, Illinois, United States of America
- University of Chicago Pritzker School of Medicine, Chicago, Illinois, United States of America
| | - Dayana Delgado
- Department of Public Health Sciences, University of Chicago, Chicago, Illinois, United States of America
| | - Lin Tong
- Department of Public Health Sciences, University of Chicago, Chicago, Illinois, United States of America
| | - Lin Chen
- Department of Public Health Sciences, University of Chicago, Chicago, Illinois, United States of America
| | - Meritxell Oliva
- Department of Public Health Sciences, University of Chicago, Chicago, Illinois, United States of America
| | - Lizeth I. Tamayo
- Department of Public Health Sciences, University of Chicago, Chicago, Illinois, United States of America
| | - Lyle G. Best
- Missouri Breaks Industries Research Inc, Eagle Butte, South Dakota, United States of America
| | - Shelley Cole
- Texas Biomedical Research Institute, San Antonio, Texas, United States of America
| | - Farzana Jasmine
- Department of Public Health Sciences, University of Chicago, Chicago, Illinois, United States of America
| | - Muhammad G. Kibriya
- Department of Public Health Sciences, University of Chicago, Chicago, Illinois, United States of America
| | - Heather Nelson
- School of Public Health, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Lei Huang
- Center for Research Informatics, University of Chicago, Chicago, Illinois, United States of America
| | - Karin Haack
- Texas Biomedical Research Institute, San Antonio, Texas, United States of America
| | - Jack Kent
- Texas Biomedical Research Institute, San Antonio, Texas, United States of America
| | - Jason G. Umans
- MedStar Health Research Institute, Hyattsville, Maryland, United States of America
- Georgetown-Howard Universities Center for Clinical and Translational Science, Georgetown University, Washington, District of Columbia, United States of America
| | - Joseph Graziano
- Texas Biomedical Research Institute, San Antonio, Texas, United States of America
- Department of Pharmacology, Columbia University, New York City, New York, United States of America
| | - Ana Navas-Acien
- Mailman School of Public Health, Columbia University, New York City, New York, United States of America
| | - Margaret R. Karagas
- Department of Epidemiology, Geisel School of Medicine at Dartmouth, Hanover, New Hampshire, United States of America
| | - Habib Ahsan
- Department of Public Health Sciences, University of Chicago, Chicago, Illinois, United States of America
- Comprehensive Cancer Center, University of Chicago, Chicago, Illinois, United States of America
- Department of Medicine, University of Chicago, Chicago, Illinois, United States of America
| | - Brandon L. Pierce
- Department of Public Health Sciences, University of Chicago, Chicago, Illinois, United States of America
- Comprehensive Cancer Center, University of Chicago, Chicago, Illinois, United States of America
- Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
| |
Collapse
|
31
|
Guevara-Ramírez P, Cadena-Ullauri S, Ruiz-Pozo VA, Tamayo-Trujillo R, Paz-Cruz E, Simancas-Racines D, Zambrano AK. Genetics, genomics, and diet interactions in obesity in the Latin American environment. Front Nutr 2022; 9:1063286. [PMID: 36532520 PMCID: PMC9751379 DOI: 10.3389/fnut.2022.1063286] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Accepted: 10/25/2022] [Indexed: 08/25/2023] Open
Abstract
Obesity is a chronic disease characterized by abnormal or excessive fat accumulation that could impact an individual's health; moreover, the World Health Organization (WHO) has declared obesity a global epidemic since 1997. In Latin America, in 2016, reports indicated that 24.2% of the adult population was obese. The environmental factor or specific behaviors like dietary intake or physical activity have a vital role in the development of a condition like obesity, but the interaction of genes could contribute to that predisposition. Hence, it is vital to understand the relationship between genes and disease. Indeed, genetics in nutrition studies the genetic variations and their effect on dietary response; while genomics in nutrition studies the role of nutrients in gene expression. The present review represents a compendium of the dietary behaviors in the Latin American environment and the interactions of genes with their single nucleotide polymorphisms (SNPs) associated with obesity, including the risk allele frequencies in the Latin American population. Additionally, a bibliographical selection of several studies has been included; these studies examined the impact that dietary patterns in Latin American environments have on the expression of numerous genes involved in obesity-associated metabolic pathways.
Collapse
Affiliation(s)
- Patricia Guevara-Ramírez
- Centro de Investigación Genética y Genómica, Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito, Ecuador
| | - Santiago Cadena-Ullauri
- Centro de Investigación Genética y Genómica, Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito, Ecuador
| | - Viviana A. Ruiz-Pozo
- Centro de Investigación Genética y Genómica, Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito, Ecuador
| | - Rafael Tamayo-Trujillo
- Centro de Investigación Genética y Genómica, Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito, Ecuador
| | - Elius Paz-Cruz
- Centro de Investigación Genética y Genómica, Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito, Ecuador
| | - Daniel Simancas-Racines
- Centro de Investigación en Salud Pública y Epidemiología Clínica (CISPEC), Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito, Ecuador
| | - Ana Karina Zambrano
- Centro de Investigación Genética y Genómica, Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito, Ecuador
| |
Collapse
|
32
|
Novembre J, Stein C, Asgari S, Gonzaga-Jauregui C, Landstrom A, Lemke A, Li J, Mighton C, Taylor M, Tishkoff S. Addressing the challenges of polygenic scores in human genetic research. Am J Hum Genet 2022; 109:2095-2100. [PMID: 36459976 PMCID: PMC9808501 DOI: 10.1016/j.ajhg.2022.10.012] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
The genotyping of millions of human samples has made it possible to evaluate variants across the human genome for their possible association with risks for numerous diseases and other traits by using genome-wide association studies (GWASs). The associations between phenotype and genotype found in GWASs make possible the construction of polygenic scores (PGSs), which aim to predict a trait or disease outcome in an individual on the basis of their genotype (in the disease case, the term polygenic risk score [PRS] is often used). PGSs have shown promise for studying the biology of complex traits and as a tool for evaluating individual disease risks in clinical settings. Although the quantity and quality of data to compute PGSs are increasing, challenges remain in the technical aspects of developing PGSs and in the ethical and social issues that might arise from their use. This ASHG Guidance emphasizes three major themes for researchers working with or interested in the application of PGSs in their own research: (1) developing diverse research cohorts; (2) fostering robustness in the development, application, and interpretation of PGSs; and (3) improving the communication of PGS results and their implications to broad audiences.
Collapse
Affiliation(s)
- John Novembre
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Department of Human Genetics, University of Chicago, Chicago, IL, USA,Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA,Corresponding author
| | - Catherine Stein
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH, USA,Corresponding author
| | - Samira Asgari
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Claudia Gonzaga-Jauregui
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,International Laboratory for Human Genome Research, Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Juriquilla, Querétaro, México
| | - Andrew Landstrom
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Department of Pediatrics, Division of Cardiology, Duke University School of Medicine, Durham, NC, USA
| | - Amy Lemke
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Norton Children’s Research Institute, affiliated with the University of Louisville School of Medicine, Louisville, KY, USA
| | - Jun Li
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Chloe Mighton
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Genomics Health Services Research Program, St. Michael’s Hospital, Unity Health Toronto, Toronto, ON, Canada,Institute of Health Policy, Management and Evaluation, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| | - Matthew Taylor
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Adult Medical Genetics Program, Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Sarah Tishkoff
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Department of Genetics, Center for Global Genomics and Health Equity, University of Pennsylvania, Philadelphia, PA, USA,Department of Biology, Center for Global Genomics and Health Equity, University of Pennsylvania, Philadelphia, PA, USA
| |
Collapse
|
33
|
Handelman SK, Puentes YM, Kuppa A, Chen Y, Du X, Feitosa MF, Palmer ND, Speliotes EK. Population-based meta-analysis and gene-set enrichment identifies FXR/RXR pathway as common to fatty liver disease and serum lipids. Hepatol Commun 2022; 6:3120-3131. [PMID: 36098472 PMCID: PMC9592792 DOI: 10.1002/hep4.2066] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Revised: 07/15/2022] [Accepted: 07/19/2022] [Indexed: 02/03/2023] Open
Abstract
Nonalcoholic fatty liver disease (NAFLD) is prevalent worldwide. NAFLD is associated with elevated serum triglycerides (TG), low-density lipoprotein cholesterol (LDL), and reduced high-density lipoprotein cholesterol (HDL). Both NAFLD and blood lipid levels are genetically influenced and may share a common genetic etiology. We used genome-wide association studies (GWAS)-ranked genes and gene-set enrichment analysis to identify pathways that affect serum lipids and NAFLD. We identified credible genes in these pathways and characterized missense variants in these for effects on serum traits. We used MAGENTA to identify 58 enriched pathways from publicly available TG, LDL, and HDL GWAS (n = 99,000). Three of these pathways were also enriched for associations with European-ancestry NAFLD GWAS (n = 7176). One pathway, farnesoid X receptor (FXR)/retinoid X receptor (RXR) activation, was replicated for association in an African-ancestry NAFLD GWAS (n = 3214) and plays a role in serum lipids and NAFLD. Credible genes (proteins) in FXR/RXR activation include those associated with cholesterol/bile/bilirubin transport/absorption (ABCC2 (MRP2) [ATP binding cassette subfamily C member (multidrug resistance-associated protein 2)], ABCG5, ABCG8 [ATP-binding cassette (ABC) transporters G5 and G8], APOB (APOB) [apolipoprotein B], FABP6 (ILBP) [fatty acid binding protein 6 (ileal lipid-binding protein)], MTTP (MTP) [microsomal triglyceride transfer protein], SLC4A2 (AE2) [solute carrier family 4 member 2 (anion exchange protein 2)]), nuclear hormone-mediated control of metabolism (NR0B2 (SHP) [nuclear receptor subfamily 0 group B member 2 (small heterodimer partner)], NR1H4 (FXR) [nuclear receptor subfamily 1 group H member 4 (FXR)], PPARA (PPAR) [peroxisome proliferator activated receptor alpha], FOXO1 (FOXO1A) [forkhead box O1]), or other pathways (FETUB (FETUB) [fetuin B]). Missense variants in ABCC2 (MRP2), ABCG5 (ABCG5), ABCG8 (ABCG8), APOB (APOB), MTTP (MTP), NR0B2 (SHP), NR1H4 (FXR), and PPARA (PPAR) that associate with serum LDL levels also associate with serum liver function tests in UK Biobank. Conclusion: Genetic variants in NR1H4 (FXR) that protect against liver steatosis increase serum LDL cholesterol while variants in other members of the family have congruent effects on these traits. Human genetic pathway enrichment analysis can help guide therapeutic development by identifying effective targets for NAFLD/serum lipid manipulation while minimizing side effects. In addition, missense variants could be used in companion diagnostics to determine their influence on drug effectiveness.
Collapse
Affiliation(s)
- Samuel K. Handelman
- Division of Gastroenterology and HepatologyUniversity of Michigan Health SystemAnn ArborMichiganUSA
| | - Yindra M. Puentes
- Division of Gastroenterology and HepatologyUniversity of Michigan Health SystemAnn ArborMichiganUSA
- Department of Computational Medicine and BioinformaticsUniversity of Michigan Medical SchoolAnn ArborMichiganUSA
| | - Annapurna Kuppa
- Division of Gastroenterology and HepatologyUniversity of Michigan Health SystemAnn ArborMichiganUSA
| | - Yanhua Chen
- Division of Gastroenterology and HepatologyUniversity of Michigan Health SystemAnn ArborMichiganUSA
| | - Xiaomeng Du
- Division of Gastroenterology and HepatologyUniversity of Michigan Health SystemAnn ArborMichiganUSA
| | - Mary F. Feitosa
- Division of Statistical Genomics, Department of GeneticsWashington UniversitySt. LouisMissouriUSA
| | - Nicholette D. Palmer
- Department of BiochemistryWake Forest School of MedicineWinston‐SalemNorth CarolinaUSA
| | - Elizabeth K. Speliotes
- Division of Gastroenterology and HepatologyUniversity of Michigan Health SystemAnn ArborMichiganUSA
- Department of Computational Medicine and BioinformaticsUniversity of Michigan Medical SchoolAnn ArborMichiganUSA
| |
Collapse
|
34
|
Qiao J, Shao Z, Wu Y, Zeng P, Wang T. Detecting associated genes for complex traits shared across East Asian and European populations under the framework of composite null hypothesis testing. Lab Invest 2022; 20:424. [PMID: 36138484 PMCID: PMC9503281 DOI: 10.1186/s12967-022-03637-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2022] [Accepted: 09/12/2022] [Indexed: 11/21/2022]
Abstract
Background Detecting trans-ethnic common associated genetic loci can offer important insights into shared genetic components underlying complex diseases/traits across diverse continental populations. However, effective statistical methods for such a goal are currently lacking. Methods By leveraging summary statistics available from global-scale genome-wide association studies, we herein proposed a novel genetic overlap detection method called CONTO (COmposite Null hypothesis test for Trans-ethnic genetic Overlap) from the perspective of high-dimensional composite null hypothesis testing. Unlike previous studies which generally analyzed individual genetic variants, CONTO is a gene-centric method which focuses on a set of genetic variants located within a gene simultaneously and assesses their joint significance with the trait of interest. By borrowing the similar principle of joint significance test (JST), CONTO takes the maximum P value of multiple associations as the significance measurement. Results Compared to JST which is often overly conservative, CONTO is improved in two aspects, including the construction of three-component mixture null distribution and the adjustment of trans-ethnic genetic correlation. Consequently, CONTO corrects the conservativeness of JST with well-calibrated P values and is much more powerful validated by extensive simulation studies. We applied CONTO to discover common associated genes for 31 complex diseases/traits between the East Asian and European populations, and identified many shared trait-associated genes that had otherwise been missed by JST. We further revealed that population-common genes were generally more evolutionarily conserved than population-specific or null ones. Conclusion Overall, CONTO represents a powerful method for detecting common associated genes across diverse ancestral groups; our results provide important implications on the transferability of GWAS discoveries in one population to others. Supplementary Information The online version contains supplementary material available at 10.1186/s12967-022-03637-8.
Collapse
Affiliation(s)
- Jiahao Qiao
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Zhonghe Shao
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Yuxuan Wu
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Ping Zeng
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China. .,Center for Medical Statistics and Data Analysis, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China. .,Key Laboratory of Human Genetics and Environmental Medicine, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China. .,Key Laboratory of Environment and Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China. .,Engineering Research Innovation Center of Biological Data Mining and Healthcare Transformation, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
| | - Ting Wang
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
| |
Collapse
|
35
|
Kim D, Justice AE, Chittoor G, Blanco E, Burrows R, Graff M, Howard AG, Wang Y, Rohde R, Buchanan VL, Voruganti VS, Almeida M, Peralta J, Lehman DM, Curran JE, Comuzzie AG, Duggirala R, Blangero J, Albala C, Santos JL, Angel B, Lozoff B, Gahagan S, North KE. Genetic determinants of metabolic biomarkers and their associations with cardiometabolic traits in Hispanic/Latino adolescents. Pediatr Res 2022; 92:563-571. [PMID: 34645953 PMCID: PMC9005573 DOI: 10.1038/s41390-021-01729-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Revised: 06/08/2021] [Accepted: 08/17/2021] [Indexed: 01/05/2023]
Abstract
BACKGROUND Metabolic regulation plays a significant role in energy homeostasis, and adolescence is a crucial life stage for the development of cardiometabolic disease (CMD). This study aims to investigate the genetic determinants of metabolic biomarkers-adiponectin, leptin, ghrelin, and orexin-and their associations with CMD risk factors. METHODS We characterized the genetic determinants of the biomarkers among Hispanic/Latino adolescents of the Santiago Longitudinal Study (SLS) and identified the cumulative effects of genetic variants on adiponectin and leptin using biomarker polygenic risk scores (PRS). We further investigated the direct and indirect effect of the biomarker PRS on downstream body fat percent (BF%) and glycemic traits using structural equation modeling. RESULTS We identified putatively novel genetic variants associated with the metabolic biomarkers. A substantial amount of biomarker variance was explained by SLS-specific PRS, and the prediction was improved by including the putatively novel loci. Fasting blood insulin and insulin resistance were associated with PRS for adiponectin, leptin, and ghrelin, and BF% was associated with PRS for adiponectin and leptin. We found evidence of substantial mediation of these associations by the biomarker levels. CONCLUSIONS The genetic underpinnings of metabolic biomarkers can affect the early development of CMD, partly mediated by the biomarkers. IMPACT This study characterized the genetic underpinnings of four metabolic hormones and investigated their potential influence on adiposity and insulin biology among Hispanic/Latino adolescents. Fasting blood insulin and insulin resistance were associated with polygenic risk score (PRS) for adiponectin, leptin, and ghrelin, with evidence of some degree of mediation by the biomarker levels. Body fat percent (BF%) was also associated with PRS for adiponectin and leptin. This provides important insight on biological mechanisms underlying early metabolic dysfunction and reveals candidates for prevention efforts. Our findings also highlight the importance of ancestrally diverse populations to facilitate valid studies of the genetic architecture of metabolic biomarker levels.
Collapse
Affiliation(s)
- Daeeun Kim
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Anne E Justice
- Department of Population Health Sciences, Geisinger, Danville, PA, USA
| | - Geetha Chittoor
- Department of Population Health Sciences, Geisinger, Danville, PA, USA
| | - Estela Blanco
- Division of Academic General Pediatrics, Child Development and Community Health at the Center for Community Health, University of California at San Diego, San Diego, CA, USA
- Department of Public Health, School of Medicine, Pontificia Universidad Católica de Chile, Santiago, Chile
| | - Raquel Burrows
- Department of Public Health Nutrition, Institute of Nutrition and Food Technology (INTA), University of Chile, Santiago, Chile
| | - Mariaelisa Graff
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Annie Green Howard
- Department of Biostatistics, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Yujie Wang
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Rebecca Rohde
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Victoria L Buchanan
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - V Saroja Voruganti
- Department of Nutrition and Nutrition Research Institute, University of North Carolina at Chapel Hill, Kannapolis, NC, USA
| | - Marcio Almeida
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley, Brownsville, TX, USA
| | - Juan Peralta
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley, Brownsville, TX, USA
| | - Donna M Lehman
- Departments of Medicine and Epidemiology and Biostatistics, University of Texas Health San Antonio, San Antonio, TX, USA
| | - Joanne E Curran
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley, Brownsville, TX, USA
| | | | - Ravindranath Duggirala
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley, Brownsville, TX, USA
| | - John Blangero
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley, Brownsville, TX, USA
| | - Cecilia Albala
- Department of Public Health Nutrition, Institute of Nutrition and Food Technology (INTA), University of Chile, Santiago, Chile
| | - José L Santos
- Department of Nutrition, Diabetes and Metabolism, School of Medicine, Pontificia Universidad Católica de Chile, Santiago, Chile
| | - Bárbara Angel
- Department of Public Health Nutrition, Institute of Nutrition and Food Technology (INTA), University of Chile, Santiago, Chile
| | - Betsy Lozoff
- Department of Pediatrics, University of Michigan, Ann Arbor, MI, USA
| | - Sheila Gahagan
- Division of Academic General Pediatrics, Child Development and Community Health at the Center for Community Health, University of California at San Diego, San Diego, CA, USA
| | - Kari E North
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
| |
Collapse
|
36
|
Markowitz RHG, LaBella AL, Shi M, Rokas A, Capra JA, Ferguson JF, Mosley JD, Bordenstein SR. Microbiome-associated human genetic variants impact phenome-wide disease risk. Proc Natl Acad Sci U S A 2022; 119:e2200551119. [PMID: 35749358 PMCID: PMC9245617 DOI: 10.1073/pnas.2200551119] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Accepted: 04/29/2022] [Indexed: 12/26/2022] Open
Abstract
Human genetic variation associates with the composition of the gut microbiome, yet its influence on clinical traits remains largely unknown. We analyzed the consequences of nearly a thousand gut microbiome-associated variants (MAVs) on phenotypes reported in electronic health records from tens of thousands of individuals. We discovered and replicated associations of MAVs with neurological, metabolic, digestive, and circulatory diseases. Five significant MAVs in these categories correlate with the relative abundance of microbes down to the strain level. We also demonstrate that these relationships are independently observed and concordant with microbe by disease associations reported in case-control studies. Moreover, a selective sweep and population differentiation impacted some disease-linked MAVs. Combined, these findings establish triad relationships among the human genome, microbiome, and disease. Consequently, human genetic influences may offer opportunities for precision diagnostics of microbiome-associated diseases but also highlight the relevance of genetic background for microbiome modulation and therapeutics.
Collapse
Affiliation(s)
- Robert H. George Markowitz
- Vanderbilt Microbiome Innovation Center, Vanderbilt University, Nashville, TN 37232
- Department of Biological Sciences, Vanderbilt University, Nashville, TN 37232
| | | | - Mingjian Shi
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37232
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, TN 37232
| | - John A. Capra
- Department of Epidemiology and Biostatistics, University of California, San Francisco, CA 94143
- Bakar Computational Health Sciences Institute, University of California, San Francisco, CA 94143
| | - Jane F. Ferguson
- Vanderbilt Microbiome Innovation Center, Vanderbilt University, Nashville, TN 37232
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232
| | - Jonathan D. Mosley
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37232
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232
| | - Seth R. Bordenstein
- Vanderbilt Microbiome Innovation Center, Vanderbilt University, Nashville, TN 37232
- Department of Biological Sciences, Vanderbilt University, Nashville, TN 37232
- Vanderbilt Institute for Infection, Immunology and Inflammation, Vanderbilt University Medical Center, Nashville, TN 37232
- Department of Pathology, Microbiology, and Immunology, School of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232
| |
Collapse
|
37
|
Diniz IG, Noce RRD, Pereira AP, Silva ANLMD, Sacuena ERP, Lemes RB, Cardoso-Costa GDL, Araújo GS, Machado JLP, Figueiredo FADPL, Hümemeier T, Guerreiro JF. Common BMI and diabetes-related genetic variants: A pilot study among indigenous people in the Brazilian Amazon. Genet Mol Biol 2022; 45:e20210153. [PMID: 35560161 PMCID: PMC9104643 DOI: 10.1590/1678-4685-gmb-2021-0153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Accepted: 12/27/2021] [Indexed: 11/23/2022] Open
Abstract
This study was carried out to investigate the frequency of genetic variants related to body mass index (BMI) and type 2 diabetes (T2D) and evaluating the potential impact of risk alleles on susceptibility to these disorders in six indigenous peoples from Brazilian Amazon region. The majority of Fst values for pairwise population comparisons among the indigenous groups are low or moderate. The indigenous people show high values of differentiation with Africans, Europeans and Southeast Asians and moderate values with East Asian and American populations, as expected. The allelic frequencies among indigenous indicate that the majority of associations observed with T2D in continental populations can be replicated in native Amazonians. The genetic risk scores calculated for T2D in indigenous are high and similar to those calculated for Americans and East Asians, while the estimates obtained for obesity are low, probably due to the low frequencies of the risk allele of the FTO gene found in our samples. ADRB3-rs4994 and ABCC8-rs1799854 genes showed a significant association with BMI and waist circumference, and the KCNJ11-rs5219 gene with hyperglycemia. These results emphasize the importance of knowing the genetic variability underlying complex genetic diseases in indigenous peoples and the search for particular or rare variants.
Collapse
Affiliation(s)
- Isabela Guerreiro Diniz
- Universidade Federal do Pará, Instituto de Ciências Biológicas, Laboratório de Genética Humana e Médica, Belém, PA, Brazil
| | - Rosilene Reis Della Noce
- Universidade Federal do Pará, Instituto de Ciências da Saúde, Faculdade de Nutrição, Belém, PA, Brazil
| | - Ana Paula Pereira
- Universidade Federal do Pará, Instituto de Ciências da Saúde, Faculdade de Nutrição, Belém, PA, Brazil
| | | | | | - Renan Barbosa Lemes
- Universidade de São Paulo, Departamento de Genética e Biologia Evolutiva, São Paulo, SP, Brazil
| | - Greice de Lemos Cardoso-Costa
- Universidade Federal do Pará, Instituto de Ciências Biológicas, Laboratório de Genética Humana e Médica, Belém, PA, Brazil
| | - Gilderlânio Santana Araújo
- Universidade Federal do Pará, Instituto de Ciências Biológicas, Laboratório de Genética Humana e Médica, Belém, PA, Brazil
| | - Jéssica Lígia Picanço Machado
- Universidade Federal do Pará, Instituto de Ciências Biológicas, Laboratório de Genética Humana e Médica, Belém, PA, Brazil
| | | | - Tábita Hümemeier
- Universidade de São Paulo, Departamento de Genética e Biologia Evolutiva, São Paulo, SP, Brazil
| | - João Farias Guerreiro
- Universidade Federal do Pará, Instituto de Ciências Biológicas, Laboratório de Genética Humana e Médica, Belém, PA, Brazil
| |
Collapse
|
38
|
Smith SP, Shahamatdar S, Cheng W, Zhang S, Paik J, Graff M, Haiman C, Matise TC, North KE, Peters U, Kenny E, Gignoux C, Wojcik G, Crawford L, Ramachandran S. Enrichment analyses identify shared associations for 25 quantitative traits in over 600,000 individuals from seven diverse ancestries. Am J Hum Genet 2022; 109:871-884. [PMID: 35349783 PMCID: PMC9118115 DOI: 10.1016/j.ajhg.2022.03.005] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Accepted: 03/02/2022] [Indexed: 12/12/2022] Open
Abstract
Since 2005, genome-wide association (GWA) datasets have been largely biased toward sampling European ancestry individuals, and recent studies have shown that GWA results estimated from self-identified European individuals are not transferable to non-European individuals because of various confounding challenges. Here, we demonstrate that enrichment analyses that aggregate SNP-level association statistics at multiple genomic scales-from genes to genomic regions and pathways-have been underutilized in the GWA era and can generate biologically interpretable hypotheses regarding the genetic basis of complex trait architecture. We illustrate examples of the robust associations generated by enrichment analyses while studying 25 continuous traits assayed in 566,786 individuals from seven diverse self-identified human ancestries in the UK Biobank and the Biobank Japan as well as 44,348 admixed individuals from the PAGE consortium including cohorts of African American, Hispanic and Latin American, Native Hawaiian, and American Indian/Alaska Native individuals. We identify 1,000 gene-level associations that are genome-wide significant in at least two ancestry cohorts across these 25 traits as well as highly conserved pathway associations with triglyceride levels in European, East Asian, and Native Hawaiian cohorts.
Collapse
Affiliation(s)
- Samuel Pattillo Smith
- Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA; Department of Ecology, Evolution, and Organismal Biology, Brown University, Providence, RI 02912, USA
| | - Sahar Shahamatdar
- Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA; Department of Ecology, Evolution, and Organismal Biology, Brown University, Providence, RI 02912, USA
| | - Wei Cheng
- Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA; Department of Ecology, Evolution, and Organismal Biology, Brown University, Providence, RI 02912, USA
| | - Selena Zhang
- Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA
| | - Joseph Paik
- Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA
| | - Misa Graff
- Department of Epidemiology, University of North Carolina, Chapel Hill, Chapel Hill, NC 27599, USA
| | - Christopher Haiman
- Department of Preventative Medicine, University of Southern California, Los Angeles, CA 90089, USA
| | - T C Matise
- Department of Genetics, Rutgers University, Piscataway, NJ 08854, USA
| | - Kari E North
- Department of Epidemiology, University of North Carolina, Chapel Hill, Chapel Hill, NC 27599, USA
| | - Ulrike Peters
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Eimear Kenny
- The Center for Genomic Health, Icahn School of Medicine at Mount Sinai, New York City, NY 10029, USA; The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York City, NY 10029, USA; Department of Medicine, Icahn School of Medicine at Mount Sinai, New York City, NY 10029, USA; Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York City, NY 10029, USA
| | - Chris Gignoux
- Division of Biomedical Informatics and Personalized Medicine, University of Colorado, Denver, CO 80204, USA
| | - Genevieve Wojcik
- Department of Epidemiology, Johns Hopkins University, Baltimore, MD 21287, USA
| | - Lorin Crawford
- Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA; Department of Biostatistics, Brown University, Providence, RI 02906, USA; Microsoft Research New England, Cambridge, MA 02142, USA
| | - Sohini Ramachandran
- Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA; Department of Ecology, Evolution, and Organismal Biology, Brown University, Providence, RI 02912, USA; Data Science Initiative, Brown University, Providence, RI 02912, USA.
| |
Collapse
|
39
|
Carlson MO, Rice DP, Berg JJ, Steinrücken M. Polygenic score accuracy in ancient samples: Quantifying the effects of allelic turnover. PLoS Genet 2022; 18:e1010170. [PMID: 35522704 PMCID: PMC9116686 DOI: 10.1371/journal.pgen.1010170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Revised: 05/18/2022] [Accepted: 03/26/2022] [Indexed: 11/19/2022] Open
Abstract
Polygenic scores link the genotypes of ancient individuals to their phenotypes, which are often unobservable, offering a tantalizing opportunity to reconstruct complex trait evolution. In practice, however, interpretation of ancient polygenic scores is subject to numerous assumptions. For one, the genome-wide association (GWA) studies from which polygenic scores are derived, can only estimate effect sizes for loci segregating in contemporary populations. Therefore, a GWA study may not correctly identify all loci relevant to trait variation in the ancient population. In addition, the frequencies of trait-associated loci may have changed in the intervening years. Here, we devise a theoretical framework to quantify the effect of this allelic turnover on the statistical properties of polygenic scores as functions of population genetic dynamics, trait architecture, power to detect significant loci, and the age of the ancient sample. We model the allele frequencies of loci underlying trait variation using the Wright-Fisher diffusion, and employ the spectral representation of its transition density to find analytical expressions for several error metrics, including the expected sample correlation between the polygenic scores of ancient individuals and their true phenotypes, referred to as polygenic score accuracy. Our theory also applies to a two-population scenario and demonstrates that allelic turnover alone may explain a substantial percentage of the reduced accuracy observed in cross-population predictions, akin to those performed in human genetics. Finally, we use simulations to explore the effects of recent directional selection, a bias-inducing process, on the statistics of interest. We find that even in the presence of bias, weak selection induces minimal deviations from our neutral expectations for the decay of polygenic score accuracy. By quantifying the limitations of polygenic scores in an explicit evolutionary context, our work lays the foundation for the development of more sophisticated statistical procedures to analyze both temporally and geographically resolved polygenic scores.
Collapse
Affiliation(s)
- Maryn O. Carlson
- Committee on Genetics, Genomics, & Systems Biology, University of Chicago, Chicago, Illinois, United States of America
| | - Daniel P. Rice
- Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
| | - Jeremy J. Berg
- Committee on Genetics, Genomics, & Systems Biology, University of Chicago, Chicago, Illinois, United States of America
- Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
| | - Matthias Steinrücken
- Committee on Genetics, Genomics, & Systems Biology, University of Chicago, Chicago, Illinois, United States of America
- Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
- Department of Ecology & Evolution, University of Chicago, Chicago, Illinois, United States of America
| |
Collapse
|
40
|
Fernández-Rhodes L, Graff M, Buchanan VL, Justice AE, Highland HM, Guo X, Zhu W, Chen HH, Young KL, Adhikari K, Palmer ND, Below JE, Bradfield J, Pereira AC, Glover L, Kim D, Lilly AG, Shrestha P, Thomas AG, Zhang X, Chen M, Chiang CW, Pulit S, Horimoto A, Krieger JE, Guindo-Martínez M, Preuss M, Schumann C, Smit RA, Torres-Mejía G, Acuña-Alonzo V, Bedoya G, Bortolini MC, Canizales-Quinteros S, Gallo C, González-José R, Poletti G, Rothhammer F, Hakonarson H, Igo R, Adler SG, Iyengar SK, Nicholas SB, Gogarten SM, Isasi CR, Papnicolaou G, Stilp AM, Qi Q, Kho M, Smith JA, Langefeld CD, Wagenknecht L, Mckean-Cowdin R, Gao XR, Nousome D, Conti DV, Feng Y, Allison MA, Arzumanyan Z, Buchanan TA, Ida Chen YD, Genter PM, Goodarzi MO, Hai Y, Hsueh W, Ipp E, Kandeel FR, Lam K, Li X, Nadler JL, Raffel LJ, Roll K, Sandow K, Tan J, Taylor KD, Xiang AH, Yao J, Audirac-Chalifour A, de Jesus Peralta Romero J, Hartwig F, Horta B, Blangero J, Curran JE, Duggirala R, Lehman DE, Puppala S, Fejerman L, John EM, Aguilar-Salinas C, Burtt NP, Florez JC, García-Ortíz H, González-Villalpando C, Mercader J, Orozco L, Tusié-Luna T, Blanco E, Gahagan S, Cox NJ, Hanis C, et alFernández-Rhodes L, Graff M, Buchanan VL, Justice AE, Highland HM, Guo X, Zhu W, Chen HH, Young KL, Adhikari K, Palmer ND, Below JE, Bradfield J, Pereira AC, Glover L, Kim D, Lilly AG, Shrestha P, Thomas AG, Zhang X, Chen M, Chiang CW, Pulit S, Horimoto A, Krieger JE, Guindo-Martínez M, Preuss M, Schumann C, Smit RA, Torres-Mejía G, Acuña-Alonzo V, Bedoya G, Bortolini MC, Canizales-Quinteros S, Gallo C, González-José R, Poletti G, Rothhammer F, Hakonarson H, Igo R, Adler SG, Iyengar SK, Nicholas SB, Gogarten SM, Isasi CR, Papnicolaou G, Stilp AM, Qi Q, Kho M, Smith JA, Langefeld CD, Wagenknecht L, Mckean-Cowdin R, Gao XR, Nousome D, Conti DV, Feng Y, Allison MA, Arzumanyan Z, Buchanan TA, Ida Chen YD, Genter PM, Goodarzi MO, Hai Y, Hsueh W, Ipp E, Kandeel FR, Lam K, Li X, Nadler JL, Raffel LJ, Roll K, Sandow K, Tan J, Taylor KD, Xiang AH, Yao J, Audirac-Chalifour A, de Jesus Peralta Romero J, Hartwig F, Horta B, Blangero J, Curran JE, Duggirala R, Lehman DE, Puppala S, Fejerman L, John EM, Aguilar-Salinas C, Burtt NP, Florez JC, García-Ortíz H, González-Villalpando C, Mercader J, Orozco L, Tusié-Luna T, Blanco E, Gahagan S, Cox NJ, Hanis C, Butte NF, Cole SA, Comuzzie AG, Voruganti VS, Rohde R, Wang Y, Sofer T, Ziv E, Grant SF, Ruiz-Linares A, Rotter JI, Haiman CA, Parra EJ, Cruz M, Loos RJ, North KE. Ancestral diversity improves discovery and fine-mapping of genetic loci for anthropometric traits-The Hispanic/Latino Anthropometry Consortium. HGG ADVANCES 2022; 3:100099. [PMID: 35399580 PMCID: PMC8990175 DOI: 10.1016/j.xhgg.2022.100099] [Show More Authors] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2021] [Accepted: 03/06/2022] [Indexed: 02/05/2023] Open
Abstract
Hispanic/Latinos have been underrepresented in genome-wide association studies (GWAS) for anthropometric traits despite their notable anthropometric variability, ancestry proportions, and high burden of growth stunting and overweight/obesity. To address this knowledge gap, we analyzed densely imputed genetic data in a sample of Hispanic/Latino adults to identify and fine-map genetic variants associated with body mass index (BMI), height, and BMI-adjusted waist-to-hip ratio (WHRadjBMI). We conducted a GWAS of 18 studies/consortia as part of the Hispanic/Latino Anthropometry (HISLA) Consortium (stage 1, n = 59,771) and generalized our findings in 9 additional studies (stage 2, n = 10,538). We conducted a trans-ancestral GWAS with summary statistics from HISLA stage 1 and existing consortia of European and African ancestries. In our HISLA stage 1 + 2 analyses, we discovered one BMI locus, as well as two BMI signals and another height signal each within established anthropometric loci. In our trans-ancestral meta-analysis, we discovered three BMI loci, one height locus, and one WHRadjBMI locus. We also identified 3 secondary signals for BMI, 28 for height, and 2 for WHRadjBMI in established loci. We show that 336 known BMI, 1,177 known height, and 143 known WHRadjBMI (combined) SNPs demonstrated suggestive transferability (nominal significance and effect estimate directional consistency) in Hispanic/Latino adults. Of these, 36 BMI, 124 height, and 11 WHRadjBMI SNPs were significant after trait-specific Bonferroni correction. Trans-ancestral meta-analysis of the three ancestries showed a small-to-moderate impact of uncorrected population stratification on the resulting effect size estimates. Our findings demonstrate that future studies may also benefit from leveraging diverse ancestries and differences in linkage disequilibrium patterns to discover novel loci and additional signals with less residual population stratification.
Collapse
Affiliation(s)
- Lindsay Fernández-Rhodes
- Department of Biobehavioral Health, Pennsylvania State University, 219 Biobehavioral Health Building, University Park, PA 16802, USA
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Mariaelisa Graff
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Victoria L. Buchanan
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Anne E. Justice
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Department of Biomedical and Translational Informatics, Geisinger Health System, Danville, PA 17822, USA
| | - Heather M. Highland
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Xiuqing Guo
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA 90502 USA
| | - Wanying Zhu
- Vanderbilt Genetics Institute, Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Hung-Hsin Chen
- Vanderbilt Genetics Institute, Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Kristin L. Young
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Kaustubh Adhikari
- School of Mathematics and Statistics, Faculty of Science, Technology, Engineering and Mathematics, The Open University, MK7 6AA Milton Keynes, UK
| | - Nicholette D. Palmer
- Department of Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC 27101, USA
| | - Jennifer E. Below
- Vanderbilt Genetics Institute, Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Jonathan Bradfield
- Center for Applied Genomics, Division of Human Genetics, Department of Pediatrics, The Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Alexandre C. Pereira
- Laboratory of Genetics and Molecular Cardiology, Heart Institute, University of São Paulo, São Paulo 05508-220, Brazil
| | - LáShauntá Glover
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Daeeun Kim
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Adam G. Lilly
- Department of Sociology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Carolina Population Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Poojan Shrestha
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Division of Pediatric and Public Health, Adams School of Dentistry, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Alvin G. Thomas
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Xinruo Zhang
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Minhui Chen
- Center for Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA
| | - Charleston W.K. Chiang
- Center for Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90007, USA
| | - Sara Pulit
- Vertex Pharmaceuticals, W2 6BD Oxford, UK
| | - Andrea Horimoto
- Laboratory of Genetics and Molecular Cardiology, Heart Institute, University of São Paulo, São Paulo 05508-220, Brazil
| | - Jose E. Krieger
- Laboratory of Genetics and Molecular Cardiology, Heart Institute, University of São Paulo, São Paulo 05508-220, Brazil
| | - Marta Guindo-Martínez
- The Charles Bronfman Institutes for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- The Novo Nordisk Center for Basic Metabolic Research, University of Copenhagen, 2200 Copenhagen, Denmark
| | - Michael Preuss
- The Charles Bronfman Institutes for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Claudia Schumann
- Hasso Plattner Institute, University of Potsdam, Digital Health Center, 14482 Potsdam, Germany
| | - Roelof A.J. Smit
- The Charles Bronfman Institutes for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Gabriela Torres-Mejía
- Department of Research in Cardiovascular Diseases, Diabetes Mellitus, and Cancer, Population Health Research Center, National Institute of Public Health, Cuernavaca, Morelos 62100, Mexico
| | | | - Gabriel Bedoya
- Molecular Genetics Investigation Group, University of Antioquia, Medellín 1226, Colombia
| | - Maria-Cátira Bortolini
- Department of Genetics, Federal University of Rio Grande do Sul, Porto Alegre 90040-060, Brazil
| | - Samuel Canizales-Quinteros
- Population Genomics Applied to Health Unit, The National Institute of Genomic Medicine and the Faculty of Chemistry at the National Autonomous University of Mexico, Mexico City 04510, Mexico
| | - Carla Gallo
- Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima 15102, Peru
| | - Rolando González-José
- Patagonian Institute of the Social and Human Sciences, Patagonian National Center, Puerto Madryn U9120, Argentina
| | - Giovanni Poletti
- Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima 15102, Peru
| | | | - Hakon Hakonarson
- Center for Applied Genomics, Division of Human Genetics, Department of Pediatrics, The Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Robert Igo
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH 44106, USA
| | - Sharon G. Adler
- Division of Nephrology and Hypertension, Harbor-University of California Los Angeles Medical Center, Torrance, CA 90502, USA
| | - Sudha K. Iyengar
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH 44106, USA
| | - Susanne B. Nicholas
- Department of Medicine, David Geffen School of Medicine at University of California, Los Angeles, CA 90095, USA
| | | | - Carmen R. Isasi
- Department of Epidemiology and Population Health, Albert Einstein College of Medicine, Bronx, NY 10461, USA
| | | | - Adrienne M. Stilp
- Department of Biostatistics, University of Washington, Seattle, WA 98195, USA
| | - Qibin Qi
- Department of Epidemiology and Population Health, Albert Einstein College of Medicine, Bronx, NY 10461, USA
| | - Minjung Kho
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI 48109, USA
| | - Jennifer A. Smith
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI 48109, USA
| | - Carl D. Langefeld
- Department of Biostatistics and Data Science, Wake Forest School of Medicine, Winston-Salem, NC 27101, USA
| | - Lynne Wagenknecht
- Division of Public Health Sciences, Wake Forest School of Medicine, Winston-Salem, NC 27101, USA
| | - Roberta Mckean-Cowdin
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA 90032, USA
| | - Xiaoyi Raymond Gao
- Department of Ophthalmology and Visual Sciences, Department of Biomedical Informatics, Division of Human Genetics, The Ohio State University, Columbus, OH 43210, USA
| | - Darryl Nousome
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA 90032, USA
| | - David V. Conti
- Center for Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA
| | - Ye Feng
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA 90032, USA
| | - Matthew A. Allison
- Department of Family Medicine, University of California, San Diego, CA 92161, USA
| | - Zorayr Arzumanyan
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA 90502 USA
| | - Thomas A. Buchanan
- Department of Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA
| | - Yii-Der Ida Chen
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA 90502 USA
| | - Pauline M. Genter
- Department of Medicine, Division of Endocrinology, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA 90502, USA
| | - Mark O. Goodarzi
- Division of Endocrinology, Diabetes, and Metabolism, Department of Medicine, Cedars-Sinai Medical Center, Los Angeles, CA 90048, USA
| | - Yang Hai
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA 90502 USA
| | - Willa Hsueh
- Department of Internal Medicine, The Ohio State University Wexner Medical Center, Columbus, OH 43210, USA
| | - Eli Ipp
- Department of Medicine, David Geffen School of Medicine at University of California, Los Angeles, CA 90095, USA
- Department of Medicine, Division of Endocrinology, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA 90502, USA
| | - Fouad R. Kandeel
- Department of Translational Research & Cellular Therapeutics, Beckman Research Institute of City of Hope, Duarte, CA 91010, USA
| | - Kelvin Lam
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA 90502 USA
| | - Xiaohui Li
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA 90502 USA
| | - Jerry L. Nadler
- Department of Pharmacology at New York Medical College School of Medicine, Valhalla, NY 10595, USA
| | - Leslie J. Raffel
- Division of Genetic and Genomic Medicine, Department of Pediatrics, University of California, Irvine, CA 92697, USA
| | - Kathryn Roll
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA 90502 USA
| | - Kevin Sandow
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA 90502 USA
| | - Jingyi Tan
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA 90502 USA
| | - Kent D. Taylor
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA 90502 USA
| | - Anny H. Xiang
- Research and Evaluation Branch, Kaiser Permanente of Southern California, Pasadena, CA 91101, USA
| | - Jie Yao
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA 90502 USA
| | - Astride Audirac-Chalifour
- Medical Research Unit in Biochemistry, Specialty Hospital, National Medical Center of the Twenty-First Century, Mexican Institute of Social Security, Mexico City 06725, Mexico
| | - Jose de Jesus Peralta Romero
- Medical Research Unit in Biochemistry, Specialty Hospital, National Medical Center of the Twenty-First Century, Mexican Institute of Social Security, Mexico City 06725, Mexico
| | - Fernando Hartwig
- Postgraduate Program in Epidemiology, Federal University of Pelotas, Pelotas 96010-610, Brazil
| | - Bernando Horta
- Postgraduate Program in Epidemiology, Federal University of Pelotas, Pelotas 96010-610, Brazil
| | - John Blangero
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, School of Medicine, University of Texas Rio Grande Valley, Brownsville and Edinburg, TX 78520 and 78539, USA
| | - Joanne E. Curran
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, School of Medicine, University of Texas Rio Grande Valley, Brownsville and Edinburg, TX 78520 and 78539, USA
| | - Ravindranath Duggirala
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, School of Medicine, University of Texas Rio Grande Valley, Brownsville and Edinburg, TX 78520 and 78539, USA
| | - Donna E. Lehman
- Department of Medicine, School of Medicine, University of Texas Health San Antonio, San Antonio, TX 78229, USA
| | - Sobha Puppala
- Department of Internal Medicine, Section of Molecular Medicine, Wake Forest School of Medicine, Winston-Salem, NC 27109, USA
| | - Laura Fejerman
- Department of Public Health Sciences, School of Medicine, and the Comprehensive Cancer Center, University of California Davis, Davis, CA 95616, USA
| | - Esther M. John
- Departments of Epidemiology & Population Health and Medicine-Oncology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Carlos Aguilar-Salinas
- Division of Nutrition, Salvador Zubirán National Institute of Health Sciences and Nutrition, Mexico City 14080, Mexico
| | - Noël P. Burtt
- Programs in Metabolism and Medical and Population Genetics, Broad Institute of the Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA
| | - Jose C. Florez
- Programs in Metabolism and Medical and Population Genetics, Broad Institute of the Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA
- Department of Medicine, Harvard Medical School, Boston, MA 02115, USA
- Diabetes Unit and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Humberto García-Ortíz
- Laboratory of Immunogenomics and Metabolic Diseases, National Institute of Genomic Medicine, Mexico City 14610, Mexico
| | - Clicerio González-Villalpando
- Center for Diabetes Studies, Research Unit for Diabetes and Cardiovascular Risk, Center for Population Health Studies, National Institute of Public Health, Mexico City 14080, Mexico
| | - Josep Mercader
- Programs in Metabolism and Medical and Population Genetics, Broad Institute of the Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA
- Department of Medicine, Harvard Medical School, Boston, MA 02115, USA
- Diabetes Unit and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Lorena Orozco
- Laboratory of Immunogenomics and Metabolic Diseases, National Institute of Genomic Medicine, Mexico City 14610, Mexico
| | - Teresa Tusié-Luna
- Molecular Biology and Medical Genomics Unity, Institute of Biomedical Research, The National Autonomous University of Mexico and the Salvador Zubirán National Institute of Health Sciences and Nutrition, Mexico City 14080, Mexico
| | - Estela Blanco
- Center for Community Health, Division of Academic General Pediatrics, University of California at San Diego, San Diego, CA 92093, USA
| | - Sheila Gahagan
- Center for Community Health, Division of Academic General Pediatrics, University of California at San Diego, San Diego, CA 92093, USA
| | - Nancy J. Cox
- Vanderbilt Genetics Institute, Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Craig Hanis
- University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Nancy F. Butte
- United States Department of Agriculture, Agricultural Research Service, The Children’s Nutrition Research Center, and the Department Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Shelley A. Cole
- Population Health Program, Texas Biomedical Research Institute, San Antonio, TX 78227, USA
| | | | - V. Saroja Voruganti
- Department of Nutrition and Nutrition Research Institute, University of North Carolina at Chapel Hill, Kannapolis, NC 28081, USA
| | - Rebecca Rohde
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Yujie Wang
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Tamar Sofer
- Diabetes Unit and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA
- Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, MA 02115, USA
| | - Elad Ziv
- Division of General Internal Medicine, Department of Medicine, Helen Diller Family Comprehensive Cancer Center, Institute for Human Genetics, University of California, San Francisco, San Francisco, CA 94115, USA
| | - Struan F.A. Grant
- Center for Applied Genomics, Division of Human Genetics, Department of Pediatrics, The Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Andres Ruiz-Linares
- Ministry of Education Key Laboratory of Contemporary Anthropology and Collaborative Innovation Center of Genetics and Development, School of Life Sciences and Human Phenome Institute, Fudan University, Shanghai 200438, China
- Department of Genetics, Evolution and Environment, and Genetics Institute of the University College London, London WC1E 6BT, UK
- Laboratory of Biocultural Anthropology, Law, Ethics, and Health, Aix-Marseille University, Marseille 13385, France
| | - Jerome I. Rotter
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA 90502 USA
| | - Christopher A. Haiman
- Center for Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA
| | - Esteban J. Parra
- Department of Anthropology, University of Toronto- Mississauga, Mississauga, ON L5L 1C6, Canada
| | - Miguel Cruz
- Medical Research Unit in Biochemistry, Specialty Hospital, National Medical Center of the Twenty-First Century, Mexican Institute of Social Security, Mexico City 06725, Mexico
| | - Ruth J.F. Loos
- The Charles Bronfman Institutes for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Kari E. North
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, NC 27514, USA
| |
Collapse
|
41
|
Sapoval N, Aghazadeh A, Nute MG, Antunes DA, Balaji A, Baraniuk R, Barberan CJ, Dannenfelser R, Dun C, Edrisi M, Elworth RAL, Kille B, Kyrillidis A, Nakhleh L, Wolfe CR, Yan Z, Yao V, Treangen TJ. Current progress and open challenges for applying deep learning across the biosciences. Nat Commun 2022; 13:1728. [PMID: 35365602 PMCID: PMC8976012 DOI: 10.1038/s41467-022-29268-7] [Citation(s) in RCA: 112] [Impact Index Per Article: 37.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2021] [Accepted: 03/09/2022] [Indexed: 11/19/2022] Open
Abstract
Deep Learning (DL) has recently enabled unprecedented advances in one of the grand challenges in computational biology: the half-century-old problem of protein structure prediction. In this paper we discuss recent advances, limitations, and future perspectives of DL on five broad areas: protein structure prediction, protein function prediction, genome engineering, systems biology and data integration, and phylogenetic inference. We discuss each application area and cover the main bottlenecks of DL approaches, such as training data, problem scope, and the ability to leverage existing DL architectures in new contexts. To conclude, we provide a summary of the subject-specific and general challenges for DL across the biosciences.
Collapse
Affiliation(s)
- Nicolae Sapoval
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Amirali Aghazadeh
- Department of Electrical Engineering and Computer Sciences, University of California Berkeley, Berkeley, CA, USA
| | - Michael G Nute
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Dinler A Antunes
- Department of Biology and Biochemistry, University of Houston, Houston, TX, USA
| | - Advait Balaji
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Richard Baraniuk
- Department of Electrical and Computer Engineering, Rice University, Houston, TX, USA
| | - C J Barberan
- Department of Electrical and Computer Engineering, Rice University, Houston, TX, USA
| | | | - Chen Dun
- Department of Computer Science, Rice University, Houston, TX, USA
| | | | - R A Leo Elworth
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Bryce Kille
- Department of Computer Science, Rice University, Houston, TX, USA
| | | | - Luay Nakhleh
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Cameron R Wolfe
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Zhi Yan
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Vicky Yao
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Todd J Treangen
- Department of Computer Science, Rice University, Houston, TX, USA.
- Department of Bioengineering, Rice University, Houston, TX, USA.
| |
Collapse
|
42
|
Haas CB, Hsu L, Lampe JW, Wernli KJ, Lindström S. Cross-ancestry Genome-wide Association Studies of Sex Hormone Concentrations in Pre- and Postmenopausal Women. Endocrinology 2022; 163:bqac020. [PMID: 35192695 PMCID: PMC8962449 DOI: 10.1210/endocr/bqac020] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Indexed: 11/19/2022]
Abstract
OBJECTIVE Concentrations of circulating sex hormones have been associated with a variety of diseases in women and are strongly influenced by menopausal status. We investigated the genetic architectures of circulating concentrations of estradiol, testosterone, and SHBG by menopausal status in women of European and African ancestry. METHODS Using data on 229 966 women from the UK Biobank, we conducted genome-wide association studies (GWASs) of circulating concentrations of estradiol, testosterone, and SHBG in premenopausal and postmenopausal women. We tested for evidence of heterogeneity of genetic effects by menopausal status and genetic ancestry. We conducted gene-based enrichment analyses to identify tissues in which genes with GWAS-enriched signals were expressed. RESULTS We identified 4 loci (5q35.2, 12q14.3, 19q13.42, 20p12.3) that were associated with detectable concentrations of estradiol in both pre- and postmenopausal women of European ancestry. Heterogeneity analysis identified 1 locus for testosterone (7q22.1) in the CYP3A7 gene and 1 locus that was strongly associated with concentrations of SHBG in premenopausal women only (10q15.1) near the AKR1C4 gene. Gene-based analysis of testosterone revealed evidence of enrichment of GWAS signals in genes expressed in adipose tissue for postmenopausal women. We did not find any evidence of ancestry-specific genetic effects for concentrations of estradiol, testosterone, or SHBG. CONCLUSIONS We identified specific loci that showed genome-wide significant evidence of heterogeneity by menopausal status for testosterone and SHBG. We also observed support for a more prominent role of genetic variants located near genes expressed in adipose tissue in determining testosterone concentrations among postmenopausal women.
Collapse
Affiliation(s)
- Cameron B Haas
- Department of Epidemiology, University of Washington, Seattle, WA 98195, USA
| | - Li Hsu
- Department of Biostatistics, University of Washington, Seattle, WA 98195, USA
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Johanna W Lampe
- Department of Epidemiology, University of Washington, Seattle, WA 98195, USA
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Karen J Wernli
- Kaiser Permanente Washington Health Research Institute, Seattle, WA 98101, USA
| | - Sara Lindström
- Department of Epidemiology, University of Washington, Seattle, WA 98195, USA
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| |
Collapse
|
43
|
Raben TG, Lello L, Widen E, Hsu SDH. From Genotype to Phenotype: Polygenic Prediction of Complex Human Traits. Methods Mol Biol 2022; 2467:421-446. [PMID: 35451785 DOI: 10.1007/978-1-0716-2205-6_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Decoding the genome confers the capability to predict characteristics of the organism (phenotype) from DNA (genotype). We describe the present status and future prospects of genomic prediction of complex traits in humans. Some highly heritable complex phenotypes such as height and other quantitative traits can already be predicted with reasonable accuracy from DNA alone. For many diseases, including important common conditions such as coronary artery disease, breast cancer, type I and II diabetes, individuals with outlier polygenic scores (e.g., top few percent) have been shown to have 5 or even 10 times higher risk than average. Several psychiatric conditions such as schizophrenia and autism also fall into this category. We discuss related topics such as the genetic architecture of complex traits, sibling validation of polygenic scores, and applications to adult health, in vitro fertilization (embryo selection), and genetic engineering.
Collapse
Affiliation(s)
| | - Louis Lello
- Michigan State University, East Lansing, MI, USA
- Genomic Prediction, North Brunswick, NJ, USA
| | - Erik Widen
- Michigan State University, East Lansing, MI, USA
| | - Stephen D H Hsu
- Michigan State University, East Lansing, MI, USA.
- Genomic Prediction, North Brunswick, NJ, USA.
| |
Collapse
|
44
|
Wedekind LE, Mitchell CM, Andersen CC, Knowler WC, Hanson RL. Epidemiology of Type 2 Diabetes in Indigenous Communities in the United States. Curr Diab Rep 2021; 21:47. [PMID: 34807308 PMCID: PMC8665733 DOI: 10.1007/s11892-021-01406-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 07/21/2021] [Indexed: 11/24/2022]
Abstract
PURPOSE OF REVIEW The present review focuses on the epidemiology of type 2 diabetes (T2D) in Indigenous communities in the continental United States (U.S.)-including disease prevention and management-and discusses special considerations in conducting research with Indigenous communities. RECENT FINDINGS Previous studies have reported the disparately high prevalence of diabetes, especially T2D, among Indigenous peoples in the U.S. The high prevalence and incidence of early-onset T2D in Indigenous youth relative to that of all youth in the U.S. population pose challenges to the prevention of complications of diabetes. Behavioral, dietary, lifestyle, and genetic factors associated with T2D in Indigenous communities are often investigated. More limited is the discussion of the historical and ongoing consequences of colonization and displacement that impact the aforementioned risk factors. Future research is necessary to assess community-specific needs with respect to diabetes prevention and management across the diversity of Indigenous communities in the U.S.
Collapse
Affiliation(s)
- Lauren E Wedekind
- Phoenix Epidemiology and Clinical Research Branch, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, 1550 East Indian School Road, Phoenix, AZ, 85014, USA
- Nuffield Department of Medicine, University of Oxford, Old Road Campus, Oxford, OX3 7LF, UK
| | - Cassie M Mitchell
- Phoenix Epidemiology and Clinical Research Branch, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, 1550 East Indian School Road, Phoenix, AZ, 85014, USA
| | - Coley C Andersen
- Phoenix Epidemiology and Clinical Research Branch, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, 1550 East Indian School Road, Phoenix, AZ, 85014, USA
| | - William C Knowler
- Phoenix Epidemiology and Clinical Research Branch, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, 1550 East Indian School Road, Phoenix, AZ, 85014, USA
| | - Robert L Hanson
- Phoenix Epidemiology and Clinical Research Branch, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, 1550 East Indian School Road, Phoenix, AZ, 85014, USA.
| |
Collapse
|
45
|
Onyishi CU, May RC. Human immune polymorphisms associated with the risk of cryptococcal disease. Immunology 2021; 165:143-157. [PMID: 34716931 PMCID: PMC9426616 DOI: 10.1111/imm.13425] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2021] [Revised: 10/07/2021] [Accepted: 10/16/2021] [Indexed: 11/29/2022] Open
Abstract
Cryptococcus neoformans is an opportunistic fungal pathogen that can cause lethal cryptococcal meningitis in immunocompromised individuals such as those with HIV/AIDS. In addition, cryptococcal infections occasionally arise in immunocompetent individuals or those with previously undiagnosed immunodeficiencies. The course of cryptococcosis is highly variable in both patient groups, and there is rapidly growing evidence that genetic polymorphisms may have a significant impact on the trajectory of disease. Here, we review what is currently known about the nature of these polymorphisms and their impact on host response to C. neoformans infection. Thus far, polymorphisms in Fc gamma receptors, mannose‐binding lectin, Dectin‐2, Toll‐like receptors and macrophage colony‐stimulating factor have been associated with susceptibility to cryptococcal disease. Notably, however, in some cases the impact of these polymorphisms depends on the genetic background of the population; for example, the FCGR3A 158 F/V polymorphism was associated with an increased risk of cryptococcal disease in both HIV‐positive and HIV‐negative white populations, but not in Han Chinese patients. In most cases, the precise mechanism by which the identified polymorphisms influence disease progression remains unclear, although impaired fungal recognition and phagocytosis by innate immune cells appears to play a major role. Finally, we highlight outstanding questions in the field and emphasize the need for future research to include more diverse populations in their genetic association studies.
Collapse
Affiliation(s)
- Chinaemerem U Onyishi
- Institute of Microbiology & Infection, School of Biosciences, University of Birmingham, Birmingham, UK
| | - Robin C May
- Institute of Microbiology & Infection, School of Biosciences, University of Birmingham, Birmingham, UK
| |
Collapse
|
46
|
van der Laan CM, Morosoli-García JJ, van de Weijer SGA, Colodro-Conde L, Lupton MK, Mitchell BL, McAloney K, Parker R, Burns JM, Hickie IB, Pool R, Hottenga JJ, Martin NG, Medland SE, Nivard MG, Boomsma DI. Continuity of Genetic Risk for Aggressive Behavior Across the Life-Course. Behav Genet 2021; 51:592-606. [PMID: 34390460 PMCID: PMC8390412 DOI: 10.1007/s10519-021-10076-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2021] [Accepted: 06/23/2021] [Indexed: 11/24/2022]
Abstract
We test whether genetic influences that explain individual differences in aggression in early life also explain individual differences across the life-course. In two cohorts from The Netherlands (N = 13,471) and Australia (N = 5628), polygenic scores (PGSs) were computed based on a genome-wide meta-analysis of childhood/adolescence aggression. In a novel analytic approach, we ran a mixed effects model for each age (Netherlands: 12-70 years, Australia: 16-73 years), with observations at the focus age weighted as 1, and decaying weights for ages further away. We call this approach a 'rolling weights' model. In The Netherlands, the estimated effect of the PGS was relatively similar from age 12 to age 41, and decreased from age 41-70. In Australia, there was a peak in the effect of the PGS around age 40 years. These results are a first indication from a molecular genetics perspective that genetic influences on aggressive behavior that are expressed in childhood continue to play a role later in life.
Collapse
Affiliation(s)
- Camiel M van der Laan
- Biological Psychology, Vrije Universiteit, Van der Boechorststraat 7, 1081 BT, Amsterdam, The Netherlands.
- The Netherlands Institute for the Study of Crime and Law Enforcement, Amsterdam, The Netherlands.
| | | | - Steve G A van de Weijer
- The Netherlands Institute for the Study of Crime and Law Enforcement, Amsterdam, The Netherlands
| | | | | | | | - Kerrie McAloney
- QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia
| | - Richard Parker
- QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia
| | - Jane M Burns
- Faculty of Health Sciences, The University of Sydney, Sydney, Australia
| | - Ian B Hickie
- Brain and Mind Centre, University of Sydney, Camperdown, Australia
| | - René Pool
- Biological Psychology, Vrije Universiteit, Van der Boechorststraat 7, 1081 BT, Amsterdam, The Netherlands
| | - Jouke-Jan Hottenga
- Biological Psychology, Vrije Universiteit, Van der Boechorststraat 7, 1081 BT, Amsterdam, The Netherlands
| | | | - Sarah E Medland
- QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia
| | - Michel G Nivard
- Biological Psychology, Vrije Universiteit, Van der Boechorststraat 7, 1081 BT, Amsterdam, The Netherlands
| | - Dorret I Boomsma
- Biological Psychology, Vrije Universiteit, Van der Boechorststraat 7, 1081 BT, Amsterdam, The Netherlands
| |
Collapse
|
47
|
Abstract
The reference human genome sequence is inarguably the most important and widely used resource in the fields of human genetics and genomics. It has transformed the conduct of biomedical sciences and brought invaluable benefits to the understanding and improvement of human health. However, the commonly used reference sequence has profound limitations, because across much of its span, it represents the sequence of just one human haplotype. This single, monoploid reference structure presents a critical barrier to representing the broad genomic diversity in the human population. In this review, we discuss the modernization of the reference human genome sequence to a more complete reference of human genomic diversity, known as a human pangenome.
Collapse
Affiliation(s)
- Karen H Miga
- UC Santa Cruz Genomics Institute and Department of Biomedical Engineering, University of California, Santa Cruz, California 95064, USA;
| | - Ting Wang
- Department of Genetics, Edison Family Center for Genome Sciences and Systems Biology, and McDonnell Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63110, USA;
| |
Collapse
|
48
|
Levy AM, Paschou P, Tümer Z. Candidate Genes and Pathways Associated with Gilles de la Tourette Syndrome-Where Are We? Genes (Basel) 2021; 12:1321. [PMID: 34573303 PMCID: PMC8468358 DOI: 10.3390/genes12091321] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Revised: 08/05/2021] [Accepted: 08/25/2021] [Indexed: 12/21/2022] Open
Abstract
Gilles de la Tourette syndrome (GTS) is a childhood-onset neurodevelopmental and -psychiatric tic-disorder of complex etiology which is often comorbid with obsessive-compulsive disorder (OCD) and/or attention deficit hyperactivity disorder (ADHD). Twin and family studies of GTS individuals have shown a high level of heritability suggesting, that genetic risk factors play an important role in disease etiology. However, the identification of major GTS susceptibility genes has been challenging, presumably due to the complex interplay between several genetic factors and environmental influences, low penetrance of each individual factor, genetic diversity in populations, and the presence of comorbid disorders. To understand the genetic components of GTS etiopathology, we conducted an extensive review of the literature, compiling the candidate susceptibility genes identified through various genetic approaches. Even though several strong candidate genes have hitherto been identified, none of these have turned out to be major susceptibility genes yet.
Collapse
Affiliation(s)
- Amanda M. Levy
- Kennedy Center, Department of Clinical Genetics, Copenhagen University Hospital, Rigshospitalet, 2600 Glostrup, Denmark;
| | - Peristera Paschou
- Department of Molecular Biology and Genetics, Democritus University of Thrace, 68100 Alexandroupolis, Greece;
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
| | - Zeynep Tümer
- Kennedy Center, Department of Clinical Genetics, Copenhagen University Hospital, Rigshospitalet, 2600 Glostrup, Denmark;
- Department of Clinical Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen, Denmark
| |
Collapse
|
49
|
Widen E, Raben TG, Lello L, Hsu SDH. Machine Learning Prediction of Biomarkers from SNPs and of Disease Risk from Biomarkers in the UK Biobank. Genes (Basel) 2021; 12:991. [PMID: 34209487 PMCID: PMC8308062 DOI: 10.3390/genes12070991] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 06/22/2021] [Accepted: 06/23/2021] [Indexed: 12/29/2022] Open
Abstract
We use UK Biobank data to train predictors for 65 blood and urine markers such as HDL, LDL, lipoprotein A, glycated haemoglobin, etc. from SNP genotype. For example, our Polygenic Score (PGS) predictor correlates ∼0.76 with lipoprotein A level, which is highly heritable and an independent risk factor for heart disease. This may be the most accurate genomic prediction of a quantitative trait that has yet been produced (specifically, for European ancestry groups). We also train predictors of common disease risk using blood and urine biomarkers alone (no DNA information); we call these predictors biomarker risk scores, BMRS. Individuals who are at high risk (e.g., odds ratio of >5× population average) can be identified for conditions such as coronary artery disease (AUC∼0.75), diabetes (AUC∼0.95), hypertension, liver and kidney problems, and cancer using biomarkers alone. Our atherosclerotic cardiovascular disease (ASCVD) predictor uses ∼10 biomarkers and performs in UKB evaluation as well as or better than the American College of Cardiology ASCVD Risk Estimator, which uses quite different inputs (age, diagnostic history, BMI, smoking status, statin usage, etc.). We compare polygenic risk scores (risk conditional on genotype: PRS) for common diseases to the risk predictors which result from the concatenation of learned functions BMRS and PGS, i.e., applying the BMRS predictors to the PGS output.
Collapse
Affiliation(s)
- Erik Widen
- Department of Physics and Astronomy, Michigan State University, 567 Wilson Rd, East Lansing, MI 48824, USA; (T.G.R.); (S.D.H.H.)
| | - Timothy G. Raben
- Department of Physics and Astronomy, Michigan State University, 567 Wilson Rd, East Lansing, MI 48824, USA; (T.G.R.); (S.D.H.H.)
| | - Louis Lello
- Department of Physics and Astronomy, Michigan State University, 567 Wilson Rd, East Lansing, MI 48824, USA; (T.G.R.); (S.D.H.H.)
- Genomic Prediction, Inc., 675 US Highway One, North Brunswick, NJ 08902, USA
| | - Stephen D. H. Hsu
- Department of Physics and Astronomy, Michigan State University, 567 Wilson Rd, East Lansing, MI 48824, USA; (T.G.R.); (S.D.H.H.)
- Genomic Prediction, Inc., 675 US Highway One, North Brunswick, NJ 08902, USA
| |
Collapse
|
50
|
Kim HR, Jung SH, Kim J, Jang H, Kang SH, Hwangbo S, Kim JP, Kim SY, Kim B, Kim S, Jeong JH, Yoon SJ, Park KW, Kim EJ, Yoon B, Jang JW, Hong JY, Choi SH, Noh Y, Kim KW, Kim SE, Lee JS, Jung NY, Lee J, Kim BC, Son SJ, Hong CH, Na DL, Seo SW, Won HH, Kim HJ. Identifying novel genetic variants for brain amyloid deposition: a genome-wide association study in the Korean population. Alzheimers Res Ther 2021; 13:117. [PMID: 34154648 PMCID: PMC8215820 DOI: 10.1186/s13195-021-00854-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2021] [Accepted: 06/02/2021] [Indexed: 12/12/2022]
Abstract
BACKGROUND Genome-wide association studies (GWAS) have identified a number of genetic variants for Alzheimer's disease (AD). However, most GWAS were conducted in individuals of European ancestry, and non-European populations are still underrepresented in genetic discovery efforts. Here, we performed GWAS to identify single nucleotide polymorphisms (SNPs) associated with amyloid β (Aβ) positivity using a large sample of Korean population. METHODS One thousand four hundred seventy-four participants of Korean ancestry were recruited from multicenters in South Korea. Discovery dataset consisted of 1190 participants (383 with cognitively unimpaired [CU], 330 with amnestic mild cognitive impairment [aMCI], and 477 with AD dementia [ADD]) and replication dataset consisted of 284 participants (46 with CU, 167 with aMCI, and 71 with ADD). GWAS was conducted to identify SNPs associated with Aβ positivity (measured by amyloid positron emission tomography). Aβ prediction models were developed using the identified SNPs. Furthermore, bioinformatics analysis was conducted for the identified SNPs. RESULTS In addition to APOE, we identified nine SNPs on chromosome 7, which were associated with a decreased risk of Aβ positivity at a genome-wide suggestive level. Of these nine SNPs, four novel SNPs (rs73375428, rs2903923, rs3828947, and rs11983537) were associated with a decreased risk of Aβ positivity (p < 0.05) in the replication dataset. In a meta-analysis, two SNPs (rs7337542 and rs2903923) reached a genome-wide significant level (p < 5.0 × 10-8). Prediction performance for Aβ positivity increased when rs73375428 were incorporated (area under curve = 0.75; 95% CI = 0.74-0.76) in addition to clinical factors and APOE genotype. Cis-eQTL analysis demonstrated that the rs73375428 was associated with decreased expression levels of FGL2 in the brain. CONCLUSION The novel genetic variants associated with FGL2 decreased risk of Aβ positivity in the Korean population. This finding may provide a candidate therapeutic target for AD, highlighting the importance of genetic studies in diverse populations.
Collapse
Affiliation(s)
- Hang-Rai Kim
- Department of Neurology, Dongguk University Ilsan Hospital, Dongguk University College of Medicine, Goyang, Republic of Korea
- Department of Neurology, Samsung Medical Center, Sungkyunkwan University School of Medicine, 81 Irwon-ro, Gangnam-gu, Seoul, 06351, Republic of Korea
- Alzheimer's Disease Convergence Research Center, Samsung Medical Center, Seoul, Republic of Korea
| | - Sang-Hyuk Jung
- Department of Digital Health, SAIHST, Sungkyunkwan University, Samsung Medical Center, 81 Irwon-ro, Gangnam-gu, Seoul, 06351, Republic of Korea
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, USA
| | - Jaeho Kim
- Department of Neurology, Dongtan Sacred Heart Hospital, Hallym University College of Medicine, Hwaseong, Republic of Korea
| | - Hyemin Jang
- Department of Neurology, Samsung Medical Center, Sungkyunkwan University School of Medicine, 81 Irwon-ro, Gangnam-gu, Seoul, 06351, Republic of Korea
- Alzheimer's Disease Convergence Research Center, Samsung Medical Center, Seoul, Republic of Korea
| | - Sung Hoon Kang
- Department of Neurology, Samsung Medical Center, Sungkyunkwan University School of Medicine, 81 Irwon-ro, Gangnam-gu, Seoul, 06351, Republic of Korea
- Alzheimer's Disease Convergence Research Center, Samsung Medical Center, Seoul, Republic of Korea
- Department of Neurology, Korea University Guro Hospital, Korea University College of Medicine, Seoul, Korea
| | - Song Hwangbo
- Department of Neurology, Samsung Medical Center, Sungkyunkwan University School of Medicine, 81 Irwon-ro, Gangnam-gu, Seoul, 06351, Republic of Korea
- Alzheimer's Disease Convergence Research Center, Samsung Medical Center, Seoul, Republic of Korea
| | - Jun Pyo Kim
- Center for Neuroimaging, Radiology and Imaging Sciences, Indiana University School of Medicine, Indianapolis, IN, USA
| | - So Yeon Kim
- Department of Digital Health, SAIHST, Sungkyunkwan University, Samsung Medical Center, 81 Irwon-ro, Gangnam-gu, Seoul, 06351, Republic of Korea
- Samsung Genome Institute, Samsung Medical Center, Seoul, Republic of Korea
| | - Beomsu Kim
- Department of Digital Health, SAIHST, Sungkyunkwan University, Samsung Medical Center, 81 Irwon-ro, Gangnam-gu, Seoul, 06351, Republic of Korea
| | - Soyeon Kim
- Department of Digital Health, SAIHST, Sungkyunkwan University, Samsung Medical Center, 81 Irwon-ro, Gangnam-gu, Seoul, 06351, Republic of Korea
| | - Jee Hyang Jeong
- Department of Neurology, Ewha Womans University Seoul Hospital, Ewha Womans University School of Medicine, Seoul, Republic of Korea
| | - Soo Jin Yoon
- Department of Neurology, Eulji University Hospital, Eulji University School of Medicine, Daejeon, Republic of Korea
| | - Kyung Won Park
- Department of Neurology, Dong-A University College of Medicine, Department of Translational Biomedical Sciences, Graduate School of Dong-A University, Busan, Republic of Korea
| | - Eun-Joo Kim
- Department of Neurology, Pusan National University Hospital, Pusan National University School of Medicine and Medical Research Institute, Busan, Republic of Korea
| | - Bora Yoon
- Department of Neurology, Konyang University College of Medicine, Daejeon, Republic of Korea
| | - Jae-Won Jang
- Department of Neurology, Kangwon National University Hospital, Kangwon National University College of Medicine, Chuncheon, Republic of Korea
| | - Jin Yong Hong
- Department of Neurology, Yonsei University Wonju College of Medicine, Wonju, Republic of Korea
| | - Seong Hye Choi
- Department of Neurology, Inha University School of Medicine, Incheon, Republic of Korea
| | - Young Noh
- Department of Neurology, Gachon University College of Medicine, Gil Medical Center, Incheon, Republic of Korea
| | - Ko Woon Kim
- Department of Neurology, School of Medicine, Jeonbuk National University Hospital, Jeonju, Republic of Korea
| | - Si Eun Kim
- Department of Neurology, Inje University College of Medicine, Haeundae Paik Hospital, Busan, Republic of Korea
| | - Jin San Lee
- Department of Neurology, Kyung Hee University College of Medicine, Kyung Hee University Hospital, Seoul, Republic of Korea
| | - Na-Yeon Jung
- Department of Neurology, Pusan National University Yangsan Hospital, Pusan National University School of Medicine and Medical Research Institute, Busan, Republic of Korea
| | - Juyoun Lee
- Department of Neurology, Chungnam National University Hospital, Daejeon, Republic of Korea
| | - Byeong C Kim
- Departmet of Neurology, Chonnam National University School of Medicine, Gwangju, Republic of Korea
| | - Sang Joon Son
- Department of Psychiatry, Ajou University School of Medicine, Suwon, Republic of Korea
| | - Chang Hyung Hong
- Department of Psychiatry, Ajou University School of Medicine, Suwon, Republic of Korea
| | - Duk L Na
- Department of Neurology, Samsung Medical Center, Sungkyunkwan University School of Medicine, 81 Irwon-ro, Gangnam-gu, Seoul, 06351, Republic of Korea
- Alzheimer's Disease Convergence Research Center, Samsung Medical Center, Seoul, Republic of Korea
- Department of Health Sciences and Technology, SAIHST, Sungkyunkwan University, Seoul, Republic of Korea
| | - Sang Won Seo
- Department of Neurology, Samsung Medical Center, Sungkyunkwan University School of Medicine, 81 Irwon-ro, Gangnam-gu, Seoul, 06351, Republic of Korea
- Alzheimer's Disease Convergence Research Center, Samsung Medical Center, Seoul, Republic of Korea
- Department of Health Sciences and Technology, SAIHST, Sungkyunkwan University, Seoul, Republic of Korea
- Department of Intelligent Precision Healthcare Convergence, Sungkyunkwan University, Seoul, Republic of Korea
| | - Hong-Hee Won
- Department of Digital Health, SAIHST, Sungkyunkwan University, Samsung Medical Center, 81 Irwon-ro, Gangnam-gu, Seoul, 06351, Republic of Korea.
- Department of Health Sciences and Technology, SAIHST, Sungkyunkwan University, Seoul, Republic of Korea.
| | - Hee Jin Kim
- Department of Neurology, Samsung Medical Center, Sungkyunkwan University School of Medicine, 81 Irwon-ro, Gangnam-gu, Seoul, 06351, Republic of Korea.
- Alzheimer's Disease Convergence Research Center, Samsung Medical Center, Seoul, Republic of Korea.
- Department of Digital Health, SAIHST, Sungkyunkwan University, Samsung Medical Center, 81 Irwon-ro, Gangnam-gu, Seoul, 06351, Republic of Korea.
- Department of Health Sciences and Technology, SAIHST, Sungkyunkwan University, Seoul, Republic of Korea.
| |
Collapse
|