1
|
Xiao J, Zhou Y, He S, Ren WL. An Efficient Score Test Integrated with Empirical Bayes for Genome-Wide Association Studies. Front Genet 2021; 12:742752. [PMID: 34659362 PMCID: PMC8517403 DOI: 10.3389/fgene.2021.742752] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 09/13/2021] [Indexed: 11/30/2022] Open
Abstract
Many methods used in multi-locus genome-wide association studies (GWAS) have been developed to improve statistical power. However, most existing multi-locus methods are not quicker than single-locus methods. To address this concern, we proposed a fast score test integrated with Empirical Bayes (ScoreEB) for multi-locus GWAS. Firstly, a score test was conducted for each single nucleotide polymorphism (SNP) under a linear mixed model (LMM) framework, taking into account the genetic relatedness and population structure. Then, all of the potentially associated SNPs were selected with a less stringent criterion. Finally, Empirical Bayes in a multi-locus model was performed for all of the selected SNPs to identify the true quantitative trait nucleotide (QTN). Our new method ScoreEB adopts the similar strategy of multi-locus random-SNP-effect mixed linear model (mrMLM) and fast multi-locus random-SNP-effect EMMA (FASTmrEMMA), and the only difference is that we use the score test to select all the potentially associated markers. Monte Carlo simulation studies demonstrate that ScoreEB significantly improved the computational efficiency compared with the popular methods mrMLM, FASTmrEMMA, iterative modified-sure independence screening EM-Bayesian lasso (ISIS EM-BLASSO), hybrid of restricted and penalized maximum likelihood (HRePML) and genome-wide efficient mixed model association (GEMMA). In addition, ScoreEB remained accurate in QTN effect estimation and effectively controlled false positive rate. Subsequently, ScoreEB was applied to re-analyze quantitative traits in plants and animals. The results show that ScoreEB not only can detect previously reported genes, but also can mine new genes.
Collapse
Affiliation(s)
- Jing Xiao
- Department of Epidemiology and Medical Statistics, School of Public Health, Nantong University, Nantong, China
| | - Yang Zhou
- Department of Epidemiology and Medical Statistics, School of Public Health, Nantong University, Nantong, China
| | - Shu He
- Department of Epidemiology and Medical Statistics, School of Public Health, Nantong University, Nantong, China
| | - Wen-Long Ren
- Department of Epidemiology and Medical Statistics, School of Public Health, Nantong University, Nantong, China
| |
Collapse
|
2
|
Magwa RA, Zhao H, Xing Y. Genome-wide association mapping revealed a diverse genetic basis of seed dormancy across subpopulations in rice (Oryza sativa L.). BMC Genet 2016; 17:28. [PMID: 26810156 PMCID: PMC4727300 DOI: 10.1186/s12863-016-0340-2] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2015] [Accepted: 01/21/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Seed dormancy is an adaptive trait employed by flowering plants to avoid harsh environmental conditions for the continuity of their next generations. In cereal crops, moderate seed dormancy could help prevent pre-harvest sprouting and improve grain yield and quality. We performed a genome wide association study (GWAS) for dormancy, based on seed germination percentage (GP) in freshly harvested seeds (FHS) and after-ripened seeds (ARS) in 350 worldwide accessions that were characterized with strong population structure of indica, japonica and Aus subpopulations. RESULTS The germination tests revealed that Aus and indica rice had stronger seed dormancy than japonica rice in FHS. Association analysis revealed 16 loci significantly associated with GP in FHS and 38 in ARS. Three out of the 38 loci detected in ARS were also detected in FHS and 13 of the ARS loci were detected near previously mapped dormancy QTL. In FHS, three of the association loci were located within 100 kb around previously cloned GA/IAA inactivation genes such as GA2ox3, EUI1 and GH3-2 and one near dormancy gene, Sdr4. In ARS, an association signal was detected near ABA signaling gene ABI5. No association peaks were commonly detected among the sub-populations in FHS and only one association peak was detected in both indica and japonica populations in ARS. Sdr4 and GA2OX3 haplotype analysis showed that Aus and indica II (IndII) varieties had stronger dormancy alleles whereas indica I (IndI) and japonica had weak or non-dormancy alleles. CONCLUSION The association study and haplotype analysis together, indicate an involvement of independent genes and alleles contributing towards regulation and natural variation of seed dormancy among the rice sub-populations.
Collapse
Affiliation(s)
- Risper Auma Magwa
- National Key Laboratory of Crop Genetic Improvement and National Center of Plant, Gene Research (Wuhan), Huazhong Agricultural University, Wuhan, 430070, China.
| | - Hu Zhao
- National Key Laboratory of Crop Genetic Improvement and National Center of Plant, Gene Research (Wuhan), Huazhong Agricultural University, Wuhan, 430070, China.
| | - Yongzhong Xing
- National Key Laboratory of Crop Genetic Improvement and National Center of Plant, Gene Research (Wuhan), Huazhong Agricultural University, Wuhan, 430070, China. .,Hubei Collaborative Innovation Center for Grain Industry, Hubei, China.
| |
Collapse
|
3
|
Yashin AI, Arbeev KG, Arbeeva LS, Wu D, Akushevich I, Kovtun M, Yashkin A, Kulminski A, Culminskaya I, Stallard E, Li M, Ukraintseva SV. How the effects of aging and stresses of life are integrated in mortality rates: insights for genetic studies of human health and longevity. Biogerontology 2015; 17:89-107. [PMID: 26280653 DOI: 10.1007/s10522-015-9594-8] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2015] [Accepted: 07/25/2015] [Indexed: 12/21/2022]
Abstract
Increasing proportions of elderly individuals in developed countries combined with substantial increases in related medical expenditures make the improvement of the health of the elderly a high priority today. If the process of aging by individuals is a major cause of age related health declines then postponing aging could be an efficient strategy for improving the health of the elderly. Implementing this strategy requires a better understanding of genetic and non-genetic connections among aging, health, and longevity. We review progress and problems in research areas whose development may contribute to analyses of such connections. These include genetic studies of human aging and longevity, the heterogeneity of populations with respect to their susceptibility to disease and death, forces that shape age patterns of human mortality, secular trends in mortality decline, and integrative mortality modeling using longitudinal data. The dynamic involvement of genetic factors in (i) morbidity/mortality risks, (ii) responses to stresses of life, (iii) multi-morbidities of many elderly individuals, (iv) trade-offs for diseases, (v) genetic heterogeneity, and (vi) other relevant aging-related health declines, underscores the need for a comprehensive, integrated approach to analyze the genetic connections for all of the above aspects of aging-related changes. The dynamic relationships among aging, health, and longevity traits would be better understood if one linked several research fields within one conceptual framework that allowed for efficient analyses of available longitudinal data using the wealth of available knowledge about aging, health, and longevity already accumulated in the research field.
Collapse
Affiliation(s)
- Anatoliy I Yashin
- The Biodemography of Aging Research Unit, Social Science Research Institute, Duke University, Durham, NC, USA. .,The Biodemography of Aging Research Unit, Social Science Research Institute, Duke University, 2024 W. Main Street, Room A102E, Durham, NC, 27705, USA.
| | - Konstantin G Arbeev
- The Biodemography of Aging Research Unit, Social Science Research Institute, Duke University, Durham, NC, USA
| | - Liubov S Arbeeva
- The Biodemography of Aging Research Unit, Social Science Research Institute, Duke University, Durham, NC, USA
| | - Deqing Wu
- The Biodemography of Aging Research Unit, Social Science Research Institute, Duke University, Durham, NC, USA
| | - Igor Akushevich
- The Biodemography of Aging Research Unit, Social Science Research Institute, Duke University, Durham, NC, USA
| | - Mikhail Kovtun
- The Biodemography of Aging Research Unit, Social Science Research Institute, Duke University, Durham, NC, USA
| | - Arseniy Yashkin
- The Biodemography of Aging Research Unit, Social Science Research Institute, Duke University, Durham, NC, USA
| | - Alexander Kulminski
- The Biodemography of Aging Research Unit, Social Science Research Institute, Duke University, Durham, NC, USA
| | - Irina Culminskaya
- The Biodemography of Aging Research Unit, Social Science Research Institute, Duke University, Durham, NC, USA
| | - Eric Stallard
- The Biodemography of Aging Research Unit, Social Science Research Institute, Duke University, Durham, NC, USA
| | - Miaozhu Li
- The Biodemography of Aging Research Unit, Social Science Research Institute, Duke University, Durham, NC, USA
| | - Svetlana V Ukraintseva
- The Biodemography of Aging Research Unit, Social Science Research Institute, Duke University, Durham, NC, USA.,The Biodemography of Aging Research Unit, Social Science Research Institute, Duke University, 2024 W. Main Street, Room A105, Durham, NC, 27705, USA
| |
Collapse
|
5
|
Korte A, Farlow A. The advantages and limitations of trait analysis with GWAS: a review. PLANT METHODS 2013; 9:29. [PMID: 23876160 PMCID: PMC3750305 DOI: 10.1186/1746-4811-9-29] [Citation(s) in RCA: 864] [Impact Index Per Article: 72.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/12/2013] [Accepted: 06/13/2013] [Indexed: 05/17/2023]
Abstract
Over the last 10 years, high-density SNP arrays and DNA re-sequencing have illuminated the majority of the genotypic space for a number of organisms, including humans, maize, rice and Arabidopsis. For any researcher willing to define and score a phenotype across many individuals, Genome Wide Association Studies (GWAS) present a powerful tool to reconnect this trait back to its underlying genetics. In this review we discuss the biological and statistical considerations that underpin a successful analysis or otherwise. The relevance of biological factors including effect size, sample size, genetic heterogeneity, genomic confounding, linkage disequilibrium and spurious association, and statistical tools to account for these are presented. GWAS can offer a valuable first insight into trait architecture or candidate loci for subsequent validation.
Collapse
Affiliation(s)
- Arthur Korte
- Gregor Mendel Institute of Molecular Plant Biology, Vienna, Austria
| | - Ashley Farlow
- Gregor Mendel Institute of Molecular Plant Biology, Vienna, Austria
| |
Collapse
|