1
|
Naserkheil M, Mehrban H, Lee D, Park MN. Genome-wide Association Study for Carcass Primal Cut Yields Using Single-step Bayesian Approach in Hanwoo Cattle. Front Genet 2021; 12:752424. [PMID: 34899840 PMCID: PMC8662546 DOI: 10.3389/fgene.2021.752424] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2021] [Accepted: 11/02/2021] [Indexed: 12/30/2022] Open
Abstract
The importance of meat and carcass quality is growing in beef cattle production to meet both producer and consumer demands. Primal cut yields, which reflect the body compositions of carcass, could determine the carcass grade and, consequently, command premium prices. Despite its importance, there have been few genome-wide association studies on these traits. This study aimed to identify genomic regions and putative candidate genes related to 10 primal cut traits, including tenderloin, sirloin, striploin, chuck, brisket, top round, bottom round, shank, flank, and rib in Hanwoo cattle using a single-step Bayesian regression (ssBR) approach. After genomic data quality control, 43,987 SNPs from 3,745 genotyped animals were available, of which 3,467 had phenotypic records for the analyzed traits. A total of 16 significant genomic regions (1-Mb window) were identified, of which five large-effect quantitative trait loci (QTLs) located on chromosomes 6 at 38–39 Mb, 11 at 21–22 Mb, 14 at 6–7 Mb and 26–27 Mb, and 19 at 26–27 Mb were associated with more than one trait, while the remaining 11 QTLs were trait-specific. These significant regions were harbored by 154 genes, among which TOX, FAM184B, SPP1, IBSP, PKD2, SDCBP, PIGY, LCORL, NCAPG, and ABCG2 were noteworthy. Enrichment analysis revealed biological processes and functional terms involved in growth and lipid metabolism, such as growth (GO:0040007), muscle structure development (GO:0061061), skeletal system development (GO:0001501), animal organ development (GO:0048513), lipid metabolic process (GO:0006629), response to lipid (GO:0033993), metabolic pathways (bta01100), focal adhesion (bta04510), ECM–receptor interaction (bta04512), fat digestion and absorption (bta04975), and Rap1 signaling pathway (bta04015) being the most significant for the carcass primal cut traits. Thus, identification of quantitative trait loci regions and plausible candidate genes will aid in a better understanding of the genetic and biological mechanisms regulating carcass primal cut yields.
Collapse
Affiliation(s)
- Masoumeh Naserkheil
- Animal Breeding and Genetics Division, National Institute of Animal Science, Cheonan-si, South Korea
| | - Hossein Mehrban
- Department of Animal Science, Shahrekord University, Shahrekord, Iran
| | - Deukmin Lee
- Department of Animal Life and Environment Sciences, Hankyong National University, Anseong-si, South Korea
| | - Mi Na Park
- Animal Breeding and Genetics Division, National Institute of Animal Science, Cheonan-si, South Korea
| |
Collapse
|
2
|
Kulwal PL. Trait Mapping Approaches Through Linkage Mapping in Plants. PLANT GENETICS AND MOLECULAR BIOLOGY 2018; 164:53-82. [DOI: 10.1007/10_2017_49] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/11/2023]
|
3
|
Howard R, Carriquiry AL, Beavis WD. Parametric and nonparametric statistical methods for genomic selection of traits with additive and epistatic genetic architectures. G3 (BETHESDA, MD.) 2014; 4:1027-46. [PMID: 24727289 PMCID: PMC4065247 DOI: 10.1534/g3.114.010298] [Citation(s) in RCA: 96] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/08/2014] [Accepted: 03/18/2014] [Indexed: 01/12/2023]
Abstract
Parametric and nonparametric methods have been developed for purposes of predicting phenotypes. These methods are based on retrospective analyses of empirical data consisting of genotypic and phenotypic scores. Recent reports have indicated that parametric methods are unable to predict phenotypes of traits with known epistatic genetic architectures. Herein, we review parametric methods including least squares regression, ridge regression, Bayesian ridge regression, least absolute shrinkage and selection operator (LASSO), Bayesian LASSO, best linear unbiased prediction (BLUP), Bayes A, Bayes B, Bayes C, and Bayes Cπ. We also review nonparametric methods including Nadaraya-Watson estimator, reproducing kernel Hilbert space, support vector machine regression, and neural networks. We assess the relative merits of these 14 methods in terms of accuracy and mean squared error (MSE) using simulated genetic architectures consisting of completely additive or two-way epistatic interactions in an F2 population derived from crosses of inbred lines. Each simulated genetic architecture explained either 30% or 70% of the phenotypic variability. The greatest impact on estimates of accuracy and MSE was due to genetic architecture. Parametric methods were unable to predict phenotypic values when the underlying genetic architecture was based entirely on epistasis. Parametric methods were slightly better than nonparametric methods for additive genetic architectures. Distinctions among parametric methods for additive genetic architectures were incremental. Heritability, i.e., proportion of phenotypic variability, had the second greatest impact on estimates of accuracy and MSE.
Collapse
Affiliation(s)
- Réka Howard
- Department of Statistics, Iowa State University, Ames, Iowa 50011 Department of Agronomy, Iowa State University, Ames, Iowa 50011
| | | | | |
Collapse
|
4
|
Genome-wide association study of temperament and tenderness using different Bayesian approaches in a Nellore–Angus crossbred population. Livest Sci 2014. [DOI: 10.1016/j.livsci.2013.12.012] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
5
|
RWGAIM: an efficient high-dimensional random whole genome average (QTL) interval mapping approach. Genet Res (Camb) 2013; 94:291-306. [PMID: 23374240 DOI: 10.1017/s0016672312000493] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
Abstract
Mapping of quantitative trait loci (QTLs) underlying variation in quantitative traits continues to be a powerful tool in genetic study of plants and other organisms. Whole genome average interval mapping (WGAIM), a mixed model QTL mapping approach using all intervals or markers simultaneously, has been demonstrated to outperform composite interval mapping, a common approach for QTL analysis. However, the advent of high-throughput high-dimensional marker platforms provides a challenge. To overcome this, a dimension reduction technique is proposed for WGAIM for efficient analysis of a large number of markers. This approach results in reduced computing time as it is dependent on the number of genetic lines (or individuals) rather than the number of intervals (or markers). The approach allows for the full set of potential QTL effects to be recovered. A proposed random effects version of WGAIM aims to reduce bias in the estimated size of QTL effects. Lastly, the two-stage outlier procedure used in WGAIM is replaced by a single stage approach to reduce possible bias in the selection of putative QTL in both WGAIM and the random effects version. Simulation is used to demonstrate the efficiency of the dimension reduction approach as well as demonstrate that while the approaches are very similar, the random WGAIM performs better than the original and modified fixed WGAIM by reducing bias and in terms of mean square error of prediction of estimated QTL effects. Finally, an analysis of a doubled haploid population is used to illustrate the three approaches.
Collapse
|
6
|
van den Berg I, Fritz S, Boichard D. QTL fine mapping with Bayes C(π): a simulation study. Genet Sel Evol 2013; 45:19. [PMID: 23782975 PMCID: PMC3700753 DOI: 10.1186/1297-9686-45-19] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2012] [Accepted: 06/07/2013] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND Accurate QTL mapping is a prerequisite in the search for causative mutations. Bayesian genomic selection models that analyse many markers simultaneously should provide more accurate QTL detection results than single-marker models. Our objectives were to (a) evaluate by simulation the influence of heritability, number of QTL and number of records on the accuracy of QTL mapping with Bayes Cπ and Bayes C; (b) estimate the QTL status (homozygous vs. heterozygous) of the individuals analysed. This study focussed on the ten largest detected QTL, assuming they are candidates for further characterization. METHODS Our simulations were based on a true dairy cattle population genotyped for 38,277 phased markers. Some of these markers were considered biallelic QTL and used to generate corresponding phenotypes. Different numbers of records (4387 and 1500), heritability values (0.1, 0.4 and 0.7) and numbers of QTL (10, 100 and 1000) were studied. QTL detection was based on the posterior inclusion probability for individual markers, or on the sum of the posterior inclusion probabilities for consecutive markers, estimated using Bayes C or Bayes Cπ. The QTL status of the individuals was derived from the contrast between the sums of the SNP allelic effects of their chromosomal segments. RESULTS The proportion of markers with null effect (π) frequently did not reach convergence, leading to poor results for Bayes Cπ in QTL detection. Fixing π led to better results. Detection of the largest QTL was most accurate for medium to high heritability, for low to moderate numbers of QTL, and with a large number of records. The QTL status was accurately inferred when the distribution of the contrast between chromosomal segment effects was bimodal. CONCLUSIONS QTL detection is feasible with Bayes C. For QTL detection, it is recommended to use a large dataset and to focus on highly heritable traits and on the largest QTL. QTL statuses were inferred based on the distribution of the contrast between chromosomal segment effects.
Collapse
Affiliation(s)
- Irene van den Berg
- INRA, UMR1313 Génétique animale et biologie intégrative, Domaine de Vilvert, 78350 Jouy-en-Josas, France.
| | | | | |
Collapse
|
7
|
Bhadra A, Mallick BK. Joint High‐Dimensional Bayesian Variable and Covariance Selection with an Application to eQTL Analysis. Biometrics 2013; 69:447-57. [DOI: 10.1111/biom.12021] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2011] [Revised: 10/01/2012] [Accepted: 12/01/2012] [Indexed: 01/29/2023]
Affiliation(s)
- Anindya Bhadra
- Department of StatisticsPurdue University, West Lafayette Indiana 47907‐2066, U.S.A
| | - Bani K. Mallick
- Department of StatisticsTexas A&M University, College Station Texas 77843‐3143, U.S.A
| |
Collapse
|
8
|
QTL Mapping for Meat Color Traits Using the F 2 Intercross between the Oh-Shamo (Japanese Large Game) and White Leghorn Chickens. J Poult Sci 2013. [DOI: 10.2141/jpsa.0120189] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
|
9
|
Scott-Boyer MP, Imholte GC, Tayeb A, Labbe A, Deschepper CF, Gottardo R. An integrated hierarchical Bayesian model for multivariate eQTL mapping. Stat Appl Genet Mol Biol 2012; 11:/j/sagmb.2012.11.issue-4/1544-6115.1760/1544-6115.1760.xml. [PMID: 22850063 PMCID: PMC4627701 DOI: 10.1515/1544-6115.1760] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Recently, expression quantitative loci (eQTL) mapping studies, where expression levels of thousands of genes are viewed as quantitative traits, have been used to provide greater insight into the biology of gene regulation. Originally, eQTLs were detected by applying standard QTL detection tools (using a "one gene at-a-time" approach), but this method ignores many possible interactions between genes. Several other methods have proposed to overcome these limitations, but each of them has some specific disadvantages. In this paper, we present an integrated hierarchical Bayesian model that jointly models all genes and SNPs to detect eQTLs. We propose a model (named iBMQ) that is specifically designed to handle a large number G of gene expressions, a large number S of regressors (genetic markers) and a small number n of individuals in what we call a ``large G, large S, small n'' paradigm. This method incorporates genotypic and gene expression data into a single model while 1) specifically coping with the high dimensionality of eQTL data (large number of genes), 2) borrowing strength from all gene expression data for the mapping procedures, and 3) controlling the number of false positives to a desirable level. To validate our model, we have performed simulation studies and showed that it outperforms other popular methods for eQTL detection, including QTLBIM, R-QTL, remMap and M-SPLS. Finally, we used our model to analyze a real expression dataset obtained in a panel of mice BXD Recombinant Inbred (RI) strains. Analysis of these data with iBMQ revealed the presence of multiple hotspots showing significant enrichment in genes belonging to one or more annotation categories.
Collapse
|
10
|
Abstract
In this chapter, we consider the problem of jointly analyzing multiple (correlated) complex traits in the context of identifying quantitative trait loci (QTL). The advantages of joint analysis (as opposed independent analysis) is the detection of pleiotropy and improved precision of estimates. The multivariate model is introduced along with a brief description of the setup. The model is evaluated in a Bayesian framework to perform model selection (strategy to identify QTL for each trait). A detailed vignette of a statistical software (R/qtlbim) which uses a Markov Chain Monte Carlo (MCMC) approach to draw samples from the posterior distribution is presented. Strategies of checking MCMC convergence, visualization of posterior samples, model building, and testing pleiotropy with the software are described. Relevant code to perform the analysis on an example (simulated) dataset is also provided.
Collapse
|
11
|
Abstract
This chapter covers the procedure of mapping quantitative trait loci (QTLs) in an F(2) breeding design. I describe genetic design, general methods and software, and several commonly used approaches. The genetic design section includes F(2) population construction. Widely used methods and software are introduced in the section of general methods and software. Finally, composite interval mapping, penalized maximum likelihood, and empirical Bayes are described in detail. Some issues related to the F(2) design are discussed.
Collapse
|
12
|
Bottolo L, Petretto E, Blankenberg S, Cambien F, Cook SA, Tiret L, Richardson S. Bayesian detection of expression quantitative trait loci hot spots. Genetics 2011; 189:1449-59. [PMID: 21926303 PMCID: PMC3241411 DOI: 10.1534/genetics.111.131425] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2011] [Accepted: 08/23/2011] [Indexed: 12/21/2022] Open
Abstract
High-throughput genomics allows genome-wide quantification of gene expression levels in tissues and cell types and, when combined with sequence variation data, permits the identification of genetic control points of expression (expression QTL or eQTL). Clusters of eQTL influenced by single genetic polymorphisms can inform on hotspots of regulation of pathways and networks, although very few hotspots have been robustly detected, replicated, or experimentally verified. Here we present a novel modeling strategy to estimate the propensity of a genetic marker to influence several expression traits at the same time, based on a hierarchical formulation of related regressions. We implement this hierarchical regression model in a Bayesian framework using a stochastic search algorithm, HESS, that efficiently probes sparse subsets of genetic markers in a high-dimensional data matrix to identify hotspots and to pinpoint the individual genetic effects (eQTL). Simulating complex regulatory scenarios, we demonstrate that our method outperforms current state-of-the-art approaches, in particular when the number of transcripts is large. We also illustrate the applicability of HESS to diverse real-case data sets, in mouse and human genetic settings, and show that it provides new insights into regulatory hotspots that were not detected by conventional methods. The results suggest that the combination of our modeling strategy and algorithmic implementation provides significant advantages for the identification of functional eQTL hotspots, revealing key regulators underlying pathways.
Collapse
Affiliation(s)
- Leonardo Bottolo
- MRC Clinical Sciences Centre, Imperial College, London W12 0NN United Kingdom
- Department of Epidemiology and Biostatistics, Imperial College, London W2 1PG, United Kingdom
| | - Enrico Petretto
- MRC Clinical Sciences Centre, Imperial College, London W12 0NN United Kingdom
- Department of Epidemiology and Biostatistics, Imperial College, London W2 1PG, United Kingdom
| | | | - François Cambien
- INSERM UMRS 937, Pierre and Marie Curie University, 75013 Paris, France
| | - Stuart A. Cook
- MRC Clinical Sciences Centre, Imperial College, London W12 0NN United Kingdom
- National Heart and Lung Institute, Imperial College, London W2 1PG, United Kingdom
| | - Laurence Tiret
- INSERM UMRS 937, Pierre and Marie Curie University, 75013 Paris, France
| | - Sylvia Richardson
- Department of Epidemiology and Biostatistics, Imperial College, London W2 1PG, United Kingdom
- MRC–HPA Centre for Environment and Health, Imperial College, London-Harefield Hospital, Harefield, Middlesex UB9 6JH, United Kingdom
| |
Collapse
|
13
|
Abstract
Many common human diseases and complex traits are highly heritable and influenced by multiple genetic and environmental factors. Although genome-wide association studies (GWAS) have successfully identified many disease-associated variants, these genetic variants explain only a small proportion of the heritability of most complex diseases. Genetic interactions (gene-gene and gene-environment) substantially contribute to complex traits and diseases and could be one of the main sources of the missing heritability. This paper provides an overview of the available statistical methods and related computer software for identifying genetic interactions in animal and plant experimental crosses and human genetic association studies. The main discussion falls under the three broad issues in statistical analysis of genetic interactions: the definition, detection and interpretation of genetic interactions. Recently developed methods based on modern techniques for high-dimensional data are reviewed, including penalized likelihood approaches and hierarchical models; the relationships between these methods are also discussed. I conclude this review by highlighting some areas of future research.
Collapse
|
14
|
Guo B, Beavis WD. In silico genotyping of the maize nested association mapping population. MOLECULAR BREEDING : NEW STRATEGIES IN PLANT IMPROVEMENT 2011; 27:107-113. [PMID: 21289856 PMCID: PMC3015163 DOI: 10.1007/s11032-010-9503-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/28/2010] [Accepted: 09/04/2010] [Indexed: 05/30/2023]
Abstract
Nested Association Mapping (NAM) has been proposed as a means to combine the power of linkage mapping with the resolution of association mapping. It is enabled through sequencing or array genotyping of parental inbred lines while using low-cost, low-density genotyping technologies for their segregating progenies. For purposes of data analyses of NAM populations, parental genotypes at a large number of Single Nucleotide Polymorphic (SNP) loci need to be projected to their segregating progeny. Herein we demonstrate how approximately 0.5 million SNPs that have been genotyped in 26 parental lines of the publicly available maize NAM population can be projected onto their segregating progeny using only 1,106 SNP loci that have been genotyped in both the parents and their 5,000 progeny. The challenge is to estimate both the genotype and genetic location of the parental SNP genotypes in segregating progeny. Both challenges were met by estimating their expected genotypic values conditional on observed flanking markers through the use of both physical and linkage maps. About 90%, of 500,000 genotyped SNPs from the maize HapMap project, were assigned linkage map positions using linear interpolation between the maize Accessioned Gold Path (AGP) and NAM linkage maps. Of these, almost 70% provided high probability estimates of genotypes in almost 5,000 recombinant inbred lines.
Collapse
Affiliation(s)
- Baohong Guo
- Department of Agronomy, Iowa State University, 1208 Agronomy Hall, Ames, IA 50011 USA
- Present Address: Syngenta Seeds, Inc, Slater, IA 50244 USA
| | - William D. Beavis
- Department of Agronomy, Iowa State University, 1208 Agronomy Hall, Ames, IA 50011 USA
| |
Collapse
|
15
|
Abstract
Quantitative trait loci (QTLs) mapping often results in data on a number of traits that have well-established causal relationships. Many multi-trait QTL mapping methods that account for correlation among the multiple traits have been developed to improve the statistical power and the precision of QTL parameter estimation. However, none of these methods are capable of incorporating the causal structure among the traits. Consequently, genetic functions of the QTL may not be fully understood. In this paper, we developed a Bayesian multiple QTL mapping method for causally related traits using a mixture structural equation model (SEM), which allows researchers to decompose QTL effects into direct, indirect and total effects. Parameters are estimated based on their marginal posterior distribution. The posterior distributions of parameters are estimated using Markov Chain Monte Carlo methods such as the Gibbs sampler and the Metropolis-Hasting algorithm. The number of QTLs affecting traits is determined by the Bayes factor. The performance of the proposed method is evaluated by simulation study and applied to data from a wheat experiment. Compared with single trait Bayesian analysis, our proposed method not only improved the statistical power of QTL detection, accuracy and precision of parameter estimates but also provided important insight into how genes regulate traits directly and indirectly by fitting a more biologically sensible model.
Collapse
|
16
|
Abstract
Identification of functional markers (FMs) provides information about the genetic architecture underlying complex traits. An approach that combines the strengths of linkage and association mapping, referred to as nested association mapping (NAM), has been proposed to identify FMs in many plant species. The ability to identify and resolve FMs for complex traits depends upon a number of factors including frequency of FM alleles, magnitudes of their genetic effects, disequilibrium among functional and nonfunctional markers, statistical analysis methods, and mating design. The statistical characteristics of power, accuracy, and precision to identify FMs with a NAM population were investigated using three simulation studies. The simulated data sets utilized publicly available genetic sequences and simulated FMs were identified using least-squares variable selection methods. Results indicate that FMs with simple additive genetic effects that contribute at least 5% to the phenotypic variability in at least five segregating families of a NAM population consisting of recombinant inbred progeny derived from 28 matings with a single reference inbred will have adequate power to accurately and precisely identify FMs. This resolution and power are possible even for genetic architectures consisting of disequilibrium among multiple functional and nonfunctional markers in the same genomic region, although the resolution of FMs will deteriorate rapidly if more than two FMs are tightly linked within the same amplicon. Finally, nested mating designs involving several reference parents will have a greater likelihood of resolving FMs than single reference designs.
Collapse
|
17
|
E Silva LDC, Zeng ZB. Current Progress on Statistical Methods for Mapping Quantitative Trait Loci from Inbred Line Crosses. J Biopharm Stat 2010; 20:454-81. [DOI: 10.1080/10543400903572845] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Affiliation(s)
- Luciano Da Costa E Silva
- a Department of Statistics, Bioinformatics Research Center , North Carolina State University , Raleigh, North Carolina, USA
| | - Zhao-Bang Zeng
- a Department of Statistics, Bioinformatics Research Center , North Carolina State University , Raleigh, North Carolina, USA
- b Department of Genetics , North Carolina State University , Raleigh, North Carolina, USA
| |
Collapse
|
18
|
Ankra-Badu GA, Shriner D, Le Bihan-Duval E, Mignon-Grasteau S, Pitel F, Beaumont C, Duclos MJ, Simon J, Porter TE, Vignal A, Cogburn LA, Allison DB, Yi N, Aggrey SE. Mapping main, epistatic and sex-specific QTL for body composition in a chicken population divergently selected for low or high growth rate. BMC Genomics 2010; 11:107. [PMID: 20149241 PMCID: PMC2830984 DOI: 10.1186/1471-2164-11-107] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2009] [Accepted: 02/11/2010] [Indexed: 11/30/2022] Open
Abstract
Background Delineating the genetic basis of body composition is important to agriculture and medicine. In addition, the incorporation of gene-gene interactions in the statistical model provides further insight into the genetic factors that underlie body composition traits. We used Bayesian model selection to comprehensively map main, epistatic and sex-specific QTL in an F2 reciprocal intercross between two chicken lines divergently selected for high or low growth rate. Results We identified 17 QTL with main effects across 13 chromosomes and several sex-specific and sex-antagonistic QTL for breast meat yield, thigh + drumstick yield and abdominal fatness. Different sets of QTL were found for both breast muscles [Pectoralis (P) major and P. minor], which suggests that they could be controlled by different regulatory mechanisms. Significant interactions of QTL by sex allowed detection of sex-specific and sex-antagonistic QTL for body composition and abdominal fat. We found several female-specific P. major QTL and sex-antagonistic P. minor and abdominal fatness QTL. Also, several QTL on different chromosomes interact with each other to affect body composition and abdominal fatness. Conclusions The detection of main effects, epistasis and sex-dimorphic QTL suggest complex genetic regulation of somatic growth. An understanding of such regulatory mechanisms is key to mapping specific genes that underlie QTL controlling somatic growth in an avian model.
Collapse
Affiliation(s)
- Georgina A Ankra-Badu
- Department of Poultry Science/Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Li J, Reynolds RJ, Pomp D, Allison DB, Yi N. Mapping interacting QTL for count phenotypes using hierarchical Poisson and binomial models: an application to reproductive traits in mice. Genet Res (Camb) 2010; 92:13-23. [PMID: 20199696 PMCID: PMC2938180 DOI: 10.1017/s0016672310000029] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
We proposed hierarchical Poisson and binomial models for mapping multiple interacting quantitative trait loci (QTLs) for count traits in experimental crosses. We applied our methods to two counted reproductive traits, live fetuses (LF) and dead fetuses (DF) at 17 days gestation, in an F2 female mouse population. We treated observed number of corpora lutea (ovulation rate) as the baseline and the total trials in our Poisson and binomial models, respectively. We detected more than 10 QTLs for LF and DF, most having epistatic and pleiotropic effects. The epistatic effects were larger, involved more QTLs, and explained a larger proportion of phenotypic variance than the main effects. Our analyses revealed a complex network of multiple interacting QTLs for the reproductive traits, and increase our understanding of the genetic architecture of reproductive characters. The proposed statistical models and methods provide valuable tools for detecting multiple interacting QTLs for complex count phenotypes.
Collapse
Affiliation(s)
- Jun Li
- Department of Biostatistics, Section on Statistical Genetics, University of Alabama at Birmingham, Birmingham, Alabama 35294, USA
| | - Richard J. Reynolds
- Department of Biostatistics, Section on Statistical Genetics, University of Alabama at Birmingham, Birmingham, Alabama 35294, USA
| | - Daniel Pomp
- Departments of Genetics, Nutrition, Cell and Molecular Physiology, University of North Carolina, Chapel Hill, NC 27599, USA
| | - David B. Allison
- Department of Biostatistics, Section on Statistical Genetics, University of Alabama at Birmingham, Birmingham, Alabama 35294, USA
- Clinical Nutrition Research Center, University of Alabama at Birmingham, Birmingham, AL 35294, USA
| | - Nengjun Yi
- Department of Biostatistics, Section on Statistical Genetics, University of Alabama at Birmingham, Birmingham, Alabama 35294, USA
- Clinical Nutrition Research Center, University of Alabama at Birmingham, Birmingham, AL 35294, USA
| |
Collapse
|
20
|
Logsdon BA, Hoffman GE, Mezey JG. A variational Bayes algorithm for fast and accurate multiple locus genome-wide association analysis. BMC Bioinformatics 2010; 11:58. [PMID: 20105321 PMCID: PMC2824680 DOI: 10.1186/1471-2105-11-58] [Citation(s) in RCA: 66] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2009] [Accepted: 01/27/2010] [Indexed: 12/17/2022] Open
Abstract
Background The success achieved by genome-wide association (GWA) studies in the identification of candidate loci for complex diseases has been accompanied by an inability to explain the bulk of heritability. Here, we describe the algorithm V-Bay, a variational Bayes algorithm for multiple locus GWA analysis, which is designed to identify weaker associations that may contribute to this missing heritability. Results V-Bay provides a novel solution to the computational scaling constraints of most multiple locus methods and can complete a simultaneous analysis of a million genetic markers in a few hours, when using a desktop. Using a range of simulated genetic and GWA experimental scenarios, we demonstrate that V-Bay is highly accurate, and reliably identifies associations that are too weak to be discovered by single-marker testing approaches. V-Bay can also outperform a multiple locus analysis method based on the lasso, which has similar scaling properties for large numbers of genetic markers. For demonstration purposes, we also use V-Bay to confirm associations with gene expression in cell lines derived from the Phase II individuals of HapMap. Conclusions V-Bay is a versatile, fast, and accurate multiple locus GWA analysis tool for the practitioner interested in identifying weaker associations without high false positive rates.
Collapse
Affiliation(s)
- Benjamin A Logsdon
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, NY, USA
| | | | | |
Collapse
|
21
|
Abstract
In the past two decades, various statistical approaches have been developed to identify quantitative trait locus with experimental organisms. In this chapter, we introduce several commonly used QTL mapping methods for intercross and backcross populations. Important issues related to QTL mapping, such as threshold and confidence interval calculations are also discussed. We list and describe five public domain QTL software packages commonly used by biologists.
Collapse
|
22
|
Abstract
We describe a fast hierarchical Bayesian method for mapping quantitative trait loci by haplotype-based association, applicable when haplotypes are not observed directly but are inferred from multiple marker genotypes. The method avoids the use of a Monte Carlo Markov chain by employing priors for which the likelihood factorizes completely. It is parameterized by a single hyperparameter, the fraction of variance explained by the quantitative trait locus, compared to the frequentist fixed-effects model, which requires a parameter for the phenotypic effect of each combination of haplotypes; nevertheless it still provides estimates of haplotype effects. We use simulation to show that the method matches the power of the frequentist regression model and, when the haplotypes are inferred, exceeds it for small QTL effect sizes. The Bayesian estimates of the haplotype effects are more accurate than the frequentist estimates, for both known and inferred haplotypes, which indicates that this advantage is independent of the effect of uncertainty in haplotype inference and will hold in comparison with frequentist methods in general. We apply the method to data from a panel of recombinant inbred lines of Arabidopsis thaliana, descended from 19 inbred founders.
Collapse
|
23
|
Wei WH, Knott S, Haley CS, de Koning DJ. Controlling false positives in the mapping of epistatic QTL. Heredity (Edinb) 2009; 104:401-9. [PMID: 19789566 DOI: 10.1038/hdy.2009.129] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
This study addresses the poorly explored issue of the control of false positive rate (FPR) in the mapping of pair-wise epistatic quantitative trait loci (QTL). A nested test framework was developed to (1) allow pre-identified QTL to be used directly to detect epistasis in one-dimensional genome scans, (2) to detect novel epistatic QTL pairs in two-dimensional genome scans and (3) to derive genome-wide thresholds through permutation and handle multiple testing. We used large-scale simulations to evaluate the performance of both the one- and two-dimensional approaches in mapping different forms and levels of epistasis and to generate profiles of FPR, power and accuracy to inform epistasis mapping studies. We showed that the nested test framework and genome-wide thresholds were essential to control FPR at the 5% level. The one-dimensional approach was generally more powerful than the two-dimensional approach in detecting QTL-associated epistasis and identified nearly all epistatic pairs detected from the two-dimensional approach. However, only the two-dimensional approach could detect epistatic QTL with weak main effects. Combining the two approaches allowed effective mapping of different forms of epistasis, whereas using the nested test framework kept the FPR under control. This approach provides a good search engine for high-throughput epistasis analyses.
Collapse
Affiliation(s)
- W-H Wei
- Division of Genetics and Genomics, The Roslin Institute and R(D)SVS, University of Edinburgh, Roslin, Midlothian, Scotland EH25 9PS, UK
| | | | | | | |
Collapse
|
24
|
Huang BE, George AW. Look before you leap: a new approach to mapping QTL. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2009; 119:899-911. [PMID: 19585099 DOI: 10.1007/s00122-009-1098-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/21/2009] [Accepted: 06/21/2009] [Indexed: 05/28/2023]
Abstract
In this paper, we present an innovative and powerful approach for mapping quantitative trait loci (QTL) in experimental populations. This deviates from the traditional approach of (composite) interval mapping which uses a QTL profile to simultaneously determine the number and location of QTL. Instead, we look before we leap by employing separate detection and localization stages. In the detection stage, we use an iterative variable selection process coupled with permutation to identify the number and synteny of QTL. In the localization stage, we position the detected QTL through a series of one-dimensional interval mapping scans. Results from a detailed simulation study and real analysis of wheat data are presented. We achieve impressive increases in the power of QTL detection compared to composite interval mapping. We also accurately estimate the size and position of QTL. An R library, DLMap, implements the methods described here and is freely available from CRAN ( http://cran.r-project.org/ ).
Collapse
Affiliation(s)
- B Emma Huang
- CSIRO Mathematical and Information Sciences, Queensland Bioscience Precinct, Brisbane, QLD 4067, Australia
| | | |
Collapse
|
25
|
Shriner D. Mapping multiple quantitative trait loci under Bayes error control. Genet Res (Camb) 2009; 91:147-59. [PMID: 19589185 PMCID: PMC3205938 DOI: 10.1017/s001667230900010x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
Abstract
In mapping of quantitative trait loci (QTLs), performing hypothesis tests of linkage to a phenotype of interest across an entire genome involves multiple comparisons. Furthermore, linkage among loci induces correlation among tests. Under many multiple comparison frameworks, these problems are exacerbated when mapping multiple QTLs. Traditionally, significance thresholds have been subjectively set to control the probability of detecting at least one false positive outcome, although such thresholds are known to result in excessively low power to detect true positive outcomes. Recently, false discovery rate (FDR)-controlling procedures have been developed that yield more power both by relaxing the stringency of the significance threshold and by retaining more power for a given significance threshold. However, these procedures have been shown to perform poorly for mapping QTLs, principally because they ignore recombination fractions between markers. Here, I describe a procedure that accounts for recombination fractions and extends FDR control to include simultaneous control of the false non-discovery rate, i.e. the overall error rate is controlled. This procedure is developed in the Bayesian framework using a direct posterior probability approach. Data-driven significance thresholds are determined by minimizing the expected loss. The procedure is equivalent to jointly maximizing positive and negative predictive values. In the context of mapping QTLs for experimental crosses, the procedure is applicable to mapping main effects, gene-gene interactions and gene-environment interactions.
Collapse
Affiliation(s)
- Daniel Shriner
- Center for Research on Genomics and Global Health, National Institutes of Health, Bethesda, MD 20892, USA.
| |
Collapse
|
26
|
Yi N, Banerjee S. Hierarchical generalized linear models for multiple quantitative trait locus mapping. Genetics 2009; 181:1101-13. [PMID: 19139143 PMCID: PMC2651046 DOI: 10.1534/genetics.108.099556] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2008] [Accepted: 01/06/2009] [Indexed: 11/18/2022] Open
Abstract
We develop hierarchical generalized linear models and computationally efficient algorithms for genomewide analysis of quantitative trait loci (QTL) for various types of phenotypes in experimental crosses. The proposed models can fit a large number of effects, including covariates, main effects of numerous loci, and gene-gene (epistasis) and gene-environment (G x E) interactions. The key to the approach is the use of continuous prior distribution on coefficients that favors sparseness in the fitted model and facilitates computation. We develop a fast expectation-maximization (EM) algorithm to fit models by estimating posterior modes of coefficients. We incorporate our algorithm into the iteratively weighted least squares for classical generalized linear models as implemented in the package R. We propose a model search strategy to build a parsimonious model. Our method takes advantage of the special correlation structure in QTL data. Simulation studies demonstrate reasonable power to detect true effects, while controlling the rate of false positives. We illustrate with three real data sets and compare our method to existing methods for multiple-QTL mapping. Our method has been implemented in our freely available package R/qtlbim (www.qtlbim.org), providing a valuable addition to our previous Markov chain Monte Carlo (MCMC) approach.
Collapse
Affiliation(s)
- Nengjun Yi
- Department of Biostatistics, Section on Statistical Genetics, University of Alabama, Birmingham, Alabama 35294-0022, USA.
| | | |
Collapse
|
27
|
Banerjee S, Yandell BS, Yi N. Bayesian quantitative trait loci mapping for multiple traits. Genetics 2008; 179:2275-89. [PMID: 18689903 PMCID: PMC2516097 DOI: 10.1534/genetics.108.088427] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2008] [Accepted: 06/15/2008] [Indexed: 11/18/2022] Open
Abstract
Most quantitative trait loci (QTL) mapping experiments typically collect phenotypic data on multiple correlated complex traits. However, there is a lack of a comprehensive genomewide mapping strategy for correlated traits in the literature. We develop Bayesian multiple-QTL mapping methods for correlated continuous traits using two multivariate models: one that assumes the same genetic model for all traits, the traditional multivariate model, and the other known as the seemingly unrelated regression (SUR) model that allows different genetic models for different traits. We develop computationally efficient Markov chain Monte Carlo (MCMC) algorithms for performing joint analysis. We conduct extensive simulation studies to assess the performance of the proposed methods and to compare with the conventional single-trait model. Our methods have been implemented in the freely available package R/qtlbim (http://www.qtlbim.org), which greatly facilitates the general usage of the Bayesian methodology for unraveling the genetic architecture of complex traits.
Collapse
Affiliation(s)
- Samprit Banerjee
- Departments of Biostatistics, Section on Statistical Genetics, University of Alabama, Birmingham, AL 35294, USA
| | | | | |
Collapse
|
28
|
Abstract
The mapping of quantitative trait loci (QTL) is to identify molecular markers or genomic loci that influence the variation of complex traits. The problem is complicated by the facts that QTL data usually contain a large number of markers across the entire genome and most of them have little or no effect on the phenotype. In this article, we propose several Bayesian hierarchical models for mapping multiple QTL that simultaneously fit and estimate all possible genetic effects associated with all markers. The proposed models use prior distributions for the genetic effects that are scale mixtures of normal distributions with mean zero and variances distributed to give each effect a high probability of being near zero. We consider two types of priors for the variances, exponential and scaled inverse-chi(2) distributions, which result in a Bayesian version of the popular least absolute shrinkage and selection operator (LASSO) model and the well-known Student's t model, respectively. Unlike most applications where fixed values are preset for hyperparameters in the priors, we treat all hyperparameters as unknowns and estimate them along with other parameters. Markov chain Monte Carlo (MCMC) algorithms are developed to simulate the parameters from the posteriors. The methods are illustrated using well-known barley data.
Collapse
|