1
|
Haplotype-Based Genome-Wide Association Study and Identification of Candidate Genes Associated with Carcass Traits in Hanwoo Cattle. Genes (Basel) 2020; 11:genes11050551. [PMID: 32423003 PMCID: PMC7290854 DOI: 10.3390/genes11050551] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2020] [Revised: 04/30/2020] [Accepted: 05/05/2020] [Indexed: 12/20/2022] Open
Abstract
Hanwoo, is the most popular native beef cattle in South Korea. Due to its extensive popularity, research is ongoing to enhance its carcass quality and marbling traits. In this study we conducted a haplotype-based genome-wide association study (GWAS) by constructing haplotype blocks by three methods: number of single nucleotide polymorphisms (SNPs) in a haplotype block (nsnp), length of genomic region in kb (Len) and linkage disequilibrium (LD). Significant haplotype blocks and genes associated with them were identified for carcass traits such as BFT (back fat thickness), EMA (eye Muscle area), CWT (carcass weight) and MS (marbling score). Gene-set enrichment analysis and functional annotation of genes in the significantly-associated loci revealed candidate genes, including PLCB1 and PLCB4 present on BTA13, coding for phospholipases, which might be important candidates for increasing fat deposition due to their role in lipid metabolism and adipogenesis. CEL (carboxyl ester lipase), a bile-salt activated lipase, responsible for lipid catabolic process was also identified within the significantly-associated haplotype block on BTA11. The results were validated in a different Hanwoo population. The genes and pathways identified in this study may serve as good candidates for improving carcass traits in Hanwoo cattle.
Collapse
|
2
|
Eltahawy MS, Ali N, Zaid IU, Li D, Abdulmajid D, Bux L, Wang H, Hong D. Association analysis between constructed SNPLDBs and GCA effects of 9 quality-related traits in parents of hybrid rice (Oryza sativa L.). BMC Genomics 2020; 21:31. [PMID: 31918652 PMCID: PMC6953305 DOI: 10.1186/s12864-019-6428-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2019] [Accepted: 12/24/2019] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The general combining ability (GCA) of parents in hybrid rice affects not only heterotic level of grain yield and other important agronomic traits, but also performance of grain quality traits of F2 bulk population which is the commodity consumed by humans. In order to make GCA improvement for quality traits in parents of hybrid rice by molecular marker assisted selection feasible, genome-wide GCA loci for quality traits in parents were detected through association analysis between the effects of GCA and constructed single nucleotide polymorphism linkage disequilibrium blocks (SNPLDBs), by using unhusked rice grains harvested from F1 plants of 48 crosses of Indica rice and 78 crosses of Japonica rice. GCA-SNPLDBs association analysis. RESULTS Among the 8 CMS and 6 restorer lines of indica rice subspecies, CMS lines Zhenpin A, Zhenshan97 A, and 257A, and restorers Kanghui98, Minghui63 and Yanhui559 were recognized as good general combiners based on their GCA effect values for the 9 quality traits (brown rice rate, milled rice rate, head rice rate, percentage of chalky grains, chalky area size, chalkiness degree, gelatinization temperature, gel consistency and amylose content). Among the 13 CMS and 6 restorer lines of japonica rice subspecies, CMS 863A, 6427A and Xu 2A, and restorers C418, Ninghui8hao and Yunhui4hao showed elite GCA effect values for the 9 traits. GCA-SNPLDB association analysis revealed 39 significant SNPLDB loci associated with the GCA of the 9 quality-related traits, and the numbers of SNPLDB loci located on chromosome 1, 2, 3, 4, 5, 8, 9, 11 and 12 were 1, 4, 3, 9, 6, 5, 5, 4 and 2, respectively. Number of superior GCA alleles for the 9 traits among the 33 parents ranged from 1 to 26. CONCLUSIONS Thirty-nine significant SNPLDBs loci were identified associated with the GCA of 9 quality-related traits, and the superior SNPLDB alleles could be used to improve the GCA of parents for the traits in the future by molecular marker assisted selection. The genetic basis of trait GCA in parents is different from that of trait itself.
Collapse
Affiliation(s)
- Moaz S Eltahawy
- Nanjing Agricultural University, Nanjing, 210095, China.,State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, China.,Agronomy Department, Faculty of Agriculture, Zagazig University, Sharqia, 44519, Egypt
| | - Nour Ali
- Nanjing Agricultural University, Nanjing, 210095, China.,State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, China.,Laboratory of Crop Genetics and Germplasm Enhancement, Field Crops Research Department, Agricultural Faculty, Damascus University, Damascus, Syria
| | - Imdad U Zaid
- Nanjing Agricultural University, Nanjing, 210095, China.,State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, China
| | - Dalu Li
- Nanjing Agricultural University, Nanjing, 210095, China.,State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, China
| | - Dina Abdulmajid
- Nanjing Agricultural University, Nanjing, 210095, China.,State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, China.,Rice Research and Training Centre, Field Crops Research Institute, Agricultural Research Centre, Kafr El-Sheikh, 33717, Egypt
| | - Lal Bux
- Nanjing Agricultural University, Nanjing, 210095, China.,State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, China
| | - Hui Wang
- Nanjing Agricultural University, Nanjing, 210095, China.,State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, China
| | - Delin Hong
- Nanjing Agricultural University, Nanjing, 210095, China. .,State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, China.
| |
Collapse
|
3
|
Sehgal D, Dreisigacker S. Haplotypes-based genetic analysis: benefits and challenges. Vavilovskii Zhurnal Genet Selektsii 2019. [DOI: 10.18699/vj19.37-o] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
The increasing availability of Single Nucleotide Polymorphisms (SNPs) discovered by Next Generation Sequencing will enable a range of new genetic analyses in crops, which was not possible before. Concomitantly, researchers will face the challenge of handling large data sets at the whole-genome level. By grouping thousands of SNPs into a few hundred haplotype blocks, complexity of the data can be reduced with fewer statistical tests and a lower probability of spurious associations. Owing to the strong genome structure present in breeding lines of most crops, the deployment of haplotypes could be a powerful complement to improve efficiency of marker-assisted and genomic selection. This review describes in brief the commonly used approaches to construct haplotype blocks and some examples in animals and crops are cited where haplotype-based dissection of traits were proven beneficial. Some important considerations and facts while working with haplotypes in crops are reviewed at the end.
Collapse
Affiliation(s)
- D. Sehgal
- International Center for Maize and Wheat Improvement (CIMMYT)
| | - S. Dreisigacker
- International Center for Maize and Wheat Improvement (CIMMYT)
| |
Collapse
|
4
|
Ullah Zaid I, Tang W, He J, Ullah Khan S, Hong D. Association analysis uncovers the genetic basis of general combining ability of 11 yield-related traits in parents of hybrid rice. AOB PLANTS 2019; 11:ply077. [PMID: 30697406 PMCID: PMC6343818 DOI: 10.1093/aobpla/ply077] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/04/2018] [Revised: 11/20/2018] [Accepted: 12/17/2018] [Indexed: 05/12/2023]
Abstract
Association analysis between constructed single nucleotide polymorphism linkage disequilibrium blocks (SNPLDBs) and general combining ability (GCA) effects is a novel approach to uncover the genetic basis of GCA within the sequence genomes of parents of hybrid rice. Here, we calculated the GCA effect values of 33 parents of hybrid rice and sequenced them to identify genome-wide single nucleotide polymorphisms (SNPs). In total, 64.6 % of the uniquely mapped paired-end short reads revealed a final total of 291 959 SNPs between the 33 parental genomes and the Nipponbare reference genome. The identified SNPs were non-randomly distributed among all chromosomes of rice, whereas one-fourth of the SNPs were situated in the exonic regions with 16 % being non-synonymous. Further, the identified SNPs were merged and optimized for construction of 2612 SNPLDB markers, using linkage disequilibrium information. The single-factor analysis of variance-based association method between the constructed SNPLDB markers and GCA effects values detected 99 significant SNPLDBs for GCA of 11 yield-related traits. The associated SNPLDB markers explained 26.4 % of phenotypic variations with traits, on average. We mined 50 favourable GCA alleles at the associated SNPLDBs regions, distributed across the 33 parental genomes. The parental genomes possessed a small number of favourable GCA alleles for studied traits, with the exception of days to heading and plant height. Our results suggest that the identified GCA alleles could be used to improve the GCA performance of parents of hybrid rice through optimal crossing design. Moreover, favourable GCA alleles should be incorporated in the parental genomes through marker-assisted selection experiments, and the parental lines carrying more alleles could be utilized in breeding as superior parents for developing rice hybrids of desirable characteristics.
Collapse
Affiliation(s)
- Imdad Ullah Zaid
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
- Key Laboratory of Agricultural Water Resources, Centre for Agricultural Resources Research, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, No. 286 Huaizhong Rd, Shijiazhuang, China
| | - Weijie Tang
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
| | - Jianbo He
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
- Soybean Research Institute, Nanjing Agricultural University, Nanjing, Jiangsu, China
| | - Sana Ullah Khan
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Australia
| | - Delin Hong
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
| |
Collapse
|
5
|
Combined haplotype blocks regression and multi-locus mixed model analysis reveals novel candidate genes associated with milk traits in dairy sheep. Livest Sci 2019. [DOI: 10.1016/j.livsci.2018.11.020] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
6
|
Braz CU, Taylor JF, Bresolin T, Espigolan R, Feitosa FLB, Carvalheiro R, Baldi F, de Albuquerque LG, de Oliveira HN. Sliding window haplotype approaches overcome single SNP analysis limitations in identifying genes for meat tenderness in Nelore cattle. BMC Genet 2019; 20:8. [PMID: 30642245 PMCID: PMC6332854 DOI: 10.1186/s12863-019-0713-4] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2018] [Accepted: 01/02/2019] [Indexed: 12/30/2022] Open
Abstract
Background Traditional single nucleotide polymorphism (SNP) genome-wide association analysis (GWAA) can be inefficient because single SNPs provide limited genetic information about genomic regions. On the other hand, using haplotypes in the statistical analysis may increase the extent of linkage disequilibrium (LD) between haplotypes and causal variants and may also potentially capture epistastic interactions between variants within a haplotyped locus, providing an increase in the power and robustness of the association studies. We performed GWAA (413,355 SNP markers) using haplotypes based on variable-sized sliding windows and compared the results to a single-SNP GWAA using Warner-Bratzler shear force measured in the longissimus thorasis muscle of 3161 Nelore bulls to ascertain the optimal window size for identifying the genomic regions that influence meat tenderness. Results The GWAA using single SNPs identified eight variants influencing meat tenderness on BTA 3, 4, 9, 10 and 11. However, thirty-three putative meat tenderness QTL were detected on BTA 1, 3, 4, 5, 8, 9, 10, 11, 15, 17, 18, 24, 25, 26 and 29 using variable-sized sliding haplotype windows. Analyses using sliding window haplotypes of 3, 5, 7, 9 and 11 SNPs identified 57, 61, 42, 39, and 21% of all thirty-three putative QTL regions, respectively; however, the analyses using the 3 and 5 SNP haplotypes, cumulatively detected 88% of the putative QTL. The genes associated with variation in meat tenderness participate in myogenesis, neurogenesis, lipid and fatty acid metabolism and skeletal muscle structure or composition processes. Conclusions GWAA using haplotypes based on variable-sized sliding windows allowed the detection of more QTL than traditional single-SNP GWAA. Analyses using smaller haplotypes (3 and 5 SNPs) detected a higher proportion of the putative QTL. Electronic supplementary material The online version of this article (10.1186/s12863-019-0713-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Camila U Braz
- Animal Science Department, São Paulo State University (Unesp), Jaboticabal, SP, 144884-900, Brazil.
| | - Jeremy F Taylor
- Division of Animal Sciences, University of Missouri, Columbia, MO, 65211, USA
| | - Tiago Bresolin
- Animal Science Department, São Paulo State University (Unesp), Jaboticabal, SP, 144884-900, Brazil
| | - Rafael Espigolan
- Animal Science Department, São Paulo State University (Unesp), Jaboticabal, SP, 144884-900, Brazil
| | - Fabieli L B Feitosa
- Animal Science Department, São Paulo State University (Unesp), Jaboticabal, SP, 144884-900, Brazil
| | - Roberto Carvalheiro
- Animal Science Department, São Paulo State University (Unesp), Jaboticabal, SP, 144884-900, Brazil
| | - Fernando Baldi
- Animal Science Department, São Paulo State University (Unesp), Jaboticabal, SP, 144884-900, Brazil
| | - Lucia G de Albuquerque
- Animal Science Department, São Paulo State University (Unesp), Jaboticabal, SP, 144884-900, Brazil
| | - Henrique N de Oliveira
- Animal Science Department, São Paulo State University (Unesp), Jaboticabal, SP, 144884-900, Brazil.
| |
Collapse
|
7
|
do Nascimento AV, Romero ÂRDS, Utsunomiya YT, Utsunomiya ATH, Cardoso DF, Neves HHR, Carvalheiro R, Garcia JF, Grisolia AB. Genome-wide association study using haplotype alleles for the evaluation of reproductive traits in Nelore cattle. PLoS One 2018; 13:e0201876. [PMID: 30089161 PMCID: PMC6082543 DOI: 10.1371/journal.pone.0201876] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2017] [Accepted: 07/24/2018] [Indexed: 12/18/2022] Open
Abstract
Zebu cattle (Bos taurus indicus) are highly adapted to tropical regions. However, females reach puberty after taurine heifers, which affects the economic efficiency of beef cattle breeding in the tropical regions. The aims of this study were to establish associations between the haplotype alleles of the bovine genome and age at first calving (AFC) in the Nelore cattle, and to identify the genes and quantitative trait loci (QTL) related to this phenotype. A total of 2,273 Nelore cattle (995 males and 1,278 females) genotyped using the Illumina BovineHD BeadChip were used in the current study. The association analysis included females with valid first calving records as well as open heifers. Linkage disequilibrium (LD) analysis among the markers was performed using blocks of 5, 10, and 15 markers, which were determined by sliding windows shifting one marker at a time. Then, the haplotype block size to be used in the association study was chosen based on the highest r2 average among the SNPs in the block. The five HapAlleles most strongly associated with the trait (top five) were considered as significant associations. The results of the analysis revealed four genomic regions related to AFC, which overlapped with 20 QTL of the reproductive traits reported previously. Furthermore, there were 19 genes related to reproduction in those regions. In conclusion, the use of haplotypes allowed the detection of chromosomal regions associated with AFC in Nelore cattle, and provided the basis for elucidating the mechanisms underlying this trait.
Collapse
Affiliation(s)
- André Vieira do Nascimento
- Faculdade de Ciências Biológicas e Ambientais, Universidade Federal da Grande Dourados, UFGD, Dourados, Mato Grosso do Sul, Brazil
| | | | - Yuri Tani Utsunomiya
- Departamento de Medicina Veterinária Preventiva e Reprodução Animal, Faculdade de Ciências Agrárias e Veterinárias, UNESP, Jaboticabal, São Paulo, Brazil
| | - Adam Taiti Harth Utsunomiya
- Departamento de Apoio, Produção e Saúde Animal, Faculdade de Medicina Veterinária de Araçatuba, UNESP, Araçatuba, São Paulo, Brazil
| | - Diercles Francisco Cardoso
- Departamento de Zootecnia, Faculdade de Ciências Agrárias e Veterinárias, UNESP, Jaboticabal, São Paulo, Brazil
| | | | - Roberto Carvalheiro
- Departamento de Zootecnia, Faculdade de Ciências Agrárias e Veterinárias, UNESP, Jaboticabal, São Paulo, Brazil
| | - José Fernando Garcia
- Departamento de Medicina Veterinária Preventiva e Reprodução Animal, Faculdade de Ciências Agrárias e Veterinárias, UNESP, Jaboticabal, São Paulo, Brazil
- Departamento de Apoio, Produção e Saúde Animal, Faculdade de Medicina Veterinária de Araçatuba, UNESP, Araçatuba, São Paulo, Brazil
- International Atomic Energy Agency (IAEA), Collaborating Centre on Animal Genomics and Bioinformatics, Araçatuba, São Paulo, Brazil
| | - Alexeia Barufatti Grisolia
- Faculdade de Ciências Biológicas e Ambientais, Universidade Federal da Grande Dourados, UFGD, Dourados, Mato Grosso do Sul, Brazil
- * E-mail:
| |
Collapse
|
8
|
N’Diaye A, Haile JK, Cory AT, Clarke FR, Clarke JM, Knox RE, Pozniak CJ. Single Marker and Haplotype-Based Association Analysis of Semolina and Pasta Colour in Elite Durum Wheat Breeding Lines Using a High-Density Consensus Map. PLoS One 2017; 12:e0170941. [PMID: 28135299 PMCID: PMC5279799 DOI: 10.1371/journal.pone.0170941] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2016] [Accepted: 01/12/2017] [Indexed: 12/30/2022] Open
Abstract
Association mapping is usually performed by testing the correlation between a single marker and phenotypes. However, because patterns of variation within genomes are inherited as blocks, clustering markers into haplotypes for genome-wide scans could be a worthwhile approach to improve statistical power to detect associations. The availability of high-density molecular data allows the possibility to assess the potential of both approaches to identify marker-trait associations in durum wheat. In the present study, we used single marker- and haplotype-based approaches to identify loci associated with semolina and pasta colour in durum wheat, the main objective being to evaluate the potential benefits of haplotype-based analysis for identifying quantitative trait loci. One hundred sixty-nine durum lines were genotyped using the Illumina 90K Infinium iSelect assay, and 12,234 polymorphic single nucleotide polymorphism (SNP) markers were generated and used to assess the population structure and the linkage disequilibrium (LD) patterns. A total of 8,581 SNPs previously localized to a high-density consensus map were clustered into 406 haplotype blocks based on the average LD distance of 5.3 cM. Combining multiple SNPs into haplotype blocks increased the average polymorphism information content (PIC) from 0.27 per SNP to 0.50 per haplotype. The haplotype-based analysis identified 12 loci associated with grain pigment colour traits, including the five loci identified by the single marker-based analysis. Furthermore, the haplotype-based analysis resulted in an increase of the phenotypic variance explained (50.4% on average) and the allelic effect (33.7% on average) when compared to single marker analysis. The presence of multiple allelic combinations within each haplotype locus offers potential for screening the most favorable haplotype series and may facilitate marker-assisted selection of grain pigment colour in durum wheat. These results suggest a benefit of haplotype-based analysis over single marker analysis to detect loci associated with colour traits in durum wheat.
Collapse
Affiliation(s)
- Amidou N’Diaye
- Department of Plant Sciences and Crop Development Centre, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Jemanesh K. Haile
- Department of Plant Sciences and Crop Development Centre, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Aron T. Cory
- Department of Plant Sciences and Crop Development Centre, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Fran R. Clarke
- Semiarid Prairie Agricultural Research Centre, Agriculture and Agri-Food Canada, Swift Current, Saskatchewan, Canada
| | - John M. Clarke
- Department of Plant Sciences and Crop Development Centre, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Ron E. Knox
- Semiarid Prairie Agricultural Research Centre, Agriculture and Agri-Food Canada, Swift Current, Saskatchewan, Canada
| | - Curtis J. Pozniak
- Department of Plant Sciences and Crop Development Centre, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| |
Collapse
|
9
|
Li Y, Kim JJ. Multiple Linkage Disequilibrium Mapping Methods to Validate Additive Quantitative Trait Loci in Korean Native Cattle (Hanwoo). ASIAN-AUSTRALASIAN JOURNAL OF ANIMAL SCIENCES 2015; 28:926-35. [PMID: 26104396 PMCID: PMC4478501 DOI: 10.5713/ajas.15.0077] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/27/2015] [Revised: 03/23/2015] [Accepted: 04/25/2015] [Indexed: 01/11/2023]
Abstract
The efficiency of genome-wide association analysis (GWAS) depends on power of detection for quantitative trait loci (QTL) and precision for QTL mapping. In this study, three different strategies for GWAS were applied to detect QTL for carcass quality traits in the Korean cattle, Hanwoo; a linkage disequilibrium single locus regression method (LDRM), a combined linkage and linkage disequilibrium analysis (LDLA) and a BayesCπ approach. The phenotypes of 486 steers were collected for weaning weight (WWT), yearling weight (YWT), carcass weight (CWT), backfat thickness (BFT), longissimus dorsi muscle area, and marbling score (Marb). Also the genotype data for the steers and their sires were scored with the Illumina bovine 50K single nucleotide polymorphism (SNP) chips. For the two former GWAS methods, threshold values were set at false discovery rate <0.01 on a chromosome-wide level, while a cut-off threshold value was set in the latter model, such that the top five windows, each of which comprised 10 adjacent SNPs, were chosen with significant variation for the phenotype. Four major additive QTL from these three methods had high concordance found in 64.1 to 64.9Mb for Bos taurus autosome (BTA) 7 for WWT, 24.3 to 25.4Mb for BTA14 for CWT, 0.5 to 1.5Mb for BTA6 for BFT and 26.3 to 33.4Mb for BTA29 for BFT. Several candidate genes (i.e. glutamate receptor, ionotropic, ampa 1 [GRIA1], family with sequence similarity 110, member B [FAM110B], and thymocyte selection-associated high mobility group box [TOX]) may be identified close to these QTL. Our result suggests that the use of different linkage disequilibrium mapping approaches can provide more reliable chromosome regions to further pinpoint DNA makers or causative genes in these regions.
Collapse
Affiliation(s)
- Yi Li
- School of Biotechnology, Yeungnam University, Gyeongsan 712-749, Korea
| | - Jong-Joo Kim
- School of Biotechnology, Yeungnam University, Gyeongsan 712-749, Korea
| |
Collapse
|
10
|
Kim ES, Sonstegard TS, da Silva MVGB, Gasbarre LC, Van Tassell CP. Genome-wide scan of gastrointestinal nematode resistance in closed Angus population selected for minimized influence of MHC. PLoS One 2015; 10:e0119380. [PMID: 25803687 PMCID: PMC4372334 DOI: 10.1371/journal.pone.0119380] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2014] [Accepted: 01/30/2015] [Indexed: 12/03/2022] Open
Abstract
Genetic markers associated with parasite indicator traits are ideal targets for study of marker assisted selection aimed at controlling infections that reduce herd use of anthelminthics. For this study, we collected gastrointestinal (GI) nematode fecal egg count (FEC) data from post-weaning animals of an Angus resource population challenged to a 26 week natural exposure on pasture. In all, data from 487 animals was collected over a 16 year period between 1992 and 2007, most of which were selected for a specific DRB1 allele to reduce the influence of potential allelic variant effects of the MHC locus. A genome-wide association study (GWAS) based on BovineSNP50 genotypes revealed six genomic regions located on bovine Chromosomes 3, 5, 8, 15 and 27; which were significantly associated (-log10 p=4.3) with Box-Cox transformed mean FEC (BC-MFEC). DAVID analysis of the genes within the significant genomic regions suggested a correlation between our results and annotation for genes involved in inflammatory response to infection. Furthermore, ROH and selection signature analyses provided strong evidence that the genomic regions associated BC-MFEC have not been affected by local autozygosity or recent experimental selection. These findings provide useful information for parasite resistance prediction for young grazing cattle and suggest new candidate gene targets for development of disease-modifying therapies or future studies of host response to GI parasite infection.
Collapse
Affiliation(s)
- Eui-Soo Kim
- Animal Genomics and Improvement Laboratory, United States Department of Agriculture, Agricultural Research Service, Beltsville, Maryland, United States of America
| | - Tad S. Sonstegard
- Animal Genomics and Improvement Laboratory, United States Department of Agriculture, Agricultural Research Service, Beltsville, Maryland, United States of America
- * E-mail:
| | | | - Louis C. Gasbarre
- Animal Genomics and Improvement Laboratory, United States Department of Agriculture, Agricultural Research Service, Beltsville, Maryland, United States of America
| | - Curtis P. Van Tassell
- Animal Genomics and Improvement Laboratory, United States Department of Agriculture, Agricultural Research Service, Beltsville, Maryland, United States of America
| |
Collapse
|
11
|
Wu Y, Fan H, Wang Y, Zhang L, Gao X, Chen Y, Li J, Ren H, Gao H. Genome-wide association studies using haplotypes and individual SNPs in Simmental cattle. PLoS One 2014; 9:e109330. [PMID: 25330174 PMCID: PMC4203724 DOI: 10.1371/journal.pone.0109330] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2014] [Accepted: 09/10/2014] [Indexed: 01/05/2023] Open
Abstract
Recent advances in high-throughput genotyping technologies have provided the opportunity to map genes using associations between complex traits and markers. Genome-wide association studies (GWAS) based on either a single marker or haplotype have identified genetic variants and underlying genetic mechanisms of quantitative traits. Prompted by the achievements of studies examining economic traits in cattle and to verify the consistency of these two methods using real data, the current study was conducted to construct the haplotype structure in the bovine genome and to detect relevant genes genuinely affecting a carcass trait and a meat quality trait. Using the Illumina BovineHD BeadChip, 942 young bulls with genotyping data were introduced as a reference population to identify the genes in the beef cattle genome significantly associated with foreshank weight and triglyceride levels. In total, 92,553 haplotype blocks were detected in the genome. The regions of high linkage disequilibrium extended up to approximately 200 kb, and the size of haplotype blocks ranged from 22 bp to 199,266 bp. Additionally, the individual SNP analysis and the haplotype-based analysis detected similar regions and common SNPs for these two representative traits. A total of 12 and 7 SNPs in the bovine genome were significantly associated with foreshank weight and triglyceride levels, respectively. By comparison, 4 and 5 haplotype blocks containing the majority of significant SNPs were strongly associated with foreshank weight and triglyceride levels, respectively. In addition, 36 SNPs with high linkage disequilibrium were detected in the GNAQ gene, a potential hotspot that may play a crucial role for regulating carcass trait components.
Collapse
Affiliation(s)
- Yang Wu
- Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China
| | - Huizhong Fan
- Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China
| | - Yanhui Wang
- Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China
| | - Lupei Zhang
- Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China
| | - Xue Gao
- Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China
| | - Yan Chen
- Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China
| | - Junya Li
- Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China
| | - HongYan Ren
- Department of life sciences, National Natural Science Foundation of China, Beijing, China
- * E-mail: (HG); (HR)
| | - Huijiang Gao
- Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China
- * E-mail: (HG); (HR)
| |
Collapse
|
12
|
Bayesian model selection for multiple QTLs mapping combining linkage disequilibrium and linkage. Genet Res (Camb) 2014; 96:e10. [PMID: 25579473 DOI: 10.1017/s0016672314000135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
Abstract
Linkage disequilibrium (LD) mapping is able to localize quantitative trait loci (QTL) within a rather small region (e.g. 2 cM), which is much narrower than linkage analysis (LA, usually 20 cM). The multilocus LD method utilizes haplotype information around putative mutation and takes historical recombination events into account, and thus provides a powerful method for further fine mapping. However, sometimes there are more than one QTLs in the region being studied. In this study, the Bayesian model selection implemented via the Markov chain Monte Carlo (MCMC) method is developed for fine mapping of multiple QTLs using haplotype information in a small region. The method combines LD as well as linkage information. A series of simulation experiments were conducted to investigate the behavior of the method. The results showed that this new multiple QTLs method was more efficient in separating closely linked QTLs than single-marker association studies.
Collapse
|
13
|
Jacquin L, Elsen JM, Gilbert H. Using haplotypes for the prediction of allelic identity to fine-map QTL: characterization and properties. Genet Sel Evol 2014; 46:45. [PMID: 25022866 PMCID: PMC4223544 DOI: 10.1186/1297-9686-46-45] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2013] [Accepted: 05/20/2014] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND Numerous methods have been developed over the last decade to predict allelic identity at unobserved loci between pairs of chromosome segments along the genome. These loci are often unobserved positions tested for the presence of quantitative trait loci (QTL). The main objective of this study was to understand from a theoretical standpoint the relation between linkage disequilibrium (LD) and allelic identity prediction when using haplotypes for fine mapping of QTL. In addition, six allelic identity predictors (AIP) were also compared in this study to determine which one performed best in theory and application. RESULTS A criterion based on a simple measure of matrix distance was used to study the relation between LD and allelic identity prediction when using haplotypes. The consistency of this criterion with the accuracy of QTL localization, another criterion commonly used to compare AIP, was evaluated on a set of real chromosomes. For this set of chromosomes, the criterion was consistent with the mapping accuracy of a simulated QTL with either low or high effect. As measured by the matrix distance, the best AIP for QTL mapping were those that best captured LD between a tested position and a QTL. Moreover the matrix distance between a tested position and a QTL was shown to decrease for some AIP when LD increased. However, the matrix distance for AIP with continuous predictions in the [0,1] interval was algebraically proven to decrease less rapidly up to a lower bound with increasing LD in the simplest situations, than the discrete predictor based on identity by state between haplotypes (IBS hap), for which there was no lower bound. The expected LD between haplotypes at a tested position and alleles at a QTL is a quantity that increases naturally when the tested position gets closer to the QTL. This behavior was demonstrated with pig and unrelated human chromosomes. CONCLUSIONS When the density of markers is high, and therefore LD between adjacent loci can be assumed to be high, the discrete predictor IBS hap is recommended since it predicts allele identity correctly when taking LD into account.
Collapse
Affiliation(s)
- Laval Jacquin
- INRA, GenPhySE (Génétique, Physiologie et Systèmes d'Elevage), F-31326, Castanet-Tolosan, France.
| | | | | |
Collapse
|
14
|
Sanchez MP, Tribout T, Iannuccelli N, Bouffaud M, Servin B, Tenghe A, Dehais P, Muller N, Del Schneider MP, Mercat MJ, Rogel-Gaillard C, Milan D, Bidanel JP, Gilbert H. A genome-wide association study of production traits in a commercial population of Large White pigs: evidence of haplotypes affecting meat quality. Genet Sel Evol 2014; 46:12. [PMID: 24528607 PMCID: PMC3975960 DOI: 10.1186/1297-9686-46-12] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2013] [Accepted: 12/13/2013] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND Numerous quantitative trait loci (QTL) have been detected in pigs over the past 20 years using microsatellite markers. However, due to the low density of these markers, the accuracy of QTL location has generally been poor. Since 2009, the dense genome coverage provided by the Illumina PorcineSNP60 BeadChip has made it possible to more accurately map QTL using genome-wide association studies (GWAS). Our objective was to perform high-density GWAS in order to identify genomic regions and corresponding haplotypes associated with production traits in a French Large White population of pigs. METHODS Animals (385 Large White pigs from 106 sires) were genotyped using the PorcineSNP60 BeadChip and evaluated for 19 traits related to feed intake, growth, carcass composition and meat quality. Of the 64,432 SNPs on the chip, 44,412 were used for GWAS with an animal mixed model that included a regression coefficient for the tested SNPs and a genomic kinship matrix. SNP haplotype effects in QTL regions were then tested for association with phenotypes following phase reconstruction based on the Sscrofa10.2 pig genome assembly. RESULTS Twenty-three QTL regions were identified on autosomes and their effects ranged from 0.25 to 0.75 phenotypic standard deviation units for feed intake and feed efficiency (four QTL), carcass (12 QTL) and meat quality traits (seven QTL). The 10 most significant QTL regions had effects on carcass (chromosomes 7, 10, 16, 17 and 18) and meat quality traits (two regions on chromosome 1 and one region on chromosomes 8, 9 and 13). Thirteen of the 23 QTL regions had not been previously described. A haplotype block of 183 kb on chromosome 1 (six SNPs) was identified and displayed three distinct haplotypes with significant (0.0001 < P < 0.03) associations with all evaluated meat quality traits. CONCLUSIONS GWAS analyses with the PorcineSNP60 BeadChip enabled the detection of 23 QTL regions that affect feed consumption, carcass and meat quality traits in a LW population, of which 13 were novel QTL. The proportionally larger number of QTL found for meat quality traits suggests a specific opportunity for improving these traits in the pig by genomic selection.
Collapse
Affiliation(s)
- Marie-Pierre Sanchez
- INRA, UMR1313 Génétique Animale et Biologie Intégrative, F-78350 Jouy-en-Josas, France
- INRA, AgroParisTech, UMR1313 Génétique Animale et Biologie Intégrative, F-78350 Jouy-en-Josas, France
| | - Thierry Tribout
- INRA, UMR1313 Génétique Animale et Biologie Intégrative, F-78350 Jouy-en-Josas, France
- INRA, AgroParisTech, UMR1313 Génétique Animale et Biologie Intégrative, F-78350 Jouy-en-Josas, France
| | - Nathalie Iannuccelli
- INRA, UMR444 Laboratoire de Génétique Cellulaire, F-31326 Castanet-Tolosan, France
| | | | - Bertrand Servin
- INRA, UMR444 Laboratoire de Génétique Cellulaire, F-31326 Castanet-Tolosan, France
| | - Amabel Tenghe
- INRA, UMR1313 Génétique Animale et Biologie Intégrative, F-78350 Jouy-en-Josas, France
- INRA, AgroParisTech, UMR1313 Génétique Animale et Biologie Intégrative, F-78350 Jouy-en-Josas, France
| | - Patrice Dehais
- INRA, UMR444 Laboratoire de Génétique Cellulaire, F-31326 Castanet-Tolosan, France
| | - Nelly Muller
- INRA, UE450 Testage Porcs, F-35651 Le Rheu, France
| | - Maria Pilar Del Schneider
- INRA, UMR1313 Génétique Animale et Biologie Intégrative, F-78350 Jouy-en-Josas, France
- INRA, AgroParisTech, UMR1313 Génétique Animale et Biologie Intégrative, F-78350 Jouy-en-Josas, France
| | | | - Claire Rogel-Gaillard
- INRA, UMR1313 Génétique Animale et Biologie Intégrative, F-78350 Jouy-en-Josas, France
- INRA, AgroParisTech, UMR1313 Génétique Animale et Biologie Intégrative, F-78350 Jouy-en-Josas, France
| | - Denis Milan
- INRA, UMR444 Laboratoire de Génétique Cellulaire, F-31326 Castanet-Tolosan, France
| | - Jean-Pierre Bidanel
- INRA, UMR1313 Génétique Animale et Biologie Intégrative, F-78350 Jouy-en-Josas, France
- INRA, AgroParisTech, UMR1313 Génétique Animale et Biologie Intégrative, F-78350 Jouy-en-Josas, France
| | - Hélène Gilbert
- INRA, UMR444 Laboratoire de Génétique Cellulaire, F-31326 Castanet-Tolosan, France
| |
Collapse
|
15
|
Bardol N, Ventelon M, Mangin B, Jasson S, Loywick V, Couton F, Derue C, Blanchard P, Charcosset A, Moreau L. Combined linkage and linkage disequilibrium QTL mapping in multiple families of maize (Zea mays L.) line crosses highlights complementarities between models based on parental haplotype and single locus polymorphism. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2013; 126:2717-2736. [PMID: 23975245 DOI: 10.1007/s00122-013-2167-9] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/19/2012] [Accepted: 07/12/2013] [Indexed: 06/02/2023]
Abstract
Advancements in genotyping are rapidly decreasing marker costs and increasing marker density. This opens new possibilities for mapping quantitative trait loci (QTL), in particular by combining linkage disequilibrium information and linkage analysis (LDLA). In this study, we compared different approaches to detect QTL for four traits of agronomical importance in two large multi-parental datasets of maize (Zea mays L.) of 895 and 928 testcross progenies composed of 7 and 21 biparental families, respectively, and genotyped with 491 markers. We compared to traditional linkage-based methods two LDLA models relying on the dense genotyping of parental lines with 17,728 SNP: one based on a clustering approach of parental line segments into ancestral alleles and one based on single marker information. The two LDLA models generally identified more QTL (60 and 52 QTL in total) than classical linkage models (49 and 44 QTL in total). However, they performed inconsistently over datasets and traits suggesting that a compromise must be found between the reduction of allele number for increasing statistical power and the adequacy of the model to potentially complex allelic variation. For some QTL, the model exclusively based on linkage analysis, which assumed that each parental line carried a different QTL allele, was able to capture remaining variation not explained by LDLA models. These complementarities between models clearly suggest that the different QTL mapping approaches must be considered to capture the different levels of allelic variation at QTL involved in complex traits.
Collapse
Affiliation(s)
- N Bardol
- UMR0320/UMR8120 de Génétique Végétale, INRA, Université Paris-Sud, CNRS, Ferme du Moulon, 91190, Gif-sur-Yvette, France
| | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Fine-mapping quantitative trait loci with a medium density marker panel: efficiency of population structures and comparison of linkage disequilibrium linkage analysis models. Genet Res (Camb) 2013; 94:223-34. [PMID: 22950902 PMCID: PMC3487687 DOI: 10.1017/s0016672312000407] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
Recently, a Haley–Knott-type regression method using combined linkage disequilibrium and linkage analyses (LDLA) was proposed to map quantitative trait loci (QTLs). Chromosome of 5 and 25 cM with 0·25 and 0·05 cM, respectively, between markers were simulated. The differences between the LDLA approaches with regard to QTL position accuracy were very limited, with a significantly better mean square error (MSE) with the LDLA regression (LDLA_reg) in sparse map cases; the contrary was observed, but not significantly, in dense map situations. The computing time required for the LDLA variance components (LDLA_vc) model was much higher than the LDLA_reg model. The precision of QTL position estimation was compared for four numbers of half-sib families, four different family sizes and two experimental designs (half-sibs, and full- and half-sibs). Regarding the number of families, MSE values were lowest for 15 or 50 half-sib families, differences not being significant. We observed that the greater the number of progenies per sire, the more accurate the QTL position. However, for a fixed population size, reducing the number of families (e.g. using a small number of large full-sib families) could lead to less accuracy of estimated QTL position.
Collapse
|
17
|
Abstract
This chapter provides an overview of statistical methods for genome-wide association studies (GWAS) in animals, plants, and humans. The simplest form of GWAS, a marker-by-marker analysis, is illustrated with a simple example. The problem of selecting a significance threshold that accounts for the large amount of multiple testing that occurs in GWAS is discussed. Population structure causes false positive associations in GWAS if not accounted for, and methods to deal with this are presented. Methodology for more complex models for GWAS, including haplotype-based approaches, accounting for identical by descent versus identical by state, and fitting all markers simultaneously are described and illustrated with examples.
Collapse
Affiliation(s)
- Ben Hayes
- Biosciences Research Division, Department of Primary Industries, Bundoora, VIC, Australia
| |
Collapse
|
18
|
Teyssèdre S, Elsen JM, Ricard A. Statistical distributions of test statistics used for quantitative trait association mapping in structured populations. Genet Sel Evol 2012; 44:32. [PMID: 23146127 PMCID: PMC3817592 DOI: 10.1186/1297-9686-44-32] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2012] [Accepted: 10/31/2012] [Indexed: 11/25/2022] Open
Abstract
Background Spurious associations between single nucleotide polymorphisms and phenotypes are a major issue in genome-wide association studies and have led to underestimation of type 1 error rate and overestimation of the number of quantitative trait loci found. Many authors have investigated the influence of population structure on the robustness of methods by simulation. This paper is aimed at developing further the algebraic formalization of power and type 1 error rate for some of the classical statistical methods used: simple regression, two approximate methods of mixed models involving the effect of a single nucleotide polymorphism (SNP) and a random polygenic effect (GRAMMAR and FASTA) and the transmission/disequilibrium test for quantitative traits and nuclear families. Analytical formulae were derived using matrix algebra for the first and second moments of the statistical tests, assuming a true mixed model with a polygenic effect and SNP effects. Results The expectation and variance of the test statistics and their marginal expectations and variances according to the distribution of genotypes and estimators of variance components are given as a function of the relationship matrix and of the heritability of the polygenic effect. These formulae were used to compute type 1 error rate and power for any kind of relationship matrix between phenotyped and genotyped individuals for any level of heritability. For the regression method, type 1 error rate increased with the variability of relationships and with heritability, but decreased with the GRAMMAR method and was not affected with the FASTA and quantitative transmission/disequilibrium test methods. Conclusions The formulae can be easily used to provide the correct threshold of type 1 error rate and to calculate the power when designing experiments or data collection protocols. The results concerning the efficacy of each method agree with simulation results in the literature but were generalized in this work. The power of the GRAMMAR method was equal to the power of the FASTA method at the same type 1 error rate. The power of the quantitative transmission/disequilibrium test was low. In conclusion, the FASTA method, which is very close to the full mixed model, is recommended in association mapping studies.
Collapse
Affiliation(s)
- Simon Teyssèdre
- INRA, UR 631 Station d’Amélioration Génétique des Animaux, Castanet-Tolosan F-31326, France
| | | | | |
Collapse
|
19
|
Lee SH, Lim D, Jang GW, Cho YM, Choi BH, Kim SD, Oh SJ, Lee JH, Yoon D, Park EW, Lee HK, Hong SK, Yang BS. Genome Wide Association Study to Identity QTL for Growth Taits in Hanwoo. JOURNAL OF ANIMAL SCIENCE AND TECHNOLOGY 2012. [DOI: 10.5187/jast.2012.54.5.323] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
20
|
Ding X, Wang C, Zhang Q. Pedigree transmission disequilibrium test for quantitative traits in farm animals. CHINESE SCIENCE BULLETIN-CHINESE 2012. [DOI: 10.1007/s11434-012-5218-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
21
|
Pszczola M, Strabel T, Mulder HA, Calus MPL. Reliability of direct genomic values for animals with different relationships within and to the reference population. J Dairy Sci 2012; 95:389-400. [PMID: 22192218 DOI: 10.3168/jds.2011-4338] [Citation(s) in RCA: 208] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2011] [Accepted: 09/18/2011] [Indexed: 11/19/2022]
Abstract
Accuracy of genomic selection depends on the accuracy of prediction of single nucleotide polymorphism effects and the proportion of genetic variance explained by markers. Design of the reference population with respect to its family structure may influence the accuracy of genomic selection. The objective of this study was to investigate the effect of various relationship levels within the reference population and different level of relationship of evaluated animals to the reference population on the reliability of direct genomic breeding values (DGV). The DGV reliabilities, expressed as squared correlation between estimated and true breeding value, were calculated for evaluated animals at 3 heritability levels. To emulate a trait that is difficult or expensive to measure, such as methane emission, reference populations were kept small and consisted of females with own performance records. A population reflecting a dairy cattle population structure was simulated. Four chosen reference populations consisted of all females available in the first genotyped generation. They consisted of highly (HR), moderately (MR), or lowly (LR) related animals, by selecting paternal half-sib families of decreasing size, or consisted of randomly chosen animals (RND). Of those 4 reference populations, RND had the lowest average relationship. Three sets of evaluated animals were chosen from 3 consecutive generations of genotyped animals, starting from the same generation as the reference population. Reliabilities of DGV predictions were calculated deterministically using selection index theory. The randomly chosen reference population had the lowest average relationship within the reference population. Average reliabilities increased when average relationship within the reference population decreased and the highest average reliabilities were achieved for RND (e.g., from 0.53 in HR to 0.61 in RND for a heritability of 0.30). A higher relationship to the reference population resulted in higher reliability values. At the average squared relationship of evaluated animals to the reference population of 0.005, reliabilities were, on average, 0.49 (HR) and 0.63 (RND) for a heritability of 0.30; 0.20 (HR) and 0.27 (RND) for a heritability of 0.05; and 0.07 (HR) and 0.09 (RND) for a heritability of 0.01. Substantial decrease in the reliability was observed when the number of generations to the reference population increased [e.g., for heritability of 0.30, the decrease from evaluated set I (chosen from the same generation as the reference population) to II (one generation younger than the reference population) was 0.04 for HR, and 0.07 for RND]. In this study, the importance of the design of a reference population consisting of cows was shown and optimal designs of the reference population for genomic prediction were suggested.
Collapse
Affiliation(s)
- M Pszczola
- Animal Breeding and Genomics Centre, Wageningen UR Livestock Research, Lelystad, the Netherlands.
| | | | | | | |
Collapse
|
22
|
Lee SH, van der Werf J, Lee SH, Lim DJ, Park EW, Gondro C, Yoon D, Oh SJ, Kim OH, Gibson J, Thompson J. Genome wide QTL mapping to identify candidate genes for carcass traits in Hanwoo (Korean Cattle). Genes Genomics 2012. [DOI: 10.1007/s13258-011-0081-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
|
23
|
Abstract
Hierarchical mixed effects models have been demonstrated to be powerful for predicting genomic merit of livestock and plants, on the basis of high-density single-nucleotide polymorphism (SNP) marker panels, and their use is being increasingly advocated for genomic predictions in human health. Two particularly popular approaches, labeled BayesA and BayesB, are based on specifying all SNP-associated effects to be independent of each other. BayesB extends BayesA by allowing a large proportion of SNP markers to be associated with null effects. We further extend these two models to specify SNP effects as being spatially correlated due to the chromosomally proximal effects of causal variants. These two models, that we respectively dub as ante-BayesA and ante-BayesB, are based on a first-order nonstationary antedependence specification between SNP effects. In a simulation study involving 20 replicate data sets, each analyzed at six different SNP marker densities with average LD levels ranging from r(2) = 0.15 to 0.31, the antedependence methods had significantly (P < 0.01) higher accuracies than their corresponding classical counterparts at higher LD levels (r(2) > 0. 24) with differences exceeding 3%. A cross-validation study was also conducted on the heterogeneous stock mice data resource (http://mus.well.ox.ac.uk/mouse/HS/) using 6-week body weights as the phenotype. The antedependence methods increased cross-validation prediction accuracies by up to 3.6% compared to their classical counterparts (P < 0.001). Finally, we applied our method to other benchmark data sets and demonstrated that the antedependence methods were more accurate than their classical counterparts for genomic predictions, even for individuals several generations beyond the training data.
Collapse
|
24
|
Powell JE, Kranis A, Floyd J, Dekkers JCM, Knott S, Haley CS. Optimal use of regression models in genome-wide association studies. Anim Genet 2011; 43:133-43. [PMID: 22404349 DOI: 10.1111/j.1365-2052.2011.02234.x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The performance of linear regression models in genome-wide association studies is influenced by how marker information is parameterized in the model. Considering the impact of parameterization is especially important when using information from multiple markers to test for association. Properties of the population, such as linkage disequilibrium (LD) and allele frequencies, will also affect the ability of a model to provide statistical support for an underlying quantitative trait locus (QTL). Thus, for a given location in the genome, the relationship between population properties and model parameterization is expected to influence the performance of the model in providing evidence for the position of a QTL. As LD and allele frequencies vary throughout the genome and between populations, understanding the relationship between these properties and model parameterization is of considerable importance in order to make optimal use of available genomic data. Here, we evaluate the performance of regression-based association models using genotype and haplotype information across the full spectrum of allele frequency and LD scenarios. Genetic marker data from 200 broiler chickens were used to simulate genomic conditions by selecting individual markers to act as surrogate QTL (sQTL) and then investigating the ability of surrounding markers to estimate sQTL genotypes and provide statistical support for their location. The LD and allele frequencies of markers and sQTL are shown to have a strong effect on the performance of models relative to one another. Our results provide an indication of the best choice of model parameterization given certain scenarios of marker and QTL LD and allele frequencies. We demonstrate a clear advantage of haplotype-based models, which account for phase uncertainty over other models tested, particularly for QTL with low minor allele frequencies. We show that the greatest advantage of haplotype models over single-marker models occurs when LD between markers and the causal locus is low. Under these situations, haplotype models have a greater accuracy of predicting the location of the QTL than other models tested.
Collapse
Affiliation(s)
- J E Powell
- Department of Genetics and Genomics, The Roslin Institute, University of Edinburgh, Roslin, UK.
| | | | | | | | | | | |
Collapse
|
25
|
Abstract
Background Bayesian methods allow prediction of genomic breeding values (GEBVs) using high-density single nucleotide polymorphisms (SNPs) covering the whole genome with effective shrinkage of SNP effects using appropriate priors. In this study we applied a modification of the well-known BayesA and BayesB methods to estimate the proportion of SNPs with zero effects (π) and a common variance for non-zero effects. The method, termed BayesCπ, was used to predict the GEBVs of the last generation of the QTLMAS2010 data. The accuracy of GEBVs from various methods was estimated by the correlation with phenotypes in the last generation. The methods were BayesCPi and BayesB with different π values, both with and without polygenic effects, and best linear unbiased prediction using an animal model with a genomic or numerator relationship matrix. Positions of quantitative trait loci (QTLs) were identified based on the variances of GEBVs for windows of 10 consecutive SNPs. We also proposed a novel approach to set significance thresholds for claiming QTL in this specific case by using pedigree-based simulation of genotypes. All analyses were focused on detecting and evaluating QTL with additive effects. Results The accuracy of GEBVs was highest for BayesCπ, but the accuracy of BayesB with π equal to 0.99 was similar to that of BayesCπ. The accuracy of BayesB dropped with a decrease in π. Including polygenic effects into the model only had marginal effects on accuracy and bias of predictions. The number of QTL identified was 15 when based on a stringent 10% chromosome-wise threshold and increased to 21 when a 20% chromosome-wise threshold was used. Conclusions The BayesCπ method without polygenic effects was identified to be the best method for the QTLMAS2010 dataset, because it had highest accuracy and least bias. The significance criterion based on variance of 10-SNP windows allowed detection of more than half of the QTL, with few false positives.
Collapse
|
26
|
Abstract
Background Partial least square regression (PLSR) was used to analyze the data of the QTLMAS 2010 workshop to identify genomic regions affecting either one of the two traits and to estimate breeding values. PLSR was appropriate for these data because it enabled to simultaneously fit several traits to the markers. Results A preliminary analysis showed phenotypic and genetic correlations between the two traits. Consequently, the data were analyzed jointly in a PLSR model for each chromosome independently. Regression coefficients for the markers were used to calculate the variance of each marker and inference of quantitative trait loci (QTL) was based on local maxima of a smoothed line traced through these variances. In this way, 25 QTL for the continuous trait and 22 for the discrete trait were found. There was evidence for pleiotropic QTL on chromosome 1. The 2000 most important markers were fitted in a second PLSR model to calculate breeding values of the individuals. The accuracies of these estimated breeding values ranged between 0.56 and 0.92. Conclusions Results showed the viability of PLSR for QTL analysis and estimating breeding values using markers.
Collapse
Affiliation(s)
- Albart Coster
- Animal Breeding and Genomics Centre, Wageningen University, Wageningen, The Netherlands.
| | | |
Collapse
|
27
|
Weller JI, Ron M. Invited review: quantitative trait nucleotide determination in the era of genomic selection. J Dairy Sci 2011; 94:1082-90. [PMID: 21338774 DOI: 10.3168/jds.2010-3793] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2010] [Accepted: 11/09/2010] [Indexed: 12/24/2022]
Abstract
Genome-wide association studies based on tens of thousands of single nucleotide polymorphisms have been completed for several dairy cattle populations. Methods have been proposed to directly incorporate genome scan data into breeding programs, chiefly by selection of young sires based on their genotypes for the genetic markers and pedigree without progeny test. Thus, the rate of genetic gain is increased by reduction of the mean generation interval. The methods developed so far for application of genomic selection do not require identification of the actual quantitative trait nucleotides (QTN) responsible for the observed variation of quantitative trait loci (QTL). To date, 2 QTN affecting milk production traits have been detected in dairy cattle: DGAT1 and ABCG2. This review will attempt to address the following questions based on the current state of bovine genomics and statistics. What are the pros and cons for QTN determination? How can data obtained from high-density, genome-wide scans be used most efficiently for QTN determination? Can the genome scan results already available and next-generation sequencing data be used to determine QTN? Should QTN be treated differently than markers at linkage disequilibrium with QTL in genetic evaluation programs? Data obtained by genome-wide association studies can be used to deduce QTL genotypes of sires via application of the a posteriori granddaughter design for concordance testing of putative QTN. This, together with next-generation sequencing technology, will dramatically reduce costs for QTN determination. By complete genome sequencing of 21 sires with many artificial insemination sons, it should be possible to determine concordance for all potential QTN, thus establishing the field of QTNomics.
Collapse
Affiliation(s)
- J I Weller
- Institute of Animal Sciences, A.R.O., The Volcani Center, Bet Dagan 50250, Israel.
| | | |
Collapse
|
28
|
Performance of single nucleotide polymorphisms versus haplotypes for genome-wide association analysis in barley. PLoS One 2010; 5:e14079. [PMID: 21124933 PMCID: PMC2989918 DOI: 10.1371/journal.pone.0014079] [Citation(s) in RCA: 85] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2010] [Accepted: 10/21/2010] [Indexed: 12/30/2022] Open
Abstract
Genome-wide association studies (GWAS) may benefit from utilizing haplotype information for making marker-phenotype associations. Several rationales for grouping single nucleotide polymorphisms (SNPs) into haplotype blocks exist, but any advantage may depend on such factors as genetic architecture of traits, patterns of linkage disequilibrium in the study population, and marker density. The objective of this study was to explore the utility of haplotypes for GWAS in barley (Hordeum vulgare) to offer a first detailed look at this approach for identifying agronomically important genes in crops. To accomplish this, we used genotype and phenotype data from the Barley Coordinated Agricultural Project and constructed haplotypes using three different methods. Marker-trait associations were tested by the efficient mixed-model association algorithm (EMMA). When QTL were simulated using single SNPs dropped from the marker dataset, a simple sliding window performed as well or better than single SNPs or the more sophisticated methods of blocking SNPs into haplotypes. Moreover, the haplotype analyses performed better 1) when QTL were simulated as polymorphisms that arose subsequent to marker variants, and 2) in analysis of empirical heading date data. These results demonstrate that the information content of haplotypes is dependent on the particular mutational and recombinational history of the QTL and nearby markers. Analysis of the empirical data also confirmed our intuition that the distribution of QTL alleles in nature is often unlike the distribution of marker variants, and hence utilizing haplotype information could capture associations that would elude single SNPs. We recommend routine use of both single SNP and haplotype markers for GWAS to take advantage of the full information content of the genotype data.
Collapse
|
29
|
Jiang L, Liu J, Sun D, Ma P, Ding X, Yu Y, Zhang Q. Genome wide association studies for milk production traits in Chinese Holstein population. PLoS One 2010; 5:e13661. [PMID: 21048968 PMCID: PMC2965099 DOI: 10.1371/journal.pone.0013661] [Citation(s) in RCA: 176] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2010] [Accepted: 10/04/2010] [Indexed: 11/21/2022] Open
Abstract
Genome-wide association studies (GWAS) based on high throughput SNP genotyping technologies open a broad avenue for exploring genes associated with milk production traits in dairy cattle. Motivated by pinpointing novel quantitative trait nucleotide (QTN) across Bos Taurus genome, the present study is to perform GWAS to identify genes affecting milk production traits using current state-of-the-art SNP genotyping technology, i.e., the Illumina BovineSNP50 BeadChip. In the analyses, the five most commonly evaluated milk production traits are involved, including milk yield (MY), milk fat yield (FY), milk protein yield (PY), milk fat percentage (FP) and milk protein percentage (PP). Estimated breeding values (EBVs) of 2,093 daughters from 14 paternal half-sib families are considered as phenotypes within the framework of a daughter design. Association tests between each trait and the 54K SNPs are achieved via two different analysis approaches, a paternal transmission disequilibrium test (TDT)-based approach (L1-TDT) and a mixed model based regression analysis (MMRA). In total, 105 SNPs were detected to be significantly associated genome-wise with one or multiple milk production traits. Of the 105 SNPs, 38 were commonly detected by both methods, while four and 63 were solely detected by L1-TDT and MMRA, respectively. The majority (86 out of 105) of the significant SNPs is located within the reported QTL regions and some are within or close to the reported candidate genes. In particular, two SNPs, ARS-BFGL-NGS-4939 and BFGL-NGS-118998, are located close to the DGAT1 gene (160bp apart) and within the GHR gene, respectively. Our findings herein not only provide confirmatory evidences for previously findings, but also explore a suite of novel SNPs associated with milk production traits, and thus form a solid basis for eventually unraveling the causal mutations for milk production traits in dairy cattle.
Collapse
Affiliation(s)
- Li Jiang
- Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, People's Republic of China
| | - Jianfeng Liu
- Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, People's Republic of China
| | - Dongxiao Sun
- Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, People's Republic of China
| | - Peipei Ma
- Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, People's Republic of China
| | - Xiangdong Ding
- Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, People's Republic of China
| | - Ying Yu
- Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, People's Republic of China
| | - Qin Zhang
- Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, People's Republic of China
- * E-mail:
| |
Collapse
|
30
|
Cierco-Ayrolles C, Dejean S, Legarra A, Gilbert H, Druet T, Ytournel F, Estivals D, Oumouhou N, Mangin B. Does probabilistic modelling of linkage disequilibrium evolution improve the accuracy of QTL location in animal pedigree? Genet Sel Evol 2010; 42:38. [PMID: 20969751 PMCID: PMC2984385 DOI: 10.1186/1297-9686-42-38] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2010] [Accepted: 10/22/2010] [Indexed: 12/02/2022] Open
Abstract
Background Since 2001, the use of more and more dense maps has made researchers aware that combining linkage and linkage disequilibrium enhances the feasibility of fine-mapping genes of interest. So, various method types have been derived to include concepts of population genetics in the analyses. One major drawback of many of these methods is their computational cost, which is very significant when many markers are considered. Recent advances in technology, such as SNP genotyping, have made it possible to deal with huge amount of data. Thus the challenge that remains is to find accurate and efficient methods that are not too time consuming. The study reported here specifically focuses on the half-sib family animal design. Our objective was to determine whether modelling of linkage disequilibrium evolution improved the mapping accuracy of a quantitative trait locus of agricultural interest in these populations. We compared two methods of fine-mapping. The first one was an association analysis. In this method, we did not model linkage disequilibrium evolution. Therefore, the modelling of the evolution of linkage disequilibrium was a deterministic process; it was complete at time 0 and remained complete during the following generations. In the second method, the modelling of the evolution of population allele frequencies was derived from a Wright-Fisher model. We simulated a wide range of scenarios adapted to animal populations and compared these two methods for each scenario. Results Our results indicated that the improvement produced by probabilistic modelling of linkage disequilibrium evolution was not significant. Both methods led to similar results concerning the location accuracy of quantitative trait loci which appeared to be mainly improved by using four flanking markers instead of two. Conclusions Therefore, in animal half-sib designs, modelling linkage disequilibrium evolution using a Wright-Fisher model does not significantly improve the accuracy of the QTL location when compared to a simpler method assuming complete and constant linkage between the QTL and the marker alleles. Finally, given the high marker density available nowadays, the simpler method should be preferred as it gives accurate results in a reasonable computing time.
Collapse
|
31
|
Erbe M, Ytournel F, Pimentel E, Sharifi A, Simianer H. Power and robustness of three whole genome association mapping approaches in selected populations. J Anim Breed Genet 2010; 128:3-14. [DOI: 10.1111/j.1439-0388.2010.00885.x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
32
|
MacLeod IM, Hayes BJ, Savin KW, Chamberlain AJ, McPartlan HC, Goddard ME. Power of a genome scan to detect and locate quantitative trait loci in cattle using dense single nucleotide polymorphisms. J Anim Breed Genet 2010; 127:133-42. [PMID: 20433522 DOI: 10.1111/j.1439-0388.2009.00831.x] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
There is increasing use of dense single nucleotide polymorphisms (SNPs) for whole-genome association studies (WGAS) in livestock to map and identify quantitative trait loci (QTL). These studies rely on linkage disequilibrium (LD) to detect an association between SNP genotypes and phenotypes. The power and precision of these WGAS are unknown, and will depend on the extent of LD in the experimental population. One complication for WGAS in livestock populations is that they typically consist of many paternal half-sib families, and in some cases full-sib families; unless this subtle population stratification is accounted for, many spurious associations may be reported. Our aim was to investigate the power, precision and false discovery rates of WGAS for QTL discovery, with a commercial SNP array, given existing patterns of LD in cattle. We also tested the efficiency of selective genotyping animals. A total of 365 cattle were genotyped for 9232 SNPs. We simulated a QTL effect as well as polygenic and environmental effects for all animals. One QTL was simulated on a randomly chosen SNP and accounted for 5%, 10% or 18% of the total variance. The power to detect a moderate-sized additive QTL (5% of the phenotypic variance) with 365 animals genotyped was 37% (p < 0.001). Most importantly, if pedigree structure was not accounted for, the number of false positives significantly increased above those expected by chance alone. Selective genotyping also resulted in a significant increase in false positives, even when pedigree structure was accounted for.
Collapse
Affiliation(s)
- I M MacLeod
- Cooperative Research Centre for Beef Genetic Technologies, Armidale, NSW, Australia.
| | | | | | | | | | | |
Collapse
|
33
|
Pryce J, Bolormaa S, Chamberlain A, Bowman P, Savin K, Goddard M, Hayes B. A validated genome-wide association study in 2 dairy cattle breeds for milk production and fertility traits using variable length haplotypes. J Dairy Sci 2010; 93:3331-45. [DOI: 10.3168/jds.2009-2893] [Citation(s) in RCA: 118] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2009] [Accepted: 03/11/2010] [Indexed: 11/19/2022]
|
34
|
Baes C, Goertz I, Mayer M, Weimann C, Liu Z, Reinhardt F, Erhardt G, Reinsch N. Refined mapping of quantitative trait loci for somatic cell score on BTA02 in the German Holstein. J Anim Breed Genet 2010; 127:180-8. [PMID: 20536635 DOI: 10.1111/j.1439-0388.2009.00838.x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
The aim of this study was to more precisely map a previously reported quantitative trait locus (QTL) affecting somatic cell score on Bos taurus autosome 2 by increasing the number of markers fourfold, analysing more families and exploiting within-population linkage disequilibrium (LD). A granddaughter design of 10 German Holstein grandsire families with 1121 progeny tested sons was used. Twenty-six markers with an average marker spacing of 3.14 cM were genotyped along 81.6 cM. Linkage analysis (LA) was performed using variance-component methodology. The incorporation of LD was first done using variance-component methods followed by regression on marker alleles. LA revealed genome-wide significance (LOD > 3) at 15 contiguous marker-intervals, with the maximum test-statistic between DIK2862 and BMS778 and a 1-lod drop-off interval of 38 cM. While the variance-component methods could not detect any LD, two individual markers with a significant effect (ILSTS098, p < 0.05; BMS778, p < 0.01) were found by regression analysis. Compared with previous results QTL-localisation was substantially narrowed; further fine-mapping should focus on the close vicinity of BMS778.
Collapse
Affiliation(s)
- C Baes
- Research Institute for the Biology of Farm Animals, Research Unit: Genetics and Biometry, 18196 Dummerstorf, Germany
| | | | | | | | | | | | | | | |
Collapse
|
35
|
He W, Fernando RL, Dekkers JC, Gilbert H. A gene frequency model for QTL mapping using Bayesian inference. Genet Sel Evol 2010; 42:21. [PMID: 20540762 PMCID: PMC2901203 DOI: 10.1186/1297-9686-42-21] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2009] [Accepted: 06/11/2010] [Indexed: 01/16/2023] Open
Abstract
Background Information for mapping of quantitative trait loci (QTL) comes from two sources: linkage disequilibrium (non-random association of allele states) and cosegregation (non-random association of allele origin). Information from LD can be captured by modeling conditional means and variances at the QTL given marker information. Similarly, information from cosegregation can be captured by modeling conditional covariances. Here, we consider a Bayesian model based on gene frequency (BGF) where both conditional means and variances are modeled as a function of the conditional gene frequencies at the QTL. The parameters in this model include these gene frequencies, additive effect of the QTL, its location, and the residual variance. Bayesian methodology was used to estimate these parameters. The priors used were: logit-normal for gene frequencies, normal for the additive effect, uniform for location, and inverse chi-square for the residual variance. Computer simulation was used to compare the power to detect and accuracy to map QTL by this method with those from least squares analysis using a regression model (LSR). Results To simplify the analysis, data from unrelated individuals in a purebred population were simulated, where only LD information contributes to map the QTL. LD was simulated in a chromosomal segment of 1 cM with one QTL by random mating in a population of size 500 for 1000 generations and in a population of size 100 for 50 generations. The comparison was studied under a range of conditions, which included SNP density of 0.1, 0.05 or 0.02 cM, sample size of 500 or 1000, and phenotypic variance explained by QTL of 2 or 5%. Both 1 and 2-SNP models were considered. Power to detect the QTL for the BGF, ranged from 0.4 to 0.99, and close or equal to the power of the regression using least squares (LSR). Precision to map QTL position of BGF, quantified by the mean absolute error, ranged from 0.11 to 0.21 cM for BGF, and was better than the precision of LSR, which ranged from 0.12 to 0.25 cM. Conclusions In conclusion given a high SNP density, the gene frequency model can be used to map QTL with considerable accuracy even within a 1 cM region.
Collapse
Affiliation(s)
- Wei He
- Department of Animal Science, Iowa State University, Ames, IA, USA.
| | | | | | | |
Collapse
|
36
|
Estimating genetic diversity across the neutral genome with the use of dense marker maps. Genet Sel Evol 2010; 42:12. [PMID: 20459708 PMCID: PMC2882900 DOI: 10.1186/1297-9686-42-12] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2009] [Accepted: 05/10/2010] [Indexed: 11/23/2022] Open
Abstract
Background With the advent of high throughput DNA typing, dense marker maps have become available to investigate genetic diversity on specific regions of the genome. The aim of this paper was to compare two marker based estimates of the genetic diversity in specific genomic regions lying in between markers: IBD-based genetic diversity and heterozygosity. Methods A computer simulated population was set up with individuals containing a single 1-Morgan chromosome and 1665 SNP markers and from this one, an additional population was produced with a lower marker density i.e. 166 SNP markers. For each marker interval based on adjacent markers, the genetic diversity was estimated either by IBD probabilities or heterozygosity. Estimates were compared to each other and to the true genetic diversity. The latter was calculated for a marker in the middle of each marker interval that was not used to estimate genetic diversity. Results The simulated population had an average minor allele frequency of 0.28 and an LD (r2) of 0.26, comparable to those of real livestock populations. Genetic diversities estimated by IBD probabilities and by heterozygosity were positively correlated, and correlations with the true genetic diversity were quite similar for the simulated population with a high marker density, both for specific regions (r = 0.19-0.20) and large regions (r = 0.61-0.64) over the genome. For the population with a lower marker density, the correlation with the true genetic diversity turned out to be higher for the IBD-based genetic diversity. Conclusions Genetic diversities of ungenotyped regions of the genome (i.e. between markers) estimated by IBD-based methods and heterozygosity give similar results for the simulated population with a high marker density. However, for a population with a lower marker density, the IBD-based method gives a better prediction, since variation and recombination between markers are missed with heterozygosity.
Collapse
|
37
|
Sensitivity of methods for estimating breeding values using genetic markers to the number of QTL and distribution of QTL variance. Genet Sel Evol 2010; 42:9. [PMID: 20302681 PMCID: PMC2851578 DOI: 10.1186/1297-9686-42-9] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2009] [Accepted: 03/22/2010] [Indexed: 11/19/2022] Open
Abstract
The objective of this simulation study was to compare the effect of the number of QTL and distribution of QTL variance on the accuracy of breeding values estimated with genomewide markers (MEBV). Three distinct methods were used to calculate MEBV: a Bayesian Method (BM), Least Angle Regression (LARS) and Partial Least Square Regression (PLSR). The accuracy of MEBV calculated with BM and LARS decreased when the number of simulated QTL increased. The accuracy decreased more when QTL had different variance values than when all QTL had an equal variance. The accuracy of MEBV calculated with PLSR was affected neither by the number of QTL nor by the distribution of QTL variance. Additional simulations and analyses showed that these conclusions were not affected by the number of individuals in the training population, by the number of markers and by the heritability of the trait. Results of this study show that the effect of the number of QTL and distribution of QTL variance on the accuracy of MEBV depends on the method that is used to calculate MEBV.
Collapse
|
38
|
Mulder HA, Calus MPL, Veerkamp RF. Prediction of haplotypes for ungenotyped animals and its effect on marker-assisted breeding value estimation. Genet Sel Evol 2010; 42:10. [PMID: 20307301 PMCID: PMC2861017 DOI: 10.1186/1297-9686-42-10] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2009] [Accepted: 03/22/2010] [Indexed: 11/10/2022] Open
Abstract
Background In livestock populations, missing genotypes on a large proportion of animals are a major problem to implement the estimation of marker-assisted breeding values using haplotypes. The objective of this article is to develop a method to predict haplotypes of animals that are not genotyped using mixed model equations and to investigate the effect of using these predicted haplotypes on the accuracy of marker-assisted breeding value estimation. Methods For genotyped animals, haplotypes were determined and for each animal the number of haplotype copies (nhc) was counted, i.e. 0, 1 or 2 copies. In a mixed model framework, nhc for each haplotype were predicted for ungenotyped animals as well as for genotyped animals using the additive genetic relationship matrix. The heritability of nhc was assumed to be 0.99, allowing for minor genotyping and haplotyping errors. The predicted nhc were subsequently used in marker-assisted breeding value estimation by applying random regression on these covariables. To evaluate the method, a population was simulated with one additive QTL and an additive polygenic genetic effect. The QTL was located in the middle of a haplotype based on SNP-markers. Results The accuracy of predicted haplotype copies for ungenotyped animals ranged between 0.59 and 0.64 depending on haplotype length. Because powerful BLUP-software was used, the method was computationally very efficient. The accuracy of total EBV increased for genotyped animals when marker-assisted breeding value estimation was compared with conventional breeding value estimation, but for ungenotyped animals the increase was marginal unless the heritability was smaller than 0.1. Haplotypes based on four markers yielded the highest accuracies and when only the nearest left marker was used, it yielded the lowest accuracy. The accuracy increased with increasing marker density. Accuracy of the total EBV approached that of gene-assisted BLUP when 4-marker haplotypes were used with a distance of 0.1 cM between the markers. Conclusions The proposed method is computationally very efficient and suitable for marker-assisted breeding value estimation in large livestock populations including effects of a number of known QTL. Marker-assisted breeding value estimation using predicted haplotypes increases accuracy especially for traits with low heritability.
Collapse
Affiliation(s)
- Han A Mulder
- Animal Breeding and Genomics Centre, Wageningen UR Livestock Research, PO Box 65, 8200 AB Lelystad, The Netherlands.
| | | | | |
Collapse
|
39
|
Mei G, Yin C, Ding X, Zhang Q. Fine mapping quantitative trait loci affecting milk production traits on bovine chromosome 6 in a Chinese Holstein population. J Genet Genomics 2009; 36:653-60. [PMID: 19932461 DOI: 10.1016/s1673-8527(08)60157-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2009] [Revised: 07/08/2009] [Accepted: 08/14/2009] [Indexed: 11/18/2022]
Abstract
To fine map the previously detected quantitative trait loci (QTLs) affecting milk production traits on bovine chromosome 6 (BTA6), 15 microsatellite markers situated within an interval of 14.3 cM spanning from BMS690 to BM4528 were selected and 918 daughters of 8 sires were genotyped. Two mapping approaches, haplotype sharing based LD mapping and single marker regression mapping, were used to analyze the data. Both approaches revealed a quantitative trait locus (QTL) with significant effects on milk yield, fat yield and protein yield located in the segment flanked by markers BMS483 and MNB209, which spans a genetic distance of 0.6 cM and a physical distance of 1.5 Mb. In addition, the single marker regression mapping also revealed a QTL affecting fat percentage and protein percentage at marker DIK2291. Our fine mapping work will facilitate the cloning of candidate genes underlying the QTLs for milk production traits.
Collapse
Affiliation(s)
- Gui Mei
- State Key Laboratory for Agrobiotechnology, Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | | | | | | |
Collapse
|
40
|
Kolbehdari D, Wang Z, Grant JR, Murdoch B, Prasad A, Xiu Z, Marques E, Stothard P, Moore SS. A whole genome scan to map QTL for milk production traits and somatic cell score in Canadian Holstein bulls. J Anim Breed Genet 2009; 126:216-27. [PMID: 19646150 DOI: 10.1111/j.1439-0388.2008.00793.x] [Citation(s) in RCA: 77] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The detection and mapping of genetic markers linked to quantitative trait loci (QTL) can be utilized to enhance genetic improvement of livestock populations. With the completion of the bovine genome sequence assembly, single nucleotide polymorphisms (SNP) assays spanning the whole bovine genome and research work on large scale identification, validation and analysis of genotypic variation in cattle has become possible. The objective of the present study was to perform a whole genome scan to identify and map QTL affecting milk production traits and somatic cell scores using linkage disequilibrium (LD) regression and 1536 SNP markers. Three and 18 SNP were found to be associated with only milk yield (MY) at a genome and chromosome wise significance (p < 0.05) level respectively. Among the 21 significant SNP, 16 were in a region reported to have QTL for MY in other dairy cattle populations and while the rest five were new QTL finding. Four SNP out of 21 are significant for the milk production traits (MY, fat yield, protein yield (PY), and milk contents) in the present study. Six and nine SNP were associated with PY at a genome and chromosome wise significant (p < 0.05) level respectively. Three and 17 SNP were found to be associated with FY at a genome and chromosome wise significant (p < 0.05) level. Five and seven SNP were mapped with somatic cell score at a genome and chromosome wise significant (p < 0.05) level respectively. The results of this study have revealed QTL for MY, PY, protein percentage, FY, fat percentage, somatic cell score and persistency of milk in the Canadian dairy cattle population. The chromosome regions identified in this study should be further investigated to potentially identify the causative mutations underlying the QTL.
Collapse
Affiliation(s)
- D Kolbehdari
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, Alberta, Canada.
| | | | | | | | | | | | | | | | | |
Collapse
|
41
|
Legarra A, Fernando RL. Linear models for joint association and linkage QTL mapping. Genet Sel Evol 2009; 41:43. [PMID: 19788745 PMCID: PMC2764561 DOI: 10.1186/1297-9686-41-43] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2009] [Accepted: 09/29/2009] [Indexed: 11/14/2022] Open
Abstract
Background Populational linkage disequilibrium and within-family linkage are commonly used for QTL mapping and marker assisted selection. The combination of both results in more robust and accurate locations of the QTL, but models proposed so far have been either single marker, complex in practice or well fit to a particular family structure. Results We herein present linear model theory to come up with additive effects of the QTL alleles in any member of a general pedigree, conditional to observed markers and pedigree, accounting for possible linkage disequilibrium among QTLs and markers. The model is based on association analysis in the founders; further, the additive effect of the QTLs transmitted to the descendants is a weighted (by the probabilities of transmission) average of the substitution effects of founders' haplotypes. The model allows for non-complete linkage disequilibrium QTL-markers in the founders. Two submodels are presented: a simple and easy to implement Haley-Knott type regression for half-sib families, and a general mixed (variance component) model for general pedigrees. The model can use information from all markers. The performance of the regression method is compared by simulation with a more complex IBD method by Meuwissen and Goddard. Numerical examples are provided. Conclusion The linear model theory provides a useful framework for QTL mapping with dense marker maps. Results show similar accuracies but a bias of the IBD method towards the center of the region. Computations for the linear regression model are extremely simple, in contrast with IBD methods. Extensions of the model to genomic selection and multi-QTL mapping are straightforward.
Collapse
|
42
|
Macciotta NPP, Gaspa G, Steri R, Pieramati C, Carnier P, Dimauro C. Pre-selection of most significant SNPS for the estimation of genomic breeding values. BMC Proc 2009; 3 Suppl 1:S14. [PMID: 19278540 PMCID: PMC2654495 DOI: 10.1186/1753-6561-3-s1-s14] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
The availability of a large amount of SNP markers throughout the genome of different livestock species offers the opportunity to estimate genomic breeding values (GEBVs). However, the estimation of many effects in a data set of limited size represent a severe statistical problem. A pre-selection of SNPS based on single regression may provide a reasonable compromise between accuracy of results, number of independent variables to be considered and computing requirements. A total of 595 and 618 SNPS were pre-selected using a simple linear regression for each SNP, based on phenotypes or polygenic EBVs, respectively, with an average distance of 9–10 cM between them. Chromosome four had the largest frequency of selected SNPS. Average correlations between GEBVs and TBVs were about 0.82 and 0.73 for the TRAINING generations when phenotypes or polygenic EBVs were considered as dependent variable, whereas they tend to decrease to 0.66 and 0.54 for the PREDICTION generations. The pre-selection of SNPs using the phenotypes as dependent variable together with a BLUP estimation of marker genotype effects using a variance contribution of each marker equal to σ2a/nsnps resulted in a remarkable accuracy of GEBV estimation (0.77) in the PREDICTION generations.
Collapse
|
43
|
Kim ES, Berger PJ, Kirkpatrick BW. Genome-wide scan for bovine twinning rate QTL using linkage disequilibrium. Anim Genet 2009; 40:300-7. [PMID: 19220232 DOI: 10.1111/j.1365-2052.2008.01832.x] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Twinning is a complex trait with negative impacts on health and reproduction, which cause economic loss in dairy production. Several twinning rate quantitative trait loci (QTL) have been detected in previous studies, but confidence intervals for QTL location are broad and many QTL are unreplicated. To identify genomic regions or genes associated with twinning rate, QTL analysis based on linkage combined with linkage disequilibrium (LLD) and individual marker associations was conducted across the genome using high-throughput single nucleotide polymorphism (SNP) genotypes. A total of 9919 SNP markers were genotyped with 200 sires and sons in 19 half-sib North American Holstein dairy cattle families. After SNPs were genotyped, informative markers were selected for genome-wide association tests and QTL searches. Evidence for twinning rate QTL was found throughout the genome. Thirteen markers significantly associated with twinning rate were detected on chromosomes 2, 5 and 14 (P < 2.3 x 10(-5)). Twenty-six regions on fourteen chromosomes were identified by LLD analysis at P < 0.0007. Seven previously reported ovulation or twinning rate QTL were supported by results of single marker association or LLD analyses. Single marker association analysis and LLD mapping were complementary tools for the identification of putative QTL in this genome scan.
Collapse
Affiliation(s)
- E-S Kim
- Department of Animal Sciences, University of Wisconsin, Madison, WI 53706, USA
| | | | | |
Collapse
|
44
|
Calus MPL, Meuwissen THE, Windig JJ, Knol EF, Schrooten C, Vereijken ALJ, Veerkamp RF. Effects of the number of markers per haplotype and clustering of haplotypes on the accuracy of QTL mapping and prediction of genomic breeding values. Genet Sel Evol 2009; 41:11. [PMID: 19284677 PMCID: PMC3225874 DOI: 10.1186/1297-9686-41-11] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2008] [Accepted: 01/15/2009] [Indexed: 11/26/2022] Open
Abstract
The aim of this paper was to compare the effect of haplotype definition on the precision of QTL-mapping and on the accuracy of predicted genomic breeding values. In a multiple QTL model using identity-by-descent (IBD) probabilities between haplotypes, various haplotype definitions were tested i.e. including 2, 6, 12 or 20 marker alleles and clustering base haplotypes related with an IBD probability of > 0.55, 0.75 or 0.95. Simulated data contained 1100 animals with known genotypes and phenotypes and 1000 animals with known genotypes and unknown phenotypes. Genomes comprising 3 Morgan were simulated and contained 74 polymorphic QTL and 383 polymorphic SNP markers with an average r2 value of 0.14 between adjacent markers. The total number of haplotypes decreased up to 50% when the window size was increased from two to 20 markers and decreased by at least 50% when haplotypes related with an IBD probability of > 0.55 instead of > 0.95 were clustered. An intermediate window size led to more precise QTL mapping. Window size and clustering had a limited effect on the accuracy of predicted total breeding values, ranging from 0.79 to 0.81. Our conclusion is that different optimal window sizes should be used in QTL-mapping versus genome-wide breeding value prediction.
Collapse
Affiliation(s)
- Mario P L Calus
- Animal Breeding and Genomics Centre, Animal Sciences Group, Wageningen University and Research Centre, Lelystad, The Netherlands.
| | | | | | | | | | | | | |
Collapse
|
45
|
Daetwyler HD, Schenkel FS, Sargolzaei M, Robinson JAB. A genome scan to detect quantitative trait loci for economically important traits in Holstein cattle using two methods and a dense single nucleotide polymorphism map. J Dairy Sci 2008; 91:3225-36. [PMID: 18650300 DOI: 10.3168/jds.2007-0333] [Citation(s) in RCA: 102] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
Genome scans for detection of bovine quantitative trait loci (QTL) were performed via variance component linkage analysis and linkage disequilibrium single-locus regression (LDRM). Four hundred eighty-four Holstein sires, of which 427 were from 10 grandsire families, were genotyped for 9,919 single nucleotide polymorphisms (SNP) using the Affymetrix MegAllele GeneChip Bovine Mapping 10K SNP array. A hybrid of the grand-daughter and selective genotyping designs was applied. Four thousand eight hundred fifty-six of the 9,919 SNP were located to chromosomes in base-pairs and formed the basis for the analyses. The mean polymorphism information content of the SNP was 0.25. The SNP centimorgan position was interpolated from their base-pair position using a microsatellite framework map. Estimated breeding values were used as observations, and the following traits were analyzed: 305-d lactation milk, fat, and protein yield; somatic cell score; herd life; interval of calving to first service; and age at first service. The variance component linkage analysis detected 102 potential QTL, whereas LDRM analysis found 144 significant SNP associations after accounting for a 5% false discovery rate. Twenty potential QTL and 49 significant SNP associations were in close proximity to QTL cited in the literature. Both methods found significant regions on Bos taurus autosome (BTA) 3, 5, and 16 for milk yield; BTA 14 and 19 for fat yield; BTA 1, 3, 16, and 28 for protein yield; BTA 2 and 13 for calving to first service; and BTA 14 for age at first service. Both approaches were effective in detecting potential QTL with a dense SNP map. The LDRM was well suited for a first genome scan due to its approximately 8 times lower computational demands. Further fine mapping should be applied on the chromosomal regions of interest found in this study.
Collapse
Affiliation(s)
- H D Daetwyler
- Centre for Genetic Improvement of Livestock, Department of Animal and Poultry Science, University of Guelph, Guelph, Ontario, Canada.
| | | | | | | |
Collapse
|
46
|
Kolbehdari D, Wang Z, Grant JR, Murdoch B, Prasad A, Xiu Z, Marques E, Stothard P, Moore SS. A whole-genome scan to map quantitative trait loci for conformation and functional traits in Canadian Holstein bulls. J Dairy Sci 2008; 91:2844-56. [PMID: 18565942 DOI: 10.3168/jds.2007-0585] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
Genetic improvement of livestock populations can be achieved through detection and mapping of genetic markers linked to quantitative trait loci (QTL). With the completion of the bovine genome sequence assembly, single nucleotide polymorphism (SNP) assays spanning the whole bovine genome and research work on large-scale identification, validation, and analysis of genotypic variation in cattle has become possible. A total of 462 Canadian Holstein Bulls were used to test the association between SNP and QTL. Single locus linkage disequilibrium regression model was implemented to perform a whole genome scan to identify and map QTL affecting conformation and functional traits. One thousand five hundred thirty-six SNP markers from introns and exons of potential QTL regions for economically important traits across the bovine genome were selected for association analysis. A total of 45 and 151 SNP were found to be associated with 17 conformation and functional traits at a genome- and chromosome-wise significance level, respectively. Among the 196 significant SNP, 169 of them are newly detected in this study, whereas 27 of them have been reported in previous literature and 161 of these were located in genes and are worth further investigating to potentially identify the causative mutations underlying the QTL. The single locus linkage disequilibrium regression method using SNP marker genotypes has proven to be a successful methodology for detecting and mapping QTL in dairy cattle populations.
Collapse
Affiliation(s)
- D Kolbehdari
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, Alberta, Canada T6G 2P5
| | | | | | | | | | | | | | | | | |
Collapse
|
47
|
Accuracy of marker-assisted selection with single markers and marker haplotypes in cattle. Genet Res (Camb) 2008; 89:215-20. [PMID: 18208627 DOI: 10.1017/s0016672307008865] [Citation(s) in RCA: 65] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
Abstract
A key question for the implementation of marker-assisted selection (MAS) using markers in linkage disequilibrium with quantitative trait loci (QTLs) is how many markers surrounding each QTL should be used to ensure the marker or marker haplotypes are in sufficient linkage disequilibrium (LD) with the QTL. In this paper we compare the accuracy of MAS using either single markers or marker haplotypes in an Angus cattle data set consisting of 9323 genome-wide single nucleotide polymorphisms (SNPs) genotyped in 379 Angus cattle. The extent of LD in the data set was such that the average marker-marker r2 was 0.2 at 200 kb. The accuracy of MAS increased as the number of markers in the haplotype surrounding the QTL increased, although only when the number of markers in the haplotype was 4 or greater did the accuracy exceed that achieved when the SNP in the highest LD with the QTL was used. A large number of phenotypic records (>1000) were required to accurately estimate the effects of the haplotypes.
Collapse
|
48
|
Abstract
Genomic selection uses total breeding values for juvenile animals, predicted from a large number of estimated marker haplotype effects across the whole genome. In this study the accuracy of predicting breeding values is compared for four different models including a large number of markers, at different marker densities for traits with heritabilities of 50 and 10%. The models estimated the effect of (1) each single-marker allele [single-nucleotide polymorphism (SNP)1], (2) haplotypes constructed from two adjacent marker alleles (SNP2), and (3) haplotypes constructed from 2 or 10 markers, including the covariance between haplotypes by combining linkage disequilibrium and linkage analysis (HAP_IBD2 and HAP_IBD10). Between 119 and 2343 polymorphic SNPs were simulated on a 3-M genome. For the trait with a heritability of 10%, the differences between models were small and none of them yielded the highest accuracies across all marker densities. For the trait with a heritability of 50%, the HAP_IBD10 model yielded the highest accuracies of estimated total breeding values for juvenile and phenotyped animals at all marker densities. It was concluded that genomic selection is considerably more accurate than traditional selection, especially for a low-heritability trait.
Collapse
|
49
|
Abstract
Genomic selection is a form of marker-assisted selection in which genetic markers covering the whole genome are used so that all quantitative trait loci (QTL) are in linkage disequilibrium with at least one marker. This approach has become feasible thanks to the large number of single nucleotide polymorphisms (SNP) discovered by genome sequencing and new methods to efficiently genotype large number of SNP. Simulation results and limited experimental results suggest that breeding values can be predicted with high accuracy using genetic markers alone but more validation is required especially in samples of the population different from that in which the effect of the markers was estimated. The ideal method to estimate the breeding value from genomic data is to calculate the conditional mean of the breeding value given the genotype of the animal at each QTL. This conditional mean can only be calculated by using a prior distribution of QTL effects so this should be part of the research carried out to implement genomic selection. In practice, this method of estimating breeding values is approximated by using the marker genotypes instead of the QTL genotypes but the ideal method is likely to be approached more closely as more sequence and SNP data is obtained. Implementation of genomic selection is likely to have major implications for genetic evaluation systems and for genetic improvement programmes generally and these are discussed.
Collapse
Affiliation(s)
- M E Goddard
- University of Melbourne, Attwood, Australia.
| | | |
Collapse
|
50
|
Application of genomic technologies to the improvement of meat quality of farm animals. Meat Sci 2007; 77:36-45. [DOI: 10.1016/j.meatsci.2007.03.026] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2007] [Revised: 03/27/2007] [Accepted: 03/27/2007] [Indexed: 11/21/2022]
|