1
|
Stegemiller MR, Redden RR, Notter DR, Taylor T, Taylor JB, Cockett NE, Heaton MP, Kalbfleisch TS, Murdoch BM. Using whole genome sequence to compare variant callers and breed differences of US sheep. Front Genet 2023; 13:1060882. [PMID: 36685812 PMCID: PMC9846548 DOI: 10.3389/fgene.2022.1060882] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Accepted: 11/22/2022] [Indexed: 01/06/2023] Open
Abstract
As whole genome sequence (WGS) data sets have become abundant and widely available, so has the need for variant detection and scoring. The aim of this study was to compare the accuracy of commonly used variant calling programs, Freebayes and GATK HaplotypeCaller (GATK-HC), and to use U.S. sheep WGS data sets to identify novel breed-associated SNPs. Sequence data from 145 sheep consisting of 14 U.S. breeds were filtered and biallelic single nucleotide polymorphisms (SNPs) were retained for genotyping analyses. Genotypes from both programs were compared to each other and to genotypes from bead arrays. The SNPs from WGS were compared to the bead array data with breed heterozygosity, principal component analysis and identifying breed associated SNPs to analyze genetic diversity. The average sequence read depth was 2.78 reads greater with 6.11% more SNPs being identified in Freebayes compared to GATK-HC. The genotype concordance of the variant callers to bead array data was 96.0% and 95.5% for Freebayes and GATK-HC, respectively. Genotyping with WGS identified 10.5 million SNPs from all 145 sheep. This resulted in an 8% increase in measured heterozygosity and greater breed separation in the principal component analysis compared to the bead array analysis. There were 1,849 SNPs identified in only the Romanov sheep where all 10 rams were homozygous for one allele and the remaining 135 sheep from 13 breeds were homozygous for the opposite allele. Both variant calling programs had greater than 95% concordance of SNPs with bead array data, and either was suitably accurate for ovine WGS data sets. The use of WGS SNPs improved the resolution of PCA analysis and was critical for identifying Romanov breed-associated SNPs. Subsets of such SNPs could be used to estimate germplasm composition in animals without pedigree information.
Collapse
Affiliation(s)
- Morgan R. Stegemiller
- Department of Animal, Veterinary and Food Sciences, University of Idaho, Moscow, ID, United States
| | - Reid R. Redden
- Texas A&M AgriLife Research and Extension, Texas A&M University, San Angelo, TX, United States
| | - David R. Notter
- School of Animal Sciences, Virginia Tech, Blacksburg, VA, United States
| | - Todd Taylor
- Department of Animal and Dairy Sciences, University of Wisconsin-Madison, Madison, WI, United States
| | - J. Bret Taylor
- United States Sheep Experiment Station, United States Department of Agriculture, Agricultural Research Service, Dubois, ID, United States
| | - Noelle E. Cockett
- Department of Animal, Dairy and Veterinary Sciences, Utah State University, Logan, UT, United States
| | - Michael P. Heaton
- USDA, ARS, U.S. Meat Animal Research Center, Clay Center, NE, United States
| | - Theodore S. Kalbfleisch
- Gluck Equine Research Center, College of Agriculture, Food, and Environment, University of Kentucky, Lexington, KY, United States,*Correspondence: Theodore S. Kalbfleisch, ; Brenda M. Murdoch,
| | - Brenda M. Murdoch
- Department of Animal, Veterinary and Food Sciences, University of Idaho, Moscow, ID, United States,*Correspondence: Theodore S. Kalbfleisch, ; Brenda M. Murdoch,
| |
Collapse
|
2
|
Rodriguez Neira JD, Peripolli E, de Negreiros MPM, Espigolan R, López-Correa R, Aguilar I, Lobo RB, Baldi F. Prediction ability for growth and maternal traits using SNP arrays based on different marker densities in Nellore cattle using the ssGBLUP. J Appl Genet 2022; 63:389-400. [PMID: 35133621 DOI: 10.1007/s13353-022-00685-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2021] [Revised: 01/25/2022] [Accepted: 02/02/2022] [Indexed: 11/25/2022]
Abstract
This study aimed to investigate the prediction ability for growth and maternal traits using different low-density customized SNP arrays selected by informativeness and distribution of markers across the genome employing single-step genomic BLUP (ssGBLUP). Phenotypic records for adjusted weight at 210 and 450 days of age were utilized. A total of 945 animals were genotyped with high-density chip, and 267 individuals born after 2008 were selected as validation population. We evaluated 11 scenarios using five customized density arrays (40 k, 20 k, 10 k, 5 k and 2 k) and the HD array was used as desirable scenario. The GEBV predictions and BIF (Beef Improvement Federation) accuracy were obtained with BLUPF90 family programs. Linear regression was used to evaluate the prediction ability, inflation, and bias of GEBV of each customized array. An overestimation of partial GEBVs in contrast with complete GEBVs and increase of BIF accuracy with the density arrays diminished were observed. For all traits, the prediction ability was higher as the array density increased and it was similar with customized arrays higher than 10 k SNPs. Level of inflation was lower as the density array increased of and was higher for MW210 effect. The bias was susceptible to overestimation of GEBVs when the density customized arrays decreased. These results revealed that the BIF accuracy is sensible to overestimation using low-density customized arrays while the prediction ability with least 10,000 informative SNPs obtained from the Illumina BovineHD BeadChip shows accurate and less biased predictions. Low-density customized arrays under ssGBLUP method could be feasible and cost-effective in genomic selection.
Collapse
Affiliation(s)
- Juan Diego Rodriguez Neira
- Departamento de Zootecnia, Faculdade de Ciências Agrarias e Veterinárias, Universidade Estadual Paulista (Unesp), Jaboticabal, 14884-900, Brazil.
| | - Elisa Peripolli
- Departamento de Zootecnia, Faculdade de Ciências Agrarias e Veterinárias, Universidade Estadual Paulista (Unesp), Jaboticabal, 14884-900, Brazil
| | - Maria Paula Marinho de Negreiros
- Departamento de Medicina Veterinária, Faculdade de Zootecnia e Engenharia de Alimentos, Universidade de São Paulo (Usp), Pirassununga, 13535-900, Brazil
| | - Rafael Espigolan
- Departamento de Medicina Veterinária, Faculdade de Zootecnia e Engenharia de Alimentos, Universidade de São Paulo (Usp), Pirassununga, 13535-900, Brazil
| | - Rodrigo López-Correa
- Departamento de Genética y Mejoramiento Animal, Facultad de Veterinaria, Universidad de La República, Montevideo, Uruguay
| | - Ignacio Aguilar
- Instituto Nacional de Investigación Agropecuaria (INIA), Montevideo, Uruguay
| | - Raysildo B Lobo
- Associação Nacional de Criadores e Pesquisadores (ANCP), Ribeirão Preto, Brazil
| | - Fernando Baldi
- Departamento de Zootecnia, Faculdade de Ciências Agrarias e Veterinárias, Universidade Estadual Paulista (Unesp), Jaboticabal, 14884-900, Brazil
| |
Collapse
|