Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Lopez BIM, An N, Srikanth K, Lee S, Oh JD, Shin DH, Park W, Chai HH, Park JE, Lim D. Genomic Prediction Based on SNP Functional Annotation Using Imputed Whole-Genome Sequence Data in Korean Hanwoo Cattle. Front Genet 2021;11:603822. [PMID: 33552124 PMCID: PMC7859490 DOI: 10.3389/fgene.2020.603822] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Accepted: 11/09/2020] [Indexed: 12/12/2022] Open

For:	Lopez BIM, An N, Srikanth K, Lee S, Oh JD, Shin DH, Park W, Chai HH, Park JE, Lim D. Genomic Prediction Based on SNP Functional Annotation Using Imputed Whole-Genome Sequence Data in Korean Hanwoo Cattle. Front Genet 2021;11:603822. [PMID: 33552124 PMCID: PMC7859490 DOI: 10.3389/fgene.2020.603822] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Accepted: 11/09/2020] [Indexed: 12/12/2022] Open

Number

Cited by Other Article(s)

Lee J, Hong I, Lee C, Kim D, Kim S, Lee Y. SNPs in microRNA seed region and impact of miR-375 in concurrent regulation of multiple lipid accumulation-related genes. Sci Rep 2024;14:10924. [PMID: 38740866 DOI: 10.1038/s41598-024-61673-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2024] [Accepted: 05/08/2024] [Indexed: 05/16/2024] Open

Liu Y, Zhang Y, Zhou F, Yao Z, Zhan Y, Fan Z, Meng X, Zhang Z, Liu L, Yang J, Wu Z, Cai G, Zheng E. Increased Accuracy of Genomic Prediction Using Preselected SNPs from GWAS with Imputed Whole-Genome Sequence Data in Pigs. Animals (Basel) 2023;13:3871. [PMID: 38136908 PMCID: PMC10740755 DOI: 10.3390/ani13243871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Revised: 12/13/2023] [Accepted: 12/14/2023] [Indexed: 12/24/2023] Open

Affiliation(s)

Yiyi Liu National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
Yuling Zhang National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
Fuchen Zhou National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
Zekai Yao National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
Yuexin Zhan National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
Zhenfei Fan National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
Xianglun Meng National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
Zebin Zhang National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
Langqing Liu National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
Jie Yang National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
Zhenfang Wu National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China Guangdong Zhongxin Breeding Technology Co., Ltd., Guangzhou 510642, China
Gengyuan Cai National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China Guangdong Zhongxin Breeding Technology Co., Ltd., Guangzhou 510642, China
Enqin Zheng National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China

Collapse

Haque MA, Iqbal A, Bae H, Lee SE, Park S, Lee YM, Kim JJ. Assessment of genomic breeding values and their accuracies for carcass traits in Jeju Black cattle using whole-genome SNP chip panels. J Anim Breed Genet 2023;140:519-531. [PMID: 37102238 DOI: 10.1111/jbg.12776] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Revised: 04/04/2023] [Accepted: 04/06/2023] [Indexed: 04/28/2023]

Abstract

The objective of the present study was to evaluate the breeding value and accuracy of genomic estimated breeding values (GEBVs) of carcass traits in Jeju Black cattle (JBC) using Hanwoo steers and JBC as a reference population using the single-trait animal model. Our research included genotype and phenotype information on 19,154 Hanwoo steers with 1097 JBC acting as the reference population. Likewise, the test population consisted of 418 genotyped JBC individuals with no phenotypic records for those carcass traits. For estimating the accuracy of GEBV, we divided the entire population into three groups. Hanwoo and JBC make up the first group; Hanwoo and JBC, who has both the genotype and phenotypic records, are referred to as the reference (training) population, and JBC, who lacks phenotypic information is referred to as the test (validation) population. The second group consists of the JBC (without phenotype) as the test population and Hanwoo as a reference population with phenotype and genotypic data. The only JBCs in the third group are those who have genotypic and phenotypic data on them as a reference population but no phenotypic data on them as a test population. The single-trait animal model was used in all three groups for statistical purposes. The reference populations estimated heritabilities for carcass weight (CWT), eye muscle area (EMA), backfat thickness (BF), and marbling score (MS) as 0.30, 0.26, 0.26, and 0.34 for the Hanwoo steer and 0.42, 0.27, 0.26, and 0.48 for JBC. The average accuracy for carcass traits in Group 1 was 0.80 for the Hanwoo and JBC reference population compared with 0.73 for the JBC test population. Although the average accuracy for carcass traits in Group 2 was 0.80, it was 0.80 for the Hanwoo reference population and only 0.56 for the JBC test population. The average accuracy for the JBC reference and test populations was 0.68 and 0.50, respectively, when they were included in the accuracy comparison without the Hanwoo reference population. Groups 1 and 2 used Hanwoo as reference population, which led to a better average accuracy; however, Group 3 only used the JBC reference and test population, which led to a lower average accuracy. This might be due to the fact that Group 3 used a smaller reference size than the group that came before it and that the genetic makeup of the Hanwoo and JBC breeds differed. The GEBV accuracy for MS was higher than that of other traits across all three analysis groups, followed by CWT, EMA, and BF, which may be partially explained by the MS traits' higher heritability. This study suggests that in order to achieve more accuracy, a large reference population particular to a breed should be established. Therefore, to increase the accuracy of GEBV prediction and the genetic benefit from genomic selection in JBC, individual reference breeds, and large populations are required.

Collapse

Jang S, Ros-Freixedes R, Hickey JM, Chen CY, Holl J, Herring WO, Misztal I, Lourenco D. Using pre-selected variants from large-scale whole-genome sequence data for single-step genomic predictions in pigs. Genet Sel Evol 2023;55:55. [PMID: 37495982 PMCID: PMC10373252 DOI: 10.1186/s12711-023-00831-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Accepted: 07/18/2023] [Indexed: 07/28/2023] Open

Abstract

BACKGROUND

Whole-genome sequence (WGS) data harbor causative variants that may not be present in standard single nucleotide polymorphism (SNP) chip data. The objective of this study was to investigate the impact of using preselected variants from WGS for single-step genomic predictions in maternal and terminal pig lines with up to 1.8k sequenced and 104k sequence imputed animals per line.

METHODS

Two maternal and four terminal lines were investigated for eight and seven traits, respectively. The number of sequenced animals ranged from 1365 to 1491 for the maternal lines and 381 to 1865 for the terminal lines. Imputation to sequence occurred within each line for 66k to 76k animals for the maternal lines and 29k to 104k animals for the terminal lines. Two preselected SNP sets were generated based on a genome-wide association study (GWAS). Top40k included the SNPs with the lowest p-value in each of the 40k genomic windows, and ChipPlusSign included significant variants integrated into the porcine SNP chip used for routine genotyping. We compared the performance of single-step genomic predictions between using preselected SNP sets assuming equal or different variances and the standard porcine SNP chip.

RESULTS

In the maternal lines, ChipPlusSign and Top40k showed an average increase in accuracy of 0.6 and 4.9%, respectively, compared to the regular porcine SNP chip. The greatest increase was obtained with Top40k, particularly for fertility traits, for which the initial accuracy based on the standard SNP chip was low. However, in the terminal lines, Top40k resulted in an average loss of accuracy of 1%. ChipPlusSign provided a positive, although small, gain in accuracy (0.9%). Assigning different variances for the SNPs slightly improved accuracies when using variances obtained from BayesR. However, increases were inconsistent across the lines and traits.

CONCLUSIONS

The benefit of using sequence data depends on the line, the size of the genotyped population, and how the WGS variants are preselected. When WGS data are available on hundreds of thousands of animals, using sequence data presents an advantage but this remains limited in pigs.

Collapse

Jang S, Tsuruta S, Leite NG, Misztal I, Lourenco D. Dimensionality of genomic information and its impact on genome-wide associations and variant selection for genomic prediction: a simulation study. Genet Sel Evol 2023;55:49. [PMID: 37460964 DOI: 10.1186/s12711-023-00823-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Accepted: 07/03/2023] [Indexed: 07/20/2023] Open

Abstract

BACKGROUND

Identifying true positive variants in genome-wide associations (GWA) depends on several factors, including the number of genotyped individuals. The limited dimensionality of genomic information may give insights into the optimal number of individuals to be used in GWA. This study investigated different discovery set sizes based on the number of largest eigenvalues explaining a certain proportion of variance in the genomic relationship matrix (G). In addition, we investigated the impact on the prediction accuracy by adding variants, which were selected based on different set sizes, to the regular single nucleotide polymorphism (SNP) chips used for genomic prediction.

METHODS

We simulated sequence data that included 500k SNPs with 200 or 2000 quantitative trait nucleotides (QTN). A regular 50k panel included one in every ten simulated SNPs. Effective population size (Ne) was set to 20 or 200. GWA were performed using a number of genotyped animals equivalent to the number of largest eigenvalues of G (EIG) explaining 50, 60, 70, 80, 90, 95, 98, and 99% of the variance. In addition, the largest discovery set consisted of 30k genotyped animals. Limited or extensive phenotypic information was mimicked by changing the trait heritability. Significant and large-effect size SNPs were added to the 50k panel and used for single-step genomic best linear unbiased prediction (ssGBLUP).

RESULTS

Using a number of genotyped animals corresponding to at least EIG98 allowed the identification of QTN with the largest effect sizes when Ne was large. Populations with smaller Ne required more than EIG98. Furthermore, including genotyped animals with a higher reliability (i.e., a higher trait heritability) improved the identification of the most informative QTN. Prediction accuracy was highest when the significant or the large-effect SNPs representing twice the number of simulated QTN were added to the 50k panel.

CONCLUSIONS

Accurately identifying causative variants from sequence data depends on the effective population size and, therefore, on the dimensionality of genomic information. This dimensionality can help identify the most suitable sample size for GWA and could be considered for variant selection, especially when resources are restricted. Even when variants are accurately identified, their inclusion in prediction models has limited benefits.

Collapse

Jang S, Ros-Freixedes R, Hickey JM, Chen CY, Herring WO, Holl J, Misztal I, Lourenco D. Multi-line ssGBLUP evaluation using preselected markers from whole-genome sequence data in pigs. Front Genet 2023;14:1163626. [PMID: 37252662 PMCID: PMC10213539 DOI: 10.3389/fgene.2023.1163626] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Accepted: 05/03/2023] [Indexed: 05/31/2023] Open

Abstract

Genomic evaluations in pigs could benefit from using multi-line data along with whole-genome sequencing (WGS) if the data are large enough to represent the variability across populations. The objective of this study was to investigate strategies to combine large-scale data from different terminal pig lines in a multi-line genomic evaluation (MLE) through single-step GBLUP (ssGBLUP) models while including variants preselected from whole-genome sequence (WGS) data. We investigated single-line and multi-line evaluations for five traits recorded in three terminal lines. The number of sequenced animals in each line ranged from 731 to 1,865, with 60k to 104k imputed to WGS. Unknown parent groups (UPG) and metafounders (MF) were explored to account for genetic differences among the lines and improve the compatibility between pedigree and genomic relationships in the MLE. Sequence variants were preselected based on multi-line genome-wide association studies (GWAS) or linkage disequilibrium (LD) pruning. These preselected variant sets were used for ssGBLUP predictions without and with weights from BayesR, and the performances were compared to that of a commercial porcine single-nucleotide polymorphisms (SNP) chip. Using UPG and MF in MLE showed small to no gain in prediction accuracy (up to 0.02), depending on the lines and traits, compared to the single-line genomic evaluation (SLE). Likewise, adding selected variants from the GWAS to the commercial SNP chip resulted in a maximum increase of 0.02 in the prediction accuracy, only for average daily feed intake in the most numerous lines. In addition, no benefits were observed when using preselected sequence variants in multi-line genomic predictions. Weights from BayesR did not help improve the performance of ssGBLUP. This study revealed limited benefits of using preselected whole-genome sequence variants for multi-line genomic predictions, even when tens of thousands of animals had imputed sequence data. Correctly accounting for line differences with UPG or MF in MLE is essential to obtain predictions similar to SLE; however, the only observed benefit of an MLE is to have comparable predictions across lines. Further investigation into the amount of data and novel methods to preselect whole-genome causative variants in combined populations would be of significant interest.

Collapse

Russell CA, Kuehn LA, Snelling WM, Kachman SD, Spangler ML. Variance component estimates for growth traits in beef cattle using selected variants from imputed low-pass sequence data. J Anim Sci 2023;101:skad274. [PMID: 37585275 PMCID: PMC10464510 DOI: 10.1093/jas/skad274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2023] [Accepted: 08/11/2023] [Indexed: 08/18/2023] Open

Abstract

A beef cattle population (n = 2,343) was used to assess the impact of variants identified from the imputed low-pass sequence (LPS) on the estimation of variance components and genetic parameters of birth weight (BWT) and post-weaning gain (PWG). Variants were selected based on functional impact and were partitioned into four groups (low, modifier, moderate, high) based on predicted functional impact and re-partitioned based on the consequence of mutation, such as missense and untranslated region variants, into six groups (G1-G6). Each subset was used to construct a genomic relationship matrix (GRM) for univariate animal models. Multiple analyses were conducted to compare the proportion of additive genetic variation explained by the different subsets individually and collectively, and these estimates were benchmarked against all LPS variants in a single GRM and array (e.g., GeneSeek Genomic Profiler 100K) genotypes. When all variants were included in a single GRM, heritability estimates for BWT and PWG were 0.43 ± 0.05 and 0.38 ± 0.05, respectively. Heritability estimates for BWT ranged from 0.10 to 0.42 dependent on which variant subsets were included. Similarly, estimates for PWG ranged from 0.05 to 0.38. Results showed that variants in the subsets modifier and G1 (untranslated region) yielded the highest heritability estimates and were similar to the inclusion of all variants, while estimates from GRM containing only variants in the categories High, G4 (non-coding transcript exon), and G6 (start and stop loss/gain) were the lowest. All variants combined provided similar heritability estimates to chip genotypes and provided minimal to no additional information when combined with chip data. This suggests that the chip single nucleotide polymorphisms and the variants from LPS predicted to be less consequential are in relatively high linkage disequilibrium with the underlying causal variants as a whole and sufficiently spread throughout the genome to capture larger proportions of additive genetic variation.

Collapse

Lopes FB, Baldi F, Brunes LC, Oliveira E Costa MF, da Costa Eifert E, Rosa GJM, Lobo RB, Magnabosco CU. Genomic prediction for meat and carcass traits in Nellore cattle using a Markov blanket algorithm. J Anim Breed Genet 2023;140:1-12. [PMID: 36239216 DOI: 10.1111/jbg.12740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Accepted: 09/22/2022] [Indexed: 12/13/2022]

Sánchez-Roncancio C, García B, Gallardo-Hidalgo J, Yáñez JM. GWAS on Imputed Whole-Genome Sequence Variants Reveal Genes Associated with Resistance to Piscirickettsia salmonis in Rainbow Trout (Oncorhynchus mykiss). Genes (Basel) 2022;14:114. [PMID: 36672855 PMCID: PMC9859203 DOI: 10.3390/genes14010114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 12/27/2022] [Accepted: 12/28/2022] [Indexed: 12/31/2022] Open

Brzáková M, Bauer J, Steyn Y, Šplíchal J, Fulínová D. The prediction accuracies of linear-type traits in Czech Holstein cattle when using ssGBLUP or wssGBLUP. J Anim Sci 2022;100:skac369. [PMID: 36334266 PMCID: PMC9746800 DOI: 10.1093/jas/skac369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 11/04/2022] [Indexed: 11/07/2022] Open

Ros-Freixedes R, Johnsson M, Whalen A, Chen CY, Valente BD, Herring WO, Gorjanc G, Hickey JM. Genomic prediction with whole-genome sequence data in intensely selected pig lines. GENETICS SELECTION EVOLUTION 2022;54:65. [PMID: 36153511 PMCID: PMC9509613 DOI: 10.1186/s12711-022-00756-0] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Accepted: 09/05/2022] [Indexed: 12/03/2022]

Abstract

Background

Early simulations indicated that whole-genome sequence data (WGS) could improve the accuracy of genomic predictions within and across breeds. However, empirical results have been ambiguous so far. Large datasets that capture most of the genomic diversity in a population must be assembled so that allele substitution effects are estimated with high accuracy. The objectives of this study were to use a large pig dataset from seven intensely selected lines to assess the benefits of using WGS for genomic prediction compared to using commercial marker arrays and to identify scenarios in which WGS provides the largest advantage.

Methods

We sequenced 6931 individuals from seven commercial pig lines with different numerical sizes. Genotypes of 32.8 million variants were imputed for 396,100 individuals (17,224 to 104,661 per line). We used BayesR to perform genomic prediction for eight complex traits. Genomic predictions were performed using either data from a standard marker array or variants preselected from WGS based on association tests.

Results

The accuracies of genomic predictions based on preselected WGS variants were not robust across traits and lines and the improvements in prediction accuracy that we achieved so far with WGS compared to standard marker arrays were generally small. The most favourable results for WGS were obtained when the largest training sets were available and standard marker arrays were augmented with preselected variants with statistically significant associations to the trait. With this method and training sets of around 80k individuals, the accuracy of within-line genomic predictions was on average improved by 0.025. With multi-line training sets, improvements of 0.04 compared to marker arrays could be expected.

Conclusions

Our results showed that WGS has limited potential to improve the accuracy of genomic predictions compared to marker arrays in intensely selected pig lines. Thus, although we expect that larger improvements in accuracy from the use of WGS are possible with a combination of larger training sets and optimised pipelines for generating and analysing such datasets, the use of WGS in the current implementations of genomic prediction should be carefully evaluated against the cost of large-scale WGS data on a case-by-case basis.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12711-022-00756-0.

Collapse

Rare and population-specific functional variation across pig lines. Genet Sel Evol 2022;54:39. [PMID: 35659233 PMCID: PMC9164375 DOI: 10.1186/s12711-022-00732-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 05/17/2022] [Indexed: 01/09/2023] Open

Abstract

BACKGROUND

It is expected that functional, mainly missense and loss-of-function (LOF), and regulatory variants are responsible for most phenotypic differences between breeds and genetic lines of livestock species that have undergone diverse selection histories. However, there is still limited knowledge about the existing missense and LOF variation in commercial livestock populations, in particular regarding population-specific variation and how it can affect applications such as across-breed genomic prediction.

METHODS

We re-sequenced the whole genome of 7848 individuals from nine commercial pig lines (average sequencing coverage: 4.1×) and imputed whole-genome genotypes for 440,610 pedigree-related individuals. The called variants were categorized according to predicted functional annotation (from LOF to intergenic) and prevalence level (number of lines in which the variant segregated; from private to widespread). Variants in each category were examined in terms of their distribution along the genome, alternative allele frequency, per-site Wright's fixation index (F_ST), individual load, and association to production traits.

RESULTS

Of the 46 million called variants, 28% were private (called in only one line) and 21% were widespread (called in all nine lines). Genomic regions with a low recombination rate were enriched with private variants. Low-prevalence variants (called in one or a few lines only) were enriched for lower allele frequencies, lower F_ST, and putatively functional and regulatory roles (including LOF and deleterious missense variants). On average, individuals carried fewer private deleterious missense alleles than expected compared to alleles with other predicted consequences. Only a small subset of the low-prevalence variants had intermediate allele frequencies and explained small fractions of phenotypic variance (up to 3.2%) of production traits. The significant low-prevalence variants had higher per-site F_ST than the non-significant ones. These associated low-prevalence variants were tagged by other more widespread variants in high linkage disequilibrium, including intergenic variants.

CONCLUSIONS

Most low-prevalence variants have low minor allele frequencies and only a small subset of low-prevalence variants contributed detectable fractions of phenotypic variance of production traits. Accounting for low-prevalence variants is therefore unlikely to noticeably benefit across-breed analyses, such as the prediction of genomic breeding values in a population using reference populations of a different genetic background.

Collapse

Srivastava S, Lopez BI, Kumar H, Jang M, Chai HH, Park W, Park JE, Lim D. Prediction of Hanwoo Cattle Phenotypes from Genotypes Using Machine Learning Methods. Animals (Basel) 2021;11:ani11072066. [PMID: 34359194 PMCID: PMC8300336 DOI: 10.3390/ani11072066] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Revised: 07/06/2021] [Accepted: 07/09/2021] [Indexed: 11/16/2022] Open