1
|
Lee J, Hong I, Lee C, Kim D, Kim S, Lee Y. SNPs in microRNA seed region and impact of miR-375 in concurrent regulation of multiple lipid accumulation-related genes. Sci Rep 2024; 14:10924. [PMID: 38740866 DOI: 10.1038/s41598-024-61673-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2024] [Accepted: 05/08/2024] [Indexed: 05/16/2024] Open
Abstract
Bovine intramuscular fat (IMF), commonly referred to as marbling, is regulated by lipid metabolism, which includes adipogenesis, lipogenesis, glycerolipid synthesis, and lipolysis. In recent years, breeding researchers have identified single nucleotide polymorphisms (SNPs) as useful marker-assisted selection tools for improving marbling scores in national breeding programs. These included causal SNPs that induce phenotypic variation. MicroRNAs (miRNAs) are small highly conserved non-coding RNA molecules that bind to multiple non-coding regions. They are involved in post-transcriptional regulation. Multiple miRNAs may regulate a given target. Previously, three SNPs in the GPAM 3' UTR and four miRNAs were identified through in silico assays. The aim of this study is to verify the binding ability of the four miRNAs to the SNPs within the 3'UTR of GPAM, and to identify the regulatory function of miR-375 in the expression of genes related to lipid metabolism in mammalian adipocytes. It was verified that the four miRNAs bind to the GPAM 3'UTR, and identified that the miR-375 sequence is highly conserved. Furthermore, it was founded that miR-375 upregulated the GPAM gene, C/EBPα, PPARγ and lipid metabolism-related genes and promoted lipid droplet accumulation in 3T3-L1 cells. In conclusion, these results suggest that miR-375 is a multifunctional regulator of multiple lipid metabolism-related genes and may aid in obesity research as a biomarker.
Collapse
Affiliation(s)
- Jiyeon Lee
- School of Biotechnology, Hankyong National University, Anseong, Gyeonggi-do, South Korea
| | - Inpyo Hong
- School of Biotechnology, Hankyong National University, Anseong, Gyeonggi-do, South Korea
| | - Chanwoo Lee
- Nuonbio Inc., 906, A, 302 Galmachi-ro, Jungwon-gu, Seongnam-si, South Korea
| | - Daehyun Kim
- Department of Animal Science, Chonnam National University, Gwangju, South Korea
| | - Sunghak Kim
- Department of Animal Science, Chonnam National University, Gwangju, South Korea.
| | - Yoonseok Lee
- School of Biotechnology, Hankyong National University, Anseong, Gyeonggi-do, South Korea.
- Center for Genetic Information, Hankyong National University, Anseong, Gyeonggi-do, South Korea.
| |
Collapse
|
2
|
Liu Y, Zhang Y, Zhou F, Yao Z, Zhan Y, Fan Z, Meng X, Zhang Z, Liu L, Yang J, Wu Z, Cai G, Zheng E. Increased Accuracy of Genomic Prediction Using Preselected SNPs from GWAS with Imputed Whole-Genome Sequence Data in Pigs. Animals (Basel) 2023; 13:3871. [PMID: 38136908 PMCID: PMC10740755 DOI: 10.3390/ani13243871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Revised: 12/13/2023] [Accepted: 12/14/2023] [Indexed: 12/24/2023] Open
Abstract
Enhancing the accuracy of genomic prediction is a key goal in genomic selection (GS) research. Integrating prior biological information into GS methods using appropriate models can improve prediction accuracy for complex traits. Genome-wide association study (GWAS) is widely utilized to identify potential candidate loci associated with complex traits in livestock and poultry, offering essential genomic insights. In this study, a GWAS was conducted on 685 Duroc × Landrace × Yorkshire (DLY) pigs to extract significant single-nucleotide polymorphisms (SNPs) as genomic features. We compared two GS models, genomic best linear unbiased prediction (GBLUP) and genomic feature BLUP (GFBLUP), by using imputed whole-genome sequencing (WGS) data on 651 Yorkshire pigs. The results revealed that the GBLUP model achieved prediction accuracies of 0.499 for backfat thickness (BFT) and 0.423 for loin muscle area (LMA). By applying the GFBLUP model with GWAS-based SNP preselection, the average prediction accuracies for BFT and LMA traits reached 0.491 and 0.440, respectively. Specifically, the GFBLUP model displayed a 4.8% enhancement in predicting LMA compared to the GBLUP model. These findings suggest that, in certain scenarios, the GFBLUP model may offer superior genomic prediction accuracy when compared to the GBLUP model, underscoring the potential value of incorporating genomic features to refine GS models.
Collapse
Affiliation(s)
- Yiyi Liu
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Yuling Zhang
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Fuchen Zhou
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zekai Yao
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Yuexin Zhan
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zhenfei Fan
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Xianglun Meng
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zebin Zhang
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Langqing Liu
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Jie Yang
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zhenfang Wu
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
- Guangdong Zhongxin Breeding Technology Co., Ltd., Guangzhou 510642, China
| | - Gengyuan Cai
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
- Guangdong Zhongxin Breeding Technology Co., Ltd., Guangzhou 510642, China
| | - Enqin Zheng
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| |
Collapse
|
3
|
Haque MA, Iqbal A, Bae H, Lee SE, Park S, Lee YM, Kim JJ. Assessment of genomic breeding values and their accuracies for carcass traits in Jeju Black cattle using whole-genome SNP chip panels. J Anim Breed Genet 2023; 140:519-531. [PMID: 37102238 DOI: 10.1111/jbg.12776] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Revised: 04/04/2023] [Accepted: 04/06/2023] [Indexed: 04/28/2023]
Abstract
The objective of the present study was to evaluate the breeding value and accuracy of genomic estimated breeding values (GEBVs) of carcass traits in Jeju Black cattle (JBC) using Hanwoo steers and JBC as a reference population using the single-trait animal model. Our research included genotype and phenotype information on 19,154 Hanwoo steers with 1097 JBC acting as the reference population. Likewise, the test population consisted of 418 genotyped JBC individuals with no phenotypic records for those carcass traits. For estimating the accuracy of GEBV, we divided the entire population into three groups. Hanwoo and JBC make up the first group; Hanwoo and JBC, who has both the genotype and phenotypic records, are referred to as the reference (training) population, and JBC, who lacks phenotypic information is referred to as the test (validation) population. The second group consists of the JBC (without phenotype) as the test population and Hanwoo as a reference population with phenotype and genotypic data. The only JBCs in the third group are those who have genotypic and phenotypic data on them as a reference population but no phenotypic data on them as a test population. The single-trait animal model was used in all three groups for statistical purposes. The reference populations estimated heritabilities for carcass weight (CWT), eye muscle area (EMA), backfat thickness (BF), and marbling score (MS) as 0.30, 0.26, 0.26, and 0.34 for the Hanwoo steer and 0.42, 0.27, 0.26, and 0.48 for JBC. The average accuracy for carcass traits in Group 1 was 0.80 for the Hanwoo and JBC reference population compared with 0.73 for the JBC test population. Although the average accuracy for carcass traits in Group 2 was 0.80, it was 0.80 for the Hanwoo reference population and only 0.56 for the JBC test population. The average accuracy for the JBC reference and test populations was 0.68 and 0.50, respectively, when they were included in the accuracy comparison without the Hanwoo reference population. Groups 1 and 2 used Hanwoo as reference population, which led to a better average accuracy; however, Group 3 only used the JBC reference and test population, which led to a lower average accuracy. This might be due to the fact that Group 3 used a smaller reference size than the group that came before it and that the genetic makeup of the Hanwoo and JBC breeds differed. The GEBV accuracy for MS was higher than that of other traits across all three analysis groups, followed by CWT, EMA, and BF, which may be partially explained by the MS traits' higher heritability. This study suggests that in order to achieve more accuracy, a large reference population particular to a breed should be established. Therefore, to increase the accuracy of GEBV prediction and the genetic benefit from genomic selection in JBC, individual reference breeds, and large populations are required.
Collapse
Affiliation(s)
- Md Azizul Haque
- Department of Biotechnology, Yeungnam University, Gyeongsan, Gyeongbuk, Korea
| | - Asif Iqbal
- Department of Biotechnology, Yeungnam University, Gyeongsan, Gyeongbuk, Korea
| | - Haechang Bae
- Department of Biotechnology, Yeungnam University, Gyeongsan, Gyeongbuk, Korea
| | - Seung Eun Lee
- Department of Biomedical Informatics, Jeju National University, Jeju, Korea
| | - Sepil Park
- Department of Biomedical Informatics, Jeju National University, Jeju, Korea
| | - Yun Mi Lee
- Department of Biotechnology, Yeungnam University, Gyeongsan, Gyeongbuk, Korea
| | - Jong Joo Kim
- Department of Biotechnology, Yeungnam University, Gyeongsan, Gyeongbuk, Korea
| |
Collapse
|
4
|
Jang S, Ros-Freixedes R, Hickey JM, Chen CY, Holl J, Herring WO, Misztal I, Lourenco D. Using pre-selected variants from large-scale whole-genome sequence data for single-step genomic predictions in pigs. Genet Sel Evol 2023; 55:55. [PMID: 37495982 PMCID: PMC10373252 DOI: 10.1186/s12711-023-00831-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Accepted: 07/18/2023] [Indexed: 07/28/2023] Open
Abstract
BACKGROUND Whole-genome sequence (WGS) data harbor causative variants that may not be present in standard single nucleotide polymorphism (SNP) chip data. The objective of this study was to investigate the impact of using preselected variants from WGS for single-step genomic predictions in maternal and terminal pig lines with up to 1.8k sequenced and 104k sequence imputed animals per line. METHODS Two maternal and four terminal lines were investigated for eight and seven traits, respectively. The number of sequenced animals ranged from 1365 to 1491 for the maternal lines and 381 to 1865 for the terminal lines. Imputation to sequence occurred within each line for 66k to 76k animals for the maternal lines and 29k to 104k animals for the terminal lines. Two preselected SNP sets were generated based on a genome-wide association study (GWAS). Top40k included the SNPs with the lowest p-value in each of the 40k genomic windows, and ChipPlusSign included significant variants integrated into the porcine SNP chip used for routine genotyping. We compared the performance of single-step genomic predictions between using preselected SNP sets assuming equal or different variances and the standard porcine SNP chip. RESULTS In the maternal lines, ChipPlusSign and Top40k showed an average increase in accuracy of 0.6 and 4.9%, respectively, compared to the regular porcine SNP chip. The greatest increase was obtained with Top40k, particularly for fertility traits, for which the initial accuracy based on the standard SNP chip was low. However, in the terminal lines, Top40k resulted in an average loss of accuracy of 1%. ChipPlusSign provided a positive, although small, gain in accuracy (0.9%). Assigning different variances for the SNPs slightly improved accuracies when using variances obtained from BayesR. However, increases were inconsistent across the lines and traits. CONCLUSIONS The benefit of using sequence data depends on the line, the size of the genotyped population, and how the WGS variants are preselected. When WGS data are available on hundreds of thousands of animals, using sequence data presents an advantage but this remains limited in pigs.
Collapse
Affiliation(s)
- Sungbong Jang
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA.
| | - Roger Ros-Freixedes
- Departament de Ciència Animal, Universitat de Lleida-Agrotecnio-CERCA Center, Lleida, Spain
| | - John M Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - Ching-Yi Chen
- The Pig Improvement Company, Genus Plc, Hendersonville, TN, USA
| | - Justin Holl
- The Pig Improvement Company, Genus Plc, Hendersonville, TN, USA
| | | | - Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA
| |
Collapse
|
5
|
Jang S, Tsuruta S, Leite NG, Misztal I, Lourenco D. Dimensionality of genomic information and its impact on genome-wide associations and variant selection for genomic prediction: a simulation study. Genet Sel Evol 2023; 55:49. [PMID: 37460964 DOI: 10.1186/s12711-023-00823-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Accepted: 07/03/2023] [Indexed: 07/20/2023] Open
Abstract
BACKGROUND Identifying true positive variants in genome-wide associations (GWA) depends on several factors, including the number of genotyped individuals. The limited dimensionality of genomic information may give insights into the optimal number of individuals to be used in GWA. This study investigated different discovery set sizes based on the number of largest eigenvalues explaining a certain proportion of variance in the genomic relationship matrix (G). In addition, we investigated the impact on the prediction accuracy by adding variants, which were selected based on different set sizes, to the regular single nucleotide polymorphism (SNP) chips used for genomic prediction. METHODS We simulated sequence data that included 500k SNPs with 200 or 2000 quantitative trait nucleotides (QTN). A regular 50k panel included one in every ten simulated SNPs. Effective population size (Ne) was set to 20 or 200. GWA were performed using a number of genotyped animals equivalent to the number of largest eigenvalues of G (EIG) explaining 50, 60, 70, 80, 90, 95, 98, and 99% of the variance. In addition, the largest discovery set consisted of 30k genotyped animals. Limited or extensive phenotypic information was mimicked by changing the trait heritability. Significant and large-effect size SNPs were added to the 50k panel and used for single-step genomic best linear unbiased prediction (ssGBLUP). RESULTS Using a number of genotyped animals corresponding to at least EIG98 allowed the identification of QTN with the largest effect sizes when Ne was large. Populations with smaller Ne required more than EIG98. Furthermore, including genotyped animals with a higher reliability (i.e., a higher trait heritability) improved the identification of the most informative QTN. Prediction accuracy was highest when the significant or the large-effect SNPs representing twice the number of simulated QTN were added to the 50k panel. CONCLUSIONS Accurately identifying causative variants from sequence data depends on the effective population size and, therefore, on the dimensionality of genomic information. This dimensionality can help identify the most suitable sample size for GWA and could be considered for variant selection, especially when resources are restricted. Even when variants are accurately identified, their inclusion in prediction models has limited benefits.
Collapse
Affiliation(s)
- Sungbong Jang
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA.
| | - Shogo Tsuruta
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA
| | - Natalia Galoro Leite
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA
| | - Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA
| |
Collapse
|
6
|
Jang S, Ros-Freixedes R, Hickey JM, Chen CY, Herring WO, Holl J, Misztal I, Lourenco D. Multi-line ssGBLUP evaluation using preselected markers from whole-genome sequence data in pigs. Front Genet 2023; 14:1163626. [PMID: 37252662 PMCID: PMC10213539 DOI: 10.3389/fgene.2023.1163626] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Accepted: 05/03/2023] [Indexed: 05/31/2023] Open
Abstract
Genomic evaluations in pigs could benefit from using multi-line data along with whole-genome sequencing (WGS) if the data are large enough to represent the variability across populations. The objective of this study was to investigate strategies to combine large-scale data from different terminal pig lines in a multi-line genomic evaluation (MLE) through single-step GBLUP (ssGBLUP) models while including variants preselected from whole-genome sequence (WGS) data. We investigated single-line and multi-line evaluations for five traits recorded in three terminal lines. The number of sequenced animals in each line ranged from 731 to 1,865, with 60k to 104k imputed to WGS. Unknown parent groups (UPG) and metafounders (MF) were explored to account for genetic differences among the lines and improve the compatibility between pedigree and genomic relationships in the MLE. Sequence variants were preselected based on multi-line genome-wide association studies (GWAS) or linkage disequilibrium (LD) pruning. These preselected variant sets were used for ssGBLUP predictions without and with weights from BayesR, and the performances were compared to that of a commercial porcine single-nucleotide polymorphisms (SNP) chip. Using UPG and MF in MLE showed small to no gain in prediction accuracy (up to 0.02), depending on the lines and traits, compared to the single-line genomic evaluation (SLE). Likewise, adding selected variants from the GWAS to the commercial SNP chip resulted in a maximum increase of 0.02 in the prediction accuracy, only for average daily feed intake in the most numerous lines. In addition, no benefits were observed when using preselected sequence variants in multi-line genomic predictions. Weights from BayesR did not help improve the performance of ssGBLUP. This study revealed limited benefits of using preselected whole-genome sequence variants for multi-line genomic predictions, even when tens of thousands of animals had imputed sequence data. Correctly accounting for line differences with UPG or MF in MLE is essential to obtain predictions similar to SLE; however, the only observed benefit of an MLE is to have comparable predictions across lines. Further investigation into the amount of data and novel methods to preselect whole-genome causative variants in combined populations would be of significant interest.
Collapse
Affiliation(s)
- Sungbong Jang
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, United States
| | - Roger Ros-Freixedes
- Departament de Ciència Animal, Universitat de Lleida-Agrotecnio-CERCA Center, Lleida, Spain
| | - John M. Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Edinburgh, Scotland, United Kingdom
| | - Ching-Yi Chen
- The Pig Improvement Company, Genus plc, Hendersonville, TN, United States
| | - William O. Herring
- The Pig Improvement Company, Genus plc, Hendersonville, TN, United States
| | - Justin Holl
- The Pig Improvement Company, Genus plc, Hendersonville, TN, United States
| | - Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, United States
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, United States
| |
Collapse
|
7
|
Russell CA, Kuehn LA, Snelling WM, Kachman SD, Spangler ML. Variance component estimates for growth traits in beef cattle using selected variants from imputed low-pass sequence data. J Anim Sci 2023; 101:skad274. [PMID: 37585275 PMCID: PMC10464510 DOI: 10.1093/jas/skad274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2023] [Accepted: 08/11/2023] [Indexed: 08/18/2023] Open
Abstract
A beef cattle population (n = 2,343) was used to assess the impact of variants identified from the imputed low-pass sequence (LPS) on the estimation of variance components and genetic parameters of birth weight (BWT) and post-weaning gain (PWG). Variants were selected based on functional impact and were partitioned into four groups (low, modifier, moderate, high) based on predicted functional impact and re-partitioned based on the consequence of mutation, such as missense and untranslated region variants, into six groups (G1-G6). Each subset was used to construct a genomic relationship matrix (GRM) for univariate animal models. Multiple analyses were conducted to compare the proportion of additive genetic variation explained by the different subsets individually and collectively, and these estimates were benchmarked against all LPS variants in a single GRM and array (e.g., GeneSeek Genomic Profiler 100K) genotypes. When all variants were included in a single GRM, heritability estimates for BWT and PWG were 0.43 ± 0.05 and 0.38 ± 0.05, respectively. Heritability estimates for BWT ranged from 0.10 to 0.42 dependent on which variant subsets were included. Similarly, estimates for PWG ranged from 0.05 to 0.38. Results showed that variants in the subsets modifier and G1 (untranslated region) yielded the highest heritability estimates and were similar to the inclusion of all variants, while estimates from GRM containing only variants in the categories High, G4 (non-coding transcript exon), and G6 (start and stop loss/gain) were the lowest. All variants combined provided similar heritability estimates to chip genotypes and provided minimal to no additional information when combined with chip data. This suggests that the chip single nucleotide polymorphisms and the variants from LPS predicted to be less consequential are in relatively high linkage disequilibrium with the underlying causal variants as a whole and sufficiently spread throughout the genome to capture larger proportions of additive genetic variation.
Collapse
Affiliation(s)
- Chad A Russell
- Department of Animal Science, University of Nebraska, Lincoln, NE 68583, USA
| | - Larry A Kuehn
- USDA, ARS, Roman L. Hruska U.S. Meat Animal Research Center, Clay Center, NE 68933, USA
| | - Warren M Snelling
- USDA, ARS, Roman L. Hruska U.S. Meat Animal Research Center, Clay Center, NE 68933, USA
| | - Stephen D Kachman
- Department of Statistics, University of Nebraska, Lincoln, NE 68583, USA
| | - Matthew L Spangler
- Department of Animal Science, University of Nebraska, Lincoln, NE 68583, USA
| |
Collapse
|
8
|
Lopes FB, Baldi F, Brunes LC, Oliveira E Costa MF, da Costa Eifert E, Rosa GJM, Lobo RB, Magnabosco CU. Genomic prediction for meat and carcass traits in Nellore cattle using a Markov blanket algorithm. J Anim Breed Genet 2023; 140:1-12. [PMID: 36239216 DOI: 10.1111/jbg.12740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Accepted: 09/22/2022] [Indexed: 12/13/2022]
Abstract
This study was carried out to evaluate the advantage of preselecting SNP markers using Markov blanket algorithm regarding the accuracy of genomic prediction for carcass and meat quality traits in Nellore cattle. This study considered 3675, 3680, 3660 and 524 records of rib eye area (REA), back fat thickness (BF), rump fat (RF), and Warner-Bratzler shear force (WBSF), respectively, from the Nellore Brazil Breeding Program. The animals have been genotyped using low-density SNP panel (30 k), and subsequently imputed for arrays with 777 k SNPs. Four Bayesian specifications of genomic regression models, namely Bayes A, Bayes B, Bayes Cπ and Bayesian Ridge Regression methods were compared in terms of prediction accuracy using a five folds cross-validation. Prediction accuracy for REA, BF and RF was all similar using the Bayesian Alphabet models, ranging from 0.75 to 0.95. For WBSF, the predictive ability was higher using Bayes B (0.47) than other methods (0.39 to 0.42). Although the prediction accuracies using Markov blanket of SNP markers were lower than those using all SNPs, for WBSF the relative gain was lower than 13%. With a subset of informative SNPs markers, identified using Markov blanket, probably, is possible to capture a large proportion of the genetic variance for WBSF. The development of low-density and customized arrays using Markov blanket might be cost-effective to perform a genomic selection for this trait, increasing the number of evaluated animals, improving the management decisions based on genomic information and applying genomic selection on a large scale.
Collapse
Affiliation(s)
- Fernando Brito Lopes
- São Paulo State University - Júlio de Mesquita Filho (UNESP), Department of Animal Science, Prof. Paulo Donato Castelane, Jaboticabal, Brazil.,Embrapa Cerrados, Brasilia, Brazil
| | - Fernando Baldi
- São Paulo State University - Júlio de Mesquita Filho (UNESP), Department of Animal Science, Prof. Paulo Donato Castelane, Jaboticabal, Brazil
| | | | | | | | - Guilherme Jordão Magalhães Rosa
- Department of Animal Sciences, University of Wisconsin-Madison, Madison, Wisconsin, USA.,Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | | | | |
Collapse
|
9
|
Sánchez-Roncancio C, García B, Gallardo-Hidalgo J, Yáñez JM. GWAS on Imputed Whole-Genome Sequence Variants Reveal Genes Associated with Resistance to Piscirickettsia salmonis in Rainbow Trout ( Oncorhynchus mykiss). Genes (Basel) 2022; 14:114. [PMID: 36672855 PMCID: PMC9859203 DOI: 10.3390/genes14010114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 12/27/2022] [Accepted: 12/28/2022] [Indexed: 12/31/2022] Open
Abstract
Genome-wide association studies (GWAS) allow the identification of associations between genetic variants and important phenotypes in domestic animals, including disease-resistance traits. Whole Genome Sequencing (WGS) data can help increase the resolution and statistical power of association mapping. Here, we conduced GWAS to asses he facultative intracellular bacterium Piscirickettsia salmonis, which affects farmed rainbow trout, Oncorhynchus mykiss, in Chile using imputed genotypes at the sequence level and searched for candidate genes located in genomic regions associated with the trait. A total of 2130 rainbow trout were intraperitoneally challenged with P. salmonis under controlled conditions and genotyped using a 57K single nucleotide polymorphism (SNP) panel. Genotype imputation was performed in all the genotyped animals using WGS data from 102 individuals. A total of 488,979 imputed WGS variants were available in the 2130 individuals after quality control. GWAS revealed genome-wide significant quantitative trait loci (QTL) in Omy02, Omy03, Omy25, Omy26 and Omy27 for time to death and in Omy26 for binary survival. Twenty-four (24) candidate genes associated with P. salmonis resistance were identified, which were mainly related to phagocytosis, innate immune response, inflammation, oxidative response, lipid metabolism and apoptotic process. Our results provide further knowledge on the genetic variants and genes associated with resistance to intracellular bacterial infection in rainbow trout.
Collapse
Affiliation(s)
- Charles Sánchez-Roncancio
- Doctorado en Acuicultura, Programa Cooperativo: Universidad de Chile. Universidad Católica del Norte. Pontificia Universidad Católica de Valparaíso, Chile
- Center for Research and Innovation in Aquaculture (CRIA), Universidad de Chile, Santiago 8820808, Chile
| | - Baltasar García
- Center for Research and Innovation in Aquaculture (CRIA), Universidad de Chile, Santiago 8820808, Chile
- Facultad de Ciencias Veterinarias y Pecuarias, Universidad de Chile, La Pintana, Santiago 8820808, Chile
| | - Jousepth Gallardo-Hidalgo
- Center for Research and Innovation in Aquaculture (CRIA), Universidad de Chile, Santiago 8820808, Chile
- Facultad de Ciencias Veterinarias y Pecuarias, Universidad de Chile, La Pintana, Santiago 8820808, Chile
| | - José M. Yáñez
- Center for Research and Innovation in Aquaculture (CRIA), Universidad de Chile, Santiago 8820808, Chile
- Facultad de Ciencias Veterinarias y Pecuarias, Universidad de Chile, La Pintana, Santiago 8820808, Chile
- Núcleo Milenio de Salmonidos Invasores Australes (INVASAL), Concepcion 4030000, Chile
| |
Collapse
|
10
|
Brzáková M, Bauer J, Steyn Y, Šplíchal J, Fulínová D. The prediction accuracies of linear-type traits in Czech Holstein cattle when using ssGBLUP or wssGBLUP. J Anim Sci 2022; 100:skac369. [PMID: 36334266 PMCID: PMC9746800 DOI: 10.1093/jas/skac369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 11/04/2022] [Indexed: 11/07/2022] Open
Abstract
The aim of this study was to assess the contribution of the weighted single-step genomic best linear unbiased prediction (wssGBLUP) method compared to the single-step genomic best linear unbiased prediction (ssGBLUP) method for genomic evaluation of 25 linear-type traits in the Czech Holstein cattle population. The nationwide database of linear-type traits with 6,99,681 records combined with deregressed proofs from Interbull (MACE method) was used as the input data. Genomic breeding values (GEBVs) were predicted based on these phenotypes using ssGBLUP and wssGBLUP methods using the BLUPF90 software. The bull validation test was employed which was based on comparing GEBVs of young bulls (N = 334) with no progeny in 2016. A minimum of 50 daughters with their own performance in 2020 was chosen to verify the contribution to the GEBV prediction, GEBV reliability, validation reliabilities (R2), and regression coefficients (b1). The results showed that the differences between the two methods were negligible. The low benefit of wssGBLUP may be due to the inclusion of a small number of SNPs; therefore, most predictions rely on polygenic relationships between animals. Nevertheless, the benefits of wssGBLUP analysis should be assessed with respect to specific population structures and given traits.
Collapse
Affiliation(s)
- Michaela Brzáková
- Department of Genetics and Breeding of Farm Animals, Institute of Animal Science, Prague-Uhříněves 104 00, Czech Republic
| | - Jiří Bauer
- Czech-Moravian Breeders’ Corporation, Hradištko 252 09, Czech Republic
| | - Yvette Steyn
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| | - Jiří Šplíchal
- Czech-Moravian Breeders’ Corporation, Hradištko 252 09, Czech Republic
| | - Daniela Fulínová
- Czech-Moravian Breeders’ Corporation, Hradištko 252 09, Czech Republic
| |
Collapse
|
11
|
Ros-Freixedes R, Johnsson M, Whalen A, Chen CY, Valente BD, Herring WO, Gorjanc G, Hickey JM. Genomic prediction with whole-genome sequence data in intensely selected pig lines. GENETICS SELECTION EVOLUTION 2022; 54:65. [PMID: 36153511 PMCID: PMC9509613 DOI: 10.1186/s12711-022-00756-0] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Accepted: 09/05/2022] [Indexed: 12/03/2022]
Abstract
Background Early simulations indicated that whole-genome sequence data (WGS) could improve the accuracy of genomic predictions within and across breeds. However, empirical results have been ambiguous so far. Large datasets that capture most of the genomic diversity in a population must be assembled so that allele substitution effects are estimated with high accuracy. The objectives of this study were to use a large pig dataset from seven intensely selected lines to assess the benefits of using WGS for genomic prediction compared to using commercial marker arrays and to identify scenarios in which WGS provides the largest advantage. Methods We sequenced 6931 individuals from seven commercial pig lines with different numerical sizes. Genotypes of 32.8 million variants were imputed for 396,100 individuals (17,224 to 104,661 per line). We used BayesR to perform genomic prediction for eight complex traits. Genomic predictions were performed using either data from a standard marker array or variants preselected from WGS based on association tests. Results The accuracies of genomic predictions based on preselected WGS variants were not robust across traits and lines and the improvements in prediction accuracy that we achieved so far with WGS compared to standard marker arrays were generally small. The most favourable results for WGS were obtained when the largest training sets were available and standard marker arrays were augmented with preselected variants with statistically significant associations to the trait. With this method and training sets of around 80k individuals, the accuracy of within-line genomic predictions was on average improved by 0.025. With multi-line training sets, improvements of 0.04 compared to marker arrays could be expected. Conclusions Our results showed that WGS has limited potential to improve the accuracy of genomic predictions compared to marker arrays in intensely selected pig lines. Thus, although we expect that larger improvements in accuracy from the use of WGS are possible with a combination of larger training sets and optimised pipelines for generating and analysing such datasets, the use of WGS in the current implementations of genomic prediction should be carefully evaluated against the cost of large-scale WGS data on a case-by-case basis. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-022-00756-0.
Collapse
|
12
|
Rare and population-specific functional variation across pig lines. Genet Sel Evol 2022; 54:39. [PMID: 35659233 PMCID: PMC9164375 DOI: 10.1186/s12711-022-00732-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 05/17/2022] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND It is expected that functional, mainly missense and loss-of-function (LOF), and regulatory variants are responsible for most phenotypic differences between breeds and genetic lines of livestock species that have undergone diverse selection histories. However, there is still limited knowledge about the existing missense and LOF variation in commercial livestock populations, in particular regarding population-specific variation and how it can affect applications such as across-breed genomic prediction. METHODS We re-sequenced the whole genome of 7848 individuals from nine commercial pig lines (average sequencing coverage: 4.1×) and imputed whole-genome genotypes for 440,610 pedigree-related individuals. The called variants were categorized according to predicted functional annotation (from LOF to intergenic) and prevalence level (number of lines in which the variant segregated; from private to widespread). Variants in each category were examined in terms of their distribution along the genome, alternative allele frequency, per-site Wright's fixation index (FST), individual load, and association to production traits. RESULTS Of the 46 million called variants, 28% were private (called in only one line) and 21% were widespread (called in all nine lines). Genomic regions with a low recombination rate were enriched with private variants. Low-prevalence variants (called in one or a few lines only) were enriched for lower allele frequencies, lower FST, and putatively functional and regulatory roles (including LOF and deleterious missense variants). On average, individuals carried fewer private deleterious missense alleles than expected compared to alleles with other predicted consequences. Only a small subset of the low-prevalence variants had intermediate allele frequencies and explained small fractions of phenotypic variance (up to 3.2%) of production traits. The significant low-prevalence variants had higher per-site FST than the non-significant ones. These associated low-prevalence variants were tagged by other more widespread variants in high linkage disequilibrium, including intergenic variants. CONCLUSIONS Most low-prevalence variants have low minor allele frequencies and only a small subset of low-prevalence variants contributed detectable fractions of phenotypic variance of production traits. Accounting for low-prevalence variants is therefore unlikely to noticeably benefit across-breed analyses, such as the prediction of genomic breeding values in a population using reference populations of a different genetic background.
Collapse
|
13
|
Srivastava S, Lopez BI, Kumar H, Jang M, Chai HH, Park W, Park JE, Lim D. Prediction of Hanwoo Cattle Phenotypes from Genotypes Using Machine Learning Methods. Animals (Basel) 2021; 11:ani11072066. [PMID: 34359194 PMCID: PMC8300336 DOI: 10.3390/ani11072066] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Revised: 07/06/2021] [Accepted: 07/09/2021] [Indexed: 11/16/2022] Open
Abstract
Hanwoo was originally raised for draft purposes, but the increase in local demand for red meat turned that purpose into full-scale meat-type cattle rearing; it is now considered one of the most economically important species and a vital food source for Koreans. The application of genomic selection in Hanwoo breeding programs in recent years was expected to lead to higher genetic progress. However, better statistical methods that can improve the genomic prediction accuracy are required. Hence, this study aimed to compare the predictive performance of three machine learning methods, namely, random forest (RF), extreme gradient boosting method (XGB), and support vector machine (SVM), when predicting the carcass weight (CWT), marbling score (MS), backfat thickness (BFT) and eye muscle area (EMA). Phenotypic and genotypic data (53,866 SNPs) from 7324 commercial Hanwoo cattle that were slaughtered at the age of around 30 months were used. The results showed that the boosting method XGB showed the highest predictive correlation for CWT and MS, followed by GBLUP, SVM, and RF. Meanwhile, the best predictive correlation for BFT and EMA was delivered by GBLUP, followed by SVM, RF, and XGB. Although XGB presented the highest predictive correlations for some traits, we did not find an advantage of XGB or any machine learning methods over GBLUP according to the mean squared error of prediction. Thus, we still recommend the use of GBLUP in the prediction of genomic breeding values for carcass traits in Hanwoo cattle.
Collapse
|