Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Villa-Angulo R, Matukumalli LK, Gill CA, Choi J, Van Tassell CP, Grefenstette JJ. High-resolution haplotype block structure in the cattle genome. BMC Genet 2009;10:19. [PMID: 19393054 PMCID: PMC2684545 DOI: 10.1186/1471-2156-10-19] [Citation(s) in RCA: 122] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2008] [Accepted: 04/24/2009] [Indexed: 02/06/2023] Open

For:	Villa-Angulo R, Matukumalli LK, Gill CA, Choi J, Van Tassell CP, Grefenstette JJ. High-resolution haplotype block structure in the cattle genome. BMC Genet 2009;10:19. [PMID: 19393054 PMCID: PMC2684545 DOI: 10.1186/1471-2156-10-19] [Citation(s) in RCA: 122] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2008] [Accepted: 04/24/2009] [Indexed: 02/06/2023] Open

Number

Cited by Other Article(s)

Nisa FU, Kaul H, Asif M, Amin I, Mrode R, Mansoor S, Mukhtar Z. Genetic insights into crossbred dairy cattle of Pakistan: exploring allele frequency, linkage disequilibrium, and effective population size at a genome-wide scale. Mamm Genome 2023;34:602-614. [PMID: 37804434 DOI: 10.1007/s00335-023-10019-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 09/13/2023] [Indexed: 10/09/2023]

Barani S, Nejati-Javaremi A, Moradi MH, Moradi-Sharbabak M, Gholizadeh M, Esfandyari H. Genome-wide study of linkage disequilibrium, population structure, and inbreeding in Iranian indigenous sheep breeds. PLoS One 2023;18:e0286463. [PMID: 37267244 DOI: 10.1371/journal.pone.0286463] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Accepted: 05/16/2023] [Indexed: 06/04/2023] Open

Dash S, Singh A, Dixit SP, Kumar A, Behera R. Exploring haplotype block structure, runs of homozygosity, and effective population size among dairy cattle breeds of India. Trop Anim Health Prod 2023;55:129. [PMID: 36952060 DOI: 10.1007/s11250-023-03534-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Accepted: 03/06/2023] [Indexed: 03/24/2023]

Ahmad SF, Singh A, Gangwar M, Kumar S, Dutt T, Kumar A. Haplotype-based association study of production and reproduction traits in multigenerational Vrindavani population. Gene 2023;867:147365. [PMID: 36918047 DOI: 10.1016/j.gene.2023.147365] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Revised: 02/23/2023] [Accepted: 03/08/2023] [Indexed: 03/14/2023]

Abstract

Haplotype-based association analysis promises to reveal important information regarding the effect of genetic variants on economic traits of interest. The present study aimed to evaluate the haplotype structure of Vrindavani cattle and explore the association of haplotypes with (re)production traits of economic interest. Genotyping array data of medium density (Bovine50KSNP BeadChip) on 96 randomly selected Vrindavani cows was used in the present study. Genotypes were called in GenomeStudio program while quality control was undertaken in PLINK using standard thresholds. The phenotypic traits used in the present study included age at first calving, dry days, lactation length, peak yield, total lactation milk yield, inter-calving period and service period. The haplotype structure of Vrindavani population was assessed, using a sliding window of 20 SNP with a shift of 5 SNPs at a time, in terms of the size of haplotype blocks regarding their length (in Kb) and frequency in chromosome-wise fashion. Haplotype blocks were assessed for possible association with important production and reproduction traits across three lactation cycles in Vrindavani cattle population. The first ten principal components were included in the model for haplotype-based association analysis to correct for stratification effects of assessed individuals. Multiple haplotypes were found to be associated with age at first calving, total lactation milk yield, peak yield, dry days, inter-calving period and service period. Various candidate genes were found to overlap haplotypes that were significantly associated with age at first calving (CDH18, MARCHF11, MYO10, FBXL7), total lactation milk yield (TGF, PDE1A, and COL8A1), peak yield (PPARGC1A, RCAN1, KCNE1, SMIM34 and MRPS6), dry days (CPNE4, ACAD11 and MRAS), inter-calving period (ABCG5, ABCG8 and COX7A2L) and service period (FOXL2 and PIK3CB). The putative candidate genes overlapping the significantly associated haplotypes revealed important pathways affecting the production and reproduction performance of animals. The identified genes and pathways may serve as good candidate markers to select animals for improved production and reproduction performance in future generations.

Collapse

Calderón-Chagoya R, Vega-Murillo VE, García-Ruiz A, Ríos-Utrera Á, Martínez-Velázquez G, Montaño-Bermúdez M. Genome and chromosome wide association studies for growth traits in Simmental and Simbrah cattle. Anim Biosci 2023;36:19-28. [PMID: 35798032 PMCID: PMC9834659 DOI: 10.5713/ab.21.0517] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Accepted: 06/27/2022] [Indexed: 01/27/2023] Open

Hu X, Hao D, Yin J, Gong F, Wang X, Wang R, Liu B. Association between MIR31HG polymorphisms and the risk of Lumbar disc herniation in Chinese Han population. Cell Cycle 2022;21:2109-2120. [PMID: 35704669 DOI: 10.1080/15384101.2022.2087281] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022] Open

Campos GS, Cardoso FF, Gomes CCG, Domingues R, de Almeida Regitano LC, de Sena Oliveira MC, de Oliveira HN, Carvalheiro R, Albuquerque LG, Miller S, Misztal I, Lourenco D. Development of genomic predictions for Angus cattle in Brazil incorporating genotypes from related American sires. J Anim Sci 2022;100:6507787. [PMID: 35031806 PMCID: PMC8867558 DOI: 10.1093/jas/skac009] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Accepted: 01/12/2022] [Indexed: 11/24/2022] Open

Abstract

Genomic prediction has become the new standard for genetic improvement programs, and currently, there is a desire to implement this technology for the evaluation of Angus cattle in Brazil. Thus, the main objective of this study was to assess the feasibility of evaluating young Brazilian Angus (BA) bulls and heifers for 12 routinely recorded traits using single-step genomic BLUP (ssGBLUP) with and without genotypes from American Angus (AA) sires. The second objective was to obtain estimates of effective population size (N_e) and linkage disequilibrium (LD) in the Brazilian Angus population. The dataset contained phenotypic information for up to 277,661 animals belonging to the Promebo breeding program, pedigree for 362,900, of which 1,386 were genotyped for 50k, 77k, and 150k single nucleotide polymorphism (SNP) panels. After imputation and quality control, 61,666 SNPs were available for the analyses. In addition, genotypes from 332 American Angus (AA) sires widely used in Brazil were retrieved from the AA Association database to be used for genomic predictions. Bivariate animal models were used to estimate variance components, traditional EBV, and genomic EBV (GEBV). Validation was carried out with the linear regression method (LR) using young-genotyped animals born between 2013 and 2015 without phenotypes in the reduced dataset and with records in the complete dataset. Validation animals were further split into progeny of BA and AA sires to evaluate if their progenies would benefit by including genotypes from AA sires. The N_e was 254 based on pedigree and 197 based on LD, and the average LD (±SD) and distance between adjacent single nucleotide polymorphisms (SNPs) across all chromosomes were 0.27 (±0.27) and 40743.68 bp, respectively. Prediction accuracies with ssGBLUP outperformed BLUP for all traits, improving accuracies by, on average, 16% for BA young bulls and heifers. The GEBV prediction accuracies ranged from 0.37 (total maternal for weaning weight and tick count) to 0.54 (yearling precocity) across all traits, and dispersion (LR coefficients) fluctuated between 0.92 and 1.06. Inclusion of genotyped sires from the AA improved GEBV accuracies by 2%, on average, compared to using only the BA reference population. Our study indicated that genomic information could help us to improve GEBV accuracies and hence genetic progress in the Brazilian Angus population. The inclusion of genotypes from American Angus sires heavily used in Brazil just marginally increased the GEBV accuracies for selection candidates.

There was a desire to implement genomic selection for Angus cattle in Brazil since the technology has been proved to increase genetic gain in animal breeding programs. Single-step genomic best linear unbiased prediction (ssGBLUP), which simultaneously combines pedigree and genomic information, was used to estimate individuals’ genomic breeding values (GEBV) or genetic merit. Genomic selection can accelerate genetic progress by increasing accuracy, especially in young animals without progeny. The accuracy of GEBV can also be improved by combing data from other countries to increase the reference population (i.e., genotyped and phenotyped animals) in small, genotyped populations. Thus, the main objective of this study was to evaluate the accuracy of GEBV for young Brazilian Angus (BA) bulls and heifers with ssGBLUP, including or not the genotypes from American Angus sires. The accuracies with ssGBLUP were higher than those from traditional BLUP (EBV calculated from pedigree), improving accuracies by, on average, 16% for young bulls and heifers. Including genotypes from American Angus sires heavily used in Brazil just marginally increased the GEBV accuracies for selection candidates.

Collapse

Rahimmadar S, Ghaffari M, Mokhber M, Williams JL. Linkage Disequilibrium and Effective Population Size of Buffalo Populations of Iran, Turkey, Pakistan, and Egypt Using a Medium Density SNP Array. Front Genet 2021;12:608186. [PMID: 34950186 PMCID: PMC8689148 DOI: 10.3389/fgene.2021.608186] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2020] [Accepted: 11/03/2021] [Indexed: 11/21/2022] Open

Abstract

Linkage disequilibrium (LD) across the genome provides information to identify the genes and variations related to quantitative traits in genome-wide association studies (GWAS) and for the implementation of genomic selection (GS). LD can also be used to evaluate genetic diversity and population structure and reveal genomic regions affected by selection. LD structure and Ne were assessed in a set of 83 water buffaloes, comprising Azeri (AZI), Khuzestani (KHU), and Mazandarani (MAZ) breeds from Iran, Kundi (KUN) and Nili-Ravi (NIL) from Pakistan, Anatolian (ANA) buffalo from Turkey, and buffalo from Egypt (EGY). The values of corrected r² (defined as the correlation between two loci) of adjacent SNPs for three pooled Iranian breeds (IRI), ANA, EGY, and two pooled Pakistani breeds (PAK) populations were 0.24, 0.28, 0.27, and 0.22, respectively. The corrected r² between SNPs decreased with increasing physical distance from 100 Kb to 1 Mb. The LD values for IRI, ANA, EGY, and PAK populations were 0.16, 0.23, 0.24, and 0.21 for less than 100Kb, respectively, which reduced rapidly to 0.018, 0.042, 0.059, and 0.024, for a distance of 1 Mb. In all the populations, the decay rate was low for distances greater than 2Mb, up to the longest studied distance (15 Mb). The r² values for adjacent SNPs in unrelated samples indicated that the Affymetrix Axiom 90 K SNP genomic array was suitable for GWAS and GS in these populations. The persistency of LD phase (PLDP) between populations was assessed, and results showed that PLPD values between the populations were more than 0.9 for distances of less than 100 Kb. The Ne in the recent generations has declined to the extent that breeding plans are urgently required to ensure that these buffalo populations are not at risk of being lost. We found that results are affected by sample size, which could be partially corrected for; however, additional data should be obtained to be confident of the results.

Collapse

Eiríksson JH, Karaman E, Su G, Christensen OF. Breed of origin of alleles and genomic predictions for crossbred dairy cows. Genet Sel Evol 2021;53:84. [PMID: 34742238 PMCID: PMC8572482 DOI: 10.1186/s12711-021-00678-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Accepted: 10/20/2021] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

In dairy cattle, genomic selection has been implemented successfully for purebred populations, but, to date, genomic estimated breeding values (GEBV) for crossbred cows are rarely available, although they are valuable for rotational crossbreeding schemes that are promoted as efficient strategies. An attractive approach to provide GEBV for crossbreds is to use estimated marker effects from the genetic evaluation of purebreds. The effects of each marker allele in crossbreds can depend on the breed of origin of the allele (BOA), thus applying marker effects based on BOA could result in more accurate GEBV than applying only proportional contribution of the purebreds. Application of BOA models in rotational crossbreeding requires methods for detecting BOA, but the existing methods have not been developed for rotational crossbreeding. Therefore, the aims of this study were to develop and test methods for detecting BOA in a rotational crossbreeding system, and to investigate methods for calculating GEBV for crossbred cows using estimated marker effects from purebreds.

RESULTS

For detecting BOA in crossbred cows from rotational crossbreeding for which pedigree is recorded, we developed the AllOr method based on the comparison of haplotypes in overlapping windows. To calculate the GEBV of crossbred cows, two models were compared: a BOA model where marker effects estimated from purebreds are combined based on the detected BOA; and a breed proportion model where marker effects are combined based on estimated breed proportions. The methods were tested on simulated data that mimic the first four generations of rotational crossbreeding between Holstein, Jersey and Red Dairy Cattle. The AllOr method detected BOA correctly for 99.6% of the marker alleles across the four crossbred generations. The reliability of GEBV was higher with the BOA model than with the breed proportion model for the four generations of crossbreeding, with the largest difference observed in the first generation.

CONCLUSIONS

In rotational crossbreeding for which pedigree is recorded, BOA can be accurately detected using the AllOr method. Combining marker effects estimated from purebreds to predict the breeding value of crossbreds based on BOA is a promising approach to provide GEBV for crossbred dairy cows.

Collapse

Xiong J, Niu Y, Liu W, Zeng F, Cheng JF, Chen SQ, Zeng XZ. Effect of L3MBTL3/PTPN9 polymorphisms on risk to alcohol-induced ONFH in Chinese Han population. Neurol Sci 2021;43:2823-2830. [PMID: 34373992 DOI: 10.1007/s10072-021-05486-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2021] [Accepted: 07/16/2021] [Indexed: 11/26/2022]

Kulski JK, Suzuki S, Shiina T. Haplotype Shuffling and Dimorphic Transposable Elements in the Human Extended Major Histocompatibility Complex Class II Region. Front Genet 2021;12:665899. [PMID: 34122517 PMCID: PMC8193847 DOI: 10.3389/fgene.2021.665899] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Accepted: 04/12/2021] [Indexed: 12/26/2022] Open

Abstract

The major histocompatibility complex (MHC) on chromosome 6p21 is one of the most single-nucleotide polymorphism (SNP)-dense regions of the human genome and a prime model for the study and understanding of conserved sequence polymorphisms and structural diversity of ancestral haplotypes/conserved extended haplotypes. This study aimed to follow up on a previous analysis of the MHC class I region by using the same set of 95 MHC haplotype sequences downloaded from a publicly available BioProject database at the National Center for Biotechnology Information to identify and characterize the polymorphic human leukocyte antigen (HLA)-class II genes, the MTCO3P1 pseudogene alleles, the indels of transposable elements as haplotypic lineage markers, and SNP-density crossover (XO) loci at haplotype junctions in DNA sequence alignments of different haplotypes across the extended class II region (∼1 Mb) from the telomeric PRRT1 gene in class III to the COL11A2 gene at the centromeric end of class II. We identified 42 haplotypic indels (20 Alu, 7 SVA, 13 LTR or MERs, and 2 indels composed of a mosaic of different transposable elements) linked to particular HLA-class II alleles. Comparative sequence analyses of 136 haplotype pairs revealed 98 unique XO sites between SNP-poor and SNP-rich genomic segments with considerable haplotype shuffling located in the proximity of putative recombination hotspots. The majority of XO sites occurred across various regions including in the vicinity of MTCO3P1 between HLA-DQB1 and HLA-DQB3, between HLA-DQB2 and HLA-DOB, between DOB and TAP2, and between HLA-DOA and HLA-DPA1, where most XOs were within a HERVK22 sequence. We also determined the genomic positions of the PRDM9-recombination suppression sequence motif ATCCATG/CATGGAT and the PRDM9 recombination activation partial binding motif CCTCCCCT/AGGGGAG in the class II region of the human reference genome (NC_ 000006) relative to published meiotic recombination positions. Both the recombination and anti-recombination PRDM9 binding motifs were widely distributed throughout the class II genomic regions with 50% or more found within repeat elements; the anti-recombination motifs were found mostly in L1 fragmented repeats. This study shows substantial haplotype shuffling between different polymorphic blocks and confirms the presence of numerous putative ancestral recombination sites across the class II region between various HLA class II genes.

Collapse

Thakor PB, Hinsu AT, Bhatia DR, Shah TM, Nayee N, Sudhakar A, Rank DN, Joshi CG. High-throughput genotype-based population structure analysis of selected buffalo breeds. Transl Anim Sci 2021;5:txab033. [PMID: 33981962 PMCID: PMC8103726 DOI: 10.1093/tas/txab033] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2020] [Accepted: 05/01/2021] [Indexed: 11/29/2022] Open

Whalen A, Gorjanc G, Hickey JM. AlphaFamImpute: high-accuracy imputation in full-sib families from genotype-by-sequencing data. Bioinformatics 2020;36:4369-4371. [PMID: 32467963 PMCID: PMC7520044 DOI: 10.1093/bioinformatics/btaa499] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Revised: 04/22/2020] [Accepted: 05/25/2020] [Indexed: 12/12/2022] Open

Zhan H, Zhang S, Zhang K, Peng X, Xie S, Li X, Zhao S, Ma Y. Genome-Wide Patterns of Homozygosity and Relevant Characterizations on the Population Structure in Piétrain Pigs. Genes (Basel) 2020;11:genes11050577. [PMID: 32455573 PMCID: PMC7291003 DOI: 10.3390/genes11050577] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2020] [Revised: 05/11/2020] [Accepted: 05/11/2020] [Indexed: 01/06/2023] Open

Haplotype-Based Genome-Wide Association Study and Identification of Candidate Genes Associated with Carcass Traits in Hanwoo Cattle. Genes (Basel) 2020;11:genes11050551. [PMID: 32423003 PMCID: PMC7290854 DOI: 10.3390/genes11050551] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2020] [Revised: 04/30/2020] [Accepted: 05/05/2020] [Indexed: 12/20/2022] Open

Alvarenga AB, Veroneze R, Oliveira HR, Marques DBD, Lopes PS, Silva FF, Brito LF. Comparing Alternative Single-Step GBLUP Approaches and Training Population Designs for Genomic Evaluation of Crossbred Animals. Front Genet 2020;11:263. [PMID: 32328083 PMCID: PMC7162606 DOI: 10.3389/fgene.2020.00263] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2019] [Accepted: 03/05/2020] [Indexed: 02/06/2023] Open

Abstract

As crossbreeding is extensively used in some livestock species, we aimed to evaluate the performance of single-step GBLUP (ssGBLUP) and weighted ssGBLUP (WssGBLUP) methods to predict Genomic Estimated Breeding Values (GEBVs) of crossbred animals. Different training population scenarios were evaluated: (SC1) ssGBLUP based on a single-trait model considering purebred and crossbred animals in a joint training population; (SC2) ssGBLUP based on a multiple-trait model to enable considering phenotypes recorded in purebred and crossbred training animals as different traits; (SC3) WssGBLUP based on a single-trait model considering purebred and crossbred animals jointly in the training population (both populations were used for SNP weights' estimation); (SC4) WssGBLUP based on a single-trait model considering only purebred animals in the training population (crossbred population only used for SNP weights' estimation); (SC5) WssGBLUP based on a single-trait model and the training population characterized by purebred animals (purebred population used for SNP weights' estimation). A complex trait was simulated assuming alternative genetic architectures. Different scaling factors to blend the inverse of the genomic (G -1) and pedigree (A 22 - 1 ) relationship matrices were also tested. The predictive performance of each scenario was evaluated based on the validation accuracy and regression coefficient. The genetic correlations across simulated populations in the different scenarios ranged from moderate to high (0.71-0.99). The scenario mimicking a completely polygenic trait (h Q T L 2 = 0) yielded the lowest validation accuracy (0.12; for SC3 and SC4). The simulated scenarios assuming 4,500 QTLs affecting the trait andh Q T L 2 = h 2 resulted in the greatest GEBV accuracies (0.47; for SC1 and SC2). The regression coefficients ranged from 0.28 (for SC3 assuming polygenic effect) to 1.27 (for SC2 considering 4,500 QTLs). In general, SC3 and SC5 resulted in inflated GEBVs, whereas other scenarios yielded deflated GEBVs. The scaling factors used to combine G -1 andA 22 - 1 had a small influence on the validation accuracies, but a greater effect on the regression coefficients. Due to the complexity of multiple-trait models and WssGBLUP analyses, and a similar predictive performance across the methods evaluated, SC1 is recommended for genomic evaluation in crossbred populations with similar genetic structures [moderate-to-high (0.71-0.99) genetic correlations between purebred and crossbred populations].

Collapse

Pierce C, Speidel S, Coleman S, Enns R, Bailey D, Medrano J, Cánovas A, Meiman P, Howery L, Mandeville W, Thomas M. Genome-wide association studies of beef cow terrain-use traits using Bayesian multiple-SNP regression. Livest Sci 2020. [DOI: 10.1016/j.livsci.2019.103900] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]

Mouresan EF, González-Rodríguez A, Cañas-Álvarez JJ, Munilla S, Altarriba J, Díaz C, Baró JA, Molina A, Lopez-Buesa P, Piedrafita J, Varona L. Mapping Recombination Rate on the Autosomal Chromosomes Based on the Persistency of Linkage Disequilibrium Phase Among Autochthonous Beef Cattle Populations in Spain. Front Genet 2019;10:1170. [PMID: 31824571 PMCID: PMC6880760 DOI: 10.3389/fgene.2019.01170] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2018] [Accepted: 10/23/2019] [Indexed: 01/14/2023] Open

Abstract

In organisms with sexual reproduction, genetic diversity, and genome evolution are governed by meiotic recombination caused by crossing-over, which is known to vary within the genome. In this study, we propose a simple method to estimate the recombination rate that makes use of the persistency of linkage disequilibrium (LD) phase among closely related populations. The biological material comprised 171 triplets (sire/dam/offspring) from seven populations of autochthonous beef cattle in Spain (Asturiana de los Valles, Avileña-Negra Ibérica, Bruna dels Pirineus, Morucha, Pirenaica, Retinta, and Rubia Gallega), which were genotyped for 777,962 SNPs with the BovineHD BeadChip. After standard quality filtering, we reconstructed the haplotype phases in the parental individuals and calculated the LD by the correlation -r- between each pair of markers that had a genetic distance < 1 Mb. Subsequently, these correlations were used to calculate the persistency of LD phase between each pair of populations along the autosomal genome. Therefore, the distribution of the recombination rate along the genome can be inferred since the effect of the number of generations of divergence should be equivalent throughout the genome. In our study, the recombination rate was highest in the largest chromosomes and at the distal portion of the chromosomes. In addition, the persistency of LD phase was highly heterogeneous throughout the genome, with a ratio of 25.4 times between the estimates of the recombination rates from the genomic regions that had the highest (BTA18-7.1 Mb) and the lowest (BTA12-42.4 Mb) estimates. Finally, an overrepresentation enrichment analysis (ORA) showed differences in the enriched gene ontology (GO) terms between the genes located in the genomic regions with estimates of the recombination rate over (or below) the 95^th (or 5^th) percentile throughout the autosomal genome.

Collapse

Thomasen JR, Liu H, Sørensen AC. Genotyping more cows increases genetic gain and reduces rate of true inbreeding in a dairy cattle breeding scheme using female reproductive technologies. J Dairy Sci 2019;103:597-606. [PMID: 31733861 DOI: 10.3168/jds.2019-16974] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2019] [Accepted: 09/23/2019] [Indexed: 12/26/2022]

Abstract

Both small dairy cattle populations and dairy cattle populations with a low level of linkage disequilibrium (LD) suffer from low reliability of genomic prediction. In this study, we investigated whether adding more genotyped cows to the reference population influences the rate of genetic gain and rate of inbreeding by affecting the reliability. A standard breeding program with a large reference population and high LD, which mimicked a breeding program for Danish Holstein population, was simulated as a reference. A Danish Jersey population with a small reference population and high LD and a Red Dairy Cattle population with a large reference population and low LD were also simulated. Two additional breeding programs were simulated for Danish Jersey and Red Dairy Cattle populations, where 2,000 additional genotyped cows were included in the population for genomic selection. All 5 simulated breeding programs were initiated by a founder population to generate LD resembling the real LD pattern, followed by a 20-yr conventional progeny-testing scheme with 1,000 or 10,000 genotyped progeny-tested bulls and a 10-yr genomic selection scheme with or without 2,000 additional genotyped cows. Evaluation criteria were annual monetary genetic gain and rate of true inbreeding. Our results showed that adding more genotyped cows to the reference in dairy cattle populations has the potential to increase genetic gain and reduce the rate of inbreeding, regardless of reference population size and level of LD. However, it is still not possible to reach the same genetic gain as in the simulated Danish Holstein population with either a small reference population or low LD. Our results also showed that in a small reference population with high LD, it is difficult to manage inbreeding because of lower accuracy compared with the simulated Danish Holstein population and a smaller number of relevant families to select from. Therefore, breeding strategies need to be chosen to match population size and structure. The rate of true inbreeding is always underestimated by pedigree inbreeding and even more in genomic breeding programs, indicating that some forms of genome-wide inbreeding, instead of pedigree-based inbreeding, should be used to monitor inbreeding when genomic selection is implemented.

Collapse

Benjelloun B, Boyer F, Streeter I, Zamani W, Engelen S, Alberti A, Alberto FJ, BenBati M, Ibnelbachyr M, Chentouf M, Bechchari A, Rezaei HR, Naderi S, Stella A, Chikhi A, Clarke L, Kijas J, Flicek P, Taberlet P, Pompanon F. An evaluation of sequencing coverage and genotyping strategies to assess neutral and adaptive diversity. Mol Ecol Resour 2019;19:1497-1515. [PMID: 31359622 PMCID: PMC7115901 DOI: 10.1111/1755-0998.13070] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2018] [Revised: 06/30/2019] [Accepted: 07/08/2019] [Indexed: 12/12/2022]

Abstract

Whole genome sequences (WGS) greatly increase our ability to precisely infer population genetic parameters, demographic processes, and selection signatures. However, WGS may still be not affordable for a representative number of individuals/populations. In this context, our goal was to assess the efficiency of several SNP genotyping strategies by testing their ability to accurately estimate parameters describing neutral diversity and to detect signatures of selection. We analysed 110 WGS at 12× coverage for four different species, i.e., sheep, goats and their wild counterparts. From these data we generated 946 data sets corresponding to random panels of 1K to 5M variants, commercial SNP chips and exome capture, for sample sizes of five to 48 individuals. We also extracted low-coverage genome resequencing of 1×, 2× and 5× by randomly subsampling reads from the 12× resequencing data. Globally, 5K to 10K random variants were enough for an accurate estimation of genome diversity. Conversely, commercial panels and exome capture displayed strong ascertainment biases. Besides the characterization of neutral diversity, the detection of the signature of selection and the accurate estimation of linkage disequilibrium (LD) required high-density panels of at least 1M variants. Finally, genotype likelihoods increased the quality of variant calling from low coverage resequencing but proportions of incorrect genotypes remained substantial, especially for heterozygote sites. Whole genome resequencing coverage of at least 5× appeared to be necessary for accurate assessment of genomic variations. These results have implications for studies seeking to deploy low-density SNP collections or genome scans across genetically diverse populations/species showing similar genetic characteristics and patterns of LD decay for a wide variety of purposes.

Collapse

Affiliation(s)

Badr Benjelloun Univ. Grenoble-Alpes, Univ. Savoie Mont Blanc, CNRS, LECA, F-38000 Grenoble, France National Institute of Agronomic Research (INRA Maroc), Regional Centre of Agronomic Research, 23000 Beni-Mellal, Morocco
Frédéric Boyer Univ. Grenoble-Alpes, Univ. Savoie Mont Blanc, CNRS, LECA, F-38000 Grenoble, France
Ian Streeter European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD UK
Wahid Zamani Univ. Grenoble-Alpes, Univ. Savoie Mont Blanc, CNRS, LECA, F-38000 Grenoble, France Department of Environmental Sciences, Faculty of Natural Resources and Marine Sciences, Tarbiat Modares University, 46417-76489 Noor, Mazandaran, Iran
Stefan Engelen CEA - Institut de biologie François-Jacob, Genoscope, 2 Rue Gaston Cremieux 91057 Evry Cedex, France
Adriana Alberti CEA - Institut de biologie François-Jacob, Genoscope, 2 Rue Gaston Cremieux 91057 Evry Cedex, France
Florian J. Alberto Univ. Grenoble-Alpes, Univ. Savoie Mont Blanc, CNRS, LECA, F-38000 Grenoble, France
Mohamed BenBati National Institute of Agronomic Research (INRA Maroc), Regional Centre of Agronomic Research, 23000 Beni-Mellal, Morocco
Mustapha Ibnelbachyr National Institute of Agronomic Research (INRA Maroc), CRRA Errachidia, 52000 Errachidia, Morocco
Mouad Chentouf National Institute of Agronomic Research (INRA Maroc), CRRA Tangier, 90010 Tangier, Morocco
Abdelmajid Bechchari National Institute of Agronomic Research (INRA Maroc), CRRA Oujda, 60000 Oujda, Morocco
Hamid R. Rezaei Department of Environmental Sci, Gorgan University of Agricultural Sciences & Natural Resources, 41996-13776 Gorgan, Iran
Saeid Naderi Environmental Sciences Department, Natural Resources Faculty, University of Guilan, 49138-15749 Guilan, Iran
Alessandra Stella PTP Science Park, Bioinformatics Unit, Via Einstein-Loc. Cascina Codazza, 26900 Lodi, Italy
Abdelkader Chikhi National Institute of Agronomic Research (INRA Maroc), CRRA Errachidia, 52000 Errachidia, Morocco
Laura Clarke European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD UK
James Kijas Commonwealth Scientific and Industrial Research Organisation Animal Food and Health Sciences, St Lucia, QLD 4067, Australia
Paul Flicek European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD UK
Pierre Taberlet Univ. Grenoble-Alpes, Univ. Savoie Mont Blanc, CNRS, LECA, F-38000 Grenoble, France
François Pompanon Univ. Grenoble-Alpes, Univ. Savoie Mont Blanc, CNRS, LECA, F-38000 Grenoble, France

Collapse

Obšteter J, Jenko J, Hickey JM, Gorjanc G. Efficient use of genomic information for sustainable genetic improvement in small cattle populations. J Dairy Sci 2019;102:9971-9982. [PMID: 31477287 DOI: 10.3168/jds.2019-16853] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2019] [Accepted: 07/13/2019] [Indexed: 11/19/2022]

Abstract

In this study, we compared genetic gain, genetic variation, and the efficiency of converting variation into gain under different genomic selection scenarios with truncation or optimum contribution selection in a small dairy population by simulation. Breeding programs have to maximize genetic gain but also ensure sustainability by maintaining genetic variation. Numerous studies have shown that genomic selection increases genetic gain. Although genomic selection is a well-established method, small populations still struggle with choosing the most sustainable strategy to adopt this type of selection. We developed a simulator of a dairy population and simulated a model after the Slovenian Brown Swiss population with ∼10,500 cows. We compared different truncation selection scenarios by varying (1) the method of sire selection and their use on cows or bull-dams, and (2) selection intensity and the number of years a sire is in use. Furthermore, we compared different optimum contribution selection scenarios with optimization of sire selection and their usage. We compared scenarios in terms of genetic gain, selection accuracy, generation interval, genetic and genic variance, rate of coancestry, effective population size, and conversion efficiency. The results showed that early use of genomically tested sires increased genetic gain compared with progeny testing, as expected from changes in selection accuracy and generation interval. A faster turnover of sires from year to year and higher intensity increased the genetic gain even further but increased the loss of genetic variation per year. Although maximizing intensity gave the lowest conversion efficiency, faster turnover of sires gave an intermediate conversion efficiency. The largest conversion efficiency was achieved with the simultaneous use of genomically and progeny-tested sires that were used over several years. Compared with truncation selection, optimizing sire selection and their usage increased the conversion efficiency by achieving either comparable genetic gain for a smaller loss of genetic variation or higher genetic gain for a comparable loss of genetic variation. Our results will help breeding organizations implement sustainable genomic selection.

Collapse

Fallahi MH, Shahrbabak HM, Shahrbabak MM, Arpanahi RA, Gholami S. Detection of Haplotypic Structure for Genome of Azerbaijani Buffalo Using High Density SNP Markers. RUSS J GENET+ 2019. [DOI: 10.1134/s1022795419080040] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Mokhber M, Shahrbabak MM, Sadeghi M, Shahrbabak HM, Stella A, Nicolzzi E, Williams JL. Study of whole genome linkage disequilibrium patterns of Iranian water buffalo breeds using the Axiom Buffalo Genotyping 90K Array. PLoS One 2019;14:e0217687. [PMID: 31150486 PMCID: PMC6544294 DOI: 10.1371/journal.pone.0217687] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2018] [Accepted: 05/16/2019] [Indexed: 01/21/2023] Open

Barría A, Christensen KA, Yoshida G, Jedlicki A, Leong JS, Rondeau EB, Lhorente JP, Koop BF, Davidson WS, Yáñez JM. Whole Genome Linkage Disequilibrium and Effective Population Size in a Coho Salmon (Oncorhynchus kisutch) Breeding Population Using a High-Density SNP Array. Front Genet 2019;10:498. [PMID: 31191613 PMCID: PMC6539196 DOI: 10.3389/fgene.2019.00498] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2018] [Accepted: 05/07/2019] [Indexed: 12/19/2022] Open

Abstract

The estimation of linkage disequilibrium between molecular markers within a population is critical when establishing the minimum number of markers required for association studies, genomic selection, and inferring historical events influencing different populations. This work aimed to evaluate the extent and decay of linkage disequilibrium in a coho salmon breeding population using a high-density SNP array. Linkage disequilibrium was estimated between a total of 93,502 SNPs found in 64 individuals (33 dams and 31 sires) from the breeding population. The markers encompass all 30 coho salmon chromosomes and comprise 1,684.62 Mb of the genome. The average density of markers per chromosome ranged from 48.31 to 66 per 1 Mb. The minor allele frequency averaged 0.26 (with a range from 0.22 to 0.27). The overall average linkage disequilibrium among SNPs pairs measured as r² was 0.10. The Average r² value decreased with increasing physical distance, with values ranging from 0.21 to 0.07 at a distance lower than 1 kb and up to 10 Mb, respectively. An r² threshold of 0.2 was reached at distance of approximately 40 Kb. Chromosomes Okis05, Okis15 and Okis28 showed high levels of linkage disequilibrium (>0.20 at distances lower than 1 Mb). Average r² values were lower than 0.15 for all chromosomes at distances greater than 4 Mb. An effective population size of 43 was estimated for the population 10 generations ago, and 325, for 139 generations ago. Based on the effective number of chromosome segments, we suggest that at least 74,000 SNPs would be necessary for an association mapping study and genomic predictions. Therefore, the SNP panel used allowed us to capture high-resolution information in the farmed coho salmon population. Furthermore, based on the contemporary N_e, a new mate allocation strategy is suggested to increase the effective population size.

Collapse

Chhotaray S, Panigrahi M, Pal D, Ahmad SF, Bhanuprakash V, Kumar H, Parida S, Bhushan B, Gaur GK, Mishra BP, Singh RK. Genome-wide estimation of inbreeding coefficient, effective population size and haplotype blocks in Vrindavani crossbred cattle strain of India. BIOL RHYTHM RES 2019. [DOI: 10.1080/09291016.2019.1600266] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Schäler J, Wellmann R, Bennewitz J, Thaller G, Hinrichs D. Genetic diversity and historic introgression in German Angler and Red Dual Purpose cattle and possibilities to reverse introgression. ACTA AGR SCAND A-AN 2019. [DOI: 10.1080/09064702.2019.1600011] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Xu L, Zhu B, Wang Z, Xu L, Liu Y, Chen Y, Zhang L, Gao X, Gao H, Zhang S, Xu L, Li J. Evaluation of Linkage Disequilibrium, Effective Population Size and Haplotype Block Structure in Chinese Cattle. Animals (Basel) 2019;9:ani9030083. [PMID: 30845681 PMCID: PMC6466336 DOI: 10.3390/ani9030083] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2019] [Revised: 02/21/2019] [Accepted: 02/22/2019] [Indexed: 11/16/2022] Open

Abstract

Simple Summary

Evaluation of the population structure and linkage disequilibrium can offer important insights to fully understand the genetic diversity and population history of cattle, which can enable us to appropriately design and implement GWAS and GS in cattle. In this study, we characterized the extent of genome-wide LD and the haplotype block structure, and estimated the persistence of phase of Chinese indigenous cattle with Illumina BovineHD BeadChip. According to our study, 58K, 87K, 95K, 52K, and 52K markers would be necessary for SCHC, NCC, SWC, SIM, and WAG, respectively, in the implementation of GWAS and GS and combining a multipopulation with high persistence of phase is feasible for the implication of genomic selection for Chinese beef cattle.

Abstract

Understanding the linkage disequilibrium (LD) across the genome, haplotype structure, and persistence of phase between breeds can enable us to appropriately design and implement the genome-wide association (GWAS) and genomic selection (GS) in beef cattle. We estimated the extent of genome-wide LD, haplotype block structure, and the persistence of phase in 10 Chinese cattle population using high density BovinHD BeadChip. The overall LD measured by r² between adjacent SNPs were 0.60, 0.67, 0.58, 0.73, and 0.71 for South Chinese cattle (SCHC), North Chinese cattle (NCC), Southwest Chinese cattle (SWC), Simmental (SIM), and Wagyu (WAG). The highest correlation (0.53) for persistence of phase across groups was observed for SCHC vs. SWC at distances of 0–50 kb, while the lowest correlation was 0.13 for SIM vs. SCHC at the same distances. In addition, the estimated current effective population sizes were 27, 14, 31, 34, and 43 for SCHC, NCC, SWC, SIM, and WAG, respectively. Our result showed that 58K, 87K, 95K, 52K, and 52K markers were required for implementation of GWAS and GS in SCHC, NCC, SWC, SIM, and WAG, respectively. Also, our findings suggested that the implication of genomic selection for multipopulation with high persistence of phase is feasible for Chinese cattle.

Collapse

Jenko J, McClure MC, Matthews D, McClure J, Johnsson M, Gorjanc G, Hickey JM. Analysis of a large dataset reveals haplotypes carrying putatively recessive lethal and semi-lethal alleles with pleiotropic effects on economically important traits in beef cattle. Genet Sel Evol 2019;51:9. [PMID: 30836944 PMCID: PMC6402105 DOI: 10.1186/s12711-019-0452-z] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2018] [Accepted: 02/21/2019] [Indexed: 12/20/2022] Open

Abstract

BACKGROUND

In livestock, deleterious recessive alleles can result in reduced economic performance of homozygous individuals in multiple ways, e.g. early embryonic death, death soon after birth, or semi-lethality with incomplete penetrance causing reduced viability. While death is an easy phenotype to score, reduced viability is not as easy to identify. However, it can sometimes be observed as reduced conception rates, longer calving intervals, or lower survival for live born animals.

METHODS

In this paper, we searched for haplotypes that carry putatively recessive lethal or semi-lethal alleles in 132,725 genotyped Irish beef cattle from five breeds: Aberdeen Angus, Charolais, Hereford, Limousin, and Simmental. We phased the genotypes in sliding windows along the genome and used five tests to identify haplotypes with absence of or reduced homozygosity. Then, we associated the identified haplotypes with 44,351 insemination records that indicated early embryonic death, and postnatal survival records. Finally, we assessed haplotype pleiotropy by estimating substitution effects on estimates of breeding value for 15 economically important traits in beef production.

RESULTS

We found support for one haplotype that carries a putatively recessive lethal (chromosome 16 in Simmental) and two haplotypes that carry semi-lethal alleles (chromosome 14 in Aberdeen Angus and chromosome 19 in Charolais), with population frequencies of 8.8, 15.2, and 14.4%, respectively. These three haplotypes showed pleiotropic effects on economically important traits for beef production. Their allele substitution effects are €2.30, €3.42, and €1.47 for the terminal index and €1.03, - €3.11, and - €0.88 for the replacement index, where the standard deviations for the terminal index are €22.52, €18.65, and €22.70 and for the replacement index they are €31.35, €29.82, and €35.79. We identified ZFAT as the candidate gene for semi-lethality in Aberdeen Angus, several candidate genes for the lethal Simmental haplotype, and no candidate genes for the semi-lethal Charolais haplotype.

CONCLUSIONS

We analysed genotype, reproduction, survival, and production data to detect haplotypes that carry putatively recessive lethal or semi-lethal alleles in Irish beef cattle and identified one lethal and two semi-lethal haplotypes, which have pleiotropic effects on economically important traits in beef production.

Collapse

Whalen A, Gorjanc G, Hickey JM. Parentage assignment with genotyping-by-sequencing data. J Anim Breed Genet 2019;136:102-112. [PMID: 30548685 PMCID: PMC6392119 DOI: 10.1111/jbg.12370] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2018] [Revised: 11/09/2018] [Accepted: 11/10/2018] [Indexed: 01/04/2023]

Jemaa SB, Thamri N, Mnara S, Rebours E, Rocha D, Boussaha M. Linkage disequilibrium and past effective population size in native Tunisian cattle. Genet Mol Biol 2019;42:52-61. [PMID: 30776288 PMCID: PMC6428135 DOI: 10.1590/1678-4685-gmb-2017-0342] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2017] [Accepted: 05/14/2018] [Indexed: 12/30/2022] Open

Bresolin T, Rosa GJDM, Valente BD, Espigolan R, Gordo DGM, Braz CU, Fernandes Júnior GA, Magalhães AFB, Garcia DA, Frezarim GB, Leão GFC, Carvalheiro R, Baldi F, Nunes de Oliveira H, Galvão de Albuquerque L. Effect of quality control, density and allele frequency of markers on the accuracy of genomic prediction for complex traits in Nellore cattle. ANIMAL PRODUCTION SCIENCE 2019. [DOI: 10.1071/an16821] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Abstract This study was designed to test the impact of quality control, density and allele frequency of single nucleotide polymorphisms (SNP) markers on the accuracy of genomic predictions, using three traits with different heritabilities and two methods of prediction in a Nellore cattle population genotyped with the Illumina Bovine HD Assay. A total of 1756; 3150 and 3119 records of age at first calving (AFC); weaning weight (WW) and yearling weight (YW), respectively, were used. Three scenarios with different exclusion thresholds for minor allele frequency (MAF), deviation from Hardy–Weinberg equilibrium (HWE) and correlation between SNP pairs (r2) were constructed for all traits: (1) high rigor (S1): call rate <0.98, MAF <0.05, HWE with P <10−5, and r2 >0.999; (2) Moderate rigor (S2): call rate <0.85 and MAF <0.01; (3) Low rigor (S3): only non-autosomal SNP and those mapped on the same position were excluded. Additionally, to assess the prediction accuracy from different markers density, six panels (10K, 50K, 100K, 300K, 500K and 700K) were customised using the high-density genotyping assay as reference. Finally, from the markers available in high-density genotyping assay, six groups (G) with different minor allele frequency bins were defined to estimate the accuracy of genomic prediction. The range of MAF bins was approximately equal for the traits studied: G1 (0.000–0.009), G2 (0.010–0.064), G3 (0.065–0.174), G4 (0.175–0.325), G5 (0.326–0.500) and G6 (0.000–0.500). The Genomic Best Linear Unbiased Predictor and BayesCπ methods were used to estimate the SNP marker effects. Five-fold cross-validation was used to measure the accuracy of genomic prediction for all scenarios. There were no effects of genotypes quality control criteria on the accuracies of genomic predictions. For all traits, the higher density panel did not provide greater prediction accuracies than the low density one (10K panel). The groups of SNP with low MAF (MAF ≤0.007 for AFC, MAF ≤0.009 for WW and MAF ≤0.008 for YW) provided lower prediction accuracies than the groups with higher allele frequencies. Collapse

Fonseca PADS, dos Santos FC, Lam S, Suárez-Vega A, Miglior F, Schenkel FS, Diniz LDAF, Id-Lahoucine S, Carvalho MRS, Cánovas A. Genetic mechanisms underlying spermatic and testicular traits within and among cattle breeds: systematic review and prioritization of GWAS results. J Anim Sci 2018;96:4978-4999. [PMID: 30304443 PMCID: PMC6276581 DOI: 10.1093/jas/sky382] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2018] [Accepted: 09/27/2018] [Indexed: 12/20/2022] Open

Abstract

Reduced bull fertility imposes economic losses in bovine herds. Specifically, testicular and spermatic traits are important indicators of reproductive efficiency. Several genome-wide association studies (GWAS) have identified genomic regions associated with these fertility traits. The aims of this study were as follows: 1) to perform a systematic review of GWAS results for spermatic and testicular traits in cattle and 2) to identify key functional candidate genes for these traits. The identification of functional candidate genes was performed using a systems biology approach, where genes shared between traits and studies were evaluated by a guilt by association gene prioritization (GUILDify and ToppGene software) in order to identify the best functional candidates. These candidate genes were integrated and analyzed in order to identify overlapping patterns among traits and breeds. Results showed that GWAS for testicular-related traits have been developed for beef breeds only, whereas the majority of GWAS for spermatic-related traits were conducted using dairy breeds. When comparing traits measured within the same study, the highest number of genes shared between different traits was observed, indicating a high impact of the population genetic structure and environmental effects. Several chromosomal regions were enriched for functional candidate genes associated with fertility traits. Moreover, multiple functional candidate genes were enriched for markers in a species-specific basis, taurine (Bos taurus) or indicine (Bos indicus). For the different candidate regions identified in the GWAS in the literature, functional candidate genes were detected as follows: B. Taurus chromosome X (BTX) (TEX11, IRAK, CDK16, ATP7A, ATRX, HDAC6, FMR1, L1CAM, MECP2, etc.), BTA17 (TRPV4 and DYNLL1), and BTA14 (MOS, FABP5, ZFPM2). These genes are responsible for regulating important metabolic pathways or biological processes associated with fertility, such as progression of spermatogenesis, control of ciliary activity, development of Sertoli cells, DNA integrity in spermatozoa, and homeostasis of testicular cells. This study represents the first systematic review on male fertility traits in cattle using a system biology approach to identify key candidate genes for these traits.

Collapse

Fonseca PADS, Id-Lahoucine S, Reverter A, Medrano JF, Fortes MS, Casellas J, Miglior F, Brito L, Carvalho MRS, Schenkel FS, Nguyen LT, Porto-Neto LR, Thomas MG, Cánovas A. Combining multi-OMICs information to identify key-regulator genes for pleiotropic effect on fertility and production traits in beef cattle. PLoS One 2018;13:e0205295. [PMID: 30335783 PMCID: PMC6193631 DOI: 10.1371/journal.pone.0205295] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2018] [Accepted: 09/21/2018] [Indexed: 12/21/2022] Open

Abstract

The identification of biological processes related to the regulation of complex traits is a difficult task. Commonly, complex traits are regulated through a multitude of genes contributing each to a small part of the total genetic variance. Additionally, some loci can simultaneously regulate several complex traits, a phenomenon defined as pleiotropy. The lack of understanding on the biological processes responsible for the regulation of these traits results in the decrease of selection efficiency and the selection of undesirable hitchhiking effects. The identification of pleiotropic key-regulator genes can assist in developing important tools for investigating biological processes underlying complex traits. A multi-breed and multi-OMICs approach was applied to study the pleiotropic effects of key-regulator genes using three independent beef cattle populations evaluated for fertility traits. A pleiotropic map for 32 traits related to growth, feed efficiency, carcass and meat quality, and reproduction was used to identify genes shared among the different populations and breeds in pleiotropic regions. Furthermore, data-mining analyses were performed using the Cattle QTL database (CattleQTLdb) to identify the QTL category annotated in the regions around the genes shared among breeds. This approach allowed the identification of a main gene network (composed of 38 genes) shared among breeds. This gene network was significantly associated with thyroid activity, among other biological processes, and displayed a high regulatory potential. In addition, it was possible to identify genes with pleiotropic effects related to crucial biological processes that regulate economically relevant traits associated with fertility, production and health, such as MYC, PPARG, GSK3B, TG and IYD genes. These genes will be further investigated to better understand the biological processes involved in the expression of complex traits and assist in the identification of functional variants associated with undesirable phenotypes, such as decreased fertility, poor feed efficiency and negative energetic balance.

Collapse

Affiliation(s)

Pablo Augusto de Souza Fonseca University of Guelph, Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, Guelph, Ontario, Canada Universidade Federal de Minas Gerais, Departamento de Biologia Geral, Belo Horizonte, Minas Gerais, Brazil
Samir Id-Lahoucine University of Guelph, Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, Guelph, Ontario, Canada
Antonio Reverter CSIRO Agriculture and Food, Queensland Bioscience Precinct, Brisbane, Queensland, Australia
Juan F. Medrano University of California-Davis, Department of Animal Science, Davis, California, United States of America
Marina S. Fortes The University of Queensland, School of Chemistry and Molecular Biosciences, Brisbane, Queensland, Australia
Joaquim Casellas Universitat Autònoma de Barcelona, Departament de Ciència Animal i dels Aliments, Barcelona, Bellaterra, Barcelona, Spain
Filippo Miglior University of Guelph, Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, Guelph, Ontario, Canada Canadian Dairy Network, Guelph, Ontario, Canada
Luiz Brito University of Guelph, Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, Guelph, Ontario, Canada
Maria Raquel S. Carvalho Universidade Federal de Minas Gerais, Departamento de Biologia Geral, Belo Horizonte, Minas Gerais, Brazil
Flávio S. Schenkel University of Guelph, Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, Guelph, Ontario, Canada
Loan T. Nguyen The University of Queensland, School of Chemistry and Molecular Biosciences, Brisbane, Queensland, Australia
Laercio R. Porto-Neto CSIRO Agriculture and Food, Queensland Bioscience Precinct, Brisbane, Queensland, Australia
Milton G. Thomas Colorado State University, Department of Animal Science, Fort-Colins, Colorado, United States of America
Angela Cánovas University of Guelph, Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, Guelph, Ontario, Canada * E-mail:

Collapse

Whalen A, Gorjanc G, Ros-Freixedes R, Hickey JM. Assessment of the performance of hidden Markov models for imputation in animal breeding. Genet Sel Evol 2018;50:44. [PMID: 30223768 PMCID: PMC6142395 DOI: 10.1186/s12711-018-0416-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2017] [Accepted: 09/05/2018] [Indexed: 12/31/2022] Open

Abstract

Background

In this paper, we review the performance of various hidden Markov model-based imputation methods in animal breeding populations. Traditionally, pedigree and heuristic-based imputation methods have been used for imputation in large animal populations due to their computational efficiency, scalability, and accuracy. Recent advances in the area of human genetics have increased the ability of probabilistic hidden Markov model methods to perform accurate phasing and imputation in large populations. These advances may enable these methods to be useful for routine use in large animal populations, particularly in populations where pedigree information is not readily available.

Methods

To test the performance of hidden Markov model-based imputation, we evaluated the accuracy and computational cost of several methods in a series of simulated populations and a real animal population without using a pedigree. First, we tested single-step (diploid) imputation, which performs both phasing and imputation. Second, we tested pre-phasing followed by haploid imputation. Overall, we used four available diploid imputation methods (fastPHASE, Beagle v4.0, IMPUTE2, and MaCH), three phasing methods, (SHAPEIT2, HAPI-UR, and Eagle2), and three haploid imputation methods (IMPUTE2, Beagle v4.1, and Minimac3).

Results

We found that performing pre-phasing and haploid imputation was faster and more accurate than diploid imputation. In particular, among all the methods tested, pre-phasing with Eagle2 or HAPI-UR and imputing with Minimac3 or IMPUTE2 gave the highest accuracies with both simulated and real data.

Conclusions

The results of this study suggest that hidden Markov model-based imputation algorithms are an accurate and computationally feasible approach for performing imputation without a pedigree when pre-phasing and haploid imputation are used. Of the algorithms tested, the combination of Eagle2 and Minimac3 gave the highest accuracy across the simulated and real datasets.

Collapse

Nandolo W, Utsunomiya YT, Mészáros G, Wurzinger M, Khayadzadeh N, Torrecilha RBP, Mulindwa HA, Gondwe TN, Waldmann P, Ferenčaković M, Garcia JF, Rosen BD, Bickhart D, van Tassell CP, Curik I, Sölkner J. Misidentification of runs of homozygosity islands in cattle caused by interference with copy number variation or large intermarker distances. Genet Sel Evol 2018;50:43. [PMID: 30134820 PMCID: PMC6106898 DOI: 10.1186/s12711-018-0414-x] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2017] [Accepted: 07/30/2018] [Indexed: 12/22/2022] Open

Abstract

Background

Runs of homozygosity (ROH) islands are stretches of homozygous sequence in the genome of a large proportion of individuals in a population. Algorithms for the detection of ROH depend on the similarity of haplotypes. Coverage gaps and copy number variants (CNV) may result in incorrect identification of such similarity, leading to the detection of ROH islands where none exists. Misidentified hemizygous regions will also appear as homozygous based on sequence variation alone. Our aim was to identify ROH islands influenced by marker coverage gaps or CNV, using Illumina BovineHD BeadChip (777 K) single nucleotide polymorphism (SNP) data for Austrian Brown Swiss, Tyrol Grey and Pinzgauer cattle.

Methods

ROH were detected using clustering, and ROH islands were determined from population inbreeding levels for each marker. CNV were detected using a multivariate copy number analysis method and a hidden Markov model. SNP coverage gaps were defined as genomic regions with intermarker distances on average longer than 9.24 kb. ROH islands that overlapped CNV regions (CNVR) or SNP coverage gaps were considered as potential artefacts. Permutation tests were used to determine if overlaps between CNVR with copy losses and ROH islands were due to chance. Diversity of the haplotypes in the ROH islands was assessed by haplotype analyses.

Results

In Brown Swiss, Tyrol Grey and Pinzgauer, we identified 13, 22, and 24 ROH islands covering 26.6, 389.0 and 35.8 Mb, respectively, and we detected 30, 50 and 71 CNVR derived from CNV by using both algorithms, respectively. Overlaps between ROH islands, CNVR or coverage gaps occurred for 7, 14 and 16 ROH islands, respectively. About 37, 44 and 52% of the ROH islands coverage in Brown Swiss, Tyrol Grey and Pinzgauer, respectively, were affected by copy loss. Intersections between ROH islands and CNVR were small, but significantly larger compared to ROH islands at random locations across the genome, implying an association between ROH islands and CNVR. Haplotype diversity for reliable ROH islands was lower than for ROH islands that intersected with copy loss CNVR.

Conclusions

Our findings show that a significant proportion of the ROH islands in the bovine genome are artefacts due to CNV or SNP coverage gaps.

Electronic supplementary material

The online version of this article (10.1186/s12711-018-0414-x) contains supplementary material, which is available to authorized users.

Collapse

Affiliation(s)

Wilson Nandolo Division of Livestock Sciences (NUWI), University of Natural Resources and Life Sciences, Gregor-Mendel Strasse 33, 1180, Vienna, Austria.,Lilongwe University of Agriculture and Natural Resources, P. O. Box 219, Lilongwe, Malawi
Yuri T Utsunomiya School of Agricultural and Veterinarian Sciences, Jaboticabal, Department of Preventive Veterinary Medicine and Animal Reproduction, São Paulo State University (UNESP), São Paulo, Brazil
Gábor Mészáros Division of Livestock Sciences (NUWI), University of Natural Resources and Life Sciences, Gregor-Mendel Strasse 33, 1180, Vienna, Austria.
Maria Wurzinger Division of Livestock Sciences (NUWI), University of Natural Resources and Life Sciences, Gregor-Mendel Strasse 33, 1180, Vienna, Austria
Negar Khayadzadeh Division of Livestock Sciences (NUWI), University of Natural Resources and Life Sciences, Gregor-Mendel Strasse 33, 1180, Vienna, Austria
Rafaela B P Torrecilha School of Agricultural and Veterinarian Sciences, Jaboticabal, Department of Preventive Veterinary Medicine and Animal Reproduction, São Paulo State University (UNESP), São Paulo, Brazil
Henry A Mulindwa National Livestock Resources Research Institute, P.O Box 96, Tororo, Uganda
Timothy N Gondwe Lilongwe University of Agriculture and Natural Resources, P. O. Box 219, Lilongwe, Malawi
Patrik Waldmann Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Box 7023, 750 07, Uppsala, Sweden
Maja Ferenčaković Department of Animal Science, Faculty of Agriculture, University of Zagreb, Svetošimunska cesta 25, 10000, Zagreb, Croatia
José F Garcia School of Agricultural and Veterinarian Sciences, Jaboticabal, Department of Preventive Veterinary Medicine and Animal Reproduction, São Paulo State University (UNESP), São Paulo, Brazil.,School of Veterinary Medicine, Araçatuba, Department of Support, Production and Animal Health, São Paulo State University (UNESP), São Paulo, Brazil
Benjamin D Rosen Animal Genomics and Improvement Laboratory, Beltsville, MD, 20705-2350, USA
Derek Bickhart Animal Genomics and Improvement Laboratory, Beltsville, MD, 20705-2350, USA
Curt P van Tassell Animal Genomics and Improvement Laboratory, Beltsville, MD, 20705-2350, USA
Ino Curik Department of Animal Science, Faculty of Agriculture, University of Zagreb, Svetošimunska cesta 25, 10000, Zagreb, Croatia
Johann Sölkner Division of Livestock Sciences (NUWI), University of Natural Resources and Life Sciences, Gregor-Mendel Strasse 33, 1180, Vienna, Austria

Collapse

Goszczynski D, Molina A, Terán E, Morales-Durand H, Ross P, Cheng H, Giovambattista G, Demyda-Peyrás S. Runs of homozygosity in a selected cattle population with extremely inbred bulls: Descriptive and functional analyses revealed highly variable patterns. PLoS One 2018;13:e0200069. [PMID: 29985951 PMCID: PMC6037354 DOI: 10.1371/journal.pone.0200069] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2017] [Accepted: 06/19/2018] [Indexed: 01/05/2023] Open

Abstract

The analysis of runs of homozygosity (ROH), using high throughput genomic data, has become a valuable and frequently used methodology to characterize the genomic and inbreeding variation of livestock and wildlife animal populations. However, this methodology has been scarcely used in highly inbred domestic animals. Here, we analyzed and characterized the occurrence of ROH fragments in highly inbred (HI; average pedigree-based inbreeding coefficient F_PED = 0.164; 0.103 to 0.306) and outbred Retinta bulls (LI; average F_PED = 0.008; 0 to 0.025). We studied the length of the fragments, their abundance, and genome distribution using high-density microarray data. The number of ROH was significantly higher in the HI group, especially for long fragments (>8Mb). In the LI group, the number of ROH continuously decreased with fragment length. Genome-wide distribution of ROH was highly variable between samples. Some chromosomes presented a larger number of fragments (BTA1, BTA19, BTA29), others had longer fragments (BTA4, BTA12, BTA17), while other ones showed an increased ROH accumulation over specific loci (BTA2, BTA7, BTA23, BTA29). Similar differences were observed in the analysis of 12 individuals produced by a similar inbred event (F_PED3 = 0.125). The correlation between the fraction of the genome covered by ROH (F_ROH) and F_PED was high (0.79), suggesting that ROH-based estimations are indicative of inbreeding levels. On the other hand, the correlation between F_PED and the microsatellite-based inbreeding coefficient (F_MIC) was only moderate (r = 0.44), suggesting that STR-based inbreeding estimations should be avoided. Similarly, we found a very low correlation (r = -0.0132) between recombination rate and ROH abundance across the genome. Finally, we performed functional annotation analyses of genome regions with significantly enriched ROH abundance. Results revealed gene clusters related to pregnancy-associated proteins and immune reaction. The same analysis performed for regions enriched with recently formed ROH (> 8 Mb) showed gene clusters related to flagellum assembly. In both cases, the processes were related to male and female reproductive functions, which may partially explain the reduced fertility associated with inbred populations.

Collapse

Affiliation(s)

Daniel Goszczynski IGEVET–Instituto de Genética Veterinaria "Ing. Fernando N. Dulout” (UNLP-CONICET LA PLATA), Facultad de Ciencias Veterinarias UNLP, La Plata, Argentina
Antonio Molina Departamento de Genética, Universidad de Córdoba, Córdoba, España
Ester Terán IGEVET–Instituto de Genética Veterinaria "Ing. Fernando N. Dulout” (UNLP-CONICET LA PLATA), Facultad de Ciencias Veterinarias UNLP, La Plata, Argentina
Hernán Morales-Durand IGEVET–Instituto de Genética Veterinaria "Ing. Fernando N. Dulout” (UNLP-CONICET LA PLATA), Facultad de Ciencias Veterinarias UNLP, La Plata, Argentina
Pablo Ross Department of Animal Science, University of California, Davis, Davis, California, United States of America
Hao Cheng Department of Animal Science, University of California, Davis, Davis, California, United States of America
Guillermo Giovambattista IGEVET–Instituto de Genética Veterinaria "Ing. Fernando N. Dulout” (UNLP-CONICET LA PLATA), Facultad de Ciencias Veterinarias UNLP, La Plata, Argentina Departamento de Genética, Universidad de Córdoba, Córdoba, España Department of Animal Science, University of California, Davis, Davis, California, United States of America Departamento de Producción Animal, Facultad de Ciencias Veterinarias, Universidad Nacional de La Plata, La Plata, Buenos Aires, Argentina
Sebastián Demyda-Peyrás IGEVET–Instituto de Genética Veterinaria "Ing. Fernando N. Dulout” (UNLP-CONICET LA PLATA), Facultad de Ciencias Veterinarias UNLP, La Plata, Argentina Departamento de Genética, Universidad de Córdoba, Córdoba, España Department of Animal Science, University of California, Davis, Davis, California, United States of America Departamento de Producción Animal, Facultad de Ciencias Veterinarias, Universidad Nacional de La Plata, La Plata, Buenos Aires, Argentina * E-mail:

Collapse

Bejarano D, Martínez R, Manrique C, Parra LM, Rocha JF, Gómez Y, Abuabara Y, Gallego J. Linkage disequilibrium levels and allele frequency distribution in Blanco Orejinegro and Romosinuano Creole cattle using medium density SNP chip data. Genet Mol Biol 2018;41:426-433. [PMID: 30088613 PMCID: PMC6082240 DOI: 10.1590/1678-4685-gmb-2016-0310] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2016] [Accepted: 09/20/2017] [Indexed: 11/22/2022] Open

Larmer SG, Sargolzaei M, Brito LF, Ventura RV, Schenkel FS. Novel methods for genotype imputation to whole-genome sequence and a simple linear model to predict imputation accuracy. BMC Genet 2017;18:120. [PMID: 29281958 PMCID: PMC5746022 DOI: 10.1186/s12863-017-0588-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2017] [Accepted: 12/15/2017] [Indexed: 11/10/2022] Open

Abstract

Background

Accurate imputation plays a major role in genomic studies of livestock industries, where the number of genotyped or sequenced animals is limited by costs. This study explored methods to create an ideal reference population for imputation to Next Generation Sequencing data in cattle.

Methods

Methods for clustering of animals for imputation were explored, using 1000 Bull Genomes Project sequence data on 1146 animals from a variety of beef and dairy breeds. Imputation from 50 K to 777 K was first carried out to choose an ideal clustering method, using ADMIXTURE or PLINK clustering algorithms with either genotypes or reconstructed haplotypes.

Results

Due to efficiency, accuracy and ease of use, clustering with PLINK using haplotypes as quasi-genotypes was chosen as the most advantageous grouping method. It was found that using a clustered population slightly decreased computing time, while maintaining accuracy across the population. Although overall accuracy remained the same, a slight increase in accuracy was observed for groups of animals in some breeds (primarily purebred beef cattle from breeds with fewer sequenced animals) and for other groups, primarily crossbreed animals, a slight decrease in accuracy was observed. However, it was noted that some animals in each breed were poorly imputed across all methods. When imputed sequences were included in the reference population to aid imputation of poorly imputed animals, a small increase in overall accuracy was observed for nearly every individual in the population. Two models were created to predict imputation accuracy, a complete model using all information available including Euclidean distances from genotypes and haplotypes, pedigree information, and clustering groups and a simple model using only breed and an Euclidean distance matrix as predictors. Both models were successful in predicting imputation accuracy, with correlations between predicted and true imputation accuracy as measured by concordance rate of 0.87 and 0.83, respectively.

Conclusions

A clustering methodology can be very useful to subgroup cattle for efficient genotype imputation. In addition, accuracy of genotype imputation from medium to high-density Single Nucleotide Polymorphisms (SNP) chip panels to whole-genome sequence can be predicted well using a simple linear model defined in this study.

Collapse

A method for allocating low-coverage sequencing resources by targeting haplotypes rather than individuals. Genet Sel Evol 2017;49:78. [PMID: 29070022 PMCID: PMC5655873 DOI: 10.1186/s12711-017-0353-y] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2017] [Accepted: 10/18/2017] [Indexed: 11/25/2022] Open

Abstract

Background

This paper describes a heuristic method for allocating low-coverage sequencing resources by targeting haplotypes rather than individuals. Low-coverage sequencing assembles high-coverage sequence information for every individual by accumulating data from the genome segments that they share with many other individuals into consensus haplotypes. Deriving the consensus haplotypes accurately is critical for achieving a high phasing and imputation accuracy. In order to enable accurate phasing and imputation of sequence information for the whole population, we allocate the available sequencing resources among individuals with existing phased genomic data by targeting the sequencing coverage of their haplotypes.

Results

Our method, called AlphaSeqOpt, prioritizes haplotypes using a score function that is based on the frequency of the haplotypes in the sequencing set relative to the target coverage. AlphaSeqOpt has two steps: (1) selection of an initial set of individuals by iteratively choosing the individuals that have the maximum score conditional on the current set, and (2) refinement of the set through several rounds of exchanges of individuals. AlphaSeqOpt is very effective for distributing a fixed amount of sequencing resources evenly across haplotypes, which results in a reduction of the proportion of haplotypes that are sequenced below the target coverage. AlphaSeqOpt can provide a greater proportion of haplotypes sequenced at the target coverage by sequencing less individuals, as compared with other methods that use a score function based on haplotype frequencies in the population. A refinement of the initially selected set can provide a larger more diverse set with more unique individuals, which is beneficial in the context of low-coverage sequencing. We extend the method with an approach for filtering rare haplotypes based on their flanking haplotypes, so that only those that are likely to derive from a recombination event are targeted.

Conclusions

We present a method for allocating sequencing resources so that a greater proportion of haplotypes are sequenced at a coverage that is sufficiently high for population-based imputation with low-coverage sequencing. The haplotype score function, the refinement step, and the new approach for filtering rare haplotypes make AlphaSeqOpt more effective for that purpose than previously reported methods for reducing sequencing redundancy.

Electronic supplementary material

The online version of this article (doi:10.1186/s12711-017-0353-y) contains supplementary material, which is available to authorized users.

Collapse

Cañas-Álvarez JJ, Mouresan EF, Varona L, Díaz C, Molina A, Baro JA, Altarriba J, Carabaño MJ, Casellas J, Piedrafita J. Linkage disequilibrium, persistence of phase, and effective population size in Spanish local beef cattle breeds assessed through a high-density single nucleotide polymorphism chip. J Anim Sci 2017;94:2779-88. [PMID: 27482665 DOI: 10.2527/jas.2016-0425] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open

Abstract

Linkage disequilibrium (LD) and persistence of phase are fundamental approaches for exploring the genetic basis of economically important traits in cattle, including the identification of QTL for genomic selection and the estimation of effective population size () to determine the size of the training populations. In this study, we have used the Illumina BovineHD chip in 168 trios of 7 Spanish beef cattle breeds to obtain an overview of the magnitude of LD and the persistence of LD phase through the physical distance between markers. Also, we estimated the time of divergence based on the persistence of the LD phase and calculated past from LD estimates using different alternatives to define the recombination rate. Estimates of average (as a measure of LD) for adjacent markers were close to 0.52 in the 7 breeds and decreased with the distance between markers, although in long distances, some LD still remained (0.07 and 0.05 for markers 200 kb and 1 Mb apart, respectively). A panel with a lower boundary of 38,000 SNP would be necessary to launch a successful within-breed genomic selection program. Persistence of phase, measured as the pairwise correlations between estimates of in 2 breeds at short distances (10 kb), was in the 0.89 to 0.94 range and decreased from 0.33 to 0.52 to a range of 0.01 to 0.08 when marker distance increased from 200 kb to 1 Mb, respectively. The magnitude of the persistence of phase between the Spanish beef breeds was similar to those found in dairy breeds. For across-breed genomic selection, the size of the SNP panels must be in the range of 50,000 to 83,000 SNP. Estimates of past showed values ranging from 26 to 31 for 1 generation ago in all breeds. The divergence among breeds occurred between 129 and 207 generations ago. The results of this study are relevant for the future implementation of within- and across-breed genomic selection programs in the Spanish beef cattle populations. Our results suggest that a reduced subset of the SNP panel would be enough to achieve an adequate precision of the genomic predictions.

Collapse

Gonen S, Ros-Freixedes R, Battagin M, Gorjanc G, Hickey JM. A method for the allocation of sequencing resources in genotyped livestock populations. Genet Sel Evol 2017;49:47. [PMID: 28521728 PMCID: PMC5437657 DOI: 10.1186/s12711-017-0322-5] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2016] [Accepted: 05/12/2017] [Indexed: 11/18/2022] Open

Abstract

Background

This paper describes a method, called AlphaSeqOpt, for the allocation of sequencing resources in livestock populations with existing phased genomic data to maximise the ability to phase and impute sequenced haplotypes into the whole population.

Methods

We present two algorithms. The first selects focal individuals that collectively represent the maximum possible portion of the haplotype diversity in the population. The second allocates a fixed sequencing budget among the families of focal individuals to enable phasing of their haplotypes at the sequence level. We tested the performance of the two algorithms in simulated pedigrees. For each pedigree, we evaluated the proportion of population haplotypes that are carried by the focal individuals and compared our results to a variant of the widely-used key ancestors approach and to two haplotype-based approaches. We calculated the expected phasing accuracy of the haplotypes of a focal individual at the sequence level given the proportion of the fixed sequencing budget allocated to its family.

Results

AlphaSeqOpt maximises the ability to capture and phase the most frequent haplotypes in a population in three ways. First, it selects focal individuals that collectively represent a larger portion of the population haplotype diversity than existing methods. Second, it selects focal individuals from across the pedigree whose haplotypes can be easily phased using family-based phasing and imputation algorithms, thus maximises the ability to impute sequence into the rest of the population. Third, it allocates more of the fixed sequencing budget to focal individuals whose haplotypes are more frequent in the population than to focal individuals whose haplotypes are less frequent. Unlike existing methods, we additionally present an algorithm to allocate part of the sequencing budget to the families (i.e. immediate ancestors) of focal individuals to ensure that their haplotypes can be phased at the sequence level, which is essential for enabling and maximising subsequent sequence imputation.

Conclusions

We present a new method for the allocation of a fixed sequencing budget to focal individuals and their families such that the final sequenced haplotypes, when phased at the sequence level, represent the maximum possible portion of the haplotype diversity in the population that can be sequenced and phased at that budget.

Electronic supplementary material

The online version of this article (doi:10.1186/s12711-017-0322-5) contains supplementary material, which is available to authorized users.

Collapse

Brouard JS, Boyle B, Ibeagha-Awemu EM, Bissonnette N. Low-depth genotyping-by-sequencing (GBS) in a bovine population: strategies to maximize the selection of high quality genotypes and the accuracy of imputation. BMC Genet 2017;18:32. [PMID: 28381212 PMCID: PMC5382419 DOI: 10.1186/s12863-017-0501-y] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2016] [Accepted: 03/30/2017] [Indexed: 01/01/2023] Open

Abstract

Background

Genotyping-by-sequencing (GBS) has emerged as a powerful and cost-effective approach for discovering and genotyping single-nucleotide polymorphisms. The GBS technique was largely used in crop species where its low sequence coverage is not a drawback for calling genotypes because inbred lines are almost homozygous. In contrast, only a few studies used the GBS technique in animal populations (with sizeable heterozygosity rates) and many of those that have been published did not consider the quality of the genotypes produced by the bioinformatic pipelines. To improve the sequence coverage of the fragments, an alternative GBS preparation protocol that includes selective primers during the PCR amplification step has been recently proposed. In this study, we compared this modified protocol with the conventional two-enzyme GBS protocol. We also described various procedures to maximize the selection of high quality genotypes and to increase the accuracy of imputation.

Results

The in silico digestions of the bovine genome showed that the combination of PstI and MspI is more suitable for sequencing bovine GBS libraries than the use of single digestions with PstI or ApeKI. The sequencing output of the GBS libraries generated a total of 123,666 variants with the selective-primer approach and 272,103 variants with the conventional approach. Validating our data with genotypes obtained from mass spectrometry and Illumina’s bovine SNP50 array, we found that the genotypes produced by the conventional GBS method were concordant with those produced by these alternative genotyping methods, whereas the selective-primer method failed to call heterozygotes with confidence. Our results indicate that high accuracy in genotype calling (>97%) can be obtained using low read-depth thresholds (3 to 5 reads) provided that markers are simultaneously filtered for genotype quality scores. We also show that factors such as the minimum call rate and the minor allele frequency positively influence the accuracy of imputation of missing GBS data. The highest accuracies (around 85%) of imputed GBS markers were obtained with the FIMPUTE program when GBS and SNP50 array genotypes were combined (80,190 to 100,297 markers) before imputation.

Conclusions

We discovered that the conventional two-enzyme GBS protocol could produce a large number of high-quality genotypes provided that appropriate filtration criteria were used. In contrast, the selective-primer approach resulted in a substantial proportion of miscalled genotypes and should be avoided for livestock genotyping studies. Overall, our study demonstrates that carefully adjusting the different filtering parameters applied to the GBS data is critical to maximize the selection of high quality genotypes and to increase the accuracy of imputation of missing data. The strategies and results presented here provide a framework to maximize the output of the GBS technique in animal populations and qualified the PstI/MspI GBS assay as a low-cost high-density genotyping platform. The conclusions reported here regarding read-depth and genotype quality filtering could benefit many GBS applications, notably genome-wide association studies, where there is a need to increase the density of markers genotyped across the target population while preserving the quality of genotypes.

Collapse

Antolín R, Nettelblad C, Gorjanc G, Money D, Hickey JM. A hybrid method for the imputation of genomic data in livestock populations. Genet Sel Evol 2017;49:30. [PMID: 28253858 PMCID: PMC5439152 DOI: 10.1186/s12711-017-0300-y] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2016] [Accepted: 02/13/2017] [Indexed: 11/24/2022] Open

Abstract

BACKGROUND

This paper describes a combined heuristic and hidden Markov model (HMM) method to accurately impute missing genotypes in livestock datasets. Genomic selection in breeding programs requires high-density genotyping of many individuals, making algorithms that economically generate this information crucial. There are two common classes of imputation methods, heuristic methods and probabilistic methods, the latter being largely based on hidden Markov models. Heuristic methods are robust, but fail to impute markers in regions where the thresholds of heuristic rules are not met, or the pedigree is inconsistent. Hidden Markov models are probabilistic methods which typically do not require specific family structures or pedigree information, making them very flexible, but they are computationally expensive and, in some cases, less accurate.

RESULTS

We implemented a new hybrid imputation method that combined heuristic and HMM methods, AlphaImpute and MaCH, and compared the computation time and imputation accuracy of the three methods. AlphaImpute was the fastest, followed by the hybrid method and then the HMM. The computation time of the hybrid method and the HMM increased linearly with the number of iterations used in the hidden Markov model, however, the computation time of the hybrid method increased almost linearly and that of the HMM quadratically with the number of template haplotypes. The hybrid method was the most accurate imputation method for low-density panels when pedigree information was missing, especially if minor allele frequency was also low. The accuracy of the hybrid method and the HMM increased with the number of template haplotypes. The imputation accuracy of all three methods increased with the marker density of the low-density panels. Excluding the pedigree information reduced imputation accuracy for the hybrid method and AlphaImpute. Finally, the imputation accuracy of the three methods decreased with decreasing minor allele frequency.

CONCLUSIONS

The hybrid heuristic and probabilistic imputation method is able to impute all markers for all individuals in a population, as the HMM. The hybrid method is usually more accurate and never significantly less accurate than a purely heuristic method or a purely probabilistic method and is faster than a standard probabilistic method.

Collapse

Bennewitz J, Edel C, Fries R, Meuwissen THE, Wellmann R. Application of a Bayesian dominance model improves power in quantitative trait genome-wide association analysis. Genet Sel Evol 2017;49:7. [PMID: 28088170 PMCID: PMC5237573 DOI: 10.1186/s12711-017-0284-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2016] [Accepted: 01/04/2017] [Indexed: 02/02/2023] Open

Abstract

BACKGROUND

Multi-marker methods, which fit all markers simultaneously, were originally tailored for genomic selection purposes, but have proven to be useful also in association analyses, especially the so-called BayesC Bayesian methods. In a recent study, BayesD extended BayesC towards accounting for dominance effects and improved prediction accuracy and persistence in genomic selection. The current study investigated the power and precision of BayesC and BayesD in genome-wide association studies by means of stochastic simulations and applied these methods to a dairy cattle dataset.

METHODS

The simulation protocol was designed to mimic the genetic architecture of quantitative traits as realistically as possible. Special emphasis was put on the joint distribution of the additive and dominance effects of causative mutations. Additive marker effects were estimated by BayesC and additive and dominance effects by BayesD. The dependencies between additive and dominance effects were modelled in BayesD by choosing appropriate priors. A sliding-window approach was used. For each window, the R. Fernando window posterior probability of association was calculated and this was used for inference purpose. The power to map segregating causal effects and the mapping precision were assessed for various marker densities up to full sequence information and various window sizes.

RESULTS

Power to map a QTL increased with higher marker densities and larger window sizes. This held true for both methods. Method BayesD had improved power compared to BayesC. The increase in power was between -2 and 8% for causative genes that explained more than 2.5% of the genetic variance. In addition, inspection of the estimates of genomic window dominance variance allowed for inference about the magnitude of dominance at significant associations, which remains hidden in BayesC analysis. Mapping precision was not substantially improved by BayesD.

CONCLUSIONS

BayesD improved power, but precision only slightly. Application of BayesD needs large datasets with genotypes and own performance records as phenotypes. Given the current efforts to establish cow reference populations in dairy cattle genomic selection schemes, such datasets are expected to be soon available, which will enable the application of BayesD for association mapping and genomic prediction purposes.

Collapse

Gonen S, Jenko J, Gorjanc G, Mileham AJ, Whitelaw CBA, Hickey JM. Potential of gene drives with genome editing to increase genetic gain in livestock breeding programs. Genet Sel Evol 2017;49:3. [PMID: 28093068 PMCID: PMC5240390 DOI: 10.1186/s12711-016-0280-3] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2016] [Accepted: 12/14/2016] [Indexed: 01/10/2023] Open

Abstract

BACKGROUND

This paper uses simulation to explore how gene drives can increase genetic gain in livestock breeding programs. Gene drives are naturally occurring phenomena that cause a mutation on one chromosome to copy itself onto its homologous chromosome.

METHODS

We simulated nine different breeding and editing scenarios with a common overall structure. Each scenario began with 21 generations of selection, followed by 20 generations of selection based on true breeding values where the breeder used selection alone, selection in combination with genome editing, or selection with genome editing and gene drives. In the scenarios that used gene drives, we varied the probability of successfully incorporating the gene drive. For each scenario, we evaluated genetic gain, genetic variance [Formula: see text], rate of change in inbreeding ([Formula: see text]), number of distinct quantitative trait nucleotides (QTN) edited, rate of increase in favourable allele frequencies of edited QTN and the time to fix favourable alleles.

RESULTS

Gene drives enhanced the benefits of genome editing in seven ways: (1) they amplified the increase in genetic gain brought about by genome editing; (2) they amplified the rate of increase in the frequency of favourable alleles and reduced the time it took to fix them; (3) they enabled more rapid targeting of QTN with lesser effect for genome editing; (4) they distributed fixed editing resources across a larger number of distinct QTN across generations; (5) they focussed editing on a smaller number of QTN within a given generation; (6) they reduced the level of inbreeding when editing a subset of the sires; and (7) they increased the efficiency of converting genetic variation into genetic gain.

CONCLUSIONS

Genome editing in livestock breeding results in short-, medium- and long-term increases in genetic gain. The increase in genetic gain occurs because editing increases the frequency of favourable alleles in the population. Gene drives accelerate the increase in allele frequency caused by editing, which results in even higher genetic gain over a shorter period of time with no impact on inbreeding.

Collapse

Retelling the recent evolution of genetic diversity for Guzerá: Inferences from LD decay, runs of homozygosity and Ne over the generations. Livest Sci 2016. [DOI: 10.1016/j.livsci.2016.10.006] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Identification of selective sweeps reveals divergent selection between Chinese Holstein and Simmental cattle populations. Genet Sel Evol 2016;48:76. [PMID: 27716022 PMCID: PMC5054554 DOI: 10.1186/s12711-016-0254-5] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2015] [Accepted: 09/26/2016] [Indexed: 12/23/2022] Open

Abstract

Background

The identification of signals left by recent positive selection provides a feasible approach for targeting genomic variants that underlie complex traits and fitness. A better understanding of the selection mechanisms that occurred during the evolution of species can also be gained. In this study, we simultaneously detected the genome-wide footprints of recent positive selection that occurred within and between Chinese Holstein and Simmental populations, which have been subjected to artificial selection for distinct purposes. We conducted analyses using various complementary approaches, including LRH, XP-EHH and F_ST, based on the Illumina 770K high-density single nucleotide polymorphism (SNP) array, to enable more comprehensive detection.

Results

We successfully constructed profiles of selective signals in both cattle populations. To further annotate these regions, we identified a set of novel functional genes related to growth, reproduction, immune response and milk production. There were no overlapping candidate windows between the two breeds. Finally, we investigated the distribution of SNPs that had low F_ST values across five distinct functional regions in the genome. In the low-minor allele frequency bin, we found a higher proportion of low-F_ST SNPs in the exons of the bovine genome, which indicates strong purifying selection of the exons.

Conclusions

The selection signatures identified in these two populations demonstrated positive selection pressure on a set of important genes with potential functions that are involved in many biological processes. We also demonstrated that in the bovine genome, exons were under strong purifying selection. Our findings provide insight into the mechanisms of artificial selection and will facilitate follow-up functional studies of potential candidate genes that are related to various economically important traits in cattle.

Electronic supplementary material

The online version of this article (doi:10.1186/s12711-016-0254-5) contains supplementary material, which is available to authorized users.

Collapse

Linkage disequilibrium and haplotype block structure in Limousin, Simmental and native Polish Red cattle. Livest Sci 2016. [DOI: 10.1016/j.livsci.2016.07.008] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Vandenplas J, Calus MPL, Sevillano CA, Windig JJ, Bastiaansen JWM. Assigning breed origin to alleles in crossbred animals. Genet Sel Evol 2016;48:61. [PMID: 27549177 PMCID: PMC4994281 DOI: 10.1186/s12711-016-0240-y] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2015] [Accepted: 08/10/2016] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

For some species, animal production systems are based on the use of crossbreeding to take advantage of the increased performance of crossbred compared to purebred animals. Effects of single nucleotide polymorphisms (SNPs) may differ between purebred and crossbred animals for several reasons: (1) differences in linkage disequilibrium between SNP alleles and a quantitative trait locus; (2) differences in genetic backgrounds (e.g., dominance and epistatic interactions); and (3) differences in environmental conditions, which result in genotype-by-environment interactions. Thus, SNP effects may be breed-specific, which has led to the development of genomic evaluations for crossbred performance that take such effects into account. However, to estimate breed-specific effects, it is necessary to know breed origin of alleles in crossbred animals. Therefore, our aim was to develop an approach for assigning breed origin to alleles of crossbred animals (termed BOA) without information on pedigree and to study its accuracy by considering various factors, including distance between breeds.

RESULTS

The BOA approach consists of: (1) phasing genotypes of purebred and crossbred animals; (2) assigning breed origin to phased haplotypes; and (3) assigning breed origin to alleles of crossbred animals based on a library of assigned haplotypes, the breed composition of crossbred animals, and their SNP genotypes. The accuracy of allele assignments was determined for simulated datasets that include crosses between closely-related, distantly-related and unrelated breeds. Across these scenarios, the percentage of alleles of a crossbred animal that were correctly assigned to their breed origin was greater than 90 %, and increased with increasing distance between breeds, while the percentage of incorrectly assigned alleles was always less than 2 %. For the remaining alleles, i.e. 0 to 10 % of all alleles of a crossbred animal, breed origin could not be assigned.

CONCLUSIONS

The BOA approach accurately assigns breed origin to alleles of crossbred animals, even if their pedigree is not recorded.

Collapse

Niu H, Zhu B, Guo P, Zhang W, Xue J, Chen Y, Zhang L, Gao H, Gao X, Xu L, Li J. Estimation of linkage disequilibrium levels and haplotype block structure in Chinese Simmental and Wagyu beef cattle using high-density genotypes. Livest Sci 2016. [DOI: 10.1016/j.livsci.2016.05.012] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]