1
|
Nisa FU, Kaul H, Asif M, Amin I, Mrode R, Mansoor S, Mukhtar Z. Genetic insights into crossbred dairy cattle of Pakistan: exploring allele frequency, linkage disequilibrium, and effective population size at a genome-wide scale. Mamm Genome 2023; 34:602-614. [PMID: 37804434 DOI: 10.1007/s00335-023-10019-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 09/13/2023] [Indexed: 10/09/2023]
Abstract
Linkage disequilibrium (LD) affects genomic studies accuracy. High-density genotyping platforms identify SNPs across animal genomes, increasing LD evaluation resolution for accurate analysis. This study aimed to evaluate the decay and magnitude of LD in a cohort of 81 crossbred dairy cattle using the GGP_HDv3_C Bead Chip. After quality control, 116,710 Single Nucleotide Polymorphisms (SNPs) across 2520.241 Mb of autosomes were retained. LD extent was assessed between autosomal SNPs within a 10 Mb range using the r2 statistics. LD value declined as inter-marker distance increased. The average r2 value was 0.24 for SNP pairs < 10 kb apart, decreasing to 0.13 for 50-100 kb distances. Minor allele frequency (MAF) and sample size significantly impact LD. Lower MAF thresholds result in smaller r2 values, while higher thresholds show increased r2 values. Additionally, smaller sample sizes exhibit higher average r2 values, especially for larger physical distance intervals (> 50 kb) between SNP pairs. Effective population size and inbreeding coefficient were 150 and 0.028 for the present generation, indicating a decrease in genetic diversity over time. These findings imply that the utilization of high-density SNP panels and customized/breed-specific SNP panels represent a highly favorable approach for conducting genome-wide association studies (GWAS) and implementing genomic selection (GS) in the Bos indicus cattle breeds, whose genomes are still largely unexplored. Furthermore, it is imperative to devise a meticulous breeding strategy tailored to each herd, aiming to enhance desired traits while simultaneously preserving genetic diversity.
Collapse
Affiliation(s)
- Fakhar Un Nisa
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering College (NIBGE-C), Faisalabad, Pakistan
- Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan
- Department of Animal Breeding and Genetics, University of Veterinary and Animal Sciences, Lahore, Pakistan
| | - Haiba Kaul
- Department of Animal Breeding and Genetics, University of Veterinary and Animal Sciences, Lahore, Pakistan
| | - Muhammad Asif
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering College (NIBGE-C), Faisalabad, Pakistan
- Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan
| | - Imran Amin
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering College (NIBGE-C), Faisalabad, Pakistan
- Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan
| | - Raphael Mrode
- Animal Biosciences, International Livestock Research Institute, Nairobi, Kenya
- Animal and Veterinary Sciences, Scotland's Rural College, Edinburgh, UK
| | - Shahid Mansoor
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering College (NIBGE-C), Faisalabad, Pakistan
- Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan
- International Centre for Chemical and Biological Sciences, University of Karachi, Karachi, Pakistan
| | - Zahid Mukhtar
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering College (NIBGE-C), Faisalabad, Pakistan.
- Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan.
| |
Collapse
|
2
|
Barani S, Nejati-Javaremi A, Moradi MH, Moradi-Sharbabak M, Gholizadeh M, Esfandyari H. Genome-wide study of linkage disequilibrium, population structure, and inbreeding in Iranian indigenous sheep breeds. PLoS One 2023; 18:e0286463. [PMID: 37267244 DOI: 10.1371/journal.pone.0286463] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Accepted: 05/16/2023] [Indexed: 06/04/2023] Open
Abstract
Knowledge of linkage disequilibrium (LD), genetic structure and genetic diversity are some key parameters to study the breeding history of indigenous small ruminants. In this study, the OvineSNP50 Bead Chip array was used to estimate and compare LD, genetic diversity, effective population size (Ne) and genomic inbreeding in 186 individuals, from three Iranian indigenous sheep breeds consisting of Baluchi (n = 96), Lori-Bakhtiari (n = 47) and Zel (n = 47). The results of principal component analysis (PCA) revealed that all animals were allocated to the groups that they sampled and the admixture analysis revealed that the structure within the populations is best explained when separated into three groups (K = 3). The average r2 values estimated between adjacent single nucleotide polymorphisms (SNPs) at distances up to 10Kb, were 0.388±0.324, 0.353±0.311, and 0.333±0.309 for Baluchi, Lori-Bakhtiari and Zel, respectively. Estimation of genetic diversity and effective population size (Ne) showed that the Zel breed had the highest heterozygosity and Ne, whereas the lowest value was found in Baluchi breed. Estimation of genomic inbreeding using FROH (based on the long stretches of consecutive homozygous genotypes) showed the highest inbreeding coefficient in Baluchi and the lowest in Zel breed that could be due to higher pressure of artificial selection on Baluchi breed. The results of genomic inbreeding and Ne showed an increase in sharing haplotypes in Baluchi, leading to the enlargement of LD and the consequences of linkage disequilibrium and haplotype blocks confirmed this point. Also, the persistence of the LD phase between Zel and Lori-Bakhtiari was highest indicating that these two breeds would be combined in a multi-breed training population in genomic selection studies.
Collapse
Affiliation(s)
- S Barani
- Department of Animal Science, University College of Agriculture and Natural Resources, University of Tehran, Karaj, Iran
| | - A Nejati-Javaremi
- Department of Animal Science, University College of Agriculture and Natural Resources, University of Tehran, Karaj, Iran
| | - M H Moradi
- Department of Animal Science, Faculty of Agriculture and Natural Resources, Arak University, Arak, Iran
| | - M Moradi-Sharbabak
- Department of Animal Science, University College of Agriculture and Natural Resources, University of Tehran, Karaj, Iran
| | - M Gholizadeh
- Department of Animal Science, Sari Agricultural Sciences and Natural Resources University, Sari, Mazandaran, Iran
| | | |
Collapse
|
3
|
Dash S, Singh A, Dixit SP, Kumar A, Behera R. Exploring haplotype block structure, runs of homozygosity, and effective population size among dairy cattle breeds of India. Trop Anim Health Prod 2023; 55:129. [PMID: 36952060 DOI: 10.1007/s11250-023-03534-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Accepted: 03/06/2023] [Indexed: 03/24/2023]
Abstract
The present study aimed to explore haplotype structure, runs of homozygosity (ROH), effective population size and persistence of gametic phase among three indigenous dairy cattle breeds, viz., Sahiwal (n = 19), Tharparkar (n = 17), and Gir (n = 16) by using BovineHD single nucleotide polymorphism (SNP) genotyping assay. The filtered SNPs after quality control ranged from 44% in Sahiwal to 53% in Gir. The highest number of haplotype blocks was observed in Tharparkar (15,640) and the lowest in Sahiwal (8027) spanning 17.3% and 7.8% of genome, respectively. The average block length was found close to 26 kb which suggests that multiple recombination events fragmented the ancestral haplotypes into smaller sizes. Gir cattle had the largest number of runs of homozygosity (ROH) regions (1762) followed by Tharparkar (1528) and Sahiwal (1138). Without pedigree information, inbreeding coefficients estimated from ROH (FROH) revealed that Gir had the highest FROH (0.099) proposing more inbreeding rate in this population. Effective population size (Ne) decreased slowly over the last 60 generations and at 13 generations ago; Ne was estimated as 70 for all the three dairy breeds. The highest gametic phase correlation (r = 0.78) was observed for Sahiwal and Tharparkar breed pair suggesting formulation of multi-breed reference population for successful implementation of genomic selection among dairy breeds. The decline in effective population size among native Indian cattle breeds may help in formulating strategies for conservation and genetic improvement of native germplasm for future use.
Collapse
Affiliation(s)
- Soumya Dash
- Animal Genetics Division, ICAR-National Bureau of Animal Genetic Resources, Karnal, 132001, Haryana, India.
- School of Crop Health Policy Support Research, ICAR-National Institute of Biotic Stress Management, Raipur, 493225, Chhattisgarh, India.
| | - Avtar Singh
- Animal Genetics and Breeding Division, ICAR-National Bureau of Animal Genetic Resources, Karnal, 132001, Haryana, India
| | - S P Dixit
- Animal Genetics Division, ICAR-National Bureau of Animal Genetic Resources, Karnal, 132001, Haryana, India
| | - Avnish Kumar
- Animal Genetics Division, ICAR-National Bureau of Animal Genetic Resources, Karnal, 132001, Haryana, India
| | - Rajalaxmi Behera
- Regional Centre, ICAR-Directorate of Poultry Research, Bhubaneswar, 751003, Odisha, India
| |
Collapse
|
4
|
Ahmad SF, Singh A, Gangwar M, Kumar S, Dutt T, Kumar A. Haplotype-based association study of production and reproduction traits in multigenerational Vrindavani population. Gene 2023; 867:147365. [PMID: 36918047 DOI: 10.1016/j.gene.2023.147365] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Revised: 02/23/2023] [Accepted: 03/08/2023] [Indexed: 03/14/2023]
Abstract
Haplotype-based association analysis promises to reveal important information regarding the effect of genetic variants on economic traits of interest. The present study aimed to evaluate the haplotype structure of Vrindavani cattle and explore the association of haplotypes with (re)production traits of economic interest. Genotyping array data of medium density (Bovine50KSNP BeadChip) on 96 randomly selected Vrindavani cows was used in the present study. Genotypes were called in GenomeStudio program while quality control was undertaken in PLINK using standard thresholds. The phenotypic traits used in the present study included age at first calving, dry days, lactation length, peak yield, total lactation milk yield, inter-calving period and service period. The haplotype structure of Vrindavani population was assessed, using a sliding window of 20 SNP with a shift of 5 SNPs at a time, in terms of the size of haplotype blocks regarding their length (in Kb) and frequency in chromosome-wise fashion. Haplotype blocks were assessed for possible association with important production and reproduction traits across three lactation cycles in Vrindavani cattle population. The first ten principal components were included in the model for haplotype-based association analysis to correct for stratification effects of assessed individuals. Multiple haplotypes were found to be associated with age at first calving, total lactation milk yield, peak yield, dry days, inter-calving period and service period. Various candidate genes were found to overlap haplotypes that were significantly associated with age at first calving (CDH18, MARCHF11, MYO10, FBXL7), total lactation milk yield (TGF, PDE1A, and COL8A1), peak yield (PPARGC1A, RCAN1, KCNE1, SMIM34 and MRPS6), dry days (CPNE4, ACAD11 and MRAS), inter-calving period (ABCG5, ABCG8 and COX7A2L) and service period (FOXL2 and PIK3CB). The putative candidate genes overlapping the significantly associated haplotypes revealed important pathways affecting the production and reproduction performance of animals. The identified genes and pathways may serve as good candidate markers to select animals for improved production and reproduction performance in future generations.
Collapse
Affiliation(s)
- Sheikh Firdous Ahmad
- ICAR-Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Akansha Singh
- ICAR-Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Munish Gangwar
- ICAR-Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Subodh Kumar
- ICAR-Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Triveni Dutt
- ICAR-Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Amit Kumar
- ICAR-Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India.
| |
Collapse
|
5
|
Calderón-Chagoya R, Vega-Murillo VE, García-Ruiz A, Ríos-Utrera Á, Martínez-Velázquez G, Montaño-Bermúdez M. Genome and chromosome wide association studies for growth traits in Simmental and Simbrah cattle. Anim Biosci 2023; 36:19-28. [PMID: 35798032 PMCID: PMC9834659 DOI: 10.5713/ab.21.0517] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Accepted: 06/27/2022] [Indexed: 01/27/2023] Open
Abstract
OBJECTIVE The objective of this study was to perform genome (genome wide association studies [GWAS]) and chromosome (CWAS) wide association analyses to identify single nucleotide polymorphisms (SNPs) associated with growth traits in registered Simmental and Simbrah cattle. METHODS The phenotypes were deregressed BLUP EBVs for birth weight, weaning weight direct, weaning weight maternal, and yearling weight. The genotyping was performed with the GGP Bovine 150k chip. After the quality control analysis, 105,129 autosomal SNP from 967 animals (473 Simmental and 494 Simbrah) were used to carry out genotype association tests. The two association analyses were performed per breed and using combined information of the two breeds. The SNP associated with growth traits were mapped to their corresponding genes at 100 kb on either side. RESULTS A difference in magnitude of posterior probabilities was found across breeds between genome and chromosome wide association analyses. A total of 110, 143, and 302 SNP were associated with GWAS and CWAS for growth traits in the Simmental-, Simbrah-and joint -data analyses, respectively. It stands out from the enrichment analysis of the pathways for RNA polymerase (POLR2G, POLR3E) and GABAergic synapse (GABRR1, GABRR3) for Simmental cattle and p53 signaling pathway (BID, SERPINB5) for Simbrah cattle. CONCLUSION Only 6,265% of the markers associated with growth traits were found using CWAS and GWAS. The associated markers using the CWAS analysis, which were not associated using the GWAS, represents information that due to the model and priors was not associated with the traits.
Collapse
Affiliation(s)
- René Calderón-Chagoya
- Facultad de Medicina Veterinaria y Zootecnia, Universidad Nacional Autónoma de México, Ciudad Universitaria, Ciudad de México 04510,
México,Centro Nacional de Investigación Disciplinaria en Fisiología y Mejoramiento Animal, Instituto Nacional de Investigaciones Forestales, Agrícolas y Pecuarias, Colón, Querétaro 76280,
México
| | | | - Adriana García-Ruiz
- Centro Nacional de Investigación Disciplinaria en Fisiología y Mejoramiento Animal, Instituto Nacional de Investigaciones Forestales, Agrícolas y Pecuarias, Colón, Querétaro 76280,
México
| | - Ángel Ríos-Utrera
- Campo Experimental La Posta, Instituto Nacional de Investigaciones Forestales, Agrícolas y Pecuarias, Medellín, Veracruz 94277,
México
| | - Guillermo Martínez-Velázquez
- Campo Experimental Santiago Ixcuintla, Instituto Nacional de Investigaciones Forestales, Agrícolas y Pecuarias, Santiago Ixcuintla, Nayarit 63570,
México
| | - Moisés Montaño-Bermúdez
- Centro Nacional de Investigación Disciplinaria en Fisiología y Mejoramiento Animal, Instituto Nacional de Investigaciones Forestales, Agrícolas y Pecuarias, Colón, Querétaro 76280,
México,Corresponding Author: Moisés Montaño-Bermúdez, Tel: +52-55-38-71-8700 Ext. 80220, E-mail:
| |
Collapse
|
6
|
Hu X, Hao D, Yin J, Gong F, Wang X, Wang R, Liu B. Association between MIR31HG polymorphisms and the risk of Lumbar disc herniation in Chinese Han population. Cell Cycle 2022; 21:2109-2120. [PMID: 35704669 DOI: 10.1080/15384101.2022.2087281] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022] Open
Abstract
Lumbar disc herniation (LDH) is a common spinal disease that endangers human health. Genetic factors play a vital role in the progression of LDH. This study aimed to explore the relationship of the MIR31HG polymorphism with LDH risk in the Chinese population. Seven candidate SNPs on MIR31HG in 504 patients with LDH and 503 healthy people were genotyped by Agena MassARRAY platform. Logistic regression was used to calculate the relationship between MIR31HG polymorphism and LDH risk under different genetic models. Multi-factor dimensionality reduction (MDR) analysis was performed to evaluate the SNP-SNP interaction. We found that rs10965059 was significantly associated with a decreased risk of LDH under the dominant (OR = 0.46, 95% CI: 0.34-0.62, P < 0.001), log-additive (OR = 0.59, 95%CI: 0.45-0.76, P < 0.001), and codominant (OR = 0.40, 95%CI: 0.29-0.55, P < 0.001) models in the overall analysis. In the subgroup analyses of age, male, and complications, we found that rs10965059 was associated with a reduced risk of LDH. However, there was no significant correlation between MiR-31HG polymorphisms and risk of LDH in females. In addition, the three SNPs (rs72703442-rs2025327-rs55683539) was mapped to a 26kb LD block with D' >0.96, suggesting a significant linkage disequilibrium presence among each pair SNPs. MDR analysis showed that the best single-locus and multi-locus models for the prediction of LDH risk were rs10965059 and seven-locus models, respectively, and both of them increased LDH risk. Our results shown that in the Chinese Han population, the MIR31HG polymorphism rs10965059 was involved in a risk to symptomatic LDH, which provides a scientific basis for early screening, prevention, diagnosis and treatment of local LDH high-risk populations.
Collapse
Affiliation(s)
- Xinglv Hu
- Health Science Center, Xi'an Jiaotong University, Xi'an, Shaanxi, China.,Department of Spinal Surgery, Xi 'An Hospital of Traditional Chinese Medicine, Xi'an, Shaanxi, China
| | - Dingjun Hao
- Health Science Center, Xi'an Jiaotong University, Xi'an, Shaanxi, China.,Department of Spinal Surgery, Honghui Hospital Affiliated to Xi 'An Jiaotong University, Xi'an, Shaanxi, China
| | - Jichao Yin
- Department of Spinal Surgery, Xi 'An Hospital of Traditional Chinese Medicine, Xi'an, Shaanxi, China
| | - Futai Gong
- Department of Spinal Surgery, Xi 'An Hospital of Traditional Chinese Medicine, Xi'an, Shaanxi, China
| | - Xiangyang Wang
- Department of Spinal Surgery, Xi 'An Hospital of Traditional Chinese Medicine, Xi'an, Shaanxi, China
| | - Ruoxi Wang
- Department of Spinal Surgery, Xi 'An Hospital of Traditional Chinese Medicine, Xi'an, Shaanxi, China
| | - Bo Liu
- Department of Spinal Surgery, Xi 'An Hospital of Traditional Chinese Medicine, Xi'an, Shaanxi, China
| |
Collapse
|
7
|
Campos GS, Cardoso FF, Gomes CCG, Domingues R, de Almeida Regitano LC, de Sena Oliveira MC, de Oliveira HN, Carvalheiro R, Albuquerque LG, Miller S, Misztal I, Lourenco D. Development of genomic predictions for Angus cattle in Brazil incorporating genotypes from related American sires. J Anim Sci 2022; 100:6507787. [PMID: 35031806 PMCID: PMC8867558 DOI: 10.1093/jas/skac009] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Accepted: 01/12/2022] [Indexed: 11/24/2022] Open
Abstract
Genomic prediction has become the new standard for genetic improvement programs, and currently, there is a desire to implement this technology for the evaluation of Angus cattle in Brazil. Thus, the main objective of this study was to assess the feasibility of evaluating young Brazilian Angus (BA) bulls and heifers for 12 routinely recorded traits using single-step genomic BLUP (ssGBLUP) with and without genotypes from American Angus (AA) sires. The second objective was to obtain estimates of effective population size (Ne) and linkage disequilibrium (LD) in the Brazilian Angus population. The dataset contained phenotypic information for up to 277,661 animals belonging to the Promebo breeding program, pedigree for 362,900, of which 1,386 were genotyped for 50k, 77k, and 150k single nucleotide polymorphism (SNP) panels. After imputation and quality control, 61,666 SNPs were available for the analyses. In addition, genotypes from 332 American Angus (AA) sires widely used in Brazil were retrieved from the AA Association database to be used for genomic predictions. Bivariate animal models were used to estimate variance components, traditional EBV, and genomic EBV (GEBV). Validation was carried out with the linear regression method (LR) using young-genotyped animals born between 2013 and 2015 without phenotypes in the reduced dataset and with records in the complete dataset. Validation animals were further split into progeny of BA and AA sires to evaluate if their progenies would benefit by including genotypes from AA sires. The Ne was 254 based on pedigree and 197 based on LD, and the average LD (±SD) and distance between adjacent single nucleotide polymorphisms (SNPs) across all chromosomes were 0.27 (±0.27) and 40743.68 bp, respectively. Prediction accuracies with ssGBLUP outperformed BLUP for all traits, improving accuracies by, on average, 16% for BA young bulls and heifers. The GEBV prediction accuracies ranged from 0.37 (total maternal for weaning weight and tick count) to 0.54 (yearling precocity) across all traits, and dispersion (LR coefficients) fluctuated between 0.92 and 1.06. Inclusion of genotyped sires from the AA improved GEBV accuracies by 2%, on average, compared to using only the BA reference population. Our study indicated that genomic information could help us to improve GEBV accuracies and hence genetic progress in the Brazilian Angus population. The inclusion of genotypes from American Angus sires heavily used in Brazil just marginally increased the GEBV accuracies for selection candidates. There was a desire to implement genomic selection for Angus cattle in Brazil since the technology has been proved to increase genetic gain in animal breeding programs. Single-step genomic best linear unbiased prediction (ssGBLUP), which simultaneously combines pedigree and genomic information, was used to estimate individuals’ genomic breeding values (GEBV) or genetic merit. Genomic selection can accelerate genetic progress by increasing accuracy, especially in young animals without progeny. The accuracy of GEBV can also be improved by combing data from other countries to increase the reference population (i.e., genotyped and phenotyped animals) in small, genotyped populations. Thus, the main objective of this study was to evaluate the accuracy of GEBV for young Brazilian Angus (BA) bulls and heifers with ssGBLUP, including or not the genotypes from American Angus sires. The accuracies with ssGBLUP were higher than those from traditional BLUP (EBV calculated from pedigree), improving accuracies by, on average, 16% for young bulls and heifers. Including genotypes from American Angus sires heavily used in Brazil just marginally increased the GEBV accuracies for selection candidates.
Collapse
Affiliation(s)
- Gabriel Soares Campos
- Department of Animal and Dairy Science, University of Georgia, 30602, Athens, GA, USA
| | | | | | | | | | | | - Henrique Nunes de Oliveira
- Departamento de Zootecnia, Faculdade de Ciências Agrárias e Veterinárias, Universidade Estadual Paulista, 14884-900, Jaboticabal, SP, Brazil
| | - Roberto Carvalheiro
- Departamento de Zootecnia, Faculdade de Ciências Agrárias e Veterinárias, Universidade Estadual Paulista, 14884-900, Jaboticabal, SP, Brazil
| | - Lucia Galvão Albuquerque
- Departamento de Zootecnia, Faculdade de Ciências Agrárias e Veterinárias, Universidade Estadual Paulista, 14884-900, Jaboticabal, SP, Brazil
| | | | - Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, 30602, Athens, GA, USA
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, 30602, Athens, GA, USA
| |
Collapse
|
8
|
Rahimmadar S, Ghaffari M, Mokhber M, Williams JL. Linkage Disequilibrium and Effective Population Size of Buffalo Populations of Iran, Turkey, Pakistan, and Egypt Using a Medium Density SNP Array. Front Genet 2021; 12:608186. [PMID: 34950186 PMCID: PMC8689148 DOI: 10.3389/fgene.2021.608186] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2020] [Accepted: 11/03/2021] [Indexed: 11/21/2022] Open
Abstract
Linkage disequilibrium (LD) across the genome provides information to identify the genes and variations related to quantitative traits in genome-wide association studies (GWAS) and for the implementation of genomic selection (GS). LD can also be used to evaluate genetic diversity and population structure and reveal genomic regions affected by selection. LD structure and Ne were assessed in a set of 83 water buffaloes, comprising Azeri (AZI), Khuzestani (KHU), and Mazandarani (MAZ) breeds from Iran, Kundi (KUN) and Nili-Ravi (NIL) from Pakistan, Anatolian (ANA) buffalo from Turkey, and buffalo from Egypt (EGY). The values of corrected r2 (defined as the correlation between two loci) of adjacent SNPs for three pooled Iranian breeds (IRI), ANA, EGY, and two pooled Pakistani breeds (PAK) populations were 0.24, 0.28, 0.27, and 0.22, respectively. The corrected r2 between SNPs decreased with increasing physical distance from 100 Kb to 1 Mb. The LD values for IRI, ANA, EGY, and PAK populations were 0.16, 0.23, 0.24, and 0.21 for less than 100Kb, respectively, which reduced rapidly to 0.018, 0.042, 0.059, and 0.024, for a distance of 1 Mb. In all the populations, the decay rate was low for distances greater than 2Mb, up to the longest studied distance (15 Mb). The r2 values for adjacent SNPs in unrelated samples indicated that the Affymetrix Axiom 90 K SNP genomic array was suitable for GWAS and GS in these populations. The persistency of LD phase (PLDP) between populations was assessed, and results showed that PLPD values between the populations were more than 0.9 for distances of less than 100 Kb. The Ne in the recent generations has declined to the extent that breeding plans are urgently required to ensure that these buffalo populations are not at risk of being lost. We found that results are affected by sample size, which could be partially corrected for; however, additional data should be obtained to be confident of the results.
Collapse
Affiliation(s)
- Shirin Rahimmadar
- Department of Animal Science, Faculty of Agricultural Science, Urmia University, Urmia, Iran
| | - Mokhtar Ghaffari
- Department of Animal Science, Faculty of Agricultural Science, Urmia University, Urmia, Iran
| | - Mahdi Mokhber
- Department of Animal Science, Faculty of Agricultural Science, Urmia University, Urmia, Iran
| | - John L Williams
- Davies Research Centre, School of Animal and Veterinary Sciences, University of Adelaide, Roseworthy, SA, Australia.,Department of Animal Science, Food and Nutrition, Università Cattolica Del Sacro Cuore, Piacenza, Italy
| |
Collapse
|
9
|
Eiríksson JH, Karaman E, Su G, Christensen OF. Breed of origin of alleles and genomic predictions for crossbred dairy cows. Genet Sel Evol 2021; 53:84. [PMID: 34742238 PMCID: PMC8572482 DOI: 10.1186/s12711-021-00678-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Accepted: 10/20/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In dairy cattle, genomic selection has been implemented successfully for purebred populations, but, to date, genomic estimated breeding values (GEBV) for crossbred cows are rarely available, although they are valuable for rotational crossbreeding schemes that are promoted as efficient strategies. An attractive approach to provide GEBV for crossbreds is to use estimated marker effects from the genetic evaluation of purebreds. The effects of each marker allele in crossbreds can depend on the breed of origin of the allele (BOA), thus applying marker effects based on BOA could result in more accurate GEBV than applying only proportional contribution of the purebreds. Application of BOA models in rotational crossbreeding requires methods for detecting BOA, but the existing methods have not been developed for rotational crossbreeding. Therefore, the aims of this study were to develop and test methods for detecting BOA in a rotational crossbreeding system, and to investigate methods for calculating GEBV for crossbred cows using estimated marker effects from purebreds. RESULTS For detecting BOA in crossbred cows from rotational crossbreeding for which pedigree is recorded, we developed the AllOr method based on the comparison of haplotypes in overlapping windows. To calculate the GEBV of crossbred cows, two models were compared: a BOA model where marker effects estimated from purebreds are combined based on the detected BOA; and a breed proportion model where marker effects are combined based on estimated breed proportions. The methods were tested on simulated data that mimic the first four generations of rotational crossbreeding between Holstein, Jersey and Red Dairy Cattle. The AllOr method detected BOA correctly for 99.6% of the marker alleles across the four crossbred generations. The reliability of GEBV was higher with the BOA model than with the breed proportion model for the four generations of crossbreeding, with the largest difference observed in the first generation. CONCLUSIONS In rotational crossbreeding for which pedigree is recorded, BOA can be accurately detected using the AllOr method. Combining marker effects estimated from purebreds to predict the breeding value of crossbreds based on BOA is a promising approach to provide GEBV for crossbred dairy cows.
Collapse
Affiliation(s)
- Jón H Eiríksson
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark.
| | - Emre Karaman
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Guosheng Su
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Ole F Christensen
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| |
Collapse
|
10
|
Xiong J, Niu Y, Liu W, Zeng F, Cheng JF, Chen SQ, Zeng XZ. Effect of L3MBTL3/PTPN9 polymorphisms on risk to alcohol-induced ONFH in Chinese Han population. Neurol Sci 2021; 43:2823-2830. [PMID: 34373992 DOI: 10.1007/s10072-021-05486-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2021] [Accepted: 07/16/2021] [Indexed: 11/26/2022]
Abstract
PURPOSE Alcohol-induced osteonecrosis femoral head necrosis (ONFH) is a disease that seriously affects human health. Abnormal expression of L3MBTL3/PTPN9 gene can cause a variety of human diseases. The purpose of this study is to investigate the effect of L3MBTL3/PTPN9 gene polymorphism on the susceptibility of alcohol-induced ONFH in Chinese Han population. METHODS A total of 308 alcohol-induced ONFH patients and 425 healthy controls were enrolled in this case-control study. Alleles, genotypes, genetic models, haplotypes, and multifactor dimensionality reduction analyses (MDR) based on age-corrected by using odds ratio (OR) and 95% confidence interval (CI) were performed. RESULTS Our result revealed rs2068957 in the L3MBTL3 gene increased the risk of alcohol ONFH under the recessive model after correction. Besides, we also found that rs75393192 in the PTPN9 gene was a protective site in stratification over 40 years of age and stage. In stratified analysis of necrotic sites, we only found that rs2068957 was associated with increased susceptibility of alcohol-induced ONFH under the co-dominant model and recessive model. Haplotype "GC" in the block (rs76107647|rs10851882 in PTPN9 gene) significantly decreased the susceptibility of alcoholic ONFH. CONCLUSIONS Our results provide evidence that L3MBTL3/PTPN9 polymorphisms are associated with alcohol-induced ONFH risk in Chinese Han population.
Collapse
Affiliation(s)
- Jun Xiong
- Department of Orthopedic Trauma, the Hainan Affiliated Hospital of Hainan Medical University, No. 19, Xiuhua Road, Haikou, 570311, Hainan Province, China
| | - Yi Niu
- Department of Emergency and Critical Care Medicine, the Haikou Orthopedic and Diabetes Hospital of Shanghai Sixth People's Hospital, No. 3, Changxiu Road, Haikou, 570300, Hainan Province, China
| | - Wei Liu
- Department of Orthopedic Trauma, the Hainan Affiliated Hospital of Hainan Medical University, No. 19, Xiuhua Road, Haikou, 570311, Hainan Province, China
| | - Fan Zeng
- Department of Orthopedic Trauma, the Hainan Affiliated Hospital of Hainan Medical University, No. 19, Xiuhua Road, Haikou, 570311, Hainan Province, China
| | - Jian-Fei Cheng
- Department of Orthopedic Trauma, the Hainan Affiliated Hospital of Hainan Medical University, No. 19, Xiuhua Road, Haikou, 570311, Hainan Province, China
| | - Shi-Qiang Chen
- Department of Orthopedic Trauma, the Hainan Affiliated Hospital of Hainan Medical University, No. 19, Xiuhua Road, Haikou, 570311, Hainan Province, China
| | - Xiang-Zhou Zeng
- Department of Pharmacology, School of Basic Medicine and Life Science, the Hainan Medical University, No. 3, Xueyuan Road, Haikou, 571199, Hainan Province, China.
| |
Collapse
|
11
|
Kulski JK, Suzuki S, Shiina T. Haplotype Shuffling and Dimorphic Transposable Elements in the Human Extended Major Histocompatibility Complex Class II Region. Front Genet 2021; 12:665899. [PMID: 34122517 PMCID: PMC8193847 DOI: 10.3389/fgene.2021.665899] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Accepted: 04/12/2021] [Indexed: 12/26/2022] Open
Abstract
The major histocompatibility complex (MHC) on chromosome 6p21 is one of the most single-nucleotide polymorphism (SNP)-dense regions of the human genome and a prime model for the study and understanding of conserved sequence polymorphisms and structural diversity of ancestral haplotypes/conserved extended haplotypes. This study aimed to follow up on a previous analysis of the MHC class I region by using the same set of 95 MHC haplotype sequences downloaded from a publicly available BioProject database at the National Center for Biotechnology Information to identify and characterize the polymorphic human leukocyte antigen (HLA)-class II genes, the MTCO3P1 pseudogene alleles, the indels of transposable elements as haplotypic lineage markers, and SNP-density crossover (XO) loci at haplotype junctions in DNA sequence alignments of different haplotypes across the extended class II region (∼1 Mb) from the telomeric PRRT1 gene in class III to the COL11A2 gene at the centromeric end of class II. We identified 42 haplotypic indels (20 Alu, 7 SVA, 13 LTR or MERs, and 2 indels composed of a mosaic of different transposable elements) linked to particular HLA-class II alleles. Comparative sequence analyses of 136 haplotype pairs revealed 98 unique XO sites between SNP-poor and SNP-rich genomic segments with considerable haplotype shuffling located in the proximity of putative recombination hotspots. The majority of XO sites occurred across various regions including in the vicinity of MTCO3P1 between HLA-DQB1 and HLA-DQB3, between HLA-DQB2 and HLA-DOB, between DOB and TAP2, and between HLA-DOA and HLA-DPA1, where most XOs were within a HERVK22 sequence. We also determined the genomic positions of the PRDM9-recombination suppression sequence motif ATCCATG/CATGGAT and the PRDM9 recombination activation partial binding motif CCTCCCCT/AGGGGAG in the class II region of the human reference genome (NC_ 000006) relative to published meiotic recombination positions. Both the recombination and anti-recombination PRDM9 binding motifs were widely distributed throughout the class II genomic regions with 50% or more found within repeat elements; the anti-recombination motifs were found mostly in L1 fragmented repeats. This study shows substantial haplotype shuffling between different polymorphic blocks and confirms the presence of numerous putative ancestral recombination sites across the class II region between various HLA class II genes.
Collapse
Affiliation(s)
- Jerzy K Kulski
- Faculty of Health and Medical Sciences, The University of Western Australia, Crawley, WA, Australia.,Department of Molecular Life Sciences, Division of Basic Medical Science and Molecular Medicine, Tokai University School of Medicine, Isehara, Japan
| | - Shingo Suzuki
- Department of Molecular Life Sciences, Division of Basic Medical Science and Molecular Medicine, Tokai University School of Medicine, Isehara, Japan
| | - Takashi Shiina
- Department of Molecular Life Sciences, Division of Basic Medical Science and Molecular Medicine, Tokai University School of Medicine, Isehara, Japan
| |
Collapse
|
12
|
Thakor PB, Hinsu AT, Bhatia DR, Shah TM, Nayee N, Sudhakar A, Rank DN, Joshi CG. High-throughput genotype-based population structure analysis of selected buffalo breeds. Transl Anim Sci 2021; 5:txab033. [PMID: 33981962 PMCID: PMC8103726 DOI: 10.1093/tas/txab033] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2020] [Accepted: 05/01/2021] [Indexed: 11/29/2022] Open
Abstract
India is considered as the home tract of some of the best buffalo breeds. However, the genetic structure of the Indian river buffalo is poorly understood. Hence, there is a need to characterize the populations and understand the genetic structure of various buffalo breeds for selection and to design breeding strategies. In this study, we have analyzed genetic variability and population structure of seven buffalo breeds from their respective geographical regions using Axiom Buffalo Genotyping Array. Diversity, as measured by expected heterozygosity, ranged from 0.364 in Surti to 0.384 in Murrah breed, and pair-wise FST values revealed the lowest genetic distance between Murrah and Nili-Ravi (0.0022), while the highest between Surti and Pandharpuri (0.030). Principal component analysis and structure analysis unveiled the differentiation of Surti, Pandharpuri, and Jaffarabadi in first two principal components and at K = 4, respectively, while remaining breeds were grouped together as a separate single cluster and admixed. Murrah and Mehsana showed early linkage disequilibrium (LD) decay, while Surti breed showed late decay. In LD blocks to quantitative trait locis (QTLs) concordance analysis, 4.65% of concordance was observed with 873 LD blocks overlapped with 2,330 QTLs. Overall, total 4,090 markers were identified from all LD blocks for six types of traits. Results of this study indicated that these single-nucleotide polymorphism (SNP) markers could differentiate phenotypically distinct breeds like Surti, Pandharpuri, and Jaffarabadi but not others. So, there is a need to develop SNP chip based on SNP markers identified by sequence information of local breeds.
Collapse
Affiliation(s)
- Prakash B Thakor
- Department of Animal Genetics and Breeding, College of Veterinary Science and Animal Husbandry, Anand Agriculture University, Anand 388001, India
| | - Ankit T Hinsu
- Department of Animal Genetics and Breeding, College of Veterinary Science and Animal Husbandry, Anand Agriculture University, Anand 388001, India
| | - Dhruv R Bhatia
- Department of Animal Genetics and Breeding, College of Veterinary Science and Animal Husbandry, Anand Agriculture University, Anand 388001, India
| | - Tejas M Shah
- Department of Animal Biotechnology, College of Veterinary Science and Animal Husbandry, Anand Agriculture University, Anand 388001, India
| | - Nilesh Nayee
- National Dairy Development Board, Anand 388001, India
| | - A Sudhakar
- National Dairy Development Board, Anand 388001, India
| | - Dharamshibhai N Rank
- Department of Animal Genetics and Breeding, College of Veterinary Science and Animal Husbandry, Anand Agriculture University, Anand 388001, India
| | - Chaitanya G Joshi
- Department of Animal Biotechnology, College of Veterinary Science and Animal Husbandry, Anand Agriculture University, Anand 388001, India.,Gujarat Biotechnology Research Centre, Gandhinagar 382017, India
| |
Collapse
|
13
|
Whalen A, Gorjanc G, Hickey JM. AlphaFamImpute: high-accuracy imputation in full-sib families from genotype-by-sequencing data. Bioinformatics 2020; 36:4369-4371. [PMID: 32467963 PMCID: PMC7520044 DOI: 10.1093/bioinformatics/btaa499] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Revised: 04/22/2020] [Accepted: 05/25/2020] [Indexed: 12/12/2022] Open
Abstract
SUMMARY AlphaFamImpute is an imputation package for calling, phasing and imputing genome-wide genotypes in outbred full-sib families from single nucleotide polymorphism (SNP) array and genotype-by-sequencing (GBS) data. GBS data are increasingly being used to genotype individuals, especially when SNP arrays do not exist for a population of interest. Low-coverage GBS produces data with a large number of missing or incorrect naïve genotype calls, which can be improved by identifying shared haplotype segments between full-sib individuals. Here, we present AlphaFamImpute, an algorithm specifically designed to exploit the genetic structure of full-sib families. It performs imputation using a two-step approach. In the first step, it phases and imputes parental genotypes based on the segregation states of their offspring (i.e. which pair of parental haplotypes the offspring inherited). In the second step, it phases and imputes the offspring genotypes by detecting which haplotype segments the offspring inherited from their parents. With a series of simulations, we find that AlphaFamImpute obtains high-accuracy genotypes, even when the parents are not genotyped and individuals are sequenced at <1x coverage. AVAILABILITY AND IMPLEMENTATION AlphaFamImpute is available as a Python package from the AlphaGenes website http://www.AlphaGenes.roslin.ed.ac.uk/AlphaFamImpute. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Andrew Whalen
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian EH25 9RG, UK
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian EH25 9RG, UK
| | - John M Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian EH25 9RG, UK
| |
Collapse
|
14
|
Zhan H, Zhang S, Zhang K, Peng X, Xie S, Li X, Zhao S, Ma Y. Genome-Wide Patterns of Homozygosity and Relevant Characterizations on the Population Structure in Piétrain Pigs. Genes (Basel) 2020; 11:genes11050577. [PMID: 32455573 PMCID: PMC7291003 DOI: 10.3390/genes11050577] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2020] [Revised: 05/11/2020] [Accepted: 05/11/2020] [Indexed: 01/06/2023] Open
Abstract
Investigating the patterns of homozygosity, linkage disequilibrium, effective population size and inbreeding coefficients in livestock contributes to our understanding of the genetic diversity and evolutionary history. Here we used Illumina PorcineSNP50 Bead Chip to identify the runs of homozygosity (ROH) and estimate the linkage disequilibrium (LD) across the whole genome, and then predict the effective population size. In addition, we calculated the inbreeding coefficients based on ROH in 305 Piétrain pigs and compared its effect with the other two types of inbreeding coefficients obtained by different calculation methods. A total of 23,434 ROHs were detected, and the average length of ROH per individual was about 507.27 Mb. There was no regularity on how those runs of homozygosity distributed in genome. The comparisons of different categories suggested that the formation of long ROH was probably related with recent inbreeding events. Although the density of genes located in ROH core regions is lower than that in the other genomic regions, most of them are related with Piétrain commercial traits like meat qualities. Overall, the results provide insight into the way in which ROH is produced and the identified ROH core regions can be used to map the genes associated with commercial traits in domestic animals.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Yunlong Ma
- Correspondence: ; Tel.: +86-027-87282091
| |
Collapse
|
15
|
Haplotype-Based Genome-Wide Association Study and Identification of Candidate Genes Associated with Carcass Traits in Hanwoo Cattle. Genes (Basel) 2020; 11:genes11050551. [PMID: 32423003 PMCID: PMC7290854 DOI: 10.3390/genes11050551] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2020] [Revised: 04/30/2020] [Accepted: 05/05/2020] [Indexed: 12/20/2022] Open
Abstract
Hanwoo, is the most popular native beef cattle in South Korea. Due to its extensive popularity, research is ongoing to enhance its carcass quality and marbling traits. In this study we conducted a haplotype-based genome-wide association study (GWAS) by constructing haplotype blocks by three methods: number of single nucleotide polymorphisms (SNPs) in a haplotype block (nsnp), length of genomic region in kb (Len) and linkage disequilibrium (LD). Significant haplotype blocks and genes associated with them were identified for carcass traits such as BFT (back fat thickness), EMA (eye Muscle area), CWT (carcass weight) and MS (marbling score). Gene-set enrichment analysis and functional annotation of genes in the significantly-associated loci revealed candidate genes, including PLCB1 and PLCB4 present on BTA13, coding for phospholipases, which might be important candidates for increasing fat deposition due to their role in lipid metabolism and adipogenesis. CEL (carboxyl ester lipase), a bile-salt activated lipase, responsible for lipid catabolic process was also identified within the significantly-associated haplotype block on BTA11. The results were validated in a different Hanwoo population. The genes and pathways identified in this study may serve as good candidates for improving carcass traits in Hanwoo cattle.
Collapse
|
16
|
Alvarenga AB, Veroneze R, Oliveira HR, Marques DBD, Lopes PS, Silva FF, Brito LF. Comparing Alternative Single-Step GBLUP Approaches and Training Population Designs for Genomic Evaluation of Crossbred Animals. Front Genet 2020; 11:263. [PMID: 32328083 PMCID: PMC7162606 DOI: 10.3389/fgene.2020.00263] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2019] [Accepted: 03/05/2020] [Indexed: 02/06/2023] Open
Abstract
As crossbreeding is extensively used in some livestock species, we aimed to evaluate the performance of single-step GBLUP (ssGBLUP) and weighted ssGBLUP (WssGBLUP) methods to predict Genomic Estimated Breeding Values (GEBVs) of crossbred animals. Different training population scenarios were evaluated: (SC1) ssGBLUP based on a single-trait model considering purebred and crossbred animals in a joint training population; (SC2) ssGBLUP based on a multiple-trait model to enable considering phenotypes recorded in purebred and crossbred training animals as different traits; (SC3) WssGBLUP based on a single-trait model considering purebred and crossbred animals jointly in the training population (both populations were used for SNP weights' estimation); (SC4) WssGBLUP based on a single-trait model considering only purebred animals in the training population (crossbred population only used for SNP weights' estimation); (SC5) WssGBLUP based on a single-trait model and the training population characterized by purebred animals (purebred population used for SNP weights' estimation). A complex trait was simulated assuming alternative genetic architectures. Different scaling factors to blend the inverse of the genomic (G -1) and pedigree (A 22 - 1 ) relationship matrices were also tested. The predictive performance of each scenario was evaluated based on the validation accuracy and regression coefficient. The genetic correlations across simulated populations in the different scenarios ranged from moderate to high (0.71-0.99). The scenario mimicking a completely polygenic trait (h Q T L 2 = 0) yielded the lowest validation accuracy (0.12; for SC3 and SC4). The simulated scenarios assuming 4,500 QTLs affecting the trait andh Q T L 2 = h 2 resulted in the greatest GEBV accuracies (0.47; for SC1 and SC2). The regression coefficients ranged from 0.28 (for SC3 assuming polygenic effect) to 1.27 (for SC2 considering 4,500 QTLs). In general, SC3 and SC5 resulted in inflated GEBVs, whereas other scenarios yielded deflated GEBVs. The scaling factors used to combine G -1 andA 22 - 1 had a small influence on the validation accuracies, but a greater effect on the regression coefficients. Due to the complexity of multiple-trait models and WssGBLUP analyses, and a similar predictive performance across the methods evaluated, SC1 is recommended for genomic evaluation in crossbred populations with similar genetic structures [moderate-to-high (0.71-0.99) genetic correlations between purebred and crossbred populations].
Collapse
Affiliation(s)
- Amanda B. Alvarenga
- Department of Animal Sciences, Purdue University, West Lafayette, IN, United States
- Department of Animal Science, Federal University of Viçosa, Viçosa, Brazil
| | - Renata Veroneze
- Department of Animal Science, Federal University of Viçosa, Viçosa, Brazil
| | - Hinayah R. Oliveira
- Department of Animal Sciences, Purdue University, West Lafayette, IN, United States
- Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, ON, Canada
| | | | - Paulo S. Lopes
- Department of Animal Science, Federal University of Viçosa, Viçosa, Brazil
| | - Fabyano F. Silva
- Department of Animal Science, Federal University of Viçosa, Viçosa, Brazil
| | - Luiz F. Brito
- Department of Animal Sciences, Purdue University, West Lafayette, IN, United States
| |
Collapse
|
17
|
Pierce C, Speidel S, Coleman S, Enns R, Bailey D, Medrano J, Cánovas A, Meiman P, Howery L, Mandeville W, Thomas M. Genome-wide association studies of beef cow terrain-use traits using Bayesian multiple-SNP regression. Livest Sci 2020. [DOI: 10.1016/j.livsci.2019.103900] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
|
18
|
Mouresan EF, González-Rodríguez A, Cañas-Álvarez JJ, Munilla S, Altarriba J, Díaz C, Baró JA, Molina A, Lopez-Buesa P, Piedrafita J, Varona L. Mapping Recombination Rate on the Autosomal Chromosomes Based on the Persistency of Linkage Disequilibrium Phase Among Autochthonous Beef Cattle Populations in Spain. Front Genet 2019; 10:1170. [PMID: 31824571 PMCID: PMC6880760 DOI: 10.3389/fgene.2019.01170] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2018] [Accepted: 10/23/2019] [Indexed: 01/14/2023] Open
Abstract
In organisms with sexual reproduction, genetic diversity, and genome evolution are governed by meiotic recombination caused by crossing-over, which is known to vary within the genome. In this study, we propose a simple method to estimate the recombination rate that makes use of the persistency of linkage disequilibrium (LD) phase among closely related populations. The biological material comprised 171 triplets (sire/dam/offspring) from seven populations of autochthonous beef cattle in Spain (Asturiana de los Valles, Avileña-Negra Ibérica, Bruna dels Pirineus, Morucha, Pirenaica, Retinta, and Rubia Gallega), which were genotyped for 777,962 SNPs with the BovineHD BeadChip. After standard quality filtering, we reconstructed the haplotype phases in the parental individuals and calculated the LD by the correlation -r- between each pair of markers that had a genetic distance < 1 Mb. Subsequently, these correlations were used to calculate the persistency of LD phase between each pair of populations along the autosomal genome. Therefore, the distribution of the recombination rate along the genome can be inferred since the effect of the number of generations of divergence should be equivalent throughout the genome. In our study, the recombination rate was highest in the largest chromosomes and at the distal portion of the chromosomes. In addition, the persistency of LD phase was highly heterogeneous throughout the genome, with a ratio of 25.4 times between the estimates of the recombination rates from the genomic regions that had the highest (BTA18-7.1 Mb) and the lowest (BTA12-42.4 Mb) estimates. Finally, an overrepresentation enrichment analysis (ORA) showed differences in the enriched gene ontology (GO) terms between the genes located in the genomic regions with estimates of the recombination rate over (or below) the 95th (or 5th) percentile throughout the autosomal genome.
Collapse
Affiliation(s)
- Elena Flavia Mouresan
- Departamento de Anatomía, Embriología y Genética Animal, Universidad de Zaragoza, Zaragoza, Spain
| | | | | | - Sebastián Munilla
- Departamento de Anatomía, Embriología y Genética Animal, Universidad de Zaragoza, Zaragoza, Spain.,Departamento de Producción Animal, Facultad de Agronomía, Universidad de Buenos Aires, CONICET, Buenos Aires, Argentina
| | - Juan Altarriba
- Departamento de Anatomía, Embriología y Genética Animal, Universidad de Zaragoza, Zaragoza, Spain
| | - Clara Díaz
- Departamento de Mejora Genética Animal, Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Madrid, Spain
| | - Jesús A Baró
- Instituto Agroalimentario de Aragón (IA2), Zaragoza, Spain
| | - Antonio Molina
- Departamento de Ciencias Agroforestales, Universidad de Valladolid, Valladolid, Spain
| | - Pascual Lopez-Buesa
- Departamento de Anatomía, Embriología y Genética Animal, Universidad de Zaragoza, Zaragoza, Spain
| | - Jesús Piedrafita
- Departament de Ciència Animal i dels Aliments, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Luis Varona
- Departamento de Anatomía, Embriología y Genética Animal, Universidad de Zaragoza, Zaragoza, Spain
| |
Collapse
|
19
|
Thomasen JR, Liu H, Sørensen AC. Genotyping more cows increases genetic gain and reduces rate of true inbreeding in a dairy cattle breeding scheme using female reproductive technologies. J Dairy Sci 2019; 103:597-606. [PMID: 31733861 DOI: 10.3168/jds.2019-16974] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2019] [Accepted: 09/23/2019] [Indexed: 12/26/2022]
Abstract
Both small dairy cattle populations and dairy cattle populations with a low level of linkage disequilibrium (LD) suffer from low reliability of genomic prediction. In this study, we investigated whether adding more genotyped cows to the reference population influences the rate of genetic gain and rate of inbreeding by affecting the reliability. A standard breeding program with a large reference population and high LD, which mimicked a breeding program for Danish Holstein population, was simulated as a reference. A Danish Jersey population with a small reference population and high LD and a Red Dairy Cattle population with a large reference population and low LD were also simulated. Two additional breeding programs were simulated for Danish Jersey and Red Dairy Cattle populations, where 2,000 additional genotyped cows were included in the population for genomic selection. All 5 simulated breeding programs were initiated by a founder population to generate LD resembling the real LD pattern, followed by a 20-yr conventional progeny-testing scheme with 1,000 or 10,000 genotyped progeny-tested bulls and a 10-yr genomic selection scheme with or without 2,000 additional genotyped cows. Evaluation criteria were annual monetary genetic gain and rate of true inbreeding. Our results showed that adding more genotyped cows to the reference in dairy cattle populations has the potential to increase genetic gain and reduce the rate of inbreeding, regardless of reference population size and level of LD. However, it is still not possible to reach the same genetic gain as in the simulated Danish Holstein population with either a small reference population or low LD. Our results also showed that in a small reference population with high LD, it is difficult to manage inbreeding because of lower accuracy compared with the simulated Danish Holstein population and a smaller number of relevant families to select from. Therefore, breeding strategies need to be chosen to match population size and structure. The rate of true inbreeding is always underestimated by pedigree inbreeding and even more in genomic breeding programs, indicating that some forms of genome-wide inbreeding, instead of pedigree-based inbreeding, should be used to monitor inbreeding when genomic selection is implemented.
Collapse
Affiliation(s)
| | - H Liu
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Faculty of Science and Technology, Aarhus University, DK-8830, Tjele, Denmark.
| | - A C Sørensen
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Faculty of Science and Technology, Aarhus University, DK-8830, Tjele, Denmark
| |
Collapse
|
20
|
Benjelloun B, Boyer F, Streeter I, Zamani W, Engelen S, Alberti A, Alberto FJ, BenBati M, Ibnelbachyr M, Chentouf M, Bechchari A, Rezaei HR, Naderi S, Stella A, Chikhi A, Clarke L, Kijas J, Flicek P, Taberlet P, Pompanon F. An evaluation of sequencing coverage and genotyping strategies to assess neutral and adaptive diversity. Mol Ecol Resour 2019; 19:1497-1515. [PMID: 31359622 PMCID: PMC7115901 DOI: 10.1111/1755-0998.13070] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2018] [Revised: 06/30/2019] [Accepted: 07/08/2019] [Indexed: 12/12/2022]
Abstract
Whole genome sequences (WGS) greatly increase our ability to precisely infer population genetic parameters, demographic processes, and selection signatures. However, WGS may still be not affordable for a representative number of individuals/populations. In this context, our goal was to assess the efficiency of several SNP genotyping strategies by testing their ability to accurately estimate parameters describing neutral diversity and to detect signatures of selection. We analysed 110 WGS at 12× coverage for four different species, i.e., sheep, goats and their wild counterparts. From these data we generated 946 data sets corresponding to random panels of 1K to 5M variants, commercial SNP chips and exome capture, for sample sizes of five to 48 individuals. We also extracted low-coverage genome resequencing of 1×, 2× and 5× by randomly subsampling reads from the 12× resequencing data. Globally, 5K to 10K random variants were enough for an accurate estimation of genome diversity. Conversely, commercial panels and exome capture displayed strong ascertainment biases. Besides the characterization of neutral diversity, the detection of the signature of selection and the accurate estimation of linkage disequilibrium (LD) required high-density panels of at least 1M variants. Finally, genotype likelihoods increased the quality of variant calling from low coverage resequencing but proportions of incorrect genotypes remained substantial, especially for heterozygote sites. Whole genome resequencing coverage of at least 5× appeared to be necessary for accurate assessment of genomic variations. These results have implications for studies seeking to deploy low-density SNP collections or genome scans across genetically diverse populations/species showing similar genetic characteristics and patterns of LD decay for a wide variety of purposes.
Collapse
Affiliation(s)
- Badr Benjelloun
- Univ. Grenoble-Alpes, Univ. Savoie Mont Blanc, CNRS, LECA, F-38000 Grenoble, France
- National Institute of Agronomic Research (INRA Maroc), Regional Centre of Agronomic Research, 23000 Beni-Mellal, Morocco
| | - Frédéric Boyer
- Univ. Grenoble-Alpes, Univ. Savoie Mont Blanc, CNRS, LECA, F-38000 Grenoble, France
| | - Ian Streeter
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD UK
| | - Wahid Zamani
- Univ. Grenoble-Alpes, Univ. Savoie Mont Blanc, CNRS, LECA, F-38000 Grenoble, France
- Department of Environmental Sciences, Faculty of Natural Resources and Marine Sciences, Tarbiat Modares University, 46417-76489 Noor, Mazandaran, Iran
| | - Stefan Engelen
- CEA - Institut de biologie François-Jacob, Genoscope, 2 Rue Gaston Cremieux 91057 Evry Cedex, France
| | - Adriana Alberti
- CEA - Institut de biologie François-Jacob, Genoscope, 2 Rue Gaston Cremieux 91057 Evry Cedex, France
| | - Florian J. Alberto
- Univ. Grenoble-Alpes, Univ. Savoie Mont Blanc, CNRS, LECA, F-38000 Grenoble, France
| | - Mohamed BenBati
- National Institute of Agronomic Research (INRA Maroc), Regional Centre of Agronomic Research, 23000 Beni-Mellal, Morocco
| | - Mustapha Ibnelbachyr
- National Institute of Agronomic Research (INRA Maroc), CRRA Errachidia, 52000 Errachidia, Morocco
| | - Mouad Chentouf
- National Institute of Agronomic Research (INRA Maroc), CRRA Tangier, 90010 Tangier, Morocco
| | - Abdelmajid Bechchari
- National Institute of Agronomic Research (INRA Maroc), CRRA Oujda, 60000 Oujda, Morocco
| | - Hamid R. Rezaei
- Department of Environmental Sci, Gorgan University of Agricultural Sciences & Natural Resources, 41996-13776 Gorgan, Iran
| | - Saeid Naderi
- Environmental Sciences Department, Natural Resources Faculty, University of Guilan, 49138-15749 Guilan, Iran
| | - Alessandra Stella
- PTP Science Park, Bioinformatics Unit, Via Einstein-Loc. Cascina Codazza, 26900 Lodi, Italy
| | - Abdelkader Chikhi
- National Institute of Agronomic Research (INRA Maroc), CRRA Errachidia, 52000 Errachidia, Morocco
| | - Laura Clarke
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD UK
| | - James Kijas
- Commonwealth Scientific and Industrial Research Organisation Animal Food and Health Sciences, St Lucia, QLD 4067, Australia
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD UK
| | - Pierre Taberlet
- Univ. Grenoble-Alpes, Univ. Savoie Mont Blanc, CNRS, LECA, F-38000 Grenoble, France
| | - François Pompanon
- Univ. Grenoble-Alpes, Univ. Savoie Mont Blanc, CNRS, LECA, F-38000 Grenoble, France
| |
Collapse
|
21
|
Obšteter J, Jenko J, Hickey JM, Gorjanc G. Efficient use of genomic information for sustainable genetic improvement in small cattle populations. J Dairy Sci 2019; 102:9971-9982. [PMID: 31477287 DOI: 10.3168/jds.2019-16853] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2019] [Accepted: 07/13/2019] [Indexed: 11/19/2022]
Abstract
In this study, we compared genetic gain, genetic variation, and the efficiency of converting variation into gain under different genomic selection scenarios with truncation or optimum contribution selection in a small dairy population by simulation. Breeding programs have to maximize genetic gain but also ensure sustainability by maintaining genetic variation. Numerous studies have shown that genomic selection increases genetic gain. Although genomic selection is a well-established method, small populations still struggle with choosing the most sustainable strategy to adopt this type of selection. We developed a simulator of a dairy population and simulated a model after the Slovenian Brown Swiss population with ∼10,500 cows. We compared different truncation selection scenarios by varying (1) the method of sire selection and their use on cows or bull-dams, and (2) selection intensity and the number of years a sire is in use. Furthermore, we compared different optimum contribution selection scenarios with optimization of sire selection and their usage. We compared scenarios in terms of genetic gain, selection accuracy, generation interval, genetic and genic variance, rate of coancestry, effective population size, and conversion efficiency. The results showed that early use of genomically tested sires increased genetic gain compared with progeny testing, as expected from changes in selection accuracy and generation interval. A faster turnover of sires from year to year and higher intensity increased the genetic gain even further but increased the loss of genetic variation per year. Although maximizing intensity gave the lowest conversion efficiency, faster turnover of sires gave an intermediate conversion efficiency. The largest conversion efficiency was achieved with the simultaneous use of genomically and progeny-tested sires that were used over several years. Compared with truncation selection, optimizing sire selection and their usage increased the conversion efficiency by achieving either comparable genetic gain for a smaller loss of genetic variation or higher genetic gain for a comparable loss of genetic variation. Our results will help breeding organizations implement sustainable genomic selection.
Collapse
Affiliation(s)
- J Obšteter
- Department of Animal Science, Agricultural Institute of Slovenia, Hacquetova ulica 17, 1000 Ljubljana, Slovenia.
| | - J Jenko
- Department of Animal Science, Agricultural Institute of Slovenia, Hacquetova ulica 17, 1000 Ljubljana, Slovenia; Geno Breeding and A.I. Association, Storhamargata 44, 2317 Hamar, Norway
| | - J M Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Midlothian, EH25 9RG, United Kingdom
| | - G Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Midlothian, EH25 9RG, United Kingdom; Biotechnical Faculty, University of Ljubljana, Jamnikarjeva 101, 1000 Ljubljana, Slovenia
| |
Collapse
|
22
|
Fallahi MH, Shahrbabak HM, Shahrbabak MM, Arpanahi RA, Gholami S. Detection of Haplotypic Structure for Genome of Azerbaijani Buffalo Using High Density SNP Markers. RUSS J GENET+ 2019. [DOI: 10.1134/s1022795419080040] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
23
|
Mokhber M, Shahrbabak MM, Sadeghi M, Shahrbabak HM, Stella A, Nicolzzi E, Williams JL. Study of whole genome linkage disequilibrium patterns of Iranian water buffalo breeds using the Axiom Buffalo Genotyping 90K Array. PLoS One 2019; 14:e0217687. [PMID: 31150486 PMCID: PMC6544294 DOI: 10.1371/journal.pone.0217687] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2018] [Accepted: 05/16/2019] [Indexed: 01/21/2023] Open
Abstract
Accuracy of genome-wide association studies, and the successful implementation of genomic selection depends on the level of linkage disequilibrium (LD) across the genome and also the persistence of LD phase between populations. In the present study LD between adjacent SNPs and LD decay between SNPs was calculated in three Iranian water buffalo populations. Persistence of LD phase was evaluated across these populations and effective population size (Ne) was estimated from corrected r2 information. A set of 404 individuals from three Iranian buffalo populations were genotyped with the Axiom Buffalo Genotyping 90K Array. Average r2 and |D'| between adjacent SNP pairs across all chromosomes was 0.27 and 0.66 for AZI, 0.29 and 0.68 for KHU, and 0.32 and 0.72 for MAZ. The LD between the SNPs decreased with increasing physical distance from 100Kb to 1Mb between markers, from 0.234 to 0.018 for AZI, 0.254 to 0.034 for KHU, and 0.297 to 0.119 for MAZ, respectively. These results indicate that a density of 90K SNP is sufficient for genomic analyses relying on long range LD (e.g. GWAS and genomic selection). The persistence of LD phase decreased with increasing marker distances across all the populations, but remained above 0.8 for AZI and KHU for marker distances up to 100Kb. For multi-breed genomic evaluation, the 90K SNP panel is suitable for AZI and KHU buffalo breeds. Estimated effective population sizes for AZI, KHU and MAZ were 477, 212 and 32, respectively, for recent generations. The estimated effective population sizes indicate that the MAZ is at risk and requires careful management.
Collapse
Affiliation(s)
- Mahdi Mokhber
- Department of Animal Science, Faculty of Agriculture, Urmia University, Urmia, Iran
- * E-mail:
| | - Mohammad Moradi Shahrbabak
- Department of Animal Science, Faculty of Agricultural Science and Engineering, University College of Agriculture and Natural Resources (UTCAN), University of Tehran, Karaj, Iran
| | - Mostafa Sadeghi
- Department of Animal Science, Faculty of Agricultural Science and Engineering, University College of Agriculture and Natural Resources (UTCAN), University of Tehran, Karaj, Iran
| | - Hossein Moradi Shahrbabak
- Department of Animal Science, Faculty of Agricultural Science and Engineering, University College of Agriculture and Natural Resources (UTCAN), University of Tehran, Karaj, Iran
| | | | | | - John L. Williams
- Davies Research Centre, School of Animal and Veterinary Sciences, University of Adelaide, Roseworthy, South Australia, Australia
| |
Collapse
|
24
|
Barría A, Christensen KA, Yoshida G, Jedlicki A, Leong JS, Rondeau EB, Lhorente JP, Koop BF, Davidson WS, Yáñez JM. Whole Genome Linkage Disequilibrium and Effective Population Size in a Coho Salmon ( Oncorhynchus kisutch) Breeding Population Using a High-Density SNP Array. Front Genet 2019; 10:498. [PMID: 31191613 PMCID: PMC6539196 DOI: 10.3389/fgene.2019.00498] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2018] [Accepted: 05/07/2019] [Indexed: 12/19/2022] Open
Abstract
The estimation of linkage disequilibrium between molecular markers within a population is critical when establishing the minimum number of markers required for association studies, genomic selection, and inferring historical events influencing different populations. This work aimed to evaluate the extent and decay of linkage disequilibrium in a coho salmon breeding population using a high-density SNP array. Linkage disequilibrium was estimated between a total of 93,502 SNPs found in 64 individuals (33 dams and 31 sires) from the breeding population. The markers encompass all 30 coho salmon chromosomes and comprise 1,684.62 Mb of the genome. The average density of markers per chromosome ranged from 48.31 to 66 per 1 Mb. The minor allele frequency averaged 0.26 (with a range from 0.22 to 0.27). The overall average linkage disequilibrium among SNPs pairs measured as r2 was 0.10. The Average r2 value decreased with increasing physical distance, with values ranging from 0.21 to 0.07 at a distance lower than 1 kb and up to 10 Mb, respectively. An r2 threshold of 0.2 was reached at distance of approximately 40 Kb. Chromosomes Okis05, Okis15 and Okis28 showed high levels of linkage disequilibrium (>0.20 at distances lower than 1 Mb). Average r2 values were lower than 0.15 for all chromosomes at distances greater than 4 Mb. An effective population size of 43 was estimated for the population 10 generations ago, and 325, for 139 generations ago. Based on the effective number of chromosome segments, we suggest that at least 74,000 SNPs would be necessary for an association mapping study and genomic predictions. Therefore, the SNP panel used allowed us to capture high-resolution information in the farmed coho salmon population. Furthermore, based on the contemporary Ne, a new mate allocation strategy is suggested to increase the effective population size.
Collapse
Affiliation(s)
- Agustín Barría
- Facultad de Ciencias Veterinarias y Pecuarias, Universidad de Chile, Santiago, Chile
| | - Kris A Christensen
- Department of Biology, Centre for Biomedical Research, University of Victoria, Victoria, BC, Canada
| | - Grazyella Yoshida
- Facultad de Ciencias Veterinarias y Pecuarias, Universidad de Chile, Santiago, Chile
| | - Ana Jedlicki
- Facultad de Ciencias Veterinarias y Pecuarias, Universidad de Chile, Santiago, Chile
| | - Jong S Leong
- Department of Biology, Centre for Biomedical Research, University of Victoria, Victoria, BC, Canada
| | - Eric B Rondeau
- Department of Biology, Centre for Biomedical Research, University of Victoria, Victoria, BC, Canada
| | | | - Ben F Koop
- Department of Biology, Centre for Biomedical Research, University of Victoria, Victoria, BC, Canada
| | - William S Davidson
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada
| | - José M Yáñez
- Facultad de Ciencias Veterinarias y Pecuarias, Universidad de Chile, Santiago, Chile.,Nucleo Milenio INVASAL, Concepcion, Chile
| |
Collapse
|
25
|
Chhotaray S, Panigrahi M, Pal D, Ahmad SF, Bhanuprakash V, Kumar H, Parida S, Bhushan B, Gaur GK, Mishra BP, Singh RK. Genome-wide estimation of inbreeding coefficient, effective population size and haplotype blocks in Vrindavani crossbred cattle strain of India. BIOL RHYTHM RES 2019. [DOI: 10.1080/09291016.2019.1600266] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
- Supriya Chhotaray
- Division of Animal Genetics, Indian Veterinary Research Institute, Bareilly, India
| | - Manjit Panigrahi
- Division of Animal Genetics, Indian Veterinary Research Institute, Bareilly, India
| | - Dhan Pal
- Division of Animal Genetics, Indian Veterinary Research Institute, Bareilly, India
| | - Sheikh Firdous Ahmad
- Division of Animal Genetics, Indian Veterinary Research Institute, Bareilly, India
| | - V. Bhanuprakash
- Division of Animal Genetics, Indian Veterinary Research Institute, Bareilly, India
| | - Harshit Kumar
- Division of Animal Genetics, Indian Veterinary Research Institute, Bareilly, India
| | - Subhashree Parida
- Division of Pharmacology & Toxicology, Indian Veterinary Research Institute, Bareilly, India
| | - Bharat Bhushan
- Division of Animal Genetics, Indian Veterinary Research Institute, Bareilly, India
| | - G. K. Gaur
- Division of Animal Genetics, Indian Veterinary Research Institute, Bareilly, India
| | - B. P. Mishra
- Division of Animal Biotechnology, Indian Veterinary Research Institute, Bareilly, India
| | - R. K. Singh
- Division of Animal Biotechnology, Indian Veterinary Research Institute, Bareilly, India
| |
Collapse
|
26
|
Schäler J, Wellmann R, Bennewitz J, Thaller G, Hinrichs D. Genetic diversity and historic introgression in German Angler and Red Dual Purpose cattle and possibilities to reverse introgression. ACTA AGR SCAND A-AN 2019. [DOI: 10.1080/09064702.2019.1600011] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
- J. Schäler
- Institute of Animal Breeding and Husbandry, Christian-Albrechts-University, Kiel, Germany
- Department of Animal Breeding, University of Kassel, Witzenhausen, Germany
| | - R. Wellmann
- Department of Animal Breeding and Genetics, University of Hohenheim, Stuttgart, Germany
| | - J. Bennewitz
- Department of Animal Breeding and Genetics, University of Hohenheim, Stuttgart, Germany
| | - G. Thaller
- Institute of Animal Breeding and Husbandry, Christian-Albrechts-University, Kiel, Germany
| | - D. Hinrichs
- Department of Animal Breeding, University of Kassel, Witzenhausen, Germany
| |
Collapse
|
27
|
Xu L, Zhu B, Wang Z, Xu L, Liu Y, Chen Y, Zhang L, Gao X, Gao H, Zhang S, Xu L, Li J. Evaluation of Linkage Disequilibrium, Effective Population Size and Haplotype Block Structure in Chinese Cattle. Animals (Basel) 2019; 9:ani9030083. [PMID: 30845681 PMCID: PMC6466336 DOI: 10.3390/ani9030083] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2019] [Revised: 02/21/2019] [Accepted: 02/22/2019] [Indexed: 11/16/2022] Open
Abstract
Simple Summary Evaluation of the population structure and linkage disequilibrium can offer important insights to fully understand the genetic diversity and population history of cattle, which can enable us to appropriately design and implement GWAS and GS in cattle. In this study, we characterized the extent of genome-wide LD and the haplotype block structure, and estimated the persistence of phase of Chinese indigenous cattle with Illumina BovineHD BeadChip. According to our study, 58K, 87K, 95K, 52K, and 52K markers would be necessary for SCHC, NCC, SWC, SIM, and WAG, respectively, in the implementation of GWAS and GS and combining a multipopulation with high persistence of phase is feasible for the implication of genomic selection for Chinese beef cattle. Abstract Understanding the linkage disequilibrium (LD) across the genome, haplotype structure, and persistence of phase between breeds can enable us to appropriately design and implement the genome-wide association (GWAS) and genomic selection (GS) in beef cattle. We estimated the extent of genome-wide LD, haplotype block structure, and the persistence of phase in 10 Chinese cattle population using high density BovinHD BeadChip. The overall LD measured by r2 between adjacent SNPs were 0.60, 0.67, 0.58, 0.73, and 0.71 for South Chinese cattle (SCHC), North Chinese cattle (NCC), Southwest Chinese cattle (SWC), Simmental (SIM), and Wagyu (WAG). The highest correlation (0.53) for persistence of phase across groups was observed for SCHC vs. SWC at distances of 0–50 kb, while the lowest correlation was 0.13 for SIM vs. SCHC at the same distances. In addition, the estimated current effective population sizes were 27, 14, 31, 34, and 43 for SCHC, NCC, SWC, SIM, and WAG, respectively. Our result showed that 58K, 87K, 95K, 52K, and 52K markers were required for implementation of GWAS and GS in SCHC, NCC, SWC, SIM, and WAG, respectively. Also, our findings suggested that the implication of genomic selection for multipopulation with high persistence of phase is feasible for Chinese cattle.
Collapse
Affiliation(s)
- Lei Xu
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
- National Engineering Laboratory for Animal Breeding, Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China.
- Institute of Animal Husbandry and Veterinary Research, Anhui Academy of Agricultural Sciences, Hefei, 230031, China.
| | - Bo Zhu
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
| | - Zezhao Wang
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
| | - Ling Xu
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
| | - Ying Liu
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
| | - Yan Chen
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
| | - Lupei Zhang
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
| | - Xue Gao
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
| | - Huijiang Gao
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
| | - Shengli Zhang
- National Engineering Laboratory for Animal Breeding, Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China.
| | - Lingyang Xu
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
| | - Junya Li
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
| |
Collapse
|
28
|
Jenko J, McClure MC, Matthews D, McClure J, Johnsson M, Gorjanc G, Hickey JM. Analysis of a large dataset reveals haplotypes carrying putatively recessive lethal and semi-lethal alleles with pleiotropic effects on economically important traits in beef cattle. Genet Sel Evol 2019; 51:9. [PMID: 30836944 PMCID: PMC6402105 DOI: 10.1186/s12711-019-0452-z] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2018] [Accepted: 02/21/2019] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND In livestock, deleterious recessive alleles can result in reduced economic performance of homozygous individuals in multiple ways, e.g. early embryonic death, death soon after birth, or semi-lethality with incomplete penetrance causing reduced viability. While death is an easy phenotype to score, reduced viability is not as easy to identify. However, it can sometimes be observed as reduced conception rates, longer calving intervals, or lower survival for live born animals. METHODS In this paper, we searched for haplotypes that carry putatively recessive lethal or semi-lethal alleles in 132,725 genotyped Irish beef cattle from five breeds: Aberdeen Angus, Charolais, Hereford, Limousin, and Simmental. We phased the genotypes in sliding windows along the genome and used five tests to identify haplotypes with absence of or reduced homozygosity. Then, we associated the identified haplotypes with 44,351 insemination records that indicated early embryonic death, and postnatal survival records. Finally, we assessed haplotype pleiotropy by estimating substitution effects on estimates of breeding value for 15 economically important traits in beef production. RESULTS We found support for one haplotype that carries a putatively recessive lethal (chromosome 16 in Simmental) and two haplotypes that carry semi-lethal alleles (chromosome 14 in Aberdeen Angus and chromosome 19 in Charolais), with population frequencies of 8.8, 15.2, and 14.4%, respectively. These three haplotypes showed pleiotropic effects on economically important traits for beef production. Their allele substitution effects are €2.30, €3.42, and €1.47 for the terminal index and €1.03, - €3.11, and - €0.88 for the replacement index, where the standard deviations for the terminal index are €22.52, €18.65, and €22.70 and for the replacement index they are €31.35, €29.82, and €35.79. We identified ZFAT as the candidate gene for semi-lethality in Aberdeen Angus, several candidate genes for the lethal Simmental haplotype, and no candidate genes for the semi-lethal Charolais haplotype. CONCLUSIONS We analysed genotype, reproduction, survival, and production data to detect haplotypes that carry putatively recessive lethal or semi-lethal alleles in Irish beef cattle and identified one lethal and two semi-lethal haplotypes, which have pleiotropic effects on economically important traits in beef production.
Collapse
Affiliation(s)
- Janez Jenko
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | | | | | | | - Martin Johnsson
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK.,Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Box 7023, 750 07, Uppsala, Sweden
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - John M Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK.
| |
Collapse
|
29
|
Whalen A, Gorjanc G, Hickey JM. Parentage assignment with genotyping-by-sequencing data. J Anim Breed Genet 2019; 136:102-112. [PMID: 30548685 PMCID: PMC6392119 DOI: 10.1111/jbg.12370] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2018] [Revised: 11/09/2018] [Accepted: 11/10/2018] [Indexed: 01/04/2023]
Abstract
In this paper, we evaluate using genotype-by-sequencing (GBS) data to perform parentage assignment in lieu of traditional array data. The use of GBS data raises two issues: First, for low-coverage (e.g., <2×) GBS data, it may not be possible to call the genotype at many loci, a critical first step for detecting opposing homozygous markers. Second, the amount of sequencing coverage may vary across individuals, making it challenging to directly compare the likelihood scores between putative parents. To address these issues, we extend the probabilistic framework of Huisman (Molecular Ecology Resources, 2017, 17, 1009) and evaluate putative parents by comparing their (potentially noisy) genotypes to a series of proposal distributions. These distributions describe the expected genotype probabilities for the relatives of an individual. We assign putative parents as a parent if they are classified as a parent (as opposed to e.g., an unrelated individual), and if the assignment score passes a threshold. We evaluated this method on simulated data and found that (a) high-coverage (>2×) GBS data performs similarly to array data and requires only a small number of markers to correctly assign parents and (b) low-coverage GBS data (as low as 0.1×) can also be used, provided that it is obtained across a large number of markers. When analysing the low-coverage GBS data, we also found a high number of false positives if the true parent is not contained within the list of candidate parents, but that this false positive rate can be greatly reduced by hand tuning the assignment threshold. We provide this parentage assignment method as a standalone program called AlphaAssign.
Collapse
Affiliation(s)
- Andrew Whalen
- The Roslin Institute and Royal (Dick) School of Veterinary StudiesThe University of EdinburghMidlothianUK
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary StudiesThe University of EdinburghMidlothianUK
| | - John M. Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary StudiesThe University of EdinburghMidlothianUK
| |
Collapse
|
30
|
Jemaa SB, Thamri N, Mnara S, Rebours E, Rocha D, Boussaha M. Linkage disequilibrium and past effective population size in native Tunisian cattle. Genet Mol Biol 2019; 42:52-61. [PMID: 30776288 PMCID: PMC6428135 DOI: 10.1590/1678-4685-gmb-2017-0342] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2017] [Accepted: 05/14/2018] [Indexed: 12/30/2022] Open
Abstract
To carry out effective genome-wide association studies, information about linkage disequilibrium (LD) is essential. Here, we used medium-density SNP chips to provide estimates of LD in native Tunisian cattle. The two measures of LD that were used, mean r2 and D', decreased from 0.26 to 0.05 and from 0.73 to 0.40, respectively, when the distance between markers increased from less than 20 Kb to 200 Kb. The decay in LD over physical distance occurred at a faster rate than that reported for European and other indigenous breeds, and reached background levels at less than 500 Kb distance. This is consistent with the absence of strong selective pressure within the Tunisian population and suggests that, in order to be effective, any potential genome-wide association mapping studies will need to use chips with higher marker density. An analysis of effective population size (Ne) based on LD data showed a decline in past Ne, with a sudden drop starting about eight generations ago. This finding, combined with the high levels of recent inbreeding revealed by runs of homozygosity (ROH) analysis, indicate that this population is endangered and may be in urgent need of a conservation plan that includes a well-designed genetic management program.
Collapse
Affiliation(s)
- Slim Ben Jemaa
- National Institute of Agronomic Research of Tunisia, Laboratoire des Productions Animales et Fourragères, Ariana, Tunisia
| | - Nejia Thamri
- Livestock and Pasture Office, Tunis Belvedere, Tunisia
| | | | - Emmanuelle Rebours
- GABI, INRA, AgroParisTech, Université Paris Saclay, Jouy-en-Josas, France
| | - Dominique Rocha
- GABI, INRA, AgroParisTech, Université Paris Saclay, Jouy-en-Josas, France
| | - Mekki Boussaha
- GABI, INRA, AgroParisTech, Université Paris Saclay, Jouy-en-Josas, France
| |
Collapse
|
31
|
Bresolin T, Rosa GJDM, Valente BD, Espigolan R, Gordo DGM, Braz CU, Fernandes Júnior GA, Magalhães AFB, Garcia DA, Frezarim GB, Leão GFC, Carvalheiro R, Baldi F, Nunes de Oliveira H, Galvão de Albuquerque L. Effect of quality control, density and allele frequency of markers on the accuracy of genomic prediction for complex traits in Nellore cattle. ANIMAL PRODUCTION SCIENCE 2019. [DOI: 10.1071/an16821] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
This study was designed to test the impact of quality control, density and allele frequency of single nucleotide polymorphisms (SNP) markers on the accuracy of genomic predictions, using three traits with different heritabilities and two methods of prediction in a Nellore cattle population genotyped with the Illumina Bovine HD Assay. A total of 1756; 3150 and 3119 records of age at first calving (AFC); weaning weight (WW) and yearling weight (YW), respectively, were used. Three scenarios with different exclusion thresholds for minor allele frequency (MAF), deviation from Hardy–Weinberg equilibrium (HWE) and correlation between SNP pairs (r2) were constructed for all traits: (1) high rigor (S1): call rate <0.98, MAF <0.05, HWE with P <10−5, and r2 >0.999; (2) Moderate rigor (S2): call rate <0.85 and MAF <0.01; (3) Low rigor (S3): only non-autosomal SNP and those mapped on the same position were excluded. Additionally, to assess the prediction accuracy from different markers density, six panels (10K, 50K, 100K, 300K, 500K and 700K) were customised using the high-density genotyping assay as reference. Finally, from the markers available in high-density genotyping assay, six groups (G) with different minor allele frequency bins were defined to estimate the accuracy of genomic prediction. The range of MAF bins was approximately equal for the traits studied: G1 (0.000–0.009), G2 (0.010–0.064), G3 (0.065–0.174), G4 (0.175–0.325), G5 (0.326–0.500) and G6 (0.000–0.500). The Genomic Best Linear Unbiased Predictor and BayesCπ methods were used to estimate the SNP marker effects. Five-fold cross-validation was used to measure the accuracy of genomic prediction for all scenarios. There were no effects of genotypes quality control criteria on the accuracies of genomic predictions. For all traits, the higher density panel did not provide greater prediction accuracies than the low density one (10K panel). The groups of SNP with low MAF (MAF ≤0.007 for AFC, MAF ≤0.009 for WW and MAF ≤0.008 for YW) provided lower prediction accuracies than the groups with higher allele frequencies.
Collapse
|
32
|
Fonseca PADS, dos Santos FC, Lam S, Suárez-Vega A, Miglior F, Schenkel FS, Diniz LDAF, Id-Lahoucine S, Carvalho MRS, Cánovas A. Genetic mechanisms underlying spermatic and testicular traits within and among cattle breeds: systematic review and prioritization of GWAS results. J Anim Sci 2018; 96:4978-4999. [PMID: 30304443 PMCID: PMC6276581 DOI: 10.1093/jas/sky382] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2018] [Accepted: 09/27/2018] [Indexed: 12/20/2022] Open
Abstract
Reduced bull fertility imposes economic losses in bovine herds. Specifically, testicular and spermatic traits are important indicators of reproductive efficiency. Several genome-wide association studies (GWAS) have identified genomic regions associated with these fertility traits. The aims of this study were as follows: 1) to perform a systematic review of GWAS results for spermatic and testicular traits in cattle and 2) to identify key functional candidate genes for these traits. The identification of functional candidate genes was performed using a systems biology approach, where genes shared between traits and studies were evaluated by a guilt by association gene prioritization (GUILDify and ToppGene software) in order to identify the best functional candidates. These candidate genes were integrated and analyzed in order to identify overlapping patterns among traits and breeds. Results showed that GWAS for testicular-related traits have been developed for beef breeds only, whereas the majority of GWAS for spermatic-related traits were conducted using dairy breeds. When comparing traits measured within the same study, the highest number of genes shared between different traits was observed, indicating a high impact of the population genetic structure and environmental effects. Several chromosomal regions were enriched for functional candidate genes associated with fertility traits. Moreover, multiple functional candidate genes were enriched for markers in a species-specific basis, taurine (Bos taurus) or indicine (Bos indicus). For the different candidate regions identified in the GWAS in the literature, functional candidate genes were detected as follows: B. Taurus chromosome X (BTX) (TEX11, IRAK, CDK16, ATP7A, ATRX, HDAC6, FMR1, L1CAM, MECP2, etc.), BTA17 (TRPV4 and DYNLL1), and BTA14 (MOS, FABP5, ZFPM2). These genes are responsible for regulating important metabolic pathways or biological processes associated with fertility, such as progression of spermatogenesis, control of ciliary activity, development of Sertoli cells, DNA integrity in spermatozoa, and homeostasis of testicular cells. This study represents the first systematic review on male fertility traits in cattle using a system biology approach to identify key candidate genes for these traits.
Collapse
Affiliation(s)
- Pablo Augusto de Souza Fonseca
- Departamento de Biologia Geral, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
- Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, Ontario, Canada
| | | | - Stephanie Lam
- Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, Ontario, Canada
| | - Aroa Suárez-Vega
- Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, Ontario, Canada
| | - Filippo Miglior
- Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, Ontario, Canada
| | - Flavio S Schenkel
- Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, Ontario, Canada
| | | | - Samir Id-Lahoucine
- Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, Ontario, Canada
| | | | - Angela Cánovas
- Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, Ontario, Canada
| |
Collapse
|
33
|
Fonseca PADS, Id-Lahoucine S, Reverter A, Medrano JF, Fortes MS, Casellas J, Miglior F, Brito L, Carvalho MRS, Schenkel FS, Nguyen LT, Porto-Neto LR, Thomas MG, Cánovas A. Combining multi-OMICs information to identify key-regulator genes for pleiotropic effect on fertility and production traits in beef cattle. PLoS One 2018; 13:e0205295. [PMID: 30335783 PMCID: PMC6193631 DOI: 10.1371/journal.pone.0205295] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2018] [Accepted: 09/21/2018] [Indexed: 12/21/2022] Open
Abstract
The identification of biological processes related to the regulation of complex traits is a difficult task. Commonly, complex traits are regulated through a multitude of genes contributing each to a small part of the total genetic variance. Additionally, some loci can simultaneously regulate several complex traits, a phenomenon defined as pleiotropy. The lack of understanding on the biological processes responsible for the regulation of these traits results in the decrease of selection efficiency and the selection of undesirable hitchhiking effects. The identification of pleiotropic key-regulator genes can assist in developing important tools for investigating biological processes underlying complex traits. A multi-breed and multi-OMICs approach was applied to study the pleiotropic effects of key-regulator genes using three independent beef cattle populations evaluated for fertility traits. A pleiotropic map for 32 traits related to growth, feed efficiency, carcass and meat quality, and reproduction was used to identify genes shared among the different populations and breeds in pleiotropic regions. Furthermore, data-mining analyses were performed using the Cattle QTL database (CattleQTLdb) to identify the QTL category annotated in the regions around the genes shared among breeds. This approach allowed the identification of a main gene network (composed of 38 genes) shared among breeds. This gene network was significantly associated with thyroid activity, among other biological processes, and displayed a high regulatory potential. In addition, it was possible to identify genes with pleiotropic effects related to crucial biological processes that regulate economically relevant traits associated with fertility, production and health, such as MYC, PPARG, GSK3B, TG and IYD genes. These genes will be further investigated to better understand the biological processes involved in the expression of complex traits and assist in the identification of functional variants associated with undesirable phenotypes, such as decreased fertility, poor feed efficiency and negative energetic balance.
Collapse
Affiliation(s)
- Pablo Augusto de Souza Fonseca
- University of Guelph, Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, Guelph, Ontario, Canada
- Universidade Federal de Minas Gerais, Departamento de Biologia Geral, Belo Horizonte, Minas Gerais, Brazil
| | - Samir Id-Lahoucine
- University of Guelph, Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, Guelph, Ontario, Canada
| | - Antonio Reverter
- CSIRO Agriculture and Food, Queensland Bioscience Precinct, Brisbane, Queensland, Australia
| | - Juan F. Medrano
- University of California-Davis, Department of Animal Science, Davis, California, United States of America
| | - Marina S. Fortes
- The University of Queensland, School of Chemistry and Molecular Biosciences, Brisbane, Queensland, Australia
| | - Joaquim Casellas
- Universitat Autònoma de Barcelona, Departament de Ciència Animal i dels Aliments, Barcelona, Bellaterra, Barcelona, Spain
| | - Filippo Miglior
- University of Guelph, Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, Guelph, Ontario, Canada
- Canadian Dairy Network, Guelph, Ontario, Canada
| | - Luiz Brito
- University of Guelph, Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, Guelph, Ontario, Canada
| | - Maria Raquel S. Carvalho
- Universidade Federal de Minas Gerais, Departamento de Biologia Geral, Belo Horizonte, Minas Gerais, Brazil
| | - Flávio S. Schenkel
- University of Guelph, Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, Guelph, Ontario, Canada
| | - Loan T. Nguyen
- The University of Queensland, School of Chemistry and Molecular Biosciences, Brisbane, Queensland, Australia
| | - Laercio R. Porto-Neto
- CSIRO Agriculture and Food, Queensland Bioscience Precinct, Brisbane, Queensland, Australia
| | - Milton G. Thomas
- Colorado State University, Department of Animal Science, Fort-Colins, Colorado, United States of America
| | - Angela Cánovas
- University of Guelph, Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, Guelph, Ontario, Canada
- * E-mail:
| |
Collapse
|
34
|
Whalen A, Gorjanc G, Ros-Freixedes R, Hickey JM. Assessment of the performance of hidden Markov models for imputation in animal breeding. Genet Sel Evol 2018; 50:44. [PMID: 30223768 PMCID: PMC6142395 DOI: 10.1186/s12711-018-0416-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2017] [Accepted: 09/05/2018] [Indexed: 12/31/2022] Open
Abstract
Background In this paper, we review the performance of various hidden Markov model-based imputation methods in animal breeding populations. Traditionally, pedigree and heuristic-based imputation methods have been used for imputation in large animal populations due to their computational efficiency, scalability, and accuracy. Recent advances in the area of human genetics have increased the ability of probabilistic hidden Markov model methods to perform accurate phasing and imputation in large populations. These advances may enable these methods to be useful for routine use in large animal populations, particularly in populations where pedigree information is not readily available. Methods To test the performance of hidden Markov model-based imputation, we evaluated the accuracy and computational cost of several methods in a series of simulated populations and a real animal population without using a pedigree. First, we tested single-step (diploid) imputation, which performs both phasing and imputation. Second, we tested pre-phasing followed by haploid imputation. Overall, we used four available diploid imputation methods (fastPHASE, Beagle v4.0, IMPUTE2, and MaCH), three phasing methods, (SHAPEIT2, HAPI-UR, and Eagle2), and three haploid imputation methods (IMPUTE2, Beagle v4.1, and Minimac3). Results We found that performing pre-phasing and haploid imputation was faster and more accurate than diploid imputation. In particular, among all the methods tested, pre-phasing with Eagle2 or HAPI-UR and imputing with Minimac3 or IMPUTE2 gave the highest accuracies with both simulated and real data. Conclusions The results of this study suggest that hidden Markov model-based imputation algorithms are an accurate and computationally feasible approach for performing imputation without a pedigree when pre-phasing and haploid imputation are used. Of the algorithms tested, the combination of Eagle2 and Minimac3 gave the highest accuracy across the simulated and real datasets.
Collapse
Affiliation(s)
- Andrew Whalen
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian, Scotland, UK.
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian, Scotland, UK
| | - Roger Ros-Freixedes
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian, Scotland, UK
| | - John M Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian, Scotland, UK
| |
Collapse
|
35
|
Nandolo W, Utsunomiya YT, Mészáros G, Wurzinger M, Khayadzadeh N, Torrecilha RBP, Mulindwa HA, Gondwe TN, Waldmann P, Ferenčaković M, Garcia JF, Rosen BD, Bickhart D, van Tassell CP, Curik I, Sölkner J. Misidentification of runs of homozygosity islands in cattle caused by interference with copy number variation or large intermarker distances. Genet Sel Evol 2018; 50:43. [PMID: 30134820 PMCID: PMC6106898 DOI: 10.1186/s12711-018-0414-x] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2017] [Accepted: 07/30/2018] [Indexed: 12/22/2022] Open
Abstract
Background Runs of homozygosity (ROH) islands are stretches of homozygous sequence in the genome of a large proportion of individuals in a population. Algorithms for the detection of ROH depend on the similarity of haplotypes. Coverage gaps and copy number variants (CNV) may result in incorrect identification of such similarity, leading to the detection of ROH islands where none exists. Misidentified hemizygous regions will also appear as homozygous based on sequence variation alone. Our aim was to identify ROH islands influenced by marker coverage gaps or CNV, using Illumina BovineHD BeadChip (777 K) single nucleotide polymorphism (SNP) data for Austrian Brown Swiss, Tyrol Grey and Pinzgauer cattle. Methods ROH were detected using clustering, and ROH islands were determined from population inbreeding levels for each marker. CNV were detected using a multivariate copy number analysis method and a hidden Markov model. SNP coverage gaps were defined as genomic regions with intermarker distances on average longer than 9.24 kb. ROH islands that overlapped CNV regions (CNVR) or SNP coverage gaps were considered as potential artefacts. Permutation tests were used to determine if overlaps between CNVR with copy losses and ROH islands were due to chance. Diversity of the haplotypes in the ROH islands was assessed by haplotype analyses. Results In Brown Swiss, Tyrol Grey and Pinzgauer, we identified 13, 22, and 24 ROH islands covering 26.6, 389.0 and 35.8 Mb, respectively, and we detected 30, 50 and 71 CNVR derived from CNV by using both algorithms, respectively. Overlaps between ROH islands, CNVR or coverage gaps occurred for 7, 14 and 16 ROH islands, respectively. About 37, 44 and 52% of the ROH islands coverage in Brown Swiss, Tyrol Grey and Pinzgauer, respectively, were affected by copy loss. Intersections between ROH islands and CNVR were small, but significantly larger compared to ROH islands at random locations across the genome, implying an association between ROH islands and CNVR. Haplotype diversity for reliable ROH islands was lower than for ROH islands that intersected with copy loss CNVR. Conclusions Our findings show that a significant proportion of the ROH islands in the bovine genome are artefacts due to CNV or SNP coverage gaps. Electronic supplementary material The online version of this article (10.1186/s12711-018-0414-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Wilson Nandolo
- Division of Livestock Sciences (NUWI), University of Natural Resources and Life Sciences, Gregor-Mendel Strasse 33, 1180, Vienna, Austria.,Lilongwe University of Agriculture and Natural Resources, P. O. Box 219, Lilongwe, Malawi
| | - Yuri T Utsunomiya
- School of Agricultural and Veterinarian Sciences, Jaboticabal, Department of Preventive Veterinary Medicine and Animal Reproduction, São Paulo State University (UNESP), São Paulo, Brazil
| | - Gábor Mészáros
- Division of Livestock Sciences (NUWI), University of Natural Resources and Life Sciences, Gregor-Mendel Strasse 33, 1180, Vienna, Austria.
| | - Maria Wurzinger
- Division of Livestock Sciences (NUWI), University of Natural Resources and Life Sciences, Gregor-Mendel Strasse 33, 1180, Vienna, Austria
| | - Negar Khayadzadeh
- Division of Livestock Sciences (NUWI), University of Natural Resources and Life Sciences, Gregor-Mendel Strasse 33, 1180, Vienna, Austria
| | - Rafaela B P Torrecilha
- School of Agricultural and Veterinarian Sciences, Jaboticabal, Department of Preventive Veterinary Medicine and Animal Reproduction, São Paulo State University (UNESP), São Paulo, Brazil
| | - Henry A Mulindwa
- National Livestock Resources Research Institute, P.O Box 96, Tororo, Uganda
| | - Timothy N Gondwe
- Lilongwe University of Agriculture and Natural Resources, P. O. Box 219, Lilongwe, Malawi
| | - Patrik Waldmann
- Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Box 7023, 750 07, Uppsala, Sweden
| | - Maja Ferenčaković
- Department of Animal Science, Faculty of Agriculture, University of Zagreb, Svetošimunska cesta 25, 10000, Zagreb, Croatia
| | - José F Garcia
- School of Agricultural and Veterinarian Sciences, Jaboticabal, Department of Preventive Veterinary Medicine and Animal Reproduction, São Paulo State University (UNESP), São Paulo, Brazil.,School of Veterinary Medicine, Araçatuba, Department of Support, Production and Animal Health, São Paulo State University (UNESP), São Paulo, Brazil
| | - Benjamin D Rosen
- Animal Genomics and Improvement Laboratory, Beltsville, MD, 20705-2350, USA
| | - Derek Bickhart
- Animal Genomics and Improvement Laboratory, Beltsville, MD, 20705-2350, USA
| | - Curt P van Tassell
- Animal Genomics and Improvement Laboratory, Beltsville, MD, 20705-2350, USA
| | - Ino Curik
- Department of Animal Science, Faculty of Agriculture, University of Zagreb, Svetošimunska cesta 25, 10000, Zagreb, Croatia
| | - Johann Sölkner
- Division of Livestock Sciences (NUWI), University of Natural Resources and Life Sciences, Gregor-Mendel Strasse 33, 1180, Vienna, Austria
| |
Collapse
|
36
|
Goszczynski D, Molina A, Terán E, Morales-Durand H, Ross P, Cheng H, Giovambattista G, Demyda-Peyrás S. Runs of homozygosity in a selected cattle population with extremely inbred bulls: Descriptive and functional analyses revealed highly variable patterns. PLoS One 2018; 13:e0200069. [PMID: 29985951 PMCID: PMC6037354 DOI: 10.1371/journal.pone.0200069] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2017] [Accepted: 06/19/2018] [Indexed: 01/05/2023] Open
Abstract
The analysis of runs of homozygosity (ROH), using high throughput genomic data, has become a valuable and frequently used methodology to characterize the genomic and inbreeding variation of livestock and wildlife animal populations. However, this methodology has been scarcely used in highly inbred domestic animals. Here, we analyzed and characterized the occurrence of ROH fragments in highly inbred (HI; average pedigree-based inbreeding coefficient FPED = 0.164; 0.103 to 0.306) and outbred Retinta bulls (LI; average FPED = 0.008; 0 to 0.025). We studied the length of the fragments, their abundance, and genome distribution using high-density microarray data. The number of ROH was significantly higher in the HI group, especially for long fragments (>8Mb). In the LI group, the number of ROH continuously decreased with fragment length. Genome-wide distribution of ROH was highly variable between samples. Some chromosomes presented a larger number of fragments (BTA1, BTA19, BTA29), others had longer fragments (BTA4, BTA12, BTA17), while other ones showed an increased ROH accumulation over specific loci (BTA2, BTA7, BTA23, BTA29). Similar differences were observed in the analysis of 12 individuals produced by a similar inbred event (FPED3 = 0.125). The correlation between the fraction of the genome covered by ROH (FROH) and FPED was high (0.79), suggesting that ROH-based estimations are indicative of inbreeding levels. On the other hand, the correlation between FPED and the microsatellite-based inbreeding coefficient (FMIC) was only moderate (r = 0.44), suggesting that STR-based inbreeding estimations should be avoided. Similarly, we found a very low correlation (r = -0.0132) between recombination rate and ROH abundance across the genome. Finally, we performed functional annotation analyses of genome regions with significantly enriched ROH abundance. Results revealed gene clusters related to pregnancy-associated proteins and immune reaction. The same analysis performed for regions enriched with recently formed ROH (> 8 Mb) showed gene clusters related to flagellum assembly. In both cases, the processes were related to male and female reproductive functions, which may partially explain the reduced fertility associated with inbred populations.
Collapse
Affiliation(s)
- Daniel Goszczynski
- IGEVET–Instituto de Genética Veterinaria "Ing. Fernando N. Dulout” (UNLP-CONICET LA PLATA), Facultad de Ciencias Veterinarias UNLP, La Plata, Argentina
| | - Antonio Molina
- Departamento de Genética, Universidad de Córdoba, Córdoba, España
| | - Ester Terán
- IGEVET–Instituto de Genética Veterinaria "Ing. Fernando N. Dulout” (UNLP-CONICET LA PLATA), Facultad de Ciencias Veterinarias UNLP, La Plata, Argentina
| | - Hernán Morales-Durand
- IGEVET–Instituto de Genética Veterinaria "Ing. Fernando N. Dulout” (UNLP-CONICET LA PLATA), Facultad de Ciencias Veterinarias UNLP, La Plata, Argentina
| | - Pablo Ross
- Department of Animal Science, University of California, Davis, Davis, California, United States of America
| | - Hao Cheng
- Department of Animal Science, University of California, Davis, Davis, California, United States of America
| | - Guillermo Giovambattista
- IGEVET–Instituto de Genética Veterinaria "Ing. Fernando N. Dulout” (UNLP-CONICET LA PLATA), Facultad de Ciencias Veterinarias UNLP, La Plata, Argentina
- Departamento de Genética, Universidad de Córdoba, Córdoba, España
- Department of Animal Science, University of California, Davis, Davis, California, United States of America
- Departamento de Producción Animal, Facultad de Ciencias Veterinarias, Universidad Nacional de La Plata, La Plata, Buenos Aires, Argentina
| | - Sebastián Demyda-Peyrás
- IGEVET–Instituto de Genética Veterinaria "Ing. Fernando N. Dulout” (UNLP-CONICET LA PLATA), Facultad de Ciencias Veterinarias UNLP, La Plata, Argentina
- Departamento de Genética, Universidad de Córdoba, Córdoba, España
- Department of Animal Science, University of California, Davis, Davis, California, United States of America
- Departamento de Producción Animal, Facultad de Ciencias Veterinarias, Universidad Nacional de La Plata, La Plata, Buenos Aires, Argentina
- * E-mail:
| |
Collapse
|
37
|
Bejarano D, Martínez R, Manrique C, Parra LM, Rocha JF, Gómez Y, Abuabara Y, Gallego J. Linkage disequilibrium levels and allele frequency distribution in Blanco Orejinegro and Romosinuano Creole cattle using medium density SNP chip data. Genet Mol Biol 2018; 41:426-433. [PMID: 30088613 PMCID: PMC6082240 DOI: 10.1590/1678-4685-gmb-2016-0310] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2016] [Accepted: 09/20/2017] [Indexed: 11/22/2022] Open
Abstract
The linkage disequilibrium (LD) between molecular markers affects the accuracy of
genome-wide association studies and genomic selection application. High-density
genotyping platforms allow identifying the genotype of thousands of single
nucleotide polymorphisms (SNPs) distributed throughout the animal genomes, which
increases the resolution of LD evaluations. This study evaluated the
distribution of minor allele frequencies (MAF) and the level of LD in the
Colombian Creole cattle breeds Blanco Orejinegro (BON) and Romosinuano (ROMO)
using a medium density SNP panel (BovineSNP50K_v2). The LD decay in these breeds
was lower than those reported for other taurine breeds, achieving optimal LD
values (r2 ≥ 0.3) up to a distance of 70 kb in BON and 100 kb in
ROMO, which is possibly associated with the conservation status of these cattle
populations and their effective population size. The average MAF for both breeds
was 0.27 ± 0.14 with a higher SNP proportion having high MAF values (≥ 0.3). The
LD levels and distribution of allele frequencies found in this study suggest
that it is possible to have adequate coverage throughout the genome of these
breeds using the BovineSNP50K_v2, capturing the effect of most QTL related with
productive traits, and ensuring an adequate prediction capacity in genomic
analysis.
Collapse
Affiliation(s)
- Diego Bejarano
- Corporación Colombiana de Investigación Agropecuaria - Corpoica. Centro de Investigación Tibaitatá, Cundinamarca, Colombia
| | - Rodrigo Martínez
- Corporación Colombiana de Investigación Agropecuaria - Corpoica. Centro de Investigación Tibaitatá, Cundinamarca, Colombia
| | | | - Luis Miguel Parra
- Corporación Colombiana de Investigación Agropecuaria - Corpoica. Centro de Investigación Tibaitatá, Cundinamarca, Colombia
| | - Juan Felipe Rocha
- Corporación Colombiana de Investigación Agropecuaria - Corpoica. Centro de Investigación Obonuco, Nariño, Colombia
| | - Yolanda Gómez
- Corporación Colombiana de Investigación Agropecuaria - Corpoica. Centro de Investigación Tibaitatá, Cundinamarca, Colombia
| | - Yesid Abuabara
- Corporación Colombiana de Investigación Agropecuaria - Corpoica. Centro de Investigación Turipaná, Córdoba, Colombia
| | - Jaime Gallego
- Corporación Colombiana de Investigación Agropecuaria - Corpoica. Centro de Investigación El Nus, Antioquia, Colombia
| |
Collapse
|
38
|
Larmer SG, Sargolzaei M, Brito LF, Ventura RV, Schenkel FS. Novel methods for genotype imputation to whole-genome sequence and a simple linear model to predict imputation accuracy. BMC Genet 2017; 18:120. [PMID: 29281958 PMCID: PMC5746022 DOI: 10.1186/s12863-017-0588-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2017] [Accepted: 12/15/2017] [Indexed: 11/10/2022] Open
Abstract
Background Accurate imputation plays a major role in genomic studies of livestock industries, where the number of genotyped or sequenced animals is limited by costs. This study explored methods to create an ideal reference population for imputation to Next Generation Sequencing data in cattle. Methods Methods for clustering of animals for imputation were explored, using 1000 Bull Genomes Project sequence data on 1146 animals from a variety of beef and dairy breeds. Imputation from 50 K to 777 K was first carried out to choose an ideal clustering method, using ADMIXTURE or PLINK clustering algorithms with either genotypes or reconstructed haplotypes. Results Due to efficiency, accuracy and ease of use, clustering with PLINK using haplotypes as quasi-genotypes was chosen as the most advantageous grouping method. It was found that using a clustered population slightly decreased computing time, while maintaining accuracy across the population. Although overall accuracy remained the same, a slight increase in accuracy was observed for groups of animals in some breeds (primarily purebred beef cattle from breeds with fewer sequenced animals) and for other groups, primarily crossbreed animals, a slight decrease in accuracy was observed. However, it was noted that some animals in each breed were poorly imputed across all methods. When imputed sequences were included in the reference population to aid imputation of poorly imputed animals, a small increase in overall accuracy was observed for nearly every individual in the population. Two models were created to predict imputation accuracy, a complete model using all information available including Euclidean distances from genotypes and haplotypes, pedigree information, and clustering groups and a simple model using only breed and an Euclidean distance matrix as predictors. Both models were successful in predicting imputation accuracy, with correlations between predicted and true imputation accuracy as measured by concordance rate of 0.87 and 0.83, respectively. Conclusions A clustering methodology can be very useful to subgroup cattle for efficient genotype imputation. In addition, accuracy of genotype imputation from medium to high-density Single Nucleotide Polymorphisms (SNP) chip panels to whole-genome sequence can be predicted well using a simple linear model defined in this study.
Collapse
Affiliation(s)
- Steven G Larmer
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, 50 Stone Road East, Guelph, ON, N1G 2W1, Canada. .,The Semex Alliance, 5653 Highway 6 North, Guelph, ON, N1H 6J2, Canada.
| | - Mehdi Sargolzaei
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, 50 Stone Road East, Guelph, ON, N1G 2W1, Canada.,The Semex Alliance, 5653 Highway 6 North, Guelph, ON, N1H 6J2, Canada
| | - Luiz F Brito
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, 50 Stone Road East, Guelph, ON, N1G 2W1, Canada
| | - Ricardo V Ventura
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, 50 Stone Road East, Guelph, ON, N1G 2W1, Canada.,Bringing Intelligence Opportunities, 294 Mill St. East, Elora, ON, N0B 1S0, Canada
| | - Flávio S Schenkel
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, 50 Stone Road East, Guelph, ON, N1G 2W1, Canada
| |
Collapse
|
39
|
A method for allocating low-coverage sequencing resources by targeting haplotypes rather than individuals. Genet Sel Evol 2017; 49:78. [PMID: 29070022 PMCID: PMC5655873 DOI: 10.1186/s12711-017-0353-y] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2017] [Accepted: 10/18/2017] [Indexed: 11/25/2022] Open
Abstract
Background This paper describes a heuristic method for allocating low-coverage sequencing resources by targeting haplotypes rather than individuals. Low-coverage sequencing assembles high-coverage sequence information for every individual by accumulating data from the genome segments that they share with many other individuals into consensus haplotypes. Deriving the consensus haplotypes accurately is critical for achieving a high phasing and imputation accuracy. In order to enable accurate phasing and imputation of sequence information for the whole population, we allocate the available sequencing resources among individuals with existing phased genomic data by targeting the sequencing coverage of their haplotypes. Results Our method, called AlphaSeqOpt, prioritizes haplotypes using a score function that is based on the frequency of the haplotypes in the sequencing set relative to the target coverage. AlphaSeqOpt has two steps: (1) selection of an initial set of individuals by iteratively choosing the individuals that have the maximum score conditional on the current set, and (2) refinement of the set through several rounds of exchanges of individuals. AlphaSeqOpt is very effective for distributing a fixed amount of sequencing resources evenly across haplotypes, which results in a reduction of the proportion of haplotypes that are sequenced below the target coverage. AlphaSeqOpt can provide a greater proportion of haplotypes sequenced at the target coverage by sequencing less individuals, as compared with other methods that use a score function based on haplotype frequencies in the population. A refinement of the initially selected set can provide a larger more diverse set with more unique individuals, which is beneficial in the context of low-coverage sequencing. We extend the method with an approach for filtering rare haplotypes based on their flanking haplotypes, so that only those that are likely to derive from a recombination event are targeted. Conclusions We present a method for allocating sequencing resources so that a greater proportion of haplotypes are sequenced at a coverage that is sufficiently high for population-based imputation with low-coverage sequencing. The haplotype score function, the refinement step, and the new approach for filtering rare haplotypes make AlphaSeqOpt more effective for that purpose than previously reported methods for reducing sequencing redundancy. Electronic supplementary material The online version of this article (doi:10.1186/s12711-017-0353-y) contains supplementary material, which is available to authorized users.
Collapse
|
40
|
Cañas-Álvarez JJ, Mouresan EF, Varona L, Díaz C, Molina A, Baro JA, Altarriba J, Carabaño MJ, Casellas J, Piedrafita J. Linkage disequilibrium, persistence of phase, and effective population size in Spanish local beef cattle breeds assessed through a high-density single nucleotide polymorphism chip. J Anim Sci 2017; 94:2779-88. [PMID: 27482665 DOI: 10.2527/jas.2016-0425] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
Linkage disequilibrium (LD) and persistence of phase are fundamental approaches for exploring the genetic basis of economically important traits in cattle, including the identification of QTL for genomic selection and the estimation of effective population size () to determine the size of the training populations. In this study, we have used the Illumina BovineHD chip in 168 trios of 7 Spanish beef cattle breeds to obtain an overview of the magnitude of LD and the persistence of LD phase through the physical distance between markers. Also, we estimated the time of divergence based on the persistence of the LD phase and calculated past from LD estimates using different alternatives to define the recombination rate. Estimates of average (as a measure of LD) for adjacent markers were close to 0.52 in the 7 breeds and decreased with the distance between markers, although in long distances, some LD still remained (0.07 and 0.05 for markers 200 kb and 1 Mb apart, respectively). A panel with a lower boundary of 38,000 SNP would be necessary to launch a successful within-breed genomic selection program. Persistence of phase, measured as the pairwise correlations between estimates of in 2 breeds at short distances (10 kb), was in the 0.89 to 0.94 range and decreased from 0.33 to 0.52 to a range of 0.01 to 0.08 when marker distance increased from 200 kb to 1 Mb, respectively. The magnitude of the persistence of phase between the Spanish beef breeds was similar to those found in dairy breeds. For across-breed genomic selection, the size of the SNP panels must be in the range of 50,000 to 83,000 SNP. Estimates of past showed values ranging from 26 to 31 for 1 generation ago in all breeds. The divergence among breeds occurred between 129 and 207 generations ago. The results of this study are relevant for the future implementation of within- and across-breed genomic selection programs in the Spanish beef cattle populations. Our results suggest that a reduced subset of the SNP panel would be enough to achieve an adequate precision of the genomic predictions.
Collapse
|
41
|
Gonen S, Ros-Freixedes R, Battagin M, Gorjanc G, Hickey JM. A method for the allocation of sequencing resources in genotyped livestock populations. Genet Sel Evol 2017; 49:47. [PMID: 28521728 PMCID: PMC5437657 DOI: 10.1186/s12711-017-0322-5] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2016] [Accepted: 05/12/2017] [Indexed: 11/18/2022] Open
Abstract
Background This paper describes a method, called AlphaSeqOpt, for the allocation of sequencing resources in livestock populations with existing phased genomic data to maximise the ability to phase and impute sequenced haplotypes into the whole population. Methods We present two algorithms. The first selects focal individuals that collectively represent the maximum possible portion of the haplotype diversity in the population. The second allocates a fixed sequencing budget among the families of focal individuals to enable phasing of their haplotypes at the sequence level. We tested the performance of the two algorithms in simulated pedigrees. For each pedigree, we evaluated the proportion of population haplotypes that are carried by the focal individuals and compared our results to a variant of the widely-used key ancestors approach and to two haplotype-based approaches. We calculated the expected phasing accuracy of the haplotypes of a focal individual at the sequence level given the proportion of the fixed sequencing budget allocated to its family. Results AlphaSeqOpt maximises the ability to capture and phase the most frequent haplotypes in a population in three ways. First, it selects focal individuals that collectively represent a larger portion of the population haplotype diversity than existing methods. Second, it selects focal individuals from across the pedigree whose haplotypes can be easily phased using family-based phasing and imputation algorithms, thus maximises the ability to impute sequence into the rest of the population. Third, it allocates more of the fixed sequencing budget to focal individuals whose haplotypes are more frequent in the population than to focal individuals whose haplotypes are less frequent. Unlike existing methods, we additionally present an algorithm to allocate part of the sequencing budget to the families (i.e. immediate ancestors) of focal individuals to ensure that their haplotypes can be phased at the sequence level, which is essential for enabling and maximising subsequent sequence imputation. Conclusions We present a new method for the allocation of a fixed sequencing budget to focal individuals and their families such that the final sequenced haplotypes, when phased at the sequence level, represent the maximum possible portion of the haplotype diversity in the population that can be sequenced and phased at that budget. Electronic supplementary material The online version of this article (doi:10.1186/s12711-017-0322-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Serap Gonen
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - Roger Ros-Freixedes
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - Mara Battagin
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - John M Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK.
| |
Collapse
|
42
|
Brouard JS, Boyle B, Ibeagha-Awemu EM, Bissonnette N. Low-depth genotyping-by-sequencing (GBS) in a bovine population: strategies to maximize the selection of high quality genotypes and the accuracy of imputation. BMC Genet 2017; 18:32. [PMID: 28381212 PMCID: PMC5382419 DOI: 10.1186/s12863-017-0501-y] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2016] [Accepted: 03/30/2017] [Indexed: 01/01/2023] Open
Abstract
Background Genotyping-by-sequencing (GBS) has emerged as a powerful and cost-effective approach for discovering and genotyping single-nucleotide polymorphisms. The GBS technique was largely used in crop species where its low sequence coverage is not a drawback for calling genotypes because inbred lines are almost homozygous. In contrast, only a few studies used the GBS technique in animal populations (with sizeable heterozygosity rates) and many of those that have been published did not consider the quality of the genotypes produced by the bioinformatic pipelines. To improve the sequence coverage of the fragments, an alternative GBS preparation protocol that includes selective primers during the PCR amplification step has been recently proposed. In this study, we compared this modified protocol with the conventional two-enzyme GBS protocol. We also described various procedures to maximize the selection of high quality genotypes and to increase the accuracy of imputation. Results The in silico digestions of the bovine genome showed that the combination of PstI and MspI is more suitable for sequencing bovine GBS libraries than the use of single digestions with PstI or ApeKI. The sequencing output of the GBS libraries generated a total of 123,666 variants with the selective-primer approach and 272,103 variants with the conventional approach. Validating our data with genotypes obtained from mass spectrometry and Illumina’s bovine SNP50 array, we found that the genotypes produced by the conventional GBS method were concordant with those produced by these alternative genotyping methods, whereas the selective-primer method failed to call heterozygotes with confidence. Our results indicate that high accuracy in genotype calling (>97%) can be obtained using low read-depth thresholds (3 to 5 reads) provided that markers are simultaneously filtered for genotype quality scores. We also show that factors such as the minimum call rate and the minor allele frequency positively influence the accuracy of imputation of missing GBS data. The highest accuracies (around 85%) of imputed GBS markers were obtained with the FIMPUTE program when GBS and SNP50 array genotypes were combined (80,190 to 100,297 markers) before imputation. Conclusions We discovered that the conventional two-enzyme GBS protocol could produce a large number of high-quality genotypes provided that appropriate filtration criteria were used. In contrast, the selective-primer approach resulted in a substantial proportion of miscalled genotypes and should be avoided for livestock genotyping studies. Overall, our study demonstrates that carefully adjusting the different filtering parameters applied to the GBS data is critical to maximize the selection of high quality genotypes and to increase the accuracy of imputation of missing data. The strategies and results presented here provide a framework to maximize the output of the GBS technique in animal populations and qualified the PstI/MspI GBS assay as a low-cost high-density genotyping platform. The conclusions reported here regarding read-depth and genotype quality filtering could benefit many GBS applications, notably genome-wide association studies, where there is a need to increase the density of markers genotyped across the target population while preserving the quality of genotypes.
Collapse
Affiliation(s)
- Jean-Simon Brouard
- Research and Developent Center of Sherbrooke, Agriculture and Agri-Food Canada, Sherbrooke, QC, Canada
| | - Brian Boyle
- Plateforme d'analyses génomiques, Institut de biologie intégrative et des systèmes, Université Laval, Quebec City, QC, Canada
| | - Eveline M Ibeagha-Awemu
- Research and Developent Center of Sherbrooke, Agriculture and Agri-Food Canada, Sherbrooke, QC, Canada
| | - Nathalie Bissonnette
- Research and Developent Center of Sherbrooke, Agriculture and Agri-Food Canada, Sherbrooke, QC, Canada.
| |
Collapse
|
43
|
Antolín R, Nettelblad C, Gorjanc G, Money D, Hickey JM. A hybrid method for the imputation of genomic data in livestock populations. Genet Sel Evol 2017; 49:30. [PMID: 28253858 PMCID: PMC5439152 DOI: 10.1186/s12711-017-0300-y] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2016] [Accepted: 02/13/2017] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND This paper describes a combined heuristic and hidden Markov model (HMM) method to accurately impute missing genotypes in livestock datasets. Genomic selection in breeding programs requires high-density genotyping of many individuals, making algorithms that economically generate this information crucial. There are two common classes of imputation methods, heuristic methods and probabilistic methods, the latter being largely based on hidden Markov models. Heuristic methods are robust, but fail to impute markers in regions where the thresholds of heuristic rules are not met, or the pedigree is inconsistent. Hidden Markov models are probabilistic methods which typically do not require specific family structures or pedigree information, making them very flexible, but they are computationally expensive and, in some cases, less accurate. RESULTS We implemented a new hybrid imputation method that combined heuristic and HMM methods, AlphaImpute and MaCH, and compared the computation time and imputation accuracy of the three methods. AlphaImpute was the fastest, followed by the hybrid method and then the HMM. The computation time of the hybrid method and the HMM increased linearly with the number of iterations used in the hidden Markov model, however, the computation time of the hybrid method increased almost linearly and that of the HMM quadratically with the number of template haplotypes. The hybrid method was the most accurate imputation method for low-density panels when pedigree information was missing, especially if minor allele frequency was also low. The accuracy of the hybrid method and the HMM increased with the number of template haplotypes. The imputation accuracy of all three methods increased with the marker density of the low-density panels. Excluding the pedigree information reduced imputation accuracy for the hybrid method and AlphaImpute. Finally, the imputation accuracy of the three methods decreased with decreasing minor allele frequency. CONCLUSIONS The hybrid heuristic and probabilistic imputation method is able to impute all markers for all individuals in a population, as the HMM. The hybrid method is usually more accurate and never significantly less accurate than a purely heuristic method or a purely probabilistic method and is faster than a standard probabilistic method.
Collapse
Affiliation(s)
- Roberto Antolín
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Research Centre, Midlothian, EH25 9RG Scotland, UK
| | - Carl Nettelblad
- Division of Scientific Computing, Department of Information Technology, Science for Life Laboratory, Uppsala University, Lägerhyddsvägen 2, Box 337, 751 05 Uppsala, Sweden
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Research Centre, Midlothian, EH25 9RG Scotland, UK
| | - Daniel Money
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Research Centre, Midlothian, EH25 9RG Scotland, UK
| | - John M. Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Research Centre, Midlothian, EH25 9RG Scotland, UK
| |
Collapse
|
44
|
Bennewitz J, Edel C, Fries R, Meuwissen THE, Wellmann R. Application of a Bayesian dominance model improves power in quantitative trait genome-wide association analysis. Genet Sel Evol 2017; 49:7. [PMID: 28088170 PMCID: PMC5237573 DOI: 10.1186/s12711-017-0284-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2016] [Accepted: 01/04/2017] [Indexed: 02/02/2023] Open
Abstract
BACKGROUND Multi-marker methods, which fit all markers simultaneously, were originally tailored for genomic selection purposes, but have proven to be useful also in association analyses, especially the so-called BayesC Bayesian methods. In a recent study, BayesD extended BayesC towards accounting for dominance effects and improved prediction accuracy and persistence in genomic selection. The current study investigated the power and precision of BayesC and BayesD in genome-wide association studies by means of stochastic simulations and applied these methods to a dairy cattle dataset. METHODS The simulation protocol was designed to mimic the genetic architecture of quantitative traits as realistically as possible. Special emphasis was put on the joint distribution of the additive and dominance effects of causative mutations. Additive marker effects were estimated by BayesC and additive and dominance effects by BayesD. The dependencies between additive and dominance effects were modelled in BayesD by choosing appropriate priors. A sliding-window approach was used. For each window, the R. Fernando window posterior probability of association was calculated and this was used for inference purpose. The power to map segregating causal effects and the mapping precision were assessed for various marker densities up to full sequence information and various window sizes. RESULTS Power to map a QTL increased with higher marker densities and larger window sizes. This held true for both methods. Method BayesD had improved power compared to BayesC. The increase in power was between -2 and 8% for causative genes that explained more than 2.5% of the genetic variance. In addition, inspection of the estimates of genomic window dominance variance allowed for inference about the magnitude of dominance at significant associations, which remains hidden in BayesC analysis. Mapping precision was not substantially improved by BayesD. CONCLUSIONS BayesD improved power, but precision only slightly. Application of BayesD needs large datasets with genotypes and own performance records as phenotypes. Given the current efforts to establish cow reference populations in dairy cattle genomic selection schemes, such datasets are expected to be soon available, which will enable the application of BayesD for association mapping and genomic prediction purposes.
Collapse
Affiliation(s)
- Jörn Bennewitz
- Institute of Animal Science, University of Hohenheim, 70593 Stuttgart, Germany
| | - Christian Edel
- Institute of Animal Breeding, Bavarian State Research Center for Agriculture, 85580 Grub, Germany
| | - Ruedi Fries
- Chair of Animal Breeding, Technical University of Munich, Liesel-Beckmann-Strasse 1, 85354 Freising, Germany
| | - Theo H. E. Meuwissen
- Institute of Animal and Aquacultural Science, Norwegian University of Life Science, 1450 Aas, Norway
| | - Robin Wellmann
- Institute of Animal Science, University of Hohenheim, 70593 Stuttgart, Germany
| |
Collapse
|
45
|
Gonen S, Jenko J, Gorjanc G, Mileham AJ, Whitelaw CBA, Hickey JM. Potential of gene drives with genome editing to increase genetic gain in livestock breeding programs. Genet Sel Evol 2017; 49:3. [PMID: 28093068 PMCID: PMC5240390 DOI: 10.1186/s12711-016-0280-3] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2016] [Accepted: 12/14/2016] [Indexed: 01/10/2023] Open
Abstract
BACKGROUND This paper uses simulation to explore how gene drives can increase genetic gain in livestock breeding programs. Gene drives are naturally occurring phenomena that cause a mutation on one chromosome to copy itself onto its homologous chromosome. METHODS We simulated nine different breeding and editing scenarios with a common overall structure. Each scenario began with 21 generations of selection, followed by 20 generations of selection based on true breeding values where the breeder used selection alone, selection in combination with genome editing, or selection with genome editing and gene drives. In the scenarios that used gene drives, we varied the probability of successfully incorporating the gene drive. For each scenario, we evaluated genetic gain, genetic variance [Formula: see text], rate of change in inbreeding ([Formula: see text]), number of distinct quantitative trait nucleotides (QTN) edited, rate of increase in favourable allele frequencies of edited QTN and the time to fix favourable alleles. RESULTS Gene drives enhanced the benefits of genome editing in seven ways: (1) they amplified the increase in genetic gain brought about by genome editing; (2) they amplified the rate of increase in the frequency of favourable alleles and reduced the time it took to fix them; (3) they enabled more rapid targeting of QTN with lesser effect for genome editing; (4) they distributed fixed editing resources across a larger number of distinct QTN across generations; (5) they focussed editing on a smaller number of QTN within a given generation; (6) they reduced the level of inbreeding when editing a subset of the sires; and (7) they increased the efficiency of converting genetic variation into genetic gain. CONCLUSIONS Genome editing in livestock breeding results in short-, medium- and long-term increases in genetic gain. The increase in genetic gain occurs because editing increases the frequency of favourable alleles in the population. Gene drives accelerate the increase in allele frequency caused by editing, which results in even higher genetic gain over a shorter period of time with no impact on inbreeding.
Collapse
Affiliation(s)
- Serap Gonen
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - Janez Jenko
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | | | - C. Bruce A. Whitelaw
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - John M. Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| |
Collapse
|
46
|
Retelling the recent evolution of genetic diversity for Guzerá: Inferences from LD decay, runs of homozygosity and Ne over the generations. Livest Sci 2016. [DOI: 10.1016/j.livsci.2016.10.006] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
47
|
Identification of selective sweeps reveals divergent selection between Chinese Holstein and Simmental cattle populations. Genet Sel Evol 2016; 48:76. [PMID: 27716022 PMCID: PMC5054554 DOI: 10.1186/s12711-016-0254-5] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2015] [Accepted: 09/26/2016] [Indexed: 12/23/2022] Open
Abstract
Background The identification of signals left by recent positive selection provides a feasible approach for targeting genomic variants that underlie complex traits and fitness. A better understanding of the selection mechanisms that occurred during the evolution of species can also be gained. In this study, we simultaneously detected the genome-wide footprints of recent positive selection that occurred within and between Chinese Holstein and Simmental populations, which have been subjected to artificial selection for distinct purposes. We conducted analyses using various complementary approaches, including LRH, XP-EHH and FST, based on the Illumina 770K high-density single nucleotide polymorphism (SNP) array, to enable more comprehensive detection. Results We successfully constructed profiles of selective signals in both cattle populations. To further annotate these regions, we identified a set of novel functional genes related to growth, reproduction, immune response and milk production. There were no overlapping candidate windows between the two breeds. Finally, we investigated the distribution of SNPs that had low FST values across five distinct functional regions in the genome. In the low-minor allele frequency bin, we found a higher proportion of low-FST SNPs in the exons of the bovine genome, which indicates strong purifying selection of the exons. Conclusions The selection signatures identified in these two populations demonstrated positive selection pressure on a set of important genes with potential functions that are involved in many biological processes. We also demonstrated that in the bovine genome, exons were under strong purifying selection. Our findings provide insight into the mechanisms of artificial selection and will facilitate follow-up functional studies of potential candidate genes that are related to various economically important traits in cattle. Electronic supplementary material The online version of this article (doi:10.1186/s12711-016-0254-5) contains supplementary material, which is available to authorized users.
Collapse
|
48
|
Linkage disequilibrium and haplotype block structure in Limousin, Simmental and native Polish Red cattle. Livest Sci 2016. [DOI: 10.1016/j.livsci.2016.07.008] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
49
|
Vandenplas J, Calus MPL, Sevillano CA, Windig JJ, Bastiaansen JWM. Assigning breed origin to alleles in crossbred animals. Genet Sel Evol 2016; 48:61. [PMID: 27549177 PMCID: PMC4994281 DOI: 10.1186/s12711-016-0240-y] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2015] [Accepted: 08/10/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND For some species, animal production systems are based on the use of crossbreeding to take advantage of the increased performance of crossbred compared to purebred animals. Effects of single nucleotide polymorphisms (SNPs) may differ between purebred and crossbred animals for several reasons: (1) differences in linkage disequilibrium between SNP alleles and a quantitative trait locus; (2) differences in genetic backgrounds (e.g., dominance and epistatic interactions); and (3) differences in environmental conditions, which result in genotype-by-environment interactions. Thus, SNP effects may be breed-specific, which has led to the development of genomic evaluations for crossbred performance that take such effects into account. However, to estimate breed-specific effects, it is necessary to know breed origin of alleles in crossbred animals. Therefore, our aim was to develop an approach for assigning breed origin to alleles of crossbred animals (termed BOA) without information on pedigree and to study its accuracy by considering various factors, including distance between breeds. RESULTS The BOA approach consists of: (1) phasing genotypes of purebred and crossbred animals; (2) assigning breed origin to phased haplotypes; and (3) assigning breed origin to alleles of crossbred animals based on a library of assigned haplotypes, the breed composition of crossbred animals, and their SNP genotypes. The accuracy of allele assignments was determined for simulated datasets that include crosses between closely-related, distantly-related and unrelated breeds. Across these scenarios, the percentage of alleles of a crossbred animal that were correctly assigned to their breed origin was greater than 90 %, and increased with increasing distance between breeds, while the percentage of incorrectly assigned alleles was always less than 2 %. For the remaining alleles, i.e. 0 to 10 % of all alleles of a crossbred animal, breed origin could not be assigned. CONCLUSIONS The BOA approach accurately assigns breed origin to alleles of crossbred animals, even if their pedigree is not recorded.
Collapse
Affiliation(s)
- Jérémie Vandenplas
- Animal Breeding and Genomics Centre, Wageningen UR Livestock Research, 6700 AH, Wageningen, The Netherlands.
| | - Mario P L Calus
- Animal Breeding and Genomics Centre, Wageningen UR Livestock Research, 6700 AH, Wageningen, The Netherlands
| | - Claudia A Sevillano
- Topigs Norsvin Research Center B.V., 6640 AA, Beuningen, The Netherlands.,Animal Breeding and Genomics Centre, Wageningen University, 6700 AH, Wageningen, The Netherlands
| | - Jack J Windig
- Animal Breeding and Genomics Centre, Wageningen UR Livestock Research, 6700 AH, Wageningen, The Netherlands
| | - John W M Bastiaansen
- Animal Breeding and Genomics Centre, Wageningen University, 6700 AH, Wageningen, The Netherlands
| |
Collapse
|
50
|
Niu H, Zhu B, Guo P, Zhang W, Xue J, Chen Y, Zhang L, Gao H, Gao X, Xu L, Li J. Estimation of linkage disequilibrium levels and haplotype block structure in Chinese Simmental and Wagyu beef cattle using high-density genotypes. Livest Sci 2016. [DOI: 10.1016/j.livsci.2016.05.012] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|