1
|
Durward-Akhurst SA, Marlowe JL, Schaefer RJ, Springer K, Grantham B, Carey WK, Bellone RR, Mickelson JR, McCue ME. Predicted genetic burden and frequency of phenotype-associated variants in the horse. Sci Rep 2024; 14:8396. [PMID: 38600096 PMCID: PMC11006912 DOI: 10.1038/s41598-024-57872-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Accepted: 03/22/2024] [Indexed: 04/12/2024] Open
Abstract
Disease-causing variants have been identified for less than 20% of suspected equine genetic diseases. Whole genome sequencing (WGS) allows rapid identification of rare disease causal variants. However, interpreting the clinical variant consequence is confounded by the number of predicted deleterious variants that healthy individuals carry (predicted genetic burden). Estimation of the predicted genetic burden and baseline frequencies of known deleterious or phenotype associated variants within and across the major horse breeds have not been performed. We used WGS of 605 horses across 48 breeds to identify 32,818,945 variants, demonstrate a high predicted genetic burden (median 730 variants/horse, interquartile range: 613-829), show breed differences in predicted genetic burden across 12 target breeds, and estimate the high frequencies of some previously reported disease variants. This large-scale variant catalog for a major and highly athletic domestic animal species will enhance its ability to serve as a model for human phenotypes and improves our ability to discover the bases for important equine phenotypes.
Collapse
Affiliation(s)
- S A Durward-Akhurst
- Department of Veterinary Clinical Sciences, University of Minnesota, C339 VMC, 1353 Boyd Avenue, St. Paul, MN, 55108, USA.
| | - J L Marlowe
- Department of Veterinary Clinical Sciences, University of Minnesota, C339 VMC, 1353 Boyd Avenue, St. Paul, MN, 55108, USA
| | - R J Schaefer
- Department of Veterinary Population Medicine, University of Minnesota, 225 VMC, 1365 Gortner Avenue, St. Paul, MN, 55108, USA
| | - K Springer
- Department of Veterinary Population Medicine, University of Minnesota, 225 VMC, 1365 Gortner Avenue, St. Paul, MN, 55108, USA
| | - B Grantham
- Interval Bio LLC, 408 Stierline Road, Mountain View, CA, 94043, USA
| | - W K Carey
- Interval Bio LLC, 408 Stierline Road, Mountain View, CA, 94043, USA
| | - R R Bellone
- Veterinary Genetics Laboratory, School of Veterinary Medicine, University of California-Davis, Davis, CA, USA
- Population Health and Reproduction and Veterinary Genetics Laboratory, School of Veterinary Medicine, University of California, Davis, CA, USA
| | - J R Mickelson
- Department of Veterinary and Biomedical Sciences, University of Minnesota, 295F Animal Science Veterinary Medicine Building, 1988 Fitch Avenue, St. Paul, MN, 55108, USA
| | - M E McCue
- Department of Veterinary Population Medicine, University of Minnesota, 225 VMC, 1365 Gortner Avenue, St. Paul, MN, 55108, USA
| |
Collapse
|
2
|
Reich P, Falker-Gieske C, Pook T, Tetens J. Development and validation of a horse reference panel for genotype imputation. Genet Sel Evol 2022; 54:49. [PMID: 35787788 PMCID: PMC9252005 DOI: 10.1186/s12711-022-00740-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 06/23/2022] [Indexed: 11/10/2022] Open
Abstract
Background Genotype imputation is a cost-effective method to generate sequence-level genotypes for a large number of animals. Its application can improve the power of genomic studies, provided that the accuracy of imputation is sufficiently high. The purpose of this study was to develop an optimal strategy for genotype imputation from genotyping array data to sequence level in German warmblood horses, and to investigate the effect of different factors on the accuracy of imputation. Publicly available whole-genome sequence data from 317 horses of 46 breeds was used to conduct the analyses. Results Depending on the size and composition of the reference panel, the accuracy of imputation from medium marker density (60K) to sequence level using the software Beagle 5.1 ranged from 0.64 to 0.70 for horse chromosome 3. Generally, imputation accuracy increased as the size of the reference panel increased, but if genetically distant individuals were included in the panel, the accuracy dropped. Imputation was most precise when using a reference panel of multiple but related breeds and the software Beagle 5.1, which outperformed the other two tested computer programs, Impute 5 and Minimac 4. Genome-wide imputation for this scenario resulted in a mean accuracy of 0.66. Stepwise imputation from 60K to 670K markers and subsequently to sequence level did not improve the accuracy of imputation. However, imputation from higher density (670K) was considerably more accurate (about 0.90) than from medium density. Likewise, imputation in genomic regions with a low marker coverage resulted in a reduced accuracy of imputation. Conclusions The accuracy of imputation in horses was influenced by the size and composition of the reference panel, the marker density of the genotyping array, and the imputation software. Genotype imputation can be used to extend the limited amount of available sequence-level data from horses in order to boost the power of downstream analyses, such as genome-wide association studies, or the detection of embryonic lethal variants. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-022-00740-8.
Collapse
Affiliation(s)
- Paula Reich
- Department of Animal Sciences, Georg-August-University Göttingen, 37077, Göttingen, Germany.
| | - Clemens Falker-Gieske
- Department of Animal Sciences, Georg-August-University Göttingen, 37077, Göttingen, Germany.,Center for Integrated Breeding Research (CiBreed), Georg-August-University Göttingen, 37075, Göttingen, Germany
| | - Torsten Pook
- Department of Animal Sciences, Georg-August-University Göttingen, 37077, Göttingen, Germany.,Center for Integrated Breeding Research (CiBreed), Georg-August-University Göttingen, 37075, Göttingen, Germany
| | - Jens Tetens
- Department of Animal Sciences, Georg-August-University Göttingen, 37077, Göttingen, Germany.,Center for Integrated Breeding Research (CiBreed), Georg-August-University Göttingen, 37075, Göttingen, Germany
| |
Collapse
|
3
|
Gmel AI, Burger D, Neuditschko M. A Novel QTL and a Candidate Gene Are Associated with the Progressive Motility of Franches-Montagnes Stallion Spermatozoa after Thaw. Genes (Basel) 2021; 12:1501. [PMID: 34680896 PMCID: PMC8536120 DOI: 10.3390/genes12101501] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Revised: 09/20/2021] [Accepted: 09/22/2021] [Indexed: 12/12/2022] Open
Abstract
The use of frozen-thawed semen is an important reproduction tool to preserve the biodiversity of small, native horse breeds such as the Franches-Montagnes (FM). However, not all stallions produce cryotolerant semen with a progressive motility after thaw ≥ 35%. To improve our understanding of the genetic background of male fertility traits in both fresh and frozen-thawed semen, we performed genome-wide association studies (GWAS) on gel-free volume, sperm cell concentration, total sperm count, and progressive motility in fresh and frozen-thawed semen from 109 FM stallions using 335,494 genome-wide single nucleotide polymorphisms (SNPs). We identified one significant (p < 1.69 × 10-7) quantitative trait locus (QTL) on ECA6 within the SCN8A gene for progressive motility after thaw, which was previously associated with progressive motility in boars. Homozygous stallions showed a substantial drop in progressive motility after thaw. This QTL could be used to identify cryointolerant stallions, avoiding the costly cryopreservation process. Further studies are needed to confirm whether this QTL is also present in other horse breeds.
Collapse
Affiliation(s)
- Annik Imogen Gmel
- Animal GenoPhenomics, Agroscope, Route de la Tioleyre 4, 1725 Posieux, Switzerland;
- Equine Department, Vetsuisse Faculty, University of Zurich, Winterthurerstrasse 260, 8057 Zurich, Switzerland
| | - Dominik Burger
- Swiss Institute of Equine Medicine ISME, Agroscope and University of Bern, Les Longs Prés, 1580 Avenches, Switzerland;
| | - Markus Neuditschko
- Animal GenoPhenomics, Agroscope, Route de la Tioleyre 4, 1725 Posieux, Switzerland;
| |
Collapse
|
4
|
Gileta AF, Gao J, Chitre AS, Bimschleger HV, St Pierre CL, Gopalakrishnan S, Palmer AA. Adapting Genotyping-by-Sequencing and Variant Calling for Heterogeneous Stock Rats. G3 (BETHESDA, MD.) 2020; 10:2195-2205. [PMID: 32398234 PMCID: PMC7341140 DOI: 10.1534/g3.120.401325] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/23/2019] [Accepted: 05/01/2020] [Indexed: 02/06/2023]
Abstract
The heterogeneous stock (HS) is an outbred rat population derived from eight inbred rat strains. HS rats are ideally suited for genome wide association studies; however, only a few genotyping microarrays have ever been designed for rats and none of them are currently in production. To address the need for an efficient and cost effective method of genotyping HS rats, we have adapted genotype-by-sequencing (GBS) to obtain genotype information at large numbers of single nucleotide polymorphisms (SNPs). In this paper, we have outlined the laboratory and computational steps we took to optimize double digest genotype-by-sequencing (ddGBS) for use in rats. We evaluated multiple existing computational tools and explain the workflow we have used to call and impute over 3.7 million SNPs. We have also compared various rat genetic maps, which are necessary for imputation, including a recently developed map specific to the HS. Using our approach, we obtained concordance rates of 99% with data obtained using data from a genotyping array. The principles and computational pipeline that we describe could easily be adapted for use in other species for which reliable reference genome sets are available.
Collapse
Affiliation(s)
- Alexander F Gileta
- Department of Psychiatry
- Institute for Genomic Medicine, University of California San Diego, La Jolla, California, 92093
| | | | | | | | | | - Shyam Gopalakrishnan
- Department of Human Genetics, University of Chicago, Chicago, Illinois, 60637, and
| | - Abraham A Palmer
- Department of Psychiatry,
- Natural History Museum of Denmark, University of Copenhagen, 2200 København N, Denmark
| |
Collapse
|
5
|
Abstract
High-quality genomic tools have been integral in understanding genomic architecture and function in the modern-day horse. The equine genetics community has a long tradition of pooling resources to develop genomic tools. Since the equine genome was sequenced in 2006, several iterations of high throughput genotyping arrays have been developed and released, enabling rapid and cost-effective genotyping. This review highlights the design considerations of each iteration, focusing on data available during development and outlining considerations in selecting the genetic variants included on each array. Additionally, we outline recent applications of equine genotyping arrays as well as future prospects and applications.
Collapse
Affiliation(s)
- Robert J Schaefer
- Department of Veterinary Population Medicine, College of Veterinary Medicine, University of Minnesota, 1365 Gortner Avenue, St Paul, MN 55108, USA.
| | - Molly E McCue
- Department of Veterinary Population Medicine, College of Veterinary Medicine, University of Minnesota, 1365 Gortner Avenue, St Paul, MN 55108, USA. https://twitter.com/Molly_McCue_DVM
| |
Collapse
|
6
|
Genome-wide association study: Understanding the genetic basis of the gait type in Brazilian Mangalarga Marchador horses, a preliminary study. Livest Sci 2020. [DOI: 10.1016/j.livsci.2019.103867] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
7
|
Toro Ospina AM, Maiorano AM, Curi RA, Pereira GL, Zerlotti-Mercadante ME, Dos Santos Gonçalves Cyrillo JN, Aspilcueta-Borquis RR, de V Silva JAI. Linkage disequilibrium and effective population size in Gir cattle selected for yearling weight. Reprod Domest Anim 2019; 54:1524-1531. [PMID: 31471991 DOI: 10.1111/rda.13559] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2019] [Revised: 08/12/2019] [Accepted: 08/22/2019] [Indexed: 12/27/2022]
Abstract
Linkage disequilibrium (LD) plays an important role in genomic selection and mapping of quantitative trait loci (QTL). This study investigated the pattern of LD and effective population size (Ne ) in Gir cattle selected for yearling weight. For this purpose, 173 animals with imputed genotypes (from 18 animals genotyped with the Illumina BovineHD BeadChip and 155 animals genotyped with the Bovine LDv4 panel) were analysed. The LD was evaluated at distances of 25-50 kb, 50-100 kb, 100-500 kb and 0.5-1 Mb. The Ne was estimated based on 5 past generations. The r2 values (a measure of LD) were, respectively, .35, .29, .18 and .032 for the distances evaluated. The LD estimates decreased with increasing distance of SNP pairs and LD persisted up to a distance of 100 kb (r2 = .29). The Ne was greater in generations 4 and 5 (24 and 30 animals, respectively) and declined drastically after the last generation (12 animals). The results showed high levels of LD and low Ne , which were probably due to the loss of genetic variability as a consequence of the structure of the Gir population studied.
Collapse
Affiliation(s)
- Alejandra M Toro Ospina
- Faculty of Agricultural and Veterinary Sciences, Genetic Animal Breeding, Sao Paulo State University, Jaboticabal, Brazil
| | - Amanda Marchi Maiorano
- Faculty of Agricultural and Veterinary Sciences, Genetic Animal Breeding, Sao Paulo State University, Jaboticabal, Brazil
| | - Rogério A Curi
- Department of Breeding and Animal Nutrition, Faculty of Veterinary Medicine and Animal Science, Sao Paulo State University, Botucatu, Brazil
| | - Guilherme L Pereira
- Department of Breeding and Animal Nutrition, Faculty of Veterinary Medicine and Animal Science, Sao Paulo State University, Botucatu, Brazil
| | | | | | | | - Josineudson A Ii de V Silva
- Department of Breeding and Animal Nutrition, Faculty of Veterinary Medicine and Animal Science, Sao Paulo State University, Botucatu, Brazil
| |
Collapse
|
8
|
Gmel AI, Druml T, von Niederhäusern R, Leeb T, Neuditschko M. Genome-Wide Association Studies Based on Equine Joint Angle Measurements Reveal New QTL Affecting the Conformation of Horses. Genes (Basel) 2019; 10:genes10050370. [PMID: 31091839 PMCID: PMC6562990 DOI: 10.3390/genes10050370] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2019] [Revised: 04/30/2019] [Accepted: 05/03/2019] [Indexed: 12/30/2022] Open
Abstract
The evaluation of conformation traits is an important part of selection for breeding stallions and mares. Some of these judged conformation traits involve joint angles that are associated with performance, health, and longevity. To improve our understanding of the genetic background of joint angles in horses, we have objectively measured the angles of the poll, elbow, carpal, fetlock (front and hind), hip, stifle, and hock joints based on one photograph of each of the 300 Franches-Montagnes (FM) and 224 Lipizzan (LIP) horses. After quality control, genome-wide association studies (GWASs) for these traits were performed on 495 horses, using 374,070 genome-wide single nucleotide polymorphisms (SNPs) in a mixed-effect model. We identified two significant quantitative trait loci (QTL) for the poll angle on ECA28 (p = 1.36 × 10−7), 50 kb downstream of the ALX1 gene, involved in cranial morphology, and for the elbow joint on ECA29 (p = 1.69 × 10−7), 49 kb downstream of the RSU1 gene, and 75 kb upstream of the PTER gene. Both genes are associated with bone mineral density in humans. Furthermore, we identified other suggestive QTL associated with the stifle joint on ECA8 (p = 3.10 × 10−7); the poll on ECA1 (p = 6.83 × 10−7); the fetlock joint of the hind limb on ECA27 (p = 5.42 × 10−7); and the carpal joint angle on ECA3 (p = 6.24 × 10−7), ECA4 (p = 6.07 × 10−7), and ECA7 (p = 8.83 × 10−7). The application of angular measurements in genetic studies may increase our understanding of the underlying genetic effects of important traits in equine breeding.
Collapse
Affiliation(s)
- Annik Imogen Gmel
- Agroscope-Swiss National Stud Farm, Les Longs-Prés, 1580 Avenches, Switzerland.
- Institute of Genetics, Vetsuisse Faculty, University of Bern, Bremgartenstrasse 109a, 3012 Bern, Switzerland.
| | - Thomas Druml
- Institute of Animal Breeding and Genetics, Veterinary University Vienna, Veterinärplatz 1, A-1210 Vienna, Austria.
| | | | - Tosso Leeb
- Institute of Genetics, Vetsuisse Faculty, University of Bern, Bremgartenstrasse 109a, 3012 Bern, Switzerland.
| | - Markus Neuditschko
- Agroscope-Swiss National Stud Farm, Les Longs-Prés, 1580 Avenches, Switzerland.
| |
Collapse
|
9
|
Pégard M, Rogier O, Bérard A, Faivre-Rampant P, Paslier MCL, Bastien C, Jorge V, Sánchez L. Sequence imputation from low density single nucleotide polymorphism panel in a black poplar breeding population. BMC Genomics 2019; 20:302. [PMID: 30999856 PMCID: PMC6471894 DOI: 10.1186/s12864-019-5660-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2018] [Accepted: 03/29/2019] [Indexed: 12/30/2022] Open
Abstract
Background Genomic selection accuracy increases with the use of high SNP (single nucleotide polymorphism) coverage. However, such gains in coverage come at high costs, preventing their prompt operational implementation by breeders. Low density panels imputed to higher densities offer a cheaper alternative during the first stages of genomic resources development. Our study is the first to explore the imputation in a tree species: black poplar. About 1000 pure-breed Populus nigra trees from a breeding population were selected and genotyped with a 12K custom Infinium Bead-Chip. Forty-three of those individuals corresponding to nodal trees in the pedigree were fully sequenced (reference), while the remaining majority (target) was imputed from 8K to 1.4 million SNPs using FImpute. Each SNP and individual was evaluated for imputation errors by leave-one-out cross validation in the training sample of 43 sequenced trees. Some summary statistics such as Hardy-Weinberg Equilibrium exact test p-value, quality of sequencing, depth of sequencing per site and per individual, minor allele frequency, marker density ratio or SNP information redundancy were calculated. Principal component and Boruta analyses were used on all these parameters to rank the factors affecting the quality of imputation. Additionally, we characterize the impact of the relatedness between reference population and target population. Results During the imputation process, we used 7540 SNPs from the chip to impute 1,438,827 SNPs from sequences. At the individual level, imputation accuracy was high with a proportion of SNPs correctly imputed between 0.84 and 0.99. The variation in accuracies was mostly due to differences in relatedness between individuals. At a SNP level, the imputation quality depended on genotyped SNP density and on the original minor allele frequency. The imputation did not appear to result in an increase of linkage disequilibrium. The genotype densification not only brought a better distribution of markers all along the genome, but also we did not detect any substantial bias in annotation categories. Conclusions This study shows that it is possible to impute low-density marker panels to whole genome sequence with good accuracy under certain conditions that could be common to many breeding populations. Electronic supplementary material The online version of this article (10.1186/s12864-019-5660-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Marie Pégard
- BioForA, INRA, ONF, 45075, Orléans, France, 2163 Avenue de la Pomme de Pin CS 40001 ARDON, Orléans Cedex 2, 45075, France
| | - Odile Rogier
- BioForA, INRA, ONF, 45075, Orléans, France, 2163 Avenue de la Pomme de Pin CS 40001 ARDON, Orléans Cedex 2, 45075, France
| | - Aurélie Bérard
- Etude du Polymorphisme des Génomes Végétaux (EPGV), INRA, Université Paris-Saclay, 91000, 2 rue Gaston Crémieux, Evry, 9100, France
| | - Patricia Faivre-Rampant
- Etude du Polymorphisme des Génomes Végétaux (EPGV), INRA, Université Paris-Saclay, 91000, 2 rue Gaston Crémieux, Evry, 9100, France
| | - Marie-Christine Le Paslier
- Etude du Polymorphisme des Génomes Végétaux (EPGV), INRA, Université Paris-Saclay, 91000, 2 rue Gaston Crémieux, Evry, 9100, France
| | - Catherine Bastien
- BioForA, INRA, ONF, 45075, Orléans, France, 2163 Avenue de la Pomme de Pin CS 40001 ARDON, Orléans Cedex 2, 45075, France
| | - Véronique Jorge
- BioForA, INRA, ONF, 45075, Orléans, France, 2163 Avenue de la Pomme de Pin CS 40001 ARDON, Orléans Cedex 2, 45075, France
| | - Leopoldo Sánchez
- BioForA, INRA, ONF, 45075, Orléans, France, 2163 Avenue de la Pomme de Pin CS 40001 ARDON, Orléans Cedex 2, 45075, France.
| |
Collapse
|
10
|
Felkel S, Vogl C, Rigler D, Dobretsberger V, Chowdhary BP, Distl O, Fries R, Jagannathan V, Janečka JE, Leeb T, Lindgren G, McCue M, Metzger J, Neuditschko M, Rattei T, Raudsepp T, Rieder S, Rubin CJ, Schaefer R, Schlötterer C, Thaller G, Tetens J, Velie B, Brem G, Wallner B. The horse Y chromosome as an informative marker for tracing sire lines. Sci Rep 2019; 9:6095. [PMID: 30988347 PMCID: PMC6465346 DOI: 10.1038/s41598-019-42640-w] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2018] [Accepted: 04/04/2019] [Indexed: 12/31/2022] Open
Abstract
Analysis of the Y chromosome is the best-established way to reconstruct paternal family history in humans. Here, we applied fine-scaled Y-chromosomal haplotyping in horses with biallelic markers and demonstrate the potential of our approach to address the ancestry of sire lines. We de novo assembled a draft reference of the male-specific region of the Y chromosome from Illumina short reads and then screened 5.8 million basepairs for variants in 130 specimens from intensively selected and rural breeds and nine Przewalski's horses. Among domestic horses we confirmed the predominance of a young'crown haplogroup' in Central European and North American breeds. Within the crown, we distinguished 58 haplotypes based on 211 variants, forming three major haplogroups. In addition to two previously characterised haplogroups, one observed in Arabian/Coldblooded and the other in Turkoman/Thoroughbred horses, we uncovered a third haplogroup containing Iberian lines and a North African Barb Horse. In a genealogical showcase, we distinguished the patrilines of the three English Thoroughbred founder stallions and resolved a historic controversy over the parentage of the horse 'Galopin', born in 1872. We observed two nearly instantaneous radiations in the history of Central and Northern European Y-chromosomal lineages that both occurred after domestication 5,500 years ago.
Collapse
Affiliation(s)
- Sabine Felkel
- Institute of Animal Breeding and Genetics, University of Veterinary Medicine Vienna, Vienna, 1210, Austria
- Vienna Graduate School of Population Genetics, Vienna, Austria
| | - Claus Vogl
- Institute of Animal Breeding and Genetics, University of Veterinary Medicine Vienna, Vienna, 1210, Austria
| | - Doris Rigler
- Institute of Animal Breeding and Genetics, University of Veterinary Medicine Vienna, Vienna, 1210, Austria
| | - Viktoria Dobretsberger
- Institute of Animal Breeding and Genetics, University of Veterinary Medicine Vienna, Vienna, 1210, Austria
| | | | - Ottmar Distl
- Institute for Animal Breeding and Genetics, University of Veterinary Medicine Hannover, Hannover, 30559, Germany
| | - Ruedi Fries
- Lehrstuhl fuer Tierzucht, Technische Universitaet Muenchen, Freising, 85354, Germany
| | - Vidhya Jagannathan
- Institute of Genetics, Vetsuisse Faculty, University of Bern, Bern, 3001, Switzerland
| | - Jan E Janečka
- Department of Biological Sciences, Duquesne University, Pittsburgh, 15282, USA
| | - Tosso Leeb
- Institute of Genetics, Vetsuisse Faculty, University of Bern, Bern, 3001, Switzerland
| | - Gabriella Lindgren
- Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Uppsala, 75007, Sweden
- Department of Biosystems, KU Leuven, Leuven, 3001, Belgium
| | - Molly McCue
- Veterinary Population Medicine Department, University of Minnesota, St. Paul, MN, 55108, USA
| | - Julia Metzger
- Institute for Animal Breeding and Genetics, University of Veterinary Medicine Hannover, Hannover, 30559, Germany
| | | | - Thomas Rattei
- Department of Microbiology and Ecosystem Science, Division of Computational Systems Biology, University of Vienna, Althanstrasse 14, 1090, Vienna, Austria
| | - Terje Raudsepp
- Department of Veterinary Integrative Biosciences, College of Veterinary Medicine and Biomedical Sciences, Texas A&M University, College Station, TX, 77843-4458, USA
| | - Stefan Rieder
- Agroscope, Swiss National Stud Farm, Avenches, 1580, Switzerland
| | - Carl-Johan Rubin
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, Uppsala, 75123, Sweden
| | - Robert Schaefer
- Agroscope, Swiss National Stud Farm, Avenches, 1580, Switzerland
| | - Christian Schlötterer
- Institut fuer Populationsgenetik, University of Veterinary Medicine Vienna, Vienna, 1210, Austria
| | - Georg Thaller
- Institute of Animal Breeding and Husbandry, University of Kiel, Kiel, 24098, Germany
| | - Jens Tetens
- Institute of Animal Breeding and Husbandry, University of Kiel, Kiel, 24098, Germany
- Functional Breeding Group, Department of Animal Sciences, Georg-August-University Göttingen, Göttingen, 37077, Germany
| | - Brandon Velie
- Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Uppsala, 75007, Sweden
- School of Life and Environmental Sciences, University of Sydney, Sydney, 2006, Australia
| | - Gottfried Brem
- Institute of Animal Breeding and Genetics, University of Veterinary Medicine Vienna, Vienna, 1210, Austria
| | - Barbara Wallner
- Institute of Animal Breeding and Genetics, University of Veterinary Medicine Vienna, Vienna, 1210, Austria.
| |
Collapse
|
11
|
Comparison of genotype imputation strategies using a combined reference panel for chicken population. Animal 2018; 13:1119-1126. [PMID: 30370890 DOI: 10.1017/s1751731118002860] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Using whole-genome sequence (WGS) data are supposed to be optimal for genome-wide association studies and genomic predictions. However, sequencing thousands of individuals of interest is expensive. Imputation from single nucleotide polymorphisms panels to WGS data is an attractive approach to obtain highly reliable WGS data at low cost. Here, we conducted a genotype imputation study with a combined reference panel in yellow-feather dwarf broiler population. The combined reference panel was assembled by sequencing 24 key individuals of a yellow-feather dwarf broiler population (internal reference panel) and WGS data from 311 chickens in public databases (external reference panel). Three scenarios were investigated to determine how different factors affect the accuracy of imputation from 600 K array data to WGS data, including: genotype imputation with internal, external and combined reference panels; the number of internal reference individuals in the combined reference panel; and different reference sizes and selection strategies of an external reference panel. Results showed that imputation accuracy from 600 K to WGS data were 0.834±0.012, 0.920±0.007 and 0.982±0.003 for the internal, external and combined reference panels, respectively. Increasing the reference size from 50 to 250 improved the accuracy of genotype imputation from 0.848 to 0.974 for the combined reference panel and from 0.647 to 0.917 for the external reference panel. The selection strategies for the external reference panel had no impact on the accuracy of imputation using the combined reference panel. However, if only an external reference panel with reference size >50 was used, the selection strategy of minimizing the average distance to the closest leaf had the greatest imputation accuracy compared with other methods. Generally, using a combined reference panel provided greater imputation accuracy, especially for low-frequency variants. In conclusion, the optimal imputation strategy with a combined reference panel should comprehensively consider genetic diversity of the study population, availability and properties of external reference panels, sequencing and computing costs, and frequency of imputed variants. This work sheds light on how to design and execute genotype imputation with a combined external reference panel in a livestock population.
Collapse
|
12
|
Chassier M, Barrey E, Robert C, Duluard A, Danvy S, Ricard A. Genotype imputation accuracy in multiple equine breeds from medium- to high-density genotypes. J Anim Breed Genet 2018; 135:420-431. [DOI: 10.1111/jbg.12358] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2018] [Revised: 08/17/2018] [Accepted: 08/24/2018] [Indexed: 01/27/2023]
Affiliation(s)
- Marjorie Chassier
- Unité Mixte de Recherche 1313 Génétique Animale et Biologie Intégrative; Département Sciences du Vivant; Institut National de la Recherche Agronomique; AgroParisTech; Université Paris Saclay; Jouy-en-Josas France
| | - Eric Barrey
- Unité Mixte de Recherche 1313 Génétique Animale et Biologie Intégrative; Département Sciences du Vivant; Institut National de la Recherche Agronomique; AgroParisTech; Université Paris Saclay; Jouy-en-Josas France
| | - Céline Robert
- Unité Mixte de Recherche 1313 Génétique Animale et Biologie Intégrative; Département Sciences du Vivant; Institut National de la Recherche Agronomique; AgroParisTech; Université Paris Saclay; Jouy-en-Josas France
- Ecole Nationale Vétérinaire d'Alfort; Maisons Alfort France
| | - Arnaud Duluard
- Département élevage et santé animale; Le Trot; Paris France
| | - Sophie Danvy
- Institut Français du Cheval et de l'Equitation; Pôle développement; Innovation et Recherche; Exmes France
| | - Anne Ricard
- Unité Mixte de Recherche 1313 Génétique Animale et Biologie Intégrative; Département Sciences du Vivant; Institut National de la Recherche Agronomique; AgroParisTech; Université Paris Saclay; Jouy-en-Josas France
- Institut Français du Cheval et de l'Equitation; Pôle développement; Innovation et Recherche; Exmes France
| |
Collapse
|
13
|
Ye S, Yuan X, Lin X, Gao N, Luo Y, Chen Z, Li J, Zhang X, Zhang Z. Imputation from SNP chip to sequence: a case study in a Chinese indigenous chicken population. J Anim Sci Biotechnol 2018; 9:30. [PMID: 29581880 PMCID: PMC5861640 DOI: 10.1186/s40104-018-0241-5] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2017] [Accepted: 01/26/2018] [Indexed: 11/24/2022] Open
Abstract
Background Genome-wide association studies and genomic predictions are thought to be optimized by using whole-genome sequence (WGS) data. However, sequencing thousands of individuals of interest is expensive. Imputation from SNP panels to WGS data is an attractive and less expensive approach to obtain WGS data. The aims of this study were to investigate the accuracy of imputation and to provide insight into the design and execution of genotype imputation. Results We genotyped 450 chickens with a 600 K SNP array, and sequenced 24 key individuals by whole genome re-sequencing. Accuracy of imputation from putative 60 K and 600 K array data to WGS data was 0.620 and 0.812 for Beagle, and 0.810 and 0.914 for FImpute, respectively. By increasing the sequencing cost from 24X to 144X, the imputation accuracy increased from 0.525 to 0.698 for Beagle and from 0.654 to 0.823 for FImpute. With fixed sequence depth (12X), increasing the number of sequenced animals from 1 to 24, improved accuracy from 0.421 to 0.897 for FImpute and from 0.396 to 0.777 for Beagle. Using optimally selected key individuals resulted in a higher imputation accuracy compared with using randomly selected individuals as a reference population for re-sequencing. With fixed reference population size (24), imputation accuracy increased from 0.654 to 0.875 for FImpute and from 0.512 to 0.762 for Beagle as the sequencing depth increased from 1X to 12X. With a given total cost of genotyping, accuracy increased with the size of the reference population for FImpute, but the pattern was not valid for Beagle, which showed the highest accuracy at six fold coverage for the scenarios used in this study. Conclusions In conclusion, we comprehensively investigated the impacts of several key factors on genotype imputation. Generally, increasing sequencing cost gave a higher imputation accuracy. But with a fixed sequencing cost, the optimal imputation enhance the performance of WGP and GWAS. An optimal imputation strategy should take size of reference population, imputation algorithms, marker density, and population structure of the target population and methods to select key individuals into consideration comprehensively. This work sheds additional light on how to design and execute genotype imputation for livestock populations. Electronic supplementary material The online version of this article (10.1186/s40104-018-0241-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Shaopan Ye
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong China
| | - Xiaolong Yuan
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong China
| | - Xiran Lin
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong China
| | - Ning Gao
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong China
| | - Yuanyu Luo
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong China
| | - Zanmou Chen
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong China
| | - Jiaqi Li
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong China
| | - Xiquan Zhang
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong China
| | - Zhe Zhang
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong China
| |
Collapse
|
14
|
Frischknecht M, Pausch H, Bapst B, Signer-Hasler H, Flury C, Garrick D, Stricker C, Fries R, Gredler-Grandl B. Highly accurate sequence imputation enables precise QTL mapping in Brown Swiss cattle. BMC Genomics 2017; 18:999. [PMID: 29284405 PMCID: PMC5747239 DOI: 10.1186/s12864-017-4390-2] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2017] [Accepted: 12/15/2017] [Indexed: 01/06/2023] Open
Abstract
Background Within the last few years a large amount of genomic information has become available in cattle. Densities of genomic information vary from a few thousand variants up to whole genome sequence information. In order to combine genomic information from different sources and infer genotypes for a common set of variants, genotype imputation is required. Results In this study we evaluated the accuracy of imputation from high density chips to whole genome sequence data in Brown Swiss cattle. Using four popular imputation programs (Beagle, FImpute, Impute2, Minimac) and various compositions of reference panels, the accuracy of the imputed sequence variant genotypes was high and differences between the programs and scenarios were small. We imputed sequence variant genotypes for more than 1600 Brown Swiss bulls and performed genome-wide association studies for milk fat percentage at two stages of lactation. We found one and three quantitative trait loci for early and late lactation fat content, respectively. Known causal variants that were imputed from the sequenced reference panel were among the most significantly associated variants of the genome-wide association study. Conclusions Our study demonstrates that whole-genome sequence information can be imputed at high accuracy in cattle populations. Using imputed sequence variant genotypes in genome-wide association studies may facilitate causal variant detection. Electronic supplementary material The online version of this article (10.1186/s12864-017-4390-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Mirjam Frischknecht
- Qualitas AG, Chamerstrasse 56a, 6300, Zug, Switzerland. .,Bern University of Applied Sciences, School of Agricultural, Forest and Food Sciences HAFL, Länggasse 85, 3052, Zollikofen, Switzerland.
| | - Hubert Pausch
- Chair of Animal Breeding, Technische Universität München, Liesel-Beckmann-Str. 1, 85354, Freising, Germany.,Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia.,ETH Zurich, Tannenstrasse 1, 8092, Zurich, Switzerland
| | - Beat Bapst
- Qualitas AG, Chamerstrasse 56a, 6300, Zug, Switzerland
| | - Heidi Signer-Hasler
- Bern University of Applied Sciences, School of Agricultural, Forest and Food Sciences HAFL, Länggasse 85, 3052, Zollikofen, Switzerland
| | - Christine Flury
- Bern University of Applied Sciences, School of Agricultural, Forest and Food Sciences HAFL, Länggasse 85, 3052, Zollikofen, Switzerland
| | - Dorian Garrick
- Institute of Veterinary, Animal & Biomedical Sciences, Massey University, 4442, Palmerston North, New Zealand
| | | | - Ruedi Fries
- Chair of Animal Breeding, Technische Universität München, Liesel-Beckmann-Str. 1, 85354, Freising, Germany
| | | |
Collapse
|
15
|
Druml T, Neuditschko M, Grilz-Seger G, Horna M, Ricard A, Mesarič M, Cotman M, Pausch H, Brem G. Population Networks Associated with Runs of Homozygosity Reveal New Insights into the Breeding History of the Haflinger Horse. J Hered 2017; 109:384-392. [DOI: 10.1093/jhered/esx114] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2017] [Accepted: 12/23/2017] [Indexed: 02/04/2023] Open
Affiliation(s)
- Thomas Druml
- Institute of Animal Breeding and Genetics, University of Veterinary Sciences Vienna, Vienna, Austria
| | | | | | - Michaela Horna
- Department of Animal Husbandry, Slovak University of Agriculture in Nitra, Nitra-Chrenová, Slovak Republic
| | - Anne Ricard
- Institut National de la Recherche Agronomique, UMR 1313 Génétique Animale et Biologie Intégrative, Jouy-en-Josas, France
- Institut Français du Cheval et de l’Equitation, Recherche et Innovation, Exmes, France
| | - Matjaz Mesarič
- Clinic for Reproduction and Large Animals, Veterinary Faculty, University of Lubljana, Cesta v Mestni log, Ljubljana, Slovenia
| | - Marco Cotman
- Institute of Preclinical Sciences, Veterinary Faculty, University of Ljubljana, Cesta v Mestni log, Ljubljana, Slovenia
| | | | - Gottfried Brem
- Institute of Animal Breeding and Genetics, University of Veterinary Sciences Vienna, Vienna, Austria
| |
Collapse
|
16
|
Schaefer RJ, Schubert M, Bailey E, Bannasch DL, Barrey E, Bar-Gal GK, Brem G, Brooks SA, Distl O, Fries R, Finno CJ, Gerber V, Haase B, Jagannathan V, Kalbfleisch T, Leeb T, Lindgren G, Lopes MS, Mach N, da Câmara Machado A, MacLeod JN, McCoy A, Metzger J, Penedo C, Polani S, Rieder S, Tammen I, Tetens J, Thaller G, Verini-Supplizi A, Wade CM, Wallner B, Orlando L, Mickelson JR, McCue ME. Developing a 670k genotyping array to tag ~2M SNPs across 24 horse breeds. BMC Genomics 2017; 18:565. [PMID: 28750625 PMCID: PMC5530493 DOI: 10.1186/s12864-017-3943-8] [Citation(s) in RCA: 84] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2017] [Accepted: 07/13/2017] [Indexed: 12/30/2022] Open
Abstract
Background To date, genome-scale analyses in the domestic horse have been limited by suboptimal single nucleotide polymorphism (SNP) density and uneven genomic coverage of the current SNP genotyping arrays. The recent availability of whole genome sequences has created the opportunity to develop a next generation, high-density equine SNP array. Results Using whole genome sequence from 153 individuals representing 24 distinct breeds collated by the equine genomics community, we cataloged over 23 million de novo discovered genetic variants. Leveraging genotype data from individuals with both whole genome sequence, and genotypes from lower-density, legacy SNP arrays, a subset of ~5 million high-quality, high-density array candidate SNPs were selected based on breed representation and uniform spacing across the genome. Considering probe design recommendations from a commercial vendor (Affymetrix, now Thermo Fisher Scientific) a set of ~2 million SNPs were selected for a next-generation high-density SNP chip (MNEc2M). Genotype data were generated using the MNEc2M array from a cohort of 332 horses from 20 breeds and a lower-density array, consisting of ~670 thousand SNPs (MNEc670k), was designed for genotype imputation. Conclusions Here, we document the steps taken to design both the MNEc2M and MNEc670k arrays, report genomic and technical properties of these genotyping platforms, and demonstrate the imputation capabilities of these tools for the domestic horse. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3943-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Robert J Schaefer
- Department of Veterinary Population Medicine, College of Veterinary Medicine, University of Minnesota, St. Paul, MN, USA
| | - Mikkel Schubert
- Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, Copenhagen, Denmark
| | - Ernest Bailey
- Maxwell H. Gluck Equine Research Center, Department of Veterinary Science, University of Kentucky, Lexington, KY, USA
| | - Danika L Bannasch
- School of Veterinary Medicine, University of California-Davis, Davis, CA, 95616, USA
| | - Eric Barrey
- Unité de Génétique Animale et Biologie Intégrative- UMR1313, INRA, Université Paris-Saclay, AgroParisTech, 78350, Jouy-en-Josas, France
| | - Gila Kahila Bar-Gal
- The Robert H. Smith Faculty of Agriculture, Food and Environment, The Koret School of Veterinary Medicine, The Hebrew University, 76100, Rehovot, Israel
| | - Gottfried Brem
- Institute of Animal Breeding and Genetics, Department of Biomedical Sciences, University of Veterinary Medicine Vienna, Vienna, Austria
| | - Samantha A Brooks
- Department of Animal Science, University of Florida, Gainesville, FL, USA
| | - Ottmar Distl
- Institute for Animal Breeding and Genetics, University of Veterinary Medicine, Hannover, Germany
| | - Ruedi Fries
- Lehrstuhl für Tierzucht der Technischen Universität München, Liesel-Beckmann-Strasse 1, 85354, Freising, Germany
| | - Carrie J Finno
- School of Veterinary Medicine, University of California-Davis, Davis, CA, 95616, USA
| | - Vinzenz Gerber
- Swiss Institute of Equine Medicine, Department of Clinical Veterinary Medicine, Vetsuisse Faculty, University of Bern, and Agroscope, Länggassstrasse 124, 3001, Bern, Switzerland
| | - Bianca Haase
- School of Life and Environmental Sciences, Faculty of Veterinary Science, University of Sydney, Regimental Drive, B19-301 RMC Gunn, Sydney, NSW, 2006, Australia
| | | | - Ted Kalbfleisch
- Department of Biochemistry and Molecular Biology, School of Medicine, University of Louisville, Louisville, KY, 40202, USA
| | - Tosso Leeb
- Institute of Genetics, University of Bern, 3001, Bern, Switzerland
| | - Gabriella Lindgren
- Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | - Maria Susana Lopes
- Biotechnology Centre of Azores, University of Azores, Angra do heroísmo, Portugal
| | - Núria Mach
- Unité de Génétique Animale et Biologie Intégrative- UMR1313, INRA, Université Paris-Saclay, AgroParisTech, 78350, Jouy-en-Josas, France
| | | | - James N MacLeod
- Maxwell H. Gluck Equine Research Center, Department of Veterinary Science, University of Kentucky, Lexington, KY, USA
| | - Annette McCoy
- Department of Veterinary Clinical Medicine, College of Veterinary Medicine, University of Illinois at Urbana-Champaign, Champaign, IL, 61802, USA
| | - Julia Metzger
- Institute for Animal Breeding and Genetics, University of Veterinary Medicine, Hannover, Germany
| | - Cecilia Penedo
- Veterinary Genetics Laboratory, University of California Davis, Davis, CA, USA
| | - Sagi Polani
- The Robert H. Smith Faculty of Agriculture, Food and Environment, The Koret School of Veterinary Medicine, The Hebrew University, 76100, Rehovot, Israel
| | - Stefan Rieder
- Agroscope, Swiss National Stud Farm, 1580, Avenches, Switzerland
| | - Imke Tammen
- School of Life and Environmental Sciences, Faculty of Veterinary Science, University of Sydney, Regimental Drive, B19-301 RMC Gunn, Sydney, NSW, 2006, Australia
| | - Jens Tetens
- Institute of Animal Breeding and Husbandry, Christian-Albrechts-University Kiel, Hermann-Rodewald-Strasse 6, 24098, Kiel, Germany.,Department of Animal Sciences, Functional Breeding Group, Georg-August University Göttingen, Burckhardtweg 2, 37077, Göttingen, Germany
| | - Georg Thaller
- Institute of Animal Breeding and Husbandry, Christian-Albrechts-University Kiel, Hermann-Rodewald-Strasse 6, 24098, Kiel, Germany
| | - Andrea Verini-Supplizi
- Department of Veterinary Medicine - Sport Horse Research Centre, University of Perugia, Perugia, Italy
| | - Claire M Wade
- School of Life and Environmental Sciences, Faculty of Veterinary Science, University of Sydney, Regimental Drive, B19-301 RMC Gunn, Sydney, NSW, 2006, Australia
| | - Barbara Wallner
- Institute of Animal Breeding and Genetics, Department of Biomedical Sciences, University of Veterinary Medicine Vienna, Vienna, Austria
| | - Ludovic Orlando
- Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, Copenhagen, Denmark.,Laboratoire d'Anthropobiologie Moléculaire et d'Imagerie de Synthèse, CNRS UMR 5288, Université de Toulouse, Université Paul Sabatier, 31000, Toulouse, France
| | - James R Mickelson
- Department of Veterinary and Biomedical Sciences, College of Veterinary Medicine, University of Minnesota, St. Paul, MN, USA
| | - Molly E McCue
- Department of Veterinary Population Medicine, College of Veterinary Medicine, University of Minnesota, St. Paul, MN, USA.
| |
Collapse
|
17
|
Neuditschko M, Raadsma HW, Khatkar MS, Jonas E, Steinig EJ, Flury C, Signer-Hasler H, Frischknecht M, von Niederhäusern R, Leeb T, Rieder S. Identification of key contributors in complex population structures. PLoS One 2017; 12:e0177638. [PMID: 28520805 PMCID: PMC5433729 DOI: 10.1371/journal.pone.0177638] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2016] [Accepted: 05/01/2017] [Indexed: 11/18/2022] Open
Abstract
Evaluating the genetic contribution of individuals to population structure is essential to select informative individuals for genome sequencing, genotype imputation and to ascertain complex population structures. Existing methods for the selection of informative individuals for genomic imputation solely focus on the identification of key ancestors, which can lead to a loss of phasing accuracy of the reference population. Currently many methods are independently applied to investigate complex population structures. Based on the Eigenvalue Decomposition (EVD) of a genomic relationship matrix we describe a novel approach to evaluate the genetic contribution of individuals to population structure. We combined the identification of key contributors with model-based clustering and population network visualization into an integrated three-step approach, which allows identification of high-resolution population structures and substructures around such key contributors. The approach was applied and validated in four disparate datasets including a simulated population (5,100 individuals and 10,000 SNPs), a highly structured experimental sheep population (1,421 individuals and 44,693 SNPs) and two large complex pedigree populations namely horse (1,077 individuals and 38,124 SNPs) and cattle (2,457 individuals and 45,765 SNPs). In the simulated and experimental sheep dataset, our method, which is unsupervised, successfully identified all known key contributors. Applying our three-step approach to the horse and cattle populations, we observed high-resolution population substructures including the absence of obvious important key contributors. Furthermore, we show that compared to commonly applied strategies to select informative individuals for genotype imputation including the computation of marginal gene contributions (Pedig) and the optimization of genetic relatedness (Rel), the selection of key contributors provided the highest phasing accuracies within the selected reference populations. The presented approach opens new perspectives in the characterization and informed management of populations in general, and in areas such as conservation genetics and selective animal breeding in particular, where assessing the genetic contribution of influential and admixed individuals is crucial for research and management applications. As such, this method provides a valuable complement to common applied tools to visualize complex population structures and to select individuals for re-sequencing.
Collapse
Affiliation(s)
- Markus Neuditschko
- Agroscope, Swiss National Stud Farm, Avenches, Switzerland
- Reprogen – Animal Bioscience Group, Faculty of Veterinary Science, University of Sydney, Camden, Australia
- * E-mail:
| | - Herman W. Raadsma
- Reprogen – Animal Bioscience Group, Faculty of Veterinary Science, University of Sydney, Camden, Australia
| | - Mehar S. Khatkar
- Reprogen – Animal Bioscience Group, Faculty of Veterinary Science, University of Sydney, Camden, Australia
| | - Elisabeth Jonas
- Reprogen – Animal Bioscience Group, Faculty of Veterinary Science, University of Sydney, Camden, Australia
- SLU, Department of Animal Breeding and Genetics, Uppsala, Sweden
| | - Eike J. Steinig
- College of Marine and Environmental Sciences, James Cook University, Townsville, Australia
| | - Christine Flury
- School of Agricultural Forest and Food Sciences, Bern University of Applied Sciences, Zollikofen, Switzerland
| | - Heidi Signer-Hasler
- School of Agricultural Forest and Food Sciences, Bern University of Applied Sciences, Zollikofen, Switzerland
| | - Mirjam Frischknecht
- Agroscope, Swiss National Stud Farm, Avenches, Switzerland
- Institute of Genetics, Vetsuisse Faculty, University of Bern, Bern, Switzerland
| | | | - Tosso Leeb
- Institute of Genetics, Vetsuisse Faculty, University of Bern, Bern, Switzerland
| | - Stefan Rieder
- Agroscope, Swiss National Stud Farm, Avenches, Switzerland
| |
Collapse
|
18
|
Pausch H, MacLeod IM, Fries R, Emmerling R, Bowman PJ, Daetwyler HD, Goddard ME. Evaluation of the accuracy of imputed sequence variant genotypes and their utility for causal variant detection in cattle. Genet Sel Evol 2017; 49:24. [PMID: 28222685 PMCID: PMC5320806 DOI: 10.1186/s12711-017-0301-x] [Citation(s) in RCA: 71] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2016] [Accepted: 02/14/2017] [Indexed: 12/11/2022] Open
Abstract
Background The availability of dense genotypes and whole-genome sequence variants from various sources offers the opportunity to compile large datasets consisting of tens of thousands of individuals with genotypes at millions of polymorphic sites that may enhance the power of genomic analyses. The imputation of missing genotypes ensures that all individuals have genotypes for a shared set of variants. Results We evaluated the accuracy of imputation from dense genotypes to whole-genome sequence variants in 249 Fleckvieh and 450 Holstein cattle using Minimac and FImpute. The sequence variants of a subset of the animals were reduced to the variants that were included on the Illumina BovineHD genotyping array and subsequently inferred in silico using either within- or multi-breed reference populations. The accuracy of imputation varied considerably across chromosomes and dropped at regions where the bovine genome contains segmental duplications. Depending on the imputation strategy, the correlation between imputed and true genotypes ranged from 0.898 to 0.952. The accuracy of imputation was higher with Minimac than FImpute particularly for variants with a low minor allele frequency. Using a multi-breed reference population increased the accuracy of imputation, particularly when FImpute was used to infer genotypes. When the sequence variants were imputed using Minimac, the true genotypes were more correlated to predicted allele dosages than best-guess genotypes. The computing costs to impute 23,256,743 sequence variants in 6958 animals were ten-fold higher with Minimac than FImpute. Association studies with imputed sequence variants revealed seven quantitative trait loci (QTL) for milk fat percentage. Two causal mutations in the DGAT1 and GHR genes were the most significantly associated variants at two QTL on chromosomes 14 and 20 when Minimac was used to infer genotypes. Conclusions The population-based imputation of millions of sequence variants in large cohorts is computationally feasible and provides accurate genotypes. However, the accuracy of imputation is low in regions where the genome contains large segmental duplications or the coverage with array-derived single nucleotide polymorphisms is poor. Using a reference population that includes individuals from many breeds increases the accuracy of imputation particularly at low-frequency variants. Considering allele dosages rather than best-guess genotypes as explanatory variables is advantageous to detect causal mutations in association studies with imputed sequence variants. Electronic supplementary material The online version of this article (doi:10.1186/s12711-017-0301-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Hubert Pausch
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC, 3083, Australia.
| | - Iona M MacLeod
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC, 3083, Australia
| | - Ruedi Fries
- Chair of Animal Breeding, Technische Universitaet Muenchen, 85354, Freising, Germany
| | - Reiner Emmerling
- Institute of Animal Breeding, Bavarian State Research Center for Agriculture, 85586, Grub, Germany
| | - Phil J Bowman
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC, 3083, Australia.,School of Applied Systems Biology, La Trobe University, Bundoora, VIC, 3083, Australia
| | - Hans D Daetwyler
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC, 3083, Australia.,School of Applied Systems Biology, La Trobe University, Bundoora, VIC, 3083, Australia
| | - Michael E Goddard
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC, 3083, Australia.,Faculty of Veterinary and Agricultural Sciences, University of Melbourne, Melbourne, VIC, 3010, Australia
| |
Collapse
|
19
|
Brinkmann J, Jagannathan V, Drögemüller C, Rieder S, Leeb T, Thaller G, Tetens J. Genetic variability of the equine casein genes. J Dairy Sci 2016; 99:5486-5497. [DOI: 10.3168/jds.2015-10652] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2015] [Accepted: 03/09/2016] [Indexed: 12/23/2022]
|
20
|
Brinkmann J, Jagannathan V, Drögemüller C, Rieder S, Leeb T, Thaller G, Tetens J. DNA-based analysis of protein variants reveals different genetic variability of the paralogous equine ß-lactoglobulin genes LGB1 and LGB2. Livest Sci 2016. [DOI: 10.1016/j.livsci.2016.03.014] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
|
21
|
Friedenberg SG, Meurs KM. Genotype imputation in the domestic dog. Mamm Genome 2016; 27:485-94. [PMID: 27129452 DOI: 10.1007/s00335-016-9636-9] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2015] [Accepted: 04/11/2016] [Indexed: 01/08/2023]
Abstract
Application of imputation methods to accurately predict a dense array of SNP genotypes in the dog could provide an important supplement to current analyses of array-based genotyping data. Here, we developed a reference panel of 4,885,283 SNPs in 83 dogs across 15 breeds using whole genome sequencing. We used this panel to predict the genotypes of 268 dogs across three breeds with 84,193 SNP array-derived genotypes as inputs. We then (1) performed breed clustering of the actual and imputed data; (2) evaluated several reference panel breed combinations to determine an optimal reference panel composition; and (3) compared the accuracy of two commonly used software algorithms (Beagle and IMPUTE2). Breed clustering was well preserved in the imputation process across eigenvalues representing 75 % of the variation in the imputed data. Using Beagle with a target panel from a single breed, genotype concordance was highest using a multi-breed reference panel (92.4 %) compared to a breed-specific reference panel (87.0 %) or a reference panel containing no breeds overlapping with the target panel (74.9 %). This finding was confirmed using target panels derived from two other breeds. Additionally, using the multi-breed reference panel, genotype concordance was slightly higher with IMPUTE2 (94.1 %) compared to Beagle; Pearson correlation coefficients were slightly higher for both software packages (0.946 for Beagle, 0.961 for IMPUTE2). Our findings demonstrate that genotype imputation from SNP array-derived data to whole genome-level genotypes is both feasible and accurate in the dog with appropriate breed overlap between the target and reference panels.
Collapse
Affiliation(s)
- S G Friedenberg
- Department of Clinical Sciences and Comparative Medicine Institute, North Carolina State University College of Veterinary Medicine, 1060 William Moore Drive, Raleigh, NC, 27607, USA.
| | - K M Meurs
- Department of Clinical Sciences and Comparative Medicine Institute, North Carolina State University College of Veterinary Medicine, 1060 William Moore Drive, Raleigh, NC, 27607, USA
| |
Collapse
|
22
|
Frischknecht M, Signer-Hasler H, Leeb T, Rieder S, Neuditschko M. Genome-wide association studies based on sequence-derived genotypes reveal new QTL associated with conformation and performance traits in the Franches-Montagnes horse breed. Anim Genet 2016; 47:227-9. [DOI: 10.1111/age.12406] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/13/2015] [Indexed: 12/01/2022]
Affiliation(s)
- M. Frischknecht
- Agroscope; Swiss National Stud Farm; Les Longs-Prés 1580 Avenches Switzerland
- Institute of Genetics; Vetsuisse Faculty; University of Bern; Bremgartenstrasse 109a 3012 Bern Switzerland
- Swiss Competence Center of Animal Breeding and Genetics; University of Bern; Bern University of Applied Sciences HAFL & Agroscope; Bremgartenstrasse 109a 3001 Bern Switzerland
| | - H. Signer-Hasler
- Swiss Competence Center of Animal Breeding and Genetics; University of Bern; Bern University of Applied Sciences HAFL & Agroscope; Bremgartenstrasse 109a 3001 Bern Switzerland
- Bern University of Applied Sciences; School of Agricultural; Forest and Food Sciences; Länggasse 85 3052 Zollikofen Switzerland
| | - T. Leeb
- Agroscope; Swiss National Stud Farm; Les Longs-Prés 1580 Avenches Switzerland
- Swiss Competence Center of Animal Breeding and Genetics; University of Bern; Bern University of Applied Sciences HAFL & Agroscope; Bremgartenstrasse 109a 3001 Bern Switzerland
| | - S. Rieder
- Institute of Genetics; Vetsuisse Faculty; University of Bern; Bremgartenstrasse 109a 3012 Bern Switzerland
- Swiss Competence Center of Animal Breeding and Genetics; University of Bern; Bern University of Applied Sciences HAFL & Agroscope; Bremgartenstrasse 109a 3001 Bern Switzerland
| | - M. Neuditschko
- Institute of Genetics; Vetsuisse Faculty; University of Bern; Bremgartenstrasse 109a 3012 Bern Switzerland
- Swiss Competence Center of Animal Breeding and Genetics; University of Bern; Bern University of Applied Sciences HAFL & Agroscope; Bremgartenstrasse 109a 3001 Bern Switzerland
| |
Collapse
|
23
|
Nicolazzi EL, Biffani S, Biscarini F, Orozco Ter Wengel P, Caprera A, Nazzicari N, Stella A. Software solutions for the livestock genomics SNP array revolution. Anim Genet 2015; 46:343-53. [PMID: 25907889 DOI: 10.1111/age.12295] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/07/2015] [Indexed: 02/04/2023]
Abstract
Since the beginning of the genomic era, the number of available single nucleotide polymorphism (SNP) arrays has grown considerably. In the bovine species alone, 11 SNP chips not completely covered by intellectual property are currently available, and the number is growing. Genomic/genotype data are not standardized, and this hampers its exchange and integration. In addition, software used for the analyses of these data usually requires not standard (i.e. case specific) input files which, considering the large amount of data to be handled, require at least some programming skills in their production. In this work, we describe a software toolkit for SNP array data management, imputation, genome-wide association studies, population genetics and genomic selection. However, this toolkit does not solve the critical need for standardization of the genotypic data and software input files. It only highlights the chaotic situation each researcher has to face on a daily basis and gives some helpful advice on the currently available tools in order to navigate the SNP array data complexity.
Collapse
Affiliation(s)
- E L Nicolazzi
- Fondazione Parco Tecnologico Padano (PTP), Via Einstein, Cascina Codazza, Lodi, 26900, Italy
| | - S Biffani
- Istituto di biologia e biotecnologia Agraria (IBBA-CNR), Consiglio Nazionale delle Ricerche, Via Einstein, Cascina Codazza, Lodi, 26900, Italy
| | - F Biscarini
- Fondazione Parco Tecnologico Padano (PTP), Via Einstein, Cascina Codazza, Lodi, 26900, Italy
| | - P Orozco Ter Wengel
- School of Biosciences, Cardiff University, Museum Avenue, Cardiff, CF10 3AX, UK
| | - A Caprera
- Fondazione Parco Tecnologico Padano (PTP), Via Einstein, Cascina Codazza, Lodi, 26900, Italy
| | - N Nazzicari
- Fondazione Parco Tecnologico Padano (PTP), Via Einstein, Cascina Codazza, Lodi, 26900, Italy
| | - A Stella
- Fondazione Parco Tecnologico Padano (PTP), Via Einstein, Cascina Codazza, Lodi, 26900, Italy.,Istituto di biologia e biotecnologia Agraria (IBBA-CNR), Consiglio Nazionale delle Ricerche, Via Einstein, Cascina Codazza, Lodi, 26900, Italy
| |
Collapse
|