1
|
Bredemeyer KR, vonHoldt BM, Foley NM, Childers I, Brzeski KE, Murphy WJ. The value of hybrid genomes: Building two highly contiguous reference genome assemblies to advance Canis genomic studies. J Hered 2024:esae013. [PMID: 38416051 DOI: 10.1093/jhered/esae013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2024] [Indexed: 02/29/2024] Open
Abstract
Previous studies of canid population and evolutionary genetics have relied on high-quality domestic dog reference genomes that have been produced primarily for biomedical and trait mapping studies in dog breeds. However, the absence of highly contiguous genomes from other Canis species like the gray wolf and coyote, that represent additional distinct demographic histories, may bias inferences regarding inter-specific genetic diversity and phylogenetic relationships. Here, we present single haplotype de novo genome assemblies for the gray wolf and coyote, generated by applying the trio-binning approach to long sequence reads generated from the genome of a female first-generation hybrid produced from a gray wolf and coyote mating. The assemblies were highly contiguous, with contig N50 sizes of 44.6 Mb and 42.0 Mb for the wolf and coyote, respectively. Genome scaffolding and alignments between the two Canis assemblies and published dog reference genomes showed near complete collinearity, with one exception: a coyote-specific chromosome fission of chromosome 13 and fusion of the proximal portion of that chromosome with chromosome 8, retaining the Canis-typical haploid chromosome number of 2n=78. We evaluated mapping quality for previous RAD-seq data from 334 canids and found nearly identical mapping quality and patterns among canid species and regional populations regardless of the genome used for alignment (dog, coyote, or gray wolf). These novel wolf and coyote genome reference assemblies will be important resources for proper and accurate inference of Canis demography, taxonomic evaluation, and conservation genetics.
Collapse
Affiliation(s)
- Kevin R Bredemeyer
- Veterinary Integrative Biosciences, Texas A&M University, College Station, TX
- Interdisciplinary Program in Genetics & Genomics, Texas A&M University, College Station, TX
| | | | - Nicole M Foley
- Veterinary Integrative Biosciences, Texas A&M University, College Station, TX
| | - Isabella Childers
- Veterinary Integrative Biosciences, Texas A&M University, College Station, TX
- Interdisciplinary Program in Genetics & Genomics, Texas A&M University, College Station, TX
| | - Kristin E Brzeski
- College of Forest Resources and Environment Science, Michigan Technological University, Houghton, MI
| | - William J Murphy
- Veterinary Integrative Biosciences, Texas A&M University, College Station, TX
- Interdisciplinary Program in Genetics & Genomics, Texas A&M University, College Station, TX
| |
Collapse
|
2
|
Delorean EE, Youngblood RC, Simpson SA, Schoonmaker AN, Scheffler BE, Rutter WB, Hulse-Kemp AM. Representing true plant genomes: haplotype-resolved hybrid pepper genome with trio-binning. Front Plant Sci 2023; 14:1184112. [PMID: 38034563 PMCID: PMC10687446 DOI: 10.3389/fpls.2023.1184112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/11/2023] [Accepted: 10/17/2023] [Indexed: 12/02/2023]
Abstract
As sequencing costs decrease and availability of high fidelity long-read sequencing increases, generating experiment specific de novo genome assemblies becomes feasible. In many crop species, obtaining the genome of a hybrid or heterozygous individual is necessary for systems that do not tolerate inbreeding or for investigating important biological questions, such as hybrid vigor. However, most genome assembly methods that have been used in plants result in a merged single sequence representation that is not a true biologically accurate representation of either haplotype within a diploid individual. The resulting genome assembly is often fragmented and exhibits a mosaic of the two haplotypes, referred to as haplotype-switching. Important haplotype level information, such as causal mutations and structural variation is therefore lost causing difficulties in interpreting downstream analyses. To overcome this challenge, we have applied a method developed for animal genome assembly called trio-binning to an intra-specific hybrid of chili pepper (Capsicum annuum L. cv. HDA149 x Capsicum annuum L. cv. HDA330). We tested all currently available softwares for performing trio-binning, combined with multiple scaffolding technologies including Bionano to determine the optimal method of producing the best haplotype-resolved assembly. Ultimately, we produced highly contiguous biologically true haplotype-resolved genome assemblies for each parent, with scaffold N50s of 266.0 Mb and 281.3 Mb, with 99.6% and 99.8% positioned into chromosomes respectively. The assemblies captured 3.10 Gb and 3.12 Gb of the estimated 3.5 Gb chili pepper genome size. These assemblies represent the complete genome structure of the intraspecific hybrid, as well as the two parental genomes, and show measurable improvements over the currently available reference genomes. Our manuscript provides a valuable guide on how to apply trio-binning to other plant genomes.
Collapse
Affiliation(s)
- Emily E. Delorean
- Genomics and Bioinformatics Research Unit, USDA-ARS, Raleigh, NC, United States
- Crop and Soil Sciences Department, North Carolina State University, Raleigh, NC, United States
| | - Ramey C. Youngblood
- Institute for Genomics, Biocomputing and Biotechnology, Mississippi State University, Starkville, MS, United States
| | - Sheron A. Simpson
- Genomics and Bioinformatics Research Unit, United States Department of Agriculture - Agriculture Research Service (USDA-ARS), Stoneville, MS, United States
| | - Ashley N. Schoonmaker
- Crop and Soil Sciences Department, North Carolina State University, Raleigh, NC, United States
| | - Brian E. Scheffler
- Genomics and Bioinformatics Research Unit, United States Department of Agriculture - Agriculture Research Service (USDA-ARS), Stoneville, MS, United States
| | - William B. Rutter
- US Vegetable Laboratory, United States Department of Agriculture - Agriculture Research Service (USDA-ARS), Charleston, SC, United States
| | - Amanda M. Hulse-Kemp
- Genomics and Bioinformatics Research Unit, USDA-ARS, Raleigh, NC, United States
- Crop and Soil Sciences Department, North Carolina State University, Raleigh, NC, United States
| |
Collapse
|
3
|
Cohen ZP, Schoville SD, Hawthorne DJ. The role of structural variants in pest adaptation and genome evolution of the Colorado potato beetle, Leptinotarsa decemlineata (Say). Mol Ecol 2023; 32:1425-1440. [PMID: 36591939 DOI: 10.1111/mec.16838] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Revised: 11/30/2022] [Accepted: 12/15/2022] [Indexed: 01/03/2023]
Abstract
Structural variation has been associated with genetic diversity and adaptation. Despite these observations, it is not clear what their relative importance is for evolution, especially in rapidly adapting species. Here, we examine the significance of structural polymorphisms in pesticide resistance evolution of the agricultural super-pest, the Colorado potato beetle, Leptinotarsa decemlineata. By employing a parent offspring trio sequencing procedure, we develop highly contiguous reference genomes to characterize structural variation. These updated assemblies represent >100-fold improvement of contiguity and include derived pest and ancestral nonpest individuals. We identify >200,000 structural variations, which appear to be nonrandomly distributed across the genome as they co-occur with transposable elements and genes. Structural variations intersect with exons in a large proportion of gene annotations (~20%) that are associated with insecticide resistance (including cytochrome P450s), development, and transcription. To understand the role structural variations play in adaptation, we measure their allele frequencies among an additional 57 individuals using whole genome resequencing data, which represents pest and nonpest populations of North America. Incorporating multiple independent tests to detect the signature of natural selection using SNP data, we identify 14 genes that are probably under positive selection, include structural variations, and SNPs of elevated frequency within the pest lineages. Among these, three are associated with insecticide resistance based on previous research. One of these genes, CYP4g15, is coinduced during insecticide exposure with glycosyltransferase-13, which is a duplicated gene enclosed within a structural variant adjacent to the CYP4g15 genic region. These results demonstrate the significance of structural variations as a genomic feature to describe species history, genetic diversity, and adaptation.
Collapse
Affiliation(s)
- Zachary P Cohen
- Department of Entomology, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Sean D Schoville
- Department of Entomology, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | | |
Collapse
|
4
|
Barros CP, Derks MFL, Mohr J, Wood BJ, Crooijmans RPMA, Megens HJ, Bink MCAM, Groenen MAM. A new haplotype-resolved turkey genome to enable turkey genetics and genomics research. Gigascience 2022; 12:giad051. [PMID: 37489751 PMCID: PMC10360393 DOI: 10.1093/gigascience/giad051] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Revised: 12/12/2022] [Accepted: 06/27/2023] [Indexed: 07/26/2023] Open
Abstract
BACKGROUND The domesticated turkey (Meleagris gallopavo) is a species of significant agricultural importance and is the second largest contributor, behind broiler chickens, to world poultry meat production. The previous genome is of draft quality and partly based on the chicken (Gallus gallus) genome. A high-quality reference genome of M. gallopavo is essential for turkey genomics and genetics research and the breeding industry. RESULTS By adopting the trio-binning approach, we were able to assemble a high-quality chromosome-level F1 assembly and 2 parental haplotype assemblies, leveraging long-read technologies and genome-wide chromatin interaction data (Hi-C). From a total of 40 chromosomes (2n = 80), we captured 35 chromosomes in a single scaffold, showing much improved genome completeness and continuity compared to the old assembly build. The 3 assemblies are of higher quality than the previous draft quality assembly and comparable to the chicken assemblies (GRCg7) shown by the largest contig N50 (26.6 Mb) and comparable BUSCO gene set completeness scores (96-97%). Comparative analyses confirm a previously identified large inversion of around 19 Mbp on the Z chromosome not found in other Galliformes. Structural variation between the parent haplotypes was identified, which poses potential new target genes for breeding. CONCLUSIONS We contribute a new high-quality turkey genome at the chromosome level, benefiting turkey genetics and other avian genomics research as well as the turkey breeding industry.
Collapse
Affiliation(s)
- Carolina P Barros
- Wageningen University and Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands
| | - Martijn F L Derks
- Wageningen University and Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands
| | - Jeff Mohr
- Hybrid Turkeys, 650 Riverbend Drive Suite C, Kitchener, ON N2K 3S2, Canada
| | - Benjamin J Wood
- Hybrid Turkeys, 650 Riverbend Drive Suite C, Kitchener, ON N2K 3S2, Canada
- School of Veterinary Science, University of Queensland, Gatton, QLD 4343, Australia
| | | | - Hendrik-Jan Megens
- Wageningen University and Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands
| | - Marco C A M Bink
- Hendrix Genetics Research, Technology & Services, Boxmeer, AC 5830, The Netherlands
| | - Martien A M Groenen
- Wageningen University and Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands
| |
Collapse
|
5
|
Qi H, Cong R, Wang Y, Li L, Zhang G. Construction and analysis of the chromosome-level haplotype-resolved genomes of two Crassostrea oyster congeners: Crassostrea angulata and Crassostrea gigas. Gigascience 2022; 12:giad077. [PMID: 37787064 PMCID: PMC10546077 DOI: 10.1093/gigascience/giad077] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Revised: 07/24/2023] [Accepted: 08/30/2023] [Indexed: 10/04/2023] Open
Abstract
BACKGROUND The Portuguese oyster Crassostrea angulata and the Pacific oyster C. gigas are two major Crassostrea species that are naturally distributed along the Northwest Pacific coast and possess great ecological and economic value. Here, we report the construction and comparative analysis of the chromosome-level haplotype-resolved genomes of the two oyster congeners. FINDINGS Based on a trio-binning strategy, the PacBio high-fidelity and Illumina Hi-C reads of the offspring of the hybrid cross C. angulata (♂) × C. gigas (♀) were partitioned and independently assembled to construct two chromosome-level fully phased genomes. The assembly size (contig N50 size, BUSCO completeness) of the two genomes were 582.4 M (12.8 M, 99.1%) and 606.4 M (5.46 M, 98.9%) for C. angulata and C. gigas, respectively, ranking at the top of mollusk genomes with high contiguity and integrity. The general features of the two genomes were highly similar, and 15,475 highly conserved ortholog gene pairs shared identical gene structures and similar genomic locations. Highly similar sequences can be primarily identified in the coding regions, whereas most noncoding regions and introns of genes in the same ortholog group contain substantial small genomic and/or structural variations. Based on population resequencing analysis, a total of 2,756 species-specific single-nucleotide polymorphisms and 1,088 genes possibly under selection were identified. CONCLUSIONS This is the first report of trio-binned fully phased chromosome-level genomes in marine invertebrates. The study provides fundamental resources for the research on mollusk genetics, comparative genomics, and molecular evolution.
Collapse
Affiliation(s)
- Haigang Qi
- CAS and Shandong Province Key Laboratory of Experimental Marine Biology, Center for Ocean Mega-Science, Institute of Oceanology, Chinese Academy of Sciences, Qingdao 266071, China
- Laboratory for Marine Biology and Biotechnology, Laoshan Laboratory, Qingdao 266237, China
- National and Local Joint Engineering Key Laboratory of Ecological Mariculture, Institute of Oceanology, Chinese Academy of Sciences, Qingdao 266071, China
- Shandong Technology Innovation Center of Oyster Seed Industry, Qingdao 266105, China
| | - Rihao Cong
- CAS and Shandong Province Key Laboratory of Experimental Marine Biology, Center for Ocean Mega-Science, Institute of Oceanology, Chinese Academy of Sciences, Qingdao 266071, China
- Shandong Technology Innovation Center of Oyster Seed Industry, Qingdao 266105, China
- Key Laboratory of Breeding Biotechnology and Sustainable Aquaculture, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
- The Innovation of Seed Design, Chinese Academy of Sciences, Wuhan 430072, China
| | - Yanjun Wang
- Marine Science Data Center, Institute of Oceanology, Chinese Academy of Sciences, Qingdao 266071, China
| | - Li Li
- CAS and Shandong Province Key Laboratory of Experimental Marine Biology, Center for Ocean Mega-Science, Institute of Oceanology, Chinese Academy of Sciences, Qingdao 266071, China
- Shandong Technology Innovation Center of Oyster Seed Industry, Qingdao 266105, China
- Key Laboratory of Breeding Biotechnology and Sustainable Aquaculture, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
- The Innovation of Seed Design, Chinese Academy of Sciences, Wuhan 430072, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Guofan Zhang
- CAS and Shandong Province Key Laboratory of Experimental Marine Biology, Center for Ocean Mega-Science, Institute of Oceanology, Chinese Academy of Sciences, Qingdao 266071, China
- Laboratory for Marine Biology and Biotechnology, Laoshan Laboratory, Qingdao 266237, China
- National and Local Joint Engineering Key Laboratory of Ecological Mariculture, Institute of Oceanology, Chinese Academy of Sciences, Qingdao 266071, China
- Shandong Technology Innovation Center of Oyster Seed Industry, Qingdao 266105, China
| |
Collapse
|
6
|
Lötter A, Duong TA, Candotti J, Mizrachi E, Wegrzyn JL, Myburg AA. Haplogenome assembly reveals structural variation in Eucalyptus interspecific hybrids. Gigascience 2022; 12:giad064. [PMID: 37632754 PMCID: PMC10460159 DOI: 10.1093/gigascience/giad064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Revised: 02/15/2023] [Accepted: 07/27/2023] [Indexed: 08/28/2023] Open
Abstract
BACKGROUND De novo phased (haplo)genome assembly using long-read DNA sequencing data has improved the detection and characterization of structural variants (SVs) in plant and animal genomes. Able to span across haplotypes, long reads allow phased, haplogenome assembly in highly outbred organisms such as forest trees. Eucalyptus tree species and interspecific hybrids are the most widely planted hardwood trees with F1 hybrids of Eucalyptus grandis and E. urophylla forming the bulk of fast-growing pulpwood plantations in subtropical regions. The extent of structural variation and its effect on interspecific hybridization is unknown in these trees. As a first step towards elucidating the extent of structural variation between the genomes of E. grandis and E. urophylla, we sequenced and assembled the haplogenomes contained in an F1 hybrid of the two species. FINDINGS Using Nanopore sequencing and a trio-binning approach, we assembled the separate haplogenomes (566.7 Mb and 544.5 Mb) to 98.0% BUSCO completion. High-density SNP genetic linkage maps of both parents allowed scaffolding of 88.0% of the haplogenome contigs into 11 pseudo-chromosomes (scaffold N50 of 43.8 Mb and 42.5 Mb for the E. grandis and E. urophylla haplogenomes, respectively). We identify 48,729 SVs between the two haplogenomes providing the first detailed insight into genome structural rearrangement in these species. The two haplogenomes have similar gene content, 35,572 and 33,915 functionally annotated genes, of which 34.7% are contained in genome rearrangements. CONCLUSIONS Knowledge of SV and haplotype diversity in the two species will form the basis for understanding the genetic basis of hybrid superiority in these trees.
Collapse
Affiliation(s)
- Anneri Lötter
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Private bag X20, Pretoria 0028, South Africa
| | - Tuan A Duong
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Private bag X20, Pretoria 0028, South Africa
| | - Julia Candotti
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Private bag X20, Pretoria 0028, South Africa
| | - Eshchar Mizrachi
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Private bag X20, Pretoria 0028, South Africa
| | - Jill L Wegrzyn
- Department of Ecology and Evolutionary Biology, Institute for Systems Genomics: Computational Biology Core, University of Connecticut, Storrs, CT 06269, USA
| | - Alexander A Myburg
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Private bag X20, Pretoria 0028, South Africa
| |
Collapse
|
7
|
Bredemeyer KR, Harris AJ, Li G, Zhao L, Foley NM, Roelke-Parker M, O'Brien SJ, Lyons LA, Warren WC, Murphy WJ. Ultracontinuous Single Haplotype Genome Assemblies for the Domestic Cat (Felis catus) and Asian Leopard Cat (Prionailurus bengalensis). J Hered 2021; 112:165-173. [PMID: 33305796 PMCID: PMC8006817 DOI: 10.1093/jhered/esaa057] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Accepted: 12/08/2020] [Indexed: 12/11/2022] Open
Abstract
In addition to including one of the most popular companion animals, species from the cat family Felidae serve as a powerful system for genetic analysis of inherited and infectious disease, as well as for the study of phenotypic evolution and speciation. Previous diploid-based genome assemblies for the domestic cat have served as the primary reference for genomic studies within the cat family. However, these versions suffered from poor resolution of complex and highly repetitive regions, with substantial amounts of unplaced sequence that is polymorphic or copy number variable. We sequenced the genome of a female F1 Bengal hybrid cat, the offspring of a domestic cat (Felis catus) x Asian leopard cat (Prionailurus bengalensis) cross, with PacBio long sequence reads and used Illumina sequence reads from the parents to phase >99.9% of the reads into the 2 species' haplotypes. De novo assembly of the phased reads produced highly continuous haploid genome assemblies for the domestic cat and Asian leopard cat, with contig N50 statistics exceeding 83 Mb for both genomes. Whole-genome alignments reveal the Felis and Prionailurus genomes are colinear, and the cytogenetic differences between the homologous F1 and E4 chromosomes represent a case of centromere repositioning in the absence of a chromosomal inversion. Both assemblies offer significant improvements over the previous domestic cat reference genome, with a 100% increase in contiguity and the capture of the vast majority of chromosome arms in 1 or 2 large contigs. We further demonstrated that comparably accurate F1 haplotype phasing can be achieved with members of the same species when one or both parents of the trio are not available. These novel genome resources will empower studies of feline precision medicine, adaptation, and speciation.
Collapse
Affiliation(s)
- Kevin R Bredemeyer
- Veterinary Integrative Biosciences, Texas A&M University, College Station, TX.,Interdisciplinary Program in Genetics, Texas A&M University, College Station, TX
| | - Andrew J Harris
- Veterinary Integrative Biosciences, Texas A&M University, College Station, TX.,Interdisciplinary Program in Genetics, Texas A&M University, College Station, TX
| | - Gang Li
- College of Life Sciences, Shaanxi Normal University, Xi'an, Shaanxi, China
| | - Le Zhao
- College of Life Sciences, Shaanxi Normal University, Xi'an, Shaanxi, China
| | - Nicole M Foley
- Veterinary Integrative Biosciences, Texas A&M University, College Station, TX
| | - Melody Roelke-Parker
- Frederick National Laboratory of Cancer Research, Leidos Biomedical Research, Inc., Frederick, MD
| | - Stephen J O'Brien
- Laboratory of Genomic Diversity-Center for Computer Technologies, ITMO University, Saint Petersburg, Russian Federation.,Guy Harvey Oceanographic Center, Nova Southeastern University, Fort Lauderdale, FL
| | - Leslie A Lyons
- Department of Veterinary Medicine & Surgery, College of Veterinary Medicine, University of Missouri, Columbia, MO
| | - Wesley C Warren
- Bond Life Science Center, University of Missouri, Columbia, MO
| | - William J Murphy
- Veterinary Integrative Biosciences, Texas A&M University, College Station, TX.,Interdisciplinary Program in Genetics, Texas A&M University, College Station, TX
| |
Collapse
|