1
|
Hu H, Li R, Zhao J, Batley J, Edwards D. Technological Development and Advances for Constructing and Analyzing Plant Pangenomes. Genome Biol Evol 2024; 16:evae081. [PMID: 38669452 PMCID: PMC11058698 DOI: 10.1093/gbe/evae081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 04/09/2024] [Accepted: 04/11/2024] [Indexed: 04/28/2024] Open
Abstract
A pangenome captures the genomic diversity for a species, derived from a collection of genetic sequences of diverse populations. Advances in sequencing technologies have given rise to three primary methods for pangenome construction and analysis: de novo assembly and comparison, reference genome-based iterative assembly, and graph-based pangenome construction. Each method presents advantages and challenges in processing varying amounts and structures of DNA sequencing data. With the emergence of high-quality genome assemblies and advanced bioinformatic tools, the graph-based pangenome is emerging as an advanced reference for exploring the biological and functional implications of genetic variations.
Collapse
Affiliation(s)
- Haifei Hu
- Rice Research Institute, Guangdong Academy of Agricultural Sciences & Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs & Guangdong Key Laboratory of New Technology in Rice Breeding & Guangdong Rice Engineering Laboratory, Guangzhou 510640, China
| | - Risheng Li
- Rice Research Institute, Guangdong Academy of Agricultural Sciences & Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs & Guangdong Key Laboratory of New Technology in Rice Breeding & Guangdong Rice Engineering Laboratory, Guangzhou 510640, China
- College of Agriculture, South China Agricultural University, Guangzhou, Guangdong 510642, China
| | - Junliang Zhao
- Rice Research Institute, Guangdong Academy of Agricultural Sciences & Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs & Guangdong Key Laboratory of New Technology in Rice Breeding & Guangdong Rice Engineering Laboratory, Guangzhou 510640, China
| | - Jacqueline Batley
- School of Biological Sciences, University of Western Australia, Perth, WA, Australia
| | - David Edwards
- School of Biological Sciences, University of Western Australia, Perth, WA, Australia
- Centre for Applied Bioinformatics, University of Western Australia, Perth, WA 6009, Australia
| |
Collapse
|
2
|
Mo C, Wang H, Wei M, Zeng Q, Zhang X, Fei Z, Zhang Y, Kong Q. Complete genome assembly provides a high-quality skeleton for pan-NLRome construction in melon. Plant J 2024. [PMID: 38430487 DOI: 10.1111/tpj.16705] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Revised: 02/16/2024] [Accepted: 02/22/2024] [Indexed: 03/03/2024]
Abstract
Melon (Cucumis melo L.), being under intensive domestication and selective breeding, displays an abundant phenotypic diversity. Wild germplasm with tolerance to stress represents an untapped genetic resource for discovery of disease-resistance genes. To comprehensively characterize resistance genes in melon, we generate a telomere-to-telomere (T2T) and gap-free genome of wild melon accession PI511890 (C. melo var. chito) with a total length of 375.0 Mb and a contig N50 of 31.24 Mb. The complete genome allows us to dissect genome architecture and identify resistance gene analogs. We construct a pan-NLRome using seven melon genomes, which include 208 variable and 18 core nucleotide-binding leucine-rich repeat receptors (NLRs). Multiple disease-related transcriptome analyses indicate that most up-regulated NLRs induced by pathogens are shell or cloud NLRs. The T2T gap-free assembly and the pan-NLRome not only serve as essential resources for genomic studies and molecular breeding of melon but also provide insights into the genome architecture and NLR diversity.
Collapse
Affiliation(s)
- Changjuan Mo
- National Key Laboratory for Germplasm Innovation and Utilization of Horticultural Crops, College of Horticulture and Forestry Sciences, Huazhong Agricultural University, Wuhan, 430070, China
| | - Haiyan Wang
- National Key Laboratory for Germplasm Innovation and Utilization of Horticultural Crops, College of Horticulture and Forestry Sciences, Huazhong Agricultural University, Wuhan, 430070, China
| | - Minghua Wei
- National Key Laboratory for Germplasm Innovation and Utilization of Horticultural Crops, College of Horticulture and Forestry Sciences, Huazhong Agricultural University, Wuhan, 430070, China
| | - Qingguo Zeng
- National Key Laboratory for Germplasm Innovation and Utilization of Horticultural Crops, College of Horticulture and Forestry Sciences, Huazhong Agricultural University, Wuhan, 430070, China
| | - Xuejun Zhang
- Hami-melon Research Center, Xinjiang Academy of Agricultural Sciences, Urumqi, 830091, China
| | | | - Yongbing Zhang
- Hami-melon Research Center, Xinjiang Academy of Agricultural Sciences, Urumqi, 830091, China
| | - Qiusheng Kong
- National Key Laboratory for Germplasm Innovation and Utilization of Horticultural Crops, College of Horticulture and Forestry Sciences, Huazhong Agricultural University, Wuhan, 430070, China
| |
Collapse
|
3
|
Wu X, Simpson SA, Youngblood RC, Liu XF, Scheffler BE, Rinehart TA, Alexander LW, Hulse-Kemp AM. Two haplotype-resolved genomes reveal important flower traits in bigleaf hydrangea ( Hydrangea macrophylla) and insights into Asterid evolution. Hortic Res 2023; 10:uhad217. [PMID: 38130599 PMCID: PMC10734616 DOI: 10.1093/hr/uhad217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Accepted: 10/07/2023] [Indexed: 12/23/2023]
Abstract
The Hydrangea genus belongs to the Hydrangeaceae family, in the Cornales order of flowering plants, which early diverged among the Asterids, and includes several species that are commonly used ornamental plants. Of them, Hydrangea macrophylla is one of the most valuable species in the nursery trade, yet few genomic resources are available for this crop or closely related Asterid species. Two high-quality haplotype-resolved reference genomes of hydrangea cultivars 'Veitchii' and 'Endless Summer' [highest quality at 2.22 gigabase pairs (Gb), 396 contigs, N50 22.8 megabase pairs (Mb)] were assembled and scaffolded into the expected 18 pseudochromosomes. Utilizing the newly developed high-quality reference genomes along with high-quality genomes of other related flowering plants, nuclear data were found to support a single divergence point in the Asterids clade where both the Cornales and Ericales diverged from the euasterids. Genetic mapping with an F1 hybrid population demonstrated the power of linkage mapping combined with the new genomic resources to identify the gene for inflorescence shape, CYP78A5 located on chromosome 4, and a novel gene, BAM3 located on chromosome 17, for causing double flower. Resources developed in this study will not only help to accelerate hydrangea genetic improvement but also contribute to understanding the largest group of flowering plants, the Asterids.
Collapse
Affiliation(s)
- Xingbo Wu
- Department of Environmental Horticulture, Tropical Research and Education Center, University of Florida, Homestead, FL 33031, United States
- Genomics and Bioinformatics Research Unit, USDA-ARS, Raleigh, NC 27695, United States
| | - Sheron A Simpson
- Genomics and Bioinformatics Research Unit, USDA-ARS, Stoneville, MS 38776, United States
| | - Ramey C Youngblood
- Institute for Genomics, Biocomputing and Biotechnology, Mississippi State University, Starkville, MS 39762, United States
| | - Xiaofen F Liu
- Genomics and Bioinformatics Research Unit, USDA-ARS, Stoneville, MS 38776, United States
| | - Brian E Scheffler
- Genomics and Bioinformatics Research Unit, USDA-ARS, Stoneville, MS 38776, United States
| | - Timothy A Rinehart
- Crop Production and Protection, USDA-ARS, Beltsville, MD 20705, United States
| | - Lisa W Alexander
- Floral and Nursery Plants Research Unit, U.S. National Arboretum, USDA-ARS, McMinnville, TN 37110, United States
| | - Amanda M Hulse-Kemp
- Genomics and Bioinformatics Research Unit, USDA-ARS, Raleigh, NC 27695, United States
- Department of Crop and Soil Sciences, North Carolina State University, Raleigh, NC 27695, United States
| |
Collapse
|
4
|
Delorean EE, Youngblood RC, Simpson SA, Schoonmaker AN, Scheffler BE, Rutter WB, Hulse-Kemp AM. Representing true plant genomes: haplotype-resolved hybrid pepper genome with trio-binning. Front Plant Sci 2023; 14:1184112. [PMID: 38034563 PMCID: PMC10687446 DOI: 10.3389/fpls.2023.1184112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/11/2023] [Accepted: 10/17/2023] [Indexed: 12/02/2023]
Abstract
As sequencing costs decrease and availability of high fidelity long-read sequencing increases, generating experiment specific de novo genome assemblies becomes feasible. In many crop species, obtaining the genome of a hybrid or heterozygous individual is necessary for systems that do not tolerate inbreeding or for investigating important biological questions, such as hybrid vigor. However, most genome assembly methods that have been used in plants result in a merged single sequence representation that is not a true biologically accurate representation of either haplotype within a diploid individual. The resulting genome assembly is often fragmented and exhibits a mosaic of the two haplotypes, referred to as haplotype-switching. Important haplotype level information, such as causal mutations and structural variation is therefore lost causing difficulties in interpreting downstream analyses. To overcome this challenge, we have applied a method developed for animal genome assembly called trio-binning to an intra-specific hybrid of chili pepper (Capsicum annuum L. cv. HDA149 x Capsicum annuum L. cv. HDA330). We tested all currently available softwares for performing trio-binning, combined with multiple scaffolding technologies including Bionano to determine the optimal method of producing the best haplotype-resolved assembly. Ultimately, we produced highly contiguous biologically true haplotype-resolved genome assemblies for each parent, with scaffold N50s of 266.0 Mb and 281.3 Mb, with 99.6% and 99.8% positioned into chromosomes respectively. The assemblies captured 3.10 Gb and 3.12 Gb of the estimated 3.5 Gb chili pepper genome size. These assemblies represent the complete genome structure of the intraspecific hybrid, as well as the two parental genomes, and show measurable improvements over the currently available reference genomes. Our manuscript provides a valuable guide on how to apply trio-binning to other plant genomes.
Collapse
Affiliation(s)
- Emily E. Delorean
- Genomics and Bioinformatics Research Unit, USDA-ARS, Raleigh, NC, United States
- Crop and Soil Sciences Department, North Carolina State University, Raleigh, NC, United States
| | - Ramey C. Youngblood
- Institute for Genomics, Biocomputing and Biotechnology, Mississippi State University, Starkville, MS, United States
| | - Sheron A. Simpson
- Genomics and Bioinformatics Research Unit, United States Department of Agriculture - Agriculture Research Service (USDA-ARS), Stoneville, MS, United States
| | - Ashley N. Schoonmaker
- Crop and Soil Sciences Department, North Carolina State University, Raleigh, NC, United States
| | - Brian E. Scheffler
- Genomics and Bioinformatics Research Unit, United States Department of Agriculture - Agriculture Research Service (USDA-ARS), Stoneville, MS, United States
| | - William B. Rutter
- US Vegetable Laboratory, United States Department of Agriculture - Agriculture Research Service (USDA-ARS), Charleston, SC, United States
| | - Amanda M. Hulse-Kemp
- Genomics and Bioinformatics Research Unit, USDA-ARS, Raleigh, NC, United States
- Crop and Soil Sciences Department, North Carolina State University, Raleigh, NC, United States
| |
Collapse
|
5
|
Naithani S, Deng CH, Sahu SK, Jaiswal P. Exploring Pan-Genomes: An Overview of Resources and Tools for Unraveling Structure, Function, and Evolution of Crop Genes and Genomes. Biomolecules 2023; 13:1403. [PMID: 37759803 PMCID: PMC10527062 DOI: 10.3390/biom13091403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 08/29/2023] [Accepted: 09/12/2023] [Indexed: 09/29/2023] Open
Abstract
The availability of multiple sequenced genomes from a single species made it possible to explore intra- and inter-specific genomic comparisons at higher resolution and build clade-specific pan-genomes of several crops. The pan-genomes of crops constructed from various cultivars, accessions, landraces, and wild ancestral species represent a compendium of genes and structural variations and allow researchers to search for the novel genes and alleles that were inadvertently lost in domesticated crops during the historical process of crop domestication or in the process of extensive plant breeding. Fortunately, many valuable genes and alleles associated with desirable traits like disease resistance, abiotic stress tolerance, plant architecture, and nutrition qualities exist in landraces, ancestral species, and crop wild relatives. The novel genes from the wild ancestors and landraces can be introduced back to high-yielding varieties of modern crops by implementing classical plant breeding, genomic selection, and transgenic/gene editing approaches. Thus, pan-genomic represents a great leap in plant research and offers new avenues for targeted breeding to mitigate the impact of global climate change. Here, we summarize the tools used for pan-genome assembly and annotations, web-portals hosting plant pan-genomes, etc. Furthermore, we highlight a few discoveries made in crops using the pan-genomic approach and future potential of this emerging field of study.
Collapse
Affiliation(s)
- Sushma Naithani
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331, USA;
| | - Cecilia H. Deng
- Molecular & Digital Breeing Group, New Cultivar Innovation, The New Zealand Institute for Plant and Food Research Limited, Private Bag 92169, Auckland 1142, New Zealand;
| | - Sunil Kumar Sahu
- State Key Laboratory of Agricultural Genomics, Key Laboratory of Genomics, Ministry of Agriculture, BGI Research, Shenzhen 518083, China;
| | - Pankaj Jaiswal
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331, USA;
| |
Collapse
|
6
|
Wang Z, Kim W, Wang YW, Yakubovich E, Dong C, Trail F, Townsend JP, Yarden O. The Sordariomycetes: an expanding resource with Big Data for mining in evolutionary genomics and transcriptomics. Front Fungal Biol 2023; 4:1214537. [PMID: 37746130 PMCID: PMC10512317 DOI: 10.3389/ffunb.2023.1214537] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Accepted: 06/06/2023] [Indexed: 09/26/2023]
Abstract
Advances in genomics and transcriptomics accompanying the rapid accumulation of omics data have provided new tools that have transformed and expanded the traditional concepts of model fungi. Evolutionary genomics and transcriptomics have flourished with the use of classical and newer fungal models that facilitate the study of diverse topics encompassing fungal biology and development. Technological advances have also created the opportunity to obtain and mine large datasets. One such continuously growing dataset is that of the Sordariomycetes, which exhibit a richness of species, ecological diversity, economic importance, and a profound research history on amenable models. Currently, 3,574 species of this class have been sequenced, comprising nearly one-third of the available ascomycete genomes. Among these genomes, multiple representatives of the model genera Fusarium, Neurospora, and Trichoderma are present. In this review, we examine recently published studies and data on the Sordariomycetes that have contributed novel insights to the field of fungal evolution via integrative analyses of the genetic, pathogenic, and other biological characteristics of the fungi. Some of these studies applied ancestral state analysis of gene expression among divergent lineages to infer regulatory network models, identify key genetic elements in fungal sexual development, and investigate the regulation of conidial germination and secondary metabolism. Such multispecies investigations address challenges in the study of fungal evolutionary genomics derived from studies that are often based on limited model genomes and that primarily focus on the aspects of biology driven by knowledge drawn from a few model species. Rapidly accumulating information and expanding capabilities for systems biological analysis of Big Data are setting the stage for the expansion of the concept of model systems from unitary taxonomic species/genera to inclusive clusters of well-studied models that can facilitate both the in-depth study of specific lineages and also investigation of trait diversity across lineages. The Sordariomycetes class, in particular, offers abundant omics data and a large and active global research community. As such, the Sordariomycetes can form a core omics clade, providing a blueprint for the expansion of our knowledge of evolution at the genomic scale in the exciting era of Big Data and artificial intelligence, and serving as a reference for the future analysis of different taxonomic levels within the fungal kingdom.
Collapse
Affiliation(s)
- Zheng Wang
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, United States
| | - Wonyong Kim
- Korean Lichen Research Institute, Sunchon National University, Suncheon, Republic of Korea
| | - Yen-Wen Wang
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, United States
| | - Elizabeta Yakubovich
- Department of Plant Pathology and Microbiology, The Robert H. Smith Faculty of Agriculture, Food and Environment, The Hebrew University of Jerusalem, Rehovot, Israel
| | - Caihong Dong
- Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Frances Trail
- Department of Plant Biology, Michigan State University, East Lansing, MI, United States
- Department of Plant, Soil and Microbial Sciences, Michigan State University, East Lansing, MI, United States
| | - Jeffrey P. Townsend
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, United States
- Department of Ecology and Evolutionary Biology, Program in Microbiology, and Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, United States
| | - Oded Yarden
- Department of Plant Pathology and Microbiology, The Robert H. Smith Faculty of Agriculture, Food and Environment, The Hebrew University of Jerusalem, Rehovot, Israel
| |
Collapse
|