1
|
Du ZZ, He JB, Jiao WB. A comprehensive benchmark of graph-based genetic variant genotyping algorithms on plant genomes for creating an accurate ensemble pipeline. Genome Biol 2024; 25:91. [PMID: 38589937 PMCID: PMC11003132 DOI: 10.1186/s13059-024-03239-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Accepted: 04/04/2024] [Indexed: 04/10/2024] Open
Abstract
BACKGROUND Although sequencing technologies have boosted the measurement of the genomic diversity of plant crops, it remains challenging to accurately genotype millions of genetic variants, especially structural variations, with only short reads. In recent years, many graph-based variation genotyping methods have been developed to address this issue and tested for human genomes. However, their performance in plant genomes remains largely elusive. Furthermore, pipelines integrating the advantages of current genotyping methods might be required, considering the different complexity of plant genomes. RESULTS Here we comprehensively evaluate eight such genotypers in different scenarios in terms of variant type and size, sequencing parameters, genomic context, and complexity, as well as graph size, using both simulated and real data sets from representative plant genomes. Our evaluation reveals that there are still great challenges to applying existing methods to plants, such as excessive repeats and variants or high resource consumption. Therefore, we propose a pipeline called Ensemble Variant Genotyper (EVG) that can achieve better genotyping performance in almost all experimental scenarios and comparably higher genotyping recall and precision even using 5× reads. Furthermore, we demonstrate that EVG is more robust with an increasing number of graphed genomes, especially for insertions and deletions. CONCLUSIONS Our study will provide new insights into the development and application of graph-based genotyping algorithms. We conclude that EVG provides an accurate, unbiased, and cost-effective way for genotyping both small and large variations and will be potentially used in population-scale genotyping for large, repetitive, and heterozygous plant genomes.
Collapse
Affiliation(s)
- Ze-Zhen Du
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan, China
- Hubei Hongshan Laboratory, Wuhan, China
| | - Jia-Bao He
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan, China
- Hubei Hongshan Laboratory, Wuhan, China
| | - Wen-Biao Jiao
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan, China.
- Hubei Hongshan Laboratory, Wuhan, China.
| |
Collapse
|
2
|
Song B, Buckler ES, Stitzer MC. New whole-genome alignment tools are needed for tapping into plant diversity. Trends Plant Sci 2024; 29:355-369. [PMID: 37749022 DOI: 10.1016/j.tplants.2023.08.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Revised: 07/19/2023] [Accepted: 08/23/2023] [Indexed: 09/27/2023]
Abstract
Genome alignment is one of the most foundational methods for genome sequence studies. With rapid advances in sequencing and assembly technologies, these newly assembled genomes present challenges for alignment tools to meet the increased complexity and scale. Plant genome alignment is technologically challenging because of frequent whole-genome duplications (WGDs) as well as chromosome rearrangements and fractionation, high nucleotide diversity, widespread structural variation, and high transposable element (TE) activity causing large proportions of repeat elements. We summarize classical pairwise and multiple genome alignment (MGA) methods, and highlight techniques that are widely used or are being developed by the plant research community. We also outline the remaining challenges for precise genome alignment and the interpretation of alignment results in plants.
Collapse
Affiliation(s)
- Baoxing Song
- National Key Laboratory of Wheat Improvement, Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agriculture Sciences in Weifang, Weifang, Shandong 261325, China; Key Laboratory of Maize Biology and Genetic Breeding in Arid Area of Northwest Region of the Ministry of Agriculture, College of Agronomy, Northwest A&F University, Yangling, Shaanxi 712100, China.
| | - Edward S Buckler
- Institute for Genomic Diversity, Cornell University, Ithaca, NY 14853, USA; Section of Plant Breeding and Genetics, Cornell University, Ithaca, NY 14853, USA; Agricultural Research Service, United States Department of Agriculture, Ithaca, NY 14853, USA
| | - Michelle C Stitzer
- Institute for Genomic Diversity, Cornell University, Ithaca, NY 14853, USA; Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA.
| |
Collapse
|
3
|
Verdú-Navarro F, Moreno-Cid JA, Weiss J, Egea-Cortines M. The advent of plant cells in bioreactors. Front Plant Sci 2023; 14:1310405. [PMID: 38148861 PMCID: PMC10749943 DOI: 10.3389/fpls.2023.1310405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 12/01/2023] [Indexed: 12/28/2023]
Abstract
Ever since agriculture started, plants have been bred to obtain better yields, better fruits, or sustainable products under uncertain biotic and abiotic conditions. However, a new way to obtain products from plant cells emerged with the development of recombinant DNA technologies. This led to the possibility of producing exogenous molecules in plants. Furthermore, plant chemodiversity has been the main source of pharmacological molecules, opening a field of plant biotechnology directed to produce high quality plant metabolites. The need for different products by the pharma, cosmetics agriculture and food industry has pushed again to develop new procedures. These include cell production in bioreactors. While plant tissue and cell culture are an established technology, beginning over a hundred years ago, plant cell cultures have shown little impact in biotechnology projects, compared to bacterial, yeasts or animal cells. In this review we address the different types of bioreactors that are currently used for plant cell production and their usage for quality biomolecule production. We make an overview of Nicotiana tabacum, Nicotiana benthamiana, Oryza sativa, Daucus carota, Vitis vinifera and Physcomitrium patens as well-established models for plant cell culture, and some species used to obtain important metabolites, with an insight into the type of bioreactor and production protocols.
Collapse
Affiliation(s)
- Fuensanta Verdú-Navarro
- Bioprocessing R&D Department, Bionet, Parque Tecnológico Fuente Álamo, Fuente Álamo, Spain
- Genética Molecular, Instituto de Biotecnología Vegetal, Universidad Politécnica de Cartagena, Cartagena, Spain
| | - Juan A. Moreno-Cid
- Bioprocessing R&D Department, Bionet, Parque Tecnológico Fuente Álamo, Fuente Álamo, Spain
| | - Julia Weiss
- Genética Molecular, Instituto de Biotecnología Vegetal, Universidad Politécnica de Cartagena, Cartagena, Spain
| | - Marcos Egea-Cortines
- Genética Molecular, Instituto de Biotecnología Vegetal, Universidad Politécnica de Cartagena, Cartagena, Spain
| |
Collapse
|
4
|
Li M, Feng Y, Han Q, Yang Y, Shi Y, Zheng D, Zhang W. Genomic variations combined with epigenetic modifications rewire open chromatin in rice. Plant Physiol 2023; 193:1880-1896. [PMID: 37539937 DOI: 10.1093/plphys/kiad440] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Revised: 07/14/2023] [Accepted: 07/19/2023] [Indexed: 08/05/2023]
Abstract
Cis-regulatory elements (CREs) fine-tune gene transcription in eukaryotes. CREs with sequence variations play vital roles in driving plant or crop domestication. However, how global sequence and structural variations (SVs) are responsible for multilevel changes between indica and japonica rice (Oryza sativa) is still not fully elucidated. To address this, we conducted multiomic studies using MNase hypersensitivity sequencing (MH-seq) in combination with RNA sequencing (RNA-seq), chromatin immunoprecipitation sequencing (ChIP-seq), and bisulfite sequencing (BS-seq) between the japonica rice variety Nipponbare (NIP) and indica rice variety 93-11. We found that differential MNase hypersensitive sites (MHSs) exhibited some distinct intrinsic genomic sequence features between NIP and 93-11. Notably, through MHS-genome-wide association studies (GWAS) integration, we found that key sequence variations may be associated with differences of agronomic traits between NIP and 93-11, which is partly achieved by MHSs harboring CREs. In addition, SV-derived differential MHSs caused by transposable element (TE) insertion, especially by noncommon TEs among rice varieties, were associated with genes with distinct functions, indicating that TE-driven gene neo- or subfunctionalization is mediated by changes of chromatin openness. This study thus provides insights into how sequence and genomic SVs control agronomic traits of NIP and 93-11; it also provides genome-editing targets for molecular breeding aiming at improving favorable agronomic properties.
Collapse
Affiliation(s)
- Mengqi Li
- State Key Laboratory of Crop Genetics and Germplasm Enhancement and Utilization, CIC-MCP, Nanjing Agricultural University, No.1 Weigang, Nanjing, Jiangsu 210095, China
| | - Yilong Feng
- State Key Laboratory of Crop Genetics and Germplasm Enhancement and Utilization, CIC-MCP, Nanjing Agricultural University, No.1 Weigang, Nanjing, Jiangsu 210095, China
| | - Qi Han
- State Key Laboratory of Crop Genetics and Germplasm Enhancement and Utilization, CIC-MCP, Nanjing Agricultural University, No.1 Weigang, Nanjing, Jiangsu 210095, China
| | - Ying Yang
- State Key Laboratory of Crop Genetics and Germplasm Enhancement and Utilization, CIC-MCP, Nanjing Agricultural University, No.1 Weigang, Nanjing, Jiangsu 210095, China
| | - Yining Shi
- State Key Laboratory of Crop Genetics and Germplasm Enhancement and Utilization, CIC-MCP, Nanjing Agricultural University, No.1 Weigang, Nanjing, Jiangsu 210095, China
| | - Dongyang Zheng
- State Key Laboratory of Crop Genetics and Germplasm Enhancement and Utilization, CIC-MCP, Nanjing Agricultural University, No.1 Weigang, Nanjing, Jiangsu 210095, China
| | - Wenli Zhang
- State Key Laboratory of Crop Genetics and Germplasm Enhancement and Utilization, CIC-MCP, Nanjing Agricultural University, No.1 Weigang, Nanjing, Jiangsu 210095, China
| |
Collapse
|
5
|
Weber SE, Chawla HS, Ehrig L, Hickey LT, Frisch M, Snowdon RJ. Accurate prediction of quantitative traits with failed SNP calls in canola and maize. Front Plant Sci 2023; 14:1221750. [PMID: 37936929 PMCID: PMC10627008 DOI: 10.3389/fpls.2023.1221750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Accepted: 10/05/2023] [Indexed: 11/09/2023]
Abstract
In modern plant breeding, genomic selection is becoming the gold standard to select superior genotypes in large breeding populations that are only partially phenotyped. Many breeding programs commonly rely on single-nucleotide polymorphism (SNP) markers to capture genome-wide data for selection candidates. For this purpose, SNP arrays with moderate to high marker density represent a robust and cost-effective tool to generate reproducible, easy-to-handle, high-throughput genotype data from large-scale breeding populations. However, SNP arrays are prone to technical errors that lead to failed allele calls. To overcome this problem, failed calls are often imputed, based on the assumption that failed SNP calls are purely technical. However, this ignores the biological causes for failed calls-for example: deletions-and there is increasing evidence that gene presence-absence and other kinds of genome structural variants can play a role in phenotypic expression. Because deletions are frequently not in linkage disequilibrium with their flanking SNPs, permutation of missing SNP calls can potentially obscure valuable marker-trait associations. In this study, we analyze published datasets for canola and maize using four parametric and two machine learning models and demonstrate that failed allele calls in genomic prediction are highly predictive for important agronomic traits. We present two statistical pipelines, based on population structure and linkage disequilibrium, that enable the filtering of failed SNP calls that are likely caused by biological reasons. For the population and trait examined, prediction accuracy based on these filtered failed allele calls was competitive to standard SNP-based prediction, underlying the potential value of missing data in genomic prediction approaches. The combination of SNPs with all failed allele calls or the filtered allele calls did not outperform predictions with only SNP-based prediction due to redundancy in genomic relationship estimates.
Collapse
Affiliation(s)
- Sven E. Weber
- Department of Plant Breeding, Justus Liebig University, Giessen, Germany
| | | | - Lennard Ehrig
- Department of Plant Breeding, Justus Liebig University, Giessen, Germany
| | - Lee T. Hickey
- Centre for Crop Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, Australia
| | - Matthias Frisch
- Department of Biometry and Population Genetics, Justus Liebig University, Giessen, Germany
| | - Rod J. Snowdon
- Department of Plant Breeding, Justus Liebig University, Giessen, Germany
| |
Collapse
|
6
|
Zhang YX, Wang XF, Niu YQ, Wang YG, Zhang WJ, Song ZP, Yang J, Li LF. Evolutionary roles of polyploidization-derived structural variations in the phenotypic diversification of Panax species. Mol Ecol 2023; 32:4999-5012. [PMID: 37525516 DOI: 10.1111/mec.17088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 07/10/2023] [Accepted: 07/19/2023] [Indexed: 08/02/2023]
Abstract
Genomic structural variations (SVs) are widespread in plant and animal genomes and play important roles in phenotypic novelty and species adaptation. Frequent whole genome duplications followed by (re)diploidizations have resulted in high diversity of genome architecture among extant species. In this study, we identified abundant genomic SVs in the Panax genus that are hypothesized to have occurred through during the repeated polyploidizations/(re)diploidizations. Our genome-wide comparisons demonstrated that although these polyploidization-derived SVs have evolved at distinct evolutionary stages, a large number of SV-intersecting genes showed enrichment in functionally important pathways related to secondary metabolites, photosynthesis and basic cellular activities. In line with these observations, our metabolic analyses of these Panax species revealed high diversity of primary and secondary metabolites both at the tissue and interspecific levels. In particular, genomic SVs identified at ginsenoside biosynthesis genes, including copy number variation and large fragment deletion, appear to have played important roles in the evolution and diversification of ginsenosides. A further herbivore deterrence experiment demonstrated that, as major triterpenoidal saponins found exclusively in Panax, ginsenosides provide protection against insect herbivores. Our study provides new insights on how polyploidization-derived SVs have contributed to phenotypic novelty and plant adaptation.
Collapse
Affiliation(s)
- Yu-Xin Zhang
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Xin-Feng Wang
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Yu-Qian Niu
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Yu-Guo Wang
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Wen-Ju Zhang
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Zhi-Ping Song
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Ji Yang
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Lin-Feng Li
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, School of Life Sciences, Fudan University, Shanghai, China
| |
Collapse
|
7
|
Li B, Gschwend AR. Vitis labrusca genome assembly reveals diversification between wild and cultivated grapevine genomes. Front Plant Sci 2023; 14:1234130. [PMID: 37719220 PMCID: PMC10501149 DOI: 10.3389/fpls.2023.1234130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/03/2023] [Accepted: 08/03/2023] [Indexed: 09/19/2023]
Abstract
Wild grapevines are important genetic resources in breeding programs to confer adaptive fitness traits and unique fruit characteristics, but the genetics underlying these traits, and their evolutionary origins, are largely unknown. To determine the factors that contributed to grapevine genome diversification, we performed comprehensive intragenomic and intergenomic analyses with three cultivated European (including the PN40024 reference genome) and two wild North American grapevine genomes, including our newly released Vitis labrusca genome. We found the heterozygosity of the cultivated grapevine genomes was twice as high as the wild grapevine genomes studied. Approximately 30% of V. labrusca and 48% of V. vinifera Chardonnay genes were heterozygous or hemizygous and a considerable number of collinear genes between Chardonnay and V. labrusca had different gene zygosity. Our study revealed evidence that supports gene gain-loss events in parental genomes resulted in the inheritance of hemizygous genes in the Chardonnay genome. Thousands of segmental duplications supplied source material for genome-specific genes, further driving diversification of the genomes studied. We found an enrichment of recently duplicated, adaptive genes in similar functional pathways, but differential retention of environment-specific adaptive genes within each genome. For example, large expansions of NLR genes were discovered in the two wild grapevine genomes studied. Our findings support variation in transposable elements contributed to unique traits in grapevines. Our work revealed gene zygosity, segmental duplications, gene gain-and-loss variations, and transposable element polymorphisms can be key driving forces for grapevine genome diversification.
Collapse
Affiliation(s)
| | - Andrea R. Gschwend
- Department of Horticulture and Crop Science, The Ohio State University, Columbus, OH, United States
| |
Collapse
|
8
|
Li G, Jiang D, Wang J, Liao Y, Zhang T, Zhang H, Dai X, Ren H, Chen C, Zheng Y. A High-Continuity Genome Assembly of Chinese Flowering Cabbage ( Brassica rapa var. parachinensis) Provides New Insights into Brassica Genome Structure Evolution. Plants (Basel) 2023; 12:2498. [PMID: 37447059 DOI: 10.3390/plants12132498] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Revised: 06/19/2023] [Accepted: 06/27/2023] [Indexed: 07/15/2023]
Abstract
Chinese flowering cabbage (Brassica rapa var. parachinensis) is a popular and widely cultivated leaf vegetable crop in Asia. Here, we performed a high quality de novo assembly of the 384 Mb genome of 10 chromosomes of a typical cultivar of Chinese flowering cabbage with an integrated approach using PacBio, Illumina, and Hi-C technology. We modeled 47,598 protein-coding genes in this analysis and annotated 52% (205.9/384) of its genome as repetitive sequences including 17% in DNA transposons and 22% in long terminal retrotransposons (LTRs). Phylogenetic analysis reveals the genome of the Chinese flowering cabbage has a closer evolutionary relationship with the AA diploid progenitor of the allotetraploid species, Brassica juncea. Comparative genomic analysis of Brassica species with different subgenome types (A, B and C) reveals that the pericentromeric regions on chromosome 5 and 6 of the AA genome have been significantly expanded compared to the orthologous genomic regions in the BB and CC genomes, largely driven by LTR-retrotransposon amplification. Furthermore, we identified a large number of structural variations (SVs) within the B. rapa lines that could impact coding genes, suggesting the functional significance of SVs on Brassica genome evolution. Overall, our high-quality genome assembly of the Chinese flowering cabbage provides a valuable genetic resource for deciphering the genome evolution of Brassica species and it can potentially serve as the reference genome guiding the molecular breeding practice of B. rapa crops.
Collapse
Affiliation(s)
- Guangguang Li
- Guangzhou Academy of Agricultural Sciences, Guangzhou 510335, China
| | - Ding Jiang
- Guangzhou Academy of Agricultural Sciences, Guangzhou 510335, China
| | - Juntao Wang
- College of Horticulture, South China Agricultural University, Guangzhou 510642, China
| | - Yi Liao
- College of Horticulture, South China Agricultural University, Guangzhou 510642, China
| | - Ting Zhang
- College of Horticulture, South China Agricultural University, Guangzhou 510642, China
| | - Hua Zhang
- Guangzhou Academy of Agricultural Sciences, Guangzhou 510335, China
| | - Xiuchun Dai
- Guangzhou Academy of Agricultural Sciences, Guangzhou 510335, China
| | - Hailong Ren
- Guangzhou Academy of Agricultural Sciences, Guangzhou 510335, China
| | - Changming Chen
- College of Horticulture, South China Agricultural University, Guangzhou 510642, China
| | - Yansong Zheng
- Guangzhou Academy of Agricultural Sciences, Guangzhou 510335, China
| |
Collapse
|
9
|
Deb SK, Edger PP, Pires JC, McKain MR. Patterns, mechanisms, and consequences of homoeologous exchange in allopolyploid angiosperms: a genomic and epigenomic perspective. New Phytol 2023; 238:2284-2304. [PMID: 37010081 DOI: 10.1111/nph.18927] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 03/16/2023] [Indexed: 05/19/2023]
Abstract
Allopolyploids result from hybridization between different evolutionary lineages coupled with genome doubling. Homoeologous chromosomes (chromosomes with common shared ancestry) may undergo recombination immediately after allopolyploid formation and continue over successive generations. The outcome of this meiotic pairing behavior is dynamic and complex. Homoeologous exchanges (HEs) may lead to the formation of unbalanced gametes, reduced fertility, and selective disadvantage. By contrast, HEs could act as sources of novel evolutionary substrates, shifting the relative dosage of parental gene copies, generating novel phenotypic diversity, and helping the establishment of neo-allopolyploids. However, HE patterns vary among lineages, across generations, and even within individual genomes and chromosomes. The causes and consequences of this variation are not fully understood, though interest in this evolutionary phenomenon has increased in the last decade. Recent technological advances show promise in uncovering the mechanistic basis of HEs. Here, we describe recent observations of the common patterns among allopolyploid angiosperm lineages, underlying genomic and epigenomic features, and consequences of HEs. We identify critical research gaps and discuss future directions with far-reaching implications in understanding allopolyploid evolution and applying them to the development of important phenotypic traits of polyploid crops.
Collapse
Affiliation(s)
- Sontosh K Deb
- Department of Biological Sciences, The University of Alabama, Tuscaloosa, AL, 35487, USA
- Department of Forestry and Environmental Science, Shahjalal University of Science and Technology, Sylhet, 3114, Bangladesh
| | - Patrick P Edger
- Department of Horticulture, Michigan State University, East Lansing, MI, 48823, USA
- Genetics and Genome Sciences Program, Michigan State University, East Lansing, MI, 48823, USA
| | - J Chris Pires
- Department of Soil and Crop Sciences, Colorado State University, Fort Collins, CO, 80523, USA
| | - Michael R McKain
- Department of Biological Sciences, The University of Alabama, Tuscaloosa, AL, 35487, USA
| |
Collapse
|
10
|
Hang Y, Yue L, Bingrui S, Qing L, Xingxue M, Liqun J, Shuwei L, Jing Z, Pingli C, Dajian P, Wenfeng C, Zhilan F, Chen L. Genetic Diversity and Breeding Signatures for Regional Indica Rice Improvement in Guangdong of Southern China. Rice (N Y) 2023; 16:25. [PMID: 37191779 DOI: 10.1186/s12284-023-00642-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 05/14/2023] [Indexed: 05/17/2023]
Abstract
As the pioneer of the Green Revolution in China, Guangdong province witnessed the improvement and spread of semi-dwarf Xian/Indica rice cultivars and possessed diverse rice germplasm of landrace and cultivars. A total of 517 accessions containing a core germplasm of 479 newly sequenced landraces and modern cultivars were used to reveal breeding signatures and key variations for regional genetic improvement of indica rice from Guangdong. Four subpopulations were identified in the collection, which including Ind IV as a novel subpopulation that not covered by previously released accessions. Modern cultivars of subpopulation Ind II were inferred to have less deleterious variations, especially in yield related genes. About 15 Mb genomic segments were identified as potential breeding signatures by cross-population likelihood method (XP-CLR) of modern cultivars and landraces. The selected regions spanning multiple yield related QTLs (quantitative trait locus) which identified by GWAS (genome-wide association studies) of the same population, and specific variations that fixed in modern cultivars of Ind II were characterized. This study highlights genetic differences between traditional landraces and modern cultivars, which revealed the potential molecular basis of regional genetic improvement for Guangdong indica rice from southern China.
Collapse
Affiliation(s)
- Yu Hang
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640, China
- Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Guangzhou, 510640, China
- Guangdong Key Laboratory of New Technology in Rice Breeding, Guangzhou, 510640, China
- Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| | - Liu Yue
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640, China
- Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Guangzhou, 510640, China
- Guangdong Key Laboratory of New Technology in Rice Breeding, Guangzhou, 510640, China
- Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| | - Sun Bingrui
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640, China
- Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Guangzhou, 510640, China
- Guangdong Key Laboratory of New Technology in Rice Breeding, Guangzhou, 510640, China
- Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| | - Liu Qing
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640, China
- Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Guangzhou, 510640, China
- Guangdong Key Laboratory of New Technology in Rice Breeding, Guangzhou, 510640, China
- Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| | - Mao Xingxue
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640, China
- Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Guangzhou, 510640, China
- Guangdong Key Laboratory of New Technology in Rice Breeding, Guangzhou, 510640, China
- Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| | - Jiang Liqun
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640, China
- Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Guangzhou, 510640, China
- Guangdong Key Laboratory of New Technology in Rice Breeding, Guangzhou, 510640, China
- Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| | - Lyu Shuwei
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640, China
- Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Guangzhou, 510640, China
- Guangdong Key Laboratory of New Technology in Rice Breeding, Guangzhou, 510640, China
- Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| | - Zhang Jing
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640, China
- Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Guangzhou, 510640, China
- Guangdong Key Laboratory of New Technology in Rice Breeding, Guangzhou, 510640, China
- Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| | - Chen Pingli
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640, China
- Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Guangzhou, 510640, China
- Guangdong Key Laboratory of New Technology in Rice Breeding, Guangzhou, 510640, China
- Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| | - Pan Dajian
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640, China
- Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Guangzhou, 510640, China
- Guangdong Key Laboratory of New Technology in Rice Breeding, Guangzhou, 510640, China
- Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| | - Chen Wenfeng
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640, China
- Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Guangzhou, 510640, China
- Guangdong Key Laboratory of New Technology in Rice Breeding, Guangzhou, 510640, China
- Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| | - Fan Zhilan
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640, China
- Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Guangzhou, 510640, China
- Guangdong Key Laboratory of New Technology in Rice Breeding, Guangzhou, 510640, China
- Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| | - Li Chen
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640, China.
- Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Guangzhou, 510640, China.
- Guangdong Key Laboratory of New Technology in Rice Breeding, Guangzhou, 510640, China.
- Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China.
| |
Collapse
|
11
|
Huang B, Yan H, Sun M, Jin Y. Novel discovery in roles of structural variations and RWP-RK transcription factors in heat tolerance for pearl millet. Stress Biol 2023; 3:12. [PMID: 37676357 PMCID: PMC10442032 DOI: 10.1007/s44154-023-00092-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Accepted: 05/03/2023] [Indexed: 09/08/2023]
Abstract
Global warming adversely affects crop production worldwide. Massive efforts have been undertaken to study mechanisms regulating heat tolerance in plants. However, the roles of structural variations (SVs) in heat stress tolerance remain unclear. In a recent article, Yan et al. (Nat Genet 1-12, 2023) constructed the first pan-genome of pearl millet (Pennisetum glaucum) and identified key SVs linked to genes involved in regulating plant tolerance to heat stress for an important crop with a superior ability to thrive in extremely hot and arid climates. Through multi-omics analyses integrating by pan-genomics, comparative genomics, transcriptomics, population genetics and and molecular biological technologies, they found RWP-RK transcription factors cooperating with endoplasmic reticulum-related genes play key roles in heat tolerance in pearl millet. The results in this paper provided novel insights to advance the understanding of the genetic and genomic basis of heat tolerance and an exceptional resource for molecular breeding to improve heat tolerance in pearl millet and other crops.
Collapse
Affiliation(s)
- Bingru Huang
- Department of Plant Biology, Rutgers University, New Brunswick, NJ, 08901, USA.
| | - Haidong Yan
- Department of Genetics, University of Georgia, Athens, GA, USA
- College of Grassland Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Min Sun
- College of Grassland Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Yarong Jin
- College of Grassland Science and Technology, Sichuan Agricultural University, Chengdu, China
| |
Collapse
|
12
|
Xu L, Wang Y, Dong J, Zhang W, Tang M, Zhang W, Wang K, Chen Y, Zhang X, He Q, Zhang X, Wang K, Wang L, Ma Y, Xia K, Liu L. A chromosome-level genome assembly of radish (Raphanus sativus L.) reveals insights into genome adaptation and differential bolting regulation. Plant Biotechnol J 2023; 21:990-1004. [PMID: 36648398 PMCID: PMC10106849 DOI: 10.1111/pbi.14011] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Revised: 11/29/2022] [Accepted: 01/03/2023] [Indexed: 05/04/2023]
Abstract
High-quality radish (Raphanus sativus) genome represents a valuable resource for agronomical trait improvements and understanding genome evolution among Brassicaceae species. However, existing radish genome assembly remains fragmentary, which greatly hampered functional genomics research and genome-assisted breeding. Here, using a NAU-LB radish inbred line, we generated a reference genome of 476.32 Mb with a scaffold N50 of 56.88 Mb by incorporating Illumina, PacBio and BioNano optical mapping techniques. Utilizing Hi-C data, 448.12 Mb (94.08%) of the assembled sequences were anchored to nine radish chromosomes with 40 306 protein-coding genes annotated. In total, 249.14 Mb (52.31%) comprised the repetitive sequences, among which long terminal repeats (LTRs, 30.31%) were the most abundant class. Beyond confirming the whole-genome triplication (WGT) event in R. sativus lineage, we found several tandem arrayed genes were involved in stress response process, which may account for the distinctive phenotype of high disease resistance in R. sativus. By comparing against the existing Xin-li-mei radish genome, a total of 2 108 573 SNPs, 7740 large insertions, 7757 deletions and 84 inversions were identified. Interestingly, a 647-bp insertion in the promoter of RsVRN1 gene can be directly bound by the DOF transcription repressor RsCDF3, resulting into its low promoter activity and late-bolting phenotype of NAU-LB cultivar. Importantly, introgression of this 647-bp insertion allele, RsVRN1In-536 , into early-bolting genotype could contribute to delayed bolting time, indicating that it is a potential genetic resource for radish late-bolting breeding. Together, this genome resource provides valuable information to facilitate comparative genomic analysis and accelerate genome-guided breeding and improvement in radish.
Collapse
Affiliation(s)
- Liang Xu
- National Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Horticultural Crop Biology and Genetic Improvement (East China) of MOAR, College of HorticultureNanjing Agricultural UniversityNanjingChina
| | - Yan Wang
- National Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Horticultural Crop Biology and Genetic Improvement (East China) of MOAR, College of HorticultureNanjing Agricultural UniversityNanjingChina
| | - Junhui Dong
- National Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Horticultural Crop Biology and Genetic Improvement (East China) of MOAR, College of HorticultureNanjing Agricultural UniversityNanjingChina
| | - Wei Zhang
- National Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Horticultural Crop Biology and Genetic Improvement (East China) of MOAR, College of HorticultureNanjing Agricultural UniversityNanjingChina
- College of Horticulture and Landscape ArchitectureYangzhou UniversityYangzhouChina
| | - Mingjia Tang
- National Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Horticultural Crop Biology and Genetic Improvement (East China) of MOAR, College of HorticultureNanjing Agricultural UniversityNanjingChina
| | - Weilan Zhang
- National Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Horticultural Crop Biology and Genetic Improvement (East China) of MOAR, College of HorticultureNanjing Agricultural UniversityNanjingChina
| | - Kai Wang
- School of Life SciencesNantong UniversityNantongChina
| | - Yinglong Chen
- The UWA Institute of Agriculture, and School of Agriculture and EnvironmentThe University of Western AustraliaPerthWAAustralia
| | - Xiaoli Zhang
- National Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Horticultural Crop Biology and Genetic Improvement (East China) of MOAR, College of HorticultureNanjing Agricultural UniversityNanjingChina
| | - Qing He
- National Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Horticultural Crop Biology and Genetic Improvement (East China) of MOAR, College of HorticultureNanjing Agricultural UniversityNanjingChina
| | - Xinyu Zhang
- National Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Horticultural Crop Biology and Genetic Improvement (East China) of MOAR, College of HorticultureNanjing Agricultural UniversityNanjingChina
| | - Kai Wang
- National Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Horticultural Crop Biology and Genetic Improvement (East China) of MOAR, College of HorticultureNanjing Agricultural UniversityNanjingChina
| | - Lun Wang
- College of Horticulture and Landscape ArchitectureYangzhou UniversityYangzhouChina
| | - Yinbo Ma
- College of Horticulture and Landscape ArchitectureYangzhou UniversityYangzhouChina
| | - Kai Xia
- National Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Horticultural Crop Biology and Genetic Improvement (East China) of MOAR, College of HorticultureNanjing Agricultural UniversityNanjingChina
| | - Liwang Liu
- National Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Horticultural Crop Biology and Genetic Improvement (East China) of MOAR, College of HorticultureNanjing Agricultural UniversityNanjingChina
- College of Horticulture and Landscape ArchitectureYangzhou UniversityYangzhouChina
| |
Collapse
|
13
|
Yildiz G, Zanini SF, Afsharyan NP, Obermeier C, Snowdon RJ, Golicz AA. Benchmarking Oxford Nanopore read alignment-based insertion and deletion detection in crop plant genomes. Plant Genome 2023:e20314. [PMID: 36988043 DOI: 10.1002/tpg2.20314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Accepted: 01/15/2023] [Indexed: 06/19/2023]
Abstract
Structural variations (SVs) are larger polymorphisms (> 50 bp in length), which consist of insertions, deletions, inversions, duplications, and translocations. They can have a strong impact on agronomical traits and play an important role in environmental adaptation. The development of long-read sequencing technologies, including Oxford Nanopore, allows for comprehensive SV discovery and characterization even in complex polyploid crop genomes. However, many of the SV discovery pipeline benchmarks do not include complex plant genome datasets. In this study, we benchmarked insertion and deletion detection by popular long-read alignment-based SV detection tools for crop plant genomes. We used real and simulated Oxford Nanopore reads for two crops, allotetraploid Brassica napus (oilseed rape) and diploid Solanum lycopersicum (tomato), and evaluated several read aligners and SV callers across 5×, 10×, and 20× coverages typically used in re-sequencing studies. We further validated our findings using maize and soybean datasets. Our benchmarks provide a useful guide for designing Oxford Nanopore re-sequencing projects and SV discovery pipelines for crop plants.
Collapse
Affiliation(s)
- Gözde Yildiz
- Department of Plant Breeding, Justus Liebig University Giessen, Giessen, Germany
| | - Silvia F Zanini
- Department of Plant Breeding, Justus Liebig University Giessen, Giessen, Germany
| | - Nazanin P Afsharyan
- Department of Plant Breeding, Justus Liebig University Giessen, Giessen, Germany
| | - Christian Obermeier
- Department of Plant Breeding, Justus Liebig University Giessen, Giessen, Germany
| | - Rod J Snowdon
- Department of Plant Breeding, Justus Liebig University Giessen, Giessen, Germany
| | - Agnieszka A Golicz
- Department of Plant Breeding, Justus Liebig University Giessen, Giessen, Germany
| |
Collapse
|
14
|
Zhou Y, Yu Z, Chebotarov D, Chougule K, Lu Z, Rivera LF, Kathiresan N, Al-Bader N, Mohammed N, Alsantely A, Mussurova S, Santos J, Thimma M, Troukhan M, Fornasiero A, Green CD, Copetti D, Kudrna D, Llaca V, Lorieux M, Zuccolo A, Ware D, McNally K, Zhang J, Wing RA. Pan-genome inversion index reveals evolutionary insights into the subpopulation structure of Asian rice. Nat Commun 2023; 14:1567. [PMID: 36944612 PMCID: PMC10030860 DOI: 10.1038/s41467-023-37004-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Accepted: 02/27/2023] [Indexed: 03/23/2023] Open
Abstract
Understanding and exploiting genetic diversity is a key factor for the productive and stable production of rice. Here, we utilize 73 high-quality genomes that encompass the subpopulation structure of Asian rice (Oryza sativa), plus the genomes of two wild relatives (O. rufipogon and O. punctata), to build a pan-genome inversion index of 1769 non-redundant inversions that span an average of ~29% of the O. sativa cv. Nipponbare reference genome sequence. Using this index, we estimate an inversion rate of ~700 inversions per million years in Asian rice, which is 16 to 50 times higher than previously estimated for plants. Detailed analyses of these inversions show evidence of their effects on gene expression, recombination rate, and linkage disequilibrium. Our study uncovers the prevalence and scale of large inversions (≥100 bp) across the pan-genome of Asian rice and hints at their largely unexplored role in functional biology and crop performance.
Collapse
Affiliation(s)
- Yong Zhou
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
- Arizona Genomics Institute (AGI), School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Zhichao Yu
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China
| | - Dmytro Chebotarov
- International Rice Research Institute (IRRI), Los Baños, 4031, Laguna, Philippines
| | - Kapeel Chougule
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
| | - Zhenyuan Lu
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
| | - Luis F Rivera
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Nagarajan Kathiresan
- Supercomputing Core Lab, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Noor Al-Bader
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Nahed Mohammed
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Aseel Alsantely
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Saule Mussurova
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - João Santos
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Manjula Thimma
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | | | - Alice Fornasiero
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Carl D Green
- Information Technology Department, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Dario Copetti
- Arizona Genomics Institute (AGI), School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - David Kudrna
- Arizona Genomics Institute (AGI), School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Victor Llaca
- Research and Development, Corteva Agriscience, Johnston, IA, 50131, USA
| | - Mathias Lorieux
- DIADE, University of Montpellier, CIRAD, IRD, Montpellier, France
| | - Andrea Zuccolo
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia.
- Crop Science Research Center (CSRC), Scuola Superiore Sant'Anna, Pisa, 56127, Italy.
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA.
- USDA ARS NEA Plant, Soil & Nutrition Laboratory Research Unit, Ithaca, NY, 14853, USA.
| | - Kenneth McNally
- International Rice Research Institute (IRRI), Los Baños, 4031, Laguna, Philippines.
| | - Jianwei Zhang
- Arizona Genomics Institute (AGI), School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA.
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China.
| | - Rod A Wing
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia.
- Arizona Genomics Institute (AGI), School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA.
- International Rice Research Institute (IRRI), Los Baños, 4031, Laguna, Philippines.
| |
Collapse
|
15
|
Yan C, Song MH, Jiang D, Ren JL, Lv Y, Chang J, Huang S, Zaher H, Li JT. Genomic evidence reveals intraspecific divergence of the hot-spring snake (Thermophis baileyi), an endangered reptile endemic to the Qinghai-Tibet plateau. Mol Ecol 2023; 32:1335-1350. [PMID: 36073004 DOI: 10.1111/mec.16687] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 09/04/2022] [Accepted: 09/06/2022] [Indexed: 11/27/2022]
Abstract
Understanding how and why species evolve requires knowledge on intraspecific divergence. In this study, we examined intraspecific divergence in the endangered hot-spring snake (Thermophis baileyi), an endemic species on the Qinghai-Tibet Plateau (QTP). Whole-genome resequencing of 58 sampled individuals from 15 populations was performed to identify the drivers of intraspecific divergence and explore the potential roles of genes under selection. Our analyses resolved three groups, with major intergroup admixture occurring in regions of group contact. Divergence probably occurred during the Pleistocene as a result of glacial climatic oscillations, Yadong-Gulu rift, and geothermal fields differentiation, while complex gene flow between group pairs reflected a unique intraspecific divergence pattern on the QTP. Intergroup fixed loci involved selected genes functionally related to divergence and local adaptation, especially adaptation to hot spring microenvironments in different geothermal fields. Analysis of structural variants, genetic diversity, inbreeding, and genetic load indicated that the hot-spring snake population has declined to a low level with decreased diversity, which is important for the conservation management of this endangered species. Our study demonstrated that the integration of demographic history, gene flow, genomic divergence genes, and other information is necessary to distinguish the evolutionary processes involved in speciation.
Collapse
Affiliation(s)
- Chaochao Yan
- CAS Key Laboratory of Mountain Ecological Restoration and Bioresource Utilization & Ecological Restoration and Biodiversity Conservation Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, China
| | - Meng-Huan Song
- CAS Key Laboratory of Mountain Ecological Restoration and Bioresource Utilization & Ecological Restoration and Biodiversity Conservation Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Dechun Jiang
- CAS Key Laboratory of Mountain Ecological Restoration and Bioresource Utilization & Ecological Restoration and Biodiversity Conservation Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, China
| | - Jin-Long Ren
- CAS Key Laboratory of Mountain Ecological Restoration and Bioresource Utilization & Ecological Restoration and Biodiversity Conservation Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Yunyun Lv
- CAS Key Laboratory of Mountain Ecological Restoration and Bioresource Utilization & Ecological Restoration and Biodiversity Conservation Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, China
| | - Jiang Chang
- State Key Laboratory of Environmental Criteria and Risk Assessment, Chinese Research Academy of Environmental Sciences, Beijing, China
| | - Song Huang
- College of Life Sciences, Anhui Normal University, Wuhu, China
| | - Hussam Zaher
- Museu de Zoologia, Universidade de São Paulo, São Paulo, São Paulo, Brazil
| | - Jia-Tang Li
- CAS Key Laboratory of Mountain Ecological Restoration and Bioresource Utilization & Ecological Restoration and Biodiversity Conservation Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, China.,University of Chinese Academy of Sciences, Beijing, China.,Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China.,Mangkang Biodiversity and Ecological Station, Tibet Ecological Safety Monitor Network, Changdu, China
| |
Collapse
|
16
|
Weisweiler M, Stich B. Benchmarking of structural variant detection in the tetraploid potato genome using linked-read sequencing. Genomics 2023; 115:110568. [PMID: 36702293 DOI: 10.1016/j.ygeno.2023.110568] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Revised: 01/12/2023] [Accepted: 01/18/2023] [Indexed: 01/25/2023]
Abstract
It has recently been shown that structural variants (SV) can have a higher impact on gene expression variation compared to single nucleotide variants (SNV) in different plant species. Additionally, SV were associated with phenotypic variation in several crops. However, compared to the established SV detection based on short-read sequencing, less approaches were described for linked-read based SV calling. We therefore evaluated the performance of six linked-read SV callers compared to an established short-read SV caller based on simulated linked-reads in tetraploid potato. The objectives of our study were to i) compare the performance of SV callers based on linked-read sequencing to short-read sequencing, ii) examine the influence of SV type, SV length, haplotype incidence (HI), as well as sequencing coverage on the SV calling performance in the tetraploid potato genome, and iii) evaluate the accuracy of detecting insertions by linked-read compared to short-read sequencing. We observed high break point resolutions (BPR) detecting short SV and slightly lower BPR for large SV. Our observations highlighted the importance of short-read signals provided by Manta and LinkedSV to detect short SV. Manta and NAIBR performed well for detecting larger deletions, inversions, and duplications. Detected large SV were weakly influenced by the HI. Furthermore, we illustrated that large insertions can be assembled by Novel-X. Our results suggest the usage of the short-read and linked-read SV callers Manta, NAIBR, LinkedSV, and Novel-X based on at least 90x linked-read sequencing coverage to ensure the detection of a broad range of SV in the tetraploid potato genome.
Collapse
Affiliation(s)
- Marius Weisweiler
- Institute for Quantitative Genetics and Genomics of Plants, Universitätsstraße 1, 40225 Düsseldorf, Germany
| | - Benjamin Stich
- Institute for Quantitative Genetics and Genomics of Plants, Universitätsstraße 1, 40225 Düsseldorf, Germany; Cluster of Excellence on Plant Sciences, From Complex Traits towards Synthetic Modules, Universitätsstraße 1, 40225 Düsseldorf, Germany; Max Planck Institute for Plant Breeding Research, Carl-von-Linne-Weg 10, 50829 Köln, Germany.
| |
Collapse
|
17
|
Yan H, Sun M, Zhang Z, Jin Y, Zhang A, Lin C, Wu B, He M, Xu B, Wang J, Qin P, Mendieta JP, Nie G, Wang J, Jones CS, Feng G, Srivastava RK, Zhang X, Bombarely A, Luo D, Jin L, Peng Y, Wang X, Ji Y, Tian S, Huang L. Pangenomic analysis identifies structural variation associated with heat tolerance in pearl millet. Nat Genet 2023; 55:507-518. [PMID: 36864101 PMCID: PMC10011142 DOI: 10.1038/s41588-023-01302-4] [Citation(s) in RCA: 19] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Accepted: 01/18/2023] [Indexed: 03/04/2023]
Abstract
Pearl millet is an important cereal crop worldwide and shows superior heat tolerance. Here, we developed a graph-based pan-genome by assembling ten chromosomal genomes with one existing assembly adapted to different climates worldwide and captured 424,085 genomic structural variations (SVs). Comparative genomics and transcriptomics analyses revealed the expansion of the RWP-RK transcription factor family and the involvement of endoplasmic reticulum (ER)-related genes in heat tolerance. The overexpression of one RWP-RK gene led to enhanced plant heat tolerance and transactivated ER-related genes quickly, supporting the important roles of RWP-RK transcription factors and ER system in heat tolerance. Furthermore, we found that some SVs affected the gene expression associated with heat tolerance and SVs surrounding ER-related genes shaped adaptation to heat tolerance during domestication in the population. Our study provides a comprehensive genomic resource revealing insights into heat tolerance and laying a foundation for generating more robust crops under the changing climate.
Collapse
Affiliation(s)
- Haidong Yan
- College of Grassland Science and Technology, Sichuan Agricultural University, Chengdu, China
- School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, VA, USA
- Department of Genetics, University of Georgia, Athens, GA, USA
| | - Min Sun
- College of Grassland Science and Technology, Sichuan Agricultural University, Chengdu, China
| | | | - Yarong Jin
- College of Grassland Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Ailing Zhang
- College of Grassland Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Chuang Lin
- College of Grassland Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Bingchao Wu
- College of Grassland Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Min He
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Sichuan Agricultural University, Chengdu, China
| | - Bin Xu
- College of Grassland Science, Nanjing Agricultural University, Nanjing, China
| | - Jing Wang
- Key Laboratory of Bio-Source and Environmental Conservation, School of Life Science, Sichuan University, Chengdu, China
| | - Peng Qin
- Rice Research Institute, Sichuan Agricultural University, Chengdu, China
| | | | - Gang Nie
- College of Grassland Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Jianping Wang
- Agronomy Department, University of Florida, Gainesville, FL, USA
| | - Chris S Jones
- Feed and Forage Development, International Livestock Research Institute, Nairobi, Kenya
| | - Guangyan Feng
- College of Grassland Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Rakesh K Srivastava
- International Crops Research Institute for the Semi-Arid Tropics, Hyderabad, India
| | - Xinquan Zhang
- College of Grassland Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Aureliano Bombarely
- Instituto de Biologia Molecular y Celular de Plantas, UPV-CSIC, Valencia, Spain
| | - Dan Luo
- College of Grassland Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Long Jin
- College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Yuanying Peng
- Triticeae Research Institute, Sichuan Agricultural University, Chengdu, China
| | - Xiaoshan Wang
- College of Grassland Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Yang Ji
- Sichuan Animal Science Academy, Chengdu, China
| | - Shilin Tian
- Novogene Bioinformatics Institute, Beijing, China.
- Department of Ecology, Hubei Key Laboratory of Cell Homeostasis, College of Life Sciences, Wuhan University, Wuhan, China.
| | - Linkai Huang
- College of Grassland Science and Technology, Sichuan Agricultural University, Chengdu, China.
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Sichuan Agricultural University, Chengdu, China.
| |
Collapse
|
18
|
Zheng X, Zhong L, Pang H, Wen S, Li F, Lou D, Ge J, Fan W, Wang T, Han Z, Qiao W, Pan X, Zhu Y, Wang J, Tang C, Wang X, Zhang J, Xu Z, Kim SR, Kohli A, Ye G, Olsen KM, Fang W, Yang Q. Lost genome segments associate with trait diversity during rice domestication. BMC Biol 2023; 21:20. [PMID: 36726089 PMCID: PMC9893545 DOI: 10.1186/s12915-023-01512-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2022] [Accepted: 01/10/2023] [Indexed: 02/03/2023] Open
Abstract
BACKGROUND DNA mutations of diverse types provide the raw material required for phenotypic variation and evolution. In the case of crop species, previous research aimed to elucidate the changing patterns of repetitive sequences, single-nucleotide polymorphisms (SNPs), and small InDels during domestication to explain morphological evolution and adaptation to different environments. Additionally, structural variations (SVs) encompassing larger stretches of DNA are more likely to alter gene expression levels leading to phenotypic variation affecting plant phenotypes and stress resistance. Previous studies on SVs in rice were hampered by reliance on short-read sequencing limiting the quantity and quality of SV identification, while SV data are currently only available for cultivated rice, with wild rice largely uncharacterized. Here, we generated two genome assemblies for O. rufipogon using long-read sequencing and provide insights on the evolutionary pattern and effect of SVs on morphological traits during rice domestication. RESULTS In this study, we identified 318,589 SVs in cultivated and wild rice populations through a comprehensive analysis of 13 high-quality rice genomes and found that wild rice genomes contain 49% of unique SVs and an average of 1.76% of genes were lost during rice domestication. These SVs were further genotyped for 649 rice accessions, their evolutionary pattern during rice domestication and potential association with the diversity of important agronomic traits were examined. Genome-wide association studies between these SVs and nine agronomic traits identified 413 candidate causal variants, which together affect 361 genes. An 824-bp deletion in japonica rice, which encodes a serine carboxypeptidase family protein, is shown to be associated with grain length. CONCLUSIONS We provide relatively accurate and complete SV datasets for cultivated and wild rice accessions, especially in TE-rich regions, by comparing long-read sequencing data for 13 representative varieties. The integrated rice SV map and the identified candidate genes and variants represent valuable resources for future genomic research and breeding in rice.
Collapse
Affiliation(s)
- Xiaoming Zheng
- grid.410727.70000 0001 0526 1937National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081 China ,grid.419387.00000 0001 0729 330XInternational Rice Research Institute, DAPO box 7777, Metro Manila, the Philippines ,grid.410727.70000 0001 0526 1937Sanya National Research Institute of Breeding in Hainan, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Limei Zhong
- grid.260463.50000 0001 2182 8825College of life science, Nanchang University, Nanchang, China
| | - Hongbo Pang
- grid.263484.f0000 0004 1759 8467College of Life Science, Shenyang Normal University, Shenyang, China
| | - Siyu Wen
- grid.410727.70000 0001 0526 1937Sanya National Research Institute of Breeding in Hainan, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Fei Li
- grid.410727.70000 0001 0526 1937National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081 China
| | - Danjing Lou
- grid.410727.70000 0001 0526 1937National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081 China
| | - Jinyue Ge
- grid.410727.70000 0001 0526 1937National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081 China
| | - Weiya Fan
- grid.410727.70000 0001 0526 1937National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081 China
| | - Tianyi Wang
- Smartgenomics Technology Institute, Tianjin, China
| | - Zhenyun Han
- grid.410727.70000 0001 0526 1937National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081 China
| | - Weihua Qiao
- grid.410727.70000 0001 0526 1937National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081 China
| | - Xiaowu Pan
- grid.410598.10000 0004 4911 9766Rice Research Institute, Hunan Academy of Agricultural Sciences, Changsha, China
| | - Yebao Zhu
- grid.418033.d0000 0001 2229 4212Rice Research Institute, Fujian Academy of Agricultural Sciences, Fuzhou, China
| | - Jilin Wang
- grid.464380.d0000 0000 9885 0994Rice Research Institute, Jiangxi Academy of Agricultural Sciences, Nanchang, China
| | - Cuifeng Tang
- grid.410732.30000 0004 1799 1111Biotechnology and Germplasm Resources Institute, Yunnan Academy of Agricultural Sciences, Kunming, China
| | - Xinhua Wang
- grid.464347.6Institute of Food Crops, Hainan Academy of Agricultural Sciences, Haikou, China
| | - Jing Zhang
- grid.135769.f0000 0001 0561 6611Rice Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, China ,grid.484195.5Guangdong Provincial Key Laboratory of New Technology in Rice Breeding, Guangzhou, China
| | - Zhijian Xu
- grid.452720.60000 0004 0415 7259Rice Research Institute, Guangxi Academy of Agricultural Sciences, Nanning, China
| | - Sung Ryul Kim
- grid.419387.00000 0001 0729 330XInternational Rice Research Institute, DAPO box 7777, Metro Manila, the Philippines
| | - Ajay Kohli
- grid.419387.00000 0001 0729 330XInternational Rice Research Institute, DAPO box 7777, Metro Manila, the Philippines
| | - Guoyou Ye
- grid.419387.00000 0001 0729 330XInternational Rice Research Institute, DAPO box 7777, Metro Manila, the Philippines ,grid.289247.20000 0001 2171 7818Crop Biotech Institute & Department of Genetic Engineering, Kyung Hee University, Yongin, 446-701 Republic of Korea
| | - Kenneth M. Olsen
- grid.4367.60000 0001 2355 7002Biology Department, Washington University, Campus Box 1137, St. Louis, MO 63130 USA
| | - Wei Fang
- grid.410727.70000 0001 0526 1937National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081 China
| | - Qingwen Yang
- grid.410727.70000 0001 0526 1937National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081 China
| |
Collapse
|
19
|
Wang X, Li Z, Feng T, Luo X, Xue L, Mao C, Cui K, Li H, Huang J, Huang K, Rehman SU, Shi D, Wu D, Ruan J, Liu Q. Chromosome-level genome and recombination map of the male buffalo. Gigascience 2022; 12:giad063. [PMID: 37589307 PMCID: PMC10433102 DOI: 10.1093/gigascience/giad063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Revised: 03/20/2023] [Accepted: 07/13/2023] [Indexed: 08/18/2023] Open
Abstract
BACKGROUND The swamp buffalo (Bubalus bubalis carabanesis) is an economically important livestock supplying milk, meat, leather, and draft power. Several female buffalo genomes have been available, but the lack of high-quality male genomes hinders studies on chromosome evolution, especially Y, as well as meiotic recombination. RESULTS Here, a chromosome-level genome with a contig N50 of 72.2 Mb and a fine-scale recombination map of male buffalo were reported. We found that transposable elements (TEs) and structural variants (SVs) may contribute to buffalo evolution by influencing adjacent gene expression. We further found that the pseudoautosomal region (PAR) of the Y chromosome is subject to stronger purification selection. The meiotic recombination map showed that there were 2 obvious recombination hotspots on chromosome 8, and the genes around them were mainly related to tooth development, which may have helped to enhance the adaption of buffalo to inferior feed. Among several genomic features, TE density has the strongest correlation with recombination rates. Moreover, the TE subfamily, SINE/tRNA, is likely to play a role in driving recombination into SVs. CONCLUSIONS The male genome and sperm sequencing will facilitate the understanding of the buffalo genomic evolution and functional research.
Collapse
Affiliation(s)
- Xiaobo Wang
- Guangdong Provincial Key Laboratory of Animal Molecular Design and Precise Breeding, School of Life Science and Engineering, Foshan University, Foshan 528225, China
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Zhipeng Li
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| | - Tong Feng
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| | - Xier Luo
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| | - Lintao Xue
- Reproductive Medical and Genetic Center, The People's Hospital of Guangxi Zhuang Autonomous Region, Nanning, Guangxi 530021, China
| | - Chonghui Mao
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Kuiqing Cui
- Guangdong Provincial Key Laboratory of Animal Molecular Design and Precise Breeding, School of Life Science and Engineering, Foshan University, Foshan 528225, China
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| | - Hui Li
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| | - Jieping Huang
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| | - Kongwei Huang
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| | - Saif-ur Rehman
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| | - Deshun Shi
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| | - Dongdong Wu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China
| | - Jue Ruan
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Qingyou Liu
- Guangdong Provincial Key Laboratory of Animal Molecular Design and Precise Breeding, School of Life Science and Engineering, Foshan University, Foshan 528225, China
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| |
Collapse
|
20
|
Jankowicz-Cieslak J, Hofinger BJ, Jarc L, Junttila S, Galik B, Gyenesei A, Ingelbrecht IL, Till BJ. Spectrum and Density of Gamma and X-ray Induced Mutations in a Non-Model Rice Cultivar. Plants (Basel) 2022; 11:3232. [PMID: 36501272 PMCID: PMC9741009 DOI: 10.3390/plants11233232] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 11/19/2022] [Accepted: 11/20/2022] [Indexed: 06/17/2023]
Abstract
Physical mutagens are a powerful tool used for genetic research and breeding for over eight decades. Yet, when compared to chemical mutagens, data sets on the effect of different mutagens and dosages on the spectrum and density of induced mutations remain lacking. To address this, we investigated the landscape of mutations induced by gamma and X-ray radiation in the most widely cultivated crop species: rice. A mutant population of a tropical upland rice, Oryza sativa L., was generated and propagated via self-fertilization for seven generations. Five dosages ranging from 75 Gy to 600 Gy in both X-ray and gamma-irradiated material were applied. In the process of a forward genetic screens, 11 unique rice mutant lines showing phenotypic variation were selected for mutation analysis via whole-genome sequencing. Thousands of candidate mutations were recovered in each mutant with single base substitutions being the most common, followed by small indels and structural variants. Higher dosages resulted in a higher accumulation of mutations in gamma-irradiated material, but not in X-ray-treated plants. The in vivo role of all annotated rice genes is yet to be directly investigated. The ability to induce a high density of single nucleotide and structural variants through mutagenesis will likely remain an important approach for functional genomics and breeding.
Collapse
Affiliation(s)
- Joanna Jankowicz-Cieslak
- Plant Breeding and Genetics Laboratory, FAO/IAEA Joint Division, International Atomic Energy Agency (IAEA), 2444 Seibersdorf, Austria
| | - Bernhard J. Hofinger
- Plant Breeding and Genetics Laboratory, FAO/IAEA Joint Division, International Atomic Energy Agency (IAEA), 2444 Seibersdorf, Austria
| | - Luka Jarc
- Plant Breeding and Genetics Laboratory, FAO/IAEA Joint Division, International Atomic Energy Agency (IAEA), 2444 Seibersdorf, Austria
| | - Sini Junttila
- Bioinformatics and Scientific Computing Core, Vienna Biocenter Core Facilities GmbH, Dr-Bohr-Gasse 3, 1030 Vienna, Austria
- Medical Bioinformatics Centre, Turku Bioscience Centre, University of Turku, Tykistökatu 6, 20520 Turku, Finland
- Medical Bioinformatics Centre, Turku Bioscience Centre, Åbo Akademi University, Tykistökatu 6, 20520 Turku, Finland
| | - Bence Galik
- Bioinformatics and Scientific Computing Core, Vienna Biocenter Core Facilities GmbH, Dr-Bohr-Gasse 3, 1030 Vienna, Austria
- Department of Clinical Molecular Biology, Medical University of Bialystok, 15-269 Bialystok, Poland
- Bioinformatics Research Group, Genomics and Bioinformatics Core Facility Szentágothai Research Centre, University of Pécs, H-7622 Pecs, Hungary
| | - Attila Gyenesei
- Bioinformatics and Scientific Computing Core, Vienna Biocenter Core Facilities GmbH, Dr-Bohr-Gasse 3, 1030 Vienna, Austria
- Bioinformatics Research Group, Genomics and Bioinformatics Core Facility Szentágothai Research Centre, University of Pécs, H-7622 Pecs, Hungary
| | - Ivan L. Ingelbrecht
- Plant Breeding and Genetics Laboratory, FAO/IAEA Joint Division, International Atomic Energy Agency (IAEA), 2444 Seibersdorf, Austria
| | - Bradley J. Till
- Plant Breeding and Genetics Laboratory, FAO/IAEA Joint Division, International Atomic Energy Agency (IAEA), 2444 Seibersdorf, Austria
- Veterinary Genetics Laboratory, University of California, Old Davis Road, Davis, CA 95616, USA
| |
Collapse
|
21
|
Chen Y, Miao Y, Bai W, Lin K, Pang E. Characteristics and potential functional effects of long insertions in Asian butternuts. BMC Genomics 2022; 23:732. [PMID: 36307757 PMCID: PMC9617325 DOI: 10.1186/s12864-022-08961-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Accepted: 10/17/2022] [Indexed: 11/10/2022] Open
Abstract
Abstract
Background
Structural variants (SVs) play important roles in adaptation evolution and species diversification. Especially, in plants, many phenotypes of response to the environment were found to be associated with SVs. Despite the prevalence and significance of SVs, long insertions remain poorly detected and studied in all but model species.
Results
We used whole-genome resequencing of paired reads from 80 Asian butternuts to detect long insertions and further analyse their characteristics and potential functional effects. By combining of mapping-based and de novo assembly-based methods, we obtained a multiple related species pangenome representing higher taxonomic groups. We obtained 89,312 distinct contigs totaling 147,773,999 base pair (bp) of new sequences, of which 347 were putative long insertions placed in the reference genome. Most of the putative long insertions appeared in multiple species; in contrast, only 62 putative long insertions appeared in one species, which may be involved in the response to the environment. 65 putative long insertions fell into 61 distinct protein-coding genes involved in plant development, and 105 putative long insertions fell into upstream of 106 distinct protein-coding genes involved in cellular respiration. 3,367 genes were annotated in 2,606 contigs. We propose PLAINS (https://github.com/CMB-BNU/PLAINS.git), a streamlined, comprehensive pipeline for the prediction and analysis of long insertions using whole-genome resequencing.
Conclusions
Our study lays down an important foundation for further whole-genome long insertion studies, allowing the investigation of their effects by experiments.
Collapse
|
22
|
Wu X, Jiang W, Fragoso C, Huang J, Zhou G, Zhao H, Dellaporta S. Prioritized candidate causal haplotype blocks in plant genome-wide association studies. PLoS Genet 2022; 18:e1010437. [PMID: 36251695 DOI: 10.1371/journal.pgen.1010437] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Revised: 10/27/2022] [Accepted: 09/20/2022] [Indexed: 11/05/2022] Open
Abstract
Genome wide association studies (GWAS) can play an essential role in understanding genetic basis of complex traits in plants and animals. Conventional SNP-based linear mixed models (LMM) that marginally test single nucleotide polymorphisms (SNPs) have successfully identified many loci with major and minor effects in many GWAS. In plant, the relatively small population size in GWAS and the high genetic diversity found in many plant species can impede mapping efforts on complex traits. Here we present a novel haplotype-based trait fine-mapping framework, HapFM, to supplement current GWAS methods. HapFM uses genotype data to partition the genome into haplotype blocks, identifies haplotype clusters within each block, and then performs genome-wide haplotype fine-mapping to prioritize the candidate causal haplotype blocks of trait. We benchmarked HapFM, GEMMA, BSLMM, GMMAT, and BLINK in both simulated and real plant GWAS datasets. HapFM consistently resulted in higher mapping power than the other GWAS methods in high polygenicity simulation setting. Moreover, it resulted in smaller mapping intervals, especially in regions of high LD, achieved by prioritizing small candidate causal blocks in the larger haplotype blocks. In the Arabidopsis flowering time (FT10) datasets, HapFM identified four novel loci compared to GEMMA’s results, and the average mapping interval of HapFM was 9.6 times smaller than that of GEMMA. In conclusion, HapFM is tailored for plant GWAS to result in high mapping power on complex traits and improved on mapping resolution to facilitate crop improvement. Genome-wide association studies (GWAS) are commonly used in human and plant studies to identify genetic variants responsible for the phenotype of interest and provide foundations for studying disease mechanisms and crop improvement. Most GWAS models are developed and optimized using human datasets. However, the difference between human and plant datasets essentially limits their applications in plant studies, especially when mapping complex traits such as drought resistance and yield. In this study, we present a novel GWAS method, HapFM, tailored for plant datasets to overcome the difficulties of many conventional GWAS methods. HapFM resulted in higher statistical power than conventional GWAS methods for mapping complex traits in our simulation and real dataset analyses. In addition, HapFM reduced the mapping interval by prioritizing candidate causal regions in the genome, which benefits the downstream experimental studies. Last but not least, HapFM can incorporate biological annotations to increase statistical power further. Overall, HapFM balances statistical power, result interpretability, and downstream experimental verifiability.
Collapse
|
23
|
Weisweiler M, Arlt C, Wu PY, Van Inghelandt D, Hartwig T, Stich B. Structural variants in the barley gene pool: precision and sensitivity to detect them using short-read sequencing and their association with gene expression and phenotypic variation. Theor Appl Genet 2022; 135:3511-3529. [PMID: 36029318 PMCID: PMC9519679 DOI: 10.1007/s00122-022-04197-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Accepted: 08/03/2022] [Indexed: 06/15/2023]
Abstract
Structural variants (SV) of 23 barley inbreds, detected by the best combination of SV callers based on short-read sequencing, were associated with genome-wide and gene-specific gene expression and, thus, were evaluated to predict agronomic traits. In human genetics, several studies have shown that phenotypic variation is more likely to be caused by structural variants (SV) than by single nucleotide variants. However, accurate while cost-efficient discovery of SV in complex genomes remains challenging. The objectives of our study were to (i) facilitate SV discovery studies by benchmarking SV callers and their combinations with respect to their sensitivity and precision to detect SV in the barley genome, (ii) characterize the occurrence and distribution of SV clusters in the genomes of 23 barley inbreds that are the parents of a unique resource for mapping quantitative traits, the double round robin population, (iii) quantify the association of SV clusters with transcript abundance, and (iv) evaluate the use of SV clusters for the prediction of phenotypic traits. In our computer simulations based on a sequencing coverage of 25x, a sensitivity > 70% and precision > 95% was observed for all combinations of SV types and SV length categories if the best combination of SV callers was used. We observed a significant (P < 0.05) association of gene-associated SV clusters with global gene-specific gene expression. Furthermore, about 9% of all SV clusters that were within 5 kb of a gene were significantly (P < 0.05) associated with the gene expression of the corresponding gene. The prediction ability of SV clusters was higher compared to that of single-nucleotide polymorphisms from an array across the seven studied phenotypic traits. These findings suggest the usefulness of exploiting SV information when fine mapping and cloning the causal genes underlying quantitative traits as well as the high potential of using SV clusters for the prediction of phenotypes in diverse germplasm sets.
Collapse
Affiliation(s)
- Marius Weisweiler
- Institute for Quantitative Genetics and Genomics of Plants, Universitätsstraße 1, 40225, Düsseldorf, Germany
| | - Christopher Arlt
- Institute for Quantitative Genetics and Genomics of Plants, Universitätsstraße 1, 40225, Düsseldorf, Germany
| | - Po-Ya Wu
- Institute for Quantitative Genetics and Genomics of Plants, Universitätsstraße 1, 40225, Düsseldorf, Germany
| | - Delphine Van Inghelandt
- Institute for Quantitative Genetics and Genomics of Plants, Universitätsstraße 1, 40225, Düsseldorf, Germany
| | - Thomas Hartwig
- Institute for Molecular Physiology, Universitätsstraße 1, 40225, Düsseldorf, Germany
| | - Benjamin Stich
- Institute for Quantitative Genetics and Genomics of Plants, Universitätsstraße 1, 40225, Düsseldorf, Germany.
- Cluster of Excellence on Plant Sciences, From Complex Traits towards Synthetic Modules, Universitätsstraße 1, 40225, Düsseldorf, Germany.
| |
Collapse
|
24
|
Vourlaki IT, Castanera R, Ramos-Onsins SE, Casacuberta JM, Pérez-Enciso M. Transposable element polymorphisms improve prediction of complex agronomic traits in rice. Theor Appl Genet 2022; 135:3211-3222. [PMID: 35931838 PMCID: PMC9482605 DOI: 10.1007/s00122-022-04180-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Accepted: 07/12/2022] [Indexed: 06/15/2023]
Abstract
Transposon insertion polymorphisms can improve prediction of complex agronomic traits in rice compared to using SNPs only, especially when accessions to be predicted are less related to the training set. Transposon insertion polymorphisms (TIPs) are significant sources of genetic variation. Previous work has shown that TIPs can improve detection of causative loci on agronomic traits in rice. Here, we quantify the fraction of variance explained by single nucleotide polymorphisms (SNPs) compared to TIPs, and we explore whether TIPs can improve prediction of traits when compared to using only SNPs. We used eleven traits of agronomic relevance from by five different rice population groups (Aus, Indica, Aromatic, Japonica, and Admixed), 738 accessions in total. We assess prediction by applying data split validation in two scenarios. In the within-population scenario, we predicted performance of improved Indica varieties using the rest of Indica accessions. In the across population scenario, we predicted all Aromatic and Admixed accessions using the rest of populations. In each scenario, Bayes C and a Bayesian reproducible kernel Hilbert space regression were compared. We find that TIPs can explain an important fraction of total genetic variance and that they also improve genomic prediction. In the across population prediction scenario, TIPs outperformed SNPs in nine out of the eleven traits analyzed. In some traits like leaf senescence or grain width, using TIPs increased predictive correlation by 30-50%. Our results evidence, for the first time, that TIPs genotyping can improve prediction on complex agronomic traits in rice, especially when accessions to be predicted are less related to training accessions.
Collapse
Affiliation(s)
- Ioanna-Theoni Vourlaki
- Universitat Autònoma de Barcelona, Department of Animal Production, 08193, Bellaterra, Barcelona, Spain.
- Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, 08193, Bellaterra, Barcelona, Spain.
| | - Raúl Castanera
- Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, 08193, Bellaterra, Barcelona, Spain
| | - Sebastián E Ramos-Onsins
- Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, 08193, Bellaterra, Barcelona, Spain
| | - Josep M Casacuberta
- Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, 08193, Bellaterra, Barcelona, Spain
| | - Miguel Pérez-Enciso
- Universitat Autònoma de Barcelona, Department of Animal Production, 08193, Bellaterra, Barcelona, Spain.
- Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, 08193, Bellaterra, Barcelona, Spain.
- Catalan Institute for Research and Advanced studies, ICREA, 08010, Barcelona, Spain.
| |
Collapse
|
25
|
Gui S, Wei W, Jiang C, Luo J, Chen L, Wu S, Li W, Wang Y, Li S, Yang N, Li Q, Fernie AR, Yan J. A pan-Zea genome map for enhancing maize improvement. Genome Biol 2022; 23:178. [PMID: 35999561 PMCID: PMC9396798 DOI: 10.1186/s13059-022-02742-7] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Accepted: 07/27/2022] [Indexed: 12/22/2022] Open
Abstract
Background Maize (Zea mays L.) is at the vanguard facing the upcoming breeding challenges. However, both a super pan-genome for the Zea genus and a comprehensive genetic variation map for maize breeding are still lacking. Results Here, we construct an approximately 6.71-Gb pan-Zea genome that contains around 4.57-Gb non-B73 reference sequences from fragmented de novo assemblies of 721 pan-Zea individuals. We annotate a total of 58,944 pan-Zea genes and find around 44.34% of them are dispensable in the pan-Zea population. Moreover, 255,821 common structural variations are identified and genotyped in a maize association mapping panel. Further analyses reveal gene presence/absence variants and their potential roles during domestication of maize. Combining genetic analyses with multi-omics data, we demonstrate how structural variants are associated with complex agronomic traits. Conclusions Our results highlight the underexplored role of the pan-Zea genome and structural variations to further understand domestication of maize and explore their potential utilization in crop improvement. Supplementary Information The online version contains supplementary material available at 10.1186/s13059-022-02742-7.
Collapse
Affiliation(s)
- Songtao Gui
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China
| | - Wenjie Wei
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China
| | - Chenglin Jiang
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China
| | - Jingyun Luo
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China
| | - Lu Chen
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China
| | - Shenshen Wu
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China
| | - Wenqiang Li
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China
| | - Yuebin Wang
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China
| | - Shuyan Li
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China
| | - Ning Yang
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China.,Hubei Hongshan Laboratory, Wuhan, 430070, China
| | - Qing Li
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China.,Hubei Hongshan Laboratory, Wuhan, 430070, China
| | - Alisdair R Fernie
- Department of Molecular Physiology, Max-Planck-Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476, Potsdam, Golm, Germany
| | - Jianbing Yan
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China. .,Hubei Hongshan Laboratory, Wuhan, 430070, China.
| |
Collapse
|
26
|
Abstract
Structural variants (SVs) underlie genomic variation but are often overlooked due to difficult detection from short reads. Most algorithms have been tested on humans, and it remains unclear how applicable they are in other organisms. To solve this, we develop perSVade (personalized structural variation detection), a sample-tailored pipeline that provides optimally called SVs and their inferred accuracy, as well as small and copy number variants. PerSVade increases SV calling accuracy on a benchmark of six eukaryotes. We find no universal set of optimal parameters, underscoring the need for sample-specific parameter optimization. PerSVade will facilitate SV detection and study across diverse organisms.
Collapse
Affiliation(s)
- Miquel Àngel Schikora-Tamarit
- Barcelona Supercomputing Centre (BSC-CNS), Plaça Eusebi Güell, 1-3, 08034, Barcelona, Spain
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac, 10, 08028, Barcelona, Spain
| | - Toni Gabaldón
- Barcelona Supercomputing Centre (BSC-CNS), Plaça Eusebi Güell, 1-3, 08034, Barcelona, Spain.
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac, 10, 08028, Barcelona, Spain.
- Catalan Institution for Research and Advanced Studies (ICREA), Barcelona, Spain.
- Centro Investigación Biomédica En Red de Enfermedades Infecciosas, Barcelona, Spain.
| |
Collapse
|
27
|
Guerra‐Garcia A, Haile T, Ogutcen E, Bett KE, von Wettberg EJ. An evolutionary look into the history of lentil reveals unexpected diversity. Evol Appl 2022; 15:1313-1325. [PMID: 36051460 PMCID: PMC9423085 DOI: 10.1111/eva.13467] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Revised: 07/12/2022] [Accepted: 08/01/2022] [Indexed: 11/29/2022] Open
Abstract
The characterization and preservation of genetic variation in crops is critical to meeting the challenges of breeding in the face of changing climates and markets. In recent years, the use of single nucleotide polymorphisms (SNPs) has become routine, allowing us to understand the population structure, find divergent lines for crosses, and illuminate the origin of crops. However, the focus on SNPs overlooks other forms of variation, such as copy number variation (CNVs). Lentil is the third most important cold‐season legume and was domesticated in the Fertile Crescent. We genotyped 324 accessions that represent its global diversity, and using both SNPs and CNVs, we dissected the population structure and genetic variation, and identified candidate genes. Eight clusters were detected, most of them located in or near the Fertile Crescent, even though different clusters were present in distinct regions. The cluster from South Asia was particularly differentiated and presented low diversity, contrasting with the clusters from the Mediterranean and the northern temperate. Accessions from North America were mainly assigned to one cluster and were highly diverse, reflecting the efforts of breeding programs to integrate variation. Thirty‐three genes were identified as candidates under selection and among their functions were sporopollenin synthesis in pollen, a component of chlorophyll B reductase that partially determines the antenna size, and two genes related to the import system of chloroplasts. Eleven percent of all lentil genes and 21% of lentil disease resistance genes were affected by CNVs. The gene categories overrepresented in these genes were “enzymes,” “Cell Wall Organization,” and “external stimuli response.” All the genes found in the latter were associated with pathogen response. CNVs provided information about population structure and might have played a role in adaptation. The incorporation of CNVs in diversity studies is needed for a broader understanding of how they evolve and contribute to domestication.
Collapse
Affiliation(s)
| | - Teketel Haile
- Department of Plant Sciences University of Saskatchewan Saskatoon SK Canada
| | - Ezgi Ogutcen
- Conservatoire et Jardin Botaniques de la Ville de Genève Geneva Switzerland
| | - Kirstin E. Bett
- Department of Plant Sciences University of Saskatchewan Saskatoon SK Canada
| | - Eric J. von Wettberg
- Plant and Soil Science and Gund Institute for the Environment University of Vermont Burlington VT USA
| |
Collapse
|
28
|
Riehl K, Riccio C, Miska EA, Hemberg M. TransposonUltimate: software for transposon classification, annotation and detection. Nucleic Acids Res 2022; 50:e64. [PMID: 35234904 PMCID: PMC9226531 DOI: 10.1093/nar/gkac136] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Revised: 02/09/2022] [Accepted: 02/14/2022] [Indexed: 12/17/2022] Open
Abstract
Most genomes harbor a large number of transposons, and they play an important role in evolution and gene regulation. They are also of interest to clinicians as they are involved in several diseases, including cancer and neurodegeneration. Although several methods for transposon identification are available, they are often highly specialised towards specific tasks or classes of transposons, and they lack common standards such as a unified taxonomy scheme and output file format. We present TransposonUltimate, a powerful bundle of three modules for transposon classification, annotation, and detection of transposition events. TransposonUltimate comes as a Conda package under the GPL-3.0 licence, is well documented and it is easy to install through https://github.com/DerKevinRiehl/TransposonUltimate. We benchmark the classification module on the large TransposonDB covering 891,051 sequences to demonstrate that it outperforms the currently best existing solutions. The annotation and detection modules combine sixteen existing softwares, and we illustrate its use by annotating Caenorhabditis elegans, Rhizophagus irregularis and Oryza sativa subs. japonica genomes. Finally, we use the detection module to discover 29 554 transposition events in the genomes of 20 wild type strains of C. elegans. Databases, assemblies, annotations and further findings can be downloaded from (https://doi.org/10.5281/zenodo.5518085).
Collapse
Affiliation(s)
- Kevin Riehl
- Gurdon Institute, University of Cambridge, Cambridge CB2 1QN, UK
| | - Cristian Riccio
- Gurdon Institute, University of Cambridge, Cambridge CB2 1QN, UK
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
| | - Eric A Miska
- Gurdon Institute, University of Cambridge, Cambridge CB2 1QN, UK
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
- Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH, UK
| | - Martin Hemberg
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
- Evergrande Center for Immunologic Diseases, Harvard Medical School and Brigham and Women’s Hospital, 75 Francis Street, Boston, MA 02215, USA
| |
Collapse
|
29
|
Mohd Saad NS, Neik TX, Thomas WJW, Amas JC, Cantila AY, Craig RJ, Edwards D, Batley J. Advancing designer crops for climate resilience through an integrated genomics approach. Curr Opin Plant Biol 2022; 67:102220. [PMID: 35489163 DOI: 10.1016/j.pbi.2022.102220] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Revised: 03/15/2022] [Accepted: 03/25/2022] [Indexed: 06/14/2023]
Abstract
Climate change and exponential population growth are exposing an immediate need for developing future crops that are highly resilient and adaptable to changing environments to maintain global food security in the next decade. Rigorous selection from long domestication history has rendered cultivated crops genetically disadvantaged, raising concerns in their ability to adapt to these new challenges and limiting their usefulness in breeding programmes. As a result, future crop improvement efforts must rely on integrating various genomic strategies ranging from high-throughput sequencing to machine learning, in order to exploit germplasm diversity and overcome bottlenecks created by domestication, expansive multi-dimensional phenotypes, arduous breeding processes, complex traits and big data.
Collapse
Affiliation(s)
- Nur Shuhadah Mohd Saad
- UWA School of Biological Sciences and the UWA Institute of Agriculture, University of Western Australia, Crawley, WA, Australia
| | - Ting Xiang Neik
- Sunway College Kuala Lumpur, Bandar Sunway, 47500, Selangor, Malaysia
| | - William J W Thomas
- UWA School of Biological Sciences and the UWA Institute of Agriculture, University of Western Australia, Crawley, WA, Australia
| | - Junrey C Amas
- UWA School of Biological Sciences and the UWA Institute of Agriculture, University of Western Australia, Crawley, WA, Australia
| | - Aldrin Y Cantila
- UWA School of Biological Sciences and the UWA Institute of Agriculture, University of Western Australia, Crawley, WA, Australia
| | - Ryan J Craig
- UWA School of Biological Sciences and the UWA Institute of Agriculture, University of Western Australia, Crawley, WA, Australia
| | - David Edwards
- UWA School of Biological Sciences and the UWA Institute of Agriculture, University of Western Australia, Crawley, WA, Australia
| | - Jacqueline Batley
- UWA School of Biological Sciences and the UWA Institute of Agriculture, University of Western Australia, Crawley, WA, Australia.
| |
Collapse
|
30
|
Canaguier A, Guilbaud R, Denis E, Magdelenat G, Belser C, Istace B, Cruaud C, Wincker P, Le Paslier MC, Faivre-Rampant P, Barbe V. Oxford Nanopore and Bionano Genomics technologies evaluation for plant structural variation detection. BMC Genomics 2022; 23:317. [PMID: 35448948 PMCID: PMC9026655 DOI: 10.1186/s12864-022-08499-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Accepted: 03/17/2022] [Indexed: 11/10/2022] Open
Abstract
Background Structural Variations (SVs) are genomic rearrangements derived from duplication, deletion, insertion, inversion, and translocation events. In the past, SVs detection was limited to cytological approaches, then to Next-Generation Sequencing (NGS) short reads and partitioned assemblies. Nowadays, technologies such as DNA long read sequencing and optical mapping have revolutionized the understanding of SVs in genomes, due to the enhancement of the power of SVs detection. This study aims to investigate performance of two techniques, 1) long-read sequencing obtained with the MinION device (Oxford Nanopore Technologies) and 2) optical mapping obtained with Saphyr device (Bionano Genomics) to detect and characterize SVs in the genomes of the two ecotypes of Arabidopsis thaliana, Columbia-0 (Col-0) and Landsberg erecta 1 (Ler-1). Results We described the SVs detected from the alignment of the best ONT assembly and DLE-1 optical maps of A. thaliana Ler-1 against the public reference genome Col-0 TAIR10.1. After filtering (SV > 1 kb), 1184 and 591 Ler-1 SVs were retained from ONT and Bionano technologies respectively. A total of 948 Ler-1 ONT SVs (80.1%) corresponded to 563 Bionano SVs (95.3%) leading to 563 common locations. The specific locations were scrutinized to assess improvement in SV detection by either technology. The ONT SVs were mostly detected near TE and gene features, and resistance genes seemed particularly impacted. Conclusions Structural variations linked to ONT sequencing error were removed and false positives limited, with high quality Bionano SVs being conserved. When compared with the Col-0 TAIR10.1 reference genome, most of the detected SVs discovered by both technologies were found in the same locations. ONT assembly sequence leads to more specific SVs than Bionano one, the latter being more efficient to characterize large SVs. Even if both technologies are complementary approaches, ONT data appears to be more adapted to large scale populations studies, while Bionano performs better in improving assembly and describing specificity of a genome compared to a reference. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08499-4.
Collapse
Affiliation(s)
- Aurélie Canaguier
- Université Paris-Saclay, INRAE, Etude du Polymorphisme des Génomes Végétaux EPGV, 91000, Evry-Courcouronnes, France
| | - Romane Guilbaud
- Université Paris-Saclay, INRAE, Etude du Polymorphisme des Génomes Végétaux EPGV, 91000, Evry-Courcouronnes, France
| | - Erwan Denis
- Genoscope, Institut de biologie François-Jacob, Commissariat à l'Energie Atomique CEA, Université Paris-Saclay, Evry, France
| | - Ghislaine Magdelenat
- Genoscope, Institut de biologie François-Jacob, Commissariat à l'Energie Atomique CEA, Université Paris-Saclay, Evry, France
| | - Caroline Belser
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
| | - Benjamin Istace
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
| | - Corinne Cruaud
- Genoscope, Institut de biologie François-Jacob, Commissariat à l'Energie Atomique CEA, Université Paris-Saclay, Evry, France
| | - Patrick Wincker
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
| | - Marie-Christine Le Paslier
- Université Paris-Saclay, INRAE, Etude du Polymorphisme des Génomes Végétaux EPGV, 91000, Evry-Courcouronnes, France
| | - Patricia Faivre-Rampant
- Université Paris-Saclay, INRAE, Etude du Polymorphisme des Génomes Végétaux EPGV, 91000, Evry-Courcouronnes, France.
| | - Valérie Barbe
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
| |
Collapse
|
31
|
Wang C, Han B. Twenty years of rice genomics research: From sequencing and functional genomics to quantitative genomics. Mol Plant 2022; 15:593-619. [PMID: 35331914 DOI: 10.1016/j.molp.2022.03.009] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2022] [Revised: 03/04/2022] [Accepted: 03/18/2022] [Indexed: 06/14/2023]
Abstract
Since the completion of the rice genome sequencing project in 2005, we have entered the era of rice genomics, which is still in its ascendancy. Rice genomics studies can be classified into three stages: structural genomics, functional genomics, and quantitative genomics. Structural genomics refers primarily to genome sequencing for the construction of a complete map of rice genome sequence. This is fundamental for rice genetics and molecular biology research. Functional genomics aims to decode the functions of rice genes. Quantitative genomics is large-scale sequence- and statistics-based research to define the quantitative traits and genetic features of rice populations. Rice genomics has been a transformative influence on rice biological research and contributes significantly to rice breeding, making rice a good model plant for studying crop sciences.
Collapse
Affiliation(s)
- Changsheng Wang
- National Center for Gene Research, State Key Laboratory of Plant Molecular Genetics, Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai 200233, China.
| | - Bin Han
- National Center for Gene Research, State Key Laboratory of Plant Molecular Genetics, Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai 200233, China.
| |
Collapse
|
32
|
Zhu F, Ahchige MW, Brotman Y, Alseekh S, Zsögön A, Fernie AR. Bringing more players into play: Leveraging stress in genome wide association studies. J Plant Physiol 2022; 271:153657. [PMID: 35231821 DOI: 10.1016/j.jplph.2022.153657] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/20/2021] [Revised: 02/14/2022] [Accepted: 02/21/2022] [Indexed: 06/14/2023]
Abstract
In order to meet the demand of the burgeoning human population as well as to adapt crops to the enhanced abiotic and biotic stress caused by the global climatic change, breeders focus on identifying valuable genes to improve both crop stress tolerance and crop quality. Recently, with the development of next-generation sequencing methods, millions of high quality single-nucleotide polymorphisms (SNPs) have been made available and genome-wide association studies (GWAS) are widely used in crop improvement studies to identify the associations between genetic variants of genomes and relevant crop agronomic traits. Here, we review classic cases of use of GWAS to identify genetic variants associated with valuable traits such as geographic adaptation, crop quality and metabolites. We discuss the power of stress GWAS to identify further associations including those with genes that are not, or only lowly, expressed during optimal growth conditions. Finally, we emphasize recent demonstrations of the efficiency and accuracy of time-resolved dynamic stress GWAS and GWAS based on genomic gene expression and structural variations, which can be applied to resolve more comprehensively the genetic regulation mechanisms of complex traits.
Collapse
Affiliation(s)
- Feng Zhu
- Max-Planck-Institut für Molekulare Pflanzenphysiologie, Am Mühlenberg 1, 14476, Potsdam-Golm, Germany; National R&D Center for Citrus Preservation, Key Laboratory of Horticultural Plant Biology, Ministry of Education, Huazhong Agricultural University, 430070, Wuhan, China
| | - Micha Wijesingha Ahchige
- Max-Planck-Institut für Molekulare Pflanzenphysiologie, Am Mühlenberg 1, 14476, Potsdam-Golm, Germany
| | - Yariv Brotman
- Max-Planck-Institut für Molekulare Pflanzenphysiologie, Am Mühlenberg 1, 14476, Potsdam-Golm, Germany; Department of Life Sciences, Ben-Gurion University of the Negev, Beersheba, Israel
| | - Saleh Alseekh
- Max-Planck-Institut für Molekulare Pflanzenphysiologie, Am Mühlenberg 1, 14476, Potsdam-Golm, Germany; Center of Plant Systems Biology and Biotechnology, 4000, Plovdiv, Bulgaria
| | - Agustin Zsögön
- Max-Planck-Institut für Molekulare Pflanzenphysiologie, Am Mühlenberg 1, 14476, Potsdam-Golm, Germany; Departamento de Biologia Vegetal, Universidade Federal de Viçosa, CEP 36570-900, Viçosa, MG, Brazil
| | - Alisdair R Fernie
- Max-Planck-Institut für Molekulare Pflanzenphysiologie, Am Mühlenberg 1, 14476, Potsdam-Golm, Germany; Center of Plant Systems Biology and Biotechnology, 4000, Plovdiv, Bulgaria.
| |
Collapse
|
33
|
Abstract
It is well known that large genomic variations can greatly impact the phenotype of an organism. Structural Variants (SVs) encompass any genomic variation larger than 30 base pairs, and include changes caused by deletions, inversions, duplications, transversions, and other genome modifications. Due to their size and complex nature, until recently, it has been difficult to truly capture these variations. Recent advances in sequencing technology and computational analyses now permit more extensive studies of SVs in plant genomes. In tomato, advances in sequencing technology have allowed researchers to sequence hundreds of genomes from tomatoes, and tomato relatives. These studies have identified SVs related to fruit size and flavor, as well as plant disease response, resistance/susceptibility, and the ability of plants to detect pathogens (immunity). In this review, we discuss the implications for genomic structural variation in plants with a focus on its role in tomato immunity. We also discuss how advances in sequencing technology have led to new discoveries of SVs in more complex genomes, the current evidence for the role of SVs in biotic and abiotic stress responses, and the outlook for genetic modification of SVs to advance plant breeding objectives.
Collapse
Affiliation(s)
- Emma Jobson
- Montana State University Extension, Montana State University, Bozeman, MT, 59717, United States
| | - Robyn Roberts
- Agricultural Biology Department, College of Agricultural Sciences, Colorado State University, Fort Collins, CO, USA.
| |
Collapse
|
34
|
Lemay MA, Sibbesen JA, Torkamaneh D, Hamel J, Levesque RC, Belzile F. Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology. BMC Biol 2022; 20:53. [PMID: 35197050 PMCID: PMC8867729 DOI: 10.1186/s12915-022-01255-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Accepted: 02/16/2022] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND Structural variants (SVs), including deletions, insertions, duplications, and inversions, are relatively long genomic variations implicated in a diverse range of processes from human disease to ecology and evolution. Given their complex signatures, tendency to occur in repeated regions, and large size, discovering SVs based on short reads is challenging compared to single-nucleotide variants. The increasing availability of long-read technologies has greatly facilitated SV discovery; however, these technologies remain too costly to apply routinely to population-level studies. Here, we combined short-read and long-read sequencing technologies to provide a comprehensive population-scale assessment of structural variation in a panel of Canadian soybean cultivars. RESULTS We used Oxford Nanopore long-read sequencing data (~12× mean coverage) for 17 samples to both benchmark SV calls made from Illumina short-read data and predict SVs that were subsequently genotyped in a population of 102 samples using Illumina data. Benchmarking results show that variants discovered using Oxford Nanopore can be accurately genotyped from the Illumina data. We first use the genotyped deletions and insertions for population genetics analyses and show that results are comparable to those based on single-nucleotide variants. We observe that the population frequency and distribution within the genome of deletions and insertions are constrained by the location of genes. Gene Ontology and PFAM domain enrichment analyses also confirm previous reports that genes harboring high-frequency deletions and insertions are enriched for functions in defense response. Finally, we discover polymorphic transposable elements from the deletions and insertions and report evidence of the recent activity of a Stowaway MITE. CONCLUSIONS We show that structural variants discovered using Oxford Nanopore data can be genotyped with high accuracy from Illumina data. Our results demonstrate that long-read and short-read sequencing technologies can be efficiently combined to enhance SV analysis in large populations, providing a reusable framework for their study in a wider range of samples and non-model species.
Collapse
Affiliation(s)
- Marc-André Lemay
- Département de phytologie, Université Laval, Quebec, Canada
- Institut de biologie intégrative et des systèmes, Université Laval, Quebec, Canada
| | | | - Davoud Torkamaneh
- Département de phytologie, Université Laval, Quebec, Canada
- Institut de biologie intégrative et des systèmes, Université Laval, Quebec, Canada
| | - Jérémie Hamel
- Institut de biologie intégrative et des systèmes, Université Laval, Quebec, Canada
- Département de microbiologie-infectiologie et d’immunologie, Université Laval, Quebec, Canada
| | - Roger C. Levesque
- Institut de biologie intégrative et des systèmes, Université Laval, Quebec, Canada
- Département de microbiologie-infectiologie et d’immunologie, Université Laval, Quebec, Canada
| | - François Belzile
- Département de phytologie, Université Laval, Quebec, Canada
- Institut de biologie intégrative et des systèmes, Université Laval, Quebec, Canada
| |
Collapse
|
35
|
Li A, Liu A, Wu S, Qu K, Hu H, Yang J, Shrestha N, Liu J, Ren G. Comparison of structural variants in the whole genome sequences of two Medicago truncatula ecotypes: Jemalong A17 and R108. BMC Plant Biol 2022; 22:77. [PMID: 35193491 PMCID: PMC8862580 DOI: 10.1186/s12870-022-03469-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Accepted: 02/14/2022] [Indexed: 06/14/2023]
Abstract
BACKGROUND Structural variants (SVs) constitute a large proportion of the genomic variation that results in phenotypic variation in plants. However, they are still a largely unexplored feature in most plant genomes. Here, we present the whole-genome landscape of SVs between two model legume Medicago truncatula ecotypes-Jemalong A17 and R108- that have been extensively used in various legume biology studies. RESULTS To catalogue SVs, we first resolved the previously published R108 genome assembly (R108 v1.0) to chromosome-scale using 124 × Hi-C data, resulting in a high-quality genome assembly. The inter-chromosomal reciprocal translocations between chromosomes 4 and 8 were confirmed by performing syntenic analysis between the two genomes. Combined with the Hi-C data, it appears that these translocation events had a significant effect on chromatin organization. Using both whole-genome and short-read alignments, we identified the genomic landscape of SVs between the two genomes, some of which may account for several phenotypic differences, including their differential responses to aluminum toxicity and iron deficiency, and the development of different anthocyanin leaf markings. We also found extensive SVs within the nodule-specific cysteine-rich gene family which encodes antimicrobial peptides essential for terminal bacteroid differentiation during nitrogen-fixing symbiosis. CONCLUSIONS Our results provide a near-complete R108 genome assembly and the first genomic landscape of SVs obtained by comparing two M. truncatula ecotypes. This may provide valuable genomic resources for the functional and molecular research of legume biology in the future.
Collapse
Affiliation(s)
- Ao Li
- State Key Laboratory of Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Ai Liu
- State Key Laboratory of Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Shuang Wu
- State Key Laboratory of Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Kunjing Qu
- State Key Laboratory of Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Hongyin Hu
- State Key Laboratory of Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Jinli Yang
- State Key Laboratory of Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Nawal Shrestha
- State Key Laboratory of Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Jianquan Liu
- State Key Laboratory of Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education & State Key Lab of Hydraulics and Mountain River Engineering, College of Life Sciences, Sichuan University, Chengdu, China
| | - Guangpeng Ren
- State Key Laboratory of Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou, China.
| |
Collapse
|
36
|
Vieira MB, Faustino MV, Lourenço TF, Oliveira MM. DNA-Based Tools to Certify Authenticity of Rice Varieties—An Overview. Foods 2022; 11:258. [PMID: 35159410 PMCID: PMC8834242 DOI: 10.3390/foods11030258] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Revised: 01/04/2022] [Accepted: 01/12/2022] [Indexed: 02/07/2023] Open
Abstract
Rice (Oryza sativa L.) is one of the most cultivated and consumed crops worldwide. It is mainly produced in Asia but, due to its large genetic pool, it has expanded to several ecosystems, latitudes and climatic conditions. Europe is a rice producing region, especially in the Mediterranean countries, that grow mostly typical japonica varieties. The European consumer interest in rice has increased over the last decades towards more exotic types, often more expensive (e.g., aromatic rice) and Europe is a net importer of this commodity. This has increased food fraud opportunities in the rice supply chain, which may deliver mixtures with lower quality rice, a problem that is now global. The development of tools to clearly identify undesirable mixtures thus became urgent. Among the various tools available, DNA-based markers are considered particularly reliable and stable for discrimination of rice varieties. This review covers aspects ranging from rice diversity and fraud issues to the DNA-based methods used to distinguish varieties and detect unwanted mixtures. Although not exhaustive, the review covers the diversity of strategies and ongoing improvements already tested, highlighting important advantages and disadvantages in terms of costs, reliability, labor-effort and potential scalability for routine fraud detection.
Collapse
|
37
|
Suvakov M, Panda A, Diesh C, Holmes I, Abyzov A. CNVpytor: a tool for copy number variation detection and analysis from read depth and allele imbalance in whole-genome sequencing. Gigascience 2021; 10:giab074. [PMID: 34817058 PMCID: PMC8612020 DOI: 10.1093/gigascience/giab074] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2021] [Revised: 08/21/2021] [Accepted: 10/22/2021] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Detecting copy number variations (CNVs) and copy number alterations (CNAs) based on whole-genome sequencing data is important for personalized genomics and treatment. CNVnator is one of the most popular tools for CNV/CNA discovery and analysis based on read depth. FINDINGS Herein, we present an extension of CNVnator developed in Python-CNVpytor. CNVpytor inherits the reimplemented core engine of its predecessor and extends visualization, modularization, performance, and functionality. Additionally, CNVpytor uses B-allele frequency likelihood information from single-nucleotide polymorphisms and small indels data as additional evidence for CNVs/CNAs and as primary information for copy number-neutral losses of heterozygosity. CONCLUSIONS CNVpytor is significantly faster than CNVnator-particularly for parsing alignment files (2-20 times faster)-and has (20-50 times) smaller intermediate files. CNV calls can be filtered using several criteria, annotated, and merged over multiple samples. Modular architecture allows it to be used in shared and cloud environments such as Google Colab and Jupyter notebook. Data can be exported into JBrowse, while a lightweight plugin version of CNVpytor for JBrowse enables nearly instant and GUI-assisted analysis of CNVs by any user. CNVpytor release and the source code are available on GitHub at https://github.com/abyzovlab/CNVpytor under the MIT license.
Collapse
Affiliation(s)
- Milovan Suvakov
- Department of Quantitative Health Sciences, Center for Individualized Medicine, Mayo Clinic, Rochester, MN 55905, USA
| | - Arijit Panda
- Department of Quantitative Health Sciences, Center for Individualized Medicine, Mayo Clinic, Rochester, MN 55905, USA
| | - Colin Diesh
- Department of Bioengineering, University of California, Berkeley, CA 94720, USA
| | - Ian Holmes
- Department of Bioengineering, University of California, Berkeley, CA 94720, USA
| | - Alexej Abyzov
- Department of Quantitative Health Sciences, Center for Individualized Medicine, Mayo Clinic, Rochester, MN 55905, USA
| |
Collapse
|
38
|
Liu DX, Rajaby R, Wei LL, Zhang L, Yang ZQ, Yang QY, Sung WK. Calling large indels in 1047 Arabidopsis with IndelEnsembler. Nucleic Acids Res 2021; 49:10879-10894. [PMID: 34643730 PMCID: PMC8565333 DOI: 10.1093/nar/gkab904] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Revised: 09/01/2021] [Accepted: 09/28/2021] [Indexed: 01/23/2023] Open
Abstract
Large indels greatly impact the observable phenotypes in different organisms including plants and human. Hence, extracting large indels with high precision and sensitivity is important. Here, we developed IndelEnsembler to detect large indels in 1047 Arabidopsis whole-genome sequencing data. IndelEnsembler identified 34 093 deletions, 12 913 tandem duplications and 9773 insertions. Our large indel dataset was more comprehensive and accurate compared with the previous dataset of AthCNV (1). We captured nearly twice of the ground truth deletions and on average 27% more ground truth duplications compared with AthCNV, though our dataset has less number of large indels compared with AthCNV. Our large indels were positively correlated with transposon elements across the Arabidopsis genome. The non-homologous recombination events were the major formation mechanism of deletions in Arabidopsis genome. The Neighbor joining (NJ) tree constructed based on IndelEnsembler's deletions clearly divided the geographic subgroups of 1047 Arabidopsis. More importantly, our large indels represent a previously unassessed source of genetic variation. Approximately 49% of the deletions have low linkage disequilibrium (LD) with surrounding single nucleotide polymorphisms. Some of them could affect trait performance. For instance, using deletion-based genome-wide association study (DEL-GWAS), the accessions containing a 182-bp deletion in AT1G11520 had delayed flowering time and all accessions in north Sweden had the 182-bp deletion. We also found the accessions with 65-bp deletion in the first exon of AT4G00650 (FRI) flowered earlier than those without it. These two deletions cannot be detected in AthCNV and, interestingly, they do not co-occur in any Arabidopsis thaliana accession. By SNP-GWAS, surrounding SNPs of these two deletions do not correlate with flowering time. This example demonstrated that existing large indel datasets miss phenotypic variations and our large indel dataset filled in the gap.
Collapse
Affiliation(s)
- Dong-Xu Liu
- National Key Laboratory of Crop Genetic Improvement, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.,Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Ramesh Rajaby
- School of Computing, National University of Singapore, 117417 Singapore.,NUS Graduate School for Integrative Sciences and Engineering, National University of Singapore, 117456, Singapore
| | - Lu-Lu Wei
- National Key Laboratory of Crop Genetic Improvement, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.,Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Lei Zhang
- Precision Medical Laboratory, Wuhan Children's Hospital (Wuhan Maternal and Child Healthcare Hospital), Tongji Medical College, Huazhong University of Science & Technology, Wuhan 430016, China
| | - Zhi-Quan Yang
- National Key Laboratory of Crop Genetic Improvement, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.,Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Qing-Yong Yang
- National Key Laboratory of Crop Genetic Improvement, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.,Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.,School of Computing, National University of Singapore, 117417 Singapore
| | - Wing-Kin Sung
- National Key Laboratory of Crop Genetic Improvement, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.,Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.,School of Computing, National University of Singapore, 117417 Singapore.,Genome Institute of Singapore, Genome, 138672 Singapore
| |
Collapse
|
39
|
Yuan Y, Bayer PE, Batley J, Edwards D. Current status of structural variation studies in plants. Plant Biotechnol J 2021; 19:2153-2163. [PMID: 34101329 PMCID: PMC8541774 DOI: 10.1111/pbi.13646] [Citation(s) in RCA: 46] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/18/2021] [Revised: 05/31/2021] [Accepted: 06/03/2021] [Indexed: 05/23/2023]
Abstract
Structural variations (SVs) including gene presence/absence variations and copy number variations are a common feature of genomes in plants and, together with single nucleotide polymorphisms and epigenetic differences, are responsible for the heritable phenotypic diversity observed within and between species. Understanding the contribution of SVs to plant phenotypic variation is important for plant breeders to assist in producing improved varieties. The low resolution of early genetic technologies and inefficient methods have previously limited our understanding of SVs in plants. However, with the rapid expansion in genomic technologies, it is possible to assess SVs with an ever-greater resolution and accuracy. Here, we review the current status of SV studies in plants, examine the roles that SVs play in phenotypic traits, compare current technologies and assess future challenges for SV studies.
Collapse
Affiliation(s)
- Yuxuan Yuan
- School of Biological Sciences and Institute of AgricultureThe University of Western AustraliaPerthWAAustralia
- School of Life Sciences and State Key Laboratory for AgrobiotechnologyThe Chinese University of Hong KongHong Kong SARChina
| | - Philipp E. Bayer
- School of Biological Sciences and Institute of AgricultureThe University of Western AustraliaPerthWAAustralia
| | - Jacqueline Batley
- School of Biological Sciences and Institute of AgricultureThe University of Western AustraliaPerthWAAustralia
| | - David Edwards
- School of Biological Sciences and Institute of AgricultureThe University of Western AustraliaPerthWAAustralia
| |
Collapse
|
40
|
Tsugawa H, Rai A, Saito K, Nakabayashi R. Metabolomics and complementary techniques to investigate the plant phytochemical cosmos. Nat Prod Rep 2021; 38:1729-1759. [PMID: 34668509 DOI: 10.1039/d1np00014d] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Covering: up to 2021Plants and their associated microbial communities are known to produce millions of metabolites, a majority of which are still not characterized and are speculated to possess novel bioactive properties. In addition to their role in plant physiology, these metabolites are also relevant as existing and next-generation medicine candidates. Elucidation of the plant metabolite diversity is thus valuable for the successful exploitation of natural resources for humankind. Herein, we present a comprehensive review on recent metabolomics approaches to illuminate molecular networks in plants, including chemical isolation and enzymatic production as well as the modern metabolomics approaches such as stable isotope labeling, ultrahigh-resolution mass spectrometry, metabolome imaging (spatial metabolomics), single-cell analysis, cheminformatics, and computational mass spectrometry. Mass spectrometry-based strategies to characterize plant metabolomes through metabolite identification and annotation are described in detail. We also highlight the use of phytochemical genomics to mine genes associated with specialized metabolites' biosynthesis. Understanding the metabolic diversity through biotechnological advances is fundamental to elucidate the functions of the plant-derived specialized metabolome.
Collapse
Affiliation(s)
- Hiroshi Tsugawa
- RIKEN Center for Sustainable Resource Science, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan. .,RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan.,Department of Biotechnology and Life Science, Tokyo University of Agriculture and Technology, 2-24-16 Nakamachi, Koganei, Tokyo 184-8588, Japan.,Graduate School of Medical Life Science, Yokohama City University, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan
| | - Amit Rai
- RIKEN Center for Sustainable Resource Science, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan. .,Plant Molecular Science Center, Chiba University, 1-8-1 Inohana, Chuo-ku, Chiba 260-8675, Japan
| | - Kazuki Saito
- RIKEN Center for Sustainable Resource Science, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan. .,Plant Molecular Science Center, Chiba University, 1-8-1 Inohana, Chuo-ku, Chiba 260-8675, Japan
| | - Ryo Nakabayashi
- RIKEN Center for Sustainable Resource Science, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan.
| |
Collapse
|
41
|
Liu Z, Zhao H, Yan Y, Wei MX, Zheng YC, Yue EK, Alam MS, Smartt KO, Duan MH, Xu JH. Extensively Current Activity of Transposable Elements in Natural Rice Accessions Revealed by Singleton Insertions. Front Plant Sci 2021; 12:745526. [PMID: 34650583 PMCID: PMC8505701 DOI: 10.3389/fpls.2021.745526] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/22/2021] [Accepted: 09/08/2021] [Indexed: 06/01/2023]
Abstract
Active transposable elements (TEs) have drawn more attention as they continue to create new insertions and contribute to genetic diversity of the genome. However, only a few have been discovered in rice up to now, and their activities are mostly induced by artificial treatments (e.g., tissue culture, hybridization etc.) rather than under normal growth conditions. To systematically survey the current activity of TEs in natural rice accessions and identify rice accessions carrying highly active TEs, the transposon insertion polymorphisms (TIPs) profile was used to identify singleton insertions, which were unique to a single accession and represented the new insertion of TEs in the genome. As a result, 10,924 high-confidence singletons from 251 TE families were obtained, covering all investigated TE types. The number of singletons varied substantially among different superfamilies/families, perhaps reflecting distinct current activity. Particularly, eight TE families maintained potentially higher activity in 3,000 natural rice accessions. Sixty percent of rice accessions were detected to contain singletons, indicating the extensive activity of TEs in natural rice accessions. Thirty-five TE families exhibited potentially high activity in at least one rice accession, and the majority of them showed variable activity among different rice groups/subgroups. These naturally active TEs would be ideal candidates for elucidating the molecular mechanisms underlying the transposition and activation of TEs, as well as investigating the interactions between TEs and the host genome.
Collapse
Affiliation(s)
- Zhen Liu
- Hainan Institute, Zhejiang University, Sanya, China
- Zhejiang Key Laboratory of Crop Germplasm, Institute of Crop Science, Zhejiang University, Hangzhou, China
| | - Han Zhao
- Jiangsu Provincial Key Laboratory of Agrobiology, Institute of Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing, China
| | - Yan Yan
- Zhejiang Key Laboratory of Crop Germplasm, Institute of Crop Science, Zhejiang University, Hangzhou, China
| | - Ming-Xiao Wei
- Zhejiang Key Laboratory of Crop Germplasm, Institute of Crop Science, Zhejiang University, Hangzhou, China
| | - Yun-Chao Zheng
- Zhejiang Key Laboratory of Crop Germplasm, Institute of Crop Science, Zhejiang University, Hangzhou, China
| | - Er-Kui Yue
- Zhejiang Key Laboratory of Crop Germplasm, Institute of Crop Science, Zhejiang University, Hangzhou, China
| | - Mohammad Shah Alam
- Zhejiang Key Laboratory of Crop Germplasm, Institute of Crop Science, Zhejiang University, Hangzhou, China
| | - Kwesi Odel Smartt
- Zhejiang Key Laboratory of Crop Germplasm, Institute of Crop Science, Zhejiang University, Hangzhou, China
| | - Ming-Hua Duan
- Zhejiang Zhengjingyuan Pharmacy Chain Co., Ltd., Hangzhou, China
- Hangzhou Zhengcaiyuan Pharmaceutical Co., Ltd., Hangzhou, China
| | - Jian-Hong Xu
- Hainan Institute, Zhejiang University, Sanya, China
- Zhejiang Key Laboratory of Crop Germplasm, Institute of Crop Science, Zhejiang University, Hangzhou, China
| |
Collapse
|
42
|
Gupta PK. GWAS for genetics of complex quantitative traits: Genome to pangenome and SNPs to SVs and k-mers. Bioessays 2021; 43:e2100109. [PMID: 34486143 DOI: 10.1002/bies.202100109] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Revised: 08/21/2021] [Accepted: 08/23/2021] [Indexed: 12/22/2022]
Abstract
The development of improved methods for genome-wide association studies (GWAS) for genetics of quantitative traits has been an active area of research during the last 25 years. This activity initially started with the use of mixed linear model (MLM), which was variously modified. During the last decade, however, with the availability of high throughput next generation sequencing (NGS) technology, development and use of pangenomes and novel markers including structural variations (SVs) and k-mers for GWAS has taken over as a new thrust area of research. Pangenomes and SVs are now available in humans, livestock, and a number of plant species, so that these resources along with k-mers are being used in GWAS for exploring additional genetic variation that was hitherto not available for analysis. These developments have resulted in significant improvement in GWAS methodology for detection of marker-trait associations (MTAs) that are relevant to human healthcare and crop improvement.
Collapse
Affiliation(s)
- Pushpendra K Gupta
- Department of Genetics and Plant Breeding, Ch. Charan Singh University Meerut, Meerut, Uttar Pradesh, India
| |
Collapse
|
43
|
Zhang S, Liu W, Liu X, Du X, Zhang K, Zhang Y, Song Y, Zi Y, Qiu Q, Lenstra JA, Liu J. Structural Variants Selected during Yak Domestication Inferred from Long-Read Whole-Genome Sequencing. Mol Biol Evol 2021; 38:3676-3680. [PMID: 33944937 PMCID: PMC8382902 DOI: 10.1093/molbev/msab134] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Structural variants (SVs) represent an important genetic resource for both natural and artificial selection. Here we present a chromosome-scale reference genome for domestic yak (Bos grunniens) that has longer contigs and scaffolds (N50 44.72 and 114.39 Mb, respectively) than reported for any other ruminant genome. We further obtained long-read resequencing data for 6 wild and 23 domestic yaks and constructed a genetic SV map of 372,220 SVs that covers the geographic range of the yaks. The majority of the SVs contains repetitive sequences and several are in or near genes. By comparing SVs in domestic and wild yaks, we identified genes that are predominantly related to the nervous system, behavior, immunity, and reproduction and may have been targeted by artificial selection during yak domestication. These findings provide new insights in the domestication of animals living at high altitude and highlight the importance of SVs in animal domestication.
Collapse
Affiliation(s)
- Shangzhe Zhang
- State Key Laboratory of Grassland and Agro-ecosystem, Institute of Innovation Ecology and School of Life Science, Lanzhou University, Lanzhou, China
| | - Wenyu Liu
- State Key Laboratory of Grassland and Agro-ecosystem, Institute of Innovation Ecology and School of Life Science, Lanzhou University, Lanzhou, China
| | - Xinfeng Liu
- State Key Laboratory of Grassland and Agro-ecosystem, Institute of Innovation Ecology and School of Life Science, Lanzhou University, Lanzhou, China
| | - Xin Du
- State Key Laboratory of Grassland and Agro-ecosystem, Institute of Innovation Ecology and School of Life Science, Lanzhou University, Lanzhou, China
| | - Ke Zhang
- State Key Laboratory of Grassland and Agro-ecosystem, Institute of Innovation Ecology and School of Life Science, Lanzhou University, Lanzhou, China
| | - Yang Zhang
- The Supercomputing Center, Lanzhou University, Lanzhou, China
| | - Yongwu Song
- Animal Disease Prevention and Control Center of Gangcha County, Haibei Tibetan Autonomous Prefecture, China
| | - Yunnan Zi
- Animal Husbandry Workstation of Xiahe County, Gannan Tibetan Autonomous Prefecture, China
| | - Qiang Qiu
- State Key Laboratory of Grassland and Agro-ecosystem, Institute of Innovation Ecology and School of Life Science, Lanzhou University, Lanzhou, China
| | - Johannes A Lenstra
- Faculty of Veterinary Medicine, Utrecht University, Utrecht, The Netherlands
| | - Jianquan Liu
- State Key Laboratory of Grassland and Agro-ecosystem, Institute of Innovation Ecology and School of Life Science, Lanzhou University, Lanzhou, China
| |
Collapse
|
44
|
Demirci S, Fuentes RR, van Dooijeweert W, Aflitos S, Schijlen E, Hesselink T, de Ridder D, van Dijk ADJ, Peters S. Chasing breeding footprints through structural variations in Cucumis melo and wild relatives. G3 (Bethesda) 2021; 11:6044141. [PMID: 33561242 PMCID: PMC8022733 DOI: 10.1093/g3journal/jkaa038] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/06/2020] [Accepted: 11/24/2020] [Indexed: 02/07/2023]
Abstract
Cucumis melo (melon or muskmelon) is an important crop in the family of the Cucurbitaceae. Melon is cross pollinated and domesticated at several locations throughout the breeding history, resulting in highly diverse genetic structure in the germplasm. Yet, the relations among the groups and cultivars are still incomplete. We shed light on the melonbreeding history, analyzing structural variations ranging from 50 bp up to 100 kb, identified from whole genome sequences of 100 selected melon accessions and wild relatives. Phylogenetic trees based on SV types completely resolve cultivars and wild accessions into two monophyletic groups and clustering of cultivars largely correlates with their geographic origin. Taking into account morphology, we found six mis-categorized cultivars. Unique inversions are more often shared between cultivars, carrying advantageous genes and do not directly originate from wild species. Approximately 60% of the inversion breaks carry a long poly A/T motif, and following observations in other plant species, suggest that inversions in melon likely resulted from meiotic recombination events. We show that resistance genes in the linkage V region are expanded in the cultivar genomes compared to wild relatives. Furthermore, particular agronomic traits such as fruit ripening, fragrance, and stress response are specifically selected for in the melon subspecies. These results represent distinctive footprints of selective breeding that shaped today's melon. The sequences and genomic relations between land races, wild relatives, and cultivars will serve the community to identify genetic diversity, optimize experimental designs, and enhance crop development.
Collapse
Affiliation(s)
- Sevgin Demirci
- Bioinformatics Group, Wageningen University & Research, 6708 PB, Wageningen, the Netherlands.,Department of Bioscience, Wageningen Plant Research, Wageningen University & Research, 6708 PB, Wageningen, the Netherlands.,Keygene N.V., 6708 PW Wageningen, the Netherlands
| | - Roven Rommel Fuentes
- Bioinformatics Group, Wageningen University & Research, 6708 PB, Wageningen, the Netherlands
| | - Willem van Dooijeweert
- Centre for Genetic Resources, Wageningen University & Research, 6708PB, Wageningen, the Netherlands
| | - Saulo Aflitos
- Bejo Zaden B.V., 1749 CZ Warmenhuizen, the Netherlands
| | - Elio Schijlen
- Department of Bioscience, Wageningen Plant Research, Wageningen University & Research, 6708 PB, Wageningen, the Netherlands
| | - Thamara Hesselink
- Department of Bioscience, Wageningen Plant Research, Wageningen University & Research, 6708 PB, Wageningen, the Netherlands
| | - Dick de Ridder
- Bioinformatics Group, Wageningen University & Research, 6708 PB, Wageningen, the Netherlands
| | - Aalt D J van Dijk
- Bioinformatics Group, Wageningen University & Research, 6708 PB, Wageningen, the Netherlands.,Biometris, Wageningen University & Research, 6708PB Wageningen, the Netherlands
| | - Sander Peters
- Department of Bioscience, Wageningen Plant Research, Wageningen University & Research, 6708 PB, Wageningen, the Netherlands
| |
Collapse
|
45
|
Torres DE, Thomma BPHJ, Seidl MF. Transposable Elements Contribute to Genome Dynamics and Gene Expression Variation in the Fungal Plant Pathogen Verticillium dahliae. Genome Biol Evol 2021; 13:evab135. [PMID: 34100895 PMCID: PMC8290119 DOI: 10.1093/gbe/evab135] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/04/2021] [Indexed: 12/12/2022] Open
Abstract
Transposable elements (TEs) are a major source of genetic and regulatory variation in their host genome and are consequently thought to play important roles in evolution. Many fungal and oomycete plant pathogens have evolved dynamic and TE-rich genomic regions containing genes that are implicated in host colonization and adaptation. TEs embedded in these regions have typically been thought to accelerate the evolution of these genomic compartments, but little is known about their dynamics in strains that harbor them. Here, we used whole-genome sequencing data of 42 strains of the fungal plant pathogen Verticillium dahliae to systematically identify polymorphic TEs that may be implicated in genomic as well as in gene expression variation. We identified 2,523 TE polymorphisms and characterize a subset of 8% of the TEs as polymorphic elements that are evolutionary younger, less methylated, and more highly expressed when compared with the remaining 92% of the total TE complement. As expected, the polyrmorphic TEs are enriched in the adaptive genomic regions. Besides, we observed an association of polymorphic TEs with pathogenicity-related genes that localize nearby and that display high expression levels. Collectively, our analyses demonstrate that TE dynamics in V. dahliae contributes to genomic variation, correlates with expression of pathogenicity-related genes, and potentially impacts the evolution of adaptive genomic regions.
Collapse
Affiliation(s)
- David E Torres
- Theoretical Biology and Bioinformatics Group, Department of Biology, Utrecht University, The Netherlands
- Laboratory of Phytopathology, Wageningen University and Research, The Netherlands
| | - Bart P H J Thomma
- Laboratory of Phytopathology, Wageningen University and Research, The Netherlands
- Cluster of Excellence on Plant Sciences (CEPLAS), Institute for Plant Sciences, University of Cologne, Germany
| | - Michael F Seidl
- Theoretical Biology and Bioinformatics Group, Department of Biology, Utrecht University, The Netherlands
| |
Collapse
|
46
|
Jayakodi M, Schreiber M, Stein N, Mascher M. Building pan-genome infrastructures for crop plants and their use in association genetics. DNA Res 2021; 28:6117190. [PMID: 33484244 PMCID: PMC7934568 DOI: 10.1093/dnares/dsaa030] [Citation(s) in RCA: 39] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Indexed: 12/20/2022] Open
Abstract
Pan-genomic studies aim at representing the entire sequence diversity within a species to provide useful resources for evolutionary studies, functional genomics and breeding of cultivated plants. Cost reductions in high-throughput sequencing and advances in sequence assembly algorithms have made it possible to create multiple reference genomes along with a catalogue of all forms of genetic variations in plant species with large and complex or polyploid genomes. In this review, we summarize the current approaches to building pan-genomes as an in silico representation of plant sequence diversity and outline relevant methods for their effective utilization in linking structural with phenotypic variation. We propose as future research avenues (i) transcriptomic and epigenomic studies across multiple reference genomes and (ii) the development of user-friendly and feature-rich pan-genome browsers.
Collapse
Affiliation(s)
- Murukarthick Jayakodi
- Department of Genebank, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Mona Schreiber
- Department of Genebank, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Nils Stein
- Department of Genebank, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany.,Center for Integrated Breeding Research (CiBreed), Georg-August-University Göttingen, Göttingen, Germany
| | - Martin Mascher
- Department of Genebank, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany.,German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Leipzig, Saxony, Germany
| |
Collapse
|
47
|
Zhang H, Wang Y, Deng C, Zhao S, Zhang P, Feng J, Huang W, Kang S, Qian Q, Xiong G, Chang Y. High-quality genome assembly of Huazhan and Tianfeng, the parents of an elite rice hybrid Tian-you-hua-zhan. Sci China Life Sci 2021; 65:398-411. [PMID: 34251582 DOI: 10.1007/s11427-020-1940-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Accepted: 05/02/2021] [Indexed: 12/24/2022]
Abstract
High-quality rice reference genomes have accelerated the comprehensive identification of genome-wide variations and research on functional genomics and breeding. Tian-you-hua-zhan has been a leading hybrid in China over the past decade. Here, de novo genome assembly strategy optimization for the rice indica lines Huazhan (HZ) and Tianfeng (TF), including sequencing platforms, assembly pipelines and sequence depth, was carried out. The PacBio and Nanopore platforms for long-read sequencing were utilized, with the Canu, wtdbg2, SMARTdenovo, Flye, Canu-wtdbg2, Canu-SMARTdenovo and Canu-Flye assemblers. The combination of PacBio and Canu was optimal, considering the contig N50 length, contig number, assembled genome size and polishing process. The assembled contigs were scaffolded with Hi-C data, resulting in two "golden quality" rice reference genomes, and evaluated using the scaffold N50, BUSCO, and LTR assembly index. Furthermore, 42,625 and 41,815 non-transposable element genes were annotated for HZ and TF, respectively. Based on our assembly of HZ and TF, as well as Zhenshan97, Minghui63, Shuhui498 and 9311, comprehensive variations were identified using Nipponbare as a reference. The de novo assembly strategy for rice we optimized and the "golden quality" rice genomes we produced for HZ and TF will benefit rice genomics and breeding research, especially with respect to uncovering the genomic basis of the elite traits of HZ and TF.
Collapse
Affiliation(s)
- Hui Zhang
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China.,State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China
| | - Yuexing Wang
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, 310006, China
| | - Ce Deng
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China.,National Key Laboratory of Wheat and Maize Crop Science, College of Agronomy, Henan Agricultural University, Zhengzhou, 450002, China
| | - Sheng Zhao
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Peng Zhang
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China.,College of Life Science and Technology, Guangxi University, Nanning, 530004, China
| | - Jie Feng
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China.,College of Life Science and Technology, Guangxi University, Nanning, 530004, China
| | - Wei Huang
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China
| | - Shujing Kang
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China.,State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, 310006, China
| | - Qian Qian
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, 310006, China
| | - Guosheng Xiong
- Plant Phenomics Research Center, Nanjing Agricultural University, Nanjing, 210095, China.
| | - Yuxiao Chang
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China.
| |
Collapse
|
48
|
Zheng Y, Li S, Huang J, Fu H, Zhou L, Furusawa Y, Shu Q. Identification and characterization of inheritable structural variations induced by ion beam radiations in rice. Mutat Res 2021; 823:111757. [PMID: 34271440 DOI: 10.1016/j.mrfmmm.2021.111757] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2020] [Revised: 06/15/2021] [Accepted: 06/17/2021] [Indexed: 12/01/2022]
Abstract
High energy ion beams are effective physical mutagens for mutation induction in plants. Due to their high linear energy transfer (LET) property, they are known to generate single nucleotide variations (SNVs) and insertion/deletions (InDels, <50 bp) as well as structural variations (SVs). However, due to the technical difficulties to identify SVs, studies on ion beam induced SVs by genome sequencing have so far been limited in numbers and inadequate in nature, and knowledge of SVs is scarce with regards to their characteristics. In the present study, we identified and validated SVs in six M4 plants (designated as Ar_50, Ar_100, C_150, C_200, Ne_50 and Ne_100 according to ion beam types and irradiation doses), two each induced by argon (40Ar18+), carbon (12C6+) and neon (20Ne10+) ion beams and performed in depth analyses of their characteristics. In total, 22 SVs were identified and validated, consisting of 11 deletions, 1 duplication, and 4 intra-chromosomal and 6 inter-chromosomal translocations. There were several SVs larger than 1 kbp. The SVs were distributed across the whole genome with an aggregation with SNVs and InDels only in the Ne_50 mutants. An enrichment of a 11-bp wide G-rich DNA motif 'GAAGGWGGRGG' was identified around the SV breakpoints. Three mechanisms might be involved in the SV formation, i.e., the expansion of tandem repeats, transposable element insertion, and non-allelic homologous recombination. Put together, the present study provides a preliminary view of SVs induced by Ar, C and Ne ion beam radiations, and as a pilot study, it contributes to our understanding of how SVs might form after ion beam irradiation in rice.
Collapse
Affiliation(s)
- Yunchao Zheng
- National Key Laboratory of Rice Biology, Institute of Crop Sciences, Zhejiang University, Hangzhou, 310058, China; Institute of Nuclear-Agricultural Sciences, Zhejiang University, Hangzhou, 310058, China.
| | - Shan Li
- National Key Laboratory of Rice Biology, Institute of Crop Sciences, Zhejiang University, Hangzhou, 310058, China; Zhejiang Provincial Key Laboratory of Crop Germplasm, Zhejiang University, Hangzhou, 310058, China.
| | - Jianzhong Huang
- Institute of Nuclear-Agricultural Sciences, Zhejiang University, Hangzhou, 310058, China.
| | - Haowei Fu
- Jiaxing Academy of Agricultural Science, Jiaxing, Zhejiang, 314016, China.
| | - Libin Zhou
- Biophysics Group, Biomedical Research Center, Institute of Modern Physics, Chinese Academy of Science, Lanzhou, 730000, China.
| | - Yoshiya Furusawa
- Department of Basic Medical Sciences for Radiation Damages, National Institute of Radiological Sciences, National Institutes for Quantum and Radiological Science and Technology, Chiba, 263-8555, Japan.
| | - Qingyao Shu
- National Key Laboratory of Rice Biology, Institute of Crop Sciences, Zhejiang University, Hangzhou, 310058, China; Zhejiang Provincial Key Laboratory of Crop Germplasm, Zhejiang University, Hangzhou, 310058, China.
| |
Collapse
|
49
|
Badet T, Fouché S, Hartmann FE, Zala M, Croll D. Machine-learning predicts genomic determinants of meiosis-driven structural variation in a eukaryotic pathogen. Nat Commun 2021; 12:3551. [PMID: 34112792 PMCID: PMC8192914 DOI: 10.1038/s41467-021-23862-x] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Accepted: 05/11/2021] [Indexed: 02/05/2023] Open
Abstract
Species harbor extensive structural variation underpinning recent adaptive evolution. However, the causality between genomic features and the induction of new rearrangements is poorly established. Here, we analyze a global set of telomere-to-telomere genome assemblies of a fungal pathogen of wheat to establish a nucleotide-level map of structural variation. We show that the recent emergence of pesticide resistance has been disproportionally driven by rearrangements. We use machine learning to train a model on structural variation events based on 30 chromosomal sequence features. We show that base composition and gene density are the major determinants of structural variation. Retrotransposons explain most inversion, indel and duplication events. We apply our model to Arabidopsis thaliana and show that our approach extends to more complex genomes. Finally, we analyze complete genomes of haploid offspring in a four-generation pedigree. Meiotic crossover locations are enriched for new rearrangements consistent with crossovers being mutational hotspots. The model trained on species-wide structural variation accurately predicts the position of >74% of newly generated variants along the pedigree. The predictive power highlights causality between specific sequence features and the induction of chromosomal rearrangements. Our work demonstrates that training sequence-derived models can accurately identify regions of intrinsic DNA instability in eukaryotic genomes.
Collapse
Affiliation(s)
- Thomas Badet
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| | - Simone Fouché
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
- Plant Pathology, Institute of Integrative Biology, ETH Zurich, Zurich, Switzerland
| | - Fanny E Hartmann
- Ecologie Systématique Evolution, Bâtiment 360, Univ. Paris-Sud, AgroParisTech, CNRS, Université Paris-Saclay, Orsay, France
| | - Marcello Zala
- Plant Pathology, Institute of Integrative Biology, ETH Zurich, Zurich, Switzerland
| | - Daniel Croll
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland.
| |
Collapse
|
50
|
Qin P, Lu H, Du H, Wang H, Chen W, Chen Z, He Q, Ou S, Zhang H, Li X, Li X, Li Y, Liao Y, Gao Q, Tu B, Yuan H, Ma B, Wang Y, Qian Y, Fan S, Li W, Wang J, He M, Yin J, Li T, Jiang N, Chen X, Liang C, Li S. Pan-genome analysis of 33 genetically diverse rice accessions reveals hidden genomic variations. Cell 2021; 184:3542-3558.e16. [PMID: 34051138 DOI: 10.1016/j.cell.2021.04.046] [Citation(s) in RCA: 188] [Impact Index Per Article: 62.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2020] [Revised: 01/31/2021] [Accepted: 04/24/2021] [Indexed: 12/30/2022]
Abstract
Structural variations (SVs) and gene copy number variations (gCNVs) have contributed to crop evolution, domestication, and improvement. Here, we assembled 31 high-quality genomes of genetically diverse rice accessions. Coupling with two existing assemblies, we developed pan-genome-scale genomic resources including a graph-based genome, providing access to rice genomic variations. Specifically, we discovered 171,072 SVs and 25,549 gCNVs and used an Oryza glaberrima assembly to infer the derived states of SVs in the Oryza sativa population. Our analyses of SV formation mechanisms, impacts on gene expression, and distributions among subpopulations illustrate the utility of these resources for understanding how SVs and gCNVs shaped rice environmental adaptation and domestication. Our graph-based genome enabled genome-wide association study (GWAS)-based identification of phenotype-associated genetic variations undetectable when using only SNPs and a single reference assembly. Our work provides rich population-scale resources paired with easy-to-access tools to facilitate rice breeding as well as plant functional genomics and evolutionary biology research.
Collapse
Affiliation(s)
- Peng Qin
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China.
| | - Hongwei Lu
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
| | - Huilong Du
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China; School of Life Sciences, Institute of Life Sciences and Green Development, Hebei University, Baoding, China
| | - Hao Wang
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Weilan Chen
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Zhuo Chen
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
| | - Qiang He
- School of Life Sciences, Institute of Life Sciences and Green Development, Hebei University, Baoding, China
| | - Shujun Ou
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, USA
| | - Hongyu Zhang
- School of Life Sciences, Institute of Life Sciences and Green Development, Hebei University, Baoding, China
| | - Xuanzhao Li
- School of Life Sciences, Institute of Life Sciences and Green Development, Hebei University, Baoding, China
| | - Xiuxiu Li
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
| | - Yan Li
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China
| | - Yi Liao
- Department of Ecology and Evolutionary Biology, University of California, Irvine, Irvine, CA, USA
| | - Qiang Gao
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China
| | - Bin Tu
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Hua Yuan
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Bingtian Ma
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Yuping Wang
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Yangwen Qian
- Biogle Genome Editing Center, Changzhou, Jiangsu, China
| | - Shijun Fan
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Weitao Li
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Jing Wang
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Min He
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Junjie Yin
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Ting Li
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, MI, USA
| | - Xuewei Chen
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Chengzhi Liang
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China.
| | - Shigui Li
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China.
| |
Collapse
|