1
|
A joint learning approach for genomic prediction in polyploid grasses. Sci Rep 2022; 12:12499. [PMID: 35864135 PMCID: PMC9304331 DOI: 10.1038/s41598-022-16417-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Accepted: 07/11/2022] [Indexed: 12/20/2022] Open
Abstract
Poaceae, among the most abundant plant families, includes many economically important polyploid species, such as forage grasses and sugarcane (Saccharum spp.). These species have elevated genomic complexities and limited genetic resources, hindering the application of marker-assisted selection strategies. Currently, the most promising approach for increasing genetic gains in plant breeding is genomic selection. However, due to the polyploidy nature of these polyploid species, more accurate models for incorporating genomic selection into breeding schemes are needed. This study aims to develop a machine learning method by using a joint learning approach to predict complex traits from genotypic data. Biparental populations of sugarcane and two species of forage grasses (Urochloa decumbens, Megathyrsus maximus) were genotyped, and several quantitative traits were measured. High-quality markers were used to predict several traits in different cross-validation scenarios. By combining classification and regression strategies, we developed a predictive system with promising results. Compared with traditional genomic prediction methods, the proposed strategy achieved accuracy improvements exceeding 50%. Our results suggest that the developed methodology could be implemented in breeding programs, helping reduce breeding cycles and increase genetic gains.
Collapse
|
2
|
de Oliveira LP, Navarro BV, de Jesus Pereira JP, Lopes AR, Martins MCM, Riaño-Pachón DM, Buckeridge MS. Bioinformatic analyses to uncover genes involved in trehalose metabolism in the polyploid sugarcane. Sci Rep 2022; 12:7516. [PMID: 35525890 PMCID: PMC9079074 DOI: 10.1038/s41598-022-11508-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Accepted: 03/22/2022] [Indexed: 11/09/2022] Open
Abstract
Trehalose-6-phosphate (T6P) is an intermediate of trehalose biosynthesis that plays an essential role in plant metabolism and development. Here, we comprehensively analyzed sequences from enzymes of trehalose metabolism in sugarcane, one of the main crops used for bioenergy production. We identified protein domains, phylogeny, and in silico expression levels for all classes of enzymes. However, post-translational modifications and residues involved in catalysis and substrate binding were analyzed only in trehalose-6-phosphate synthase (TPS) sequences. We retrieved 71 putative full-length TPS, 93 trehalose-6-phosphate phosphatase (TPP), and 3 trehalase (TRE) of sugarcane, showing all their conserved domains, respectively. Putative TPS (Classes I and II) and TPP sugarcane sequences were categorized into well-known groups reported in the literature. We measured the expression levels of the sequences from one sugarcane leaf transcriptomic dataset. Furthermore, TPS Class I has specific N-glycosylation sites inserted in conserved motifs and carries catalytic and binding residues in its TPS domain. Some of these residues are mutated in TPS Class II members, which implies loss of enzyme activity. Our approach retrieved many homo(eo)logous sequences for genes involved in trehalose metabolism, paving the way to discover the role of T6P signaling in sugarcane.
Collapse
Affiliation(s)
- Lauana Pereira de Oliveira
- Laboratório de Fisiologia Ecológica de Plantas, Departamento de Botânica, Instituto de Biociências, Universidade de São Paulo, São Paulo, Brazil.,Instituto Nacional de Ciência e Tecnologia do Bioetanol, São Paulo, Brazil
| | - Bruno Viana Navarro
- Laboratório de Fisiologia Ecológica de Plantas, Departamento de Botânica, Instituto de Biociências, Universidade de São Paulo, São Paulo, Brazil.,Instituto Nacional de Ciência e Tecnologia do Bioetanol, São Paulo, Brazil
| | - João Pedro de Jesus Pereira
- Laboratório de Fisiologia Ecológica de Plantas, Departamento de Botânica, Instituto de Biociências, Universidade de São Paulo, São Paulo, Brazil.,Instituto Nacional de Ciência e Tecnologia do Bioetanol, São Paulo, Brazil
| | | | - Marina C M Martins
- Laboratório de Fisiologia Ecológica de Plantas, Departamento de Botânica, Instituto de Biociências, Universidade de São Paulo, São Paulo, Brazil.,Instituto Nacional de Ciência e Tecnologia do Bioetanol, São Paulo, Brazil
| | - Diego Mauricio Riaño-Pachón
- Laboratório de Biologia Computacional, Centro de Energia Nuclear na Agricultura, Evolutiva e de Sistemas, Universidade de São Paulo, São Paulo, Brazil. .,Instituto Nacional de Ciência e Tecnologia do Bioetanol, São Paulo, Brazil.
| | - Marcos Silveira Buckeridge
- Laboratório de Fisiologia Ecológica de Plantas, Departamento de Botânica, Instituto de Biociências, Universidade de São Paulo, São Paulo, Brazil. .,Instituto Nacional de Ciência e Tecnologia do Bioetanol, São Paulo, Brazil.
| |
Collapse
|
3
|
de Oliveira Silva L, da Silva Pereira L, Pereira JL, Gomes VM, Grativol C. Divergence and conservation of defensins and lipid transfer proteins (LTPs) from sugarcane wild species and modern cultivar genomes. Funct Integr Genomics 2022; 22:235-250. [PMID: 35195843 DOI: 10.1007/s10142-022-00832-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 09/24/2021] [Accepted: 02/15/2022] [Indexed: 11/04/2022]
Abstract
Plant defensins and lipid transfer proteins (LTPs) constitute a large and evolutionarily diverse family of antimicrobial peptides. Defensins and LTPs are two pathogenesis-related proteins (PR proteins) whose characterization may help to uncover aspects about the sugarcane response to pathogens attack. LTPs have also been investigated for their participation in the response to different types of stress. Despite the important roles of defensins and LTPs in biotic and abiotic stresses, scarce knowledge is found about these proteins in sugarcane. By using bioinformatics approaches, we characterized defensins and LTPs in the sugarcane wild species and modern cultivar genomes. The identification of defensins and LTPs showed that all five defensins groups and eight of the nine LTPs have their respective genes loci, although some was only identified in the cultivar genome. Phylogenetic analysis showed that defensins appear to be more conserved among groups of plants than LTPs. Some defensins and LTPs showed opposite expression during pathogenic and benefic bacterial interactions. Interestingly, the expression of defensins and LTPs in shoots and roots was completely different in plants submitted to benefic bacteria or water depletion. Finally, the modeling and comparison of isoforms of LTPs and defensins in wild species and cultivars revealed a high conservation of tertiary structures, with variation of amino acids in different regions of proteins, which could impact their antimicrobial activity. Our data contributed to the characterization of defensins and LTPs in sugarcane and provided new elements for understanding the involvement of these proteins in sugarcane response to different types of stress.
Collapse
Affiliation(s)
- Leandro de Oliveira Silva
- Laboratório de Química, Função de Proteínas E Peptídeos, Centro de Biociências E Biotecnologia, Universidade Estadual Do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, Brazil
| | - Lídia da Silva Pereira
- Laboratório de Fisiologia E Bioquímica de Microrganismos, Centro de Biociências E Biotecnologia, Universidade Estadual Do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, Brazil
| | - Jacymara Lopes Pereira
- Laboratório de Química, Função de Proteínas E Peptídeos, Centro de Biociências E Biotecnologia, Universidade Estadual Do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, Brazil
| | - Valdirene Moreira Gomes
- Laboratório de Fisiologia E Bioquímica de Microrganismos, Centro de Biociências E Biotecnologia, Universidade Estadual Do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, Brazil
| | - Clícia Grativol
- Laboratório de Química, Função de Proteínas E Peptídeos, Centro de Biociências E Biotecnologia, Universidade Estadual Do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, Brazil.
| |
Collapse
|
4
|
Batista LG, Mello VH, Souza AP, Margarido GRA. Genomic prediction with allele dosage information in highly polyploid species. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2022; 135:723-739. [PMID: 34800132 DOI: 10.1007/s00122-021-03994-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Accepted: 11/06/2021] [Indexed: 06/13/2023]
Abstract
Including allele, dosage can improve genomic selection in highly polyploid species under higher frequency of different heterozygous genotypic classes and high dominance degree levels. Several studies have shown how to leverage allele dosage information to improve the accuracy of genomic selection models in autotetraploid. In this study, we expanded the methodology used for genomic selection in autotetraploid to higher (and mixed) ploidy levels. We adapted the models to build covariance matrices of both additive and digenic dominance effects that are subsequently used in genomic selection models. We applied these models using estimates of ploidy and allele dosage to sugarcane and sweet potato datasets and validated our results by also applying the models in simulated data. For the simulated datasets, including allele dosage information led up to 140% higher mean predictive abilities in comparison to using diploidized markers. Including dominance effects were highly advantageous when using diploidized markers, leading to mean predictive abilities which were up to 115% higher in comparison to only including additive effects. When the frequency of heterozygous genotypes in the population was low, such as in the sugarcane and sweet potato datasets, there was little advantage in including allele dosage information in the models. Overall, we show that including allele dosage can improve genomic selection in highly polyploid species under higher frequency of different heterozygous genotypic classes and high dominance degree levels.
Collapse
Affiliation(s)
- Lorena G Batista
- Luiz de Queiroz" College of Agriculture, University of São Paulo, Piracicaba, SP, 13418-900, Brazil
| | - Victor H Mello
- Luiz de Queiroz" College of Agriculture, University of São Paulo, Piracicaba, SP, 13418-900, Brazil
| | - Anete P Souza
- Center of Molecular Biology and Genetic Engineering, University of Campinas, Campinas, SP, 13083-970, Brazil
| | - Gabriel R A Margarido
- Luiz de Queiroz" College of Agriculture, University of São Paulo, Piracicaba, SP, 13418-900, Brazil.
| |
Collapse
|
5
|
Genome-wide approaches for the identification of markers and genes associated with sugarcane yellow leaf virus resistance. Sci Rep 2021; 11:15730. [PMID: 34344928 PMCID: PMC8333424 DOI: 10.1038/s41598-021-95116-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Accepted: 07/19/2021] [Indexed: 11/10/2022] Open
Abstract
Sugarcane yellow leaf (SCYL), caused by the sugarcane yellow leaf virus (SCYLV) is a major disease affecting sugarcane, a leading sugar and energy crop. Despite damages caused by SCYLV, the genetic base of resistance to this virus remains largely unknown. Several methodologies have arisen to identify molecular markers associated with SCYLV resistance, which are crucial for marker-assisted selection and understanding response mechanisms to this virus. We investigated the genetic base of SCYLV resistance using dominant and codominant markers and genotypes of interest for sugarcane breeding. A sugarcane panel inoculated with SCYLV was analyzed for SCYL symptoms, and viral titer was estimated by RT-qPCR. This panel was genotyped with 662 dominant markers and 70,888 SNPs and indels with allele proportion information. We used polyploid-adapted genome-wide association analyses and machine-learning algorithms coupled with feature selection methods to establish marker-trait associations. While each approach identified unique marker sets associated with phenotypes, convergences were observed between them and demonstrated their complementarity. Lastly, we annotated these markers, identifying genes encoding emblematic participants in virus resistance mechanisms and previously unreported candidates involved in viral responses. Our approach could accelerate sugarcane breeding targeting SCYLV resistance and facilitate studies on biological processes leading to this trait.
Collapse
|
6
|
Calderan-Rodrigues MJ, de Barros Dantas LL, Cheavegatti Gianotto A, Caldana C. Applying Molecular Phenotyping Tools to Explore Sugarcane Carbon Potential. FRONTIERS IN PLANT SCIENCE 2021; 12:637166. [PMID: 33679852 PMCID: PMC7935522 DOI: 10.3389/fpls.2021.637166] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/02/2020] [Accepted: 01/27/2021] [Indexed: 05/21/2023]
Abstract
Sugarcane (Saccharum spp.), a C4 grass, has a peculiar feature: it accumulates, gradient-wise, large amounts of carbon (C) as sucrose in its culms through a complex pathway. Apart from being a sustainable crop concerning C efficiency and bioenergetic yield per hectare, sugarcane is used as feedstock for producing ethanol, sugar, high-value compounds, and products (e.g., polymers and succinate), and bioelectricity, earning the title of the world's leading biomass crop. Commercial cultivars, hybrids bearing high levels of polyploidy, and aneuploidy, are selected from a large number of crosses among suitable parental genotypes followed by the cloning of superior individuals among the progeny. Traditionally, these classical breeding strategies have been favoring the selection of cultivars with high sucrose content and resistance to environmental stresses. A current paradigm change in sugarcane breeding programs aims to alter the balance of C partitioning as a means to provide more plasticity in the sustainable use of this biomass for metabolic engineering and green chemistry. The recently available sugarcane genetic assemblies powered by data science provide exciting perspectives to increase biomass, as the current sugarcane yield is roughly 20% of its predicted potential. Nowadays, several molecular phenotyping tools can be applied to meet the predicted sugarcane C potential, mainly targeting two competing pathways: sucrose production/storage and biomass accumulation. Here we discuss how molecular phenotyping can be a powerful tool to assist breeding programs and which strategies could be adopted depending on the desired final products. We also tackle the advances in genetic markers and mapping as well as how functional genomics and genetic transformation might be able to improve yield and saccharification rates. Finally, we review how "omics" advances are promising to speed up plant breeding and reach the unexplored potential of sugarcane in terms of sucrose and biomass production.
Collapse
Affiliation(s)
| | | | | | - Camila Caldana
- Max Planck Institute of Molecular Plant Physiology, Potsdam, Germany
- *Correspondence: Camila Caldana,
| |
Collapse
|
7
|
Plant Proteomics and Systems Biology. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2021; 1346:51-66. [DOI: 10.1007/978-3-030-80352-0_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
8
|
Aono AH, Costa EA, Rody HVS, Nagai JS, Pimenta RJG, Mancini MC, Dos Santos FRC, Pinto LR, Landell MGDA, de Souza AP, Kuroshu RM. Machine learning approaches reveal genomic regions associated with sugarcane brown rust resistance. Sci Rep 2020; 10:20057. [PMID: 33208862 PMCID: PMC7676261 DOI: 10.1038/s41598-020-77063-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2020] [Accepted: 08/24/2020] [Indexed: 12/18/2022] Open
Abstract
Sugarcane is an economically important crop, but its genomic complexity has hindered advances in molecular approaches for genetic breeding. New cultivars are released based on the identification of interesting traits, and for sugarcane, brown rust resistance is a desirable characteristic due to the large economic impact of the disease. Although marker-assisted selection for rust resistance has been successful, the genes involved are still unknown, and the associated regions vary among cultivars, thus restricting methodological generalization. We used genotyping by sequencing of full-sib progeny to relate genomic regions with brown rust phenotypes. We established a pipeline to identify reliable SNPs in complex polyploid data, which were used for phenotypic prediction via machine learning. We identified 14,540 SNPs, which led to a mean prediction accuracy of 50% when using different models. We also tested feature selection algorithms to increase predictive accuracy, resulting in a reduced dataset with more explanatory power for rust phenotypes. As a result of this approach, we achieved an accuracy of up to 95% with a dataset of 131 SNPs related to brown rust QTL regions and auxiliary genes. Therefore, our novel strategy has the potential to assist studies of the genomic organization of brown rust resistance in sugarcane.
Collapse
Affiliation(s)
- Alexandre Hild Aono
- Molecular Biology and Genetic Engineering Center (CBMEG), University of Campinas (UNICAMP), Campinas, SP, Brazil
| | - Estela Araujo Costa
- Instituto de Ciência e Tecnologia (ICT), Universidade Federal de São Paulo (UNIFESP), São José dos Campos, SP, Brazil
| | - Hugo Vianna Silva Rody
- Instituto de Ciência e Tecnologia (ICT), Universidade Federal de São Paulo (UNIFESP), São José dos Campos, SP, Brazil
| | - James Shiniti Nagai
- Instituto de Ciência e Tecnologia (ICT), Universidade Federal de São Paulo (UNIFESP), São José dos Campos, SP, Brazil
| | - Ricardo José Gonzaga Pimenta
- Molecular Biology and Genetic Engineering Center (CBMEG), University of Campinas (UNICAMP), Campinas, SP, Brazil
| | - Melina Cristina Mancini
- Molecular Biology and Genetic Engineering Center (CBMEG), University of Campinas (UNICAMP), Campinas, SP, Brazil
| | | | - Luciana Rossini Pinto
- Advanced Center of Sugarcane Agrobusiness Technological Research, Agronomic Institute of Campinas (IAC), Ribeirão Preto, SP, Brazil
| | | | - Anete Pereira de Souza
- Molecular Biology and Genetic Engineering Center (CBMEG), University of Campinas (UNICAMP), Campinas, SP, Brazil.
- Department of Plant Biology, Institute of Biology (IB), University of Campinas (UNICAMP), Campinas, SP, Brazil.
| | - Reginaldo Massanobu Kuroshu
- Instituto de Ciência e Tecnologia (ICT), Universidade Federal de São Paulo (UNIFESP), São José dos Campos, SP, Brazil.
| |
Collapse
|
9
|
da Silva MF, Gonçalves MC, Brito MDS, Medeiros CN, Harakava R, Landell MGDA, Pinto LR. Sugarcane mosaic virus mediated changes in cytosine methylation pattern and differentially transcribed fragments in resistance-contrasting sugarcane genotypes. PLoS One 2020; 15:e0241493. [PMID: 33166323 PMCID: PMC7652275 DOI: 10.1371/journal.pone.0241493] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Accepted: 10/16/2020] [Indexed: 12/24/2022] Open
Abstract
Sugarcane mosaic virus (SCMV) is the causal agent of sugarcane mosaic disease (SMD) in Brazil; it is mainly controlled by using resistant cultivars. Studies on the changes in sugarcane transcriptome provided the first insights about the molecular basis underlying the genetic resistance to SMD; nonetheless, epigenetic modifications such as cytosine methylation is also informative, considering its roles in gene expression regulation. In our previous study, differentially transcribed fragments (DTFs) were obtained using cDNA-amplified fragment length polymorphism by comparing mock- and SCMV-inoculated plants from two sugarcane cultivars with contrasting responses to SMD. In this study, the identification of unexplored DTFs was continued while the same leaf samples were used to evaluate SCMV-mediated changes in the cytosine methylation pattern by using methylation-sensitive amplification polymorphism. This analysis revealed minor changes in cytosine methylation in response to SCMV infection, but distinct changes between the cultivars with contrasting responses to SMD, with higher hypomethylation events 24 and 72 h post-inoculation in the resistant cultivar. The differentially methylated fragments (DMFs) aligned with transcripts, putative promoters, and genomic regions, with a preponderant distribution within CpG islands. The transcripts found were associated with plant immunity and other stress responses, epigenetic changes, and transposable elements. The DTFs aligned with transcripts assigned to stress responses, epigenetic changes, photosynthesis, lipid transport, and oxidoreductases, in which the transcriptional start site is located in proximity with CpG islands and tandem repeats. Real-time quantitative polymerase chain reaction results revealed significant upregulation in the resistant cultivar of aspartyl protease and VQ protein, respectively, selected from DMF and DTF alignments, suggesting their roles in genetic resistance to SMD and supporting the influence of cytosine methylation in gene expression. Thus, we identified new candidate genes for further validation and showed that the changes in cytosine methylation may regulate important mechanisms underlying the genetic resistance to SMD.
Collapse
Affiliation(s)
- Marcel Fernando da Silva
- Biologia Aplicada à Agropecuária, Faculdade de Ciências Agrárias e Veterinárias (FCAV) Universidade Estadual Paulista “Júlio de Mesquita Filho”, Jaboticabal, São Paulo, Brazil
| | | | - Michael dos Santos Brito
- Departamento de Ciência e Tecnologia, Instituto de Ciência e Tecnologia da Universidade Federal de São Paulo, São José dos Campos, São Paulo, Brazil
| | | | - Ricardo Harakava
- Crop Protection Research Centre, Instituto Biológico, São Paulo, Brazil
| | | | | |
Collapse
|
10
|
Manimekalai R, Suresh G, Govinda Kurup H, Athiappan S, Kandalam M. Role of NGS and SNP genotyping methods in sugarcane improvement programs. Crit Rev Biotechnol 2020; 40:865-880. [PMID: 32508157 DOI: 10.1080/07388551.2020.1765730] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
Sugarcane (Saccharum spp.) is one of the most economically significant crops because of its high sucrose content and it is a promising biomass feedstock for biofuel production. Sugarcane genome sequencing and analysis is a difficult task due to its heterozygosity and polyploidy. Long sequence read technologies, PacBio Single-Molecule Real-Time (SMRT) sequencing, the Illumina TruSeq, and the Oxford Nanopore sequencing could solve the problem of genome assembly. On the applications side, next generation sequencing (NGS) technologies played a major role in the discovery of single nucleotide polymorphism (SNP) and the development of low to high throughput genotyping platforms. The two mainstream high throughput genotyping platforms are the SNP microarray and genotyping by sequencing (GBS). This paper reviews the NGS in sugarcane genomics, genotyping methodologies, and the choice of these methods. Array-based SNP genotyping is robust, provides consistent SNPs, and relatively easier downstream data analysis. The GBS method identifies large scale SNPs across the germplasm. A combination of targeted GBS and array-based genotyping methods should be used to increase the accuracy of genomic selection and marker-assisted breeding.
Collapse
Affiliation(s)
- Ramaswamy Manimekalai
- Crop Improvement Division, ICAR - Sugarcane Breeding Institute, Indian Council of Agricultural Research (ICAR), Coimbatore, Tamil Nadu, India
| | - Gayathri Suresh
- Crop Improvement Division, ICAR - Sugarcane Breeding Institute, Indian Council of Agricultural Research (ICAR), Coimbatore, Tamil Nadu, India
| | - Hemaprabha Govinda Kurup
- Crop Improvement Division, ICAR - Sugarcane Breeding Institute, Indian Council of Agricultural Research (ICAR), Coimbatore, Tamil Nadu, India
| | - Selvi Athiappan
- Crop Improvement Division, ICAR - Sugarcane Breeding Institute, Indian Council of Agricultural Research (ICAR), Coimbatore, Tamil Nadu, India
| | - Mallikarjuna Kandalam
- Business Development, Asia Pacific Japan region, Thermo Fisher Scientific, Waltham, MA, USA
| |
Collapse
|
11
|
Lloyd Evans D, Hlongwane TT, Joshi SV, Riaño Pachón DM. The sugarcane mitochondrial genome: assembly, phylogenetics and transcriptomics. PeerJ 2019; 7:e7558. [PMID: 31579570 PMCID: PMC6764373 DOI: 10.7717/peerj.7558] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2019] [Accepted: 07/26/2019] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND Chloroplast genomes provide insufficient phylogenetic information to distinguish between closely related sugarcane cultivars, due to the recent origin of many cultivars and the conserved sequence of the chloroplast. In comparison, the mitochondrial genome of plants is much larger and more plastic and could contain increased phylogenetic signals. We assembled a consensus reference mitochondrion with Illumina TruSeq synthetic long reads and Oxford Nanopore Technologies MinION long reads. Based on this assembly we also analyzed the mitochondrial transcriptomes of sugarcane and sorghum and improved the annotation of the sugarcane mitochondrion as compared with other species. METHODS Mitochondrial genomes were assembled from genomic read pools using a bait and assemble methodology. The mitogenome was exhaustively annotated using BLAST and transcript datasets were mapped with HISAT2 prior to analysis with the Integrated Genome Viewer. RESULTS The sugarcane mitochondrion is comprised of two independent chromosomes, for which there is no evidence of recombination. Based on the reference assembly from the sugarcane cultivar SP80-3280 the mitogenomes of four additional cultivars (R570, LCP85-384, RB72343 and SP70-1143) were assembled (with the SP70-1143 assembly utilizing both genomic and transcriptomic data). We demonstrate that the sugarcane plastome is completely transcribed and we assembled the chloroplast genome of SP80-3280 using transcriptomic data only. Phylogenomic analysis using mitogenomes allow closely related sugarcane cultivars to be distinguished and supports the discrimination between Saccharum officinarum and Saccharum cultum as modern sugarcane's female parent. From whole chloroplast comparisons, we demonstrate that modern sugarcane arose from a limited number of Saccharum cultum female founders. Transcriptomic and spliceosomal analyses reveal that the two chromosomes of the sugarcane mitochondrion are combined at the transcript level and that splice sites occur more frequently within gene coding regions than without. We reveal one confirmed and one potential cytoplasmic male sterility (CMS) factor in the sugarcane mitochondrion, both of which are transcribed. CONCLUSION Transcript processing in the sugarcane mitochondrion is highly complex with diverse splice events, the majority of which span the two chromosomes. PolyA baited transcripts are consistent with the use of polyadenylation for transcript degradation. For the first time we annotate two CMS factors within the sugarcane mitochondrion and demonstrate that sugarcane possesses all the molecular machinery required for CMS and rescue. A mechanism of cross-chromosomal splicing based on guide RNAs is proposed. We also demonstrate that mitogenomes can be used to perform phylogenomic studies on sugarcane cultivars.
Collapse
Affiliation(s)
- Dyfed Lloyd Evans
- Plant Breeding, South African Sugarcane Research Institute, Durban, KwaZulu-Natal, South Africa
- Cambridge Sequence Services (CSS), Waterbeach, Cambridgeshire, UK
- Department of Computer Sciences, Université Cheikh Anta Diop de Dakar, Dakar, Sénégal
| | | | - Shailesh V. Joshi
- Plant Breeding, South African Sugarcane Research Institute, Durban, KwaZulu-Natal, South Africa
- School of Life Sciences, College of Agriculture Engineering and Science, University of KwaZulu-Natal, Durban, KwaZulu-Natal, South Africa
| | - Diego M. Riaño Pachón
- Computational, Evolutionary and Systems Biology Laboratory, Center for Nuclear Energy in Agriculture, University of São Paulo, Piracicaba, São Paulo, Brazil
| |
Collapse
|
12
|
Grativol C, Thiebaut F, Sangi S, Montessoro P, Santos WDS, Hemerly AS, Ferreira PC. A miniature inverted-repeat transposable element, AddIn-MITE, located inside a WD40 gene is conserved in Andropogoneae grasses. PeerJ 2019; 7:e6080. [PMID: 30648010 PMCID: PMC6331000 DOI: 10.7717/peerj.6080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2018] [Accepted: 11/07/2018] [Indexed: 11/25/2022] Open
Abstract
Miniature inverted-repeat transposable elements (MITEs) have been associated with genic regions in plant genomes and may play important roles in the regulation of nearby genes via recruitment of small RNAs (sRNA) to the MITEs loci. We identified eight families of MITEs in the sugarcane genome assembly with MITE-Hunter pipeline. These sequences were found to be upstream, downstream or inserted into 67 genic regions in the genome. The position of the most abundant MITE (Stowaway-like) in genic regions, which we call AddIn-MITE, was confirmed in a WD40 gene. The analysis of four monocot species showed conservation of the AddIn-MITE sequence, with a large number of copies in their genomes. We also investigated the conservation of the AddIn-MITE’ position in the WD40 genes from sorghum, maize and, in sugarcane cultivars and wild Saccharum species. In all analyzed plants, AddIn-MITE has located in WD40 intronic region. Furthermore, the role of AddIn-MITE-related sRNA in WD40 genic region was investigated. We found sRNAs preferentially mapped to the AddIn-MITE than to other regions in the WD40 gene in sugarcane. In addition, the analysis of the small RNA distribution patterns in the WD40 gene and the structure of AddIn-MITE, suggests that the MITE region is a proto-miRNA locus in sugarcane. Together, these data provide insights into the AddIn-MITE role in Andropogoneae grasses.
Collapse
Affiliation(s)
- Clicia Grativol
- Laboratório de Química e Função de Proteínas e Peptídeos/Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense, Campos dos Goytacazes, Rio de Janeiro, Brazil
| | - Flavia Thiebaut
- Laboratório de Biologia Molecular de Plantas/Instituto de Bioquímica Médica Leopoldo De Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Rio de Janeiro, Brazil
| | - Sara Sangi
- Laboratório de Química e Função de Proteínas e Peptídeos/Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense, Campos dos Goytacazes, Rio de Janeiro, Brazil
| | - Patricia Montessoro
- Laboratório de Biologia Molecular de Plantas/Instituto de Bioquímica Médica Leopoldo De Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Rio de Janeiro, Brazil
| | - Walaci da Silva Santos
- Laboratório de Química e Função de Proteínas e Peptídeos/Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense, Campos dos Goytacazes, Rio de Janeiro, Brazil
| | - Adriana S. Hemerly
- Laboratório de Biologia Molecular de Plantas/Instituto de Bioquímica Médica Leopoldo De Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Rio de Janeiro, Brazil
| | - Paulo C.G. Ferreira
- Laboratório de Biologia Molecular de Plantas/Instituto de Bioquímica Médica Leopoldo De Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Rio de Janeiro, Brazil
| |
Collapse
|
13
|
Thirugnanasambandam PP, Hoang NV, Henry RJ. The Challenge of Analyzing the Sugarcane Genome. FRONTIERS IN PLANT SCIENCE 2018; 9:616. [PMID: 29868072 PMCID: PMC5961476 DOI: 10.3389/fpls.2018.00616] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/30/2018] [Accepted: 04/18/2018] [Indexed: 05/04/2023]
Abstract
Reference genome sequences have become key platforms for genetics and breeding of the major crop species. Sugarcane is probably the largest crop produced in the world (in weight of crop harvested) but lacks a reference genome sequence. Sugarcane has one of the most complex genomes in crop plants due to the extreme level of polyploidy. The genome of modern sugarcane hybrids includes sub-genomes from two progenitors Saccharum officinarum and S. spontaneum with some chromosomes resulting from recombination between these sub-genomes. Advancing DNA sequencing technologies and strategies for genome assembly are making the sugarcane genome more tractable. Advances in long read sequencing have allowed the generation of a more complete set of sugarcane gene transcripts. This is supporting transcript profiling in genetic research. The progenitor genomes are being sequenced. A monoploid coverage of the hybrid genome has been obtained by sequencing BAC clones that cover the gene space of the closely related sorghum genome. The complete polyploid genome is now being sequenced and assembled. The emerging genome will allow comparison of related genomes and increase understanding of the functioning of this polyploidy system. Sugarcane breeding for traditional sugar and new energy and biomaterial uses will be enhanced by the availability of these genomic resources.
Collapse
Affiliation(s)
- Prathima P. Thirugnanasambandam
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, QLD, Australia
- ICAR - Sugarcane Breeding Institute, Coimbatore, India
- *Correspondence: Prathima P. Thirugnanasambandam,
| | - Nam V. Hoang
- College of Agriculture and Forestry, Hue University, Hue, Vietnam
| | - Robert J. Henry
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, QLD, Australia
| |
Collapse
|
14
|
Thiebaut F, Rojas CA, Grativol C, Calixto EPDR, Motta MR, Ballesteros HGF, Peixoto B, de Lima BNS, Vieira LM, Walter ME, de Armas EM, Entenza JOP, Lifschitz S, Farinelli L, Hemerly AS, Ferreira PCG. Roles of Non-Coding RNA in Sugarcane-Microbe Interaction. Noncoding RNA 2017; 3:E25. [PMID: 29657296 PMCID: PMC5831913 DOI: 10.3390/ncrna3040025] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2017] [Revised: 12/11/2017] [Accepted: 12/19/2017] [Indexed: 12/19/2022] Open
Abstract
Studies have highlighted the importance of non-coding RNA regulation in plant-microbe interaction. However, the roles of sugarcane microRNAs (miRNAs) in the regulation of disease responses have not been investigated. Firstly, we screened the sRNA transcriptome of sugarcane infected with Acidovorax avenae. Conserved and novel miRNAs were identified. Additionally, small interfering RNAs (siRNAs) were aligned to differentially expressed sequences from the sugarcane transcriptome. Interestingly, many siRNAs aligned to a transcript encoding a copper-transporter gene whose expression was induced in the presence of A. avenae, while the siRNAs were repressed in the presence of A. avenae. Moreover, a long intergenic non-coding RNA was identified as a potential target or decoy of miR408. To extend the bioinformatics analysis, we carried out independent inoculations and the expression patterns of six miRNAs were validated by quantitative reverse transcription-PCR (qRT-PCR). Among these miRNAs, miR408-a copper-microRNA-was downregulated. The cleavage of a putative miR408 target, a laccase, was confirmed by a modified 5'RACE (rapid amplification of cDNA ends) assay. MiR408 was also downregulated in samples infected with other pathogens, but it was upregulated in the presence of a beneficial diazotrophic bacteria. Our results suggest that regulation by miR408 is important in sugarcane sensing whether microorganisms are either pathogenic or beneficial, triggering specific miRNA-mediated regulatory mechanisms accordingly.
Collapse
Affiliation(s)
- Flávia Thiebaut
- Laboratório de Biologia Molecular de Plantas, Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro 21941-901, Brazil.
| | - Cristian A Rojas
- Universidade Federal da INTEGRAÇÃO Latino-Americana, Foz do Iguaçu 85866-000, Brazil.
| | - Clícia Grativol
- Laboratório de Química e Função de Proteínas e Peptídeos, Universidade Estadual do Norte Fluminense, Campos dos Goytacazes 28013-602, Brazil.
| | - Edmundo P da R Calixto
- Laboratório de Biologia Molecular de Plantas, Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro 21941-901, Brazil.
| | - Mariana R Motta
- Laboratório de Biologia Molecular de Plantas, Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro 21941-901, Brazil.
| | - Helkin G F Ballesteros
- Laboratório de Biologia Molecular de Plantas, Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro 21941-901, Brazil.
| | - Barbara Peixoto
- Laboratório de Biologia Molecular de Plantas, Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro 21941-901, Brazil.
| | - Berenice N S de Lima
- Laboratório de Biologia Molecular de Plantas, Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro 21941-901, Brazil.
| | - Lucas M Vieira
- Departamento de Ciência da Computação, Universidade de Brasília, Brasília 70910-900, Brazil.
| | - Maria Emilia Walter
- Departamento de Ciência da Computação, Universidade de Brasília, Brasília 70910-900, Brazil.
| | - Elvismary M de Armas
- Departamento de Informática, Pontifícia Universidade Católica do Rio de Janeiro, Rio de Janeiro 22451-900, Brazil.
| | - Júlio O P Entenza
- Departamento de Informática, Pontifícia Universidade Católica do Rio de Janeiro, Rio de Janeiro 22451-900, Brazil.
| | - Sergio Lifschitz
- Departamento de Informática, Pontifícia Universidade Católica do Rio de Janeiro, Rio de Janeiro 22451-900, Brazil.
| | | | - Adriana S Hemerly
- Laboratório de Biologia Molecular de Plantas, Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro 21941-901, Brazil.
| | - Paulo C G Ferreira
- Laboratório de Biologia Molecular de Plantas, Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro 21941-901, Brazil.
| |
Collapse
|
15
|
Vilela MDM, Del Bem LE, Van Sluys MA, de Setta N, Kitajima JP, Cruz GMQ, Sforça DA, de Souza AP, Ferreira PCG, Grativol C, Cardoso-Silva CB, Vicentini R, Vincentz M. Analysis of Three Sugarcane Homo/Homeologous Regions Suggests Independent Polyploidization Events of Saccharum officinarum and Saccharum spontaneum. Genome Biol Evol 2017; 9:266-278. [PMID: 28082603 PMCID: PMC5381655 DOI: 10.1093/gbe/evw293] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/12/2016] [Indexed: 12/23/2022] Open
Abstract
Whole genome duplication has played an important role in plant evolution and diversification. Sugarcane is an important crop with a complex hybrid polyploid genome, for which the process of adaptation to polyploidy is still poorly understood. In order to improve our knowledge about sugarcane genome evolution and the homo/homeologous gene expression balance, we sequenced and analyzed 27 BACs (Bacterial Artificial Chromosome) of sugarcane R570 cultivar, containing the putative single-copy genes LFY (seven haplotypes), PHYC (four haplotypes), and TOR (seven haplotypes). Comparative genomic approaches showed that these sugarcane loci presented a high degree of conservation of gene content and collinearity (synteny) with sorghum and rice orthologous regions, but were invaded by transposable elements (TE). All the homo/homeologous haplotypes of LFY, PHYC, and TOR are likely to be functional, because they are all under purifying selection (dN/dS ≪ 1). However, they were found to participate in a nonequivalently manner to the overall expression of the corresponding gene. SNPs, indels, and amino acid substitutions allowed inferring the S. officinarum or S. spontaneum origin of the TOR haplotypes, which further led to the estimation that these two sugarcane ancestral species diverged between 2.5 and 3.5 Ma. In addition, analysis of shared TE insertions in TOR haplotypes suggested that two autopolyploidization may have occurred in the lineage that gave rise to S. officinarum, after its divergence from S. spontaneum.
Collapse
Affiliation(s)
- Mariane de Mendonça Vilela
- Centro de Biologia Molecular e Engenharia Genética, Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas, Campinas, SP, Brazil
| | - Luiz Eduardo Del Bem
- Centro de Biologia Molecular e Engenharia Genética, Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas, Campinas, SP, Brazil
| | - Marie-Anne Van Sluys
- Departamento de Botânica, Instituto de Biociências, Universidade de São Paulo, SP, Brazil
| | - Nathalia de Setta
- Universidade Federal do ABC (UFABC), São Bernardo do Campo, SP, Brazil
| | | | | | - Danilo Augusto Sforça
- Centro de Biologia Molecular e Engenharia Genética, Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas, Campinas, SP, Brazil
| | - Anete Pereira de Souza
- Centro de Biologia Molecular e Engenharia Genética, Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas, Campinas, SP, Brazil
| | | | - Clícia Grativol
- Laboratório de Química e Função de Proteínas e Peptídeos, Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Parque Califórnia, Campos dos Goytacazes, RJ, Brazil
| | - Claudio Benicio Cardoso-Silva
- Centro de Biologia Molecular e Engenharia Genética, Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas, Campinas, SP, Brazil
| | - Renato Vicentini
- Centro de Biologia Molecular e Engenharia Genética, Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas, Campinas, SP, Brazil
| | - Michel Vincentz
- Centro de Biologia Molecular e Engenharia Genética, Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas, Campinas, SP, Brazil
| |
Collapse
|
16
|
Yang X, Song J, You Q, Paudel DR, Zhang J, Wang J. Mining sequence variations in representative polyploid sugarcane germplasm accessions. BMC Genomics 2017; 18:594. [PMID: 28793856 PMCID: PMC5551020 DOI: 10.1186/s12864-017-3980-3] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2016] [Accepted: 08/01/2017] [Indexed: 11/10/2022] Open
Abstract
Background Sugarcane (Saccharum spp.) is one of the most important economic crops because of its high sugar production and biofuel potential. Due to the high polyploid level and complex genome of sugarcane, it has been a huge challenge to investigate genomic sequence variations, which are critical for identifying alleles contributing to important agronomic traits. In order to mine the genetic variations in sugarcane, genotyping by sequencing (GBS), was used to genotype 14 representative Saccharum complex accessions. GBS is a method to generate a large number of markers, enabled by next generation sequencing (NGS) and the genome complexity reduction using restriction enzymes. Results To use GBS for high throughput genotyping highly polyploid sugarcane, the GBS analysis pipelines in 14 Saccharum complex accessions were established by evaluating different alignment methods, sequence variants callers, and sequence depth for single nucleotide polymorphism (SNP) filtering. By using the established pipeline, a total of 76,251 non-redundant SNPs, 5642 InDels, 6380 presence/absence variants (PAVs), and 826 copy number variations (CNVs) were detected among the 14 accessions. In addition, non-reference based universal network enabled analysis kit and Stacks de novo called 34,353 and 109,043 SNPs, respectively. In the 14 accessions, the percentages of single dose SNPs ranged from 38.3% to 62.3% with an average of 49.6%, much more than the portions of multiple dosage SNPs. Concordantly called SNPs were used to evaluate the phylogenetic relationship among the 14 accessions. The results showed that the divergence time between the Erianthus genus and the Saccharum genus was more than 10 million years ago (MYA). The Saccharum species separated from their common ancestors ranging from 0.19 to 1.65 MYA. Conclusions The GBS pipelines including the reference sequences, alignment methods, sequence variant callers, and sequence depth were recommended and discussed for the Saccharum complex and other related species. A large number of sequence variations were discovered in the Saccharum complex, including SNPs, InDels, PAVs, and CNVs. Genome-wide SNPs were further used to illustrate sequence features of polyploid species and demonstrated the divergence of different species in the Saccharum complex. The results of this study showed that GBS was an effective NGS-based method to discover genomic sequence variations in highly polyploid and heterozygous species.
Collapse
Affiliation(s)
- Xiping Yang
- Department of Agronomy, University of Florida, Gainesville, FL, 32610, USA
| | - Jian Song
- Department of Agronomy, University of Florida, Gainesville, FL, 32610, USA
| | - Qian You
- Department of Agronomy, University of Florida, Gainesville, FL, 32610, USA
| | - Dev R Paudel
- Department of Agronomy, University of Florida, Gainesville, FL, 32610, USA
| | - Jisen Zhang
- FAFU and UIUC-SIB Joint Center for Genomics and Biotechnology, Haixia Institute of Science and Techonology, Fujian Agriculture and Forestry University, Fuzhou, Fujian, 350002, China
| | - Jianping Wang
- Department of Agronomy, University of Florida, Gainesville, FL, 32610, USA. .,FAFU and UIUC-SIB Joint Center for Genomics and Biotechnology, Haixia Institute of Science and Techonology, Fujian Agriculture and Forestry University, Fuzhou, Fujian, 350002, China. .,Genetics Institute, Plant Molecular and Biology Program, University of Florida, Gainesville, FL, 32610, USA.
| |
Collapse
|
17
|
Identification, classification and transcriptional profiles of dirigent domain-containing proteins in sugarcane. Mol Genet Genomics 2017; 292:1323-1340. [PMID: 28699001 DOI: 10.1007/s00438-017-1349-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2017] [Accepted: 07/04/2017] [Indexed: 01/13/2023]
Abstract
Dirigent (DIR) proteins, encoded by DIR genes, are referred to as "dirigent" because they direct the outcome of the coupling of the monolignol coniferyl alcohol into (+) or (-) pinoresinol, the first intermediates in the enantiocomplementary pathways for lignan biosynthesis. DIR domain-containing or DIR-like proteins are, thus, termed for not having a clear characterization. A transcriptome- and genome-wide survey of DIR domain-containing proteins in sugarcane was carried out, in addition to phylogenetic, physicochemical and transcriptional analyses. A total of 120 non-redundant sequences containing the DIR domain were identified and classified into 64 groups according to phylogenetic and sequence alignment analyses. In silico analysis of transcript abundance showed that these sequences are expressed at low levels in leaves and genes in the same phylogenetic clade have similar expression patterns. Expression analysis of ShDIR1-like transcripts in the culm internodes of sugarcane demonstrates their abundance in mature internodes, their induction by nitrogen fertilization and their predominant expression in cells that have a lignified secondary cell wall, such as vascular bundles of young internodes and parenchymal cells of the pith of mature internodes. Due to the lack of information about the functional role of DIR in plants, a possible relationship is discussed between the ShDIR1-like transcriptional profile and cell wall development in parenchyma cells of sugarcane culm, which typically accumulates large amounts of sucrose. The number of genes encoding the DIR domain-containing proteins in sugarcane is intriguing and is an indication per se that these proteins may have an important metabolic role and thus deserve to be better studied.
Collapse
|
18
|
Abstract
Sugarcane commercial cultivar SP80-3280 has been used as a model for genomic analyses in Brazil. Here we present a draft genome sequence employing Illumina TruSeq Synthetic Long reads. The dataset is available from NCBI BioProject with accession PRJNA272769.
Collapse
Affiliation(s)
- Diego Mauricio Riaño-Pachón
- Brazilian Bioethanol Science and Technology Laboratory (CTBE), Brazilian Center for Research in Energy and Materials (CNPEM), Campinas, SP, Brazil
- Laboratory of Regulatory Systems Biology, Department of Biochemistry, Institute of Chemistry, University of São Paulo, São Paulo, SP, Brazil
| | - Lucia Mattiello
- Brazilian Bioethanol Science and Technology Laboratory (CTBE), Brazilian Center for Research in Energy and Materials (CNPEM), Campinas, SP, Brazil
- Functional Genome Laboratory, Department of Genetics, Evolution and Bioagents, Institute of Biology, State University of Campinas, Campinas, SP, Brazil
| |
Collapse
|
19
|
Abstract
Sugarcane commercial cultivar SP80-3280 has been used as a model for genomic analyses in Brazil. Here we present a draft genome sequence employing Illumina TruSeq Synthetic Long reads. The dataset is available from NCBI BioProject with accession PRJNA272769.
Collapse
Affiliation(s)
- Diego Mauricio Riaño-Pachón
- Brazilian Bioethanol Science and Technology Laboratory (CTBE), Brazilian Center for Research in Energy and Materials (CNPEM), Campinas, SP, Brazil.,Laboratory of Regulatory Systems Biology, Department of Biochemistry, Institute of Chemistry, University of São Paulo, São Paulo, SP, Brazil
| | - Lucia Mattiello
- Brazilian Bioethanol Science and Technology Laboratory (CTBE), Brazilian Center for Research in Energy and Materials (CNPEM), Campinas, SP, Brazil.,Functional Genome Laboratory, Department of Genetics, Evolution and Bioagents, Institute of Biology, State University of Campinas, Campinas, SP, Brazil
| |
Collapse
|
20
|
Balsalobre TWA, da Silva Pereira G, Margarido GRA, Gazaffi R, Barreto FZ, Anoni CO, Cardoso-Silva CB, Costa EA, Mancini MC, Hoffmann HP, de Souza AP, Garcia AAF, Carneiro MS. GBS-based single dosage markers for linkage and QTL mapping allow gene mining for yield-related traits in sugarcane. BMC Genomics 2017; 18:72. [PMID: 28077090 PMCID: PMC5225503 DOI: 10.1186/s12864-016-3383-x] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2016] [Accepted: 12/07/2016] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND Sugarcane (Saccharum spp.) is predominantly an autopolyploid plant with a variable ploidy level, frequent aneuploidy and a large genome that hampers investigation of its organization. Genetic architecture studies are important for identifying genomic regions associated with traits of interest. However, due to the genetic complexity of sugarcane, the practical applications of genomic tools have been notably delayed in this crop, in contrast to other crops that have already advanced to marker-assisted selection (MAS) and genomic selection. High-throughput next-generation sequencing (NGS) technologies have opened new opportunities for discovering molecular markers, especially single nucleotide polymorphisms (SNPs) and insertion-deletion (indels), at the genome-wide level. The objectives of this study were to (i) establish a pipeline for identifying variants from genotyping-by-sequencing (GBS) data in sugarcane, (ii) construct an integrated genetic map with GBS-based markers plus target region amplification polymorphisms and microsatellites, (iii) detect QTLs related to yield component traits, and (iv) perform annotation of the sequences that originated the associated markers with mapped QTLs to search putative candidate genes. RESULTS We used four pseudo-references to align the GBS reads. Depending on the reference, from 3,433 to 15,906 high-quality markers were discovered, and half of them segregated as single-dose markers (SDMs) on average. In addition to 7,049 non-redundant SDMs from GBS, 629 gel-based markers were used in a subsequent linkage analysis. Of 7,678 SDMs, 993 were mapped. These markers were distributed throughout 223 linkage groups, which were clustered in 18 homo(eo)logous groups (HGs), with a cumulative map length of 3,682.04 cM and an average marker density of 3.70 cM. We performed QTL mapping of four traits and found seven QTLs. Our results suggest the presence of a stable QTL across locations. Furthermore, QTLs to soluble solid content (BRIX) and fiber content (FIB) traits had markers linked to putative candidate genes. CONCLUSIONS This study is the first to report the use of GBS for large-scale variant discovery and genotyping of a mapping population in sugarcane, providing several insights regarding the use of NGS data in a polyploid, non-model species. The use of GBS generated a large number of markers and still enabled ploidy and allelic dosage estimation. Moreover, we were able to identify seven QTLs, two of which had great potential for validation and future use for molecular breeding in sugarcane.
Collapse
Affiliation(s)
- Thiago Willian Almeida Balsalobre
- Departamento de Biotecnologia e Produção Vegetal e Animal, Centro de Ciências Agrárias, Universidade Federal de São Carlos, Rodovia Anhanguera, Km 174, Araras, CEP 13600-970 São Paulo Brazil
- Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas, Avenida Monteiro Lobato 255, Campinas, CEP 13083-862 São Paulo Brazil
- Centro de Biologia Molecular e Engenharia Genética, Universidade Estadual de Campinas, Avenida Candido Rondon 400, Campinas, CEP 13083-875 São Paulo Brazil
| | - Guilherme da Silva Pereira
- Departamento de Genética, Escola Superior de Agricultura Luiz de Queiroz, Universidade de São Paulo, Avenida Pádua Dias 11, Piracicaba, CEP 13418-900 São Paulo Brazil
| | - Gabriel Rodrigues Alves Margarido
- Departamento de Genética, Escola Superior de Agricultura Luiz de Queiroz, Universidade de São Paulo, Avenida Pádua Dias 11, Piracicaba, CEP 13418-900 São Paulo Brazil
| | - Rodrigo Gazaffi
- Departamento de Biotecnologia e Produção Vegetal e Animal, Centro de Ciências Agrárias, Universidade Federal de São Carlos, Rodovia Anhanguera, Km 174, Araras, CEP 13600-970 São Paulo Brazil
| | - Fernanda Zatti Barreto
- Departamento de Biotecnologia e Produção Vegetal e Animal, Centro de Ciências Agrárias, Universidade Federal de São Carlos, Rodovia Anhanguera, Km 174, Araras, CEP 13600-970 São Paulo Brazil
| | - Carina Oliveira Anoni
- Departamento de Genética, Escola Superior de Agricultura Luiz de Queiroz, Universidade de São Paulo, Avenida Pádua Dias 11, Piracicaba, CEP 13418-900 São Paulo Brazil
| | - Cláudio Benício Cardoso-Silva
- Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas, Avenida Monteiro Lobato 255, Campinas, CEP 13083-862 São Paulo Brazil
- Centro de Biologia Molecular e Engenharia Genética, Universidade Estadual de Campinas, Avenida Candido Rondon 400, Campinas, CEP 13083-875 São Paulo Brazil
| | - Estela Araújo Costa
- Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas, Avenida Monteiro Lobato 255, Campinas, CEP 13083-862 São Paulo Brazil
- Centro de Biologia Molecular e Engenharia Genética, Universidade Estadual de Campinas, Avenida Candido Rondon 400, Campinas, CEP 13083-875 São Paulo Brazil
| | - Melina Cristina Mancini
- Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas, Avenida Monteiro Lobato 255, Campinas, CEP 13083-862 São Paulo Brazil
- Centro de Biologia Molecular e Engenharia Genética, Universidade Estadual de Campinas, Avenida Candido Rondon 400, Campinas, CEP 13083-875 São Paulo Brazil
| | - Hermann Paulo Hoffmann
- Departamento de Biotecnologia e Produção Vegetal e Animal, Centro de Ciências Agrárias, Universidade Federal de São Carlos, Rodovia Anhanguera, Km 174, Araras, CEP 13600-970 São Paulo Brazil
| | - Anete Pereira de Souza
- Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas, Avenida Monteiro Lobato 255, Campinas, CEP 13083-862 São Paulo Brazil
- Centro de Biologia Molecular e Engenharia Genética, Universidade Estadual de Campinas, Avenida Candido Rondon 400, Campinas, CEP 13083-875 São Paulo Brazil
| | - Antonio Augusto Franco Garcia
- Departamento de Genética, Escola Superior de Agricultura Luiz de Queiroz, Universidade de São Paulo, Avenida Pádua Dias 11, Piracicaba, CEP 13418-900 São Paulo Brazil
| | - Monalisa Sampaio Carneiro
- Departamento de Biotecnologia e Produção Vegetal e Animal, Centro de Ciências Agrárias, Universidade Federal de São Carlos, Rodovia Anhanguera, Km 174, Araras, CEP 13600-970 São Paulo Brazil
| |
Collapse
|
21
|
Riaño-Pachón DM, Mattiello L. Draft genome sequencing of the sugarcane hybrid SP80-3280. F1000Res 2017. [PMID: 28713559 DOI: 10.12688/f1000research10.12688/f1000research.11859.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 05/07/2023] Open
Abstract
Sugarcane commercial cultivar SP80-3280 has been used as a model for genomic analyses in Brazil. Here we present a draft genome sequence employing Illumina TruSeq Synthetic Long reads. The dataset is available from NCBI BioProject with accession PRJNA272769.
Collapse
Affiliation(s)
- Diego Mauricio Riaño-Pachón
- Brazilian Bioethanol Science and Technology Laboratory (CTBE), Brazilian Center for Research in Energy and Materials (CNPEM), Campinas, SP, Brazil
- Laboratory of Regulatory Systems Biology, Department of Biochemistry, Institute of Chemistry, University of São Paulo, São Paulo, SP, Brazil
| | - Lucia Mattiello
- Brazilian Bioethanol Science and Technology Laboratory (CTBE), Brazilian Center for Research in Energy and Materials (CNPEM), Campinas, SP, Brazil
- Functional Genome Laboratory, Department of Genetics, Evolution and Bioagents, Institute of Biology, State University of Campinas, Campinas, SP, Brazil
| |
Collapse
|
22
|
Shearman JR, Sonthirod C, Naktang C, Pootakham W, Yoocha T, Sangsrakru D, Jomchai N, Tragoonrung S, Tangphatsornruang S. The two chromosomes of the mitochondrial genome of a sugarcane cultivar: assembly and recombination analysis using long PacBio reads. Sci Rep 2016; 6:31533. [PMID: 27530092 PMCID: PMC4987617 DOI: 10.1038/srep31533] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2016] [Accepted: 07/21/2016] [Indexed: 11/20/2022] Open
Abstract
Sugarcane accounts for a large portion of the worlds sugar production. Modern commercial cultivars are complex hybrids of S. officinarum and several other Saccharum species. Historical records identify New Guinea as the origin of S. officinarum and that a small number of plants originating from there were used to generate all modern commercial cultivars. The mitochondrial genome can be a useful way to identify the maternal origin of commercial cultivars. We have used the PacBio RSII to sequence and assemble the mitochondrial genome of a South East Asian commercial cultivar, known as Khon Kaen 3. The long read length of this sequencing technology allowed for the mitochondrial genome to be assembled into two distinct circular chromosomes with all repeat sequences spanned by individual reads. Comparison of five commercial hybrids, two S. officinarum and one S. spontaneum to our assembly reveals no structural rearrangements between our assembly, the commercial hybrids and an S. officinarum from New Guinea. The S. spontaneum, from India, and one sample of S. officinarum (unknown origin) are substantially rearranged and have a large number of homozygous variants. This supports the record that S. officinarum plants from New Guinea are the maternal source of all modern commercial hybrids.
Collapse
Affiliation(s)
- Jeremy R Shearman
- National Center for Genetic Engineering and Biotechnology, 113 Thailand Science Park, Paholyothin Road, Khlong Nueng, Khlong Luang, Pathumthani, 12120, Thailand
| | - Chutima Sonthirod
- National Center for Genetic Engineering and Biotechnology, 113 Thailand Science Park, Paholyothin Road, Khlong Nueng, Khlong Luang, Pathumthani, 12120, Thailand
| | - Chaiwat Naktang
- National Center for Genetic Engineering and Biotechnology, 113 Thailand Science Park, Paholyothin Road, Khlong Nueng, Khlong Luang, Pathumthani, 12120, Thailand
| | - Wirulda Pootakham
- National Center for Genetic Engineering and Biotechnology, 113 Thailand Science Park, Paholyothin Road, Khlong Nueng, Khlong Luang, Pathumthani, 12120, Thailand
| | - Thippawan Yoocha
- National Center for Genetic Engineering and Biotechnology, 113 Thailand Science Park, Paholyothin Road, Khlong Nueng, Khlong Luang, Pathumthani, 12120, Thailand
| | - Duangjai Sangsrakru
- National Center for Genetic Engineering and Biotechnology, 113 Thailand Science Park, Paholyothin Road, Khlong Nueng, Khlong Luang, Pathumthani, 12120, Thailand
| | - Nukoon Jomchai
- National Center for Genetic Engineering and Biotechnology, 113 Thailand Science Park, Paholyothin Road, Khlong Nueng, Khlong Luang, Pathumthani, 12120, Thailand
| | - Somvong Tragoonrung
- National Center for Genetic Engineering and Biotechnology, 113 Thailand Science Park, Paholyothin Road, Khlong Nueng, Khlong Luang, Pathumthani, 12120, Thailand
| | - Sithichoke Tangphatsornruang
- National Center for Genetic Engineering and Biotechnology, 113 Thailand Science Park, Paholyothin Road, Khlong Nueng, Khlong Luang, Pathumthani, 12120, Thailand
| |
Collapse
|
23
|
Song J, Yang X, Resende MFR, Neves LG, Todd J, Zhang J, Comstock JC, Wang J. Natural Allelic Variations in Highly Polyploidy Saccharum Complex. FRONTIERS IN PLANT SCIENCE 2016; 7:804. [PMID: 27375658 PMCID: PMC4896942 DOI: 10.3389/fpls.2016.00804] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/20/2016] [Accepted: 05/23/2016] [Indexed: 05/20/2023]
Abstract
Sugarcane (Saccharum spp.) is an important sugar and biofuel crop with high polyploid and complex genomes. The Saccharum complex, comprised of Saccharum genus and a few related genera, are important genetic resources for sugarcane breeding. A large amount of natural variation exists within the Saccharum complex. Though understanding their allelic variation has been challenging, it is critical to dissect allelic structure and to identify the alleles controlling important traits in sugarcane. To characterize natural variations in Saccharum complex, a target enrichment sequencing approach was used to assay 12 representative germplasm accessions. In total, 55,946 highly efficient probes were designed based on the sorghum genome and sugarcane unigene set targeting a total of 6 Mb of the sugarcane genome. A pipeline specifically tailored for polyploid sequence variants and genotype calling was established. BWA-mem and sorghum genome approved to be an acceptable aligner and reference for sugarcane target enrichment sequence analysis, respectively. Genetic variations including 1,166,066 non-redundant SNPs, 150,421 InDels, 919 gene copy number variations, and 1,257 gene presence/absence variations were detected. SNPs from three different callers (Samtools, Freebayes, and GATK) were compared and the validation rates were nearly 90%. Based on the SNP loci of each accession and their ploidy levels, 999,258 single dosage SNPs were identified and most loci were estimated as largely homozygotes. An average of 34,397 haplotype blocks for each accession was inferred. The highest divergence time among the Saccharum spp. was estimated as 1.2 million years ago (MYA). Saccharum spp. diverged from Erianthus and Sorghum approximately 5 and 6 MYA, respectively. The target enrichment sequencing approach provided an effective way to discover and catalog natural allelic variation in highly polyploid or heterozygous genomes.
Collapse
Affiliation(s)
- Jian Song
- Agronomy Department, University of FloridaGainesville, FL, USA
- College of Life Sciences, Dezhou UniversityDezhou, China
| | - Xiping Yang
- Agronomy Department, University of FloridaGainesville, FL, USA
| | | | | | - James Todd
- Sugarcane Research Unit, United States Department of Agriculture-Agricultural Research ServiceHouma, LA, USA
- Sugarcane Field Station, United States Department of Agriculture-Agricultural Research Service, Canal PointFL, USA
| | - Jisen Zhang
- Center for Genomics and Biotechnology, Haixia Institute of Science and Technology, Fujian Agriculture and Forestry UniversityFuzhou, China
| | - Jack C. Comstock
- Sugarcane Field Station, United States Department of Agriculture-Agricultural Research Service, Canal PointFL, USA
| | - Jianping Wang
- Agronomy Department, University of FloridaGainesville, FL, USA
- Center for Genomics and Biotechnology, Haixia Institute of Science and Technology, Fujian Agriculture and Forestry UniversityFuzhou, China
- Plant Molecular and Biology Program, Genetics Institute, University of FloridaGainesville, FL, USA
- *Correspondence: Jianping Wang,
| |
Collapse
|
24
|
How to Isolate a Plant's Hypomethylome in One Shot. BIOMED RESEARCH INTERNATIONAL 2015; 2015:570568. [PMID: 26421293 PMCID: PMC4573423 DOI: 10.1155/2015/570568] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/05/2014] [Revised: 03/24/2015] [Accepted: 03/30/2015] [Indexed: 11/17/2022]
Abstract
Genome assembly remains a challenge for large and/or complex plant genomes due to their abundant repetitive regions resulting in studies focusing on gene space instead of the whole genome. Thus, DNA enrichment strategies facilitate the assembly by increasing the coverage and simultaneously reducing the complexity of the whole genome. In this paper we provide an easy, fast, and cost-effective variant of MRE-seq to obtain a plant's hypomethylome by an optimized methyl filtration protocol followed by next generation sequencing. The method is demonstrated on three plant species with knowingly large and/or complex (polyploid) genomes: Oryza sativa, Picea abies, and Crocus sativus. The identified hypomethylomes show clear enrichment for genes and their flanking regions and clear reduction of transposable elements. Additionally, genomic sequences around genes are captured including regulatory elements in introns and up- and downstream flanks. High similarity of the results obtained by a de novo assembly approach with a reference based mapping in rice supports the applicability for studying and understanding the genomes of nonmodel organisms. Hence we show the high potential of MRE-seq in a wide range of scenarios for the direct analysis of methylation differences, for example, between ecotypes, individuals, within or across species harbouring large, and complex genomes.
Collapse
|
25
|
Metcalfe CJ, Oliveira SG, Gaiarsa JW, Aitken KS, Carneiro MS, Zatti F, Van Sluys MA. Using quantitative PCR with retrotransposon-based insertion polymorphisms as markers in sugarcane. JOURNAL OF EXPERIMENTAL BOTANY 2015; 66:4239-50. [PMID: 26093024 PMCID: PMC4493790 DOI: 10.1093/jxb/erv283] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
Sugarcane is the main source of the world's sugar and is becoming increasingly important as a source of biofuel. The highly polyploid and heterozygous nature of the sugarcane genome has meant that characterization of the genome has lagged behind that of other important crops. Here we developed a method using a combination of quantitative PCR with a transposable marker system to score the relative number of alleles with a transposable element (TE) present at a particular locus. We screened two genera closely related to Saccharum (Miscanthus and Erianthus), wild Saccharum, traditional cultivars, and 127 modern cultivars from Brazilian and Australian breeding programmes. We showed how this method could be used in various ways. First, we showed that the method could be extended to be used as part of a genotyping system. Secondly, the history of insertion and timing of the three TEs examined supports our current understanding of the evolution of the Saccharum complex. Thirdly, all three TEs were found in only one of the two main lineages leading to the modern sugarcane cultivars and are therefore the first TEs identified that could potentially be used as markers for Saccharum spontaneum.
Collapse
Affiliation(s)
- Cushla J Metcalfe
- GaTE-Lab, Departamento de Botânica, IBUSP, Universidade de São Paulo, rua do Matao 277, 05508-090, SP, Brazil
| | - Sarah G Oliveira
- GaTE-Lab, Departamento de Botânica, IBUSP, Universidade de São Paulo, rua do Matao 277, 05508-090, SP, Brazil
| | - Jonas W Gaiarsa
- GaTE-Lab, Departamento de Botânica, IBUSP, Universidade de São Paulo, rua do Matao 277, 05508-090, SP, Brazil
| | - Karen S Aitken
- CSIRO Agriculture Flagship, Queensland Bioscience Precinct, 306 Carmody Road, St Lucia, QLD 4072, Australia
| | - Monalisa S Carneiro
- Centro de Ciências Agrárias, Universidade Federal de São Carlos, Araras, 13600-970, SP, Brazil
| | - Fernanda Zatti
- Centro de Ciências Agrárias, Universidade Federal de São Carlos, Araras, 13600-970, SP, Brazil
| | - Marie-Anne Van Sluys
- GaTE-Lab, Departamento de Botânica, IBUSP, Universidade de São Paulo, rua do Matao 277, 05508-090, SP, Brazil
| |
Collapse
|