1
|
Sun J, Tian Z, Li X, Li S, Li Z, Wang J, Hu Z, Chen H, Guo C, Xie M, Xu R. Systematic analysis of the pectin methylesterase gene family in Nicotiana tabacum and reveal their multiple roles in plant development and abiotic stresses. FRONTIERS IN PLANT SCIENCE 2022; 13:998841. [PMID: 36247564 PMCID: PMC9554592 DOI: 10.3389/fpls.2022.998841] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Accepted: 09/05/2022] [Indexed: 06/16/2023]
Abstract
The pectin methylesterases (PMEs) play multiple roles in regulating plant development and responses to various stresses. In our study, a total of 121 PME genes were identified in the tobacco genome, which were clustered into two groups based on phylogenetic analysis together with Arabidopsis members. The investigations of gene structure and conserved motif indicated that exon/intron and motif organizations were relatively conserved in each group. Additionally, several stress-related elements were identified in the promoter region of these genes. The survey of duplication events revealed that segmental duplications were critical to the expansion of the PME gene family in tobacco. The expression profiles analysis revealed that these genes were expressed in various tissues and could be induced by diverse abiotic stresses. Notably, NtPME029 and NtPME043, were identified as homologues with AtPME3 and AtPME31, respectively. Furthermore, NtPME029 was highly expressed in roots and the over-expression of the NtPME029 gene could promote the development of roots. While NtPME043 could be induced by salt and ABA treatments, and the over-expression of the NtPME043 gene could significantly enhance the salt-stress tolerance in tobacco. Overall, these findings may shed light on the biological and functional characterization of NtPME genes in tobacco.
Collapse
Affiliation(s)
- Jinhao Sun
- Technology Center, China Tobacco Jiangsu Industrial Co., Ltd., Nanjing, China
- Key Laboratory for Tobacco Gene Resources, Tobacco Research Institute, Chinese Academy of Agricultural Sciences, Qingdao, China
| | - Zhen Tian
- Technology Center, China Tobacco Jiangsu Industrial Co., Ltd., Nanjing, China
| | - Xiaoxu Li
- Key Laboratory for Tobacco Gene Resources, Tobacco Research Institute, Chinese Academy of Agricultural Sciences, Qingdao, China
| | - Shaopeng Li
- Technology Center, China Tobacco Jiangsu Industrial Co., Ltd., Nanjing, China
| | - Zhiyuan Li
- Key Laboratory for Tobacco Gene Resources, Tobacco Research Institute, Chinese Academy of Agricultural Sciences, Qingdao, China
| | - Jinling Wang
- Technology Center, China Tobacco Jiangsu Industrial Co., Ltd., Nanjing, China
| | - Zongyu Hu
- Technology Center, China Tobacco Jiangsu Industrial Co., Ltd., Nanjing, China
| | - Haiqing Chen
- Technology Center, China Tobacco Jiangsu Industrial Co., Ltd., Nanjing, China
| | - Cun Guo
- Key Laboratory for Tobacco Gene Resources, Tobacco Research Institute, Chinese Academy of Agricultural Sciences, Qingdao, China
- Kunming Branch of Yunnan Provincial Tobacco Company, Kunming, China
| | - Minmin Xie
- Key Laboratory for Tobacco Gene Resources, Tobacco Research Institute, Chinese Academy of Agricultural Sciences, Qingdao, China
| | - Ruyan Xu
- Technology Center, China Tobacco Jiangsu Industrial Co., Ltd., Nanjing, China
| |
Collapse
|
2
|
Prasad P, Khatoon U, Verma RK, Aalam S, Kumar A, Mohapatra D, Bhattacharya P, Bag SK, Sawant SV. Transcriptional Landscape of Cotton Fiber Development and Its Alliance With Fiber-Associated Traits. FRONTIERS IN PLANT SCIENCE 2022; 13:811655. [PMID: 35283936 PMCID: PMC8908376 DOI: 10.3389/fpls.2022.811655] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/09/2021] [Accepted: 01/10/2022] [Indexed: 06/14/2023]
Abstract
Cotton fiber development is still an intriguing question to understand fiber commitment and development. At different fiber developmental stages, many genes change their expression pattern and have a pivotal role in fiber quality and yield. Recently, numerous studies have been conducted for transcriptional regulation of fiber, and raw data were deposited to the public repository for comprehensive integrative analysis. Here, we remapped > 380 cotton RNAseq data with uniform mapping strategies that span ∼400 fold coverage to the genome. We identified stage-specific features related to fiber cell commitment, initiation, elongation, and Secondary Cell Wall (SCW) synthesis and their putative cis-regulatory elements for the specific regulation in fiber development. We also mined Exclusively Expressed Transcripts (EETs) that were positively selected during cotton fiber evolution and domestication. Furthermore, the expression of EETs was validated in 100 cotton genotypes through the nCounter assay and correlated with different fiber-related traits. Thus, our data mining study reveals several important features related to cotton fiber development and improvement, which were consolidated in the "CottonExpress-omics" database.
Collapse
Affiliation(s)
- Priti Prasad
- Division of Molecular Biology and Biotechnology, CSIR-National Botanical Research Institute, Lucknow, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| | - Uzma Khatoon
- Division of Molecular Biology and Biotechnology, CSIR-National Botanical Research Institute, Lucknow, India
- Department of Botany, University of Lucknow, Lucknow, India
| | - Rishi Kumar Verma
- Division of Molecular Biology and Biotechnology, CSIR-National Botanical Research Institute, Lucknow, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| | - Shahre Aalam
- Division of Molecular Biology and Biotechnology, CSIR-National Botanical Research Institute, Lucknow, India
| | - Ajay Kumar
- Division of Molecular Biology and Biotechnology, CSIR-National Botanical Research Institute, Lucknow, India
| | | | | | - Sumit K. Bag
- Division of Molecular Biology and Biotechnology, CSIR-National Botanical Research Institute, Lucknow, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| | - Samir V. Sawant
- Division of Molecular Biology and Biotechnology, CSIR-National Botanical Research Institute, Lucknow, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| |
Collapse
|
3
|
Grover CE, Arick MA, Thrash A, Conover JL, Sanders WS, Peterson DG, Frelichowski JE, Scheffler JA, Scheffler BE, Wendel JF. Insights into the Evolution of the New World Diploid Cottons (Gossypium, Subgenus Houzingenia) Based on Genome Sequencing. Genome Biol Evol 2019; 11:53-71. [PMID: 30476109 PMCID: PMC6320677 DOI: 10.1093/gbe/evy256] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/20/2018] [Indexed: 12/24/2022] Open
Abstract
We employed phylogenomic methods to study molecular evolutionary processes and phylogeny in the geographically widely dispersed New World diploid cottons (Gossypium, subg. Houzingenia). Whole genome resequencing data (average of 33× genomic coverage) were generated to reassess the phylogenetic history of the subgenus and provide a temporal framework for its diversification. Phylogenetic analyses indicate that the subgenus likely originated following transoceanic dispersal from Africa about 6.6 Ma, but that nearly all of the biodiversity evolved following rapid diversification in the mid-Pleistocene (0.5-2.0 Ma), with multiple long-distance dispersals required to account for range expansion to Arizona, the Galapagos Islands, and Peru. Comparative analyses of cpDNAversus nuclear data indicate that this history was accompanied by several clear cases of interspecific introgression. Repetitive DNAs contribute roughly half of the total 880 Mb genome, but most transposable element families are relatively old and stable among species. In the genic fraction, pairwise synonymous mutation rates average 1% per Myr, with nonsynonymous changes being about seven times less frequent. Over 1.1 million indels were detected and phylogenetically polarized, revealing a 2-fold bias toward deletions over small insertions. We suggest that this genome down-sizing bias counteracts genome size growth by TE amplification and insertions, and helps explain the relatively small genomes that are restricted to this subgenus. Compared with the rate of nucleotide substitution, the rate of indel occurrence is much lower averaging about 17 nucleotide substitutions per indel event.
Collapse
Affiliation(s)
- Corrinne E Grover
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University
| | - Mark A Arick
- Institute for Genomics, Biocomputing, and Biotechnology, Mississippi State University
| | - Adam Thrash
- Institute for Genomics, Biocomputing, and Biotechnology, Mississippi State University
| | - Justin L Conover
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University
| | - William S Sanders
- Institute for Genomics, Biocomputing, and Biotechnology, Mississippi State University
- Department of Computer Science & Engineering, Mississippi State University
- The Jackson Laboratory, Connecticut
| | - Daniel G Peterson
- Institute for Genomics, Biocomputing, and Biotechnology, Mississippi State University
| | | | | | - Brian E Scheffler
- USDA, Genomics and Bioinformatics Research Unit, Stoneville, Mississippi
| | - Jonathan F Wendel
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University
| |
Collapse
|
4
|
Ulloa M, Hulse-Kemp AM, De Santiago LM, Stelly DM, Burke JJ. Insights Into Upland Cotton ( Gossypium hirsutum L.) Genetic Recombination Based on 3 High-Density Single-Nucleotide Polymorphism and a Consensus Map Developed Independently With Common Parents. GENOMICS INSIGHTS 2017; 10:1178631017735104. [PMID: 29308012 PMCID: PMC5751910 DOI: 10.1177/1178631017735104] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/16/2017] [Accepted: 09/10/2017] [Indexed: 11/17/2022]
Abstract
High-density linkage maps are vital to supporting the correct placement of scaffolds and gene sequences on chromosomes and fundamental to contemporary organismal research and scientific approaches to genetic improvement, especially in paleopolyploids with exceptionally complex genomes, eg, upland cotton (Gossypium hirsutum L., "2n = 52"). Three independently developed intraspecific upland mapping populations were analyzed to generate 3 high-density genetic linkage single-nucleotide polymorphism (SNP) maps and a consensus map using the CottonSNP63K array. The populations consisted of a previously reported F2, a recombinant inbred line (RIL), and reciprocal RIL population, from "Phytogen 72" and "Stoneville 474" cultivars. The cluster file provided 7417 genotyped SNP markers, resulting in 26 linkage groups corresponding to the 26 chromosomes (c) of the allotetraploid upland cotton (AD)1 arisen from the merging of 2 genomes ("A" Old World and "D" New World). Patterns of chromosome-specific recombination were largely consistent across mapping populations. The high-density genetic consensus map included 7244 SNP markers that spanned 3538 cM and comprised 3824 SNP bins, of which 1783 and 2041 were in the At and Dt subgenomes with 1825 and 1713 cM map lengths, respectively. Subgenome average distances were nearly identical, indicating that subgenomic differences in bin number arose due to the high numbers of SNPs on the Dt subgenome. Examination of expected recombination frequency or crossovers (COs) on the chromosomes within each population of the 2 subgenomes revealed that COs were also not affected by the SNPs or SNP bin number in these subgenomes. Comparative alignment analyses identified historical ancestral At-subgenomic translocations of c02 and c03, as well as of c04 and c05. The consensus map SNP sequences aligned with high congruency to the NBI assembly of Gossypium hirsutum. However, the genomic comparisons revealed evidence of additional unconfirmed possible duplications, inversions and translocations, and unbalance SNP sequence homology or SNP sequence/loci genomic dominance, or homeolog loci bias of the upland tetraploid At and Dt subgenomes. The alignments indicated that 364 SNP-associated previously unintegrated scaffolds can be placed in pseudochromosomes of the NBI G hirsutum assembly. This is the first intraspecific SNP genetic linkage consensus map assembled in G hirsutum with a core of reproducible mendelian SNP markers assayed on different populations and it provides further knowledge of chromosome arrangement of genic and nongenic SNPs. Together, the consensus map and RIL populations provide a synergistically useful platform for localizing and identifying agronomically important loci for improvement of the cotton crop.
Collapse
Affiliation(s)
- Mauricio Ulloa
- USDA-ARS, PA, Plant Stress and Germplasm Development Research Unit, Lubbock, TX, USA
| | - Amanda M Hulse-Kemp
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX, USA
| | - Luis M De Santiago
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX, USA
- Interdisciplinary Program in Genetics, Texas A&M University, College Station, TX, USA
| | - David M Stelly
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX, USA
| | - John J Burke
- USDA-ARS, PA, Plant Stress and Germplasm Development Research Unit, Lubbock, TX, USA
| |
Collapse
|
5
|
Boutte J, Ferreira de Carvalho J, Rousseau-Gueutin M, Poulain J, Da Silva C, Wincker P, Ainouche M, Salmon A. Reference Transcriptomes and Detection of Duplicated Copies in Hexaploid and Allododecaploid Spartina Species (Poaceae). Genome Biol Evol 2016; 8:3030-3044. [PMID: 27614235 PMCID: PMC5633685 DOI: 10.1093/gbe/evw209] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/20/2016] [Indexed: 01/19/2023] Open
Abstract
In this study, we report the assembly and annotation of five reference transcriptomes for the European hexaploid Spartina species (S. maritima, S. alterniflora and their homoploid hybrids S. x townsendii and S. x neyrautii) and the allododecaploid invasive species S. anglica These transcriptomes were constructed from various leaf and root cDNA libraries that were sequenced using both Roche-454 and Illumina technologies. Considering the high ploidy levels of the Spartina genomes under study, and considering the absence of diploid reference genome and the need of an appropriate analytical strategy, we developed generic bioinformatics tools to (1) detect different haplotypes of each gene within each species and (2) assign a parental origin to haplotypes detected in the hexaploid hybrids and the neo-allopolyploid. The approach described here allows the detection of putative homeologs from sets of short reads. Synonymous substitution rate (KS) comparisons between haplotypes from the hexaploid species revealed the presence of one KS peak (likely resulting from the tetraploid duplication event). The procedure developed in this study can be applied for future differential gene expression or genomics experiments to study the fate of duplicated genes in the invasive allododecaploid S. anglica.
Collapse
Affiliation(s)
- Julien Boutte
- UMR CNRS 6553 Ecobio, OSUR (Observatoire des Sciences de l'Univers de Rennes), University of Rennes 1, Rennes Cedex, France
| | - Julie Ferreira de Carvalho
- UMR CNRS 6553 Ecobio, OSUR (Observatoire des Sciences de l'Univers de Rennes), University of Rennes 1, Rennes Cedex, France
| | - Mathieu Rousseau-Gueutin
- UMR CNRS 6553 Ecobio, OSUR (Observatoire des Sciences de l'Univers de Rennes), University of Rennes 1, Rennes Cedex, France UMR Institut de Génétique, Environnement et Protection des Plantes, Institut National de la Recherche Agronomique, Le Rheu Cedex, France
| | | | | | | | - Malika Ainouche
- UMR CNRS 6553 Ecobio, OSUR (Observatoire des Sciences de l'Univers de Rennes), University of Rennes 1, Rennes Cedex, France
| | - Armel Salmon
- UMR CNRS 6553 Ecobio, OSUR (Observatoire des Sciences de l'Univers de Rennes), University of Rennes 1, Rennes Cedex, France
| |
Collapse
|
6
|
Kottapalli P, Ulloa M, Kottapalli KR, Payton P, Burke J. SNP Marker Discovery in Pima Cotton ( Gossypium barbadense L.) Leaf Transcriptomes. GENOMICS INSIGHTS 2016; 9:51-60. [PMID: 27721653 PMCID: PMC5049682 DOI: 10.4137/gei.s40377] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/10/2016] [Revised: 08/22/2016] [Accepted: 08/24/2016] [Indexed: 11/17/2022]
Abstract
The objective of this study was to explore the known narrow genetic diversity and discover single-nucleotide polymorphic (SNP) markers for marker-assisted breeding within Pima cotton (Gossypium barbadense L.) leaf transcriptomes. cDNA from 25-day plants of three diverse cotton genotypes [Pima S6 (PS6), Pima S7 (PS7), and Pima 3-79 (P3-79)] was sequenced on Illumina sequencing platform. A total of 28.9 million reads (average read length of 138 bp) were generated by sequencing cDNA libraries of these three genotypes. The de novo assembly of reads generated transcriptome sets of 26,369 contigs for PS6, 25,870 contigs for PS7, and 24,796 contigs for P3-79. A Pima leaf reference transcriptome was generated consisting of 42,695 contigs. More than 10,000 single-nucleotide polymorphisms (SNPs) were identified between the genotypes, with 100% SNP frequency and a minimum of eight sequencing reads. The most prevalent SNP substitutions were C-T and A-G in these cotton genotypes. The putative SNPs identified can be utilized for characterizing genetic diversity, genotyping, and eventually in Pima cotton breeding through marker-assisted selection.
Collapse
Affiliation(s)
- Pratibha Kottapalli
- Center for Biotechnology and Genomics, Texas Tech University, Lubbock, TX, USA
| | - Mauricio Ulloa
- USDA-ARS, PA, CSRL, Plant Stress and Germplasm Development Research, Lubbock, TX, USA
| | | | - Paxton Payton
- USDA-ARS, PA, CSRL, Plant Stress and Germplasm Development Research, Lubbock, TX, USA
| | - John Burke
- USDA-ARS, PA, CSRL, Plant Stress and Germplasm Development Research, Lubbock, TX, USA
| |
Collapse
|
7
|
Glover NM, Redestig H, Dessimoz C. Homoeologs: What Are They and How Do We Infer Them? TRENDS IN PLANT SCIENCE 2016; 21:609-621. [PMID: 27021699 PMCID: PMC4920642 DOI: 10.1016/j.tplants.2016.02.005] [Citation(s) in RCA: 97] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2015] [Revised: 02/09/2016] [Accepted: 02/20/2016] [Indexed: 05/18/2023]
Abstract
The evolutionary history of nearly all flowering plants includes a polyploidization event. Homologous genes resulting from allopolyploidy are commonly referred to as 'homoeologs', although this term has not always been used precisely or consistently in the literature. With several allopolyploid genome sequencing projects under way, there is a pressing need for computational methods for homoeology inference. Here we review the definition of homoeology in historical and modern contexts and propose a precise and testable definition highlighting the connection between homoeologs and orthologs. In the second part, we survey experimental and computational methods of homoeolog inference, considering the strengths and limitations of each approach. Establishing a precise and evolutionarily meaningful definition of homoeology is essential for understanding the evolutionary consequences of polyploidization.
Collapse
Affiliation(s)
- Natasha M Glover
- Bayer CropScience NV, Technologiepark 38, 9052 Gent, Belgium; University College London, Gower Street, London WC1E 6BT, UK
| | | | - Christophe Dessimoz
- University College London, Gower Street, London WC1E 6BT, UK; University of Lausanne, Biophore, 1015 Lausanne, Switzerland; Swiss Institute of Bioinformatics, Biophore, 1015 Lausanne, Switzerland.
| |
Collapse
|
8
|
Zhou B, Zhang L, Ullah A, Jin X, Yang X, Zhang X. Identification of Multiple Stress Responsive Genes by Sequencing a Normalized cDNA Library from Sea-Land Cotton (Gossypium barbadense L.). PLoS One 2016; 11:e0152927. [PMID: 27031331 PMCID: PMC4816313 DOI: 10.1371/journal.pone.0152927] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2015] [Accepted: 03/20/2016] [Indexed: 11/18/2022] Open
Abstract
Background Plants often face multiple stresses including drought, extreme temperature, salinity, nutrition deficiency and biotic stresses during growth and development. All the stresses result in a series of physiological and metabolic reactions and then generate reversible inhibition of metabolism and growth and can cause seriously irreversible damage, even death. At each stage of cotton growth, environmental stress conditions pose devastating threats to plant growth and development, especially yield and quality. Due to the complex stress conditions and unclear molecular mechanisms of stress response, there is an urgent need to explore the mechanisms of cotton response against abiotic stresses. Methodology and Principal Findings A normalized cDNA library was constructed using Gossypium barbadense Hai-7124 treated with different stress conditions (heat, cold, salt, drought, potassium and phosphorus deficit and Verticillium dahliae infection). Random sequencing of this library generated 6,047 high-quality expressed sequence tags (ESTs). The ESTs were clustered and assembled into 3,135 uniESTs, composed of 2,497 contigs and 638 singletons. The blastx results demonstrated 2,746 unigenes showing significant similarity to known genes, 74 uniESTs displaying significant similarity to genes of predicted proteins, and 315 uniESTs remain uncharacterized. Functional classification unveiled the abundance of uniESTs in binding, catalytic activity, and structural molecule activity. Annotations of the uniESTs by the plant transcription factor database (PlantTFDB) and Plant Stress Protein Database (PSPDB) disclosed that transcription factors and stress-related genes were enriched in the current library. The expression of some transcription factors and specific stress-related genes were verified by RT-PCR under various stress conditions. Conclusions/Significance Annotation results showed that a huge number of genes respond to stress in our study, such as MYB-related, C2H2, FAR1, bHLH, bZIP, MADS, and mTERF. These results will improve our knowledge of stress tolerance in cotton. In addition, they are also helpful in discovering candidate genes related to stress tolerance. The publicly available ESTs from G. barbadense are a valuable genomic resource that will facilitate further molecular study and breeding of stress-tolerant cotton.
Collapse
Affiliation(s)
- Bin Zhou
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, Hubei 430070, P. R. China
| | - Lin Zhang
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, Hubei 430070, P. R. China
| | - Abid Ullah
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, Hubei 430070, P. R. China
| | - Xin Jin
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, Hubei 430070, P. R. China
| | - Xiyan Yang
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, Hubei 430070, P. R. China
- * E-mail:
| | - Xianlong Zhang
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, Hubei 430070, P. R. China
| |
Collapse
|
9
|
Lambret-Frotté J, Artico S, Muniz Nardeli S, Fonseca F, Brilhante Oliveira-Neto O, Grossi-de-Sá MF, Alves-Ferreira M. Promoter isolation and characterization of GhAO-like1, a Gossypium hirsutum gene similar to multicopper oxidases that is highly expressed in reproductive organs. Genome 2015; 59:23-36. [PMID: 26692462 DOI: 10.1139/gen-2015-0055] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Cotton is one of the most economically important cultivated crops. It is the major source of natural fiber for the textile industry and an important target for genetic modification for both biotic stress and herbicide tolerance. Therefore, the characterization of genes and regulatory regions that might be useful for genetic transformation is indispensable. The isolation and characterization of new regulatory regions is of great importance to drive transgene expression in genetically modified crops. One of the major drawbacks in cotton production is pest damage; therefore, the most promising, cost-effective, and sustainable method for pest control is the development of genetically resistant cotton lines. Considering this scenario, our group isolated and characterized the promoter region of a MCO (multicopper oxidase) from Gossypium hirsutum, named GhAO-like1 (ascorbate oxidase-like1). The quantitative expression, together with the in vivo characterization of the promoter region reveals that GhAO-like1 has a flower- and fruit-specific expression pattern. The GUS activity is mainly observed in stamens, as expected considering that the GhAO-like1 regulatory sequence is enriched in cis elements, which have been characterized as a target of reproductive tissue specific transcription factors. Both histological and quantitative analyses in Arabidopsis thaliana have confirmed flower (mainly in stamens) and fruit expression of GhAO-like1. In the present paper, we isolated and characterized both in silico and in vivo the promoter region of the GhAO-like1 gene. The regulatory region of GhAO-like1 might be useful to confer tissue-specific expression in genetically modified plants.
Collapse
Affiliation(s)
- Julia Lambret-Frotté
- a Department of Genetics, Universidade Federal do Rio de Janeiro (UFRJ), Rio de Janeiro, RJ, Brazil
| | - Sinara Artico
- a Department of Genetics, Universidade Federal do Rio de Janeiro (UFRJ), Rio de Janeiro, RJ, Brazil
| | - Sarah Muniz Nardeli
- a Department of Genetics, Universidade Federal do Rio de Janeiro (UFRJ), Rio de Janeiro, RJ, Brazil
| | - Fernando Fonseca
- b Embrapa Genetic Resources and Biotechnology, Brasilia, DF, Brazil
| | | | - Maria Fatima Grossi-de-Sá
- b Embrapa Genetic Resources and Biotechnology, Brasilia, DF, Brazil.,c Universidade Católica de Brasília (UCB), Brasilia, DF, Brazil
| | - Marcio Alves-Ferreira
- a Department of Genetics, Universidade Federal do Rio de Janeiro (UFRJ), Rio de Janeiro, RJ, Brazil
| |
Collapse
|
10
|
Haplotype Detection from Next-Generation Sequencing in High-Ploidy-Level Species: 45S rDNA Gene Copies in the Hexaploid Spartina maritima. G3-GENES GENOMES GENETICS 2015; 6:29-40. [PMID: 26530424 PMCID: PMC4704722 DOI: 10.1534/g3.115.023242] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Gene and whole-genome duplications are widespread in plant nuclear genomes, resulting in sequence heterogeneity. Identification of duplicated genes may be particularly challenging in highly redundant genomes, especially when there are no diploid parents as a reference. Here, we developed a pipeline to detect the different copies in the ribosomal RNA gene family in the hexaploid grass Spartina maritima from next-generation sequencing (Roche-454) reads. The heterogeneity of the different domains of the highly repeated 45S unit was explored by identifying single nucleotide polymorphisms (SNPs) and assembling reads based on shared polymorphisms. SNPs were validated using comparisons with Illumina sequence data sets and by cloning and Sanger (re)sequencing. Using this approach, 29 validated polymorphisms and 11 validated haplotypes were reported (out of 34 and 20, respectively, that were initially predicted by our program). The rDNA domains of S. maritima have similar lengths as those found in other Poaceae, apart from the 5′-ETS, which is approximately two-times longer in S. maritima. Sequence homogeneity was encountered in coding regions and both internal transcribed spacers (ITS), whereas high intragenomic variability was detected in the intergenic spacer (IGS) and the external transcribed spacer (ETS). Molecular cytogenetic analysis by fluorescent in situ hybridization (FISH) revealed the presence of one pair of 45S rDNA signals on the chromosomes of S. maritima instead of three expected pairs for a hexaploid genome, indicating loss of duplicated homeologous loci through the diploidization process. The procedure developed here may be used at any ploidy level and using different sequencing technologies.
Collapse
|
11
|
|
12
|
Shi G, Guo X, Guo J, Liu L, Hua J. Analyzing serial cDNA libraries revealed reactive oxygen species and gibberellins signaling pathways in the salt response of Upland cotton (Gossypium hirsutum L.). PLANT CELL REPORTS 2015; 34:1005-23. [PMID: 25700980 DOI: 10.1007/s00299-015-1761-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2014] [Revised: 01/27/2015] [Accepted: 02/08/2015] [Indexed: 05/22/2023]
Abstract
By comparing series full-length cDNA libraries stressed and control, the dynamic process of salt stress response in Upland cotton was studied, and reactive oxygen species and gibberellins signaling pathways were proposed. The Upland cotton is the most important fiber plant with highly salt tolerance. However, the molecular mechanism underlying salt tolerance in domesticated cotton was unclear. Here, seven full-length cDNA libraries were constructed for seedling roots of Upland cotton 'Zhong G 5' at 0, 3, 12 and 48 h after the treatment of control or 150 mM NaCl stress. About 3300 colonies in each library were selected robotically for 5'-end pyrosequencing, resulting in 20,358 expressed sequence tags (ESTs) totally. And 8516 uniESTs were then assembled, including 2914 contigs and 5602 singletons, and explored for Gene Ontology (GO) function. GO comparison between serial stress libraries and control reflected the growth regulation, stimulus response, signal transduction and biology regulation processes were conducted dynamically in response to salt stress. MYB, MYB-related, WRKY, bHLH, GRAS and ERF families of transcription factors were significantly enriched in the early response. 65 differentially expressed genes (DEGs), mainly associated with reactive oxygen species (ROS) scavenging, gibberellins (GAs) metabolism, signal transduction, transcription regulation, stress response and transmembrane transport, were identified and confirmed by quantitative real-time PCR. Overexpression of selected DEGs increased tolerance against salt stress in transgenic yeast. Results in this study supported that a ROS-GAs interacting signaling pathway of salt stress response was activated in Upland cotton. Our results provided valuable gene resources for further investigation of the molecular mechanism of salinity tolerance.
Collapse
Affiliation(s)
- Gongyao Shi
- Key Lab of Crop Heterosis and Utilization of Ministry of Education, College of Agronomy and Biotechnology, Beijing Key Lab of Crop Genetic Improvement, China Agricultural University, Beijing, 100193, China,
| | | | | | | | | |
Collapse
|
13
|
Xu Z, Yu J, Kohel RJ, Percy RG, Beavis WD, Main D, Yu JZ. Distribution and evolution of cotton fiber development genes in the fibreless Gossypium raimondii genome. Genomics 2015; 106:61-9. [PMID: 25796538 DOI: 10.1016/j.ygeno.2015.03.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2014] [Revised: 02/05/2015] [Accepted: 03/11/2015] [Indexed: 01/15/2023]
Abstract
Cotton fiber represents the largest single cell in plants and they serve as models to study cell development. This study investigated the distribution and evolution of fiber Unigenes anchored to recombination hotspots between tetraploid cotton (Gossypium hirsutum) At and Dt subgenomes, and within a parental diploid cotton (Gossypium raimondii) D genome. Comparative analysis of At vs D and Dt vs D showed that 1) the D genome provides many fiber genes after its merger with another parental diploid cotton (Gossypium arboreum) A genome although the D genome itself does not produce any spinnable fiber; 2) similarity of fiber genes is higher between At vs D than between Dt vs D genomic hotspots. This is the first report that fiber genes have higher similarity between At and D than between Dt and D. The finding provides new insights into cotton genomic regions that would facilitate genetic improvement of natural fiber properties.
Collapse
Affiliation(s)
- Zhanyou Xu
- USDA-ARS, Southern Plains Agricultural Research Center, College Station, TX, USA; Bioinformatics and Computational Biology, Iowa State University, Ames, IA, USA
| | - Jing Yu
- USDA-ARS, Southern Plains Agricultural Research Center, College Station, TX, USA; Department of Horticulture, Washington State University, Pullman, WA, USA
| | - Russell J Kohel
- USDA-ARS, Southern Plains Agricultural Research Center, College Station, TX, USA
| | - Richard G Percy
- USDA-ARS, Southern Plains Agricultural Research Center, College Station, TX, USA
| | - William D Beavis
- Bioinformatics and Computational Biology, Iowa State University, Ames, IA, USA
| | - Dorrie Main
- Department of Horticulture, Washington State University, Pullman, WA, USA
| | - John Z Yu
- USDA-ARS, Southern Plains Agricultural Research Center, College Station, TX, USA.
| |
Collapse
|
14
|
Kebede H, Payton P, Pham HTM, Allen RD, Wright RJ. Toward Coalescing Gene Expression and Function with QTLs of Water-Deficit Stress in Cotton. INTERNATIONAL JOURNAL OF PLANT GENOMICS 2015; 2015:892716. [PMID: 26167172 PMCID: PMC4488579 DOI: 10.1155/2015/892716] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/21/2015] [Revised: 05/08/2015] [Accepted: 05/13/2015] [Indexed: 05/08/2023]
Abstract
Cotton exhibits moderately high vegetative tolerance to water-deficit stress but lint production is restricted by the available rainfed and irrigation capacity. We have described the impact of water-deficit stress on the genetic and metabolic control of fiber quality and production. Here we examine the association of tentative consensus sequences (TCs) derived from various cotton tissues under irrigated and water-limited conditions with stress-responsive QTLs. Three thousand sixteen mapped sequence-tagged-sites were used as anchored targets to examine sequence homology with 15,784 TCs to test the hypothesis that putative stress-responsive genes will map within QTLs associated with stress-related phenotypic variation more frequently than with other genomic regions not associated with these QTLs. Approximately 1,906 of 15,784 TCs were mapped to the consensus map. About 35% of the annotated TCs that mapped within QTL regions were genes involved in an abiotic stress response. By comparison, only 14.5% of the annotated TCs mapped outside these QTLs were classified as abiotic stress genes. A simple binomial probability calculation of this degree of bias being observed if QTL and non-QTL regions are equally likely to contain stress genes was P (x ≥ 85) = 7.99 × 10(-15). These results suggest that the QTL regions have a higher propensity to contain stress genes.
Collapse
Affiliation(s)
- Hirut Kebede
- USDA-ARS Crop Genetics Research Unit, Stoneville, MS 38776, USA
| | - Paxton Payton
- USDA-ARS Cropping Systems Research Laboratory, Lubbock, TX 79415, USA
| | - Hanh Thi My Pham
- Department of Plant and Soil Science, Texas Tech University, Lubbock, TX 79409, USA
| | - Randy D. Allen
- Department of Biochemistry and Molecular Biology, Oklahoma State University, Stillwater, OK 73401, USA
| | - Robert J. Wright
- Department of Plant and Soil Science, Texas Tech University, Lubbock, TX 79409, USA
- *Robert J. Wright:
| |
Collapse
|
15
|
Molecular markers and cotton genetic improvement: current status and future prospects. ScientificWorldJournal 2014; 2014:607091. [PMID: 25401149 PMCID: PMC4226190 DOI: 10.1155/2014/607091] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2014] [Accepted: 09/17/2014] [Indexed: 11/17/2022] Open
Abstract
Narrow genetic base and complex allotetraploid genome of cotton (Gossypium hirsutum L.) is stimulating efforts to avail required polymorphism for marker based breeding. The availability of draft genome sequence of G. raimondii and G. arboreum and next generation sequencing (NGS) technologies facilitated the development of high-throughput marker technologies in cotton. The concepts of genetic diversity, QTL mapping, and marker assisted selection (MAS) are evolving into more efficient concepts of linkage disequilibrium, association mapping, and genomic selection, respectively. The objective of the current review is to analyze the pace of evolution in the molecular marker technologies in cotton during the last ten years into the following four areas: (i) comparative analysis of low- and high-throughput marker technologies available in cotton, (ii) genetic diversity in the available wild and improved gene pools of cotton, (iii) identification of the genomic regions within cotton genome underlying economic traits, and (iv) marker based selection methodologies. Moreover, the applications of marker technologies to enhance the breeding efficiency in cotton are also summarized. Aforementioned genomic technologies and the integration of several other omics resources are expected to enhance the cotton productivity and meet the global fiber quantity and quality demands.
Collapse
|
16
|
Homeologous genes involved in mannitol synthesis reveal unequal contributions in response to abiotic stress in Coffea arabica. Mol Genet Genomics 2014; 289:951-63. [PMID: 24861101 DOI: 10.1007/s00438-014-0864-y] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2013] [Accepted: 05/06/2014] [Indexed: 01/10/2023]
Abstract
Polyploid plants can exhibit transcriptional modulation in homeologous genes in response to abiotic stresses. Coffea arabica, an allotetraploid, accounts for 75% of the world's coffee production. Extreme temperatures, salinity and drought limit crop productivity, which includes coffee plants. Mannitol is known to be involved in abiotic stress tolerance in higher plants. This study aimed to investigate the transcriptional responses of genes involved in mannitol biosynthesis and catabolism in C. arabica leaves under water deficit, salt stress and high temperature. Mannitol concentration was significantly increased in leaves of plants under drought and salinity, but reduced by heat stress. Fructose content followed the level of mannitol only in heat-stressed plants, suggesting the partitioning of the former into other metabolites during drought and salt stress conditions. Transcripts of the key enzymes involved in mannitol biosynthesis, CaM6PR, CaPMI and CaMTD, were modulated in distinct ways depending on the abiotic stress. Our data suggest that changes in mannitol accumulation during drought and salt stress in leaves of C. arabica are due, at least in part, to the increased expression of the key genes involved in mannitol biosynthesis. In addition, the homeologs of the Coffea canephora subgenome did not present the same pattern of overall transcriptional response, indicating differential regulation of these genes by the same stimulus. In this way, this study adds new information on the differential expression of C. arabica homeologous genes under adverse environmental conditions showing that abiotic stresses can influence the homeologous gene regulation pattern, in this case, mainly on those involved in mannitol pathway.
Collapse
|
17
|
Nigam D, Kavita P, Tripathi RK, Ranjan A, Goel R, Asif M, Shukla A, Singh G, Rana D, Sawant SV. Transcriptome dynamics during fibre development in contrasting genotypes of Gossypium hirsutum L. PLANT BIOTECHNOLOGY JOURNAL 2014; 12:204-218. [PMID: 24119257 DOI: 10.1111/pbi.12129] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/21/2013] [Revised: 08/29/2013] [Accepted: 09/03/2013] [Indexed: 06/02/2023]
Abstract
Understanding the contribution of genetic background in fibre quality traits is important for the development of future cotton varieties with superior fibre quality. We used Affymetrix microarray (Santa Clara, CA) and Roche 454 GSFLX (Branford, CT) for comparative transcriptome analysis between two superior and three inferior genotypes at six fibre developmental stages. Microarray-based analysis of variance (ANOVA) for 89 microarrays encompassing five contrasting genotypes and six developmental stages suggests that the stages of the fibre development have a more pronounced effect on the differentially expressed genes (DEGs) than the genetic background of genotypes. Superior genotypes showed enriched activity of cell wall enzymes, such as pectin methyl esterase, at early elongation stage, enriched metabolic activities such as lipid, amino acid and ribosomal protein subunits at peak elongation, and prolonged combinatorial regulation of brassinosteroid and auxin at later stages. Our efforts on transcriptome sequencing were focused on changes in gene expression at 25 DPA. Transcriptome sequencing resulted in the generation of 475 658 and 429 408 high-quality reads from superior and inferior genotypes, respectively. A total of 24 609 novel transcripts were identified manually for Gossypium hirsutum with no hits in NCBI 'nr' database. Gene ontology analyses showed that the genes for ribosome biogenesis, protein transport and fatty acid biosynthesis were over-represented in superior genotype, whereas salt stress, abscisic acid stimuli and water deprivation leading to the increased proteolytic activity were more pronounced in inferior genotype.
Collapse
Affiliation(s)
- Deepti Nigam
- Plant Molecular Biology Laboratory, CSIR-National Botanical Research Institute, Rana Pratap Marg, Lucknow, India
| | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Rambani A, Page JT, Udall JA. Polyploidy and the petal transcriptome of Gossypium. BMC PLANT BIOLOGY 2014; 14:3. [PMID: 24393201 PMCID: PMC3890615 DOI: 10.1186/1471-2229-14-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2013] [Accepted: 10/08/2013] [Indexed: 05/02/2023]
Abstract
Background Genes duplicated by polyploidy (homoeologs) may be differentially expressed in plant tissues. Recent research using DNA microarrays and RNAseq data have described a cacophony of complex expression patterns during development of cotton fibers, petals, and leaves. Because of its highly canalized development, petal tissue has been used as a model tissue for gene expression in cotton. Recent advances in cotton genome annotation and assembly now permit an enhanced analysis of duplicate gene deployment in petals from allopolyploid cotton. Results Homoeologous gene expression levels were quantified in diploid and tetraploid flower petals of Gossypium using the Gossypium raimondii genome sequence as a reference. In the polyploid, most homoeologous genes were expressed at equal levels, though a subset had an expression bias of AT and DT copies. The direction of gene expression bias was conserved in natural and recent polyploids of cotton. Conservation of direction of bias and additional comparisons between the diploids and tetraploids suggested different regulation mechanisms of gene expression. We described three phases in the evolution of cotton genomes that contribute to gene expression in the polyploid nucleus. Conclusions Compared to previous studies, a surprising level of expression homeostasis was observed in the expression patterns of polyploid genomes. Conserved expression bias in polyploid petals may have resulted from cis-acting modifications that occurred prior to polyploidization. Some duplicated genes were intriguing exceptions to general trends. Mechanisms of gene regulation for these and other genes in the cotton genome warrants further investigation.
Collapse
Affiliation(s)
- Aditi Rambani
- Plant and Wildlife Science Department, Brigham Young University, Provo, UT 84602, USA
| | - Justin T Page
- Plant and Wildlife Science Department, Brigham Young University, Provo, UT 84602, USA
| | - Joshua A Udall
- Plant and Wildlife Science Department, Brigham Young University, Provo, UT 84602, USA
| |
Collapse
|
19
|
Yoo MJ, Wendel JF. Comparative evolutionary and developmental dynamics of the cotton (Gossypium hirsutum) fiber transcriptome. PLoS Genet 2014; 10:e1004073. [PMID: 24391525 PMCID: PMC3879233 DOI: 10.1371/journal.pgen.1004073] [Citation(s) in RCA: 104] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2013] [Accepted: 11/15/2013] [Indexed: 01/05/2023] Open
Abstract
The single-celled cotton (Gossypium hirsutum) fiber provides an excellent model to investigate how human selection affects phenotypic evolution. To gain insight into the evolutionary genomics of cotton domestication, we conducted comparative transcriptome profiling of developing cotton fibers using RNA-Seq. Analysis of single-celled fiber transcriptomes from four wild and five domesticated accessions from two developmental time points revealed that at least one-third and likely one-half of the genes in the genome are expressed at any one stage during cotton fiber development. Among these, ∼5,000 genes are differentially expressed during primary and secondary cell wall synthesis between wild and domesticated cottons, with a biased distribution among chromosomes. Transcriptome data implicate a number of biological processes affected by human selection, and suggest that the domestication process has prolonged the duration of fiber elongation in modern cultivated forms. Functional analysis suggested that wild cottons allocate greater resources to stress response pathways, while domestication led to reprogrammed resource allocation toward increased fiber growth, possibly through modulating stress-response networks. This first global transcriptomic analysis using multiple accessions of wild and domesticated cottons is an important step toward a more comprehensive systems perspective on cotton fiber evolution. The understanding that human selection over the past 5,000+ years has dramatically re-wired the cotton fiber transcriptome sets the stage for a deeper understanding of the genetic architecture underlying cotton fiber synthesis and phenotypic evolution. Ever since Darwin biologists have recognized that comparative study of crop plants and their wild relatives offers a powerful framework for generating insights into the mechanisms that underlie evolutionary change. Here, we study the domestication process in cotton, Gossypium hirsutum, an allopolyploid species (containing two different genomes) which initially was domesticated approximately 5000 years ago, and which primarily is grown for its single-celled seed fibers. Strong directional selection over the millennia was accompanied by transformation of the short, coarse, and brown fibers of wild plants into the long, strong, and fine white fibers of the modern cotton crop plant. To explore the evolutionary genetics of cotton domestication, we conducted transcriptome profiling of developing cotton fibers from multiple accessions of wild and domesticated cottons. Comparative analysis revealed that the domestication process dramatically rewired the transcriptome, affecting more than 5,000 genes, and with a more evenly balanced usage of the duplicated copies arising from genome doubling. We identify many different biological processes that were involved in this transformation, including those leading to a prolongation of fiber elongation and a reallocation of resources toward increased fiber growth in modern forms. The data provide a rich resource for future functional analyses targeting crop improvement and evolutionary objectives.
Collapse
Affiliation(s)
- Mi-Jeong Yoo
- Department of Biology, University of Florida, Gainesville, Florida, United States of America
| | - Jonathan F. Wendel
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, Iowa, United States of America
- * E-mail:
| |
Collapse
|
20
|
Bowman MJ, Park W, Bauer PJ, Udall JA, Page JT, Raney J, Scheffler BE, Jones DC, Campbell BT. RNA-Seq transcriptome profiling of upland cotton (Gossypium hirsutum L.) root tissue under water-deficit stress. PLoS One 2013; 8:e82634. [PMID: 24324815 PMCID: PMC3855774 DOI: 10.1371/journal.pone.0082634] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2013] [Accepted: 11/04/2013] [Indexed: 11/19/2022] Open
Abstract
An RNA-Seq experiment was performed using field grown well-watered and naturally rain fed cotton plants to identify differentially expressed transcripts under water-deficit stress. Our work constitutes the first application of the newly published diploid D5 Gossypium raimondii sequence in the study of tetraploid AD1 upland cotton RNA-seq transcriptome analysis. A total of 1,530 transcripts were differentially expressed between well-watered and water-deficit stressed root tissues, in patterns that confirm the accuracy of this technique for future studies in cotton genomics. Additionally, putative sequence based genome localization of differentially expressed transcripts detected A2 genome specific gene expression under water-deficit stress. These data will facilitate efforts to understand the complex responses governing transcriptomic regulatory mechanisms and to identify candidate genes that may benefit applied plant breeding programs.
Collapse
Affiliation(s)
- Megan J. Bowman
- USDA-ARS, Coastal Plains Soil, Water and Plant Research Center, Florence, South Carolina, United States of America
| | - Wonkeun Park
- USDA-ARS, Coastal Plains Soil, Water and Plant Research Center, Florence, South Carolina, United States of America
- Clemson University Pee Dee Research and Education Center, Florence, South Carolina, United States of America
| | - Philip J. Bauer
- USDA-ARS, Coastal Plains Soil, Water and Plant Research Center, Florence, South Carolina, United States of America
| | - Joshua A. Udall
- Brigham Young University, Provo, Utah, United States of America
| | - Justin T. Page
- Brigham Young University, Provo, Utah, United States of America
| | - Joshua Raney
- Brigham Young University, Provo, Utah, United States of America
| | | | - Don. C. Jones
- Cotton Incorporated, Agricultural and Environmental Research, Cary, North Carolina, United States of America
| | - B. Todd Campbell
- USDA-ARS, Coastal Plains Soil, Water and Plant Research Center, Florence, South Carolina, United States of America
| |
Collapse
|
21
|
Yu J, Jung S, Cheng CH, Ficklin SP, Lee T, Zheng P, Jones D, Percy RG, Main D. CottonGen: a genomics, genetics and breeding database for cotton research. Nucleic Acids Res 2013; 42:D1229-36. [PMID: 24203703 PMCID: PMC3964939 DOI: 10.1093/nar/gkt1064] [Citation(s) in RCA: 185] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
CottonGen (http://www.cottongen.org) is a curated and integrated web-based relational database providing access to publicly available genomic, genetic and breeding data for cotton. CottonGen supercedes CottonDB and the Cotton Marker Database, with enhanced tools for easier data sharing, mining, visualization and data retrieval of cotton research data. CottonGen contains annotated whole genome sequences, unigenes from expressed sequence tags (ESTs), markers, trait loci, genetic maps, genes, taxonomy, germplasm, publications and communication resources for the cotton community. Annotated whole genome sequences of Gossypium raimondii are available with aligned genetic markers and transcripts. These whole genome data can be accessed through genome pages, search tools and GBrowse, a popular genome browser. Most of the published cotton genetic maps can be viewed and compared using CMap, a comparative map viewer, and are searchable via map search tools. Search tools also exist for markers, quantitative trait loci (QTLs), germplasm, publications and trait evaluation data. CottonGen also provides online analysis tools such as NCBI BLAST and Batch BLAST.
Collapse
Affiliation(s)
- Jing Yu
- Department of Horticulture, Washington State University, Pullman, WA 99164-6414, USA, Cotton Incorporated, Cary, NC 27513, USA and Crop Germplasm Research Unit, USDA-ARS-SPARC, College Station, TX 77845, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Generation and analysis of a large-scale expressed sequence Tag database from a full-length enriched cDNA library of developing leaves of Gossypium hirsutum L. PLoS One 2013; 8:e76443. [PMID: 24146870 PMCID: PMC3795732 DOI: 10.1371/journal.pone.0076443] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2013] [Accepted: 08/24/2013] [Indexed: 11/21/2022] Open
Abstract
Background Cotton (Gossypium hirsutum L.) is one of the world’s most economically-important crops. However, its entire genome has not been sequenced, and limited resources are available in GenBank for understanding the molecular mechanisms underlying leaf development and senescence. Methodology/Principal Findings In this study, 9,874 high-quality ESTs were generated from a normalized, full-length cDNA library derived from pooled RNA isolated from throughout leaf development during the plant blooming stage. After clustering and assembly of these ESTs, 5,191 unique sequences, representative 1,652 contigs and 3,539 singletons, were obtained. The average unique sequence length was 682 bp. Annotation of these unique sequences revealed that 84.4% showed significant homology to sequences in the NCBI non-redundant protein database, and 57.3% had significant hits to known proteins in the Swiss-Prot database. Comparative analysis indicated that our library added 2,400 ESTs and 991 unique sequences to those known for cotton. The unigenes were functionally characterized by gene ontology annotation. We identified 1,339 and 200 unigenes as potential leaf senescence-related genes and transcription factors, respectively. Moreover, nine genes related to leaf senescence and eleven MYB transcription factors were randomly selected for quantitative real-time PCR (qRT-PCR), which revealed that these genes were regulated differentially during senescence. The qRT-PCR for three GhYLSs revealed that these genes express express preferentially in senescent leaves. Conclusions/Significance These EST resources will provide valuable sequence information for gene expression profiling analyses and functional genomics studies to elucidate their roles, as well as for studying the mechanisms of leaf development and senescence in cotton and discovering candidate genes related to important agronomic traits of cotton. These data will also facilitate future whole-genome sequence assembly and annotation in G. hirsutum and comparative genomics among Gossypium species.
Collapse
|
23
|
Buyyarapu R, Kantety RV, Yu JZ, Xu Z, Kohel RJ, Percy RG, Macmil S, Wiley GB, Roe BA, Sharma GC. BAC-pool sequencing and analysis of large segments of A12 and D12 homoeologous chromosomes in upland cotton. PLoS One 2013; 8:e76757. [PMID: 24116150 PMCID: PMC3792896 DOI: 10.1371/journal.pone.0076757] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2013] [Accepted: 08/28/2013] [Indexed: 11/19/2022] Open
Abstract
Although new and emerging next-generation sequencing (NGS) technologies have reduced sequencing costs significantly, much work remains to implement them for de novo sequencing of complex and highly repetitive genomes such as the tetraploid genome of Upland cotton (Gossypium hirsutum L.). Herein we report the results from implementing a novel, hybrid Sanger/454-based BAC-pool sequencing strategy using minimum tiling path (MTP) BACs from Ctg-3301 and Ctg-465, two large genomic segments in A12 and D12 homoeologous chromosomes (Ctg). To enable generation of longer contig sequences in assembly, we implemented a hybrid assembly method to process ~35x data from 454 technology and 2.8-3x data from Sanger method. Hybrid assemblies offered higher sequence coverage and better sequence assemblies. Homology studies revealed the presence of retrotransposon regions like Copia and Gypsy elements in these contigs and also helped in identifying new genomic SSRs. Unigenes were anchored to the sequences in Ctg-3301 and Ctg-465 to support the physical map. Gene density, gene structure and protein sequence information derived from protein prediction programs were used to obtain the functional annotation of these genes. Comparative analysis of both contigs with Arabidopsis genome exhibited synteny and microcollinearity with a conserved gene order in both genomes. This study provides insight about use of MTP-based BAC-pool sequencing approach for sequencing complex polyploid genomes with limited constraints in generating better sequence assemblies to build reference scaffold sequences. Combining the utilities of MTP-based BAC-pool sequencing with current longer and short read NGS technologies in multiplexed format would provide a new direction to cost-effectively and precisely sequence complex plant genomes.
Collapse
Affiliation(s)
- Ramesh Buyyarapu
- Center for Molecular Biology, Department of Biological and Environmental Sciences, Alabama Agricultural & Mechanical University, Normal, Alabama, United States of America
| | - Ramesh V. Kantety
- Center for Molecular Biology, Department of Biological and Environmental Sciences, Alabama Agricultural & Mechanical University, Normal, Alabama, United States of America
| | - John Z. Yu
- United States Department of Agriculture, Agricultural Research Service, Southern Plains Agricultural Research Center, Crop Germplasm Research Unit, College Station, Texas, United States of America
| | - Zhanyou Xu
- United States Department of Agriculture, Agricultural Research Service, Southern Plains Agricultural Research Center, Crop Germplasm Research Unit, College Station, Texas, United States of America
| | - Russell J. Kohel
- United States Department of Agriculture, Agricultural Research Service, Southern Plains Agricultural Research Center, Crop Germplasm Research Unit, College Station, Texas, United States of America
| | - Richard G. Percy
- United States Department of Agriculture, Agricultural Research Service, Southern Plains Agricultural Research Center, Crop Germplasm Research Unit, College Station, Texas, United States of America
| | - Simone Macmil
- Gene Structure and Function Laboratory, University of Otago, Dunedin, New Zealand
| | - Graham B. Wiley
- Arthritis & Immunology Department, Oklahoma Medical Research Foundation, Oklahoma City, Oklahoma, United States of America
| | - Bruce A. Roe
- Advanced Center for Genome Technology, University of Oklahoma, Norman, Oklahoma, United States of America
| | - Govind C. Sharma
- Center for Molecular Biology, Department of Biological and Environmental Sciences, Alabama Agricultural & Mechanical University, Normal, Alabama, United States of America
| |
Collapse
|
24
|
Insights into the evolution of cotton diploids and polyploids from whole-genome re-sequencing. G3-GENES GENOMES GENETICS 2013; 3:1809-18. [PMID: 23979935 PMCID: PMC3789805 DOI: 10.1534/g3.113.007229] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Understanding the composition, evolution, and function of the Gossypium hirsutum (cotton) genome is complicated by the joint presence of two genomes in its nucleus (AT and DT genomes). These two genomes were derived from progenitor A-genome and D-genome diploids involved in ancestral allopolyploidization. To better understand the allopolyploid genome, we re-sequenced the genomes of extant diploid relatives that contain the A1 (Gossypium herbaceum), A2 (Gossypium arboreum), or D5 (Gossypium raimondii) genomes. We conducted a comparative analysis using deep re-sequencing of multiple accessions of each diploid species and identified 24 million SNPs between the A-diploid and D-diploid genomes. These analyses facilitated the construction of a robust index of conserved SNPs between the A-genomes and D-genomes at all detected polymorphic loci. This index is widely applicable for read mapping efforts of other diploid and allopolyploid Gossypium accessions. Further analysis also revealed locations of putative duplications and deletions in the A-genome relative to the D-genome reference sequence. The approximately 25,400 deleted regions included more than 50% deletion of 978 genes, including many involved with starch synthesis. In the polyploid genome, we also detected 1,472 conversion events between homoeologous chromosomes, including events that overlapped 113 genes. Continued characterization of the Gossypium genomes will further enhance our ability to manipulate fiber and agronomic production of cotton.
Collapse
|
25
|
LIU CHUANXIANG, YUAN DAOJUN, ZHANG XIANLONG, LIN ZHONGXU. Isolation, characterization and mapping of genes differentially expressed during fibre development between Gossypium hirsutum and G. barbadense by cDNA-SRAP. J Genet 2013. [DOI: 10.1007/s12041-013-0238-y] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
|
26
|
Shang H, Li W, Zou C, Yuan Y. Analyses of the NAC transcription factor gene family in Gossypium raimondii Ulbr.: chromosomal location, structure, phylogeny, and expression patterns. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2013; 55:663-76. [PMID: 23756542 DOI: 10.1111/jipb.12085] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/01/2013] [Accepted: 06/02/2012] [Indexed: 05/18/2023]
Abstract
NAC domain proteins are plant-specific transcription factors known to play diverse roles in various plant developmental processes. In the present study, we performed the first comprehensive study of the NAC gene family in Gossypium raimondii Ulbr., incorporating phylogenetic, chromosomal location, gene structure, conserved motif, and expression profiling analyses. We identified 145 NAC transcription factor (NAC-TF) genes that were phylogenetically clustered into 18 distinct subfamilies. Of these, 127 NAC-TF genes were distributed across the 13 chromosomes, 80 (55%) were preferentially retained duplicates located in both duplicated regions and six were located in triplicated chromosomal regions. The majority of NAC-TF genes showed temporal-, spatial-, and tissue-specific expression patterns based on transcriptomic and qRT-PCR analyses. However, the expression patterns of several duplicate genes were partially redundant, suggesting the occurrence of sub-functionalization during their evolution. Based on their genomic organization, we concluded that genomic duplications contributed significantly to the expansion of the NAC-TF gene family in G. raimondii. Comprehensive analysis of their expression profiles could provide novel insights into the functional divergence among members of the NAC gene family in G. raimondii.
Collapse
Affiliation(s)
- Haihong Shang
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, the Chinese Academy of Agricultural Sciences, Anyang, 455004, China
| | | | | | | |
Collapse
|
27
|
Jin X, Li Q, Xiao G, Zhu YX. Using genome-referenced expressed sequence tag assembly to analyze the origin and expression patterns of Gossypium hirsutum transcripts. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2013; 55:576-85. [PMID: 23675784 DOI: 10.1111/jipb.12066] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2013] [Accepted: 05/12/2013] [Indexed: 05/09/2023]
Abstract
We assembled a total of 297,239 Gossypium hirsutum (Gh, a tetraploid cotton, AADD) expressed sequence tag (EST) sequences that were available in the National Center for Biotechnology Information database, with reference to the recently published G. raimondii (Gr, a diploid cotton, DD) genome, and obtained 49,125 UniGenes. The average lengths of the UniGenes were increased from 804 and 791 bp in two previous EST assemblies to 1,019 bp in the current analysis. The number of putative cotton UniGenes with lengths of 3 kb or more increased from 25 or 34 to 1,223. As a result, thousands of originally independent G. hirsutum ESTs were aligned to produce large contigs encoding transcripts with very long open reading frames, indicating that the G. raimondii genome sequence provided remarkable advantages to assemble the tetraploid cotton transcriptome. Significant different distribution patterns within several GO terms, including transcription factor activity, were observed between D- and A-derived assemblies. Transcriptome analysis showed that, in a tetraploid cotton cell, 29,547 UniGenes were possibly derived from the D subgenome while another 19,578 may come from the A subgenome. Finally, some of the in silico data were confirmed by reverse transcription polymerase chain reaction experiments to show the changes in transcript levels for several gene families known to play key role in cotton fiber development. We believe that our work provides a useful platform for functional and evolutionary genomic studies in cotton.
Collapse
Affiliation(s)
- Xiang Jin
- State Key Laboratory of Protein and Plant Gene Research, College of Life Sciences, Peking University, Beijing 100871, China
| | | | | | | |
Collapse
|
28
|
Tan J, Tu L, Deng F, Hu H, Nie Y, Zhang X. A genetic and metabolic analysis revealed that cotton fiber cell development was retarded by flavonoid naringenin. PLANT PHYSIOLOGY 2013; 162:86-95. [PMID: 23535943 PMCID: PMC3641232 DOI: 10.1104/pp.112.212142] [Citation(s) in RCA: 84] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2012] [Accepted: 03/24/2013] [Indexed: 05/18/2023]
Abstract
The cotton (Gossypium spp.) fiber is a unique elongated cell that is useful for investigating cell differentiation. Previous studies have demonstrated the importance of factors such as sugar metabolism, the cytoskeleton, and hormones, which are commonly known to be involved in plant cell development, while the secondary metabolites have been less regarded. By mining public data and comparing analyses of fiber from two cotton species (Gossypium hirsutum and Gossypium barbadense), we found that the flavonoid metabolism is active in early fiber cell development. Different flavonoids exhibited distinct effects on fiber development during ovule culture; among them, naringenin (NAR) could significantly retard fiber development. NAR is a substrate of flavanone 3-hydroxylase (F3H), and silencing the F3H gene significantly increased the NAR content of fiber cells. Fiber development was suppressed following F3H silencing, but the overexpression of F3H caused no obvious effects. Significant retardation of fiber growth was observed after the introduction of the F3H-RNA interference segment into the high-flavonoid brown fiber G. hirsutum T586 line by cross. A greater accumulation of NAR as well as much shorter fibers were also observed in the BC1 generation plants. These results suggest that NAR is negatively associated with fiber development and that the metabolism mediated by F3H is important in fiber development, thus highlighting that flavonoid metabolism represents a novel pathway with the potential for cotton fiber improvement.
Collapse
Affiliation(s)
- Jiafu Tan
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, Hubei 430070, China
| | - Lili Tu
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, Hubei 430070, China
| | - Fenglin Deng
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, Hubei 430070, China
| | - Haiyan Hu
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, Hubei 430070, China
| | - Yichun Nie
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, Hubei 430070, China
| | - Xianlong Zhang
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, Hubei 430070, China
| |
Collapse
|
29
|
Lv Y, Zhao L, Xu X, Wang L, Wang C, Zhang T, Guo W. Characterization of expressed sequence tags from developing fibers of Gossypium barbadense and evaluation of insertion-deletion variation in tetraploid cultivated cotton species. BMC Genomics 2013; 14:170. [PMID: 23497047 PMCID: PMC3600364 DOI: 10.1186/1471-2164-14-170] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2012] [Accepted: 03/06/2013] [Indexed: 12/21/2022] Open
Abstract
Background Cotton is the leading fiber crop worldwide. Gossypium barbadense is an important species of cotton because of its extra-long staple fibers with superior luster and silkiness. However, a systematic analysis and utilization of cDNA sequences from G. barbadense fiber development remains understudied. Results A total of 21,079 high quality sequences were generated from two non-normalized cDNA libraries prepared by using a mixture of G. barbadense Hai7124 fibers and ovules. After assembly processing, a set of 8,653 unigenes were obtained. Of those, 7,786 were matched to known proteins and 7,316 were assigned to functional categories. The molecular functions of these unigenes were mostly related to binding and catalytic activity, and carbohydrate, amino acid, and energy metabolisms were major contributors among the subsets of metabolism. Sequences comparison between G. barbadense and G. hirsutum revealed that 8,245 unigenes from G. barbadense were detected the similarity with those released publicly in G. hirsutum, however, the remaining 408 sequences had no hits against G. hirsutum unigenes database. Furthermore, 13,275 putative ESTs InDels loci involved in the orthologous and/or homoeologous differences between/within G. barbadense and G. hirsutum were discovered by in silico analyses, and 2,160 InDel markers were developed by ESTs with more than five insertions or deletions. By gel electrophoresis combined with sequencing verification, 71.11% candidate InDel loci were reconfirmed orthologous and/or homoeologous loci polymorphisms using G. hirsutum acc TM-1 and G. barbadense cv Hai7124. Blastx result showed among 2,160 InDel loci, 81 with significant function similarity with known genes associated with secondary wall synthesis process, indicating the important roles in fiber quality in tetraploid cultivated cotton species. Conclusion Sequence comparisons and InDel markers development will lay the groundwork for promoting the identification of genes related to superior agronomic traits, genetic differentiation and comparative genomic studies between G. hirsutum and G. barbadense.
Collapse
Affiliation(s)
- Yuanda Lv
- National Key Laboratory of Crop Genetics & Germplasm Enhancement, Cotton Research Institute, Nanjing Agricultural University, Nanjing, 210095, China
| | | | | | | | | | | | | |
Collapse
|
30
|
Whole-genome thermodynamic analysis reduces siRNA off-target effects. PLoS One 2013; 8:e58326. [PMID: 23484018 PMCID: PMC3590146 DOI: 10.1371/journal.pone.0058326] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2012] [Accepted: 02/01/2013] [Indexed: 11/19/2022] Open
Abstract
Small interfering RNAs (siRNAs) are important tools for knocking down targeted genes, and have been widely applied to biological and biomedical research. To design siRNAs, two important aspects must be considered: the potency in knocking down target genes and the off-target effect on any nontarget genes. Although many studies have produced useful tools to design potent siRNAs, off-target prevention has mostly been delegated to sequence-level alignment tools such as BLAST. We hypothesize that whole-genome thermodynamic analysis can identify potential off-targets with higher precision and help us avoid siRNAs that may have strong off-target effects. To validate this hypothesis, two siRNA sets were designed to target three human genes IDH1, ITPR2 and TRIM28. They were selected from the output of two popular siRNA design tools, siDirect and siDesign. Both siRNA design tools have incorporated sequence-level screening to avoid off-targets, thus their output is believed to be optimal. However, one of the sets we tested has off-target genes predicted by Picky, a whole-genome thermodynamic analysis tool. Picky can identify off-target genes that may hybridize to a siRNA within a user-specified melting temperature range. Our experiments validated that some off-target genes predicted by Picky can indeed be inhibited by siRNAs. Similar experiments were performed using commercially available siRNAs and a few off-target genes were also found to be inhibited as predicted by Picky. In summary, we demonstrate that whole-genome thermodynamic analysis can identify off-target genes that are missed in sequence-level screening. Because Picky prediction is deterministic according to thermodynamics, if a siRNA candidate has no Picky predicted off-targets, it is unlikely to cause off-target effects. Therefore, we recommend including Picky as an additional screening step in siRNA design.
Collapse
|
31
|
PolyCat: a resource for genome categorization of sequencing reads from allopolyploid organisms. G3-GENES GENOMES GENETICS 2013; 3:517-25. [PMID: 23450226 PMCID: PMC3583458 DOI: 10.1534/g3.112.005298] [Citation(s) in RCA: 68] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/07/2012] [Accepted: 01/10/2013] [Indexed: 01/05/2023]
Abstract
Read mapping is a fundamental part of next-generation genomic research but is complicated by genome duplication in many plants. Categorizing DNA sequence reads into their respective genomes enables current methods to analyze polyploid genomes as if they were diploid. We present PolyCat-a pipeline for mapping and categorizing all types of next-generation sequence data produced from allopolyploid organisms. PolyCat uses GSNAP's single-nucleotide polymorphism (SNP)-tolerant mapping to minimize the mapping efficiency bias caused by SNPs between genomes. PolyCat then uses SNPs between genomes to categorize reads according to their respective genomes. Bisulfite-treated reads have a significant reduction in nucleotide complexity because nucleotide conversion events are confounded with transition substitutions. PolyCat includes special provisions to properly handle bisulfite-treated data. We demonstrate the functionality of PolyCat on allotetraploid cotton, Gossypium hirsutum, and create a functional SNP index for efficiently mapping sequence reads to the D-genome sequence of G. raimondii. PolyCat is appropriate for all allopolyploids and all types of next-generation genome analysis, including differential expression (RNA sequencing), differential methylation (bisulfite sequencing), differential DNA-protein binding (chromatin immunoprecipitation sequencing), and population diversity.
Collapse
|
32
|
Zhang B, Yang YW, Zhang Y, Liu JY. A high-confidence reference dataset of differentially expressed proteins in elongating cotton fiber cells. Proteomics 2013; 13:1159-63. [DOI: 10.1002/pmic.201200176] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2012] [Revised: 10/16/2012] [Accepted: 01/09/2013] [Indexed: 11/09/2022]
Affiliation(s)
- Bing Zhang
- Laboratory of Molecular Biology and MOE Laboratory of Protein Science; School of Life Sciences; Tsinghua University; Beijing P. R. China
| | - Yi-Wei Yang
- Laboratory of Molecular Biology and MOE Laboratory of Protein Science; School of Life Sciences; Tsinghua University; Beijing P. R. China
| | - Yu Zhang
- Laboratory of Molecular Biology and MOE Laboratory of Protein Science; School of Life Sciences; Tsinghua University; Beijing P. R. China
| | - Jin-Yuan Liu
- Laboratory of Molecular Biology and MOE Laboratory of Protein Science; School of Life Sciences; Tsinghua University; Beijing P. R. China
| |
Collapse
|
33
|
Liu C, Yuan D, Zhang X, Lin Z. Isolation, characterization and mapping of genes differentially expressed during fibre development between Gossypium hirsutum and G. barbadense by cDNA-SRAP. J Genet 2013; 92:175-181. [PMID: 23970073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
Gossypium hirsutum and G. barbadense are two cultivated tetraploid cotton species with differences in fibre quality. The fibre of G. barbadense is longer, stronger and finer than that of G. hirsutum. To isolate genes expressed differently between the two species during fibre development, cDNA-SRAP (sequence-related amplified polymorphism) was applied. This technique was used to analyse genes at different stages of fibre development in G. hirsutum cv. Emian22 and G. barbadense acc. 3-79, the parents of our interspecific mapping population. A total of 4096 SRAP primer combinations were used to screen polymorphism between the DNA of the parents, and 275 highly polymorphic primers were picked out to analyse DNA and RNA from leaves and fibres at different developmental stages of the parents. A total of 168 DNA fragments were isolated from gels and sequenced: 54, 30, 38 and 41 from fibres of 5, 10, 15 and 20 days post-anthesis, respectively, and five from multi stages. To genetically map these sequences, 104 sequence-specific primers were developed and were used to screened polymorphism between the mapping parents. Finally, six markers were mapped on six chromosomes of our backbone interspecific genetic map. This work can give us a primary knowledge of differences in mechanism of fibre development between G. hirsutum and G. barbadense.
Collapse
Affiliation(s)
- Chuanxiang Liu
- National Key Laboratory of Crop Genetic Improvement and National Centre of Plant Gene Research, Huazhong Agricultural University, Wuhan 430070, Hubei, People's Republic of China.
| | | | | | | |
Collapse
|
34
|
Kumari A, Kumar A, Wany A, Prajapati GK, Pandey DM. Identification and annotation of abiotic stress responsive candidate genes in peanut ESTs. Bioinformation 2012; 8:1211-9. [PMID: 23275722 PMCID: PMC3530874 DOI: 10.6026/97320630081211] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2012] [Accepted: 11/11/2012] [Indexed: 11/23/2022] Open
Abstract
Peanut (Arachis hypogaea L.) ranks fifth among the world oil crops and is widely grown in India and neighbouring countries. Due to its large and unknown genome size, studies on genomics and genetic modification of peanut are still scanty as compared to other model crops like Arabidopsis, rice, cotton and soybean. Because of its favourable cultivation in semi-arid regions, study on abiotic stress responsive genes and its regulation in peanut is very much important. Therefore, we aim to identify and annotate the abiotic stress responsive candidate genes in peanut ESTs. Expression data of drought stress responsive corresponding genes and EST sequences were screened from dot blot experiments shown as heat maps and supplementary tables, respectively as reported by Govind et al. (2009). Some of the screened genes having no information about their ESTs in above mentioned supplementary tables were retrieved from NCBI. A phylogenetic analysis was performed to find a group of utmost similar ESTs for each selected gene. Individual EST of the said group were further searched in peanut ESTs (1,78,490 whole EST sequences) using stand alone BLAST. For the prediction as well as annotation of abiotic stress responsive selected genes, various tools (like Vec-Screen, Repeat Masker, EST-Trimmer, DNA Baser, WISE2 and I-TASSER) were used. Here we report the predicted result of Contigs, domain as well as 3D structure for HSP 17.3KDa protein, DnaJ protein and Type 2 Metallothionein protein.
Collapse
Affiliation(s)
- Archana Kumari
- Department of Biotechnology, Birla Institute of Technology, Mesra, Ranchi, Jharkhand-835215, India
| | - Ashutosh Kumar
- Department of Biotechnology, Birla Institute of Technology, Mesra, Ranchi, Jharkhand-835215, India
| | - Aakanksha Wany
- Department of Biotechnology, Birla Institute of Technology, Mesra, Ranchi, Jharkhand-835215, India
| | - Gopal Kumar Prajapati
- Department of Biotechnology, Birla Institute of Technology, Mesra, Ranchi, Jharkhand-835215, India
| | - Dev Mani Pandey
- Department of Biotechnology, Birla Institute of Technology, Mesra, Ranchi, Jharkhand-835215, India
| |
Collapse
|
35
|
Lacape JM, Claverie M, Vidal RO, Carazzolle MF, Guimarães Pereira GA, Ruiz M, Pré M, Llewellyn D, Al-Ghazi Y, Jacobs J, Dereeper A, Huguet S, Giband M, Lanaud C. Deep sequencing reveals differences in the transcriptional landscapes of fibers from two cultivated species of cotton. PLoS One 2012; 7:e48855. [PMID: 23166598 PMCID: PMC3499527 DOI: 10.1371/journal.pone.0048855] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2012] [Accepted: 10/01/2012] [Indexed: 01/17/2023] Open
Abstract
Cotton (Gossypium) fiber is the most prevalent natural product used in the textile industry. The two major cultivated species, G. hirsutum (Gh) and G. barbadense (Gb), are allotetraploids with contrasting fiber quality properties. To better understand the molecular basis for their fiber differences, EST pyrosequencing was used to document the fiber transcriptomes at two key development stages, 10 days post anthesis (dpa), representing the peak of fiber elongation, and 22 dpa, representing the transition to secondary cell wall synthesis. The 617,000 high quality reads (89% of the total 692,000 reads) from 4 libraries were assembled into 46,072 unigenes, comprising 38,297 contigs and 7,775 singletons. Functional annotation of the unigenes together with comparative digital gene expression (DGE) revealed a diverse set of functions and processes that were partly linked to specific fiber stages. Globally, 2,770 contigs (7%) showed differential expression (>2-fold) between 10 and 22 dpa (irrespective of genotype), with 70% more highly expressed at 10 dpa, while 2,248 (6%) were differentially expressed between the genotypes (irrespective of stage). The most significant genes with differential DGE at 10 dpa included expansins and lipid transfer proteins (higher in Gb), while at 22 dpa tubulins, cellulose, and sucrose synthases showed higher expression in Gb. DGE was compared with expression data of 10 dpa-old fibers from Affymetrix microarrays. Among 543 contigs showing differential expression on both platforms, 74% were consistent in being either over-expressed in Gh (242 genes) or in Gb (161 genes). Furthermore, the unigene set served to identify 339 new SSRs and close to 21,000 inter-genotypic SNPs. Subsets of 88 SSRs and 48 SNPs were validated through mapping and added 65 new loci to a RIL genetic map. The new set of fiber ESTs and the gene-based markers complement existing available resources useful in basic and applied research for crop improvement in cotton.
Collapse
|
36
|
Targeted capture of homoeologous coding and noncoding sequence in polyploid cotton. G3-GENES GENOMES GENETICS 2012; 2:921-30. [PMID: 22908041 PMCID: PMC3411248 DOI: 10.1534/g3.112.003392] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/02/2012] [Accepted: 06/15/2012] [Indexed: 12/30/2022]
Abstract
Targeted sequence capture is a promising technology in many areas in biology. These methods enable efficient and relatively inexpensive sequencing of hundreds to thousands of genes or genomic regions from many more individuals than is practical using whole-genome sequencing approaches. Here, we demonstrate the feasibility of target enrichment using sequence capture in polyploid cotton. To capture and sequence both members of each gene pair (homeologs) of wild and domesticated Gossypium hirsutum, we created custom hybridization probes to target 1000 genes (500 pairs of homeologs) using information from the cotton transcriptome. Two widely divergent samples of G. hirsutum were hybridized to four custom NimbleGen capture arrays containing probes for targeted genes. We show that the two coresident homeologs in the allopolyploid nucleus were efficiently captured with high coverage. The capture efficiency was similar between the two accessions and independent of whether the samples were multiplexed. A significant amount of flanking, nontargeted sequence (untranslated regions and introns) was also captured and sequenced along with the targeted exons. Intraindividual heterozygosity is low in both wild and cultivated Upland cotton, as expected from the high level of inbreeding in natural G. hirsutum and bottlenecks accompanying domestication. In addition, levels of heterozygosity appeared asymmetrical with respect to genome (AT or DT) in cultivated cotton. The approach used here is general, scalable, and may be adapted for many different research inquiries involving polyploid plant genomes.
Collapse
|
37
|
Flagel LE, Wendel JF, Udall JA. Duplicate gene evolution, homoeologous recombination, and transcriptome characterization in allopolyploid cotton. BMC Genomics 2012. [PMID: 22768919 DOI: 10.1186/1471‐2164‐13‐302] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
BACKGROUND Modern allotetraploid cotton contains an "A" and "D" genome from an ancestral polyploidy event that occurred approximately 1-2 million years ago. Diploid A- and D-genome species can be compared to the A- and D-genomes found within these allotetraploids to make evolutionary inferences about polyploidy. In this paper we present a comprehensive EST assembly derived from diploid and model allotetraploid cottons and demonstrate several evolutionary inferences regarding genic evolution that can be drawn from these data. RESULTS We generated a set of cotton expressed sequence tags (ESTs), comprising approximately 4.4 million Sanger and next-generation (454) transcripts supplemented by approximately 152 million Illumina reads from diploid and allotetraploid cottons. From the EST alignments we inferred 259,192 genome-specific single nucleotide polymorphisms (SNPs). Molecular evolutionary analyses of protein-coding regions demonstrate that the rate of nucleotide substitution has increased among both allotetraploid genomes relative to the diploids, and that the ratio of nonsynonymous to synonymous substitutions has increased in one of the two polyploid lineages we sampled. We also use these SNPs to show that a surprisingly high percentage of duplicate genes (~7 %) show a signature of non-independent evolution in the allotetraploid nucleus, having experienced one or more episodes of nonreciprocal homoeologous recombination (NRHR). CONCLUSIONS In this study we characterize the functional and mutational properties of the cotton transcriptome, produce a large genome-specific SNP database, and detect illegitimate genetic exchanges between duplicate genomes sharing a common allotetraploid nucleus. Our findings have important implications for our understanding of the consequences of polyploidy and duplicate gene evolution. We demonstrate that cotton genes have experienced an increased rate of molecular evolution following duplication by polyploidy, and that polyploidy has enabled considerable levels of nonreciprocal exchange between homoeologous genes.
Collapse
Affiliation(s)
- Lex E Flagel
- Department of Biology, Duke University, Durham, NC 27708, USA
| | | | | |
Collapse
|
38
|
Flagel LE, Wendel JF, Udall JA. Duplicate gene evolution, homoeologous recombination, and transcriptome characterization in allopolyploid cotton. BMC Genomics 2012; 13:302. [PMID: 22768919 PMCID: PMC3427041 DOI: 10.1186/1471-2164-13-302] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2011] [Accepted: 07/06/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Modern allotetraploid cotton contains an "A" and "D" genome from an ancestral polyploidy event that occurred approximately 1-2 million years ago. Diploid A- and D-genome species can be compared to the A- and D-genomes found within these allotetraploids to make evolutionary inferences about polyploidy. In this paper we present a comprehensive EST assembly derived from diploid and model allotetraploid cottons and demonstrate several evolutionary inferences regarding genic evolution that can be drawn from these data. RESULTS We generated a set of cotton expressed sequence tags (ESTs), comprising approximately 4.4 million Sanger and next-generation (454) transcripts supplemented by approximately 152 million Illumina reads from diploid and allotetraploid cottons. From the EST alignments we inferred 259,192 genome-specific single nucleotide polymorphisms (SNPs). Molecular evolutionary analyses of protein-coding regions demonstrate that the rate of nucleotide substitution has increased among both allotetraploid genomes relative to the diploids, and that the ratio of nonsynonymous to synonymous substitutions has increased in one of the two polyploid lineages we sampled. We also use these SNPs to show that a surprisingly high percentage of duplicate genes (~7 %) show a signature of non-independent evolution in the allotetraploid nucleus, having experienced one or more episodes of nonreciprocal homoeologous recombination (NRHR). CONCLUSIONS In this study we characterize the functional and mutational properties of the cotton transcriptome, produce a large genome-specific SNP database, and detect illegitimate genetic exchanges between duplicate genomes sharing a common allotetraploid nucleus. Our findings have important implications for our understanding of the consequences of polyploidy and duplicate gene evolution. We demonstrate that cotton genes have experienced an increased rate of molecular evolution following duplication by polyploidy, and that polyploidy has enabled considerable levels of nonreciprocal exchange between homoeologous genes.
Collapse
Affiliation(s)
- Lex E Flagel
- Department of Biology, Duke University, Durham, NC 27708, USA
| | | | | |
Collapse
|
39
|
Park W, Scheffler BE, Bauer PJ, Campbell BT. Genome-wide identification of differentially expressed genes under water deficit stress in upland cotton (Gossypium hirsutum L.). BMC PLANT BIOLOGY 2012; 12:90. [PMID: 22703539 PMCID: PMC3438127 DOI: 10.1186/1471-2229-12-90] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/23/2011] [Accepted: 05/10/2012] [Indexed: 05/17/2023]
Abstract
BACKGROUND Cotton is the world's primary fiber crop and is a major agricultural commodity in over 30 countries. Like many other global commodities, sustainable cotton production is challenged by restricted natural resources. In response to the anticipated increase of agricultural water demand, a major research direction involves developing crops that use less water or that use water more efficiently. In this study, our objective was to identify differentially expressed genes in response to water deficit stress in cotton. A global expression analysis using cDNA-Amplified Fragment Length Polymorphism was conducted to compare root and leaf gene expression profiles from a putative drought resistant cotton cultivar grown under water deficit stressed and well watered field conditions. RESULTS We identified a total of 519 differentially expressed transcript derived fragments. Of these, 147 transcript derived fragment sequences were functionally annotated according to their gene ontology. Nearly 70 percent of transcript derived fragments belonged to four major categories: 1) unclassified, 2) stress/defense, 3) metabolism, and 4) gene regulation. We found heat shock protein-related and reactive oxygen species-related transcript derived fragments to be among the major parts of functional pathways induced by water deficit stress. Also, twelve novel transcripts were identified as both water deficit responsive and cotton specific. A subset of differentially expressed transcript derived fragments was verified using reverse transcription-polymerase chain reaction. Differential expression analysis also identified five pairs of duplicated transcript derived fragments in which four pairs responded differentially between each of their two homologues under water deficit stress. CONCLUSIONS In this study, we detected differentially expressed transcript derived fragments from water deficit stressed root and leaf tissues in tetraploid cotton and provided their gene ontology, functional/biological distribution, and possible roles of gene duplication. This discovery demonstrates complex mechanisms involved with polyploid cotton's transcriptome response to naturally occurring field water deficit stress. The genes identified in this study will provide candidate targets to manipulate the water use characteristics of cotton at the molecular level.
Collapse
Affiliation(s)
- Wonkeun Park
- USDA-ARS, Coastal Plains Soil, Water and Plant Research Center, Florence, SC, USA
| | | | - Philip J Bauer
- USDA-ARS, Coastal Plains Soil, Water and Plant Research Center, Florence, SC, USA
| | - B Todd Campbell
- USDA-ARS, Coastal Plains Soil, Water and Plant Research Center, Florence, SC, USA
| |
Collapse
|
40
|
Hand ML, Cogan NOI, Forster JW. Genome-wide SNP identification in multiple morphotypes of allohexaploid tall fescue (Festuca arundinacea Schreb). BMC Genomics 2012; 13:219. [PMID: 22672128 PMCID: PMC3444928 DOI: 10.1186/1471-2164-13-219] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2012] [Accepted: 05/15/2012] [Indexed: 01/29/2023] Open
Abstract
BACKGROUND Single nucleotide polymorphisms (SNPs) provide essential tools for the advancement of research in plant genomics, and the development of SNP resources for many species has been accelerated by the capabilities of second-generation sequencing technologies. The current study aimed to develop and use a novel bioinformatic pipeline to generate a comprehensive collection of SNP markers within the agriculturally important pasture grass tall fescue; an outbreeding allopolyploid species displaying three distinct morphotypes: Continental, Mediterranean and rhizomatous. RESULTS A bioinformatic pipeline was developed that successfully identified SNPs within genotypes from distinct tall fescue morphotypes, following the sequencing of 414 polymerase chain reaction (PCR) - generated amplicons using 454 GS FLX technology. Equivalent amplicon sets were derived from representative genotypes of each morphotype, including six Continental, five Mediterranean and one rhizomatous. A total of 8,584 and 2,292 SNPs were identified with high confidence within the Continental and Mediterranean morphotypes respectively. The success of the bioinformatic approach was demonstrated through validation (at a rate of 70%) of a subset of 141 SNPs using both SNaPshot™ and GoldenGate™ assay chemistries. Furthermore, the quantitative genotyping capability of the GoldenGate™ assay revealed that approximately 30% of the putative SNPs were accessible to co-dominant scoring, despite the hexaploid genome structure. The sub-genome-specific origin of each SNP validated from Continental tall fescue was predicted using a phylogenetic approach based on comparison with orthologous sequences from predicted progenitor species. CONCLUSIONS Using the appropriate bioinformatic approach, amplicon resequencing based on 454 GS FLX technology is an effective method for the identification of polymorphic SNPs within the genomes of Continental and Mediterranean tall fescue. The GoldenGate™ assay is capable of high-throughput co-dominant SNP allele detection, and minimises the problems associated with SNP genotyping in a polyploid by effectively reducing the complexity to a diploid system. This SNP collection may now be refined and used in applications such as cultivar identification, genetic linkage map construction, genome-wide association studies and genomic selection in tall fescue. The bioinformatic pipeline described here represents an effective general method for SNP discovery within outbreeding allopolyploid species.
Collapse
Affiliation(s)
- Melanie L Hand
- Department of Primary Industries, Biosciences Research Division, Victorian AgriBiosciences Centre, 1 Park Drive, La Trobe University Research and Development Park, Bundoora, VIC 3083, Australia
| | | | | |
Collapse
|
41
|
Byers RL, Harker DB, Yourstone SM, Maughan PJ, Udall JA. Development and mapping of SNP assays in allotetraploid cotton. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2012; 124:1201-14. [PMID: 22252442 PMCID: PMC3324690 DOI: 10.1007/s00122-011-1780-8] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2011] [Accepted: 12/22/2011] [Indexed: 05/06/2023]
Abstract
A narrow germplasm base and a complex allotetraploid genome have made the discovery of single nucleotide polymorphism (SNP) markers difficult in cotton (Gossypium hirsutum). To generate sequence for SNP discovery, we conducted a genome reduction experiment (EcoRI, BafI double digest, followed by adapter ligation, biotin-streptavidin purification, and agarose gel separation) on two accessions of G. hirsutum and two accessions of G. barbadense. From the genome reduction experiment, a total of 2.04 million genomic sequence reads were assembled into contigs with an N(50) of 508 bp and analyzed for SNPs. A previously generated assembly of expressed sequence tags (ESTs) provided an additional source for SNP discovery. Using highly conservative parameters (minimum coverage of 8× at each SNP and 20% minor allele frequency), a total of 11,834 and 1,679 non-genic SNPs were identified between accessions of G. hirsutum and G. barbadense in genome reduction assemblies, respectively. An additional 4,327 genic SNPs were also identified between accessions of G. hirsutum in the EST assembly. KBioscience KASPar assays were designed for a portion of the intra-specific G. hirsutum SNPs. From 704 non-genic and 348 genic markers developed, a total of 367 (267 non-genic, 100 genic) mapped in a segregating F(2) population (Acala Maxxa × TX2094) using the Fluidigm EP1 system. A G. hirsutum genetic linkage map of 1,688 cM was constructed based entirely on these new SNP markers. Of the genic-based SNPs, we were able to identify within which genome ('A' or 'D') each SNP resided using diploid species sequence data. Genetic maps generated by these newly identified markers are being used to locate quantitative, economically important regions within the cotton genome.
Collapse
Affiliation(s)
- Robert L. Byers
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, UT 84602 USA
| | - David B. Harker
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, UT 84602 USA
| | - Scott M. Yourstone
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, UT 84602 USA
| | - Peter J. Maughan
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, UT 84602 USA
| | - Joshua A. Udall
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, UT 84602 USA
| |
Collapse
|
42
|
Gong L, Salmon A, Yoo MJ, Grupp KK, Wang Z, Paterson AH, Wendel JF. The cytonuclear dimension of allopolyploid evolution: an example from cotton using rubisco. Mol Biol Evol 2012; 29:3023-36. [PMID: 22490824 DOI: 10.1093/molbev/mss110] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
During allopolyploid speciation, two divergent nuclear genomes merge, yet only one (usually the maternal) of the two sets of progenitor organellar genomes is maintained. Rubisco (1,5-bisphosphate carboxylase/oxygenase) is composed of nuclear-encoded small subunits (SSUs) and plastome-encoded large subunits (LSUs), providing an ideal system to explore the evolutionary process of cytonuclear accommodation. Here, we take initial steps in this direction, using Gossypium allopolyploids as our model. SSU copies from divergent (5-10 My) progenitor diploids ("A" and "D" genomes) were combined at the time of polyploid formation 1-2 Ma, with the LSU encoded by the maternal A-genome parent. LSU genes from A- and D-genome diploids and AD-genome allopolyploids were sequenced, revealing several nonsynonymous substitutions and suggesting the possibility of differential selection on the nuclear-encoded rbcS partner following allopolyploid formation. Sequence data for the rbcS gene family revealed nonreciprocal homoeologous recombination between A- and D-rbcS homoeologs in all polyploid species but not in a synthetic intergenomic F1 hybrid, demonstrating "gene conversion" during allopolyploid evolution. All progenitor rbcS genes are retained and expressed in the five extant allopolyploid species, but analysis of the leaf transcriptome showed that A-homoeologs are preferentially expressed in both the allopolyploid and hybrid, consistent with the maternal origin of rbcL. Although rbcS genes from both progenitor genomes are expressed, some appear to have experienced mutations that may represent cytonuclear coevolution.
Collapse
Affiliation(s)
- Lei Gong
- Department of Ecology, Evolution and Organismal Biology, Iowa State University, IA, USA
| | | | | | | | | | | | | |
Collapse
|
43
|
Manickavelu A, Kawaura K, Oishi K, Shin-I T, Kohara Y, Yahiaoui N, Keller B, Abe R, Suzuki A, Nagayama T, Yano K, Ogihara Y. Comprehensive functional analyses of expressed sequence tags in common wheat (Triticum aestivum). DNA Res 2012; 19:165-77. [PMID: 22334568 PMCID: PMC3325080 DOI: 10.1093/dnares/dss001] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
About 1 million expressed sequence tag (EST) sequences comprising 125.3 Mb nucleotides were accreted from 51 cDNA libraries constructed from a variety of tissues and organs under a range of conditions, including abiotic stresses and pathogen challenges in common wheat (Triticum aestivum). Expressed sequence tags were assembled with stringent parameters after processing with inbuild scripts, resulting in 37,138 contigs and 215,199 singlets. In the assembled sequences, 10.6% presented no matches with existing sequences in public databases. Functional characterization of wheat unigenes by gene ontology annotation, mining transcription factors, full-length cDNA, and miRNA targeting sites were carried out. A bioinformatics strategy was developed to discover single-nucleotide polymorphisms (SNPs) within our large EST resource and reported the SNPs between and within (homoeologous) cultivars. Digital gene expression was performed to find the tissue-specific gene expression, and correspondence analysis was executed to identify common and specific gene expression by selecting four biotic stress-related libraries. The assembly and associated information cater a framework for future investigation in functional genomics.
Collapse
Affiliation(s)
- Alagu Manickavelu
- Kihara Institute for Biological Research, Yokohama City University, Yokohama, Japan
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Byers RL, Harker DB, Yourstone SM, Maughan PJ, Udall JA. Development and mapping of SNP assays in allotetraploid cotton. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2012. [PMID: 22252442 DOI: 10.1007/s00122‐011‐1780‐8] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
A narrow germplasm base and a complex allotetraploid genome have made the discovery of single nucleotide polymorphism (SNP) markers difficult in cotton (Gossypium hirsutum). To generate sequence for SNP discovery, we conducted a genome reduction experiment (EcoRI, BafI double digest, followed by adapter ligation, biotin-streptavidin purification, and agarose gel separation) on two accessions of G. hirsutum and two accessions of G. barbadense. From the genome reduction experiment, a total of 2.04 million genomic sequence reads were assembled into contigs with an N(50) of 508 bp and analyzed for SNPs. A previously generated assembly of expressed sequence tags (ESTs) provided an additional source for SNP discovery. Using highly conservative parameters (minimum coverage of 8× at each SNP and 20% minor allele frequency), a total of 11,834 and 1,679 non-genic SNPs were identified between accessions of G. hirsutum and G. barbadense in genome reduction assemblies, respectively. An additional 4,327 genic SNPs were also identified between accessions of G. hirsutum in the EST assembly. KBioscience KASPar assays were designed for a portion of the intra-specific G. hirsutum SNPs. From 704 non-genic and 348 genic markers developed, a total of 367 (267 non-genic, 100 genic) mapped in a segregating F(2) population (Acala Maxxa × TX2094) using the Fluidigm EP1 system. A G. hirsutum genetic linkage map of 1,688 cM was constructed based entirely on these new SNP markers. Of the genic-based SNPs, we were able to identify within which genome ('A' or 'D') each SNP resided using diploid species sequence data. Genetic maps generated by these newly identified markers are being used to locate quantitative, economically important regions within the cotton genome.
Collapse
Affiliation(s)
- Robert L Byers
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, UT 84602, USA
| | | | | | | | | |
Collapse
|
45
|
Buyyarapu R, Kantety RV, Yu JZ, Saha S, Sharma GC. Development of New Candidate Gene and EST-Based Molecular Markers for Gossypium Species. INTERNATIONAL JOURNAL OF PLANT GENOMICS 2012; 2011:894598. [PMID: 22315588 PMCID: PMC3270397 DOI: 10.1155/2011/894598] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/31/2011] [Revised: 10/26/2011] [Accepted: 11/09/2011] [Indexed: 05/31/2023]
Abstract
New source of molecular markers accelerate the efforts in improving cotton fiber traits and aid in developing high-density integrated genetic maps. We developed new markers based on candidate genes and G. arboreum EST sequences that were used for polymorphism detection followed by genetic and physical mapping. Nineteen gene-based markers were surveyed for polymorphism detection in 26 Gossypium species. Cluster analysis generated a phylogenetic tree with four major sub-clusters for 23 species while three species branched out individually. CAP method enhanced the rate of polymorphism of candidate gene-based markers between G. hirsutum and G. barbadense. Two hundred A-genome based SSR markers were designed after datamining of G. arboreum EST sequences (Mississippi Gossypium arboreum EST-SSR: MGAES). Over 70% of MGAES markers successfully produced amplicons while 65 of them demonstrated polymorphism between the parents of G. hirsutum and G. barbadense RIL population and formed 14 linkage groups. Chromosomal localization of both candidate gene-based and MGAES markers was assisted by euploid and hypoaneuploid CS-B analysis. Gene-based and MGAES markers were highly informative as they were designed from candidate genes and fiber transcriptome with a potential to be integrated into the existing cotton genetic and physical maps.
Collapse
Affiliation(s)
- Ramesh Buyyarapu
- Center for Molecular Biology, Department of Natural Resources and Environmental Sciences, Alabama A&M University, 134 ARC Building, P.O. Box 1927, Normal, AL 35762, USA
| | - Ramesh V. Kantety
- Center for Molecular Biology, Department of Natural Resources and Environmental Sciences, Alabama A&M University, 134 ARC Building, P.O. Box 1927, Normal, AL 35762, USA
| | - John Z. Yu
- Southern Plains Agricultural Research Center, USDA-ARS, 2881 F&B Road, College Station, TX 77845, USA
| | - Sukumar Saha
- Genetics and Precision Agriculture Research Unit, USDA-ARS, P.O. Box 5367, MS 39762, USA
| | - Govind C. Sharma
- Center for Molecular Biology, Department of Natural Resources and Environmental Sciences, Alabama A&M University, 134 ARC Building, P.O. Box 1927, Normal, AL 35762, USA
| |
Collapse
|
46
|
Lai D, Li H, Fan S, Song M, Pang C, Wei H, Liu J, Wu D, Gong W, Yu S. Generation of ESTs for flowering gene discovery and SSR marker development in upland cotton. PLoS One 2011; 6:e28676. [PMID: 22163052 PMCID: PMC3232235 DOI: 10.1371/journal.pone.0028676] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2011] [Accepted: 11/13/2011] [Indexed: 11/18/2022] Open
Abstract
Background Upland cotton, Gossypium hirsutum L., is one of the world's most important economic crops. In the absence of the entire genomic sequence, a large number of expressed sequence tag (EST) resources of upland cotton have been generated and used in several studies. However, information about the flower development of this species is rare. Methodology/Principal Findings To clarify the molecular mechanism of flower development in upland cotton, 22,915 high-quality ESTs were generated and assembled into 14,373 unique sequences consisting of 4,563 contigs and 9,810 singletons from a normalized and full-length cDNA library constructed from pooled RNA isolated from shoot apexes, squares, and flowers. Comparative analysis indicated that 5,352 unique sequences had no high-degree matches to the cotton public database. Functional annotation showed that several upland cotton homologs with flowering-related genes were identified in our library. The majority of these genes were specifically expressed in flowering-related tissues. Three GhSEP (G. hirsutum L. SEPALLATA) genes determining floral organ development were cloned, and quantitative real-time PCR (qRT-PCR) revealed that these genes were expressed preferentially in squares or flowers. Furthermore, 670 new putative microsatellites with flanking sequences sufficient for primer design were identified from the 645 unigenes. Twenty-five EST–simple sequence repeats were randomly selected for validation and transferability testing in 17 Gossypium species. Of these, 23 were identified as true-to-type simple sequence repeat loci and were highly transferable among Gossypium species. Conclusions/Significance A high-quality, normalized, full-length cDNA library with a total of 14,373 unique ESTs was generated to provide sequence information for gene discovery and marker development related to upland cotton flower development. These EST resources form a valuable foundation for gene expression profiling analysis, functional analysis of newly discovered genes, genetic linkage, and quantitative trait loci analysis.
Collapse
Affiliation(s)
- Deyong Lai
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei, People's Republic of China
- Key Laboratory of Cotton Genetic Improvement of Ministry of Agriculture, The Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, Henan, People's Republic of China
| | - Huaizhu Li
- Key Laboratory of Cotton Genetic Improvement of Ministry of Agriculture, The Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, Henan, People's Republic of China
- College of Agronomy, Northwest A&F University, Yangling, Shanxi, People's Republic of China
| | - Shuli Fan
- Key Laboratory of Cotton Genetic Improvement of Ministry of Agriculture, The Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, Henan, People's Republic of China
| | - Meizhen Song
- Key Laboratory of Cotton Genetic Improvement of Ministry of Agriculture, The Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, Henan, People's Republic of China
| | - Chaoyou Pang
- Key Laboratory of Cotton Genetic Improvement of Ministry of Agriculture, The Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, Henan, People's Republic of China
| | - Hengling Wei
- Key Laboratory of Cotton Genetic Improvement of Ministry of Agriculture, The Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, Henan, People's Republic of China
- College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, Zhejiang, People's Republic of China
| | - Junjie Liu
- Key Laboratory of Cotton Genetic Improvement of Ministry of Agriculture, The Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, Henan, People's Republic of China
| | - Dong Wu
- Key Laboratory of Cotton Genetic Improvement of Ministry of Agriculture, The Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, Henan, People's Republic of China
| | - Wenfang Gong
- Key Laboratory of Cotton Genetic Improvement of Ministry of Agriculture, The Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, Henan, People's Republic of China
| | - Shuxun Yu
- Key Laboratory of Cotton Genetic Improvement of Ministry of Agriculture, The Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, Henan, People's Republic of China
- * E-mail:
| |
Collapse
|
47
|
Xie F, Sun G, Stiller JW, Zhang B. Genome-wide functional analysis of the cotton transcriptome by creating an integrated EST database. PLoS One 2011; 6:e26980. [PMID: 22087239 PMCID: PMC3210780 DOI: 10.1371/journal.pone.0026980] [Citation(s) in RCA: 91] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2011] [Accepted: 10/07/2011] [Indexed: 12/20/2022] Open
Abstract
A total of 28,432 unique contigs (25,371 in consensus contigs and 3,061 as singletons) were assembled from all 268,786 cotton ESTs currently available. Several in silico approaches [comparative genomics, Blast, Gene Ontology (GO) analysis, and pathway enrichment by Kyoto Encyclopedia of Genes and Genomes (KEGG)] were employed to investigate global functions of the cotton transcriptome. Cotton EST contigs were clustered into 5,461 groups with a maximum cluster size of 196 members. A total of 27,956 indel mutants and 149,616 single nucleotide polymorphisms (SNPs) were identified from consensus contigs. Interestingly, many contigs with significantly high frequencies of indels or SNPs encode transcription factors and protein kinases. In a comparison with six model plant species, cotton ESTs show the highest overall similarity to grape. A total of 87 cotton miRNAs were identified; 59 of these have not been reported previously from experimental or bioinformatics investigations. We also predicted 3,260 genes as miRNAs targets, which are associated with multiple biological functions, including stress response, metabolism, hormone signal transduction and fiber development. We identified 151 and 4,214 EST-simple sequence repeats (SSRs) from contigs and raw ESTs respectively. To make these data widely available, and to facilitate access to EST-related genetic information, we integrated our results into a comprehensive, fully downloadable web-based cotton EST database (www.leonxie.com).
Collapse
Affiliation(s)
- Fuliang Xie
- Department of Biology, East Carolina University, Greenville, North Carolina, United States of America
| | - Guiling Sun
- Department of Biology, East Carolina University, Greenville, North Carolina, United States of America
| | - John W. Stiller
- Department of Biology, East Carolina University, Greenville, North Carolina, United States of America
| | - Baohong Zhang
- Department of Biology, East Carolina University, Greenville, North Carolina, United States of America
- * E-mail:
| |
Collapse
|
48
|
Abstract
Allopolyploidy is an important process during plant evolution that results in the reunion of two divergent genomes into a common nucleus. Many of the immediate as well as longer-term genomic and epigenetic responses to polyploidy have become appreciated. To investigate the modifications of gene expression at the proteome level caused by allopolyploid formation, we conducted a comparative analysis of cotton seed proteomes from the allopolyploid Gossypium hirsutum (AD genome) and its model A-genome and D-genome diploid progenitors. An unexpectedly high level of divergence among the three proteomes was found, with about one-third of all protein forms being genome specific. Comparative analysis showed that there is a higher degree of proteomic similarity between the allopolyploid and its D-genome donor than its A-genome donor, reflecting a biased accumulation of seed proteins in the allopolyploid. Protein identification and genetic characterization of high-abundance proteins revealed that two classes of seed storage proteins, vicilins and legumins, compose the major component of cotton seed proteomes. Analyses further indicate differential regulation or modification of homoeologous gene products, as well as novel patterns in the polyploid proteome that may result from the interaction between homoeologous gene products. Our findings demonstrate that genomic merger and doubling have consequences that extend beyond the transcriptome into the realm of the proteome and that unequal expression of proteins from diploid parental genomes may occur in allopolyploids.
Collapse
|
49
|
Generation, annotation and analysis of first large-scale expressed sequence tags from developing fiber of Gossypium barbadense L. PLoS One 2011; 6:e22758. [PMID: 21829504 PMCID: PMC3145671 DOI: 10.1371/journal.pone.0022758] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2011] [Accepted: 06/29/2011] [Indexed: 11/22/2022] Open
Abstract
Background Cotton fiber is the world's leading natural fiber used in the manufacture of textiles. Gossypium is also the model plant in the study of polyploidization, evolution, cell elongation, cell wall development, and cellulose biosynthesis. G. barbadense L. is an ideal candidate for providing new genetic variations useful to improve fiber quality for its superior properties. However, little is known about fiber development mechanisms of G. barbadense and only a few molecular resources are available in GenBank. Methodology and Principal Findings In total, 10,979 high-quality expressed sequence tags (ESTs) were generated from a normalized fiber cDNA library of G. barbadense. The ESTs were clustered and assembled into 5852 unigenes, consisting of 1492 contigs and 4360 singletons. The blastx result showed 2165 unigenes with significant similarity to known genes and 2687 unigenes with significant similarity to genes of predicted proteins. Functional classification revealed that unigenes were abundant in the functions of binding, catalytic activity, and metabolic pathways of carbohydrate, amino acid, energy, and lipids. The function motif/domain-related cytoskeleton and redox homeostasis were enriched. Among the 5852 unigenes, 282 and 736 unigenes were identified as potential cell wall biosynthesis and transcription factors, respectively. Furthermore, the relationships among cotton species or between cotton and other model plant systems were analyzed. Some putative species-specific unigenes of G. barbadense were highlighted. Conclusions/Significance The ESTs generated in this study are from the first large-scale EST project for G. barbadense and significantly enhance the number of G. barbadense ESTs in public databases. This knowledge will contribute to cotton improvements by studying fiber development mechanisms of G. barbadense, establishing a breeding program using marker-assisted selection, and discovering candidate genes related to important agronomic traits of cotton through oligonucleotide array. Our work will also provide important resources for comparative genomics, polyploidization, and genome evolution among Gossypium species.
Collapse
|
50
|
Levi A, Paterson AH, Cakmak I, Saranga Y. Metabolite and mineral analyses of cotton near-isogenic lines introgressed with QTLs for productivity and drought-related traits. PHYSIOLOGIA PLANTARUM 2011; 141:265-275. [PMID: 21143238 DOI: 10.1111/j.1399-3054.2010.01438.x] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
Quantitative trait loci (QTLs) for yield and drought-related traits were exchanged via marker-assisted selection between elite cultivars of two cotton species, Gossypium barbadense (GB) cv. F-177 and Gossypium hirsutum (GH) cv. Siv'on. Three of the resultant near-isogenic lines (NILs), each introgressed with a different QTL region, expressed an advantage in osmotic adjustment (OA) and other drought-related traits relative to their recipient parents. These NILs and the parental genotypes were field-grown under well-watered and water-limited conditions, and characterized for their metabolic and mineral compositions. Comparisons were then made between (1) GB and GH genotypes, (2) the contrasting water regimes and (3) each NIL and its recipient parent. Hierarchical clustering analysis clearly distinguished between GB and GH genotypes based on either metabolite or mineral composition. Comparisons between well-watered and water-limited conditions in each of the genotypes showed differing trends in the various solutes. The greater concentrations of potassium, magnesium and calcium under water stress, when compared with well-watered conditions, may have enhanced OA or osmoprotection. All NILs exhibited significantly modified solute composition relative to their recipient parents. In particular, increased levels of alanine, aspartic acid, citric acid, malic acid, glycerol, myoinositol, threonic acid, potassium, magnesium and calcium were found under drought conditions in one or more of the NILs relative to their recipient parents. The increased values of these solutes could contribute to the superior capacity of these NILs to cope with drought.
Collapse
Affiliation(s)
- Avishag Levi
- The Robert H. Smith Institute of Plant Sciences and Genetics in Agriculture, The Hebrew University of Jerusalem, Rehovot, Israel
| | | | | | | |
Collapse
|