1
|
Huded AKC, Jingade P, Mishra MK, Ercisli S, Ilhan G, Marc RA, Vodnar D. Comparative genomic analysis and phylogeny of NAC25 gene from cultivated and wild Coffea species. FRONTIERS IN PLANT SCIENCE 2022; 13:1009733. [PMID: 36186041 PMCID: PMC9523601 DOI: 10.3389/fpls.2022.1009733] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Accepted: 08/30/2022] [Indexed: 06/16/2023]
Abstract
Coffee is a high value agricultural commodity grown in about 80 countries. Sustainable coffee cultivation is hampered by multiple biotic and abiotic stress conditions predominantly driven by climate change. The NAC proteins are plants specific transcription factors associated with various physiological functions in plants which include cell division, secondary wall formation, formation of shoot apical meristem, leaf senescence, flowering embryo and seed development. Besides, they are also involved in biotic and abiotic stress regulation. Due to their ubiquitous influence, studies on NAC transcription factors have gained momentum in different crop plant species. In the present study, NAC25 like transcription factor was isolated and characterized from two cultivated coffee species, Coffea arabica and Coffea canephora and five Indian wild coffee species for the first time. The full-length NAC25 gene varied from 2,456 bp in Coffea jenkinsii to 2,493 bp in C. arabica. In all the seven coffee species, sequencing of the NAC25 gene revealed 3 exons and 2 introns. The NAC25 gene is characterized by a highly conserved 377 bp NAM domain (N-terminus) and a highly variable C terminus region. The sequence analysis revealed an average of one SNP per every 40.92 bp in the coding region and 37.7 bp in the intronic region. Further, the non-synonymous SNPs are 8-11 fold higher compared to synonymous SNPs in the non-coding and coding region of the NAC25 gene, respectively. The expression of NAC25 gene was studied in six different tissue types in C. canephora and higher expression levels were observed in leaf and flower tissues. Further, the relative expression of NAC25 in comparison with the GAPDH gene revealed four folds and eight folds increase in expression levels in green fruit and ripen fruit, respectively. The evolutionary relationship revealed the independent evolution of the NAC25 gene in coffee.
Collapse
Affiliation(s)
- Arun Kumar C. Huded
- Plant Biotechnology Division, Unit of Central Coffee Research Institute, Coffee Board, Mysore, Karnataka, India
| | - Pavankumar Jingade
- Plant Biotechnology Division, Unit of Central Coffee Research Institute, Coffee Board, Mysore, Karnataka, India
| | - Manoj Kumar Mishra
- Plant Biotechnology Division, Unit of Central Coffee Research Institute, Coffee Board, Mysore, Karnataka, India
| | - Sezai Ercisli
- Department of Horticulture, Faculty of Agriculture, Erzurum, Turkey
| | - Gulce Ilhan
- Department of Horticulture, Faculty of Agriculture, Erzurum, Turkey
| | - Romina Alina Marc
- Food Engineering Department, Faculty of Food Science and Technology, University of Agricultural Sciences and Veterinary Medicine, Cluj-Napoca, Romania
| | - Dan Vodnar
- Institute of Life Sciences, Faculty of Food Science and Technology, University of Agricultural Sciences and Veterinary Medicine, Cluj-Napoca, Romania
| |
Collapse
|
2
|
Ma Q, Wu M, Pei W, Wang X, Zhai H, Wang W, Li X, Zhang J, Yu J, Yu S. RNA-Seq-Mediated Transcriptome Analysis of a Fiberless Mutant Cotton and Its Possible Origin Based on SNP Markers. PLoS One 2016; 11:e0151994. [PMID: 26990639 PMCID: PMC4798417 DOI: 10.1371/journal.pone.0151994] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2016] [Accepted: 03/07/2016] [Indexed: 01/01/2023] Open
Abstract
As the longest known single-celled trichomes, cotton (Gossypium L.) fibers constitute a classic model system to investigate cell initiation and elongation. In this study, we used a high-throughput transcriptome sequencing technology to identify fiber-initiation-related single nucleotide polymorphism (SNP) markers and differentially expressed genes (DEGs) between the wild-type (WT) Upland cotton (G. hirsutum) Xuzhou 142 and its natural fuzzless-lintless mutant Xuzhou 142 fl. Approximately 700 million high-quality cDNA reads representing over 58 Gb of sequences were obtained, resulting in the identification of 28,610 SNPs--of which 17,479 were novel--from 13,960 expressed genes. Of these SNPs, 50% of SNPs in fl were identical to those of G. barbadense, which suggests the likely origin of the fl mutant from an interspecific hybridization between Xuzhou 142 and an unknown G. barbadense genotype. Of all detected SNPs, 15,555, 12,750, and 305 were classified as non-synonymous, synonymous, and pre-terminated ones, respectively. Moreover, 1,352 insertion/deletion polymorphisms (InDels) were also detected. A total of 865 DEGs were identified between the WT and fl in ovules at -3 and 0 days post-anthesis, with 302 candidate SNPs selected from these DEGs for validation by a high-resolution melting analysis and Sanger sequencing in seven cotton genotypes. The number of genotypic pairwise polymorphisms varied from 43 to 302, indicating that the identified SNPs are reliable. These SNPs should serve as good resources for breeding and genetic studies in cotton.
Collapse
Affiliation(s)
- Qifeng Ma
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of the Chinese Academy of Agricultural Sciences, Anyang, China
| | - Man Wu
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of the Chinese Academy of Agricultural Sciences, Anyang, China
| | - Wenfeng Pei
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of the Chinese Academy of Agricultural Sciences, Anyang, China
| | - Xiaoyan Wang
- College of Biology and Food Technology, Anyang Institute of Technology, Anyang, China
| | - Honghong Zhai
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of the Chinese Academy of Agricultural Sciences, Anyang, China
| | - Wenkui Wang
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of the Chinese Academy of Agricultural Sciences, Anyang, China
| | - Xingli Li
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of the Chinese Academy of Agricultural Sciences, Anyang, China
| | - Jinfa Zhang
- Department of Plant and Environmental Sciences, New Mexico State University, Las Cruces, New Mexico, United States of America
| | - Jiwen Yu
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of the Chinese Academy of Agricultural Sciences, Anyang, China
| | - Shuxun Yu
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of the Chinese Academy of Agricultural Sciences, Anyang, China
| |
Collapse
|
3
|
Pereira L, Martins-Lopes P. Vitis vinifera L. Single-Nucleotide Polymorphism Detection with High-Resolution Melting Analysis Based on the UDP-Glucose:Flavonoid 3-O-Glucosyltransferase Gene. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2015; 63:9165-9174. [PMID: 26422991 DOI: 10.1021/acs.jafc.5b03463] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Vitis vinifera L. is a species with a large number of varieties, which differ in terms of anthocyanin content. The genes involved in the anthocyanin biosynthesis pathway have a direct effect in the anthocyanin profile of each variety, being potentially interesting for varietal identification. The current study aimed at the design of an assay suitable for the discrimination of the largest number of grapevine varieties. Two genes of the anthocyanin pathway, chalcone isomerase (CHI) and UDP-glucose:flavonoid 3-O-glucosyltransferase (UFGT), were sequenced in 22 grapevine varieties. The CHI gene presented 5 SNPs within the sequence. A total of 58 SNPs and 1 INDEL were found among the UFGT gene, allowing the discrimination of 18 different genotypes within the 22 grapevine varieties. A HRM assay designed for UFGT, containing 704 bp, produced differentiated melting curves for each of the 18 haplotypes. The developed HRM assay is efficient in grapevine varietal discrimination.
Collapse
Affiliation(s)
- Leonor Pereira
- University of Trás-os-Montes and Alto Douro , P.O. Box 1013, 5000-911 Vila Real, Portugal
- Faculty of Sciences, BioISI - Biosystems & Integrative Sciences Institute, University of Lisboa , Campo Grande, 1749-016 Lisboa, Portugal
| | - Paula Martins-Lopes
- University of Trás-os-Montes and Alto Douro , P.O. Box 1013, 5000-911 Vila Real, Portugal
- Faculty of Sciences, BioISI - Biosystems & Integrative Sciences Institute, University of Lisboa , Campo Grande, 1749-016 Lisboa, Portugal
| |
Collapse
|
4
|
Chen X, Jin X, Li X, Lin Z. Genetic mapping and comparative expression analysis of transcription factors in cotton. PLoS One 2015; 10:e0126150. [PMID: 25946129 PMCID: PMC4422734 DOI: 10.1371/journal.pone.0126150] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2015] [Accepted: 03/30/2015] [Indexed: 12/03/2022] Open
Abstract
Transcription factors (TFs) play an important role in the regulation of plant growth and development. The study of the structure and function of TFs represents a research frontier in plant molecular biology. The findings of these studies will provide significant information regarding genetic improvement traits in crops. Currently, a large number of TFs have been cloned, and their function has been verified. However, relatively few studies that genetically map TFs in cotton are available. To genetically map TFs in cotton in this study, specific primers were designed for TF genes that were published in the Plant Transcription Factor Database. A total of 977 TF primers were obtained, and 31 TF polymorphic loci were mapped on 15 cotton chromosomes. These polymorphic loci were clearly preferentially distributed on chromosomes 5, 11, 19 and 20; and TFs from the same family mapped to homologous cotton chromosomes. In-silico mapping verified that many mapped TFs were mapped on their corresponding chromosomes or their homologous chromosomes’ corresponding chromosomes in the diploid genomes. QTL mapping for fiber quality revealed that TF-Ghi005602-2 mapped on Chr19 was associated with fiber length. Eighty-five TF genes were selected for RT-PCR analysis, and 4 TFs were selected for qRT-PCR analysis, revealing unique expression patterns across different stages of fiber development between the mapping parents. Our data offer an overview of the chromosomal distribution of TFs in cotton, and the comparative expression analysis between Gossypium hirsutum and G. barbadense provides a rough understanding of the regulation of TFs during cotton fiber development.
Collapse
Affiliation(s)
- Xuemei Chen
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, Hubei, China
| | - Xin Jin
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, Hubei, China
| | - Ximei Li
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, Hubei, China
| | - Zhongxu Lin
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, Hubei, China
- * E-mail:
| |
Collapse
|
5
|
Li X, Gao W, Guo H, Zhang X, Fang DD, Lin Z. Development of EST-based SNP and InDel markers and their utilization in tetraploid cotton genetic mapping. BMC Genomics 2014; 15:1046. [PMID: 25442170 PMCID: PMC4265408 DOI: 10.1186/1471-2164-15-1046] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2014] [Accepted: 11/14/2014] [Indexed: 12/19/2022] Open
Abstract
Background Availability of molecular markers has proven to be an efficient tool in facilitating progress in plant breeding, which is particularly important in the case of less researched crops such as cotton. Considering the obvious advantages of single nucleotide polymorphisms (SNPs) and insertion-deletion polymorphisms (InDels), expressed sequence tags (ESTs) were analyzed in silico to identify SNPs and InDels in this study, aiming to develop more molecular markers in cotton. Results A total of 1,349 EST-based SNP and InDel markers were developed by comparing ESTs between Gossypium hirsutum and G. barbadense, mining G. hirsutum unigenes, and analyzing 3′ untranslated region (3′UTR) sequences. The marker polymorphisms were investigated using the two parents of the mapping population based on the single-strand conformation polymorphism (SSCP) analysis. Of all the markers, 137 (10.16%) were polymorphic, and revealed 142 loci. Linkage analysis using a BC1 population mapped 133 loci on the 26 chromosomes. Statistical analysis of base variations in SNPs showed that base transitions accounted for 55.78% of the total base variations and gene ontology indicated that cotton genes varied greatly in harboring SNPs ranging from 1.00 to 24.00 SNPs per gene. Sanger sequencing of three randomly selected SNP markers revealed discrepancy between the in silico predicted sequences and the actual sequencing results. Conclusions In silico analysis is a double-edged blade to develop EST-SNP/InDel markers. On the one hand, the designed markers can be well used in tetraploid cotton genetic mapping. And it plays a certain role in revealing transition preference and SNP frequency of cotton genes. On the other hand, the developmental efficiency of markers and polymorphism of designed primers are comparatively low. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-1046) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | - Zhongxu Lin
- National Key Laboratory of Crop Genetic Improvement & National Centre of Plant Gene Research (Wuhan), Huazhong Agricultural University, Wuhan 430070, Hubei, China.
| |
Collapse
|
6
|
Hulse-Kemp AM, Ashrafi H, Zheng X, Wang F, Hoegenauer KA, Maeda ABV, Yang SS, Stoffel K, Matvienko M, Clemons K, Udall JA, Van Deynze A, Jones DC, Stelly DM. Development and bin mapping of gene-associated interspecific SNPs for cotton (Gossypium hirsutum L.) introgression breeding efforts. BMC Genomics 2014. [PMID: 25359292 DOI: 10.1186/1471‐2164‐15‐945] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Cotton (Gossypium spp.) is the largest producer of natural fibers for textile and is an important crop worldwide. Crop production is comprised primarily of G. hirsutum L., an allotetraploid. However, elite cultivars express very small amounts of variation due to the species monophyletic origin, domestication and further bottlenecks due to selection. Conversely, wild cotton species harbor extensive genetic diversity of prospective utility to improve many beneficial agronomic traits, fiber characteristics, and resistance to disease and drought. Introgression of traits from wild species can provide a natural way to incorporate advantageous traits through breeding to generate higher-producing cotton cultivars and more sustainable production systems. Interspecific introgression efforts by conventional methods are very time-consuming and costly, but can be expedited using marker-assisted selection. RESULTS Using transcriptome sequencing we have developed the first gene-associated single nucleotide polymorphism (SNP) markers for wild cotton species G. tomentosum, G. mustelinum, G. armourianum and G. longicalyx. Markers were also developed for a secondary cultivated species G. barbadense cv. 3-79. A total of 62,832 non-redundant SNP markers were developed from the five wild species which can be utilized for interspecific germplasm introgression into cultivated G. hirsutum and are directly associated with genes. Over 500 of the G. barbadense markers have been validated by whole-genome radiation hybrid mapping. Overall 1,060 SNPs from the five different species have been screened and shown to produce acceptable genotyping assays. CONCLUSIONS This large set of 62,832 SNPs relative to cultivated G. hirsutum will allow for the first high-density mapping of genes from five wild species that affect traits of interest, including beneficial agronomic and fiber characteristics. Upon mapping, the markers can be utilized for marker-assisted introgression of new germplasm into cultivated cotton and in subsequent breeding of agronomically adapted types, including cultivar development.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | | | - David M Stelly
- Department of Soil and Crop Sciences, Texas A&M University, College Station, Texas, USA.
| |
Collapse
|
7
|
Hulse-Kemp AM, Ashrafi H, Zheng X, Wang F, Hoegenauer KA, Maeda ABV, Yang SS, Stoffel K, Matvienko M, Clemons K, Udall JA, Van Deynze A, Jones DC, Stelly DM. Development and bin mapping of gene-associated interspecific SNPs for cotton (Gossypium hirsutum L.) introgression breeding efforts. BMC Genomics 2014; 15:945. [PMID: 25359292 PMCID: PMC4298081 DOI: 10.1186/1471-2164-15-945] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2014] [Accepted: 10/03/2014] [Indexed: 11/18/2022] Open
Abstract
Background Cotton (Gossypium spp.) is the largest producer of natural fibers for textile and is an important crop worldwide. Crop production is comprised primarily of G. hirsutum L., an allotetraploid. However, elite cultivars express very small amounts of variation due to the species monophyletic origin, domestication and further bottlenecks due to selection. Conversely, wild cotton species harbor extensive genetic diversity of prospective utility to improve many beneficial agronomic traits, fiber characteristics, and resistance to disease and drought. Introgression of traits from wild species can provide a natural way to incorporate advantageous traits through breeding to generate higher-producing cotton cultivars and more sustainable production systems. Interspecific introgression efforts by conventional methods are very time-consuming and costly, but can be expedited using marker-assisted selection. Results Using transcriptome sequencing we have developed the first gene-associated single nucleotide polymorphism (SNP) markers for wild cotton species G. tomentosum, G. mustelinum, G. armourianum and G. longicalyx. Markers were also developed for a secondary cultivated species G. barbadense cv. 3–79. A total of 62,832 non-redundant SNP markers were developed from the five wild species which can be utilized for interspecific germplasm introgression into cultivated G. hirsutum and are directly associated with genes. Over 500 of the G. barbadense markers have been validated by whole-genome radiation hybrid mapping. Overall 1,060 SNPs from the five different species have been screened and shown to produce acceptable genotyping assays. Conclusions This large set of 62,832 SNPs relative to cultivated G. hirsutum will allow for the first high-density mapping of genes from five wild species that affect traits of interest, including beneficial agronomic and fiber characteristics. Upon mapping, the markers can be utilized for marker-assisted introgression of new germplasm into cultivated cotton and in subsequent breeding of agronomically adapted types, including cultivar development. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-945) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | | | - David M Stelly
- Department of Soil and Crop Sciences, Texas A&M University, College Station, Texas, USA.
| |
Collapse
|
8
|
Molecular markers and cotton genetic improvement: current status and future prospects. ScientificWorldJournal 2014; 2014:607091. [PMID: 25401149 PMCID: PMC4226190 DOI: 10.1155/2014/607091] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2014] [Accepted: 09/17/2014] [Indexed: 11/17/2022] Open
Abstract
Narrow genetic base and complex allotetraploid genome of cotton (Gossypium hirsutum L.) is stimulating efforts to avail required polymorphism for marker based breeding. The availability of draft genome sequence of G. raimondii and G. arboreum and next generation sequencing (NGS) technologies facilitated the development of high-throughput marker technologies in cotton. The concepts of genetic diversity, QTL mapping, and marker assisted selection (MAS) are evolving into more efficient concepts of linkage disequilibrium, association mapping, and genomic selection, respectively. The objective of the current review is to analyze the pace of evolution in the molecular marker technologies in cotton during the last ten years into the following four areas: (i) comparative analysis of low- and high-throughput marker technologies available in cotton, (ii) genetic diversity in the available wild and improved gene pools of cotton, (iii) identification of the genomic regions within cotton genome underlying economic traits, and (iv) marker based selection methodologies. Moreover, the applications of marker technologies to enhance the breeding efficiency in cotton are also summarized. Aforementioned genomic technologies and the integration of several other omics resources are expected to enhance the cotton productivity and meet the global fiber quantity and quality demands.
Collapse
|
9
|
Transcriptome and complexity-reduced, DNA-based identification of intraspecies single-nucleotide polymorphisms in the polyploid Gossypium hirsutum L. G3-GENES GENOMES GENETICS 2014; 4:1893-905. [PMID: 25106949 PMCID: PMC4199696 DOI: 10.1534/g3.114.012542] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Varietal single nucleotide polymorphisms (SNPs) are the differences within one of the two subgenomes between different tetraploid cotton varieties and have not been practically used in cotton genetics and breeding because they are difficult to identify due to low genetic diversity and very high sequence identity between homeologous genes in cotton. We have used transcriptome and restriction site-associated DNA sequencing to identify varietal SNPs among 18 G. hirsutum varieties based on the rationale that varietal SNPs can be more confidently called when flanked by subgenome-specific SNPs. Using transcriptome data, we successfully identified 37,413 varietal SNPs and, of these, 22,121 did not have an additional varietal SNP within their 20-bp flanking regions so can be used in most SNP genotyping assays. From restriction site-associated DNA sequencing data, we identified an additional 3090 varietal SNPs between two of the varieties. Of the 1583 successful SNP assays achieved using different genotyping platforms, 1363 were verified. Many of the SNPs behaved as dominant markers because of coamplification from homeologous loci, but the number of SNPs acting as codominant markers increased when one or more subgenome-specific SNP(s) were incorporated in their assay primers, giving them greater utility for breeding applications. A G. hirsutum genetic map with 1244 SNP markers was constructed covering 5557.42 centiMorgan and used to map qualitative and quantitative traits. This collection of G. hirsutum varietal SNPs complements existing intra-specific SNPs and provides the cotton community with a valuable marker resource applicable to genetic analyses and breeding programs.
Collapse
|
10
|
Molecular evolution and phylogenetic analysis of genes related to cotton fibers development from wild and domesticated cotton species in Gossypium. Mol Phylogenet Evol 2012; 63:589-97. [PMID: 22381639 DOI: 10.1016/j.ympev.2012.01.025] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2011] [Revised: 01/30/2012] [Accepted: 01/31/2012] [Indexed: 12/23/2022]
Abstract
The domestication of both diploid and tetraploid cotton species was carried out for fiber utilization. To understand the origin and domestication of fibers, 18 genes related to fiber development were individually cloned and sequenced from 22 different cotton species. Their structures, phylogenetic relationship and molecular evolution were further studied. In the orthologous and homeologous loci of the 18 genes, the sequence and structure of 72.22% were conserved and 27.78% were diverse. Tree topologies constructed based on the combined sequences showed that all 13 D-genome species were congruent with Fryxell's subsection taxonomy, the A- and D-subgenomes independently evolved in the allopolyploid after polyploid formation, and Gossypium raimondii had the closest relationship with all allotetraploids of D-subgenomes. The molecular evolutionary rates revealed approximately equivalent rates among different D-genome species, and purifying selection acted on all genes in the wild D-genome species. Among orthologs and homeologs, the D-subgenomes had higher evolutionary rates than the A-subgenomes in tetraploid cotton species, and the cultivars had higher evolutionary rates than either the semi-domesticated or wild species. Our study revealed that human domestication altered the molecular evolutionary pattern of genes related to fiber development, and Gossypium hirsutum endured greater selective pressures than Gossypium barbadense during the domestication process.
Collapse
|
11
|
Claverie M, Souquet M, Jean J, Forestier-Chiron N, Lepitre V, Pré M, Jacobs J, Llewellyn D, Lacape JM. cDNA-AFLP-based genetical genomics in cotton fibers. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2012; 124:665-683. [PMID: 22080217 DOI: 10.1007/s00122-011-1738-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/04/2010] [Accepted: 10/18/2011] [Indexed: 05/31/2023]
Abstract
Genetical genomics, or genetic analysis applied to gene expression data, has not been widely used in plants. We used quantitative cDNA-AFLP to monitor the variation in the expression level of cotton fiber transcripts among a population of inter-specific Gossypium hirsutum × G. barbadense recombinant inbred lines (RILs). Two key fiber developmental stages, elongation (10 days post anthesis, dpa), and secondary cell wall thickening (22 dpa), were studied. Normalized intensity ratios of 3,263 and 1,201 transcript-derived fragments (TDFs) segregating over 88 RILs were analyzed for quantitative trait loci (QTL) mapping for the 10 and 22 dpa fibers, respectively. Two-thirds of all TDFs mapped between 1 and 6 eQTLs (LOD > 3.5). Chromosome 21 had a higher density of eQTLs than other chromosomes in both data sets and, within chromosomes, hotspots of presumably trans-acting eQTLs were identified. The eQTL hotspots were compared to the location of phenotypic QTLs for fiber characteristics among the RILs, and several cases of co-localization were detected. Quantitative RT-PCR for 15 sequenced TDFs showed that 3 TDFs had at least one eQTL at a similar location to those identified by cDNA-AFLP, while 3 other TDFs mapped an eQTL at a similar location but with opposite additive effect. In conclusion, cDNA-AFLP proved to be a cost-effective and highly transferable platform for genome-wide and population-wide gene expression profiling. Because TDFs are anonymous, further validation and interpretation (in silico analysis, qPCR gene profiling) of the eQTL and eQTL hotspots will be facilitated by the increasing availability of cDNA and genomic sequence resources in cotton.
Collapse
Affiliation(s)
- Michel Claverie
- UMR AGAP, CIRAD, Avenue Agropolis, 34398, Montpellier, France
| | | | | | | | | | | | | | | | | |
Collapse
|
12
|
A high-density simple sequence repeat and single nucleotide polymorphism genetic map of the tetraploid cotton genome. G3-GENES GENOMES GENETICS 2012; 2:43-58. [PMID: 22384381 PMCID: PMC3276184 DOI: 10.1534/g3.111.001552] [Citation(s) in RCA: 61] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/24/2011] [Accepted: 11/04/2011] [Indexed: 11/18/2022]
Abstract
Genetic linkage maps play fundamental roles in understanding genome structure, explaining genome formation events during evolution, and discovering the genetic bases of important traits. A high-density cotton (Gossypium spp.) genetic map was developed using representative sets of simple sequence repeat (SSR) and the first public set of single nucleotide polymorphism (SNP) markers to genotype 186 recombinant inbred lines (RILs) derived from an interspecific cross between Gossypium hirsutum L. (TM-1) and G. barbadense L. (3-79). The genetic map comprised 2072 loci (1825 SSRs and 247 SNPs) and covered 3380 centiMorgan (cM) of the cotton genome (AD) with an average marker interval of 1.63 cM. The allotetraploid cotton genome produced equivalent recombination frequencies in its two subgenomes (At and Dt). Of the 2072 loci, 1138 (54.9%) were mapped to 13 At-subgenome chromosomes, covering 1726.8 cM (51.1%), and 934 (45.1%) mapped to 13 Dt-subgenome chromosomes, covering 1653.1 cM (48.9%). The genetically smallest homeologous chromosome pair was Chr. 04 (A04) and 22 (D04), and the largest was Chr. 05 (A05) and 19 (D05). Duplicate loci between and within homeologous chromosomes were identified that facilitate investigations of chromosome translocations. The map augments evidence of reciprocal rearrangement between ancestral forms of Chr. 02 and 03 versus segmental homeologs 14 and 17 as centromeric regions show homeologous between Chr. 02 (A02) and 17 (D02), as well as between Chr. 03 (A03) and 14 (D03). This research represents an important foundation for studies on polyploid cottons, including germplasm characterization, gene discovery, and genome sequence assembly.
Collapse
|
13
|
Xie F, Sun G, Stiller JW, Zhang B. Genome-wide functional analysis of the cotton transcriptome by creating an integrated EST database. PLoS One 2011; 6:e26980. [PMID: 22087239 PMCID: PMC3210780 DOI: 10.1371/journal.pone.0026980] [Citation(s) in RCA: 91] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2011] [Accepted: 10/07/2011] [Indexed: 12/20/2022] Open
Abstract
A total of 28,432 unique contigs (25,371 in consensus contigs and 3,061 as singletons) were assembled from all 268,786 cotton ESTs currently available. Several in silico approaches [comparative genomics, Blast, Gene Ontology (GO) analysis, and pathway enrichment by Kyoto Encyclopedia of Genes and Genomes (KEGG)] were employed to investigate global functions of the cotton transcriptome. Cotton EST contigs were clustered into 5,461 groups with a maximum cluster size of 196 members. A total of 27,956 indel mutants and 149,616 single nucleotide polymorphisms (SNPs) were identified from consensus contigs. Interestingly, many contigs with significantly high frequencies of indels or SNPs encode transcription factors and protein kinases. In a comparison with six model plant species, cotton ESTs show the highest overall similarity to grape. A total of 87 cotton miRNAs were identified; 59 of these have not been reported previously from experimental or bioinformatics investigations. We also predicted 3,260 genes as miRNAs targets, which are associated with multiple biological functions, including stress response, metabolism, hormone signal transduction and fiber development. We identified 151 and 4,214 EST-simple sequence repeats (SSRs) from contigs and raw ESTs respectively. To make these data widely available, and to facilitate access to EST-related genetic information, we integrated our results into a comprehensive, fully downloadable web-based cotton EST database (www.leonxie.com).
Collapse
Affiliation(s)
- Fuliang Xie
- Department of Biology, East Carolina University, Greenville, North Carolina, United States of America
| | - Guiling Sun
- Department of Biology, East Carolina University, Greenville, North Carolina, United States of America
| | - John W. Stiller
- Department of Biology, East Carolina University, Greenville, North Carolina, United States of America
| | - Baohong Zhang
- Department of Biology, East Carolina University, Greenville, North Carolina, United States of America
- * E-mail:
| |
Collapse
|
14
|
Liu R, Wang B, Guo W, Wang L, Zhang T. Differential gene expression and associated QTL mapping for cotton yield based on a cDNA-AFLP transcriptome map in an immortalized F2. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2011; 123:439-54. [PMID: 21512772 DOI: 10.1007/s00122-011-1597-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2010] [Accepted: 04/05/2011] [Indexed: 05/12/2023]
Abstract
cDNA-AFLP techniques have found new applications in recent years. Currently, the methodology is used to establish differential gene expression and construct linkage maps. In the present study, a transcriptome map based on cDNA-AFLP techniques was constructed using an immortalized F(2) (IF(2)) population of 171 lines. The lines were derived from intercrosses between 180 recombinant inbred lines (RILs) of the cotton hybrid Xiangzamian 2 (Gossypium. hirsutum L.). A total of 302 transcriptome-derived fragments (TDFs) were mapped onto 26 linkage groups that covered 2,477.06 cM in length with an average distance of 8.23 cM between two markers. Seventy-one QTL for yield and yield component traits were detected by CIM procedures based on four environments, with 13 QTL identified in at least two environments. Some TDFs co-located with yield QTL were subsequently sequenced and analyzed by online homology searches. Potential candidate genes for yield and yield component traits were found to encode proteins involved in DNA replication, transcription, translation, and biosynthesis regulation. Furthermore, genes regulating metabolic processes signal transduction, transport, and structural components of organelles were identified. Correlation analysis between expression patterns of TDFs and trait performance detected six TDFs positively correlated to both yield and yield heterosis: six TDFs positively correlated to yield, and seven TDFs to yield heterosis. These TDFs have potential for cloning the functional genes responsible for each corresponding trait and have future value in marker-assisted selection.
Collapse
Affiliation(s)
- Renzhong Liu
- National Key Laboratory of Crop Genetics and Germplasm Enhancement, Cotton Research Institute, Nanjing Agricultural University, Nanjing, China
| | | | | | | | | |
Collapse
|
15
|
Zhu H, Han X, Lv J, Zhao L, Xu X, Zhang T, Guo W. Structure, expression differentiation and evolution of duplicated fiber developmental genes in Gossypium barbadense and G. hirsutum. BMC PLANT BIOLOGY 2011; 11:40. [PMID: 21349199 PMCID: PMC3050799 DOI: 10.1186/1471-2229-11-40] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/26/2010] [Accepted: 02/25/2011] [Indexed: 05/08/2023]
Abstract
BACKGROUND Both Gossypium hirsutum and G. barbadense probably originated from a common ancestor, but they have very different agronomic and fiber quality characters. Here we selected 17 fiber development-related genes to study their structures, tree topologies, chromosomal location and expression patterns to better understand the interspecific divergence of fiber development genes in the two cultivated tetraploid species. RESULTS The sequence and structure of 70.59% genes were conserved with the same exon length and numbers in different species, while 29.41% genes showed diversity. There were 15 genes showing independent evolution between the A- and D-subgenomes after polyploid formation, while two evolved via different degrees of colonization. Chromosomal location showed that 22 duplicate genes were located in which at least one fiber quality QTL was detected. The molecular evolutionary rates suggested that the D-subgenome of the allotetraploid underwent rapid evolutionary differentiation, and selection had acted at the tetraploid level. Expression profiles at fiber initiation and early elongation showed that the transcripts levels of most genes were higher in Hai7124 than in TM-1. During the primary-secondary transition period, expression of most genes peaked earlier in TM-1 than in Hai7124. Homeolog expression profile showed that A-subgenome, or the combination of A- and D-subgenomes, played critical roles in fiber quality divergence of G. hirsutum and G. barbadense. However, the expression of D-subgenome alone also played an important role. CONCLUSION Integrating analysis of the structure and expression to fiber development genes, suggests selective breeding for certain desirable fiber qualities played an important role in divergence of G. hirsutum and G. barbadense.
Collapse
Affiliation(s)
- Huayu Zhu
- National Key Laboratory of Crop Genetics & Germplasm Enhancement, Cotton Research Institute, Nanjing Agricultural University, Nanjing 210095, China
| | - Xiaoyong Han
- National Key Laboratory of Crop Genetics & Germplasm Enhancement, Cotton Research Institute, Nanjing Agricultural University, Nanjing 210095, China
| | - Junhong Lv
- National Key Laboratory of Crop Genetics & Germplasm Enhancement, Cotton Research Institute, Nanjing Agricultural University, Nanjing 210095, China
| | - Liang Zhao
- National Key Laboratory of Crop Genetics & Germplasm Enhancement, Cotton Research Institute, Nanjing Agricultural University, Nanjing 210095, China
| | - Xiaoyang Xu
- National Key Laboratory of Crop Genetics & Germplasm Enhancement, Cotton Research Institute, Nanjing Agricultural University, Nanjing 210095, China
| | - Tianzhen Zhang
- National Key Laboratory of Crop Genetics & Germplasm Enhancement, Cotton Research Institute, Nanjing Agricultural University, Nanjing 210095, China
| | - Wangzhen Guo
- National Key Laboratory of Crop Genetics & Germplasm Enhancement, Cotton Research Institute, Nanjing Agricultural University, Nanjing 210095, China
| |
Collapse
|
16
|
Xu Z, Yu JZ, Cho J, Yu J, Kohel RJ, Percy RG. Polyploidization altered gene functions in cotton (Gossypium spp.). PLoS One 2010; 5:e14351. [PMID: 21179551 PMCID: PMC3002935 DOI: 10.1371/journal.pone.0014351] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2010] [Accepted: 11/04/2010] [Indexed: 11/19/2022] Open
Abstract
Cotton (Gossypium spp.) is an important crop plant that is widely grown to produce both natural textile fibers and cottonseed oil. Cotton fibers, the economically more important product of the cotton plant, are seed trichomes derived from individual cells of the epidermal layer of the seed coat. It has been known for a long time that large numbers of genes determine the development of cotton fiber, and more recently it has been determined that these genes are distributed across At and Dt subgenomes of tetraploid AD cottons. In the present study, the organization and evolution of the fiber development genes were investigated through the construction of an integrated genetic and physical map of fiber development genes whose functions have been verified and confirmed. A total of 535 cotton fiber development genes, including 103 fiber transcription factors, 259 fiber development genes, and 173 SSR-contained fiber ESTs, were analyzed at the subgenome level. A total of 499 fiber related contigs were selected and assembled. Together these contigs covered about 151 Mb in physical length, or about 6.7% of the tetraploid cotton genome. Among the 499 contigs, 397 were anchored onto individual chromosomes. Results from our studies on the distribution patterns of the fiber development genes and transcription factors between the At and Dt subgenomes showed that more transcription factors were from Dt subgenome than At, whereas more fiber development genes were from At subgenome than Dt. Combining our mapping results with previous reports that more fiber QTLs were mapped in Dt subgenome than At subgenome, the results suggested a new functional hypothesis for tetraploid cotton. After the merging of the two diploid Gossypium genomes, the At subgenome has provided most of the genes for fiber development, because it continues to function similar to its fiber producing diploid A genome ancestor. On the other hand, the Dt subgenome, with its non-fiber producing D genome ancestor, provides more transcription factors that regulate the expression of the fiber genes in the At subgenome. This hypothesis would explain previously published mapping results. At the same time, this integrated map of fiber development genes would provide a framework to clone individual full-length fiber genes, to elucidate the physiological mechanisms of the fiber differentiation, elongation, and maturation, and to systematically study the functional network of these genes that interact during the process of fiber development in the tetraploid cottons.
Collapse
Affiliation(s)
- Zhanyou Xu
- Crop Germplasm Research Unit, Southern Plains Agricultural Research Center, United States Department of Agriculture-Agricultural Research Service (USDA-ARS), College Station, Texas, United States of America
| | - John Z. Yu
- Crop Germplasm Research Unit, Southern Plains Agricultural Research Center, United States Department of Agriculture-Agricultural Research Service (USDA-ARS), College Station, Texas, United States of America
- * E-mail:
| | - Jaemin Cho
- Crop Germplasm Research Unit, Southern Plains Agricultural Research Center, United States Department of Agriculture-Agricultural Research Service (USDA-ARS), College Station, Texas, United States of America
| | - Jing Yu
- Crop Germplasm Research Unit, Southern Plains Agricultural Research Center, United States Department of Agriculture-Agricultural Research Service (USDA-ARS), College Station, Texas, United States of America
| | - Russell J. Kohel
- Crop Germplasm Research Unit, Southern Plains Agricultural Research Center, United States Department of Agriculture-Agricultural Research Service (USDA-ARS), College Station, Texas, United States of America
| | - Richard G. Percy
- Crop Germplasm Research Unit, Southern Plains Agricultural Research Center, United States Department of Agriculture-Agricultural Research Service (USDA-ARS), College Station, Texas, United States of America
| |
Collapse
|
17
|
Buriev ZT, Saha S, Abdurakhmonov IY, Jenkins JN, Abdukarimov A, Scheffler BE, Stelly DM. Clustering, haplotype diversity and locations of MIC-3: a unique root-specific defense-related gene family in Upland cotton (Gossypium hirsutum L.). TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2010; 120:587-606. [PMID: 19862497 DOI: 10.1007/s00122-009-1178-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/20/2009] [Accepted: 09/30/2009] [Indexed: 02/05/2023]
Abstract
MIC-3 is a recently identified gene family shown to exhibit increased root-specific expression following nematode infection of cotton plants that are resistant to root-knot nematode. Here, we cloned and sequenced MIC-3 genes from selected diploid and tetraploid cotton species to reveal sequence differences at the molecular level and identify chromosomal locations of MIC-3 genes in Gossypium species. Detailed sequence analysis and phylogenetic clustering of MIC-3 genes indicated the presence of multiple MIC-3 gene members in Gossypium species. Haplotypes of a MIC-3 gene family member were discovered by comparative analysis among consensus sequences across genotypes within an individual clade in the phylogram to overcome the problem of duplicated loci in the tetraploid cotton. Deficiency tests of the SNPs delimited six A(t)-genome members of the MIC-3 family clustered to chromosome arm 4sh, and one D(t)-genome member to chromosome 19. Clustering was confirmed by long-PCR amplification of the intergenic regions using A(t)-genome-specific MIC-3 primer pairs. The clustered distribution may have been favored by selection for responsiveness to evolving disease and/or pest pressures, because large variants of the MIC-3 gene family may have been recovered from small physical areas by recombination. This could give a buffer against selection pressure from a broad range of pest and pathogens in the future. To our knowledge, these are the first results on the evolution of clustering and genome-specific haplotype members of a unique cotton gene family associated with resistant response against a major pathogen.
Collapse
Affiliation(s)
- Zabardast T Buriev
- Center of Genomic Technologies, Institute of Genetics and Plant Experimental Biology, Academy of Sciences of Uzbekistan, Yuqori Yuz, Qibray Region, 111226 Tashkent, Uzbekistan.
| | | | | | | | | | | | | |
Collapse
|
18
|
Van Deynze A, Stoffel K, Lee M, Wilkins TA, Kozik A, Cantrell RG, Yu JZ, Kohel RJ, Stelly DM. Sampling nucleotide diversity in cotton. BMC PLANT BIOLOGY 2009; 9:125. [PMID: 19840401 PMCID: PMC2771027 DOI: 10.1186/1471-2229-9-125] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2008] [Accepted: 10/20/2009] [Indexed: 05/19/2023]
Abstract
BACKGROUND Cultivated cotton is an annual fiber crop derived mainly from two perennial species, Gossypium hirsutum L. or upland cotton, and G. barbadense L., extra long-staple fiber Pima or Egyptian cotton. These two cultivated species are among five allotetraploid species presumably derived monophyletically between G. arboreum and G. raimondii. Genomic-based approaches have been hindered by the limited variation within species. Yet, population-based methods are being used for genome-wide introgression of novel alleles from G. mustelinum and G. tomentosum into G. hirsutum using combinations of backcrossing, selfing, and inter-mating. Recombinant inbred line populations between genetics standards TM-1, (G. hirsutum) x 3-79 (G. barbadense) have been developed to allow high-density genetic mapping of traits. RESULTS This paper describes a strategy to efficiently characterize genomic variation (SNPs and indels) within and among cotton species. Over 1000 SNPs from 270 loci and 279 indels from 92 loci segregating in G. hirsutum and G. barbadense were genotyped across a standard panel of 24 lines, 16 of which are elite cotton breeding lines and 8 mapping parents of populations from six cotton species. Over 200 loci were genetically mapped in a core mapping population derived from TM-1 and 3-79 and in G. hirsutum breeding germplasm. CONCLUSION In this research, SNP and indel diversity is characterized for 270 single-copy polymorphic loci in cotton. A strategy for SNP discovery is defined to pre-screen loci for copy number and polymorphism. Our data indicate that the A and D genomes in both diploid and tetraploid cotton remain distinct from each such that paralogs can be distinguished. This research provides mapped DNA markers for intra-specific crosses and introgression of exotic germplasm in cotton.
Collapse
Affiliation(s)
- Allen Van Deynze
- Seed Biotechnology Center, University of California, 1 Shields Ave, Davis, CA, USA
| | - Kevin Stoffel
- Seed Biotechnology Center, University of California, 1 Shields Ave, Davis, CA, USA
| | - Mike Lee
- Seed Biotechnology Center, University of California, 1 Shields Ave, Davis, CA, USA
| | - Thea A Wilkins
- Department of Plant and Soil Science, Texas Tech University, Experimental Sciences Building, Room 215, Mail Stop 3132, Lubbock, TX 79409-3132, USA
| | - Alexander Kozik
- Genome and Biomedical Sciences Facility, University of California, 1 Shields Ave, Davis, CA, USA
| | - Roy G Cantrell
- Monsanto, 1 800 N. Lindbergh Blvd, St Louis, MO 63167, USA
| | - John Z Yu
- USDA-ARS, Southern Plains Agricultural Research Center, 2881 F&B Road, College Station, TX 77845, USA
| | - Russel J Kohel
- USDA-ARS, Southern Plains Agricultural Research Center, 2881 F&B Road, College Station, TX 77845, USA
| | - David M Stelly
- Department of Soil and Crop Sciences, Texas A & M University, College Station, TX 77843, USA
| |
Collapse
|