1
|
Mei H, Zhao T, Dong Z, Han J, Xu B, Chen R, Zhang J, Zhang J, Hu Y, Zhang T, Fang L. Population-Scale Polymorphic Short Tandem Repeat Provides an Alternative Strategy for Allele Mining in Cotton. Front Plant Sci 2022; 13:916830. [PMID: 35599867 PMCID: PMC9120961 DOI: 10.3389/fpls.2022.916830] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/10/2022] [Accepted: 04/20/2022] [Indexed: 06/15/2023]
Abstract
Short tandem repeats (STRs), which vary in size due to featuring variable numbers of repeat units, are present throughout most eukaryotic genomes. To date, few population-scale studies identifying STRs have been reported for crops. Here, we constructed a high-density polymorphic STR map by investigating polymorphic STRs from 911 Gossypium hirsutum accessions. In total, we identified 556,426 polymorphic STRs with an average length of 21.1 bp, of which 69.08% were biallelic. Moreover, 7,718 (1.39%) were identified in the exons of 6,021 genes, which were significantly enriched in transcription, ribosome biogenesis, and signal transduction. Only 5.88% of those exonic STRs altered open reading frames, of which 97.16% were trinucleotide. An alternative strategy STR-GWAS analysis revealed that 824 STRs were significantly associated with agronomic traits, including 491 novel alleles that undetectable by previous SNP-GWAS methods. For instance, a novel polymorphic STR consisting of GAACCA repeats was identified in GH_D06G1697, with its (GAACCA)5 allele increasing fiber length by 1.96-4.83% relative to the (GAACCA)4 allele. The database CottonSTRDB was further developed to facilitate use of STR datasets in breeding programs. Our study provides functional roles for STRs in influencing complex traits, an alternative strategy STR-GWAS for allele mining, and a database serving the cotton community as a valuable resource.
Collapse
Affiliation(s)
- Huan Mei
- Zhejiang Provincial Key Laboratory of Crop Genetic Resources, Institute of Crop Science, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
| | - Ting Zhao
- Zhejiang Provincial Key Laboratory of Crop Genetic Resources, Institute of Crop Science, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
| | - Zeyu Dong
- Zhejiang Provincial Key Laboratory of Crop Genetic Resources, Institute of Crop Science, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
| | - Jin Han
- Zhejiang Provincial Key Laboratory of Crop Genetic Resources, Institute of Crop Science, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
| | - Biyu Xu
- Zhejiang Provincial Key Laboratory of Crop Genetic Resources, Institute of Crop Science, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
| | - Rui Chen
- Zhejiang Provincial Key Laboratory of Crop Genetic Resources, Institute of Crop Science, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
| | - Jun Zhang
- Zhejiang Provincial Key Laboratory of Crop Genetic Resources, Institute of Crop Science, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
| | - Juncheng Zhang
- Zhejiang Provincial Key Laboratory of Crop Genetic Resources, Institute of Crop Science, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
| | - Yan Hu
- Zhejiang Provincial Key Laboratory of Crop Genetic Resources, Institute of Crop Science, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
- Hainan Institute of Zhejiang University, Sanya, China
| | - Tianzhen Zhang
- Zhejiang Provincial Key Laboratory of Crop Genetic Resources, Institute of Crop Science, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
- Hainan Institute of Zhejiang University, Sanya, China
| | - Lei Fang
- Zhejiang Provincial Key Laboratory of Crop Genetic Resources, Institute of Crop Science, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
- Hainan Institute of Zhejiang University, Sanya, China
| |
Collapse
|
2
|
Kumar S, Singh A, Shanker A. pSATdb: a database of mitochondrial common, polymorphic, and unique microsatellites. Life Sci Alliance 2022; 5:5/6/e202101307. [PMID: 35181599 PMCID: PMC8860089 DOI: 10.26508/lsa.202101307] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Revised: 02/03/2022] [Accepted: 02/04/2022] [Indexed: 12/02/2022] Open
Abstract
The polymorphic microSATellites database (pSATdb) provides information on common, polymorphic, and unique mitochondrial microsatellites. Microsatellites, also termed as simple sequence repeats, are repetitive tracts in a DNA sequence, typically consisting of one to six nucleotides. These repeats are found in all genomes and play key roles in phylogeny and species identification. Microsatellites are highly polymorphic, and their length may differ from species to species. There are several online resources dedicated to mitochondria; however, comprehensive information is not available about the length variation of mitochondrial microsatellites. Therefore, to explore it between species among a genus, we have developed a database named pSATdb (polymorphic microSATellites database; https://lms.snu.edu.in/pSATdb/). pSATdb contains 28,710 perfect microsatellites identified across 5,976 mitochondrial genome (mt-genome) sequences from 1,576 genera which includes 1,535 (5,846 mt-genome) and 41 (130 mt-genome) genera of Metazoa and Viridiplantae, respectively. pSATdb is the only database which provides genus-wise information about the length variation of mitochondrial microsatellites. Because of the emerging role of microsatellites in genomics studies, the identified common, polymorphic, and unique microsatellites stored in pSATdb will be effectively useful in various studies including genetic diversity, mapping, marker-assisted selection, and comparative population studies.
Collapse
Affiliation(s)
- Sonu Kumar
- Department of Bioinformatics, Central University of South Bihar, Gaya, India
| | - Ashutosh Singh
- Translational Bioinformatics Lab, Department of Life Sciences, Shiv Nadar University, Greater Noida, India
| | - Asheesh Shanker
- Department of Bioinformatics, Central University of South Bihar, Gaya, India
| |
Collapse
|
3
|
Yu J, Jung S, Cheng CH, Lee T, Zheng P, Buble K, Crabb J, Humann J, Hough H, Jones D, Campbell JT, Udall J, Main D. CottonGen: The Community Database for Cotton Genomics, Genetics, and Breeding Research. Plants (Basel) 2021; 10:plants10122805. [PMID: 34961276 PMCID: PMC8705096 DOI: 10.3390/plants10122805] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Revised: 12/11/2021] [Accepted: 12/12/2021] [Indexed: 05/12/2023]
Abstract
Over the last eight years, the volume of whole genome, gene expression, SNP genotyping, and phenotype data generated by the cotton research community has exponentially increased. The efficient utilization/re-utilization of these complex and large datasets for knowledge discovery, translation, and application in crop improvement requires them to be curated, integrated with other types of data, and made available for access and analysis through efficient online search tools. Initiated in 2012, CottonGen is an online community database providing access to integrated peer-reviewed cotton genomic, genetic, and breeding data, and analysis tools. Used by cotton researchers worldwide, and managed by experts with crop-specific knowledge, it continuous to be the logical choice to integrate new data and provide necessary interfaces for information retrieval. The repository in CottonGen contains colleague, gene, genome, genotype, germplasm, map, marker, metabolite, phenotype, publication, QTL, species, transcriptome, and trait data curated by the CottonGen team. The number of data entries housed in CottonGen has increased dramatically, for example, since 2014 there has been an 18-fold increase in genes/mRNAs, a 23-fold increase in whole genomes, and a 372-fold increase in genotype data. New tools include a genetic map viewer, a genome browser, a synteny viewer, a metabolite pathways browser, sequence retrieval, BLAST, and a breeding information management system (BIMS), as well as various search pages for new data types. CottonGen serves as the home to the International Cotton Genome Initiative, managing its elections and serving as a communication and coordination hub for the community. With its extensive curation and integration of data and online tools, CottonGen will continue to facilitate utilization of its critical resources to empower research for cotton crop improvement.
Collapse
Affiliation(s)
- Jing Yu
- Department of Horticulture, Washington State University, Pullman, WA 99164, USA; (J.Y.); (S.J.); (C.-H.C.); (T.L.); (P.Z.); (K.B.); (J.C.); (J.H.); (H.H.)
| | - Sook Jung
- Department of Horticulture, Washington State University, Pullman, WA 99164, USA; (J.Y.); (S.J.); (C.-H.C.); (T.L.); (P.Z.); (K.B.); (J.C.); (J.H.); (H.H.)
| | - Chun-Huai Cheng
- Department of Horticulture, Washington State University, Pullman, WA 99164, USA; (J.Y.); (S.J.); (C.-H.C.); (T.L.); (P.Z.); (K.B.); (J.C.); (J.H.); (H.H.)
| | - Taein Lee
- Department of Horticulture, Washington State University, Pullman, WA 99164, USA; (J.Y.); (S.J.); (C.-H.C.); (T.L.); (P.Z.); (K.B.); (J.C.); (J.H.); (H.H.)
| | - Ping Zheng
- Department of Horticulture, Washington State University, Pullman, WA 99164, USA; (J.Y.); (S.J.); (C.-H.C.); (T.L.); (P.Z.); (K.B.); (J.C.); (J.H.); (H.H.)
| | - Katheryn Buble
- Department of Horticulture, Washington State University, Pullman, WA 99164, USA; (J.Y.); (S.J.); (C.-H.C.); (T.L.); (P.Z.); (K.B.); (J.C.); (J.H.); (H.H.)
| | - James Crabb
- Department of Horticulture, Washington State University, Pullman, WA 99164, USA; (J.Y.); (S.J.); (C.-H.C.); (T.L.); (P.Z.); (K.B.); (J.C.); (J.H.); (H.H.)
| | - Jodi Humann
- Department of Horticulture, Washington State University, Pullman, WA 99164, USA; (J.Y.); (S.J.); (C.-H.C.); (T.L.); (P.Z.); (K.B.); (J.C.); (J.H.); (H.H.)
| | - Heidi Hough
- Department of Horticulture, Washington State University, Pullman, WA 99164, USA; (J.Y.); (S.J.); (C.-H.C.); (T.L.); (P.Z.); (K.B.); (J.C.); (J.H.); (H.H.)
| | - Don Jones
- Cotton Incorporated, Cary, NC 27513, USA;
| | - J. Todd Campbell
- The Agricultural Research Service of U.S. Department of Agriculture, Florence, SC 29501, USA;
| | - Josh Udall
- The Agricultural Research Service of U.S. Department of Agriculture, College Station, TX 77845, USA;
| | - Dorrie Main
- Department of Horticulture, Washington State University, Pullman, WA 99164, USA; (J.Y.); (S.J.); (C.-H.C.); (T.L.); (P.Z.); (K.B.); (J.C.); (J.H.); (H.H.)
- Correspondence: ; Tel.: +1-509-335-2774
| |
Collapse
|
4
|
Zhang C, Li L, Liu Q, Gu L, Huang J, Wei H, Wang H, Yu S. Identification of Loci and Candidate Genes Responsible for Fiber Length in Upland Cotton ( Gossypium hirsutum L.) via Association Mapping and Linkage Analyses. Front Plant Sci 2019; 10:53. [PMID: 30804954 PMCID: PMC6370998 DOI: 10.3389/fpls.2019.00053] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/06/2018] [Accepted: 01/16/2019] [Indexed: 05/12/2023]
Abstract
Fiber length (FL) is an important fiber quality trait in cotton. Although many fiber quality quantitative trait loci (QTL) responsible for FL have been identified, most cannot be applied to breeding programs, mainly due to unstable environments or large confidence intervals. In this study, we combined a genome-wide association study (GWAS) and linkage mapping to identify and validate high-quality QTLs responsible for FL. For the GWAS, we developed 93,250 high-quality single-nucleotide polymorphism (SNP) markers based on 355 accessions, and the FL was measured in eight different environments. For the linkage mapping, we constructed an F 2 population from two extreme accessions. The high-density linkage maps spanned 3,848.29 cM, with an average marker interval of 1.41 cM. In total, 14 and 13 QTLs were identified in the association and linkage mapping analyses, respectively. Most importantly, a major QTL on chromosome D03 identified in both populations explained more than 10% of the phenotypic variation (PV). Furthermore, we found that a sucrose synthesis-related gene (Gh_D03G1338) was associated with FL in this QTL region. The RNA-seq data showed that Gh_D03G1338 was highly expressed during the fiber development stage, and the qRT-PCR analysis showed significant expression differences between the long fiber and short fiber varieties. These results suggest that Gh_D03G1338 may determine cotton fiber elongation by regulating the synthesis of sucrose. Favorable QTLs and candidate genes should be useful for increasing fiber quality in cotton breeding.
Collapse
Affiliation(s)
- Chi Zhang
- College of Agronomy, Northwest A&F University, Yangling, China
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Lin’an, China
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Libei Li
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Lin’an, China
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Qibao Liu
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Lin’an, China
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Lijiao Gu
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Jianqin Huang
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Lin’an, China
| | - Hengling Wei
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Hantao Wang
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Shuxun Yu
- College of Agronomy, Northwest A&F University, Yangling, China
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Lin’an, China
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
- *Correspondence: Shuxun Yu,
| |
Collapse
|
5
|
Hamid R, Tomar RS, Marashi H, Shafaroudi SM, Golakiya BA, Mohsenpour M. Transcriptome profiling and cataloging differential gene expression in floral buds of fertile and sterile lines of cotton (Gossypium hirsutum L.). Gene 2018; 660:80-91. [PMID: 29577977 DOI: 10.1016/j.gene.2018.03.070] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2017] [Revised: 03/14/2018] [Accepted: 03/21/2018] [Indexed: 01/02/2023]
Abstract
Cytoplasmic Male Sterility is maternally inherited trait in plants, characterized by failure to produce functional pollen during anther development. Anther development is modulated through the interaction of nuclear and mitochondrial genes. In the present study, differential gene expression of floral buds at the sporogenous stage (SS) and microsporocyte stage (MS) between CGMS and its fertile maintainer line of cotton plants was studied. A total of 320 significantly differentially expressed genes, including 20 down-regulated and 37 up-regulated in CGMS comparing with its maintainer line at the SS stage, as well as and 89 down-regulated and 4 up-regulated in CGMS compared to the fertile line at MS stage. Comparing the two stages in the same line, there were 6 down-regulated differentially expressed genes only induced in CGMS and 9 up-regulated differentially expressed gene only induced in its maintainer. GO analysis revealed essential genes responsible for pollen development, and cytoskeleton category show differential expression between the fertile and CGMS lines. Validation studies by qRT-PCR shows concordance with RNA-seq result. A set of novel SSRs identified in this study can be used in evaluating genetic relationships among cultivars, QTL mapping, and marker-assisted breeding. We reported aberrant expression of genes related to pollen exine formation, and synthesis of pectin lyase, myosine heavy chain, tubulin, actin-beta, heat shock protein and myeloblastosis (MYB) protein as targets for CMS in cotton. The results of this study contribute to basic information for future screening of genes and identification of molecular portraits responsible for CMS as well as to elucidate molecular mechanisms that lead to CMS in cotton.
Collapse
Affiliation(s)
- Rasmieh Hamid
- Department of Biotechnology and Plant Breeding, Ferdowsi University of Mashhad, Iran
| | - Rukam S Tomar
- Department of Biotechnology and Biochemistry, Junagadh Agricultural University, Junagadh, Gujarat, India
| | - Hassan Marashi
- Department of Biotechnology and Plant Breeding, Ferdowsi University of Mashhad, Iran.
| | | | - Balaji A Golakiya
- Department of Biotechnology and Biochemistry, Junagadh Agricultural University, Junagadh, Gujarat, India
| | | |
Collapse
|
6
|
Ulloa M, Hulse-Kemp AM, De Santiago LM, Stelly DM, Burke JJ. Insights Into Upland Cotton ( Gossypium hirsutum L.) Genetic Recombination Based on 3 High-Density Single-Nucleotide Polymorphism and a Consensus Map Developed Independently With Common Parents. Genomics Insights 2017; 10:1178631017735104. [PMID: 29308012 PMCID: PMC5751910 DOI: 10.1177/1178631017735104] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/16/2017] [Accepted: 09/10/2017] [Indexed: 11/17/2022]
Abstract
High-density linkage maps are vital to supporting the correct placement of scaffolds and gene sequences on chromosomes and fundamental to contemporary organismal research and scientific approaches to genetic improvement, especially in paleopolyploids with exceptionally complex genomes, eg, upland cotton (Gossypium hirsutum L., "2n = 52"). Three independently developed intraspecific upland mapping populations were analyzed to generate 3 high-density genetic linkage single-nucleotide polymorphism (SNP) maps and a consensus map using the CottonSNP63K array. The populations consisted of a previously reported F2, a recombinant inbred line (RIL), and reciprocal RIL population, from "Phytogen 72" and "Stoneville 474" cultivars. The cluster file provided 7417 genotyped SNP markers, resulting in 26 linkage groups corresponding to the 26 chromosomes (c) of the allotetraploid upland cotton (AD)1 arisen from the merging of 2 genomes ("A" Old World and "D" New World). Patterns of chromosome-specific recombination were largely consistent across mapping populations. The high-density genetic consensus map included 7244 SNP markers that spanned 3538 cM and comprised 3824 SNP bins, of which 1783 and 2041 were in the At and Dt subgenomes with 1825 and 1713 cM map lengths, respectively. Subgenome average distances were nearly identical, indicating that subgenomic differences in bin number arose due to the high numbers of SNPs on the Dt subgenome. Examination of expected recombination frequency or crossovers (COs) on the chromosomes within each population of the 2 subgenomes revealed that COs were also not affected by the SNPs or SNP bin number in these subgenomes. Comparative alignment analyses identified historical ancestral At-subgenomic translocations of c02 and c03, as well as of c04 and c05. The consensus map SNP sequences aligned with high congruency to the NBI assembly of Gossypium hirsutum. However, the genomic comparisons revealed evidence of additional unconfirmed possible duplications, inversions and translocations, and unbalance SNP sequence homology or SNP sequence/loci genomic dominance, or homeolog loci bias of the upland tetraploid At and Dt subgenomes. The alignments indicated that 364 SNP-associated previously unintegrated scaffolds can be placed in pseudochromosomes of the NBI G hirsutum assembly. This is the first intraspecific SNP genetic linkage consensus map assembled in G hirsutum with a core of reproducible mendelian SNP markers assayed on different populations and it provides further knowledge of chromosome arrangement of genic and nongenic SNPs. Together, the consensus map and RIL populations provide a synergistically useful platform for localizing and identifying agronomically important loci for improvement of the cotton crop.
Collapse
Affiliation(s)
- Mauricio Ulloa
- USDA-ARS, PA, Plant Stress and Germplasm Development Research Unit, Lubbock, TX, USA
| | - Amanda M Hulse-Kemp
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX, USA
| | - Luis M De Santiago
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX, USA
- Interdisciplinary Program in Genetics, Texas A&M University, College Station, TX, USA
| | - David M Stelly
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX, USA
| | - John J Burke
- USDA-ARS, PA, Plant Stress and Germplasm Development Research Unit, Lubbock, TX, USA
| |
Collapse
|
7
|
Abstract
Recombination is crucial for genetic evolution, which not only provides new allele combinations but also influences the biological evolution and efficacy of natural selection. However, recombination variation is not well understood outside of the complex species’ genomes, and it is particularly unclear in Gossypium. Cotton is the most important natural fibre crop and the second largest oil-seed crop. Here, we found that the genetic and physical maps distances did not have a simple linear relationship. Recombination rates were unevenly distributed throughout the cotton genome, which showed marked changes along the chromosome lengths and recombination was completely suppressed in the centromeric regions. Recombination rates significantly varied between A-subgenome (At) (range = 1.60 to 3.26 centimorgan/megabase [cM/Mb]) and D-subgenome (Dt) (range = 2.17 to 4.97 cM/Mb), which explained why the genetic maps of At and Dt are similar but the physical map of Dt is only half that of At. The translocation regions between A02 and A03 and between A04 and A05, and the inversion regions on A10, D10, A07 and D07 indicated relatively high recombination rates in the distal regions of the chromosomes. Recombination rates were positively correlated with the densities of genes, markers and the distance from the centromere, and negatively correlated with transposable elements (TEs). The gene ontology (GO) categories showed that genes in high recombination regions may tend to response to environmental stimuli, and genes in low recombination regions are related to mitosis and meiosis, which suggested that they may provide the primary driving force in adaptive evolution and assure the stability of basic cell cycle in a rapidly changing environment. Global knowledge of recombination rates will facilitate genetics and breeding in cotton.
Collapse
Affiliation(s)
- Chao Shen
- National Key Laboratory of Crop Genetic Improvement, College of Plant Science & Technology, Huazhong Agricultural University, Wuhan, Hubei, China
| | - Ximei Li
- National Key Laboratory of Crop Genetic Improvement, College of Plant Science & Technology, Huazhong Agricultural University, Wuhan, Hubei, China
- College of Agronomy and Plant Protection, Qingdao Agricultural University/Shandong Key Laboratory of Dryland Farming Technology, Qingdao, Shandong, China
| | - Ruiting Zhang
- National Key Laboratory of Crop Genetic Improvement, College of Plant Science & Technology, Huazhong Agricultural University, Wuhan, Hubei, China
| | - Zhongxu Lin
- National Key Laboratory of Crop Genetic Improvement, College of Plant Science & Technology, Huazhong Agricultural University, Wuhan, Hubei, China
- * E-mail:
| |
Collapse
|
8
|
Saha D, Rana RS, Chakraborty S, Datta S, Kumar AA, Chakraborty AK, Karmakar PG. Development of a set of SSR markers for genetic polymorphism detection and interspecific hybrid jute breeding. ACTA ACUST UNITED AC 2017. [DOI: 10.1016/j.cj.2017.02.006] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
9
|
Kottapalli P, Ulloa M, Kottapalli KR, Payton P, Burke J. SNP Marker Discovery in Pima Cotton ( Gossypium barbadense L.) Leaf Transcriptomes. Genomics Insights 2016; 9:51-60. [PMID: 27721653 PMCID: PMC5049682 DOI: 10.4137/gei.s40377] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/10/2016] [Revised: 08/22/2016] [Accepted: 08/24/2016] [Indexed: 11/17/2022]
Abstract
The objective of this study was to explore the known narrow genetic diversity and discover single-nucleotide polymorphic (SNP) markers for marker-assisted breeding within Pima cotton (Gossypium barbadense L.) leaf transcriptomes. cDNA from 25-day plants of three diverse cotton genotypes [Pima S6 (PS6), Pima S7 (PS7), and Pima 3-79 (P3-79)] was sequenced on Illumina sequencing platform. A total of 28.9 million reads (average read length of 138 bp) were generated by sequencing cDNA libraries of these three genotypes. The de novo assembly of reads generated transcriptome sets of 26,369 contigs for PS6, 25,870 contigs for PS7, and 24,796 contigs for P3-79. A Pima leaf reference transcriptome was generated consisting of 42,695 contigs. More than 10,000 single-nucleotide polymorphisms (SNPs) were identified between the genotypes, with 100% SNP frequency and a minimum of eight sequencing reads. The most prevalent SNP substitutions were C-T and A-G in these cotton genotypes. The putative SNPs identified can be utilized for characterizing genetic diversity, genotyping, and eventually in Pima cotton breeding through marker-assisted selection.
Collapse
Affiliation(s)
- Pratibha Kottapalli
- Center for Biotechnology and Genomics, Texas Tech University, Lubbock, TX, USA
| | - Mauricio Ulloa
- USDA-ARS, PA, CSRL, Plant Stress and Germplasm Development Research, Lubbock, TX, USA
| | | | - Paxton Payton
- USDA-ARS, PA, CSRL, Plant Stress and Germplasm Development Research, Lubbock, TX, USA
| | - John Burke
- USDA-ARS, PA, CSRL, Plant Stress and Germplasm Development Research, Lubbock, TX, USA
| |
Collapse
|
10
|
Deng P, Wang M, Feng K, Cui L, Tong W, Song W, Nie X. Genome-wide characterization of microsatellites in Triticeae species: abundance, distribution and evolution. Sci Rep 2016; 6:32224. [PMID: 27561724 DOI: 10.1038/srep32224] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2016] [Accepted: 07/20/2016] [Indexed: 11/08/2022] Open
Abstract
Microsatellites are an important constituent of plant genome and distributed across entire genome. In this study, genome-wide analysis of microsatellites in 8 Triticeae species and 9 model plants revealed that microsatellite characteristics were similar among the Triticeae species. Furthermore, genome-wide microsatellite markers were designed in wheat and then used to analyze the evolutionary relationship of wheat and other Triticeae species. Results displayed that Aegilops tauschii was found to be the closest species to Triticum aestivum, followed by Triticum urartu, Triticum turgidum and Aegilops speltoides, while Triticum monococcum, Aegilops sharonensis and Hordeum vulgare showed a relatively lower PCR amplification effectivity. Additionally, a significantly higher PCR amplification effectivity was found in chromosomes at the same subgenome than its homoeologous when these markers were subjected to search against different chromosomes in wheat. After a rigorous screening process, a total of 20,666 markers showed high amplification and polymorphic potential in wheat and its relatives, which were integrated with the public available wheat markers and then anchored to the genome of wheat (CS). This study not only provided the useful resource for SSR markers development in Triticeae species, but also shed light on the evolution of polyploid wheat from the perspective of microsatellites.
Collapse
|
11
|
Egamberdiev SS, Saha S, Salakhutdinov I, Jenkins JN, Deng D, Y Abdurakhmonov I. Comparative assessment of genetic diversity in cytoplasmic and nuclear genome of upland cotton. Genetica 2016; 144:289-306. [PMID: 27155886 DOI: 10.1007/s10709-016-9898-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2015] [Accepted: 04/07/2016] [Indexed: 02/05/2023]
Abstract
The importance of the cytoplasmic genome for many economically important traits is well documented in several crop species, including cotton. There is no report on application of cotton chloroplast specific SSR markers as a diagnostic tool to study genetic diversity among improved Upland cotton lines. The complete plastome sequence information in GenBank provided us an opportunity to report on 17 chloroplast specific SSR markers using a cost-effective data mining strategy. Here we report the comparative analysis of genetic diversity among a set of 42 improved Upland cotton lines using SSR markers specific to chloroplast and nuclear genome, respectively. Our results revealed that low to moderate level of genetic diversity existed in both nuclear and cytoplasm genome among this set of cotton lines. However, the specific estimation suggested that genetic diversity is lower in cytoplasmic genome compared to the nuclear genome among this set of Upland cotton lines. In summary, this research is important from several perspectives. We detected a set of cytoplasm genome specific SSR primer pairs by using a cost-effective data mining strategy. We reported for the first time the genetic diversity in the cytoplasmic genome within a set of improved Upland cotton accessions. Results revealed that the genetic diversity in cytoplasmic genome is narrow, compared to the nuclear genome within this set of Upland cotton accessions. Our results suggested that most of these polymorphic chloroplast SSRs would be a valuable complementary tool in addition to the nuclear SSR in the study of evolution, gene flow and genetic diversity in Upland cotton.
Collapse
Affiliation(s)
- Sharof S Egamberdiev
- Center of Genomics and Bioinformatics, Academy of Sciences of Uzbekistan, Tashkent, Uzbekistan, 111215
| | - Sukumar Saha
- Crop Science Research Laboratory, Genetics and Sustainable Agriculture Research Unit, USDA-ARS, Mississippi State, MS, 39762, USA.
| | - Ilkhom Salakhutdinov
- Center of Genomics and Bioinformatics, Academy of Sciences of Uzbekistan, Tashkent, Uzbekistan, 111215
| | - Johnie N Jenkins
- Crop Science Research Laboratory, Genetics and Sustainable Agriculture Research Unit, USDA-ARS, Mississippi State, MS, 39762, USA
| | - Dewayne Deng
- Crop Science Research Laboratory, Genetics and Sustainable Agriculture Research Unit, USDA-ARS, Mississippi State, MS, 39762, USA
| | - Ibrokhim Y Abdurakhmonov
- Center of Genomics and Bioinformatics, Academy of Sciences of Uzbekistan, Tashkent, Uzbekistan, 111215
| |
Collapse
|
12
|
Jung S, Lee T, Ficklin S, Yu J, Cheng CH, Main D. Chado use case: storing genomic, genetic and breeding data of Rosaceae and Gossypium crops in Chado. Database (Oxford) 2016; 2016:baw010. [PMID: 26989146 PMCID: PMC4795932 DOI: 10.1093/database/baw010] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/25/2015] [Accepted: 01/21/2016] [Indexed: 11/12/2022]
Abstract
The Genome Database for Rosaceae (GDR) and CottonGen are comprehensive online data repositories that provide access to integrated genomic, genetic and breeding data through search, visualization and analysis tools for Rosaceae crops and Gossypium (cotton). These online databases use Chado, an open-source, generic and ontology-driven database schema for biological data, as the primary data storage platform. Chado is highly normalized and uses ontologies to indicate the 'types' of data. Therefore, Chado is flexible such that it has been used to house genomic, genetic and breeding data for GDR and CottonGen. These data include whole genome sequence and annotation, transcripts, molecular markers, genetic maps, Quantitative Trait Loci, Mendelian Trait Loci, traits, germplasm, pedigrees, large scale phenotypic and genotypic data, ontologies and publications. We provide information about how to store these types of data in Chado using GDR and CottonGen as examples sites that were converted from an older legacy infrastructure. Database URL: GDR (www.rosaceae.org), CottonGen (www.cottongen.org).
Collapse
Affiliation(s)
- Sook Jung
- Department of Horticulture, Washington State University Pullman, WA, USA
| | - Taein Lee
- Department of Horticulture, Washington State University Pullman, WA, USA
| | - Stephen Ficklin
- Department of Horticulture, Washington State University Pullman, WA, USA
| | - Jing Yu
- Department of Horticulture, Washington State University Pullman, WA, USA
| | - Chun-Huai Cheng
- Department of Horticulture, Washington State University Pullman, WA, USA
| | - Dorrie Main
- Department of Horticulture, Washington State University Pullman, WA, USA
| |
Collapse
|
13
|
Zhang J, Yu J, Pei W, Li X, Said J, Song M, Sanogo S. Genetic analysis of Verticillium wilt resistance in a backcross inbred line population and a meta-analysis of quantitative trait loci for disease resistance in cotton. BMC Genomics 2015; 16:577. [PMID: 26239843 PMCID: PMC4524102 DOI: 10.1186/s12864-015-1682-2] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2015] [Accepted: 06/01/2015] [Indexed: 12/12/2022] Open
Abstract
Background Verticillium wilt (VW) and Fusarium wilt (FW), caused by the soil-borne fungi Verticillium dahliae and Fusarium oxysporum f. sp. vasinfectum, respectively, are two most destructive diseases in cotton production worldwide. Root-knot nematodes (Meloidogyne incognita, RKN) and reniform nematodes (Rotylenchulus reniformis, RN) cause the highest yield loss in the U.S. Planting disease resistant cultivars is the most cost effective control method. Numerous studies have reported mapping of quantitative trait loci (QTLs) for disease resistance in cotton; however, very few reliable QTLs were identified for use in genomic research and breeding. Results This study first performed a 4-year replicated test of a backcross inbred line (BIL) population for VW resistance, and 10 resistance QTLs were mapped based on a 2895 cM linkage map with 392 SSR markers. The 10 VW QTLs were then placed to a consensus linkage map with other 182 VW QTLs, 75 RKN QTLs, 27 FW QTLs, and 7 RN QTLs reported from 32 publications. A meta-analysis of QTLs identified 28 QTL clusters including 13, 8 and 3 QTL hotspots for resistance to VW, RKN and FW, respectively. The number of QTLs and QTL clusters on chromosomes especially in the A-subgenome was significantly correlated with the number of nucleotide-binding site (NBS) genes, and the distribution of QTLs between homeologous A- and D- subgenome chromosomes was also significantly correlated. Conclusions Ten VW resistance QTL identified in a 4-year replicated study have added useful information to the understanding of the genetic basis of VW resistance in cotton. Twenty-eight disease resistance QTL clusters and 24 hotspots identified from a total of 306 QTLs and linked SSR markers provide important information for marker-assisted selection and high resolution mapping of resistance QTLs and genes. The non-overlapping of most resistance QTL hotspots for different diseases indicates that their resistances are controlled by different genes. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1682-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jinfa Zhang
- Department of Plant and Environmental Sciences, New Mexico State University, Las Cruces, NM, 88003, USA.
| | - Jiwen Yu
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of China, Chinese Academy of Agricultural Science, Anyang, Henan, 455000, China.
| | - Wenfeng Pei
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of China, Chinese Academy of Agricultural Science, Anyang, Henan, 455000, China.
| | - Xingli Li
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of China, Chinese Academy of Agricultural Science, Anyang, Henan, 455000, China.
| | - Joseph Said
- Department of Plant and Environmental Sciences, New Mexico State University, Las Cruces, NM, 88003, USA.
| | - Mingzhou Song
- Department of Computer Science, New Mexico State University, Las Cruces, NM, 88003, USA.
| | - Soum Sanogo
- Department of Entomology, Plant Pathology and Weed Science, New Mexico State University, Las Cruces, NM, 88003, USA.
| |
Collapse
|
14
|
Wang Q, Fang L, Chen J, Hu Y, Si Z, Wang S, Chang L, Guo W, Zhang T. Genome-wide mining, characterization, and development of microsatellite markers in gossypium species. Sci Rep 2015; 5:10638. [PMID: 26030481 PMCID: PMC4650602 DOI: 10.1038/srep10638] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2014] [Accepted: 05/01/2015] [Indexed: 12/02/2022] Open
Abstract
Although much research has been conducted to characterize microsatellites and develop markers, the distribution of microsatellites remains ambiguous and the use of microsatellite markers in genomic studies and marker-assisted selection is limited. To identify microsatellites for cotton research, we mined 100,290, 83,160, and 56,937 microsatellites with frequencies of 41.2, 49.1, and 74.8 microsatellites per Mb in the recently sequenced Gossypium species: G. hirsutum, G. arboreum, and G. raimondii, respectively. The distributions of microsatellites in their genomes were non-random and were positively and negatively correlated with genes and transposable elements, respectively. Of the 77,996 developed microsatellite markers, 65,498 were physically anchored to the 26 chromosomes of G. hirsutum with an average marker density of 34 markers per Mb. We confirmed 67,880 (87%) universal and 7,705 (9.9%) new genic microsatellite markers. The polymorphism was estimated in above three species by in silico PCR and validated with 505 markers in G. hirsutum. We further predicted 8,825 polymorphic microsatellite markers within G. hirsutum acc. TM-1 and G. barbadense cv. Hai7124. In our study, genome-wide mining and characterization of microsatellites, and marker development were very useful for the saturation of the allotetraploid genetic linkage map, genome evolution studies and comparative genome mapping.
Collapse
Affiliation(s)
- Qiong Wang
- 1] State Key Laboratory of Crop Genetics and Germplasm Enhancement, Cotton Hybrid R &D Engineering Center (the Ministry of Education), Nanjing Agricultural University, Nanjing 210095, China [2]
| | - Lei Fang
- 1] State Key Laboratory of Crop Genetics and Germplasm Enhancement, Cotton Hybrid R &D Engineering Center (the Ministry of Education), Nanjing Agricultural University, Nanjing 210095, China [2]
| | - Jiedan Chen
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Cotton Hybrid R &D Engineering Center (the Ministry of Education), Nanjing Agricultural University, Nanjing 210095, China
| | - Yan Hu
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Cotton Hybrid R &D Engineering Center (the Ministry of Education), Nanjing Agricultural University, Nanjing 210095, China
| | - Zhanfeng Si
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Cotton Hybrid R &D Engineering Center (the Ministry of Education), Nanjing Agricultural University, Nanjing 210095, China
| | - Sen Wang
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Cotton Hybrid R &D Engineering Center (the Ministry of Education), Nanjing Agricultural University, Nanjing 210095, China
| | - Lijing Chang
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Cotton Hybrid R &D Engineering Center (the Ministry of Education), Nanjing Agricultural University, Nanjing 210095, China
| | - Wangzhen Guo
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Cotton Hybrid R &D Engineering Center (the Ministry of Education), Nanjing Agricultural University, Nanjing 210095, China
| | - Tianzhen Zhang
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Cotton Hybrid R &D Engineering Center (the Ministry of Education), Nanjing Agricultural University, Nanjing 210095, China
| |
Collapse
|
15
|
Zhang L, Guo J, You Q, Yi X, Ling Y, Xu W, Hua J, Su Z. GraP: platform for functional genomics analysis of Gossypium raimondii. Database (Oxford) 2015; 2015:bav047. [PMID: 25982315 PMCID: PMC4433718 DOI: 10.1093/database/bav047] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/25/2014] [Accepted: 04/21/2015] [Indexed: 12/22/2022]
Abstract
Cotton (Gossypium spp.) is one of the most important natural fiber and oil crops worldwide. Improvement of fiber yield and quality under changing environments attract much attention from cotton researchers; however, a functional analysis platform integrating omics data is still missing. The success of cotton genome sequencing and large amount of available transcriptome data allows the opportunity to establish a comprehensive analysis platform for integrating these data and related information. A comprehensive database, Platform of Functional Genomics Analysis in Gossypium raimondii (GraP), was constructed to provide multi-dimensional analysis, integration and visualization tools. GraP includes updated functional annotation, gene family classifications, protein–protein interaction networks, co-expression networks and microRNA–target pairs. Moreover, gene set enrichment analysis and cis-element significance analysis tools are also provided for gene batch analysis of high-throughput data sets. Based on these effective services, GraP may offer further information for subsequent studies of functional genes and in-depth analysis of high-throughput data. GraP is publically accessible at http://structuralbiology.cau.edu.cn/GraP/, with all data available for downloading.
Collapse
Affiliation(s)
- Liwei Zhang
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China and College of Agriculture and Biotechnology, China Agricultural University, Beijing 100193, China
| | - Jinyan Guo
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China and College of Agriculture and Biotechnology, China Agricultural University, Beijing 100193, China State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China and College of Agriculture and Biotechnology, China Agricultural University, Beijing 100193, China
| | - Qi You
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China and College of Agriculture and Biotechnology, China Agricultural University, Beijing 100193, China
| | - Xin Yi
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China and College of Agriculture and Biotechnology, China Agricultural University, Beijing 100193, China
| | - Yi Ling
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China and College of Agriculture and Biotechnology, China Agricultural University, Beijing 100193, China
| | - Wenying Xu
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China and College of Agriculture and Biotechnology, China Agricultural University, Beijing 100193, China
| | - Jinping Hua
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China and College of Agriculture and Biotechnology, China Agricultural University, Beijing 100193, China
| | - Zhen Su
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China and College of Agriculture and Biotechnology, China Agricultural University, Beijing 100193, China
| |
Collapse
|
16
|
Said JI, Knapka JA, Song M, Zhang J. Cotton QTLdb: a cotton QTL database for QTL analysis, visualization, and comparison between Gossypium hirsutum and G. hirsutum × G. barbadense populations. Mol Genet Genomics 2015; 290:1615-25. [PMID: 25758743 DOI: 10.1007/s00438-015-1021-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
KEY MESSAGE A specialized database currently containing more than 2200 QTL is established, which allows graphic presentation, visualization and submission of QTL. In cotton quantitative trait loci (QTL), studies are focused on intraspecific Gossypium hirsutum and interspecific G. hirsutum × G. barbadense populations. These two populations are commercially important for the textile industry and are evaluated for fiber quality, yield, seed quality, resistance, physiological, and morphological trait QTL. With meta-analysis data based on the vast amount of QTL studies in cotton it will be beneficial to organize the data into a functional database for the cotton community. Here we provide a tool for cotton researchers to visualize previously identified QTL and submit their own QTL to the Cotton QTLdb database. The database provides the user with the option of selecting various QTL trait types from either the G. hirsutum or G. hirsutum × G. barbadense populations. Based on the user's QTL trait selection, graphical representations of chromosomes of the population selected are displayed in publication ready images. The database also provides users with trait information on QTL, LOD scores, and explained phenotypic variances for all QTL selected. The CottonQTLdb database provides cotton geneticist and breeders with statistical data on cotton QTL previously identified and provides a visualization tool to view QTL positions on chromosomes. Currently the database (Release 1) contains 2274 QTLs, and succeeding QTL studies will be updated regularly by the curators and members of the cotton community that contribute their data to keep the database current. The database is accessible from http://www.cottonqtldb.org.
Collapse
|
17
|
Said JI, Knapka JA, Song M, Zhang J. Cotton QTLdb: a cotton QTL database for QTL analysis, visualization, and comparison between Gossypium hirsutum and G. hirsutum × G. barbadense populations. Mol Genet Genomics 2015; 290:1615-25. [PMID: 25758743 DOI: 10.1007/s00438-015-1021-y] [Citation(s) in RCA: 83] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2014] [Accepted: 02/24/2015] [Indexed: 11/29/2022]
Abstract
KEY MESSAGE A specialized database currently containing more than 2200 QTL is established, which allows graphic presentation, visualization and submission of QTL. In cotton quantitative trait loci (QTL), studies are focused on intraspecific Gossypium hirsutum and interspecific G. hirsutum × G. barbadense populations. These two populations are commercially important for the textile industry and are evaluated for fiber quality, yield, seed quality, resistance, physiological, and morphological trait QTL. With meta-analysis data based on the vast amount of QTL studies in cotton it will be beneficial to organize the data into a functional database for the cotton community. Here we provide a tool for cotton researchers to visualize previously identified QTL and submit their own QTL to the Cotton QTLdb database. The database provides the user with the option of selecting various QTL trait types from either the G. hirsutum or G. hirsutum × G. barbadense populations. Based on the user's QTL trait selection, graphical representations of chromosomes of the population selected are displayed in publication ready images. The database also provides users with trait information on QTL, LOD scores, and explained phenotypic variances for all QTL selected. The CottonQTLdb database provides cotton geneticist and breeders with statistical data on cotton QTL previously identified and provides a visualization tool to view QTL positions on chromosomes. Currently the database (Release 1) contains 2274 QTLs, and succeeding QTL studies will be updated regularly by the curators and members of the cotton community that contribute their data to keep the database current. The database is accessible from http://www.cottonqtldb.org.
Collapse
Affiliation(s)
- Joseph I Said
- Department of Plant and Environmental Sciences, New Mexico State University, Las Cruces, NM, USA,
| | | | | | | |
Collapse
|
18
|
Abstract
The detection and analysis of genetic variation plays an important role in plant breeding and this role is increasing with the continued development of genome sequencing technologies. Molecular genetic markers are important tools to characterize genetic variation and assist with genomic breeding. Processing and storing the growing abundance of molecular marker data being produced requires the development of specific bioinformatics tools and advanced databases. Molecular marker databases range from species specific through to organism wide and often host a variety of additional related genetic, genomic, or phenotypic information. In this chapter, we will present some of the features of plant molecular genetic marker databases, highlight the various types of marker resources, and predict the potential future direction of crop marker databases.
Collapse
|
19
|
Said JI, Song M, Wang H, Lin Z, Zhang X, Fang DD, Zhang J. A comparative meta-analysis of QTL between intraspecific Gossypium hirsutum and interspecific G. hirsutum × G. barbadense populations. Mol Genet Genomics 2014; 290:1003-25. [PMID: 25501533 DOI: 10.1007/s00438-014-0963-9] [Citation(s) in RCA: 133] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2014] [Accepted: 11/18/2014] [Indexed: 12/16/2022]
Abstract
KEY MESSAGE Based on 1075 and 1059 QTL from intraspecific Upland and interspecific Upland × Pima populations, respectively, the identification of QTL clusters and hotspots provides a useful resource for cotton breeding. Mapping of quantitative trait loci (QTL) is a pre-requisite of marker-assisted selection for crop yield and quality. Recent meta-analysis of QTL in tetraploid cotton (Gossypium spp.) has identified regions of the genome with high concentrations of QTL for various traits called clusters and specific trait QTL called hotspots or meta-QTL (mQTL). However, the meta-analysis included all population types of Gossypium mixing both intraspecific G. hirsutum and interspecific G. hirsutum × G. barbadense populations. This study used 1,075 QTL from 58 publications on intraspecific G. hirsutum and 1,059 QTL from 30 publications on G. hirsutum × G. barbadense populations to perform a comprehensive comparative analysis of QTL clusters and hotspots between the two populations for yield, fiber and seed quality, and biotic and abiotic stress tolerance. QTL hotspots were further analyzed for mQTL within the hotspots using Biomercator V3 software. The ratio of QTL between the two population types was proportional yet differences in hotspot type and placement were observed between the two population types. However, on some chromosomes QTL clusters and hotspots were similar between the two populations. This shows that there are some universal QTL regions in the cultivated tetraploid cotton which remain consistent and some regions which differ between population types. This study for the first time elucidates the similarities and differences in QTL clusters and hotspots between intraspecific and interspecific populations, providing an important resource to cotton breeding programs in marker-assisted selection .
Collapse
Affiliation(s)
- Joseph I Said
- Department of Plant and Environmental Sciences, New Mexico State University, Las Cruces, NM, USA,
| | | | | | | | | | | | | |
Collapse
|
20
|
Kapil A, Rai PK, Shanker A. ChloroSSRdb: a repository of perfect and imperfect chloroplastic simple sequence repeats (cpSSRs) of green plants. Database (Oxford) 2014; 2014:bau107. [PMID: 25380781 PMCID: PMC4224265 DOI: 10.1093/database/bau107] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2014] [Revised: 09/19/2014] [Accepted: 10/08/2014] [Indexed: 01/08/2023]
Abstract
Simple sequence repeats (SSRs) are regions in DNA sequence that contain repeating motifs of length 1-6 nucleotides. These repeats are ubiquitously present and are found in both coding and non-coding regions of genome. A total of 534 complete chloroplast genome sequences (as on 18 September 2014) of Viridiplantae are available at NCBI organelle genome resource. It provides opportunity to mine these genomes for the detection of SSRs and store them in the form of a database. In an attempt to properly manage and retrieve chloroplastic SSRs, we designed ChloroSSRdb which is a relational database developed using SQL server 2008 and accessed through ASP.NET. It provides information of all the three types (perfect, imperfect and compound) of SSRs. At present, ChloroSSRdb contains 124 430 mined SSRs, with majority lying in non-coding region. Out of these, PCR primers were designed for 118 249 SSRs. Tetranucleotide repeats (47 079) were found to be the most frequent repeat type, whereas hexanucleotide repeats (6414) being the least abundant. Additionally, in each species statistical analyses were performed to calculate relative frequency, correlation coefficient and chi-square statistics of perfect and imperfect SSRs. In accordance with the growing interest in SSR studies, ChloroSSRdb will prove to be a useful resource in developing genetic markers, phylogenetic analysis, genetic mapping, etc. Moreover, it will serve as a ready reference for mined SSRs in available chloroplast genomes of green plants. Database URL: www.compubio.in/chlorossrdb/
Collapse
Affiliation(s)
- Aditi Kapil
- Department of Bioinformatics, Banasthali University, Banasthali, Rajasthan 304022, India and Department of Mathematics and Statistics, Banasthali University, Banasthali, Rajasthan 304022, India
| | - Piyush Kant Rai
- Department of Bioinformatics, Banasthali University, Banasthali, Rajasthan 304022, India and Department of Mathematics and Statistics, Banasthali University, Banasthali, Rajasthan 304022, India
| | - Asheesh Shanker
- Department of Bioinformatics, Banasthali University, Banasthali, Rajasthan 304022, India and Department of Mathematics and Statistics, Banasthali University, Banasthali, Rajasthan 304022, India
| |
Collapse
|
21
|
Doddamani D, Katta MAVSK, Khan AW, Agarwal G, Shah TM, Varshney RK. CicArMiSatDB: the chickpea microsatellite database. BMC Bioinformatics 2014; 15:212. [PMID: 24952649 PMCID: PMC4230034 DOI: 10.1186/1471-2105-15-212] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2013] [Accepted: 06/17/2014] [Indexed: 01/12/2023] Open
Abstract
Background Chickpea (Cicer arietinum) is a widely grown legume crop in tropical, sub-tropical and temperate regions. Molecular breeding approaches seem to be essential for enhancing crop productivity in chickpea. Until recently, limited numbers of molecular markers were available in the case of chickpea for use in molecular breeding. However, the recent advances in genomics facilitated the development of large scale markers especially SSRs (simple sequence repeats), the markers of choice in any breeding program. Availability of genome sequence very recently opens new avenues for accelerating molecular breeding approaches for chickpea improvement. Description In order to assist genetic studies and breeding applications, we have developed a user friendly relational database named the Chickpea Microsatellite Database (CicArMiSatDB http://cicarmisatdb.icrisat.org). This database provides detailed information on SSRs along with their features in the genome. SSRs have been classified and made accessible through an easy-to-use web interface. Conclusions This database is expected to help chickpea community in particular and legume community in general, to select SSRs of particular type or from a specific region in the genome to advance both basic genomics research as well as applied aspects of crop improvement.
Collapse
Affiliation(s)
| | | | | | | | | | - Rajeev K Varshney
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru 502324, India.
| |
Collapse
|
22
|
He Y, Kumar P, Shen X, Davis RF, Van Becelaere G, May OL, Nichols RL, Chee PW. Re-evaluation of the inheritance for root-knot nematode resistance in the Upland cotton germplasm line M-120 RNR revealed two epistatic QTLs conferring resistance. Theor Appl Genet 2014; 127:1343-51. [PMID: 24728014 DOI: 10.1007/s00122-014-2302-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2013] [Accepted: 03/17/2014] [Indexed: 05/24/2023]
Abstract
We report a second major QTL for root-knot nematode resistance in the highly resistant Upland cotton line M-120RNR and show epistasis between two resistant QTLs with different mechanisms conferring resistance. In an earlier study, we identified a major QTL on Chromosome 11 associated with resistance to root-knot nematode in the M-120 RNR Upland cotton line (Gossypium hirsutum L.) of the Auburn 623 RNR source. Herein, we re-evaluated the genetics of the resistance to root-knot nematode in the M-120 RNR × Pima S-6 population by linkage mapping using recently published SSR markers. The QTL analysis detected two regions significantly associated with the resistance phenotype. In addition to the QTL previously identified on Chromosome 11 (qMi-C11), a major QTL was identified on Chromosome 14 (qMi-C14). The resistance locus on qMi-C11 originated from the Clevewilt parent, while the qMi-C14 locus originated from the other resistant parent, Mexico Wild Jack Jones. The qMi-C14 locus had logarithms of odds score of 17 and accounted for 45 % of the total phenotype variation in egg production. It was also associated with galling index, but the percent variation explained was only 6 %, suggesting that the qMi-C11 locus had a much stronger effect on root gall suppression than egg production, while the qMi-C14 locus had a stronger effect on egg production than galling. The results also suggest that the transgressive segregation observed in the development of Auburn 623 RNR was due to the pyramiding of at least two main effect QTLs as well as an additive-by-additive epistatic effects between the two resistant loci. The SSRs markers tightly linked to the qMi-C11 and qMi-C14 loci will greatly facilitate the improvement of RKN resistance in cotton via marker-assisted breeding.
Collapse
Affiliation(s)
- Yajun He
- Cotton Molecular Breeding Laboratory, University of Georgia, Tifton, GA, 31793, USA
| | | | | | | | | | | | | | | |
Collapse
|
23
|
Fang DD, Jenkins JN, Deng DD, McCarty JC, Li P, Wu J. Quantitative trait loci analysis of fiber quality traits using a random-mated recombinant inbred population in Upland cotton (Gossypium hirsutum L.). BMC Genomics 2014; 15:397. [PMID: 24886099 PMCID: PMC4055785 DOI: 10.1186/1471-2164-15-397] [Citation(s) in RCA: 75] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2014] [Accepted: 05/19/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Upland cotton (Gossypium hirsutum L.) accounts for about 95% of world cotton production. Improving Upland cotton cultivars has been the focus of world-wide cotton breeding programs. Negative correlation between yield and fiber quality is an obstacle for cotton improvement. Random-mating provides a potential methodology to break this correlation. The suite of fiber quality traits that affect the yarn quality includes the length, strength, maturity, fineness, elongation, uniformity and color. Identification of stable fiber quantitative trait loci (QTL) in Upland cotton is essential in order to improve cotton cultivars with superior quality using marker-assisted selection (MAS) strategy. RESULTS Using 11 diverse Upland cotton cultivars as parents, a random-mated recombinant inbred (RI) population consisting of 550 RI lines was developed after 6 cycles of random-mating and 6 generations of self-pollination. The 550 RILs were planted in triplicates for two years in Mississippi State, MS, USA to obtain fiber quality data. After screening 15538 simple sequence repeat (SSR) markers, 2132 were polymorphic among the 11 parents. One thousand five hundred eighty-two markers covering 83% of cotton genome were used to genotype 275 RILs (Set 1). The marker-trait associations were analyzed using the software program TASSEL. At p < 0.01, 131 fiber QTLs and 37 QTL clusters were identified. These QTLs were responsible for the combined phenotypic variance ranging from 62.3% for short fiber content to 82.8% for elongation. The other 275 RILs (Set 2) were analyzed using a subset of 270 SSR markers, and the QTLs were confirmed. Two major QTL clusters were observed on chromosomes 7 and 16. Comparison of these 131 QTLs with the previously published QTLs indicated that 77 were identified before, and 54 appeared novel. CONCLUSIONS The 11 parents used in this study represent a diverse genetic pool of the US cultivated cotton, and 10 of them were elite commercial cultivars. The fiber QTLs, especially QTL clusters reported herein can be readily implemented in a cotton breeding program to improve fiber quality via MAS strategy. The consensus QTL regions warrant further investigation to better understand the genetics and molecular mechanisms underlying fiber development.
Collapse
Affiliation(s)
- David D Fang
- Cotton Fiber Bioscience Research Unit, USDA-ARS-SRRC, New Orleans, LA 70124, USA.
| | | | | | | | | | | |
Collapse
|
24
|
Fang DD, Jenkins JN, Deng DD, McCarty JC, Li P, Wu J. Quantitative trait loci analysis of fiber quality traits using a random-mated recombinant inbred population in Upland cotton (Gossypium hirsutum L.). BMC Genomics 2014. [PMID: 24886099 DOI: 10.1186/1471‐2164‐15‐397] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Upland cotton (Gossypium hirsutum L.) accounts for about 95% of world cotton production. Improving Upland cotton cultivars has been the focus of world-wide cotton breeding programs. Negative correlation between yield and fiber quality is an obstacle for cotton improvement. Random-mating provides a potential methodology to break this correlation. The suite of fiber quality traits that affect the yarn quality includes the length, strength, maturity, fineness, elongation, uniformity and color. Identification of stable fiber quantitative trait loci (QTL) in Upland cotton is essential in order to improve cotton cultivars with superior quality using marker-assisted selection (MAS) strategy. RESULTS Using 11 diverse Upland cotton cultivars as parents, a random-mated recombinant inbred (RI) population consisting of 550 RI lines was developed after 6 cycles of random-mating and 6 generations of self-pollination. The 550 RILs were planted in triplicates for two years in Mississippi State, MS, USA to obtain fiber quality data. After screening 15538 simple sequence repeat (SSR) markers, 2132 were polymorphic among the 11 parents. One thousand five hundred eighty-two markers covering 83% of cotton genome were used to genotype 275 RILs (Set 1). The marker-trait associations were analyzed using the software program TASSEL. At p < 0.01, 131 fiber QTLs and 37 QTL clusters were identified. These QTLs were responsible for the combined phenotypic variance ranging from 62.3% for short fiber content to 82.8% for elongation. The other 275 RILs (Set 2) were analyzed using a subset of 270 SSR markers, and the QTLs were confirmed. Two major QTL clusters were observed on chromosomes 7 and 16. Comparison of these 131 QTLs with the previously published QTLs indicated that 77 were identified before, and 54 appeared novel. CONCLUSIONS The 11 parents used in this study represent a diverse genetic pool of the US cultivated cotton, and 10 of them were elite commercial cultivars. The fiber QTLs, especially QTL clusters reported herein can be readily implemented in a cotton breeding program to improve fiber quality via MAS strategy. The consensus QTL regions warrant further investigation to better understand the genetics and molecular mechanisms underlying fiber development.
Collapse
Affiliation(s)
- David D Fang
- Cotton Fiber Bioscience Research Unit, USDA-ARS-SRRC, New Orleans, LA 70124, USA.
| | | | | | | | | | | |
Collapse
|
25
|
Fang DD, Jenkins JN, Deng DD, McCarty JC, Li P, Wu J. Quantitative trait loci analysis of fiber quality traits using a random-mated recombinant inbred population in Upland cotton (Gossypium hirsutum L.). BMC Genomics 2014. [PMID: 24886099 DOI: 10.1186/14712164-15-397] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/26/2023] Open
Abstract
BACKGROUND Upland cotton (Gossypium hirsutum L.) accounts for about 95% of world cotton production. Improving Upland cotton cultivars has been the focus of world-wide cotton breeding programs. Negative correlation between yield and fiber quality is an obstacle for cotton improvement. Random-mating provides a potential methodology to break this correlation. The suite of fiber quality traits that affect the yarn quality includes the length, strength, maturity, fineness, elongation, uniformity and color. Identification of stable fiber quantitative trait loci (QTL) in Upland cotton is essential in order to improve cotton cultivars with superior quality using marker-assisted selection (MAS) strategy. RESULTS Using 11 diverse Upland cotton cultivars as parents, a random-mated recombinant inbred (RI) population consisting of 550 RI lines was developed after 6 cycles of random-mating and 6 generations of self-pollination. The 550 RILs were planted in triplicates for two years in Mississippi State, MS, USA to obtain fiber quality data. After screening 15538 simple sequence repeat (SSR) markers, 2132 were polymorphic among the 11 parents. One thousand five hundred eighty-two markers covering 83% of cotton genome were used to genotype 275 RILs (Set 1). The marker-trait associations were analyzed using the software program TASSEL. At p < 0.01, 131 fiber QTLs and 37 QTL clusters were identified. These QTLs were responsible for the combined phenotypic variance ranging from 62.3% for short fiber content to 82.8% for elongation. The other 275 RILs (Set 2) were analyzed using a subset of 270 SSR markers, and the QTLs were confirmed. Two major QTL clusters were observed on chromosomes 7 and 16. Comparison of these 131 QTLs with the previously published QTLs indicated that 77 were identified before, and 54 appeared novel. CONCLUSIONS The 11 parents used in this study represent a diverse genetic pool of the US cultivated cotton, and 10 of them were elite commercial cultivars. The fiber QTLs, especially QTL clusters reported herein can be readily implemented in a cotton breeding program to improve fiber quality via MAS strategy. The consensus QTL regions warrant further investigation to better understand the genetics and molecular mechanisms underlying fiber development.
Collapse
Affiliation(s)
- David D Fang
- Cotton Fiber Bioscience Research Unit, USDA-ARS-SRRC, New Orleans, LA 70124, USA.
| | | | | | | | | | | |
Collapse
|
26
|
Kumar M, Kapil A, Shanker A. MitoSatPlant: mitochondrial microsatellites database of viridiplantae. Mitochondrion 2014; 19 Pt B:334-7. [PMID: 24561221 DOI: 10.1016/j.mito.2014.02.002] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2013] [Revised: 02/10/2014] [Accepted: 02/11/2014] [Indexed: 01/12/2023]
Abstract
Microsatellites also known as simple sequence repeats (SSRs) consist of 1-6 nucleotide long repeating units. The importance of mitochondrial SSRs (mtSSRs) in fields like population genetics, plant phylogenetics and genome mapping motivated us to develop MitoSatPlant, a repository of plant mtSSRs. It contains information for perfect, imperfect and compound SSRs mined from 92 mitochondrial genomes of green plants, available at NCBI (as of 1 Feb 2014). A total of 72,798 SSRs were found, of which PCR primers were designed for 72,495 SSRs. Among all sequences, tetranucleotide repeats (26,802) were found to be most abundant whereas hexanucleotide repeats (2751) were detected with least frequency. MitoSatPlant was developed using SQL server 2008 and can be accessed through a front end designed in ASP.Net. It is an easy to use, user-friendly database and will prove to be a useful resource for plant scientists. To the best of our knowledge MitoSatPlant is the only database available for plant mtSSRs and can be freely accessed at http://compubio.in/mitosatplant/.
Collapse
|
27
|
Abstract
The MICdb is a comprehensive relational database of perfect microsatellites extracted from completely sequenced and annotated genomes of bacteria and archaea. The current version MICdb3.0 is an updated and revised version of MICdb2.0. As compared with the previous version MICdb2.0, the current release is significantly improved in terms of much larger coverage of genomes, improved presentation of queried results, user-friendly administration module to manage Simple Sequence Repeat (SSR) data such as addition of new genomes, deletion of obsolete data, etc., and also removal of certain features deemed to be redundant. The new web-interface to the database called Microsatellite Analysis Server (MICAS) version 3.0 has been improved by the addition of powerful high-quality visualization tools to view the query results in the form of pie charts and bar graphs. All the query results and graphs can be exported in different formats so that the users can use them for further analysis. MICAS3.0 is also equipped with a unique genome comparison module using which users can do pair-wise comparison of genomes with regard to their microsatellite distribution. The advanced search module can be used to filter the repeats based on certain criteria such as filtering repeats of a particular motif/repeat size, extracting repeats of coding/non-coding regions, sort repeats, etc. The MICdb database has, therefore, been made portable to be administered by a person with the necessary administrative privileges. The MICdb3.0 database and analysis server can be accessed for free from www.cdfd.org.in/micas. Database URL:http://www.cdfd.org.in/micas
Collapse
Affiliation(s)
- Suresh B Mudunuri
- Department of Computer Science & Engineering, Grandhi Varalakshmi Venkatarao Institute of Technology, Bhimavaram, Andhra Pradesh 534 207, India, Training & Delivery Department, TalentSprint Educational Services, IIIT Campus, Hyderabad, Andhra Pradesh 500 032, India and Laboratory of Computational Biology, Centre for DNA Fingerprinting & Diagnostics, Hyderabad, Andhra Pradesh 500 001, India
| | | | | |
Collapse
|
28
|
Yu J, Jung S, Cheng CH, Ficklin SP, Lee T, Zheng P, Jones D, Percy RG, Main D. CottonGen: a genomics, genetics and breeding database for cotton research. Nucleic Acids Res 2013; 42:D1229-36. [PMID: 24203703 PMCID: PMC3964939 DOI: 10.1093/nar/gkt1064] [Citation(s) in RCA: 185] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
CottonGen (http://www.cottongen.org) is a curated and integrated web-based relational database providing access to publicly available genomic, genetic and breeding data for cotton. CottonGen supercedes CottonDB and the Cotton Marker Database, with enhanced tools for easier data sharing, mining, visualization and data retrieval of cotton research data. CottonGen contains annotated whole genome sequences, unigenes from expressed sequence tags (ESTs), markers, trait loci, genetic maps, genes, taxonomy, germplasm, publications and communication resources for the cotton community. Annotated whole genome sequences of Gossypium raimondii are available with aligned genetic markers and transcripts. These whole genome data can be accessed through genome pages, search tools and GBrowse, a popular genome browser. Most of the published cotton genetic maps can be viewed and compared using CMap, a comparative map viewer, and are searchable via map search tools. Search tools also exist for markers, quantitative trait loci (QTLs), germplasm, publications and trait evaluation data. CottonGen also provides online analysis tools such as NCBI BLAST and Batch BLAST.
Collapse
Affiliation(s)
- Jing Yu
- Department of Horticulture, Washington State University, Pullman, WA 99164-6414, USA, Cotton Incorporated, Cary, NC 27513, USA and Crop Germplasm Research Unit, USDA-ARS-SPARC, College Station, TX 77845, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Buyyarapu R, Kantety RV, Yu JZ, Xu Z, Kohel RJ, Percy RG, Macmil S, Wiley GB, Roe BA, Sharma GC. BAC-pool sequencing and analysis of large segments of A12 and D12 homoeologous chromosomes in upland cotton. PLoS One 2013; 8:e76757. [PMID: 24116150 PMCID: PMC3792896 DOI: 10.1371/journal.pone.0076757] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2013] [Accepted: 08/28/2013] [Indexed: 11/19/2022] Open
Abstract
Although new and emerging next-generation sequencing (NGS) technologies have reduced sequencing costs significantly, much work remains to implement them for de novo sequencing of complex and highly repetitive genomes such as the tetraploid genome of Upland cotton (Gossypium hirsutum L.). Herein we report the results from implementing a novel, hybrid Sanger/454-based BAC-pool sequencing strategy using minimum tiling path (MTP) BACs from Ctg-3301 and Ctg-465, two large genomic segments in A12 and D12 homoeologous chromosomes (Ctg). To enable generation of longer contig sequences in assembly, we implemented a hybrid assembly method to process ~35x data from 454 technology and 2.8-3x data from Sanger method. Hybrid assemblies offered higher sequence coverage and better sequence assemblies. Homology studies revealed the presence of retrotransposon regions like Copia and Gypsy elements in these contigs and also helped in identifying new genomic SSRs. Unigenes were anchored to the sequences in Ctg-3301 and Ctg-465 to support the physical map. Gene density, gene structure and protein sequence information derived from protein prediction programs were used to obtain the functional annotation of these genes. Comparative analysis of both contigs with Arabidopsis genome exhibited synteny and microcollinearity with a conserved gene order in both genomes. This study provides insight about use of MTP-based BAC-pool sequencing approach for sequencing complex polyploid genomes with limited constraints in generating better sequence assemblies to build reference scaffold sequences. Combining the utilities of MTP-based BAC-pool sequencing with current longer and short read NGS technologies in multiplexed format would provide a new direction to cost-effectively and precisely sequence complex plant genomes.
Collapse
Affiliation(s)
- Ramesh Buyyarapu
- Center for Molecular Biology, Department of Biological and Environmental Sciences, Alabama Agricultural & Mechanical University, Normal, Alabama, United States of America
| | - Ramesh V. Kantety
- Center for Molecular Biology, Department of Biological and Environmental Sciences, Alabama Agricultural & Mechanical University, Normal, Alabama, United States of America
| | - John Z. Yu
- United States Department of Agriculture, Agricultural Research Service, Southern Plains Agricultural Research Center, Crop Germplasm Research Unit, College Station, Texas, United States of America
| | - Zhanyou Xu
- United States Department of Agriculture, Agricultural Research Service, Southern Plains Agricultural Research Center, Crop Germplasm Research Unit, College Station, Texas, United States of America
| | - Russell J. Kohel
- United States Department of Agriculture, Agricultural Research Service, Southern Plains Agricultural Research Center, Crop Germplasm Research Unit, College Station, Texas, United States of America
| | - Richard G. Percy
- United States Department of Agriculture, Agricultural Research Service, Southern Plains Agricultural Research Center, Crop Germplasm Research Unit, College Station, Texas, United States of America
| | - Simone Macmil
- Gene Structure and Function Laboratory, University of Otago, Dunedin, New Zealand
| | - Graham B. Wiley
- Arthritis & Immunology Department, Oklahoma Medical Research Foundation, Oklahoma City, Oklahoma, United States of America
| | - Bruce A. Roe
- Advanced Center for Genome Technology, University of Oklahoma, Norman, Oklahoma, United States of America
| | - Govind C. Sharma
- Center for Molecular Biology, Department of Biological and Environmental Sciences, Alabama Agricultural & Mechanical University, Normal, Alabama, United States of America
| |
Collapse
|
30
|
Rai KM, Singh SK, Bhardwaj A, Kumar V, Lakhwani D, Srivastava A, Jena SN, Yadav HK, Bag SK, Sawant SV. Large-scale resource development in Gossypium hirsutum L. by 454 sequencing of genic-enriched libraries from six diverse genotypes. Plant Biotechnol J 2013; 11:953-963. [PMID: 23782852 DOI: 10.1111/pbi.12088] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/25/2013] [Revised: 04/19/2013] [Accepted: 05/04/2013] [Indexed: 06/02/2023]
Abstract
The sequence information has been proved to be an essential genomic resource in case of crop plants for their genetic improvement and better utilization by humans. To dissect the Gossypium hirsutum genome for large-scale development of genomic resources, we adopted hypomethylated restriction-based genomic enrichment strategy to sequence six diverse genotypes. Approximately 5.2-Gb data (more than 18.36 million reads) was generated which, after assembly, represents nearly 1.27-Gb genomic sequences. We predicted a total of 93,363 gene models (21,399 full length) and identified 35,923 gene models which were validated against already sequenced plant genomes. A total of 1,093 transcription factor-encoding genes, 3,135 promoter sequences and 78 miRNA (including 17 newly identified in Gossypium) were predicted. We identified significant no. of molecular markers including 47,093 novel simple sequence repeats and 66,364 novel single nucleotide polymorphisms. In addition, we developed NBRI-Comprehensive Cotton Genomics database, a web resource to provide access of cotton-related genomic resources developed at NBRI. This study contributes considerable amount of genomic resources and suggests a potential role of genic-enriched sequencing in genomic resource development for orphan crop plants.
Collapse
Affiliation(s)
- Krishan Mohan Rai
- Plant Molecular Biology Laboratory, CSIR-National Botanical Research Institute, Lucknow, India
| | | | | | | | | | | | | | | | | | | |
Collapse
|
31
|
B VS, Muthamilarasan M, Misra G, Prasad M. FmMDb: a versatile database of foxtail millet markers for millets and bioenergy grasses research. PLoS One 2013; 8:e71418. [PMID: 23951158 PMCID: PMC3741111 DOI: 10.1371/journal.pone.0071418] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2013] [Accepted: 07/01/2013] [Indexed: 11/18/2022] Open
Abstract
The prominent attributes of foxtail millet (Setaria italica L.) including its small genome size, short life cycle, inbreeding nature, and phylogenetic proximity to various biofuel crops have made this crop an excellent model system to investigate various aspects of architectural, evolutionary and physiological significances in Panicoid bioenergy grasses. After release of its whole genome sequence, large-scale genomic resources in terms of molecular markers were generated for the improvement of both foxtail millet and its related species. Hence it is now essential to congregate, curate and make available these genomic resources for the benefit of researchers and breeders working towards crop improvement. In view of this, we have constructed the Foxtail millet Marker Database (FmMDb; http://www.nipgr.res.in/foxtail.html), a comprehensive online database for information retrieval, visualization and management of large-scale marker datasets with unrestricted public access. FmMDb is the first database which provides complete marker information to the plant science community attempting to produce elite cultivars of millet and bioenergy grass species, thus addressing global food insecurity.
Collapse
Affiliation(s)
| | | | - Gopal Misra
- National Institute of Plant Genome Research, New Delhi, India
| | - Manoj Prasad
- National Institute of Plant Genome Research, New Delhi, India
- * E-mail:
| |
Collapse
|
32
|
Du FK, Xu F, Qu H, Feng S, Tang J, Wu R. Exploiting the transcriptome of Euphrates Poplar, Populus euphratica (Salicaceae) to develop and characterize new EST-SSR markers and construct an EST-SSR database. PLoS One 2013; 8:e61337. [PMID: 23593466 PMCID: PMC3623821 DOI: 10.1371/journal.pone.0061337] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2012] [Accepted: 03/08/2013] [Indexed: 01/17/2023] Open
Abstract
BACKGROUND Microsatellite markers or Simple Sequence Repeats (SSRs) are the most popular markers in population/conservation genetics. However, the development of novel microsatellite markers has been impeded by high costs, a lack of available sequence data and technical difficulties. New species-specific microsatellite markers were required to investigate the evolutionary history of the Euphratica tree, Populus euphratica, the only tree species found in the desert regions of Western China and adjacent Central Asian countries. METHODOLOGY/PRINCIPAL FINDINGS A total of 94,090 non-redundant Expressed Sequence Tags (ESTs) from P. euphratica comprising around 63 Mb of sequence data were searched for SSRs. 4,202 SSRs were found in 3,839 ESTs, with 311 ESTs containing multiple SSRs. The most common motif types were trinucleotides (37%) and hexanucleotides (33%) repeats. We developed primer pairs for all of the identified EST-SSRs (eSSRs) and selected 673 of these pairs at random for further validation. 575 pairs (85%) gave successful amplification, of which, 464 (80.7%) were polymorphic in six to 24 individuals from natural populations across Northern China. We also tested the transferability of the polymorphic eSSRs to nine other Populus species. In addition, to facilitate the use of these new eSSR markers by other researchers, we mapped them onto Populus trichocarpa scaffolds in silico and compiled our data into a web-based database (http://202.205.131.253:8080/poplar/resources/static_page/index.html). CONCLUSIONS The large set of validated eSSRs identified in this work will have many potential applications in studies on P. euphratica and other poplar species, in fields such as population genetics, comparative genomics, linkage mapping, QTL, and marker-assisted breeding. Their use will be facilitated by their incorporation into a user-friendly web-based database.
Collapse
Affiliation(s)
- Fang K. Du
- Center for Computational Biology, National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing, China
| | - Fang Xu
- Center for Computational Biology, National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing, China
| | - Hong Qu
- Center for Bioinformatics, National Laboratory of Protein Engineering and Plant Genetic Engineering, College of Life Sciences, Peking University, Beijing, China
| | - Sisi Feng
- Center for Computational Biology, National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing, China
| | - Jijun Tang
- Department of Computer Science and Engineering, University of South Carolina, Columbia, South Carolina, United States of America
| | - Rongling Wu
- Center for Computational Biology, National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing, China
| |
Collapse
|
33
|
Zou C, Lu C, Zhang Y, Song G. Distribution and characterization of simple sequence repeats in Gossypium raimondii genome. Bioinformation 2012; 8:801-6. [PMID: 23139588 PMCID: PMC3488841 DOI: 10.6026/97320630008801] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2012] [Accepted: 08/20/2012] [Indexed: 11/28/2022] Open
Abstract
Simple sequence repeats (SSRs) can be derived from the complete genome sequence. These markers are important for gene mapping as well as marker-assisted selection (MAS). To develop SSRs for cotton gene mapping, we selected the complete genome sequence of Gossypium raimondii, which consisted of 4447 non-redundant scaffolds. Out of 775.2 Mb sequence examined, a total of 136,345 microsatellites were identified with a density of 5.69 kb per SSR in the G. raimondii genome leading to development of 112,177 primer pairs. The distributions of SSRs in the genome were non-random. Among the different motifs ranging from 1 to 6 bp, penta-nucleotide repeats were most abundant (30.5%), followed by tetra-nucleotide repeats (18.2%) and di-nucleotide repeats (16.9%). Among all identified 457 motif types, the most frequently occurring repeat motifs were poly-AT/TA, which accounted for 79.8% of the total di-nt SSRs, followed by AAAT/TTTA with 51.5% of the total tetra-nucleotede. Further, 18,834 microsatellites were detected from the protein-coding genes, and the frequency of gene containing SSRs was 46.0% in 40,976 genes of G. raimondii. These genome-based SSRs developed in the present study will lay the groundwork for developing large numbers of SSR markers for genetic mapping, gene discovery, genetic diversity analysis, and MAS breeding in cotton.
Collapse
Affiliation(s)
- Changsong Zou
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang 455000, Henan, China
| | - Cairui Lu
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang 455000, Henan, China
| | - Youping Zhang
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang 455000, Henan, China
| | - Guoli Song
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang 455000, Henan, China
| |
Collapse
|
34
|
Wang K, Wang Z, Li F, Ye W, Wang J, Song G, Yue Z, Cong L, Shang H, Zhu S, Zou C, Li Q, Yuan Y, Lu C, Wei H, Gou C, Zheng Z, Yin Y, Zhang X, Liu K, Wang B, Song C, Shi N, Kohel RJ, Percy RG, Yu JZ, Zhu YX, Wang J, Yu S. The draft genome of a diploid cotton Gossypium raimondii. Nat Genet 2012; 44:1098-103. [PMID: 22922876 DOI: 10.1038/ng.2371] [Citation(s) in RCA: 602] [Impact Index Per Article: 50.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2012] [Accepted: 07/05/2012] [Indexed: 11/08/2022]
Abstract
We have sequenced and assembled a draft genome of G. raimondii, whose progenitor is the putative contributor of the D subgenome to the economically important fiber-producing cotton species Gossypium hirsutum and Gossypium barbadense. Over 73% of the assembled sequences were anchored on 13 G. raimondii chromosomes. The genome contains 40,976 protein-coding genes, with 92.2% of these further confirmed by transcriptome data. Evidence of the hexaploidization event shared by the eudicots as well as of a cotton-specific whole-genome duplication approximately 13-20 million years ago was observed. We identified 2,355 syntenic blocks in the G. raimondii genome, and we found that approximately 40% of the paralogous genes were present in more than 1 block, which suggests that this genome has undergone substantial chromosome rearrangement during its evolution. Cotton, and probably Theobroma cacao, are the only sequenced plant species that possess an authentic CDN1 gene family for gossypol biosynthesis, as revealed by phylogenetic analysis.
Collapse
|
35
|
Kuester AP, Nason JD. Microsatellite loci for Gossypium davidsonii (Malvaceae) and other D-genome, Sonoran Desert endemic cotton species. Am J Bot 2012; 99:e91-e93. [PMID: 22358044 DOI: 10.3732/ajb.1100421] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
PREMISE OF THE STUDY Microsatellite primers previously developed for domesticated cotton (Gossypium hirsutum; tetraploid) were screened for their utility in investigating genetic structure and gene flow within G. davidsonii and five other wild, Mexican, D-genome cotton species (all diploid). METHODS AND RESULTS We screened 50 microsatellite primer pairs from the Cotton Marker Database, identifying 10 loci as polymorphic within G. davidsonii. In genotyping approximately 200 individuals from four populations, we found that the number of alleles per locus ranged from seven to 17, and mean observed and expected heterozygosity ranged from 0.145 to 0.492 and from 0.436 to 0.734, respectively. We genotyped six to 20 individuals from each of the remaining species, finding these 10 loci to cross-amplify in all cases and in most cases to be polymorphic. CONCLUSIONS These markers may be useful for further investigation of population genetics of G. davidsonii and other wild D-genome cotton species.
Collapse
Affiliation(s)
- Adam P Kuester
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, Iowa 50011 USA
| | | |
Collapse
|
36
|
Jena SN, Srivastava A, Rai KM, Ranjan A, Singh SK, Nisar T, Srivastava M, Bag SK, Mantri S, Asif MH, Yadav HK, Tuli R, Sawant SV. Development and characterization of genomic and expressed SSRs for levant cotton (Gossypium herbaceum L.). Theor Appl Genet 2012; 124:565-576. [PMID: 22038488 DOI: 10.1007/s00122-011-1729-y] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/27/2011] [Accepted: 10/09/2011] [Indexed: 05/31/2023]
Abstract
Four microsatellite-enriched genomic libraries for CA(15), GA(15), AAG(8) and ATG(8) repeats and transcriptome sequences of five cDNA libraries of Gossypium herbaceum were explored to develop simple sequence repeat (SSR) markers. A total of 428 unique clones from repeat enriched genomic libraries were mined for 584 genomic SSRs (gSSRs). In addition, 99,780 unigenes from transcriptome sequencing were explored for 8,900 SSR containing sequences with 12,471 expressed SSRs. The present study adds 1,970 expressed SSRs and 263 gSSRs to the public domain for the use of genetic studies of cotton. When 150 gSSRs and 50 expressed SSRs were tested on a panel of four species of cotton, 68 gSSRs and 12 expressed SSRs revealed polymorphism. These 200 SSRs were further deployed on 15 genotypes of levant cotton for the genetic diversity assessment. This is the first report on the successful use of repeat enriched genomic library and expressed sequence database for microsatellite markers development in G. herbaceum.
Collapse
Affiliation(s)
- Satya Narayan Jena
- Plant Molecular Biology and Genetic Engineering Laboratory, National Botanical Research Institute, Rana Pratap Marg, Lucknow, India
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
37
|
Buyyarapu R, Kantety RV, Yu JZ, Saha S, Sharma GC. Development of New Candidate Gene and EST-Based Molecular Markers for Gossypium Species. Int J Plant Genomics 2012; 2011:894598. [PMID: 22315588 PMCID: PMC3270397 DOI: 10.1155/2011/894598] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/31/2011] [Revised: 10/26/2011] [Accepted: 11/09/2011] [Indexed: 05/31/2023]
Abstract
New source of molecular markers accelerate the efforts in improving cotton fiber traits and aid in developing high-density integrated genetic maps. We developed new markers based on candidate genes and G. arboreum EST sequences that were used for polymorphism detection followed by genetic and physical mapping. Nineteen gene-based markers were surveyed for polymorphism detection in 26 Gossypium species. Cluster analysis generated a phylogenetic tree with four major sub-clusters for 23 species while three species branched out individually. CAP method enhanced the rate of polymorphism of candidate gene-based markers between G. hirsutum and G. barbadense. Two hundred A-genome based SSR markers were designed after datamining of G. arboreum EST sequences (Mississippi Gossypium arboreum EST-SSR: MGAES). Over 70% of MGAES markers successfully produced amplicons while 65 of them demonstrated polymorphism between the parents of G. hirsutum and G. barbadense RIL population and formed 14 linkage groups. Chromosomal localization of both candidate gene-based and MGAES markers was assisted by euploid and hypoaneuploid CS-B analysis. Gene-based and MGAES markers were highly informative as they were designed from candidate genes and fiber transcriptome with a potential to be integrated into the existing cotton genetic and physical maps.
Collapse
Affiliation(s)
- Ramesh Buyyarapu
- Center for Molecular Biology, Department of Natural Resources and Environmental Sciences, Alabama A&M University, 134 ARC Building, P.O. Box 1927, Normal, AL 35762, USA
| | - Ramesh V. Kantety
- Center for Molecular Biology, Department of Natural Resources and Environmental Sciences, Alabama A&M University, 134 ARC Building, P.O. Box 1927, Normal, AL 35762, USA
| | - John Z. Yu
- Southern Plains Agricultural Research Center, USDA-ARS, 2881 F&B Road, College Station, TX 77845, USA
| | - Sukumar Saha
- Genetics and Precision Agriculture Research Unit, USDA-ARS, P.O. Box 5367, MS 39762, USA
| | - Govind C. Sharma
- Center for Molecular Biology, Department of Natural Resources and Environmental Sciences, Alabama A&M University, 134 ARC Building, P.O. Box 1927, Normal, AL 35762, USA
| |
Collapse
|
38
|
Yu JZ, Kohel RJ, Fang DD, Cho J, Van Deynze A, Ulloa M, Hoffman SM, Pepper AE, Stelly DM, Jenkins JN, Saha S, Kumpatla SP, Shah MR, Hugie WV, Percy RG. A high-density simple sequence repeat and single nucleotide polymorphism genetic map of the tetraploid cotton genome. G3 (Bethesda) 2012; 2:43-58. [PMID: 22384381 DOI: 10.1534/g3.111.001552] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/24/2011] [Accepted: 11/04/2011] [Indexed: 11/18/2022]
Abstract
Genetic linkage maps play fundamental roles in understanding genome structure, explaining genome formation events during evolution, and discovering the genetic bases of important traits. A high-density cotton (Gossypium spp.) genetic map was developed using representative sets of simple sequence repeat (SSR) and the first public set of single nucleotide polymorphism (SNP) markers to genotype 186 recombinant inbred lines (RILs) derived from an interspecific cross between Gossypium hirsutum L. (TM-1) and G. barbadense L. (3-79). The genetic map comprised 2072 loci (1825 SSRs and 247 SNPs) and covered 3380 centiMorgan (cM) of the cotton genome (AD) with an average marker interval of 1.63 cM. The allotetraploid cotton genome produced equivalent recombination frequencies in its two subgenomes (At and Dt). Of the 2072 loci, 1138 (54.9%) were mapped to 13 At-subgenome chromosomes, covering 1726.8 cM (51.1%), and 934 (45.1%) mapped to 13 Dt-subgenome chromosomes, covering 1653.1 cM (48.9%). The genetically smallest homeologous chromosome pair was Chr. 04 (A04) and 22 (D04), and the largest was Chr. 05 (A05) and 19 (D05). Duplicate loci between and within homeologous chromosomes were identified that facilitate investigations of chromosome translocations. The map augments evidence of reciprocal rearrangement between ancestral forms of Chr. 02 and 03 versus segmental homeologs 14 and 17 as centromeric regions show homeologous between Chr. 02 (A02) and 17 (D02), as well as between Chr. 03 (A03) and 14 (D03). This research represents an important foundation for studies on polyploid cottons, including germplasm characterization, gene discovery, and genome sequence assembly.
Collapse
|
39
|
Lai D, Li H, Fan S, Song M, Pang C, Wei H, Liu J, Wu D, Gong W, Yu S. Generation of ESTs for flowering gene discovery and SSR marker development in upland cotton. PLoS One 2011; 6:e28676. [PMID: 22163052 PMCID: PMC3232235 DOI: 10.1371/journal.pone.0028676] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2011] [Accepted: 11/13/2011] [Indexed: 11/18/2022] Open
Abstract
Background Upland cotton, Gossypium hirsutum L., is one of the world's most important economic crops. In the absence of the entire genomic sequence, a large number of expressed sequence tag (EST) resources of upland cotton have been generated and used in several studies. However, information about the flower development of this species is rare. Methodology/Principal Findings To clarify the molecular mechanism of flower development in upland cotton, 22,915 high-quality ESTs were generated and assembled into 14,373 unique sequences consisting of 4,563 contigs and 9,810 singletons from a normalized and full-length cDNA library constructed from pooled RNA isolated from shoot apexes, squares, and flowers. Comparative analysis indicated that 5,352 unique sequences had no high-degree matches to the cotton public database. Functional annotation showed that several upland cotton homologs with flowering-related genes were identified in our library. The majority of these genes were specifically expressed in flowering-related tissues. Three GhSEP (G. hirsutum L. SEPALLATA) genes determining floral organ development were cloned, and quantitative real-time PCR (qRT-PCR) revealed that these genes were expressed preferentially in squares or flowers. Furthermore, 670 new putative microsatellites with flanking sequences sufficient for primer design were identified from the 645 unigenes. Twenty-five EST–simple sequence repeats were randomly selected for validation and transferability testing in 17 Gossypium species. Of these, 23 were identified as true-to-type simple sequence repeat loci and were highly transferable among Gossypium species. Conclusions/Significance A high-quality, normalized, full-length cDNA library with a total of 14,373 unique ESTs was generated to provide sequence information for gene discovery and marker development related to upland cotton flower development. These EST resources form a valuable foundation for gene expression profiling analysis, functional analysis of newly discovered genes, genetic linkage, and quantitative trait loci analysis.
Collapse
Affiliation(s)
- Deyong Lai
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei, People's Republic of China
- Key Laboratory of Cotton Genetic Improvement of Ministry of Agriculture, The Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, Henan, People's Republic of China
| | - Huaizhu Li
- Key Laboratory of Cotton Genetic Improvement of Ministry of Agriculture, The Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, Henan, People's Republic of China
- College of Agronomy, Northwest A&F University, Yangling, Shanxi, People's Republic of China
| | - Shuli Fan
- Key Laboratory of Cotton Genetic Improvement of Ministry of Agriculture, The Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, Henan, People's Republic of China
| | - Meizhen Song
- Key Laboratory of Cotton Genetic Improvement of Ministry of Agriculture, The Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, Henan, People's Republic of China
| | - Chaoyou Pang
- Key Laboratory of Cotton Genetic Improvement of Ministry of Agriculture, The Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, Henan, People's Republic of China
| | - Hengling Wei
- Key Laboratory of Cotton Genetic Improvement of Ministry of Agriculture, The Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, Henan, People's Republic of China
- College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, Zhejiang, People's Republic of China
| | - Junjie Liu
- Key Laboratory of Cotton Genetic Improvement of Ministry of Agriculture, The Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, Henan, People's Republic of China
| | - Dong Wu
- Key Laboratory of Cotton Genetic Improvement of Ministry of Agriculture, The Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, Henan, People's Republic of China
| | - Wenfang Gong
- Key Laboratory of Cotton Genetic Improvement of Ministry of Agriculture, The Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, Henan, People's Republic of China
| | - Shuxun Yu
- Key Laboratory of Cotton Genetic Improvement of Ministry of Agriculture, The Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, Henan, People's Republic of China
- * E-mail:
| |
Collapse
|
40
|
Jena SN, Srivastava A, Rai KM, Ranjan A, Singh SK, Nisar T, Srivastava M, Bag SK, Mantri S, Asif MH, Yadav HK, Tuli R, Sawant SV. Development and characterization of genomic and expressed SSRs for levant cotton (Gossypium herbaceum L.). Theor Appl Genet 2011. [PMID: 22038488 DOI: 10.1007/s00122‐011‐1729‐y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Subscribe] [Scholar Register] [Indexed: 09/29/2022]
Abstract
Four microsatellite-enriched genomic libraries for CA(15), GA(15), AAG(8) and ATG(8) repeats and transcriptome sequences of five cDNA libraries of Gossypium herbaceum were explored to develop simple sequence repeat (SSR) markers. A total of 428 unique clones from repeat enriched genomic libraries were mined for 584 genomic SSRs (gSSRs). In addition, 99,780 unigenes from transcriptome sequencing were explored for 8,900 SSR containing sequences with 12,471 expressed SSRs. The present study adds 1,970 expressed SSRs and 263 gSSRs to the public domain for the use of genetic studies of cotton. When 150 gSSRs and 50 expressed SSRs were tested on a panel of four species of cotton, 68 gSSRs and 12 expressed SSRs revealed polymorphism. These 200 SSRs were further deployed on 15 genotypes of levant cotton for the genetic diversity assessment. This is the first report on the successful use of repeat enriched genomic library and expressed sequence database for microsatellite markers development in G. herbaceum.
Collapse
Affiliation(s)
- Satya Narayan Jena
- Plant Molecular Biology and Genetic Engineering Laboratory, National Botanical Research Institute, Rana Pratap Marg, Lucknow, India
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
41
|
Reddy UK, Rong JK, Nimmakayala P, Vajja G, Rahman MA, Yu J, Soliman KM, Heller-Uszynska K, Kilian A, Paterson AH. Use of diversity arrays technology markers for integration into a cotton reference map and anchoring to a recombinant inbred line map. Genome 2011; 54:349-59. [PMID: 21524184 DOI: 10.1139/g11-001] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
A diversity array technology (DArT) marker platform was developed for the cotton genome, to evaluate the use of DArT markers compared with AFLP markers in mapping and transferability across the mapping populations. We used a reference genetic map of tetraploid Gossypium L. that already contained ~5000 loci, which coalesced into 26 chromosomes, to anchor newly developed DArT and AFLP markers with the aim of further improving utility and map resolution. Our results indicated that the percentage of polymorphic DArT markers that could be genetically mapped (78.15%) was much higher than that of AFLP markers (22.28%). Sequence analysis of DArT markers indicated that a majority matched known expressed sequence tag (EST) sequences from tetraploid and diploid Gossypium species. A total of 794 Arabidopsis genes were homologous with various DArT marker sequences. Chromosomes 5(A), 7(A), 19(D), 23(D), and 24(D) had more Arabidopsis syntenic DArT markers than the other chromosomes. Anchoring DArT markers from the reference map to a recombinant inbred line (RIL) map indicated that DArT markers will speed the building of maps in de novo RIL populations.
Collapse
Affiliation(s)
- Umesh K Reddy
- Department of Biology and Gus R. Douglass Institute, West Virginia State University, Institute, WV 25112, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
42
|
Hinchliffe DJ, Meredith WR, Yeater KM, Kim HJ, Woodward AW, Chen ZJ, Triplett BA. Near-isogenic cotton germplasm lines that differ in fiber-bundle strength have temporal differences in fiber gene expression patterns as revealed by comparative high-throughput profiling. Theor Appl Genet 2010; 120:1347-66. [PMID: 20087569 DOI: 10.1007/s00122-010-1260-6] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/14/2009] [Accepted: 12/27/2009] [Indexed: 05/10/2023]
Abstract
Gene expression profiles of developing cotton (Gossypium hirsutum L.) fibers from two near-isogenic lines (NILs) that differ in fiber-bundle strength, short-fiber content, and in fewer than two genetic loci were compared using an oligonucleotide microarray. Fiber gene expression was compared at five time points spanning fiber elongation and secondary cell wall (SCW) biosynthesis. Fiber samples were collected from field plots in a randomized, complete block design, with three spatially distinct biological replications for each NIL at each time point. Microarray hybridizations were performed in a loop experimental design that allowed comparisons of fiber gene expression profiles as a function of time between the two NILs. Overall, developmental expression patterns revealed by the microarray experiment agreed with previously reported cotton fiber gene expression patterns for specific genes. Additionally, genes expressed coordinately with the onset of SCW biosynthesis in cotton fiber correlated with gene expression patterns of other SCW-producing plant tissues. Functional classification and enrichment analysis of differentially expressed genes between the two NILs revealed that genes associated with SCW biosynthesis were significantly up-regulated in fibers of the high-fiber quality line at the transition stage of cotton fiber development. For independent corroboration of the microarray results, 15 genes were selected for quantitative reverse transcription PCR analysis of fiber gene expression. These analyses, conducted over multiple field years, confirmed the temporal difference in fiber gene expression between the two NILs. We hypothesize that the loci conferring temporal differences in fiber gene expression between the NILs are important regulatory sequences that offer the potential for more targeted manipulation of cotton fiber quality.
Collapse
Affiliation(s)
- Doug J Hinchliffe
- USDA-ARS-SRRC, 1100 Robert E. Lee Blvd., New Orleans, LA, 70124, USA.
| | | | | | | | | | | | | |
Collapse
|
43
|
Taliercio E, Scheffler J, Scheffler B. Characterization of two cotton (Gossypium hirsutum L) invertase genes. Mol Biol Rep 2010; 37:3915-20. [PMID: 20300865 DOI: 10.1007/s11033-010-0048-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2010] [Accepted: 03/05/2010] [Indexed: 12/13/2022]
Abstract
Two cotton vacuolar-invertase genes were identified and sequenced. Both genes had seven exons, including an unusually small second exon typical of acid invertases. These genes encode peptides with many features shared by acid invertases from other species including, leader sequences that probably target the peptide to the vacuole, active site motifs and substrate binding motifs. Expression analyses indicated that one of the genes was expressed in roots during the starch filling stage of development. However, expression of the same gene fluctuated during the starch utilization stage of development. Therefore this gene was unlikely to play a role in determining sink strength of this tissue. Both genes were expressed in elongating fibers where they were likely to play a role in cell expansion. The invertase gene uniquely expressed in fiber had a simple sequence repeat (SSR) in the third intron that was polymorphic among various cotton species. An EST was identified with an expansion of the SSR that included the third intron indicating this SSR is associated with a splice variant. The polymorphic SSR may be useful in investigating the function of this gene in fiber development.
Collapse
|
44
|
Fang DD, Xiao J, Canci PC, Cantrell RG. A new SNP haplotype associated with blue disease resistance gene in cotton (Gossypium hirsutum L.). Theor Appl Genet 2010; 120:943-53. [PMID: 19960336 DOI: 10.1007/s00122-009-1223-y] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2009] [Accepted: 11/12/2009] [Indexed: 05/10/2023]
Abstract
Resistance to cotton blue disease (CBD) was evaluated in 364 F(2.3) families of three populations derived from resistant variety 'Delta Opal'. The CBD resistance in 'Delta Opal' was controlled by one single dominant gene designated Cbd. Two simple sequence repeat (SSR) markers were identified as linked to Cbd by bulked segregant analysis. Cbd resides at the telomere region of chromosome 10. SSR marker DC20027 was 0.75 cM away from Cbd. DC20027 marker fragments amplified from 3 diploid species and 13 cotton varieties whose CBD resistance was known were cloned and sequenced. One single nucleotide polymorphism (SNP) was identified at the 136 th position by sequence alignment analysis. Screening SNP markers previously mapped on chromosome 10 identified an additional 3 SNP markers that were associated with Cbd. A strong association between a haplotype based on four SNP markers and Cbd was developed. This demonstrates one of the first examples in cotton where SNP markers were used to effectively tag a trait enabling marker-assisted selection for high levels of CBD resistance in breeding programs.
Collapse
Affiliation(s)
- David D Fang
- Cotton Fiber Bioscience Research Unit, USDA-ARS-SRRC, 1100 Robert E. Lee Blvd., New Orleans, LA 70124, USA
| | | | | | | |
Collapse
|
45
|
Van Deynze A, Stoffel K, Lee M, Wilkins TA, Kozik A, Cantrell RG, Yu JZ, Kohel RJ, Stelly DM. Sampling nucleotide diversity in cotton. BMC Plant Biol 2009; 9:125. [PMID: 19840401 PMCID: PMC2771027 DOI: 10.1186/1471-2229-9-125] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2008] [Accepted: 10/20/2009] [Indexed: 05/19/2023]
Abstract
BACKGROUND Cultivated cotton is an annual fiber crop derived mainly from two perennial species, Gossypium hirsutum L. or upland cotton, and G. barbadense L., extra long-staple fiber Pima or Egyptian cotton. These two cultivated species are among five allotetraploid species presumably derived monophyletically between G. arboreum and G. raimondii. Genomic-based approaches have been hindered by the limited variation within species. Yet, population-based methods are being used for genome-wide introgression of novel alleles from G. mustelinum and G. tomentosum into G. hirsutum using combinations of backcrossing, selfing, and inter-mating. Recombinant inbred line populations between genetics standards TM-1, (G. hirsutum) x 3-79 (G. barbadense) have been developed to allow high-density genetic mapping of traits. RESULTS This paper describes a strategy to efficiently characterize genomic variation (SNPs and indels) within and among cotton species. Over 1000 SNPs from 270 loci and 279 indels from 92 loci segregating in G. hirsutum and G. barbadense were genotyped across a standard panel of 24 lines, 16 of which are elite cotton breeding lines and 8 mapping parents of populations from six cotton species. Over 200 loci were genetically mapped in a core mapping population derived from TM-1 and 3-79 and in G. hirsutum breeding germplasm. CONCLUSION In this research, SNP and indel diversity is characterized for 270 single-copy polymorphic loci in cotton. A strategy for SNP discovery is defined to pre-screen loci for copy number and polymorphism. Our data indicate that the A and D genomes in both diploid and tetraploid cotton remain distinct from each such that paralogs can be distinguished. This research provides mapped DNA markers for intra-specific crosses and introgression of exotic germplasm in cotton.
Collapse
Affiliation(s)
- Allen Van Deynze
- Seed Biotechnology Center, University of California, 1 Shields Ave, Davis, CA, USA
| | - Kevin Stoffel
- Seed Biotechnology Center, University of California, 1 Shields Ave, Davis, CA, USA
| | - Mike Lee
- Seed Biotechnology Center, University of California, 1 Shields Ave, Davis, CA, USA
| | - Thea A Wilkins
- Department of Plant and Soil Science, Texas Tech University, Experimental Sciences Building, Room 215, Mail Stop 3132, Lubbock, TX 79409-3132, USA
| | - Alexander Kozik
- Genome and Biomedical Sciences Facility, University of California, 1 Shields Ave, Davis, CA, USA
| | - Roy G Cantrell
- Monsanto, 1 800 N. Lindbergh Blvd, St Louis, MO 63167, USA
| | - John Z Yu
- USDA-ARS, Southern Plains Agricultural Research Center, 2881 F&B Road, College Station, TX 77845, USA
| | - Russel J Kohel
- USDA-ARS, Southern Plains Agricultural Research Center, 2881 F&B Road, College Station, TX 77845, USA
| | - David M Stelly
- Department of Soil and Crop Sciences, Texas A & M University, College Station, TX 77843, USA
| |
Collapse
|
46
|
Abdurakhmonov IY, Saha S, Jenkins JN, Buriev ZT, Shermatov SE, Scheffler BE, Pepper AE, Yu JZ, Kohel RJ, Abdukarimov A. Linkage disequilibrium based association mapping of fiber quality traits in G. hirsutum L. variety germplasm. Genetica 2009; 136:401-17. [PMID: 19067183 DOI: 10.1007/s10709-008-9337-8] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2008] [Accepted: 11/17/2008] [Indexed: 02/08/2023]
Abstract
Cotton is the world's leading cash crop, but it lags behind other major crops for marker-assisted breeding due to limited polymorphisms and a genetic bottleneck through historic domestication. This underlies a need for characterization, tagging, and utilization of existing natural polymorphisms in cotton germplasm collections. Here we report genetic diversity, population characteristics, the extent of linkage disequilibrium (LD), and association mapping of fiber quality traits using 202 microsatellite marker primer pairs in 335 G. hirsutum germplasm grown in two diverse environments, Uzbekistan and Mexico. At the significance threshold (r (2) >or= 0.1), a genome-wide average of LD extended up to genetic distance of 25 cM in assayed cotton variety accessions. Genome wide LD at r (2) >or= 0.2 was reduced to approximately 5-6 cM, providing evidence of the potential for association mapping of agronomically important traits in cotton. Results suggest linkage, selection, inbreeding, population stratification, and genetic drift as the potential LD-generating factors in cotton. In two environments, an average of ~20 SSR markers was associated with each main fiber quality traits using a unified mixed liner model (MLM) incorporating population structure and kinship. These MLM-derived significant associations were confirmed in general linear model and structured association test, accounting for population structure and permutation-based multiple testing. Several common markers, showing the significant associations in both Uzbekistan and Mexican environments, were determined. Between 7 and 43% of the MLM-derived significant associations were supported by a minimum Bayes factor at 'moderate to strong' and 'strong to very strong' evidence levels, suggesting their usefulness for marker-assisted breeding programs and overall effectiveness of association mapping using cotton germplasm resources.
Collapse
Affiliation(s)
- Ibrokhim Y Abdurakhmonov
- Center of Genomic Technologies, Institute of Genetics and Plant Experimental Biology, Academy of Sciences of Uzbekistan, Tashkent, Uzbekistan.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
47
|
Zeng L, Meredith WR, Gutiérrez OA, Boykin DL. Identification of associations between SSR markers and fiber traits in an exotic germplasm derived from multiple crosses among Gossypium tetraploid species. Theor Appl Genet 2009; 119:93-103. [PMID: 19360391 DOI: 10.1007/s00122-009-1020-7] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/04/2008] [Accepted: 03/20/2009] [Indexed: 05/04/2023]
Abstract
Genetic improvement in yield and fiber quality is needed for worldwide cotton production. Identification of molecular markers associated with fiber-related traits can facilitate selection for these traits in breeding. This study was designed to identify associations between SSR markers and fiber traits using an exotic germplasm population, species polycross (SP), derived from multiple crosses among Gossypium tetraploid species. The SP population underwent 11 generations of mixed random mating and selfing followed by 12 generations of selfing. A total of 260 lines were evaluated for fiber-related traits under three environments in 2005 and 2006. Large genotypic variance components in traits were identified relative to components of genotype x environment. Eighty-six primer pairs amplified a total of 314 polymorphic fragments among 260 lines. A total of 202 fragments with above 6% allele frequency were analyzed for associations. Fifty-nine markers were found to have a significant (P < 0.05, 0.01, or 0.001) association with six fiber traits. There were six groups identified within the population using structure analysis. Allele frequency divergence among six groups ranged from 0.11 to 0.27. Of the 59 marker-trait associations, 39 remained significant after correction for population structure and kinship using a mixed linear model. The effect of population sub-structure on associations was most significant in boll weight among the traits analyzed. The sub-structure among the SP lines may be caused by natural selection, the breeding method applied during development of inbred lines, and unknown factors. The identified marker-trait associations can be useful in breeding and help determine genetic mechanisms underlying interrelationships among fiber traits.
Collapse
Affiliation(s)
- Linghe Zeng
- USDA-ARS, Crop Genetics and Production Unit, Stoneville, MS 38776, USA.
| | | | | | | |
Collapse
|
48
|
Abstract
Cotton fiber cells elongate without dividing to form economically valuable spinnable fiber. Reports of the ploidy level of fiber cells are variable. Early reports indicated an increase in nuclear DNA content in young fibers; however, subsequent reports failed to observe such a significant increase in ploidy level. Evaluation and analysis of genes involved in regulation of DNA synthesis and other aspects of cell cycle regulation identified relevant genes that were present in fiber cells though usually at low levels. We report the isolation and characterization of another gene likely to be involved in cell cycle/DNA synthesis control. This gene was similar to a gene from Medicago species that controls entry into anaphase by regulating the activity of the anaphase promoting complex ability to ubiquinate selected proteins. The cotton gene was composed of nine exons and the deduced translational sequences have motifs similar to a Medicago gene expressed in highly polyploid cells. Based on this similarity the cotton gene was designated Ghcdh. Ghcdh is highly expressed in meristems and leaves but is present at much lower levels in fiber cells. These data are consistent with the lower levels of polyploidy reported for cotton fiber. A simple sequence repeat was identified in the gene that may be exploited as a marker to map this gene and associate it with important traits in cotton.
Collapse
|
49
|
Abdurakhmonov IY, Kohel RJ, Yu JZ, Pepper AE, Abdullaev AA, Kushanov FN, Salakhutdinov IB, Buriev ZT, Saha S, Scheffler BE, Jenkins JN, Abdukarimov A. Molecular diversity and association mapping of fiber quality traits in exotic G. hirsutum L. germplasm. Genomics 2008; 92:478-87. [PMID: 18801424 DOI: 10.1016/j.ygeno.2008.07.013] [Citation(s) in RCA: 80] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2008] [Revised: 06/23/2008] [Accepted: 07/29/2008] [Indexed: 02/05/2023]
Abstract
The narrow genetic base of cultivated cotton germplasm is hindering the cotton productivity worldwide. Although potential genetic diversity exists in Gossypium genus, it is largely 'underutilized' due to photoperiodism and the lack of innovative tools to overcome such challenges. The application of linkage disequilibrium (LD)-based association mapping is an alternative powerful molecular tool to dissect and exploit the natural genetic diversity conserved within cotton germplasm collections, greatly accelerating still 'lagging' cotton marker-assisted selection (MAS) programs. However, the extent of genome-wide linkage disequilibrium (LD) has not been determined in cotton. We report the extent of genome-wide LD and association mapping of fiber quality traits by using a 95 core set of microsatellite markers in a total of 285 exotic Gossypium hirsutum accessions, comprising of 208 landrace stocks and 77 photoperiodic variety accessions. We demonstrated the existence of useful genetic diversity within exotic cotton germplasm. In this germplasm set, 11-12% of SSR loci pairs revealed a significant LD. At the significance threshold (r(2)>/=0.1), a genome-wide average of LD declines within the genetic distance at <10 cM in the landrace stocks germplasm and >30 cM in variety germplasm. Genome wide LD at r(2)>/=0.2 was reduced on average to approximately 1-2 cM in the landrace stock germplasm and 6-8 cM in variety germplasm, providing evidence of the potential for association mapping of agronomically important traits in cotton. We observed significant population structure and relatedness in assayed germplasm. Consequently, the application of the mixed liner model (MLM), considering both kinship (K) and population structure (Q) detected between 6% and 13% of SSR markers associated with the main fiber quality traits in cotton. Our results highlight for the first time the feasibility and potential of association mapping, with consideration of the population structure and stratification existing in cotton germplasm resources. The number of SSR markers associated with fiber quality traits in diverse cotton germplasm, which broadly covered many historical meiotic events, should be useful to effectively exploit potentially new genetic variation by using MAS programs.
Collapse
Affiliation(s)
- I Y Abdurakhmonov
- Center of Genomic Technologies, Institute of Genetics and Plant Experimental Biology, Academy of Sciences of Uzbekistan. Yuqori Yuz, Qibray region Tashkent district, 702151, Uzbekistan
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
50
|
Abstract
Eucalyptus is an important short rotation pulpy woody plant, grown widely in the tropics. Recently, many genomic programmes are underway leading to the accumulation of voluminous genomic and expressed sequence tag sequences in public databases. These sequences can be utilized for analysis of simple sequence repeats (SSRs) and single nucleotide polymorphism (SNPs) available in the transcribed genes. In this study, in silico analysis of 15,285 sequences representing partial and full-length mRNA from Eucalyptus species for their use in developing SSRs or microsatellites were carried out. A total of 875 EST-SSRs were identified from 772 SSR containing ESTs. Motif size of 6 for dinucleotide and 5 for trinucleotide, tetranucleotide, and pentanucleotides were considered in locating the microsatellites. The average frequency of identified SSRs was 12.9%. The dinucleotide repeats were the most abundant among the dinucleotide, trinucleotide and tetranucleotide motifs and accounted for 50.9% of the Eucalyptus genome. Primer designing analysis showed that 571 sequences with SSRs had sufficient flanking regions for polymerase chain reaction (PCR) primer synthesis. Evaluation of the usefulness of the SSRs showed that EST-derived SSRs can generate polymorphic markers as all the primers showed allelic diversity among the 16 provenances of E. tereticornis.
Collapse
|