1
|
Abstract
Molecular markers are used to provide the link between genotype and phenotype, for the production of molecular genetic maps and to assess genetic diversity within and between related species. Single nucleotide polymorphisms (SNPs) are the most abundant molecular genetic marker. SNPs can be identified in silico, but care must be taken to ensure that the identified SNPs reflect true genetic variation and are not a result of errors associated with DNA sequencing. The SNP detection method autoSNP has been developed to identify SNPs from sequence data for any species. Confidence in the predicted SNPs is based on sequence redundancy, and haplotype co-segregation scores are calculated for a further independent measure of confidence. We have extended the autoSNP method to produce autoSNPdb, which integrates SNP and gene annotation information with a graphical viewer. We have applied this software to public barley expressed sequences, and the resulting database is available over the Internet. SNPs can be viewed and searched by sequence, functional annotation or predicted synteny with a reference genome, in this case rice. The correlation between SNPs and barley cultivar, expressed tissue type and development stage has been collated for ease of exploration. An average of one SNP per 240 bp was identified, with SNPs more prevalent in the 5' regions and simple sequence repeat (SSR) flanking sequences. Overall, autoSNPdb can provide a wealth of genetic polymorphism information for any species for which sequence data are available.
Collapse
Affiliation(s)
- Chris Duran
- Australian Centre for Plant Functional Genomics, School of Land, Crop and Food Sciences, Institute for Molecular Bioscience, University of Queensland, Brisbane, Qld 4072, Australia
| | | | | | | | | | | |
Collapse
|
2
|
Abstract
Over 3.5 million expressed sequence tags from the major cereal taxa were used to electronically mine over 176,000 putative single nucleotide polymorphisms (SNPs). The density, distribution and degree of linkage between these SNPs were compared among the different taxa. The frequency of sequence polymorphism was lowest in diploid taxa (rice, barley and sorghum), intermediate in tetraploid maize and highest in allohexaploid wheat and octoploid sugarcane. SNPs were further categorized as either intravarietal (differences between gene family members and homoeologues) or varietal (differences between two varieties), and as either co-segregating or non-co-segregating with neighbouring polymorphisms. Varietal co-segregating SNPs represent the best candidates for molecular markers as they show variation between varieties and have a high probability of being validated, as sequencing errors are unlikely to co-segregate with one another. This elite class of SNPs was most abundant in barley and least abundant in wheat and rice. Despite the large number of observed sequence polymorphisms in allohexaploid wheat, only a fraction of those available are likely to make good molecular markers. In addition, we found that rice SNPs up to 10 kb apart were in linkage disequilibrium (LD), but that high levels of LD attributable to population structure confounded the tracking of LD over greater distances.
Collapse
Affiliation(s)
- Gary L A Barker
- School of Biological Sciences, University of Bristol, Woodland Road, Bristol, BS8 1UG, UK.
| | | |
Collapse
|
3
|
An C, Saha S, Jenkins JN, Ma DP, Scheffler BE, Kohel RJ, Yu JZ, Stelly DM. Cotton (Gossypium spp.) R2R3-MYB transcription factors SNP identification, phylogenomic characterization, chromosome localization, and linkage mapping. Theor Appl Genet 2008; 116:1015-26. [PMID: 18338155 DOI: 10.1007/s00122-008-0732-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/06/2007] [Accepted: 02/11/2008] [Indexed: 05/08/2023]
Abstract
R2R3-MYB transcription factors of plants are involved in the regulation of trichome length and density. Several of them are differentially expressed during initiation and elongation of cotton fibers. We report sequence phylogenomic characterization of the six MYB genes, their chromosomal localization, and linkage mapping via SNP marker in AD-genome cotton (2n = 52). Phylogenetic grouping and comparison to At- and Dt-genome putative ancestral diploid species of allotetraploid cotton facilitated differentiation between genome-specific polymorphisms (GSPs) and marker-suitable locus-specific polymorphisms (LSPs). The SNP frequency averaged one per 77 bases overall, and one per 106 and 30 bases in coding and non-coding regions, respectively. SNP-based multivariate relationships conformed to independent evolution of the six MYB homoeologous loci in the four tetraploid species. Nucleotide diversity analysis indicated that the six MYB loci evolved more quickly in the Dt- than At-genome. The greater variation in the Dt-D genome comparisons than that in At-A genome comparisons showed no significant bias among synonymous substitution, non-synonymous substitution, and nucleotide change in non-coding regions. SNPs were concordantly mapped by deletion analysis and linkage mapping, which confirmed their value as candidate gene markers and indicated the reliability of the SNP discovery strategy in tetraploid cotton species. We consider that these SNPs may be useful for genetic dissection of economically important fiber and yield traits because of the role of these genes in fiber development.
Collapse
Affiliation(s)
- Chuanfu An
- Department of Plant and Soil Sciences, Mississippi State University, Mississippi State, MS 39762, USA
| | | | | | | | | | | | | | | |
Collapse
|
4
|
Kota R, Varshney RK, Prasad M, Zhang H, Stein N, Graner A. EST-derived single nucleotide polymorphism markers for assembling genetic and physical maps of the barley genome. Funct Integr Genomics 2007; 8:223-33. [PMID: 17968603 DOI: 10.1007/s10142-007-0060-9] [Citation(s) in RCA: 78] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2007] [Revised: 08/17/2007] [Accepted: 09/15/2007] [Indexed: 11/24/2022]
Abstract
In a panel of seven genotypes, 437 expressed sequence tag (EST)-derived DNA fragments were sequenced. Single nucleotide polymorphisms (SNPs) that were polymorphic between the parents of three mapping populations were mapped by heteroduplex analysis and a genome-wide consensus map comprising 216 EST-derived SNPs and 4 InDel (insertion/deletion) markers was constructed. The average frequency of SNPs amounted to 1/130 bp and 1/107.8 bp for a set of randomly selected and a set of mapped ESTs, respectively. The calculated nucleotide diversities (pi) ranged from 0 to 40.0 x 10(-3) (average 3.1 x 10(-3)) and 0.52 x 10(-3) to 39.51 x 10(-3) (average 4.37 x 10(-3)) for random and mapped ESTs, respectively. The polymorphism information content value for mapped SNPs ranged from 0.24 to 0.50 with an average of 0.34. As expected, combination of SNPs present in an amplicon (haplotype) exhibited a higher information content ranging from 0.24 to 0.85 with an average of 0.50. Cleaved amplified polymorphic sequence assays (including InDels) were designed for a total of 87 (39.5%) SNP markers. The high abundance of SNPs in the barley genome provides avenues for the systematic development of saturated genetic maps and their integration with physical maps.
Collapse
Affiliation(s)
- R Kota
- Plant Disease Resistance Group, CSIRO-Plant Industry, PO Box 1600, Canberra ACT 2601, Australia
| | | | | | | | | | | |
Collapse
|
5
|
An C, Saha S, Jenkins JN, Scheffler BE, Wilkins TA, Stelly DM. Transcriptome profiling, sequence characterization, and SNP-based chromosomal assignment of the EXPANSIN genes in cotton. Mol Genet Genomics 2007; 278:539-53. [PMID: 17724613 DOI: 10.1007/s00438-007-0270-9] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2007] [Accepted: 06/15/2007] [Indexed: 10/22/2022]
Abstract
The knowledge of biological significance associated with DNA markers is very limited in cotton. SNPs are potential functional marker to tag genes of biological importance. Plant expansins are a group of extracellular proteins that directly modify the mechanical properties of cell walls, enable turgor-driven cell extension, and likely affect length and quality of cotton fibers. Here, we report the expression profiles of EXPANSIN transcripts during fiber elongation and the discovery of SNP markers, assess the SNP characteristics, and localize six EXPANSIN A genes to chromosomes. Transcriptome profiling of cotton fiber oligonucleotide microarrays revealed that seven EXPANSIN transcripts were differentially expressed when there was parallel polar elongation during morphogenesis at early stage of fiber development, suggesting that major and minor isoforms perform discrete functions during polar elongation and lateral expansion. Ancestral and homoeologous relationships of the six EXPANSIN A genes were revealed by phylogenetic grouping and comparison to extant A- and D-genome relatives of contemporary AD-genome cottons. The average rate of SNP per nucleotide was 2.35% (one SNP per 43 bp), with 1.74 and 3.99% occurring in coding and noncoding regions, respectively, in the selected genotypes. An unequal evolutionary rate of the EXPANSIN A genes at the subgenome level of tetraploid cotton was recorded. Chromosomal locations for each of the six EXPANSIN A genes were established by gene-specific SNP markers. Results revealed a strategy for discovering SNP markers in a polyploidy species like cotton. These markers could be useful to associate candidate genes with the complex fiber traits in MAS.
Collapse
Affiliation(s)
- Chuanfu An
- Department of Plant and Soil Sciences, Mississippi State University, Mississippi State, MS 39762, USA
| | | | | | | | | | | |
Collapse
|
6
|
Melotto M, Monteiro-Vitorello CB, Bruschi AG, Camargo LEA. Comparative bioinformatic analysis of genes expressed in common bean (Phaseolus vulgaris L.) seedlings. Genome 2007; 48:562-70. [PMID: 16121253 DOI: 10.1139/g05-010] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
To rapidly and cost-effectively generate gene expression data, we developed an annotated unigene database of common bean (Phaseolus vulgaris L.). In this study, 3 cDNA libraries were constructed from the bean breeding line SEL1308, 1 from young leaf and 2 from seedlings inoculated or not inoculated with the fungal pathogen Colletotrichum lindemuthianum (Sacc. & Magnus) Briosi & Cavara, which causes anthracnose in common bean. To this date, 5255 single-pass sequences have been included in the database after selection based on sequence quality. These ESTs were trimmed and clustered using the computer programs Phred and CAP3 to form a unigene collection of 3126 unique sequences. Within clusters, 318 single nucleotide polymorphisms (SNPs) and 68 insertions-deletions (indels) were found, indicating the presence of paralogous gene families in our database. Each unigene sequence was analyzed for possible function using their similarity to known genes represented in the GenBank database and classified into 14 categories. Only 314 unigenes showed significant similarities to Phaseolus genomic sequences and P. vulgaris ESTs, which indicates that 90% (2818 unigenes) of our database represent newly discovered common bean genes. In addition, 12% (387 unigenes) were shown to be specific to common bean. This study represents a first step towards the discovery of novel genes in beans and a valuable source of molecular markers for expressed gene tagging and mapping.
Collapse
Affiliation(s)
- Maeli Melotto
- Department de Fitopatologia, Laboratório de Genética Molecular, ESALQ, Universidade de São Paulo, Brazil.
| | | | | | | |
Collapse
|
7
|
Varshney RK, Beier U, Khlestkina EK, Kota R, Korzun V, Graner A, Börner A. Single nucleotide polymorphisms in rye (Secale cereale L.): discovery, frequency, and applications for genome mapping and diversity studies. Theor Appl Genet 2007; 114:1105-16. [PMID: 17345059 DOI: 10.1007/s00122-007-0504-6] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/21/2006] [Accepted: 01/07/2007] [Indexed: 05/02/2023]
Abstract
To elucidate the potential of single nucleotide polymorphism (SNP) markers in rye, a set of 48 barley EST (expressed sequence tag) primer pairs was employed to amplify from DNA prepared from five rye inbred lines. A total of 96 SNPs and 26 indels (insertion-deletions) were defined from the sequences of 14 of the resulting amplicons, giving an estimated frequency of 1 SNP per 58 bp and 1 indel per 214 bp in the rye transcriptome. A mean of 3.4 haplotypes per marker with a mean expected heterozygosity of 0.66 were observed. The nucleotide diversity index (pi) was estimated to be in the range 0.0059-0.0530. To improve assay cost-effectiveness, 12 of the 14 SNPs were converted to a cleaved amplified polymorphic sequence (CAPS) format. The resulting 12 SNP loci mapped to chromosomes 1R, 3R, 4R, 5R, 6R, and 7R, at locations consistent with their known map positions in barley. SNP genotypic data were compared with genomic simple sequence repeat (SSR) and EST-derived SSR genotypic data collected from the same templates. This showed a broad equivalence with respect to genetic diversity between these different data types.
Collapse
Affiliation(s)
- R K Varshney
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Corrensstrasse 3, 06466 Gatersleben, Germany.
| | | | | | | | | | | | | |
Collapse
|
8
|
Morant M, Schoch GA, Ullmann P, Ertunç T, Little D, Olsen CE, Petersen M, Negrel J, Werck-Reichhart D. Catalytic activity, duplication and evolution of the CYP98 cytochrome P450 family in wheat. Plant Mol Biol 2007; 63:1-19. [PMID: 17160453 DOI: 10.1007/s11103-006-9028-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/21/2006] [Accepted: 05/30/2006] [Indexed: 05/12/2023]
Abstract
A burst of evolutionary duplication upon land colonization seems to have led to the large superfamily of cytochromes P450 in higher plants. Within this superfamily some clans and families are heavily duplicated. Others, such as genes involved in the phenylpropanoid pathway have led to fewer duplication events. Eight coding sequences belonging to the CYP98 family reported to catalyze the 3-hydroxylation step in this pathway were isolated from Triticum aestivum (wheat) and expressed in yeast. Comparison of the catalytic properties of the recombinant enzymes with those of CYP98s from other plant taxa was coupled to phylogenetic analyses. Our results indicate that the unusually high frequency of gene duplication in the wheat CYP98 family is a direct or indirect result from ploidization. While ancient duplication led to evolution of enzymes with different substrate preferences, most of recent duplicates underwent silencing via degenerative mutations. Three of the eight tested CYP98s from wheat have phenol meta-hydroxylase activity, with p-coumaroylshikimate being the primary substrate for all of these, as it is the case for CYP98s from sweet basil and Arabidopsis thaliana. However, CYP98s from divergent taxa have acquired different additional subsidiary activities. Some of them might be significant in the metabolism of various free or conjugated phenolics in different plant species. One of the most significant is meta-hydroxylation of p-coumaroyltyramine, predominantly by the wheat enzymes, for the synthesis of suberin phenolic monomers. Homology modeling, confirmed by directed mutagenesis, provides information on the protein regions and structural features important for some observed changes in substrate selectivity. They indicate that the metabolism of quinate ester and tyramine amide of p-coumaric acid rely on the same recognition site in the protein.
Collapse
Affiliation(s)
- Marc Morant
- Department of Plant Stress Response, Institute of Plant Molecular Biology, CNRS-UPR 2357, Université Louis Pasteur, Centre National de la Recherche Scientifique, 67000, Strasbourg, France
| | | | | | | | | | | | | | | | | |
Collapse
|
9
|
Cordeiro GM, Eliott F, McIntyre CL, Casu RE, Henry RJ. Characterisation of single nucleotide polymorphisms in sugarcane ESTs. Theor Appl Genet 2006; 113:331-43. [PMID: 16791699 DOI: 10.1007/s00122-006-0300-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2005] [Accepted: 04/21/2006] [Indexed: 05/04/2023]
Abstract
Commercial sugarcane cultivars (Saccharum spp. hybrids) are both polyploid and aneuploid with chromosome numbers in excess of 100; these chromosomes can be assigned to 8 homology groups. To determine the utility of single nucleotide polymorphisms (SNPs) as a means of improving our understanding of the complex sugarcane genome, we developed markers to a suite of SNPs identified in a list of sugarcane ESTs. Analysis of 69 EST contigs showed a median of 9 SNPs per EST and an average of 1 SNP per 50 bp of coding sequence. The quantitative presence of each base at 58 SNP loci within 19 contiguous sequence sets was accurately and reliably determined for 9 sugarcane genotypes, including both commercial cultivars and ancestral species, through the use of quantitative light emission technology in pyrophosphate sequencing. Across the 9 genotypes tested, 47 SNP loci were polymorphic and 11 monomorphic. Base frequency at individual SNP loci was found to vary approximately twofold between Australian sugarcane cultivars and more widely between cultivars and wild species. Base quantity was shown to segregate as expected in the IJ76-514 x Q165 sugarcane mapping population, indicating that SNPs that occur on one or two sugarcane chromosomes have the potential to be mapped. The use of SNP base frequencies from five of the developed markers was able to clearly distinguish all genotypes in the population. The use of SNP base frequencies from a further six markers within an EST contig was able to help establish the likely copy number of the locus in two genotypes tested. This is the first instance of a technology that has been able to provide an insight into the copy number of a specific gene locus in hybrid sugarcane. The identification of specific and numerous haplotypes/alleles present in a genotype by pyrophosphate sequencing or alternative techniques ultimately will provide the basis for identifying associations between specific alleles and phenotype and between allele dosage and phenotype in sugarcane.
Collapse
Affiliation(s)
- Giovanni M Cordeiro
- Centre for Plant Conservation Genetics, Southern Cross University, PO Box 157, Lismore 2480, Australia.
| | | | | | | | | |
Collapse
|
10
|
Cogan NOI, Ponting RC, Vecchies AC, Drayton MC, George J, Dracatos PM, Dobrowolski MP, Sawbridge TI, Smith KF, Spangenberg GC, Forster JW. Gene-associated single nucleotide polymorphism discovery in perennial ryegrass (Lolium perenne L.). Mol Genet Genomics 2006; 276:101-12. [PMID: 16708235 DOI: 10.1007/s00438-006-0126-8] [Citation(s) in RCA: 74] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2006] [Accepted: 03/29/2006] [Indexed: 11/28/2022]
Abstract
Molecular genetic marker development in perennial ryegrass has largely been dependent on anonymous sequence variation. The availability of a large-scale EST resource permits the development of functionally-associated genetic markers based on SNP variation in candidate genes. Genic SNP loci and associated haplotypes are suitable for implementation in molecular breeding of outbreeding forage species. Strategies for in vitro SNP discovery through amplicon cloning and sequencing have been designed and implemented. Putative SNPs were identified within and between the parents of the F(1)(NA(6) x AU(6)) genetic mapping family and were validated among progeny individuals. Proof-of-concept for the process was obtained using the drought tolerance-associated LpASRa2 gene. SNP haplotype structures were determined and correlated with predicted amino acid changes. Gene-length LD was evaluated across diverse germplasm collections. A survey of SNP variation across 100 candidate genes revealed a high frequency of SNP incidence (c. 1 per 54 bp), with similar proportions in exons and introns. A proportion (c. 50%) of the validated genic SNPs were assigned to the F(1)(NA(6) x AU(6)) genetic map, showing high levels of coincidence with previously mapped RFLP loci. The perennial ryegrass SNP resource will enable genetic map integration, detailed LD studies and selection of superior allele content during varietal development.
Collapse
Affiliation(s)
- Noel O I Cogan
- Primary Industries Research Victoria and Molecular Plant Breeding Cooperative Research Centre, Victorian AgriBiosciences Centre, La Trobe Research and Development Park, Bundoora, VIC, 3083, Australia
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Bundock PC, Cross MJ, Shapter FM, Henry RJ. Robust allele-specific polymerase chain reaction markers developed for single nucleotide polymorphisms in expressed barley sequences. Theor Appl Genet 2006; 112:358-65. [PMID: 16328233 DOI: 10.1007/s00122-005-0137-6] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/27/2005] [Accepted: 10/21/2005] [Indexed: 05/05/2023]
Abstract
Many methods have been developed to assay for single nucleotide polymorphisms (SNPs), but generally these depend on access to specialised equipment. Allele-specific polymerase chain reaction (AS-PCR) is a method that does not require specialised equipment (other than a thermocycler), but there is a common perception that AS-PCR markers can be unreliable. We have utilised a three primer AS-PCR method comprising of two flanking-primers combined with an internal allele-specific primer. We show here that this method produces a high proportion of robust markers (from candidate allele specific primers). Forty-nine inter-varietal SNP sites in 31 barley (Hordeum vulgare L.) genes were targeted for the development of AS-PCR assays. The SNP sites were found by aligning barley expressed sequence tags from public databases. The targeted genes correspond to cDNAs that have been used as restriction fragment length polymorphic probes for linkage mapping in barley. Two approaches were adopted in developing the markers. In the first approach, designed to maximise the successful development of markers to a SNP site, markers were developed for 18 sites from 19 targeted (95% success rate). With the second approach, designed to maximise the number of markers developed per primer synthesised, markers were developed for 18 SNP sites from 30 that were targeted (a 60% success rate). The robustness of markers was assessed from the range of annealing temperatures over which the PCR assay was allele-specific. The results indicate that this form of AS-PCR is highly successful for the development of robust SNP markers.
Collapse
Affiliation(s)
- P C Bundock
- Centre for Plant Conservation Genetics, Molecular Plant Breeding CRC, Southern Cross University, PO Box 157, Lismore, NSW, 2480, Australia.
| | | | | | | |
Collapse
|
12
|
Rostoks N, Mudie S, Cardle L, Russell J, Ramsay L, Booth A, Svensson JT, Wanamaker SI, Walia H, Rodriguez EM, Hedley PE, Liu H, Morris J, Close TJ, Marshall DF, Waugh R. Genome-wide SNP discovery and linkage analysis in barley based on genes responsive to abiotic stress. Mol Genet Genomics 2005. [PMID: 16244872 DOI: 10.1007/s00438‐005‐0046‐z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
More than 2,000 genome-wide barley single nucleotide polymorphisms (SNPs) were developed by resequencing unigene fragments from eight diverse accessions. The average genome-wide SNP frequency observed in 877 unigenes was 1 SNP per 200 bp. However, SNP frequency was highly variable with the least number of SNP and SNP haplotypes observed within European cultivated germplasm reflecting effects of breeding history on genetic diversity. More than 300 SNP loci were mapped genetically in three experimental mapping populations which allowed the construction of an integrated SNP map incorporating a large number of RFLP, AFLP and SSR markers (1,237 loci in total). The genes used for SNP discovery were selected based on their transcriptional response to a variety of abiotic stresses. A set of known barley abiotic stress QTL was positioned on the linkage map, while the available sequence and gene expression information facilitated the identification of genes potentially associated with these traits. Comparison of the sequenced SNP loci to the rice genome sequence identified several regions of highly conserved gene order providing a framework for marker saturation in barley genomic regions of interest. The integration of genome-wide SNP and expression data with available genetic and phenotypic information will facilitate the identification of gene function in barley and other non-model organisms.
Collapse
Affiliation(s)
- Nils Rostoks
- Genome Dynamics, Scottish Crop Research Institute, Invergowrie, Dundee, DD2 5DA, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Rostoks N, Mudie S, Cardle L, Russell J, Ramsay L, Booth A, Svensson JT, Wanamaker SI, Walia H, Rodriguez EM, Hedley PE, Liu H, Morris J, Close TJ, Marshall DF, Waugh R. Genome-wide SNP discovery and linkage analysis in barley based on genes responsive to abiotic stress. Mol Genet Genomics 2005; 274:515-27. [PMID: 16244872 DOI: 10.1007/s00438-005-0046-z] [Citation(s) in RCA: 173] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2005] [Accepted: 08/20/2005] [Indexed: 10/25/2022]
Abstract
More than 2,000 genome-wide barley single nucleotide polymorphisms (SNPs) were developed by resequencing unigene fragments from eight diverse accessions. The average genome-wide SNP frequency observed in 877 unigenes was 1 SNP per 200 bp. However, SNP frequency was highly variable with the least number of SNP and SNP haplotypes observed within European cultivated germplasm reflecting effects of breeding history on genetic diversity. More than 300 SNP loci were mapped genetically in three experimental mapping populations which allowed the construction of an integrated SNP map incorporating a large number of RFLP, AFLP and SSR markers (1,237 loci in total). The genes used for SNP discovery were selected based on their transcriptional response to a variety of abiotic stresses. A set of known barley abiotic stress QTL was positioned on the linkage map, while the available sequence and gene expression information facilitated the identification of genes potentially associated with these traits. Comparison of the sequenced SNP loci to the rice genome sequence identified several regions of highly conserved gene order providing a framework for marker saturation in barley genomic regions of interest. The integration of genome-wide SNP and expression data with available genetic and phenotypic information will facilitate the identification of gene function in barley and other non-model organisms.
Collapse
Affiliation(s)
- Nils Rostoks
- Genome Dynamics, Scottish Crop Research Institute, Invergowrie, Dundee, DD2 5DA, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Lidie KB, Ryan JC, Barbier M, Van Dolah FM. Gene expression in Florida red tide dinoflagellate Karenia brevis: analysis of an expressed sequence tag library and development of DNA microarray. Mar Biotechnol (NY) 2005; 7:481-93. [PMID: 15976935 DOI: 10.1007/s10126-004-4110-6] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/27/2004] [Accepted: 12/15/2004] [Indexed: 05/03/2023]
Abstract
Karenia brevis (Davis) is the dinoflagellate responsible for nearly annual red tides in the Gulf of Mexico. Although the mechanisms regulating the growth and toxicity of this problematic organism are of considerable interest, little information is available on its molecular biology. We therefore constructed a complementary DNA library from which to gain insight into its expressed genome and to develop tools for studying its gene expression. Large-scale sequencing yielded 7001 high-quality expressed sequence tags (ESTs), which clustered into 5280 unique gene groups. The vast majority of genes expressed fell into a low-abundance class, with the highest expressed gene accounting for only 1% of the total ESTs. Approximately 29% of genes were found to have similarity to known sequences in other organisms after BLAST similarity comparisons to the GenBank public protein database using a cutoff of P < 10e(-4). We identified for the first time in a dinoflagellate a suite of conserved eukaryotic genes involved in cell cycle control, intracellular signaling, and the transcription and translation machinery. At least 40% of gene clusters displayed single nucleotide polymorphisms, suggesting the presence of multiple gene copies. The average GC content of ESTs was 51%, with a slight preference for G or C in the third codon position (53.5%). The ESTs were used to develop an oligonucleotide microarray containing 4629 unique features and 3462 replicate probes. Microarray labeling has been optimized, and the microarray has been validated for probe specificity and reproducibility. This is the first information to be developed on the expressed genome of K. brevis and provides the basis from which to begin functional genomic studies on this harmful algal bloom species.
Collapse
Affiliation(s)
- Kristy B Lidie
- Marine Biotoxins Program, NOAA Center for Coastal Environmental and Biomolecular Research, SC 29412, USA
| | | | | | | |
Collapse
|
15
|
Horn R, Lecouls AC, Callahan A, Dandekar A, Garay L, McCord P, Howad W, Chan H, Verde I, Main D, Jung S, Georgi L, Forrest S, Mook J, Zhebentyayeva T, Yu Y, Kim HR, Jesudurai C, Sosinski B, Arús P, Baird V, Parfitt D, Reighard G, Scorza R, Tomkins J, Wing R, Abbott AG. Candidate gene database and transcript map for peach, a model species for fruit trees. Theor Appl Genet 2005; 110:1419-28. [PMID: 15846479 DOI: 10.1007/s00122-005-1968-x] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/09/2004] [Accepted: 11/15/2004] [Indexed: 05/03/2023]
Abstract
Peach (Prunus persica) is a model species for the Rosaceae, which includes a number of economically important fruit tree species. To develop an extensive Prunus expressed sequence tag (EST) database for identifying and cloning the genes important to fruit and tree development, we generated 9,984 high-quality ESTs from a peach cDNA library of developing fruit mesocarp. After assembly and annotation, a putative peach unigene set consisting of 3,842 ESTs was defined. Gene ontology (GO) classification was assigned based on the annotation of the single "best hit" match against the Swiss-Prot database. No significant homology could be found in the GenBank nr databases for 24.3% of the sequences. Using core markers from the general Prunus genetic map, we anchored bacterial artificial chromosome (BAC) clones on the genetic map, thereby providing a framework for the construction of a physical and transcript map. A transcript map was developed by hybridizing 1,236 ESTs from the putative peach unigene set and an additional 68 peach cDNA clones against the peach BAC library. Hybridizing ESTs to genetically anchored BACs immediately localized 11.2% of the ESTs on the genetic map. ESTs showed a clustering of expressed genes in defined regions of the linkage groups. [The data were built into a regularly updated Genome Database for Rosaceae (GDR), available at (http://www.genome.clemson.edu/gdr/).].
Collapse
Affiliation(s)
- Renate Horn
- Department of Genetics, Biochemistry and Life Science Studies, Clemson University, Clemson, SC 29634, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Boisson M, Mondon K, Torney V, Nicot N, Laine AL, Bahrman N, Gouy A, Daniel-Vedele F, Hirel B, Sourdille P, Dardevet M, Ravel C, Le Gouis J. Partial sequences of nitrogen metabolism genes in hexaploid wheat. Theor Appl Genet 2005; 110:932-40. [PMID: 15714330 DOI: 10.1007/s00122-004-1913-4] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/02/2004] [Accepted: 12/15/2004] [Indexed: 05/21/2023]
Abstract
Our objective was to partially sequence genes controlling nitrogen metabolism in wheat species in order to find sequence polymorphism that would enable their mapping. Primers were designed for nitrate reductase, nitrite reductase, glutamate dehydrogenase and glutamate synthase (GOGAT), and gene fragments were amplified on Triticum aestivum, T. durum, T. monococcum, T. speltoides and T. tauschii. We obtained more than 8 kb of gene sequences, mainly as coding regions (60%). Polymorphism was quantified by comparing two-by-two the three genomes of the hexaploid cultivar Arche and genomes of diploid wheat species. On average, the polymorphism rate was higher for non-coding regions, where it ranged from 1/60 to 1/23, than for coding regions (range: 1/110-1/40) except when the hexaploid D genome was compared to that of T. tauschii (1/800 and 1/816, respectively). Genome-specific primers were devised for the ferredoxin-dependent (Fd)-GOGAT gene, and they enabled the mapping of this gene on homoeologous chromosomes of group 2 using Chinese Spring deletion lines. A single nucleotide polymorphism (SNP) detected between the two hexaploid wheat cultivars Arche and Recital was used to genetically map Fd-GOGAT on chromosome 2D using a population of dihaploid lines. Fd-GOGAT-specific primers were used to estimate the SNP rate on a set of 11 hexaploid and nine Durum wheat genotypes leading to the estimate of 1 SNP/515 bp. We demonstrate that polymorphism detection enables heterologous, homeologous and even paralogous copies to be assigned, even if the elaboration of specific primer pairs is time-consuming and expensive because of the sequencing.
Collapse
Affiliation(s)
- M Boisson
- INRA URGAP, Domaine de Brunehaut, Péronne, BP 136, 80200, Estrées-Mons, France
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Kasukabe Y, He L, Nada K, Misawa S, Ihara I, Tachibana S. Overexpression of Spermidine Synthase Enhances Tolerance to Multiple Environmental Stresses and Up-Regulates the Expression of Various Stress-Regulated Genes in Transgenic Arabidopsis thaliana. ACTA ACUST UNITED AC 2004; 45:712-22. [PMID: 15215506 DOI: 10.1093/pcp/pch083] [Citation(s) in RCA: 271] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
Polyamines play pivotal roles in plant defense to environmental stresses. However, stress tolerance of genetically engineered plants for polyamine biosynthesis has been little examined so far. We cloned spermidine synthase cDNA from Cucurbita ficifolia and the gene was introduced to Arabidopsis thaliana under the control of the cauliflower mosaic virus 35S promoter. The transgene was stably integrated and actively transcribed in the transgenic plants. As compared with the wild-type plants, the T2 and T3 transgenic plants exhibited a significant increase in spermidine synthase activity and spermidine content in leaves together with enhanced tolerance to various stresses including chilling, freezing, salinity, hyperosmosis, drought, and paraquat toxicity. During exposure to chilling stress (5 degrees C), the transgenics displayed a remarkable increase in arginine decarboxylase activity and conjugated spermidine contents in leaves compared to the wild type. A cDNA microarray analysis revealed that several genes were more abundantly transcribed in the transgenics than in the wild type under chilling stress. These genes included those for stress-responsive transcription factors such as DREB and stress-protective proteins like rd29A. These results strongly suggest an important role for spermidine as a signaling regulator in stress signaling pathways, leading to build-up of stress tolerance mechanisms in plants under stress conditions.
Collapse
|
18
|
Gupta PK, Rustgi S. Molecular markers from the transcribed/expressed region of the genome in higher plants. Funct Integr Genomics 2004; 4:139-62. [PMID: 15095058 DOI: 10.1007/s10142-004-0107-0] [Citation(s) in RCA: 128] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2003] [Revised: 12/16/2003] [Accepted: 12/19/2003] [Indexed: 10/26/2022]
Abstract
In recent years, molecular marker technology in higher plants has witnessed a shift from the so-called random DNA markers (RDMs), developed in the past arbitrarily from genomic DNA and cDNA, to the molecular markers representing the transcriptome and the other coding sequences. These markers have been described as gene targeted markers (GTMs). Another specific class of markers includes the so-called functional markers (FMs), which are supposed to have a cause and effect relationship with the traits of interest. In this review, we first describe the development of these markers representing the transcriptome or genes per se; we then discuss the uses of these markers in some detail and finally add a note on the future directions of research and the implications of the wider application of these markers in crop improvement programmes. Using suitable examples, we describe markers of different classes derived from cDNA clones, expressed sequence tags (ESTs), gene sequences and the unique (coding) sequences obtained through methyl filtration or genome normalization (high C(0) t fraction) from gDNA libraries. While we briefly describe RFLPs, SSRs, AFLPs and SNPs developed from the transcriptome (cDNA clones and EST databases), we have discussed in more detail some of the novel markers developed from the transcriptome and specific genes. These novel markers include expressed sequence tag polymorphisms (ESTPs), conserved orthologue set (COS) markers, amplified consensus genetic markers (ACGMs), gene specific tags (GSTs), resistance gene analogues (RGAs) and exon-retrotransposon amplification polymorphism (ERAP). Uses of these markers have been discussed in some detail under the following headings: development of transcript and functional maps, estimations of genetic diversity, marker-assisted selection (MAS), candidate-gene (CG) approach and map-based cloning, genetical genomics and identification of eQTLs, study of genome organization and taxonomic and phylogenetic studies. At the end, we also append a list of websites relevant to further studies on the transcriptome. For want of space, considerable information including voluminous data in the form of 12 tables, and a long list of references cited in these tables, has been placed on the Internet as electronic supplementary material (ESM), which the readers may find useful.
Collapse
Affiliation(s)
- P K Gupta
- Molecular Biology Laboratory, Department of Genetics and Plant Breeding, Ch. Charan Singh University, 250 004, Meerut, India.
| | | |
Collapse
|