1
|
Šarhanová P, Pfanzelt S, Brandt R, Himmelbach A, Blattner FR. SSR-seq: Genotyping of microsatellites using next-generation sequencing reveals higher level of polymorphism as compared to traditional fragment size scoring. Ecol Evol 2018; 8:10817-10833. [PMID: 30519409 PMCID: PMC6262739 DOI: 10.1002/ece3.4533] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2018] [Revised: 08/28/2018] [Accepted: 08/30/2018] [Indexed: 01/13/2023] Open
Abstract
Microsatellites (or simple sequence repeats, SSR) are widely used markers in population genetics. Traditionally, genotyping was and still is carried out through recording fragment length. Now, next-generation sequencing (NGS) makes it easy to obtain also sequence information for the loci of interest. This avoids misinterpretations that otherwise could arise due to size homoplasy. Here, an NGS strategy is described that allows to genotype hundreds of individuals at many custom-designed SSR loci simultaneously, combining multiplex PCR, barcoding, and Illumina sequencing. We created three different datasets for which alleles were coded according to (a) length of the repetitive region, (b) total fragment length, and (c) sequence identity, in order to evaluate the eventual benefits from having sequence data at hand, not only fragment length data. For each dataset, genetic diversity statistics, as well as F ST and R ST values, were calculated. The number of alleles per locus, as well as observed and expected heterozygosity, was highest in the sequence identity dataset, because of single-nucleotide polymorphisms and insertions/deletions in the flanking regions of the SSR motif. Size homoplasy was found to be very common, amounting to 44.7%-63.5% (mean over all loci) in the three study species. Thus, the information obtained by next-generation sequencing offers a better resolution than the traditional way of SSR genotyping and allows for more accurate evolutionary interpretations.
Collapse
Affiliation(s)
- Petra Šarhanová
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK)GaterslebenGermany
- Present address:
Department of Botany and Biodiversity ResearchUniversity of ViennaViennaAustria
| | - Simon Pfanzelt
- Institute of Biology and Environmental SciencesCarl von Ossietzky University OldenburgOldenburgGermany
- Present address:
Leibniz Institute of Plant Genetics and Crop Plant Research (IPK)GaterslebenGermany
| | - Ronny Brandt
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK)GaterslebenGermany
- Present address:
Max Planck Genome Centre CologneCologneGermany
| | - Axel Himmelbach
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK)GaterslebenGermany
| | - Frank R. Blattner
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK)GaterslebenGermany
| |
Collapse
|
2
|
Cornejo C, Chabanenko S, Scheidegger C. Are species-pairs diverging lineages? A nine-locus analysis uncovers speciation among species-pairs of the Lobaria meridionalis-group (Ascomycota). Mol Phylogenet Evol 2018; 129:48-59. [PMID: 30036698 DOI: 10.1016/j.ympev.2018.07.011] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2017] [Revised: 03/13/2018] [Accepted: 07/14/2018] [Indexed: 01/17/2023]
Abstract
In spite of considerable effort to verify the theory of species-pairs, uncertainty still exists about the relationship between sexually or vegetatively reproducing populations of morphologically indistinguishable, sympatric lichen species. The current paper studies putative species-pairs within the Asian Lobaria meridionalis-group, using a nine-locus and time calibrated species-tree approach. Analyses demonstrate that pairs of sexually or vegetatively reproducing lineages split into highly supported monophyletic clades-confirming molecularly the species-pair concept for the L. meridionalis-group. In the broader context of evolution and speciation dynamics in lichenized fungi, this paper attempts to synthesize molecular findings from the last two decades to promote a more modern perception of the species-pair concept. Taxonomically, eight species were found to currently conform to the L. meridionalis-group, which differentiated during the Pliocene and Pleistocene. The coincidence of paleoclimatic events with estimated dates of divergence support a bioclimatic hypothesis for the evolution of species in the L. meridionalis-group, which also explains their current eco-geographic distribution patterns. Greater recognition for species with a long and independent evolutionary history, which merit high conservation priority, will be especially critical for preserving geographically restricted endemics from Southeast Asia, where habitat loss is driving rapid declines.
Collapse
Affiliation(s)
- Carolina Cornejo
- Swiss Federal Research Institute WSL, Zuercherstrasse 111, 8903 Birmensdorf, Switzerland.
| | - Svetlana Chabanenko
- Sakhalin Branch of the Botanical Garden-Institute FEB RAS, Gorky-street 25, 693023 Yuzhno-Sakhalinsk, Russia.
| | - Christoph Scheidegger
- Swiss Federal Research Institute WSL, Zuercherstrasse 111, 8903 Birmensdorf, Switzerland.
| |
Collapse
|
3
|
Šarhanová P, Sharbel TF, Sochor M, Vašut RJ, Dančák M, Trávníček B. Hybridization drives evolution of apomicts in Rubus subgenus Rubus: evidence from microsatellite markers. ANNALS OF BOTANY 2017; 120:317-328. [PMID: 28402390 PMCID: PMC5737493 DOI: 10.1093/aob/mcx033] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2016] [Accepted: 03/02/2017] [Indexed: 05/18/2023]
Abstract
BACKGROUND AND AIMS Rubus subgenus Rubus is a group of mostly apomictic and polyploid species with a complicated taxonomy and history of ongoing hybridization. The only polyploid series with prevailing sexuality is the series Glandulosi , although the apomictic series Discolores and Radula also retain a high degree of sexuality, which is influenced by environmental conditions and/or pollen donors. The aim of this study is to detect sources of genetic variability, determine the origin of apomictic taxa and validate microsatellite markers by cloning and sequencing. METHODS A total of 206 individuals from two central European regions were genotyped for 11 nuclear microsatellite loci and the chloroplast trn L- trn F region. Microsatellite alleles were further sequenced in order to determine the exact repeat number and to detect size homoplasy due to insertions/deletions in flanking regions. KEY RESULTS The results confirm that apomictic microspecies of ser. Radula are derived from crosses between sexual series Glandulosi and apomictic series Discolores , whereby the apomict acts as pollen donor. Each apomictic microspecies is derived from a single distinct genotype differing from the parental taxa, suggesting stabilized clonal reproduction. Intraspecific variation within apomicts is considerably low compared with sexual series Glandulosi , and reflects somatic mutation accumulation. While facultative apomicts produce clonal offspring, sexual species are the conduits of origin for new genetically different apomictic lineages. CONCLUSIONS One of the main driving forces of evolution and speciation in the highly apomictic subgenus Rubus in central Europe is sexuality in the series Glandulosi . Palaeovegetation data suggest that initial hybridizations took place over different time periods in the two studied regions, and that the successful origin and spread of apomictic microspecies of the series Radula took place over several millennia. Additionally, the cloning and sequencing show that standard evaluations of microsatellite repeat numbers underestimate genetic variability considering homoplasy in allele size.
Collapse
Affiliation(s)
- Petra Šarhanová
- Leibniz Institute of Plant Genetics and Crop Research (IPK), Department of Experimental Taxonomy, Corrensstr. 3, D-06466 Gatersleben, Germany
- For correspondence. E-mail
| | - Timothy F. Sharbel
- Leibniz Institute of Plant Genetics and Crop Research (IPK), Department of Experimental Taxonomy, Corrensstr. 3, D-06466 Gatersleben, Germany
- Present address: Global Institute for Food Security, University of Saskatchewan, 110 Gymnasium Place, suite 1460 Saskatoon, SK S7N 4J8, Canada
| | - Michal Sochor
- Department of Botany, Faculty of Science, Palacký University in Olomouc, Šlechtitelů 27, CZ-78371 Olomouc, Czech Republic
- Centre of the Region Haná for Biotechnological and Agricultural Research, Department of Genetic Resources for Vegetables, Medicinal and Special Plants, Crop Research Institute, Šlechtitelů 29, CZ-78371 Olomouc, Czech Republic
| | - Radim J. Vašut
- Department of Botany, Faculty of Science, Palacký University in Olomouc, Šlechtitelů 27, CZ-78371 Olomouc, Czech Republic
| | - Martin Dančák
- Department of Ecology and Environmental Sciences, Faculty of Science, Palacký University in Olomouc, Šlechtitelů 27, CZ-78371 Olomouc, Czech Republic
| | - Bohumil Trávníček
- Department of Botany, Faculty of Science, Palacký University in Olomouc, Šlechtitelů 27, CZ-78371 Olomouc, Czech Republic
| |
Collapse
|
4
|
Germain-Aubrey CC, Nelson C, Soltis DE, Soltis PS, Gitzendanner MA. Are microsatellite fragment lengths useful for population-level studies? The case of Polygala lewtonii (Polygalaceae). APPLICATIONS IN PLANT SCIENCES 2016; 4:apps1500115. [PMID: 26949579 PMCID: PMC4760751 DOI: 10.3732/apps.1500115] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/13/2015] [Accepted: 01/05/2016] [Indexed: 05/10/2023]
Abstract
PREMISE OF THE STUDY Microsatellites, despite being commonly used population-level markers, contain biases because scoring relies solely on fragment length. Their complexity can lead to homoplasy, the effects of which are poorly understood. We investigated the impact of using fragment lengths, repeats, or flanking region sequences on common population-level analyses. METHODS Five polymorphic microsatellite markers amplified across the central Florida scrub endemic Polygala lewtonii (Polygalaceae) and its close, widespread congener P. polygama. We genotyped 147 individuals of P. lewtonii and 156 of P. polygama, and sequenced the amplicons of four markers across all individuals. We ran basic statistics, spatial clustering analysis, historical demographics, and migration tests. RESULTS One population of intermediate morphology was genetically clearly identified as P. polygama, making it the southernmost population of this species. Statistics were comparable between the fragment length and repeat numbers, with some notable differences. Flanking regions exhibited surprisingly high polymorphism between species, and between geographically distant conspecific populations. DISCUSSION The increasing use of markers developed in one species and amplified in another is only a good practice if precautions are taken, notably the sequencing of the fragments between species and populations. Flanking region sequences are a useful marker at the interspecific level.
Collapse
Affiliation(s)
- Charlotte C. Germain-Aubrey
- Department of Biology, University of Florida, Gainesville, Florida 32611 USA
- Florida Museum of Natural History, University of Florida, Gainesville, Florida 32611 USA
- Author for correspondence:
| | - Cory Nelson
- College of Medicine, University of Florida, Gainesville, Florida 32603 USA
| | - Douglas E. Soltis
- Department of Biology, University of Florida, Gainesville, Florida 32611 USA
- Florida Museum of Natural History, University of Florida, Gainesville, Florida 32611 USA
- UF Genetics Institute, University of Florida, Gainesville, Florida 32608 USA
| | - Pamela S. Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, Florida 32611 USA
- UF Genetics Institute, University of Florida, Gainesville, Florida 32608 USA
| | - Matthew A. Gitzendanner
- Department of Biology, University of Florida, Gainesville, Florida 32611 USA
- Florida Museum of Natural History, University of Florida, Gainesville, Florida 32611 USA
- UF Genetics Institute, University of Florida, Gainesville, Florida 32608 USA
| |
Collapse
|
5
|
Ruperao P, Edwards D. Bioinformatics: identification of markers from next-generation sequence data. Methods Mol Biol 2015; 1245:29-47. [PMID: 25373747 DOI: 10.1007/978-1-4939-1966-6_3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
With the advent of sequencing technology, next-generation sequencing (NGS) technology has dramatically revolutionized plant genomics. NGS technology combined with new software tools enables the discovery, validation, and assessment of genetic markers on a large scale. Among different markers systems, simple sequence repeats (SSRs) and Single nucleotide polymorphisms (SNPs) are the markers of choice for genetics and plant breeding. SSR markers have been a choice for large-scale characterization of germplasm collections, construction of genetic maps, and QTL identification. Similarly, SNPs are the most abundant genetic variations with higher frequencies throughout the genome of plant species. This chapter discusses various tools available for genome assembly and widely focuses on SSR and SNP marker discovery.
Collapse
Affiliation(s)
- Pradeep Ruperao
- School of Agriculture and Food Sciences, University of Queensland, Brisbane, QLD, Australia
| | | |
Collapse
|
6
|
Barthe S, Gugerli F, Barkley NA, Maggia L, Cardi C, Scotti I. Always look on both sides: phylogenetic information conveyed by simple sequence repeat allele sequences. PLoS One 2012; 7:e40699. [PMID: 22808236 PMCID: PMC3396589 DOI: 10.1371/journal.pone.0040699] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2012] [Accepted: 06/12/2012] [Indexed: 11/18/2022] Open
Abstract
Simple sequence repeat (SSR) markers are widely used tools for inferences about genetic diversity, phylogeography and spatial genetic structure. Their applications assume that variation among alleles is essentially caused by an expansion or contraction of the number of repeats and that, accessorily, mutations in the target sequences follow the stepwise mutation model (SMM). Generally speaking, PCR amplicon sizes are used as direct indicators of the number of SSR repeats composing an allele with the data analysis either ignoring the extent of allele size differences or assuming that there is a direct correlation between differences in amplicon size and evolutionary distance. However, without precisely knowing the kind and distribution of polymorphism within an allele (SSR and the associated flanking region (FR) sequences), it is hard to say what kind of evolutionary message is conveyed by such a synthetic descriptor of polymorphism as DNA amplicon size. In this study, we sequenced several SSR alleles in multiple populations of three divergent tree genera and disentangled the types of polymorphisms contained in each portion of the DNA amplicon containing an SSR. The patterns of diversity provided by amplicon size variation, SSR variation itself, insertions/deletions (indels), and single nucleotide polymorphisms (SNPs) observed in the FRs were compared. Amplicon size variation largely reflected SSR repeat number. The amount of variation was as large in FRs as in the SSR itself. The former contributed significantly to the phylogenetic information and sometimes was the main source of differentiation among individuals and populations contained by FR and SSR regions of SSR markers. The presence of mutations occurring at different rates within a marker’s sequence offers the opportunity to analyse evolutionary events occurring on various timescales, but at the same time calls for caution in the interpretation of SSR marker data when the distribution of within-locus polymorphism is not known.
Collapse
Affiliation(s)
- Stéphanie Barthe
- Unité Mixte de Recherche “Ecologie des forêts de Guyane”, University of French West Indies and French Guiana, Kourou, French Guiana
| | - Felix Gugerli
- Biodiversity and Conservation Biology Research Unit, Swiss Federal Research Institute for Forest, Snow and Landscape Research, Birmensdorf, Switzerland
| | - Noelle A. Barkley
- Plant Genetic Resources Conservation Unit, United States Department of Agriculture – Agricultural Research Service, Griffin, Georgia, United States of America
| | - Laurent Maggia
- Unité Mixte de Recherche “Amélioration génétique et adaptation des plantes méditerranéennes et tropicales”, Institut Agronomique néo-Calédonien, Nouméa, New Caledonia
| | - Céline Cardi
- Unité Mixte de Recherche “Amélioration génétique et adaptation des plantes méditerranéennes et tropicales”, Centre de coopération internationale en recherche agronomique pour le développement, Montpellier, France
| | - Ivan Scotti
- Unité Mixte de Recherche Ecologie des forêts de Guyane, Institut National de la Recherche Agronomique, Kourou, French Guiana
- * E-mail:
| |
Collapse
|
7
|
Hayward A, Mason AS, Dalton-Morgan J, Zander M, Edwards D, Batley J. SNP discovery and applications in Brassica napus. ACTA ACUST UNITED AC 2012. [DOI: 10.5010/jpb.2012.39.1.049] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
8
|
Cieslarová J, Hanáček P, Fialová E, Hýbl M, Smýkal P. Estimation of pea (Pisum sativum L.) microsatellite mutation rate based on pedigree and single-seed descent analyses. J Appl Genet 2011; 52:391-401. [PMID: 21769669 DOI: 10.1007/s13353-011-0058-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2011] [Revised: 06/21/2011] [Accepted: 06/22/2011] [Indexed: 02/07/2023]
Abstract
Microsatellites, or simple sequence repeats (SSRs) are widespread class of repetitive DNA sequences, used in population genetics, genetic diversity and mapping studies. In spite of the SSR utility, the genetic and evolutionary mechanisms are not fully understood. We have investigated three microsatellite loci with different position in the pea (Pisum sativum L.) genome, the A9 locus residing in LTR region of abundant retrotransposon, AD270 as intergenic and AF016458 located in 5'untranslated region of expressed gene. Comparative analysis of a 35 pair samples from seven pea varieties propagated by single-seed descent for ten generations, revealed single 4 bp mutation in 10th generation sample at AD270 locus corresponding to stepwise increase in one additional ATCT repeat unit. The estimated mutation rate was 4.76 × 10(-3) per locus per generation, with a 95% confidence interval of 1.2 × 10(-4) to 2.7 × 10(-2). The comparison of cv. Bohatýr accessions retrieved from different collections, showed intra-, inter-accession variation and differences in flanking and repeat sequences. Fragment size and sequence alternations were also found in long term in vitro organogenic culture, established at 1983, indicative of somatic mutation process. The evidence of homoplasy was detected across of unrelated pea genotypes, which adversaly affects the reliability of diversity estimates not only for diverse germplasm but also highly bred material. The findings of this study have important implications for Pisum phylogeny studies, variety identification and registration process in pea breeding where mutation rate influences the genetic diversity and the effective population size estimates.
Collapse
Affiliation(s)
- Jaroslava Cieslarová
- Department of Plant Biology, Mendel University, Zemědělská, Brno, Czech Republic
| | | | | | | | | |
Collapse
|
9
|
Cowling W, Balázs E. Prospects and challenges for genome-wide association and genomic selection in oilseed Brassica speciesThis article is one of a selection of papers from the conference “Exploiting Genome-wide Association in Oilseed Brassicas: a model for genetic improvement of major OECD crops for sustainable farming”. Genome 2010; 53:1024-8. [DOI: 10.1139/g10-087] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Affiliation(s)
- W.A. Cowling
- Department of Applied Genomics, H-2462 Martonvásár, Agricultural Research Institute, Brunszvik. u. 2, Hungary (previously Theme Coordinator Research Theme 3: The Food Chain [Plants], OECD Co-operative Research Programme: Biological Resource Management for Sustainable Agricultural Systems)
- Deputy Director, International Centre for Plant Breeding Education and Research, The UWA Institute of Agriculture, c/o School of Plant Biology M084, The University of Western Australia, 35 Stirling Highway, Crawley, Western Australia, Australia 6009
| | - E. Balázs
- Department of Applied Genomics, H-2462 Martonvásár, Agricultural Research Institute, Brunszvik. u. 2, Hungary (previously Theme Coordinator Research Theme 3: The Food Chain [Plants], OECD Co-operative Research Programme: Biological Resource Management for Sustainable Agricultural Systems)
- Deputy Director, International Centre for Plant Breeding Education and Research, The UWA Institute of Agriculture, c/o School of Plant Biology M084, The University of Western Australia, 35 Stirling Highway, Crawley, Western Australia, Australia 6009
| |
Collapse
|
10
|
Duran C, Appleby N, Vardy M, Imelfort M, Edwards D, Batley J. Single nucleotide polymorphism discovery in barley using autoSNPdb. PLANT BIOTECHNOLOGY JOURNAL 2009; 7:326-33. [PMID: 19386041 DOI: 10.1111/j.1467-7652.2009.00407.x] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Molecular markers are used to provide the link between genotype and phenotype, for the production of molecular genetic maps and to assess genetic diversity within and between related species. Single nucleotide polymorphisms (SNPs) are the most abundant molecular genetic marker. SNPs can be identified in silico, but care must be taken to ensure that the identified SNPs reflect true genetic variation and are not a result of errors associated with DNA sequencing. The SNP detection method autoSNP has been developed to identify SNPs from sequence data for any species. Confidence in the predicted SNPs is based on sequence redundancy, and haplotype co-segregation scores are calculated for a further independent measure of confidence. We have extended the autoSNP method to produce autoSNPdb, which integrates SNP and gene annotation information with a graphical viewer. We have applied this software to public barley expressed sequences, and the resulting database is available over the Internet. SNPs can be viewed and searched by sequence, functional annotation or predicted synteny with a reference genome, in this case rice. The correlation between SNPs and barley cultivar, expressed tissue type and development stage has been collated for ease of exploration. An average of one SNP per 240 bp was identified, with SNPs more prevalent in the 5' regions and simple sequence repeat (SSR) flanking sequences. Overall, autoSNPdb can provide a wealth of genetic polymorphism information for any species for which sequence data are available.
Collapse
Affiliation(s)
- Chris Duran
- Australian Centre for Plant Functional Genomics, School of Land, Crop and Food Sciences, Institute for Molecular Bioscience, University of Queensland, Brisbane, Qld 4072, Australia
| | | | | | | | | | | |
Collapse
|
11
|
ZENG XH, WEI YM, JIANG QT, QI PF, ZHENG YL. SNP Analysis and Haplotype Identification in Chymotrypsin Inhibitor-2 (CI-2) Gene of Barley. ACTA ACUST UNITED AC 2009. [DOI: 10.1016/s1671-2927(09)60003-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
12
|
Wu J, Zhang L, Luo H, Zhu Z, Zhang C, Hou Y. Association of matrix metalloproteinases-9 gene polymorphisms with genetic susceptibility to esophageal squamous cell carcinoma. DNA Cell Biol 2008; 27:553-7. [PMID: 18680431 DOI: 10.1089/dna.2008.0732] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
Matrix metalloproteinases-9 (MMP-9) plays important roles in tumor invasion and metastasis by degrading extracellular matrix components. Variations in the DNA sequence in the MMP-9 gene may lead to altered MMP-9 production and/or activity, and so this may modulate an individual's susceptibility to esophageal squamous cell carcinoma (ESCC). To test this hypothesis, we investigated the association of the MMP-9 polymorphisms and their haplotypes with the risk of ESCC in a Chinese population. There were significant differences in the genotype and allele distribution of P574R polymorphism of the MMP-9 gene among cases and controls. The P574R GG genotypes were associated with a significantly increased risk of ESCC as compared with the CC genotypes (odds ratio [OR] = 4.08; 95% confidence interval [CI]: 1.58-10.52; p = 0.00). Compared with 279R-574P haplotype, 279R-574R (OR = 3.52; 95% CI: 1.99-6.25) and 279Q-574P (OR = 2.16; 95% CI: 1.07-4.35) haplotypes can increase the onset risk of ESCC statistically, but the role of 279R-574R haplotype is more obvious. MMP-9 P574R polymorphisms and P574R-R279Q haplotype are significantly associated with the risk of ESCC. Our study shows for the first time that MMP-9 gene P574R polymorphism may contribute to a genetic risk factor for ESCC in a Chinese population.
Collapse
Affiliation(s)
- Jin Wu
- Department of Forensic Biology, West China School of Preclinical and Forensic Medicine, Sichuan University, Chengdu, China.
| | | | | | | | | | | |
Collapse
|
13
|
Tang J, Baldwin SJ, Jacobs JM, Linden CGVD, Voorrips RE, Leunissen JA, van Eck H, Vosman B. Large-scale identification of polymorphic microsatellites using an in silico approach. BMC Bioinformatics 2008; 9:374. [PMID: 18793407 PMCID: PMC2562394 DOI: 10.1186/1471-2105-9-374] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2008] [Accepted: 09/15/2008] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Simple Sequence Repeat (SSR) or microsatellite markers are valuable for genetic research. Experimental methods to develop SSR markers are laborious, time consuming and expensive. In silico approaches have become a practicable and relatively inexpensive alternative during the last decade, although testing putative SSR markers still is time consuming and expensive. In many species only a relatively small percentage of SSR markers turn out to be polymorphic. This is particularly true for markers derived from expressed sequence tags (ESTs). In EST databases a large redundancy of sequences is present, which may contain information on length-polymorphisms in the SSR they contain, and whether they have been derived from heterozygotes or from different genotypes. Up to now, although a number of programs have been developed to identify SSRs in EST sequences, no software can detect putatively polymorphic SSRs. RESULTS We have developed PolySSR, a new pipeline to identify polymorphic SSRs rather than just SSRs. Sequence information is obtained from public EST databases derived from heterozygous individuals and/or at least two different genotypes. The pipeline includes PCR-primer design for the putatively polymorphic SSR markers, taking into account Single Nucleotide Polymorphisms (SNPs) in the flanking regions, thereby improving the success rate of the potential markers. A large number of polymorphic SSRs were identified using publicly available EST sequences of potato, tomato, rice, Arabidopsis, Brassica and chicken.The SSRs obtained were divided into long and short based on the number of times the motif was repeated. Surprisingly, the frequency of polymorphic SSRs was much higher in the short SSRs. CONCLUSION PolySSR is a very effective tool to identify polymorphic SSRs. Using PolySSR, several hundred putative markers were developed and stored in a searchable database. Validation experiments showed that almost all markers that were indicated as putatively polymorphic by polySSR were indeed polymorphic. This greatly improves the efficiency of marker development, especially in species where there are low levels of polymorphism, like tomato. When combined with the new sequencing technologies PolySSR will have a big impact on the development of polymorphic SSRs in any species.PolySSR and the polymorphic SSR marker database are available from http://www.bioinformatics.nl/tools/polyssr/.
Collapse
Affiliation(s)
- Jifeng Tang
- Laboratory of Bioinformatics, Wageningen University, PO Box 8128, 6700 ET Wageningen, the Netherlands.
| | | | | | | | | | | | | | | |
Collapse
|
14
|
Lia VV, Bracco M, Gottlieb AM, Poggio L, Confalonieri VA. Complex mutational patterns and size homoplasy at maize microsatellite loci. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2007; 115:981-91. [PMID: 17712542 DOI: 10.1007/s00122-007-0625-y] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/25/2006] [Accepted: 07/31/2007] [Indexed: 05/16/2023]
Abstract
Microsatellite markers have become one of the most popular tools for germplasm characterization, population genetics and evolutionary studies. To investigate the mutational mechanisms of maize microsatellites, nucleotide sequence information was obtained for ten loci. In addition, Single-Strand Conformation Polymorphism (SSCP) analysis was conducted to assess the occurrence of size homoplasy. Sequence analysis of 54 alleles revealed a complex pattern of mutation at 8/10 loci, with only 2 loci showing allele variation strictly consistent with stepwise mutations. The overall allelic diversity resulted from changes in the number of repeat units, base substitutions, and indels within repetitive and non-repetitive segments. Thirty-one electromorphs sampled from six maize landraces were considered for SSCP analysis. The number of conformers per electromorph ranged from 1 to 7, with 74.2% of the electromorphs showing more than one conformer. Size homoplasy was apparent within landraces and populations. Variation in the amount of size homoplasy was observed within and between loci, although no differences were detected among populations. The results of the present study provide useful information on the interpretation of genetic data derived from microsatellite markers. Further efforts are still needed to determine the impact of these findings on the estimation of population parameters and on the inference of phylogenetic relationships in maize investigations.
Collapse
Affiliation(s)
- V V Lia
- Laboratorio de Genética, Departamento de Ecología, Genética y Evolución, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina.
| | | | | | | | | |
Collapse
|
15
|
|
16
|
Faville MJ, Vecchies AC, Schreiber M, Drayton MC, Hughes LJ, Jones ES, Guthridge KM, Smith KF, Sawbridge T, Spangenberg GC, Bryan GT, Forster JW. Functionally associated molecular genetic marker map construction in perennial ryegrass (Lolium perenne L.). TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2004; 110:12-32. [PMID: 15526086 DOI: 10.1007/s00122-004-1785-7] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2004] [Accepted: 07/30/2004] [Indexed: 05/10/2023]
Abstract
A molecular marker-based map of perennial ryegrass (Lolium perenne L.) has been constructed through the use of polymorphisms associated with expressed sequence tags (ESTs). A pair-cross between genotypes from a North African ecotype and the cultivar Aurora was used to generate a two-way pseudo-testcross population. A selection of 157 cDNAs assigned to eight different functional categories associated with agronomically important biological processes was used to detect polymorphic EST-RFLP loci in the F(1)(NA(6) x AU(6)) population. A comprehensive set of EST-SSR markers was developed from the analysis of 14,767 unigenes, with 310 primer pairs showing efficient amplification and detecting 113 polymorphic loci. Two parental genetic maps were produced: the NA(6) genetic map contains 88 EST-RFLP and 71 EST-SSR loci with a total map length of 963 cM, while the AU(6) genetic map contains 67 EST-RFLP and 58 EST-SSR loci with a total map length of 757 cM. Bridging loci permitted the alignment of homologous chromosomes between the parental maps, and a sub-set of genomic DNA-derived SSRs was used to relate linkage groups to the perennial ryegrass reference map. Regions of segregation distortion were identified, in some instances in common with other perennial ryegrass maps. The EST-derived marker-based map provides the basis for in silico comparative genetic mapping, as well as the evaluation of co-location between QTLs and functionally associated genetic loci.
Collapse
Affiliation(s)
- M J Faville
- Grasslands Research Centre, AgResearch Ltd., Private Bag 11008, Palmerston North, New Zealand
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Guillet-Claude C, Birolleau-Touchard C, Manicacci D, Fourmann M, Barraud S, Carret V, Martinant JP, Barrière Y. Genetic diversity associated with variation in silage corn digestibility for three O-methyltransferase genes involved in lignin biosynthesis. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2004; 110:126-35. [PMID: 15536523 DOI: 10.1007/s00122-004-1808-4] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/26/2004] [Accepted: 08/24/2004] [Indexed: 05/12/2023]
Abstract
Polymorphisms within three candidate genes for lignin biosynthesis were investigated to identify alleles useful for the improvement of maize digestibility. The allelic diversity of two caffeoyl-CoA 3-O-methyltransferase genes, CCoAOMT2 and CCoAOMT1, as well as that of the aldehyde O-methyltransferase gene, AldOMT, was evaluated for 34 maize lines chosen for their varying degrees of cell wall digestibility. Frequency of nucleotide changes averaged one SNP every 35 bp. Ninety-one indels were identified in non-coding regions and only four in coding regions. Numerous distinct and highly diverse haplotypes were identified at each locus. Numerous sites were in linkage disequilibrium that declined rapidly within a few hundred bases. For F4, an early flint French line with high cell wall digestibility, the CCoAOMT2 first exon presented many non-synonymous polymorphisms. Notably we found an 18-bp indel, which resembled a microsatellite and was associated with cell wall digestibility variation. Additionally, the CCoAOMT2 gene co-localized with a QTL for cell wall digestibility and lignin content. Together, these results suggest that genetic diversity investigated on a broader genetic basis could contribute to the identification of favourable alleles to be used in the molecular breeding of elite maize germplasm.
Collapse
Affiliation(s)
- C Guillet-Claude
- Unité de Génétique et d'Amélioration des Plantes Fourragères, INRA, Route de Saintes, 86600 Lusignan, France.
| | | | | | | | | | | | | | | |
Collapse
|
18
|
Adams RI, Brown KM, Hamilton MB. The impact of microsatellite electromorph size homoplasy on multilocus population structure estimates in a tropical tree (Corythophora alta) and an anadromous fish (Morone saxatilis). Mol Ecol 2004; 13:2579-88. [PMID: 15315672 DOI: 10.1111/j.1365-294x.2004.02256.x] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Microsatellite allelic states are determined by electrophoretic sizing of polymerase chain reaction fragments to define electromorphs. Numerous studies have documented that identical microsatellite electromorphs are potentially heterogeneous at the DNA sequence level, a phenomenon called electromorph size homoplasy. Few studies have examined the impact of electromorph size homoplasy on estimates of population genetic parameters. We investigated the frequency of microsatellite electromorph size homoplasy for 12 loci in the tropical tree Corythophora alta and 11 loci in the anadromous fish Morone saxatilis by sequencing 14-23 homozygotes per locus sampled from multiple populations for a total of 453 sequences. Sequencing revealed no homoplasy for M. saxatilis loci. Seven C. alta loci exhibited homoplasy, including single and compound repeat motifs both with and without interruptions. Between 12.5 and 42.9% of electromorphs sampled per locus showed size homoplasy. Two methods of correction for homoplasy in C. alta generally produced little or no change in single-locus estimates of RST, except for two loci in which some additional differentiation among populations was revealed. Twelve-locus estimates of RST (including the seven loci corrected for homoplasy) were slightly greater than estimates from uncorrected data, although the 95% confidence intervals overlapped. The frequency of methodological errors such as clerical mistakes or sample mislabelling per genotype scored was estimated at 5.4 and 7.3% for C. alta and M. saxatilis, respectively. Simulations showed that the increase in RST produced by homoplasy correction was only slightly larger than variation in RST estimates expected to be caused by methodological errors.
Collapse
Affiliation(s)
- Rachel I Adams
- Georgetown University, Department of Biology, Reiss Sciences Building 406, 37th and O Streets NW, Washington, DC 20057-1229 USA
| | | | | |
Collapse
|
19
|
Bundock PC, Henry RJ. Single nucleotide polymorphism, haplotype diversity and recombination in the Isa gene of barley. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2004; 109:543-51. [PMID: 15146316 DOI: 10.1007/s00122-004-1675-z] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/08/2003] [Accepted: 03/19/2004] [Indexed: 05/10/2023]
Abstract
The Isa gene from barley--an intronless gene expressed in maternal tissues of the seed--has a likely role in defence against pathogens. The protein product--bi-functional alpha-amylase/subtilisin inhibitor--inhibits the seed's own amylase in addition to the bacterial protease subtilisin and fungal xylanase. Sixteen barley genotypes were targeted to amplify and sequence the Isa gene region to detect sequence polymorphisms, since little is known about genetic diversity at this locus. A total of 80 single nucleotide polymorphisms (SNPs) and 23 indels were detected in 2,164 bp of sequence containing the Isa transcript, promoter and 3' non-transcribed region (overall one SNP per 27 bp and one indel per 94 bp), with eight sequence-based haplotypes distinguishable amongst the 16 varieties. Sequencing a polymorphic region in the promoter in an additional 27 barley genotypes increased the number of sequence-based haplotypes discovered to 11. However there is low haplotype diversity amongst the cultivated barley varieties sampled, with most varieties represented by a single haplotype. There was minor amino acid diversity in the protein, with five out of ten SNP sites in the coding region predicted to produce amino acid substitutions. SNP analysis indicated a history of recombination events--a minimum of seven based on the initial eight haplotypes from the whole sequenced region. Most of the recombination events occurred in the highly polymorphic regions, the 3' non-transcribed region and sequences flanking a microsatellite in the Isa promoter.
Collapse
Affiliation(s)
- P C Bundock
- Molecular Plant Breeding CRC, Centre for Plant Conservation Genetics, Southern Cross University, Lismore, NSW, Australia.
| | | |
Collapse
|
20
|
Gupta PK, Rustgi S. Molecular markers from the transcribed/expressed region of the genome in higher plants. Funct Integr Genomics 2004; 4:139-62. [PMID: 15095058 DOI: 10.1007/s10142-004-0107-0] [Citation(s) in RCA: 128] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2003] [Revised: 12/16/2003] [Accepted: 12/19/2003] [Indexed: 10/26/2022]
Abstract
In recent years, molecular marker technology in higher plants has witnessed a shift from the so-called random DNA markers (RDMs), developed in the past arbitrarily from genomic DNA and cDNA, to the molecular markers representing the transcriptome and the other coding sequences. These markers have been described as gene targeted markers (GTMs). Another specific class of markers includes the so-called functional markers (FMs), which are supposed to have a cause and effect relationship with the traits of interest. In this review, we first describe the development of these markers representing the transcriptome or genes per se; we then discuss the uses of these markers in some detail and finally add a note on the future directions of research and the implications of the wider application of these markers in crop improvement programmes. Using suitable examples, we describe markers of different classes derived from cDNA clones, expressed sequence tags (ESTs), gene sequences and the unique (coding) sequences obtained through methyl filtration or genome normalization (high C(0) t fraction) from gDNA libraries. While we briefly describe RFLPs, SSRs, AFLPs and SNPs developed from the transcriptome (cDNA clones and EST databases), we have discussed in more detail some of the novel markers developed from the transcriptome and specific genes. These novel markers include expressed sequence tag polymorphisms (ESTPs), conserved orthologue set (COS) markers, amplified consensus genetic markers (ACGMs), gene specific tags (GSTs), resistance gene analogues (RGAs) and exon-retrotransposon amplification polymorphism (ERAP). Uses of these markers have been discussed in some detail under the following headings: development of transcript and functional maps, estimations of genetic diversity, marker-assisted selection (MAS), candidate-gene (CG) approach and map-based cloning, genetical genomics and identification of eQTLs, study of genome organization and taxonomic and phylogenetic studies. At the end, we also append a list of websites relevant to further studies on the transcriptome. For want of space, considerable information including voluminous data in the form of 12 tables, and a long list of references cited in these tables, has been placed on the Internet as electronic supplementary material (ESM), which the readers may find useful.
Collapse
Affiliation(s)
- P K Gupta
- Molecular Biology Laboratory, Department of Genetics and Plant Breeding, Ch. Charan Singh University, 250 004, Meerut, India.
| | | |
Collapse
|
21
|
Kishore VK, Velasco P, Shintani DK, Rowe J, Rosato C, Adair N, Slabaugh MB, Knapp SJ. Conserved simple sequence repeats for the Limnanthaceae (Brassicales). TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2004; 108:450-457. [PMID: 14647896 DOI: 10.1007/s00122-003-1447-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/30/2002] [Accepted: 08/20/2003] [Indexed: 05/24/2023]
Abstract
The Limnanthaceae (Order Brassicales) is a family of 18 taxa of Limnanthes (meadowfoam) native to California, Oregon, and British Columbia. Cultivated meadowfoam ( L. alba Benth.), a recently domesticated plant, has been the focus of research and development as an industrial oilseed for three decades. The goal of the present research was to develop several hundred simple sequence repeat (SSR) markers for genetic mapping, molecular breeding, and genomics research in wild and cultivated meadowfoam taxa. We developed 389 SSR markers for cultivated meadowfoam by isolating and sequencing 1,596 clones from L. alba genomic DNA libraries enriched for AG(n) or AC(n) repeats, identifying one or more unique SSRs in 696 clone sequences, and designing and testing primers for 624 unique SSRs. The SSR markers were screened for cross- taxa utility and polymorphisms among ten of 17 taxa in the Limnanthaceae; 373 of these markers were polymorphic and 106 amplified loci from every taxon. Cross-taxa amplification percentages ranged from 37.3% in L. douglasii ssp. rosea (145/389) to 85.6% in L. montana (333/389). The SSR markers amplified 4,160 unique bands from 14 genotypes sampled from ten taxa (10.7 unique bands per SSR marker), of which 972 were genotype-specific. Mean and maximum haplotype heterozygosities were 0.71 and 0.90, respectively, among six L. alba genotypes and 0.63 and 0.93, respectively, among 14 genotypes (ten taxa). The SSR markers supply a critical mass of high-throughput DNA markers for biological and agricultural research across the Limnanthaceae and open the way to the development of a genetic linkage map for meadowfoam ( x = 5).
Collapse
Affiliation(s)
- V K Kishore
- Department of Crop and Soil Science, Oregon State University, Corvallis, OR 97331, USA
| | | | | | | | | | | | | | | |
Collapse
|
22
|
Gupta PK, Rustgi S, Sharma S, Singh R, Kumar N, Balyan HS. Transferable EST-SSR markers for the study of polymorphism and genetic diversity in bread wheat. Mol Genet Genomics 2003; 270:315-23. [PMID: 14508680 DOI: 10.1007/s00438-003-0921-4] [Citation(s) in RCA: 265] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2003] [Accepted: 08/19/2003] [Indexed: 10/26/2022]
Abstract
Nearly 900 SSRs (simple sequence repeats) were identified among 15,000 ESTs (expressed sequence tags) belonging to bread wheat ( Triticum aestivumL.). The SSRs were defined by their minimum length, which ranged from 14 to 21 bp. The maximum length ranged from 24 to 87 bp depending upon the length of the repeat unit itself (1-7 bp). The average density of SSRs was one SSR per 9.2 kb of EST sequence screened. The trinucleotide repeats were the most abundant SSRs detected. As a representative sample, 78 primer pairs were designed, which were also used to screen the dbEST entries for Hordeum vulgare and Triticum tauschii (donor of the D-genome of cultivated wheat) using a cut-off E (expectation) value of 0.01. On the basis of in silico analysis, up to 55.12% of the primer pairs exhibited transferability from Triticum to Hordeum, indicating that the sequences flanking the SSRs are not only conserved within a single genus but also between related genera in Poaceae. Primer pairs for the 78 SSRs were synthesized and used successfully for the study of (1) their transferability to 18 related wild species and five cereal species (barley, oat, rye, rice and maize); and (2) polymorphism between the parents of four mapping populations available with us. A subset of 20 EST-SSR primers was also used to assess genetic diversity in a collection of 52 elite exotic wheat genotypes. This was done with a view to compare their utility relative to other molecular markers (gSSRs, AFLPs, and SAMPL) previously used by us for the same purpose with the same set of 52 bread wheat genotypes. Although only a low level of polymorphism was detected, relative to that observed with genomic SSRs, the study suggested that EST-SSRs can be successfully used for a variety of purposes, and may actually prove superior to SSR markers extracted from genomic libraries for diversity estimation and transferability.
Collapse
Affiliation(s)
- P K Gupta
- Molecular Biology Laboratory, Department of Agricultural Botany, Ch. Charan Singh University, 250 004 (U.P.), Meerut, India.
| | | | | | | | | | | |
Collapse
|
23
|
Batley J, Barker G, O'Sullivan H, Edwards KJ, Edwards D. Mining for single nucleotide polymorphisms and insertions/deletions in maize expressed sequence tag data. PLANT PHYSIOLOGY 2003; 132:84-91. [PMID: 12746514 PMCID: PMC166954 DOI: 10.1104/pp.102.019422] [Citation(s) in RCA: 121] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/18/2002] [Revised: 02/17/2003] [Accepted: 02/26/2003] [Indexed: 05/20/2023]
Abstract
We have developed a computer based method to identify candidate single nucleotide polymorphisms (SNPs) and small insertions/deletions from expressed sequence tag data. Using a redundancy-based approach, valid SNPs are distinguished from erroneous sequence by their representation multiple times in an alignment of sequence reads. A second measure of validity was also calculated based on the cosegregation of the SNP pattern between multiple SNP loci in an alignment. The utility of this method was demonstrated by applying it to 102,551 maize (Zea mays) expressed sequence tag sequences. A total of 14,832 candidate polymorphisms were identified with an SNP redundancy score of two or greater. Segregation of these SNPs with haplotype indicates that candidate SNPs with high redundancy and cosegregation confidence scores are likely to represent true SNPs. This was confirmed by validation of 264 candidate SNPs from 27 loci, with a range of redundancy and cosegregation scores, in four inbred maize lines. The SNP transition/transversion ratio and insertion/deletion size frequencies correspond to those observed by direct sequencing methods of SNP discovery and suggest that the majority of predicted SNPs and insertion/deletions identified using this approach represent true genetic variation in maize.
Collapse
Affiliation(s)
- Jacqueline Batley
- School of Biological Sciences, University of Bristol, United Kingdom
| | | | | | | | | |
Collapse
|