26
|
Tian HF, Hu QM, Li Z. Genome-wide identification of simple sequence repeats and development of polymorphic SSR markers in swamp eel (Monopterus albus). Sci Prog 2021; 104:368504211035597. [PMID: 34375541 PMCID: PMC10358632 DOI: 10.1177/00368504211035597] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
OBJECTIVES Swamp eel is one model species for sexual reversion and an aquaculture fish in China. One local strain with deep yellow and big spots of Monopterus albus has been selected for consecutive selective breeding. The objectives of this study were characterizing the Simple Sequence Repeats (SSRs) of M. albus in the assembled genome obtained recently, and developing polymorphic SSRs for future breeding programs. METHODS The genome wide SSRs were mined by using MISA software, and their types and genomic distribution patterns were investigated. Based on the available flanking sequences, primer pairs were batched developed, and Polymorphic SSRs were identified by using Polymorphic SSR Retrieval tool. The obtained polymorphic SSRs were validated by using e-PCR and capillary electrophoresis, then they were used to investigate genetic diversity of one breeding population. RESULTS A total of 364,802 SSRs were identified in assembled M. albus genome. The total length, density and frequency of SSRs were 8,204,641 bp, 10,259 bp/Mb, and 456.16 loci/Mb, respectively. Mononucleotide repeats were predominant among SSRs (33.33%), and AC and AAT repeats were the most abundant di- and tri-nucleotide repeats motifs. A total of 287,189 primer pairs were designed, and a high-density physical map was constructed (359.11 markers per Mb). A total of 871 polymorphic SSRs were identified, and 38 SSRs of 101 randomly selected ones were validated by using e-PCR and capillary electrophoresis. Using these 38 polymorphic SSRs, 201 alleles were detected and genetic diversity level (Na, PIC, HO, and He) was evaluated. CONCLUSIONS The genome-wide SSRs and newly developed SSR markers will provide a useful tool for genetic mapping, diversity analysis studies in swamp eel in the future. The high level of genetic diversity (Na = 5.29, PIC = 0.5068, HO = 0.4665, He = 0.5525) but excess of homozygotes (FIS = 0.155) in one breeding population provide baseline information for future breeding program.
Collapse
|
27
|
Ye X, Lin J, Zhou M, He X, Yan M, Cheng R. The complete chloroplast genome of the medicinal plant Polygonum cuspidatum (Polygonaceae) and its phylogenetic implications within the subfamily Polygonoideae. MITOCHONDRIAL DNA PART B-RESOURCES 2021; 6:1563-1565. [PMID: 33969219 PMCID: PMC8078928 DOI: 10.1080/23802359.2021.1917313] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Polygonum cuspidatum Siebold & Zucc. is a well-known and widely used medical plant to treat arthritis, gout and inflammation. In this study, we determined the complete chloroplast genome sequence of P. cuspidatum from Zhejiang Province. The assembled chloroplast (cp) genome was 163,183 bp in length, containing two inverted repeated (IR) regions of 30,859 bp each, a large single copy (LSC) region of 87,905 bp, and a small single copy (SSC) region of 13,560 bp. The genome encodes 131 genes, consisting of 86 protein-coding, 37 tRNA, and eight rRNA genes. The overall GC content of P. cuspidatum is 37.53%, with the highest GC content of 41.27% in the IR region. The 86 protein-coding genes encode 27,597 amino acids in total, most of which use the initiation codon ATG, except the ndhD gene which starts with ACG. The length of the tRNA genes range from 48 bp to 88 bp, with the highest GC content of 62.16% in tRNA-Arg (ACG) and tRNA-Asp (GUC). A total of 66 simple sequence repeats are identified in the cp of P. cuspidatum. Phylogenetic analysis indicated a sister relationship between P. cuspidatum and Fallopia sachalinensis, suggesting a close genetic relationship between the genera of Polygonum and Fallopia. This work provides basic genetic resources for investigating the evolutionary status and population genetics of this important medicinal species.
Collapse
|
28
|
Zhou M, Yan M, Ying Z, He X, Ge Y, Cheng R. Characterization of the complete chloroplast genome of Oxalis corymbosa DC. (Oxalidaceae), a medicinal plant from Zhejiang Province. Mitochondrial DNA B Resour 2021; 6:1138-1140. [PMID: 33796768 PMCID: PMC8006946 DOI: 10.1080/23802359.2021.1882905] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2020] [Accepted: 01/23/2021] [Indexed: 11/10/2022] Open
Abstract
Oxalis corymbosa DC. is an important medicinal and edible perennial herb belonging to the wood-sorrel family Oxalidaceae. In this study, we report the complete chloroplast (cp) genome sequence of O. corymbosa. The assembled chloroplast genome was 151,351 bp in length, containing two inverted repeated (IR) regions of 24,587 bp each, a large single copy (LSC) region of 85,476 bp, and a small single copy (SSC) region of 16,701 bp. The genome encodes 128 genes, consisting of 82 protein-coding genes, 37 tRNA genes, eight rRNA genes, and one pseudogene (ycf1). The 82 protein-coding genes encode 25,751 amino acids in total, most of which use the initiation codon ATG, except rps19 and psbC genes start with GTG. The lengths of the tRNA genes range from 71 bp to 93 bp, with the highest GC content of 62.16% in tRNA-Arg (ACG). The overall GC content of O. corymbosa is 36.47%, with the highest GC content of 42.64% in IR region. In addition, a total of 74 simple sequence repeats were identified in the cp genome of O. corymbosa. Phylogenetic analysis indicated a sister relationship between O. corymbosa and O. drummondii, suggesting a close genetic relationship between the two Oxalis species. This work provides basic genetic resources for investigating the evolutionary status and population genetics diversities for this medicinal species.
Collapse
|
29
|
Yang Y, Tian H, Wang R, Wang L, Yi H, Liu Y, Xu L, Fan Y, Zhao J, Wang F. Variety Discrimination Power: An Appraisal Index for Loci Combination Screening Applied to Plant Variety Discrimination. FRONTIERS IN PLANT SCIENCE 2021; 12:566796. [PMID: 33815430 PMCID: PMC8014032 DOI: 10.3389/fpls.2021.566796] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/28/2020] [Accepted: 02/11/2021] [Indexed: 06/12/2023]
Abstract
Molecular marker technology is used widely in plant variety discrimination, molecular breeding, and other fields. To lower the cost of testing and improve the efficiency of data analysis, molecular marker screening is very important. Screening usually involves two phases: the first to control loci quality and the second to reduce loci quantity. To reduce loci quantity, an appraisal index that is very sensitive to a specific scenario is necessary to select loci combinations. In this study, we focused on loci combination screening for plant variety discrimination. A loci combination appraisal index, variety discrimination power (VDP), is proposed, and three statistical methods, probability-based VDP (P-VDP), comparison-based VDP (C-VDP), and ratio-based VDP (R-VDP), are described and compared. The results using the simulated data showed that VDP was sensitive to statistical populations with convergence toward the same variety, and the total probability of discrimination power (TDP) method was effective only for partial populations. R-VDP was more sensitive to statistical populations with convergence toward various varieties than P-VDP and C-VDP, which both had the same sensitivity; TDP was not sensitive at all. With the real data, R-VDP values for sorghum, wheat, maize and rice data begin to show downward tendency when the number of loci is 20, 7, 100, 100 respectively, while in the case of P-VDP and C-VDP (which have the same results), the number is 6, 4, 9, 19 respectively and in the case of TDP, the number is 6, 4, 4, 11 respectively. For the variety threshold setting, R-VDP values of loci combinations with different numbers of loci responded evenly to different thresholds. C-VDP values responded unevenly to different thresholds, and the extent of the response increased as the number of loci decreased. All the methods gave underestimations when data were missing, with systematic errors for TDP, C-VDP, and R-VDP going from smallest to biggest. We concluded that VDP was a better loci combination appraisal index than TDP for plant variety discrimination and the three VDP methods have different applications. We developed the software called VDPtools, which can calculate the values of TDP, P-VDP, C-VDP, and R-VDP. VDPtools is publicly available at https://github.com/caurwx1/VDPtools.git.
Collapse
|
30
|
Mizeriene G, Cerny K, Zyka V, Bakonyi J, Nagy ZÁ, Oliva J, Redondo MA, Corcobado T, Martín-García J, Prospero S. Patterns of Genetic Diversification in the Invasive Hybrid Plant Pathogen Phytophthora × alni and Its Parental Species P. uniformis. PHYTOPATHOLOGY 2020; 110:1959-1969. [PMID: 32633698 DOI: 10.1094/phyto-12-19-0475-r] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
In pathogenic fungi and oomycetes, interspecific hybridization may lead to the formation of new species having a greater impact on natural ecosystems than the parental species. From the early 1990s, a severe alder (Alnus spp.) decline due to an unknown Phytophthora species was observed in several European countries. Genetic analyses revealed that the disease was caused by the triploid hybrid P. × alni, which originated in Europe from the hybridization of P. uniformis and P. × multiformis. Here, we investigated the population structure of P. × alni (158 isolates) and P. uniformis (85 isolates) in several European countries using microsatellite markers. Our analyses confirmed the genetic structure previously observed in other European populations, with P. uniformis populations consisting of at most two multilocus genotypes (MLGs) and P. × alni populations dominated by MLG Pxa-1. The genetic structure of P. × alni populations in the Czech Republic, Hungary and Sweden seemed to reflect the physical isolation of river systems. Most rare P. × alni MLGs showed a loss of heterozygosity (LOH) at one or a few microsatellite loci compared with other MLGs. This LOH may allow a stabilization within the P. × alni genome or a rapid adaptation to stress situations. Alternatively, alleles may be lost because of random genetic drift in small, isolated populations, with no effect on fitness of P. × alni. Additional studies would be necessary to confirm these patterns of population diversification and to better understand the factors driving it.
Collapse
|
31
|
Jiang WZ, Yao FJ, Lu LX, Fang M, Wang P, Zhang YM, Meng JJ, Lu J, Ma XX, He Q, Shao KS. Genetic linkage map construction and quantitative trait loci mapping of agronomic traits in Gloeostereum incarnatum. J Microbiol 2020; 59:41-50. [PMID: 33201433 DOI: 10.1007/s12275-021-0242-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2020] [Revised: 08/18/2020] [Accepted: 09/15/2020] [Indexed: 11/30/2022]
Abstract
Gloeostereum incarnatum is an edible medicinal mushroom widely grown in China. Using the whole genome of G. incarnatum, simple sequence repeat (SSR) markers were developed and synthetic primers were designed to construct its first genetic linkage map. The 1,048.6 cm map is composed of 10 linkage groups and contains 183 SSR markers. In total, 112 genome assembly sequences were anchored, representing 16.43 Mb and covering 46.41% of the genome. Selfing populations were used for quantitative trait loci (QTL) targeting, and the composite interval mapping method was used to co-localize the mycelium growth rate (potato dextrose agar and sawdust), growth period, yield and fruiting body length, and width and thickness. The 14 QTLs of agronomic traits had LOD values of 3.20-6.51 and contribution rates of 2.22-13.18%. No linkage relationship was found between the mycelium growth rate and the growth period, but a linkage relationship was observed among the length, width and thickness of the fruiting bodies. Using NCBI's BLAST alignment, the genomic sequences corresponding to the QTL regions were compared, and a TPR-like protein candidate gene was selected. Using whole-genome data, 138 candidate genes were found in four sequence fragments of two SSR markers located in the same scaffold. The genetic map and QTLs established in this study will aid in developing selective markers for agronomic traits and identifying corresponding genes, thereby providing a scientific basis for the further gene mapping of quantitative traits and the marker-assisted selection of functional genes in G. incarnatum breeding programs.
Collapse
|
32
|
Hong F, Mo SH, Liu Y, Wei D. Transcriptomic Profiling of Various Developmental Stages of Aphis Aurantii to Provide a Genetic Resource for Gene Expression and SSR Analysis. Front Physiol 2020; 11:578939. [PMID: 33071832 PMCID: PMC7530277 DOI: 10.3389/fphys.2020.578939] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Accepted: 08/18/2020] [Indexed: 12/16/2022] Open
|
33
|
Eisenbart SK, Alzheimer M, Pernitzsch SR, Dietrich S, Stahl S, Sharma CM. A Repeat-Associated Small RNA Controls the Major Virulence Factors of Helicobacter pylori. Mol Cell 2020; 80:210-226.e7. [PMID: 33002424 DOI: 10.1016/j.molcel.2020.09.009] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2020] [Revised: 07/29/2020] [Accepted: 09/04/2020] [Indexed: 12/12/2022]
Abstract
Many bacterial pathogens regulate their virulence genes via phase variation, whereby length-variable simple sequence repeats control the transcription or coding potential of those genes. Here, we have exploited this relationship between DNA structure and physiological function to discover a globally acting small RNA (sRNA) regulator of virulence in the gastric pathogen Helicobacter pylori. Our study reports the first sRNA whose expression is affected by a variable thymine (T) stretch in its promoter. We show the sRNA post-transcriptionally represses multiple major pathogenicity factors of H. pylori, including CagA and VacA, by base pairing to their mRNAs. We further demonstrate transcription of the sRNA is regulated by the nickel-responsive transcriptional regulator NikR (thus named NikS for nickel-regulated sRNA), thereby linking virulence factor regulation to nickel concentrations. Using in-vitro infection experiments, we demonstrate NikS affects host cell internalization and epithelial barrier disruption. Together, our results show NikS is a phase-variable, post-transcriptional global regulator of virulence properties in H. pylori.
Collapse
|
34
|
Wang L, Lu G, Liu H, Huang L, Jiang W, Li P, Lu X. The complete chloroplast genome sequence of Gynostemma yixingense and comparative analysis with congeneric species. Genet Mol Biol 2020; 43:e20200092. [PMID: 33001132 PMCID: PMC7521087 DOI: 10.1590/1678-4685-gmb-2020-0092] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2020] [Accepted: 07/19/2020] [Indexed: 11/21/2022] Open
Abstract
Gynostemma yixingense, an important medicinal member of the Cucurbitaceae family, is an endemic herbaceous species distributed in East China. It is morphologically similar to the plants in the same genus, which resulted in some confusion in identification and application. Meanwhile, there are still some controversies in taxonomy. Herein, the complete chloroplast genome sequence of G. yixingense was obtained by Illumina paired-end sequencing technology and compared to other chloroplast genome sequences of congeneric species. The complete chloroplast genome of G. yixingense is 157,910 bp in length with 36.94% GC content and contains a large single-copy (LSC) region of 86,791 bp, a small single-copy (SSC) region of 18,635 bp and a pair of inverted repeat (IR) regions of 26,242 bp. The whole genome contains 133 unique genes, including 87 protein-coding genes, 37 tRNA genes, eight rRNA genes and one pseudogene. In addition, 74 simple sequence repeats (SSRs) were identified, most of which were A/T rich. The phylogenetic analysis indicated that G. yixingense had the closest relationship to G. laxiflorum. The result of this study provided an important theoretical basis for chloroplast genome and phylogenetic analysis of G. yixingense.
Collapse
|
35
|
Low Base-Substitution Mutation Rate but High Rate of Slippage Mutations in the Sequence Repeat-Rich Genome of Dictyostelium discoideum. G3-GENES GENOMES GENETICS 2020; 10:3445-3452. [PMID: 32732307 PMCID: PMC7466956 DOI: 10.1534/g3.120.401578] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
We describe the rate and spectrum of spontaneous mutations for the social amoeba Dictyostelium discoideum, a key model organism in molecular, cellular, evolutionary and developmental biology. Whole-genome sequencing of 37 mutation accumulation lines of D. discoideum after an average of 1,500 cell divisions yields a base-substitution mutation rate of 2.47 × 10−11 per site per generation, substantially lower than that of most eukaryotic and prokaryotic organisms, and of the same order of magnitude as in the ciliates Paramecium tetraurelia and Tetrahymena thermophila. Known for its high genomic AT content and abundance of simple sequence repeats, we observe that base-substitution mutations in D. discoideum are highly A/T biased. This bias likely contributes both to the high genomic AT content and to the formation of simple sequence repeats in the AT-rich genome of Dictyostelium discoideum. In contrast to the situation in other surveyed unicellular eukaryotes, indel rates far exceed the base-substitution mutation rate in this organism with a high proportion of 3n indels, particularly in regions without simple sequence repeats. Like ciliates, D. discoideum has a large effective population size, reducing the power of random genetic drift, magnifying the effect of selection on replication fidelity, in principle allowing D. discoideum to evolve an extremely low base-substitution mutation rate.
Collapse
|
36
|
Gou X, Shi H, Yu S, Wang Z, Li C, Liu S, Ma J, Chen G, Liu T, Liu Y. SSRMMD: A Rapid and Accurate Algorithm for Mining SSR Feature Loci and Candidate Polymorphic SSRs Based on Assembled Sequences. Front Genet 2020; 11:706. [PMID: 32849772 PMCID: PMC7398111 DOI: 10.3389/fgene.2020.00706] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2020] [Accepted: 06/10/2020] [Indexed: 12/16/2022] Open
Abstract
Microsatellites or simple sequence repeats (SSRs) are short tandem repeats of DNA widespread in genomes and transcriptomes of diverse organisms and are used in various genetic studies. Few software programs that mine SSRs can be further used to mine polymorphic SSRs, and these programs have poor portability, have slow computational speed, are highly dependent on other programs, and have low marker development rates. In this study, we develop an algorithm named Simple Sequence Repeat Molecular Marker Developer (SSRMMD), which uses improved regular expressions to rapidly and exhaustively mine perfect SSR loci from any size of assembled sequence. To mine polymorphic SSRs, SSRMMD uses a novel three-stage method to assess the conservativeness of SSR flanking sequences and then uses the sliding window method to fragment each assembled sequence to assess its uniqueness. Furthermore, molecular biology assays support the polymorphic SSRs identified by SSRMMD. SSRMMD is implemented using the Perl programming language and can be downloaded from https://github.com/GouXiangJian/SSRMMD.
Collapse
|
37
|
Herbert A. Simple Repeats as Building Blocks for Genetic Computers. Trends Genet 2020; 36:739-750. [PMID: 32690316 DOI: 10.1016/j.tig.2020.06.012] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2020] [Revised: 06/17/2020] [Accepted: 06/22/2020] [Indexed: 11/15/2022]
Abstract
Processing of RNA involves heterogeneous nuclear ribonucleoproteins. The simple sequence repeats (SSRs) they bind can also adopt alternative DNA structures, like Z DNA, triplexes, G quadruplexes, and I motifs. Those SSRs capable of switching conformation under physiological conditions (called flipons) are genetic elements that can encode alternative RNA processing by their effects on RNA processivity, most likely as DNA:RNA hybrids. Flipons are elements of a binary, instructive genetic code directing how genomic sequences are compiled into transcripts. The combinatorial nature of this code provides a rich set of options for creating genetic computers able to reproduce themselves and use a heritable and evolvable code to optimize survival. The underlying computational logic potentiates a diverse set of genetic programs that modify cis-mediated heritability and disease risk.
Collapse
|
38
|
Sasseron GR, Benchimol-Reis LL, Perseguini JMKC, Paulino JFC, Bajay MM, Carbonell SAM, Chiorato AF. Fusarium oxysporum f. sp. phaseoli genetic variability assessed by new developed microsatellites. Genet Mol Biol 2020; 43:e20190267. [PMID: 32478796 PMCID: PMC7263423 DOI: 10.1590/1678-4685-gmb-2019-0267] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2019] [Accepted: 03/11/2020] [Indexed: 11/22/2022] Open
Abstract
Fusarium oxysporum f. sp. phaseoli (Fop) J.B. Kendrich & W.C. Snyder is the causal agent of Fusarium wilt of common bean (Phaseolus vulgaris L.). The objective of this study was to develop microsatellite markers (SSRs) to characterize the genetic diversity of Fop. Two libraries enriched with SSRs were developed and a total of 40 pairs of SSRs were characterized. Out of these, 15 SSRs were polymorphic for 42 Fop isolates. The number of alleles varied from two to ten, with an average of four alleles per locus and an average PIC (Polymorphic Information Content) of 0.38. The genetic diversity assessed by microsatellites for Fop was low, as expected for an asexual fungus, and not associated with geographic origin, but they were able to detect enough genetic variability among isolates in order to differentiate them. Microsatellites are a robust tool widely used for genetic fingerprinting and population structure analyses. SSRs for Fop may be an efficient tool for a better understanding of the ecology, epidemiology and evolution of this pathogen.
Collapse
|
39
|
Li Y, Chen X, Wu K, Pan J, Long H, Yan Y. Characterization of Simple Sequence Repeats (SSRs) in Ciliated Protists Inferred by Comparative Genomics. Microorganisms 2020; 8:microorganisms8050662. [PMID: 32370063 PMCID: PMC7285179 DOI: 10.3390/microorganisms8050662] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2020] [Revised: 04/24/2020] [Accepted: 04/26/2020] [Indexed: 01/02/2023] Open
Abstract
Simple sequence repeats (SSRs) are prevalent in the genomes of all organisms. They are widely used as genetic markers, and are insertion/deletion mutation hotspots, which directly influence genome evolution. However, little is known about such important genomic components in ciliated protists, a large group of unicellular eukaryotes with extremely long evolutionary history and genome diversity. With recent publications of multiple ciliate genomes, we start to get a chance to explore perfect SSRs with motif size 1-100 bp and at least three motif repeats in nine species of two ciliate classes, Oligohymenophorea and Spirotrichea. We found that homopolymers are the most prevalent SSRs in these A/T-rich species, with AAA (lysine, charged amino acid; also seen as an SSR with one-adenine motif repeated three times) being the codons repeated at the highest frequencies in coding SSR regions, consistent with the widespread alveolin proteins rich in lysine repeats as found in Tetrahymena. Micronuclear SSRs are universally more abundant than the macronuclear ones of the same motif-size, except for the 8-bp-motif SSRs in extensively fragmented chromosomes. Both the abundance and A/T content of SSRs decrease as motif-size increases, while the abundance is positively correlated with the A/T content of the genome. Also, smaller genomes have lower proportions of coding SSRs out of all SSRs in Paramecium species. This genome-wide and cross-species analysis reveals the high diversity of SSRs and reflects the rapid evolution of these simple repetitive elements in ciliate genomes.
Collapse
|
40
|
Gao C, Wang Q, Ying Z, Ge Y, Cheng R. Molecular structure and phylogenetic analysis of complete chloroplast genomes of medicinal species Paeonia lactiflora from Zhejiang Province. MITOCHONDRIAL DNA PART B-RESOURCES 2020; 5:1077-1078. [PMID: 33366882 PMCID: PMC7748428 DOI: 10.1080/23802359.2020.1721372] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Paeonia lactiflora is a geo-authentic and superior medicinal plant in Zhejiang province. Here, we report the complete chloroplast genome sequence of P. lactiflora. The total genome size of P. lactiflora is 153,405 bp in length, including a small single-copy (SSC) region of 16,969 bp, a large single-copy (LSC) region of 84,340 bp separated by a pair of inverted repeats (IRs) of 26,048 bp. The overall annotated gene number is 109, containing 76 protein-coding genes, 29 tRNAs and 4 rRNAs. The entire GC content of P. lactiflora is 38.43%, with the highest GC content of 42.99% in IR region. A total of 52 simple sequence repeats are identified in the cp genome of P. lactiflora. Phylogenetic analysis indicated a sister relationship between P. lactiflora and P. veitchii, and supported a unique evolutionary status of Family Paeoniaceae. This work provides a valuable genetic resource to develop robust markers and investigate the population genetics diversities for this famous medicinal species.
Collapse
|
41
|
Li YG, Jiang WY, Zhang QF, Ali E, Ji P, Pan HY, Sun LP. Population structure and genetic diversity of Setosphaeria turcica from corn in Heilongjiang province, China. J Appl Microbiol 2019; 127:1814-1823. [PMID: 31517423 DOI: 10.1111/jam.14449] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2019] [Revised: 09/04/2019] [Accepted: 09/06/2019] [Indexed: 11/26/2022]
Abstract
AIM The aims of this study were to identify races and mating types of Setosphaeria turcica causing northern corn leaf blight in Heilongjiang province of China and analyse the genetic diversity of S. turcica isolates using SSR markers. METHODS AND RESULTS Based on gene-for-gene interactions, 13 races of S. turcica (races 0, 1, 2, 3, 12, 13, 23, 123, N, 1N, 12N, 3N and 23N) were isolated from infected corn plants in Heilongjiang province. Races 0 and 1 were the predominant races, and race 23N was identified for the first time in the region. Using two pairs of specific primers, three mating types, 'a', 'Aa' and 'A', were identified, with 'a' being the predominant mating type. SSR markers were used to analyse genetic diversity of 60 S. turcica isolates. Five SSR primers were polymorphic, which resulted in 45 reproducible bands with 2-15 bands for each primer. Cluster analysis separated the isolates into five groups at a similarity coefficient of 0·84. Analysis of molecular variance showed that there was significant correlation between SSR groups and mating type of the isolates. No significant correlation was found between SSR groups and physiological races or geographical location of the isolates. CONCLUSIONS The work reported that races 0 and 1 were the predominant races, and race 23N was identified for the first time in Heilongjiang province with 'a' being the predominant mating type. There was significant correlation between SSR groups and mating type of S. turcica isolates. SIGNIFICANCE AND IMPACT OF THE STUDY Our results provide information on population structure and genetic diversity of S. turcica causing Northern corn leaf blight, which will facilitate the development of effective disease management programs.
Collapse
|
42
|
Transcriptomic Analysis of Marine Gastropod Hemifusus tuba Provides Novel Insights into Conotoxin Genes. Mar Drugs 2019; 17:md17080466. [PMID: 31405144 PMCID: PMC6722550 DOI: 10.3390/md17080466] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2019] [Revised: 08/03/2019] [Accepted: 08/07/2019] [Indexed: 12/26/2022] Open
Abstract
The marine gastropod Hemifusus tuba is served as a luxury food in Asian countries and used in traditional Chinese medicine to treat lumbago and deafness. The lack of genomic data on H. tuba is a barrier to aquaculture development and functional characteristics of potential bioactive molecules are poorly understood. In the present study, we used high-throughput sequencing technologies to generate the first transcriptomic database of H. tuba. A total of 41 unique conopeptides were retrieved from 44 unigenes, containing 6-cysteine frameworks belonging to four superfamilies. Duplication of mature regions and alternative splicing were also found in some of the conopeptides, and the de novo assembly identified a total of 76,306 transcripts with an average length of 824.6 nt, of which including 75,620 (99.1%) were annotated. In addition, simple sequence repeats (SSRs) detection identified 14,000 unigenes containing 20,735 SSRs, among which, 23 polymorphic SSRs were screened. Thirteen of these markers could be amplified in Hemifusus ternatanus and seven in Rapana venosa. This study provides reports of conopeptide genes in Buccinidae for the first time as well as genomic resources for further drug development, gene discovery and population resource studies of this species.
Collapse
|
43
|
Mi R, Wang T, Dunn DW, Huang K, Li B. Development of simple sequence repeat markers for Chamerion angustifolium (Onagraceae). APPLICATIONS IN PLANT SCIENCES 2019; 7:e01244. [PMID: 31139510 PMCID: PMC6526638 DOI: 10.1002/aps3.1244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 12/02/2018] [Accepted: 01/09/2019] [Indexed: 06/09/2023]
Abstract
PREMISE Rosebay willowherb, or fireweed (Chamerion angustifolium: Onagraceae), has diploid, tetraploid, and hexaploid cytotypes. There are known physiological and ecological differences among the three cytotypes, but genetic differences remain undetermined. We developed simple sequence repeat (SSR) markers for this species. METHODS AND RESULTS Leaf samples were collected from three hexaploid C. angustifolium populations. We successfully amplified 16 SSR loci, which were found to be highly polymorphic. The number of alleles, the observed heterozygosity levels, and the expected heterozygosity levels ranged from four to 13, 0.286-0.899, and 0.372-0.871, respectively. Most primers could also be amplified successfully in C. conspersum and the closely related species Epilobium palustre. CONCLUSIONS The 16 polymorphic SSR markers developed here will be useful for genetic studies in C. angustifolium and related species.
Collapse
|
44
|
Alam CM, Iqbal A, Sharma A, Schulman AH, Ali S. Microsatellite Diversity, Complexity, and Host Range of Mycobacteriophage Genomes of the Siphoviridae Family. Front Genet 2019; 10:207. [PMID: 30923537 PMCID: PMC6426759 DOI: 10.3389/fgene.2019.00207] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2018] [Accepted: 02/26/2019] [Indexed: 01/21/2023] Open
Abstract
The incidence, distribution, and variation of simple sequence repeats (SSRs) in viruses is instrumental in understanding the functional and evolutionary aspects of repeat sequences. Full-length genome sequences retrieved from NCBI were used for extraction and analysis of repeat sequences using IMEx software. We have also developed two MATLAB-based tools for extraction of gene locations from GenBank in tabular format and simulation of this data with SSR incidence data. Present study encompassing 147 Mycobacteriophage genomes revealed 25,284 SSRs and 1,127 compound SSRs (cSSRs) through IMEx. Mono- to hexa-nucleotide motifs were present. The SSR count per genome ranged from 78 (M100) to 342 (M58) while cSSRs incidence ranged from 1 (M138) to 17 (M28, M73). Though cSSRs were present in all the genomes, their frequency and SSR to cSSR conversion percentage varied from 1.08 (M138 with 93 SSRs) to 8.33 (M116 with 96 SSRs). In terms of localization, the SSRs were predominantly localized to coding regions (∼78%). Interestingly, genomes of around 50 kb contained a similar number of SSRs/cSSRs to that in a 110 kb genome, suggesting functional relevance for SSRs which was substantiated by variation in motif constitution between species with different host range. The three species with broad host range (M97, M100, M116) have around 90% of their mono-nucleotide repeat motifs composed of G or C and only M16 has both A and T mononucleotide motifs. Around 20% of the di-nucleotide repeat motifs in the genomes exhibiting a broad host range were CT/TC, which were either absent or represented to a much lesser extent in the other genomes.
Collapse
|
45
|
De Novo Sequencing and Hybrid Assembly of the Biofuel Crop Jatropha curcas L.: Identification of Quantitative Trait Loci for Geminivirus Resistance. Genes (Basel) 2019; 10:genes10010069. [PMID: 30669588 PMCID: PMC6356885 DOI: 10.3390/genes10010069] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2018] [Revised: 12/03/2018] [Accepted: 12/07/2018] [Indexed: 12/26/2022] Open
Abstract
Jatropha curcas is an important perennial, drought tolerant plant that has been identified as a potential biodiesel crop. We report here the hybrid de novo genome assembly of J. curcas generated using Illumina and PacBio sequencing technologies, and identification of quantitative loci for Jatropha Mosaic Virus (JMV) resistance. In this study, we generated scaffolds of 265.7 Mbp in length, which correspond to 84.8% of the gene space, using Benchmarking Universal Single-Copy Orthologs (BUSCO) analysis. Additionally, 96.4% of predicted protein-coding genes were captured in RNA sequencing data, which reconfirms the accuracy of the assembled genome. The genome was utilized to identify 12,103 dinucleotide simple sequence repeat (SSR) markers, which were exploited in genetic diversity analysis to identify genetically distinct lines. A total of 207 polymorphic SSR markers were employed to construct a genetic linkage map for JMV resistance, using an interspecific F₂ mapping population involving susceptible J. curcas and resistant Jatropha integerrima as parents. Quantitative trait locus (QTL) analysis led to the identification of three minor QTLs for JMV resistance, and the same has been validated in an alternate F₂ mapping population. These validated QTLs were utilized in marker-assisted breeding for JMV resistance. Comparative genomics of oil-producing genes across selected oil producing species revealed 27 conserved genes and 2986 orthologous protein clusters in Jatropha. This reference genome assembly gives an insight into the understanding of the complex genetic structure of Jatropha, and serves as source for the development of agronomically improved virus-resistant and oil-producing lines.
Collapse
|
46
|
D'Agostino N, Tamburino R, Cantarella C, De Carluccio V, Sannino L, Cozzolino S, Cardi T, Scotti N. The Complete Plastome Sequences of Eleven Capsicum Genotypes: Insights into DNA Variation and Molecular Evolution. Genes (Basel) 2018; 9:E503. [PMID: 30336638 PMCID: PMC6210379 DOI: 10.3390/genes9100503] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2018] [Revised: 10/11/2018] [Accepted: 10/11/2018] [Indexed: 11/16/2022] Open
Abstract
Members of the genus Capsicum are of great economic importance, including both wild forms and cultivars of peppers and chilies. The high number of potentially informative characteristics that can be identified through next-generation sequencing technologies gave a huge boost to evolutionary and comparative genomic research in higher plants. Here, we determined the complete nucleotide sequences of the plastomes of eight Capsicum species (eleven genotypes), representing the three main taxonomic groups in the genus and estimated molecular diversity. Comparative analyses highlighted a wide spectrum of variation, ranging from point mutations to small/medium size insertions/deletions (InDels), with accD, ndhB, rpl20, ycf1, and ycf2 being the most variable genes. The global pattern of sequence variation is consistent with the phylogenetic signal. Maximum-likelihood tree estimation revealed that Capsicum chacoense is sister to the baccatum complex. Divergence and positive selection analyses unveiled that protein-coding genes were generally well conserved, but we identified 25 positive signatures distributed in six genes involved in different essential plastid functions, suggesting positive selection during evolution of Capsicum plastomes. Finally, the identified sequence variation allowed us to develop simple PCR-based markers useful in future work to discriminate species belonging to different Capsicum complexes.
Collapse
|
47
|
Şahin EÇ, Kalenderoğlu A, Aydın Y, Evci G, Uncuoğlu AA. SSR Markers Suitable for Marker Assisted Selection in Sunflower for Downy Mildew Resistance. Open Life Sci 2018; 13:319-326. [PMID: 33817099 PMCID: PMC7874726 DOI: 10.1515/biol-2018-0039] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2018] [Accepted: 06/10/2018] [Indexed: 11/15/2022] Open
Abstract
The effectiveness of Pl genes is known to be resistant to downy mildew (DM) disease affected by fungus Plasmopara halstedii in sunflower. In this study phenotypic analysis was performed using inoculation tests and genotypic analysis were carried out with three DM resistance genes Plarg, Pl13 and Pl8. A total of 69 simple sequence repeat markers and 241 F2 individuals derived from a cross of RHA-419 (R) x P6LC (S), RHA-419 (R) x CL (S), RHA-419 (R) x OL (S), RHA419 (R) x 9758R (S), HA-R5 (R) x P6LC (S) and HA89 (R) x P6LC (S) parental lines were used to identify resistant hybrids in sunflower. Results of SSR analysis using markers linked with downy mildew resistance genes (Plarg, Pl8 and Pl13) and downy mildew inoculation tests were evaluated together and ORS716 (for Plarg and Pl13), HA4011 (for Pl8) markers showed positive correlation with their phenotypic results. These results suggest that these markers are associated with DM resistance and they can be used successfully in marker-assisted selection for sunflower breeding programs specific for downy mildew resistance.
Collapse
|
48
|
Molecular Genotyping (SSR) and Agronomic Phenotyping for Utilization of Durum Wheat ( Triticum durum Desf.) Ex Situ Collection from Southern Italy: A Combined Approach Including Pedigreed Varieties. Genes (Basel) 2018; 9:genes9100465. [PMID: 30241387 PMCID: PMC6211131 DOI: 10.3390/genes9100465] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2018] [Revised: 09/18/2018] [Accepted: 09/18/2018] [Indexed: 11/17/2022] Open
Abstract
In South Italy durum wheat (Triticum durum Desf.) has a long-time tradition of growing and breeding. Accessions collected and now preserved ex situ are a valuable genetic resource, but their effective use in agriculture and breeding programs remains very low. In this study, a small number (44) of simple sequence repeats (SSR) molecular markers were used to detect pattern of diversity for 136 accessions collected in South Italy over time, to identify the genepool of origin, and establish similarities with 28 Italian varieties with known pedigree grown in Italy over the same time-period. Phenotyping was conducted for 12 morphophysiological characters of agronomic interest. Based on discriminant analysis of principal components (DAPC) and STRUCTURE analysis six groups were identified, the assignment of varieties reflected the genetic basis and breeding strategies involved in their development. Some “old” varieties grown today are the result of evolution through natural hybridization and conservative pure line selection. A small number of molecular markers and little phenotyping coupled with powerful statistical analysis and comparison to pedigreed varieties can provide enough information on the genetic structure of durum wheat germplasm for a quick screening of the germplasm collection able to identify accessions for breeding or introduction in low input agriculture.
Collapse
|
49
|
Avvaru AK, Saxena S, Sowpati DT, Mishra RK. MSDB: A Comprehensive Database of Simple Sequence Repeats. Genome Biol Evol 2018; 9:1797-1802. [PMID: 28854643 PMCID: PMC5533116 DOI: 10.1093/gbe/evx132] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/12/2017] [Indexed: 11/13/2022] Open
Abstract
Microsatellites, also known as Simple Sequence Repeats (SSRs), are short tandem repeats of 1-6 nt motifs present in all genomes, particularly eukaryotes. Besides their usefulness as genome markers, SSRs have been shown to perform important regulatory functions, and variations in their length at coding regions are linked to several disorders in humans. Microsatellites show a taxon-specific enrichment in eukaryotic genomes, and some may be functional. MSDB (Microsatellite Database) is a collection of >650 million SSRs from 6,893 species including Bacteria, Archaea, Fungi, Plants, and Animals. This database is by far the most exhaustive resource to access and analyze SSR data of multiple species. In addition to exploring data in a customizable tabular format, users can view and compare the data of multiple species simultaneously using our interactive plotting system. MSDB is developed using the Django framework and MySQL. It is freely available at http://tdb.ccmb.res.in/msdb.
Collapse
|
50
|
Rouiss H, Bakry F, Froelicher Y, Navarro L, Aleza P, Ollitrault P. Origin of C. latifolia and C. aurantiifolia triploid limes: the preferential disomic inheritance of doubled-diploid 'Mexican' lime is consistent with an interploid hybridization hypothesis. ANNALS OF BOTANY 2018; 121:571-585. [PMID: 29293884 PMCID: PMC5838810 DOI: 10.1093/aob/mcx179] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/23/2017] [Accepted: 11/14/2017] [Indexed: 05/23/2023]
Abstract
Background and Aims Two main types of triploid limes are produced worldwide. The 'Tahiti' lime type (Citrus latifolia) is predominant, while the 'Tanepao' type (C. aurantiifolia) is produced to a lesser extent. Both types result from natural interspecific hybridization involving a diploid gamete of C. aurantiifolia 'Mexican' lime type (itself a direct interspecific C. micrantha × C. medica hybrid). The meiotic behaviour of a doubled-diploid 'Mexican' lime, the interspecific micrantha/medica recombination and the resulting diploid gamete structures were analysed to investigate the possibility that 'Tahiti' and 'Tanepao' varieties are derived from natural interploid hybridization. Methods A population of 85 tetraploid hybrids was established between a doubled-diploid clementine and a doubled-diploid 'Mexican' lime and used to infer the genotypes of 'Mexican' lime diploid gametes. Meiotic behaviour was studied through combined segregation analysis of 35 simple sequenbce repeat (SSR) and single nucleotide polymorphismn (SNP) markers covering the nine citrus chromosomes and cytogenetic studies. It was supplemented by pollen viability assessment. Key Results Pollen viability of the doubled-diploid Mexican lime (64 %) was much higher than that of the diploid. On average, 65 % of the chromosomes paired as bivalents and 31.4 % as tetravalents. Parental heterozygosity restitution ranged from 83 to 99 %. Disomic inheritance with high preferential pairing values was deduced for three chromosomes. Intermediate inheritances, with disomic trend, were found for five chromosomes, and an intermediate inheritance was observed for one chromosome. The average effective interspecific recombination rate was low (1.2 cM Mb-1). Conclusion The doubled-diploid 'Mexican' lime had predominantly disomic segregation, producing interspecific diploid gamete structures with high C. medica/C. micrantha heterozygosity, compatible with the phylogenomic structures of triploid C. latifolia and C. aurantiifolia varieties. This disomic trend limits effective interspecific recombination and diversity of the diploid gamete population. Interploid reconstruction breeding using doubled-diploid lime as one parent is a promising approach for triploid lime diversification.
Collapse
|