1
|
Geraffi N, Gupta P, Wagner N, Barash I, Pupko T, Sessa G. Comparative sequence analysis of pPATH pathogenicity plasmids in Pantoea agglomerans gall-forming bacteria. Front Plant Sci 2023; 14:1198160. [PMID: 37583594 PMCID: PMC10425158 DOI: 10.3389/fpls.2023.1198160] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Accepted: 07/10/2023] [Indexed: 08/17/2023]
Abstract
Acquisition of the pathogenicity plasmid pPATH that encodes a type III secretion system (T3SS) and effectors (T3Es) has likely led to the transition of a non-pathogenic bacterium into the tumorigenic pathogen Pantoea agglomerans. P. agglomerans pv. gypsophilae (Pag) forms galls on gypsophila (Gypsophila paniculata) and triggers immunity on sugar beet (Beta vulgaris), while P. agglomerans pv. betae (Pab) causes galls on both gypsophila and sugar beet. Draft sequences of the Pag and Pab genomes were previously generated using the MiSeq Illumina technology and used to determine partial T3E inventories of Pab and Pag. Here, we fully assembled the Pab and Pag genomes following sequencing with PacBio technology and carried out a comparative sequence analysis of the Pab and Pag pathogenicity plasmids pPATHpag and pPATHpab. Assembly of Pab and Pag genomes revealed a ~4 Mbp chromosome with a 55% GC content, and three and four plasmids in Pab and Pag, respectively. pPATHpag and pPATHpab share 97% identity within a 74% coverage, and a similar GC content (51%); they are ~156 kb and ~131 kb in size and consist of 198 and 155 coding sequences (CDSs), respectively. In both plasmids, we confirmed the presence of highly similar gene clusters encoding a T3SS, as well as auxin and cytokinins biosynthetic enzymes. Three putative novel T3Es were identified in Pab and one in Pag. Among T3SS-associated proteins encoded by Pag and Pab, we identified two novel chaperons of the ShcV and CesT families that are present in both pathovars with high similarity. We also identified insertion sequences (ISs) and transposons (Tns) that may have contributed to the evolution of the two pathovars. These include seven shared IS elements, and three ISs and two transposons unique to Pab. Finally, comparative sequence analysis revealed plasmid regions and CDSs that are present only in pPATHpab or in pPATHpag. The high similarity and common features of the pPATH plasmids support the hypothesis that the two strains recently evolved into host-specific pathogens.
Collapse
Affiliation(s)
- Naama Geraffi
- School of Plant Sciences and Food Security, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Priya Gupta
- School of Plant Sciences and Food Security, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Naama Wagner
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Isaac Barash
- School of Plant Sciences and Food Security, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Tal Pupko
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Guido Sessa
- School of Plant Sciences and Food Security, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
2
|
Zehentner B, Ardern Z, Kreitmeier M, Scherer S, Neuhaus K. A Novel pH-Regulated, Unusual 603 bp Overlapping Protein Coding Gene pop Is Encoded Antisense to ompA in Escherichia coli O157:H7 (EHEC). Front Microbiol 2020; 11:377. [PMID: 32265854 PMCID: PMC7103648 DOI: 10.3389/fmicb.2020.00377] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2019] [Accepted: 02/20/2020] [Indexed: 12/23/2022] Open
Abstract
Antisense transcription is well known in bacteria. However, translation of antisense RNAs is typically not considered, as the implied overlapping coding at a DNA locus is assumed to be highly improbable. Therefore, such overlapping genes are systematically excluded in prokaryotic genome annotation. Here we report an exceptional 603 bp long open reading frame completely embedded in antisense to the gene of the outer membrane protein ompA. An active σ70 promoter, transcription start site (TSS), Shine-Dalgarno motif and rho-independent terminator were experimentally validated, providing evidence that this open reading frame has all the structural features of a functional gene. Furthermore, ribosomal profiling revealed translation of the mRNA, the protein was detected in Western blots and a pH-dependent phenotype conferred by the protein was shown in competitive overexpression growth experiments of a translationally arrested mutant versus wild type. We designate this novel gene pop (pH-regulated overlapping protein-coding gene), thus adding another example to the growing list of overlapping, protein coding genes in bacteria.
Collapse
Affiliation(s)
- Barbara Zehentner
- Chair for Microbial Ecology, Technical University of Munich, Freising, Germany
| | - Zachary Ardern
- Chair for Microbial Ecology, Technical University of Munich, Freising, Germany
| | - Michaela Kreitmeier
- Chair for Microbial Ecology, Technical University of Munich, Freising, Germany
| | - Siegfried Scherer
- Chair for Microbial Ecology, Technical University of Munich, Freising, Germany
- ZIEL – Institute for Food & Health, Technical University of Munich, Freising, Germany
| | - Klaus Neuhaus
- ZIEL – Institute for Food & Health, Technical University of Munich, Freising, Germany
- Core Facility Microbiome, ZIEL – Institute for Food & Health, Technical University of Munich, Freising, Germany
| |
Collapse
|
3
|
Hücker SM, Vanderhaeghen S, Abellan-Schneyder I, Scherer S, Neuhaus K. The Novel Anaerobiosis-Responsive Overlapping Gene ano Is Overlapping Antisense to the Annotated Gene ECs2385 of Escherichia coli O157:H7 Sakai. Front Microbiol 2018; 9:931. [PMID: 29867840 PMCID: PMC5960689 DOI: 10.3389/fmicb.2018.00931] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2018] [Accepted: 04/23/2018] [Indexed: 12/26/2022] Open
Abstract
Current notion presumes that only one protein is encoded at a given bacterial genetic locus. However, transcription and translation of an overlapping open reading frame (ORF) of 186 bp length were discovered by RNAseq and RIBOseq experiments. This ORF is almost completely embedded in the annotated L,D-transpeptidase gene ECs2385 of Escherichia coli O157:H7 Sakai in the antisense reading frame -3. The ORF is transcribed as part of a bicistronic mRNA, which includes the annotated upstream gene ECs2384, encoding a murein lipoprotein. The transcriptional start site of the operon resides 38 bp upstream of the ECs2384 start codon and is driven by a predicted σ70 promoter, which is constitutively active under different growth conditions. The bicistronic operon contains a ρ-independent terminator just upstream of the novel gene, significantly decreasing its transcription. The novel gene can be stably expressed as an EGFP-fusion protein and a translationally arrested mutant of ano, unable to produce the protein, shows a growth advantage in competitive growth experiments compared to the wild type under anaerobiosis. Therefore, the novel antisense overlapping gene is named ano (anaerobiosis responsive overlapping gene). A phylostratigraphic analysis indicates that ano originated very recently de novo by overprinting after the Escherichia/Shigella clade separated from other enterobacteria. Therefore, ano is one of the very rare cases of overlapping genes known in the genus Escherichia.
Collapse
Affiliation(s)
- Sarah M Hücker
- Chair for Microbial Ecology, Technical University of Munich, Freising, Germany
| | - Sonja Vanderhaeghen
- Chair for Microbial Ecology, Technical University of Munich, Freising, Germany
| | | | - Siegfried Scherer
- Chair for Microbial Ecology, Technical University of Munich, Freising, Germany.,Institute for Food & Health, Technical University of Munich, Freising, Germany
| | - Klaus Neuhaus
- Chair for Microbial Ecology, Technical University of Munich, Freising, Germany.,Core Facility Microbiome/NGS, Institute for Food & Health, Technical University of Munich, Freising, Germany
| |
Collapse
|
4
|
Lechner M, Nickel AI, Wehner S, Riege K, Wieseke N, Beckmann BM, Hartmann RK, Marz M. Genomewide comparison and novel ncRNAs of Aquificales. BMC Genomics 2014; 15:522. [PMID: 24965762 PMCID: PMC4227106 DOI: 10.1186/1471-2164-15-522] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2013] [Accepted: 05/08/2014] [Indexed: 12/05/2022] Open
Abstract
Background The Aquificales are a diverse group of thermophilic bacteria that thrive in terrestrial and marine hydrothermal environments. They can be divided into the families Aquificaceae, Desulfurobacteriaceae and Hydrogenothermaceae. Although eleven fully sequenced and assembled genomes are available, only little is known about this taxonomic order in terms of RNA metabolism. Results In this work, we compare the available genomes, extend their protein annotation, identify regulatory sequences, annotate non-coding RNAs (ncRNAs) of known function, predict novel ncRNA candidates, show idiosyncrasies of the genetic decoding machinery, present two different types of transfer-messenger RNAs and variations of the CRISPR systems. Furthermore, we performed a phylogenetic analysis of the Aquificales based on entire genome sequences, and extended this by a classification among all bacteria using 16S rRNA sequences and a set of orthologous proteins. Combining several in silico features (e.g. conserved and stable secondary structures, GC-content, comparison based on multiple genome alignments) with an in vivo dRNA-seq transcriptome analysis of Aquifex aeolicus, we predict roughly 100 novel ncRNA candidates in this bacterium. Conclusions We have here re-analyzed the Aquificales, a group of bacteria thriving in extreme environments, sharing the feature of a small, compact genome with a reduced number of protein and ncRNA genes. We present several classical ncRNAs and riboswitch candidates. By combining in silico analysis with dRNA-seq data of A. aeolicus we predict nearly 100 novel ncRNA candidates.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Roland K Hartmann
- Institut für Pharmazeutische Chemie, Philipps-Universität Marburg, Marbacher Weg 6, 35032 Marburg, Germany.
| | | |
Collapse
|
5
|
Rossbach S, Kunze K, Albert S, Zehner S, Göttfert M. The Sinorhizobium meliloti EmrAB efflux system is regulated by flavonoids through a TetR-like regulator (EmrR). Mol Plant Microbe Interact 2014; 27:379-387. [PMID: 24224534 DOI: 10.1094/mpmi-09-13-0282-r] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
The divergently oriented Sinorhizobium meliloti emrAB (SMc03168 and SMc03167) and emrR (SMc03169) genes are predicted to encode an efflux system of the major facilitator superfamily and a TetR-like transcriptional regulator, respectively. The transcription of the emrA gene was found to be inducible by flavonoids, including luteolin and apigenin, which are known inducers of the nodulation genes in S. meliloti. Interestingly, quercetin, which does not induce nodulation genes, was also a potent inducer of emrA, indicating that NodD is not directly involved in regulation of emrA. The likely regulator of emrAB is EmrR, which binds to palindrome-like sequences in the intergenic region. Several modifications of the palindromes, including an increase of the spacing between the two half sites, prevented binding of EmrR. Binding was also impaired by the presence of luteolin. Mutations in emrA had no obvious effect on symbiosis. This was in contrast to the emrR mutant, which exhibited a symbiotic deficiency with Medicago sativa. Conserved binding sites for TetR-like regulators within the intergenic regions between the emrAB and emrR genes were identified in many symbiotic and pathogenic members of the order Rhizobiales.
Collapse
|
6
|
Baudet M, Ortet P, Gaillard JC, Fernandez B, Guérin P, Enjalbal C, Subra G, de Groot A, Barakat M, Dedieu A, Armengaud J. Proteomics-based refinement of Deinococcus deserti genome annotation reveals an unwonted use of non-canonical translation initiation codons. Mol Cell Proteomics 2009; 9:415-26. [PMID: 19875382 PMCID: PMC2830850 DOI: 10.1074/mcp.m900359-mcp200] [Citation(s) in RCA: 79] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
Deinococcaceae are a family of extremely radiation-tolerant bacteria that are currently subjected to numerous studies aimed at understanding the molecular mechanisms for such radiotolerance. To achieve a comprehensive and accurate annotation of the Deinococcus deserti genome, we performed an N terminus-oriented characterization of its proteome. For this, we used a labeling reagent, N-tris(2,4,6-trimethoxyphenyl)phosphonium acetyl succinimide, to selectively derivatize protein N termini. The large scale identification of N-tris(2,4,6-trimethoxyphenyl)phosphonium acetyl succinimide-modified N-terminal-most peptides by shotgun liquid chromatography-tandem mass spectrometry analysis led to the validation of 278 and the correction of 73 translation initiation codons in the D. deserti genome. In addition, four new genes were detected, three located on the main chromosome and one on plasmid P3. We also analyzed signal peptide cleavages on a genome-wide scale. Based on comparative proteogenomics analysis, we propose a set of 137 corrections to improve Deinococcus radiodurans and Deinococcus geothermalis gene annotations. Some of these corrections affect important genes involved in DNA repair mechanisms such as polA, ligA, and ddrB. Surprisingly, experimental evidences were obtained indicating that DnaA (the protein involved in the DNA replication initiation process) and RpsL (the S12 ribosomal conserved protein) translation is initiated in Deinococcaceae from non-canonical codons (ATC and CTG, respectively). Such use may be the basis of specific regulation mechanisms affecting replication and translation. We also report the use of non-conventional translation initiation codons for two other genes: Deide_03051 and infC. Whether such use of non-canonical translation initiation codons is much more frequent than for other previously reported bacterial phyla or restricted to Deinococcaceae remains to be investigated. Our results demonstrate that predicting translation initiation codons is still difficult for some bacteria and that proteomics-based refinement of genome annotations may be helpful in such cases.
Collapse
Affiliation(s)
- Mathieu Baudet
- Laboratoire de Biochimie des Systèmes Perturbés, Service de Biochimie et Toxicologie Nucléaire, Institut de Biologie Environnementale et Biotechnologie (iBEB), Direction des Sciences du Vivant (DSV), Commissariat à l'Energie Atomique et aux Energies Alternatives (CEA), F-30207 Bagnols-sur-Cèze, France
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
7
|
|
8
|
Gassel M, Möllenkamp T, Puppe W, Altendorf K. The KdpF subunit is part of the K(+)-translocating Kdp complex of Escherichia coli and is responsible for stabilization of the complex in vitro. J Biol Chem 1999; 274:37901-7. [PMID: 10608856 DOI: 10.1074/jbc.274.53.37901] [Citation(s) in RCA: 80] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The kdpABC operon codes for the high affinity K(+)-translocating Kdp complex (P-type ATPase) of Escherichia coli. Upon expression of this operon in minicells, a so far unrecognized small hydrophobic polypeptide, KdpF, could be identified on high resolution SDS-polyacrylamide gels in addition to the subunits KdpA, KdpB, and KdpC. Furthermore, it could be demonstrated that KdpF remains associated with the purified complex. As determined by mass spectrometry, this peptide is present in its formylated form and has a molecular mass of 3100 Da. KdpF is not essential for growth on low K(+) (0.1 mM) medium, as shown by deletion analysis of kdpF, but proved to be indispensable for a functional enzyme complex in vitro. In the absence of KdpF, the ATPase activity of the membrane-bound Kdp complex was almost indistinguishable from that of the wild type. In contrast, the purified detergent-solubilized enzyme complex showed a dramatic decrease in enzymatic activity. However, addition of purified KdpF to the KdpABC complex restored the activity up to wild type level. It is interesting to note that the addition of high amounts of E. coli lipids had a similar effect. Although KdpF is not essential for the function of the Kdp complex in vivo, it is part of the complex and functions as a stabilizing element in vitro. The corresponding operon should now be referred to as kdpFABC.
Collapse
Affiliation(s)
- M Gassel
- Universität Osnabrück, Fachbereich Biologie/Chemie, Abteilung Mikrobiologie, D-49069 Osnabrück, Germany
| | | | | | | |
Collapse
|
9
|
Abstract
Some statistical properties of open reading frames in all currently available complete genome sequences are analyzed (seventeen prokatyotic genomes, and 16 chromosome sequences from the yeast genome). The size distribution of open reading frames is characterized by various techniques, such as quantile tables, QQ-plots, rank-size plots (Zipf's plots), and spatial densities. The issue of the influence of CG% on the size distribution is addressed. When yeast chromosomes are compared with archaeal and eubacterial genomes, they tend to have more long open reading frames. There is little or no evidence to reject the null hypothesis that open reading frames on six different reading frames and two strands distribute similarly. A topic of current interest, the base composition asymmetry in open reading frames between the two strands, is studied using regression analysis. The base composition asymmetry at three codon positions is analyzed separately. It was shown in these genome sequences that the first codon position is G- and A-rich (i.e. purine-rich); there is a co-existence of A- and T-rich branches at the second codon position; and the third codon position is weakly T-rich.
Collapse
Affiliation(s)
- W Li
- Laboratory of Statistical Genetics, Rockefeller University, New York, NY 10021, USA.
| |
Collapse
|
10
|
Abstract
Partial sequencing of a genomic library of the virulence plasmid of Salmonella enteritidis has been used to localize in the restriction map of this plasmid the genetic loci already described in other Salmonella plasmids. The comparison of the vestigial tra region with the corresponding genes in the F plasmid allowed us to define the extent of the deletions that the S. enteritidis plasmid should have suffered. The putative replicons of the plasmid, repB and repC, were isolated and both proved to be functional in Escherichia coli, but repC was segregationally unstable. The nucleotide sequence of repB showed the typical organization of RepFIIA replicons, although the similarity was lower than usual in this group of replicons. The highest homology was found with the replicon of the virulence plasmid pYVe439-80 from Yersinia enterocolitica (72.5%). Replicon repC also showed a maximum identity of 72.6% with known replicons, namely the RepFIB of pColV-K30 and P307, both virulence plasmids isolated from E. coli. We conclude that the S. enteritidis plasmid could arise from the S. typhimurium plasmid through deletions, and that they are evolutionary distant from other IncFI and IncFII plasmids.
Collapse
Affiliation(s)
- J M Rodríguez-Peña
- Departamento de Microbiologia II, Facultad de Farmacia, Universidad Complutense, Madrid, Spain
| | | | | | | |
Collapse
|
11
|
Abstract
Clinical isolates of Escherichia coli were found to host a multiplicity of plasmids. These were resolved from plasmid gel profiles, from the properties of various transconjugants and transformants of E. coli DH1, by the topoisomerase I relaxation of covalently closed circle plasmid DNA, by electron microscopy, and by the determination of their compatibilities. The majority of these were unusually small, cryptic plasmids (SCPs). From one strain, KL4, 13 electrophoretic bands were resolved to five plasmids, three of which were SCPs. SCPs were phenotypically barren, and the smallest of these, pKL1, contained barely enough information for self-replication. A derivative of pKL1, pKL1Km, in which the transposon was restricted to a small 350-bp region, was stably maintained in Shigella, Salmonella, Serratia, and Citrobacter species and its replication was polA independent. pKL1 encoded only a single protein, RepA (Mr 17960), which specifically bound to pKL1 DNA. No apparent homologies with other RepA protein sequences could be detected. Thus the SCP, pKL1, is a novel minimal plasmid replicon encoding only enough information to ensure perpetuation. A hypothesis is presented describing SCPs as a class of selfish DNA that persists simply due to its ability to replicate and to its stability based on high copy number.
Collapse
Affiliation(s)
- J Burian
- Faculty of Natural Sciences, Comenius University, Mlynská dolina B-2, Bratislava, Slovakia
| | | | | | | |
Collapse
|
12
|
Affiliation(s)
- D K Chattoraj
- Laboratory of Biochemistry NCI, NIH Bethesda, Maryland 20892, USA
| | | |
Collapse
|
13
|
Spiers AJ, Bhana N, Bergquist PL. Regulatory interactions between RepA, an essential replication protein, and the DNA repeats of RepFIB from plasmid P307. J Bacteriol 1993; 175:4016-24. [PMID: 8320218 PMCID: PMC204830 DOI: 10.1128/jb.175.13.4016-4024.1993] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
The control of RepFIB replication appears to rely on the interaction between an initiator protein (RepA) and two sets of DNA repeat elements located on either side of the repA gene (BCDD'D" and EFGHIJ). In vivo genetic tests demonstrate that the BCDD'D" repeats form part of the origin of replication, while some of the downstream repeat elements (HIJ) are involved in the sensing and setting of plasmid copy number. RepA DNA binding to these groups of repeats has been investigated in vivo by utilizing the fact that the replicon contains three active promoters (orip, repAp, and EFp), one of which has previously been shown to control the expression of repA (repAp). All three promoters are closely associated with the repeat elements flanking repA, and an investigation using lacZ or cml gene fusions has demonstrated that RepA expressed in trans is able to repress each promoter. However, these assays suggest that the transcriptional responses of orip and repAp to RepA repression are significantly different, despite the fact that both promoters are embedded within the BCDD'D" repeat elements. Extra copies of the BCDD'D" or EFG repeats in trans have no effect on RepA repression of repAp embedded in a second copy of the BCDD'D" repeats, but copies of the HIJ or EFGHIJ repeats are able to derepress repAp, suggesting that there is a fundamental difference between RepA-BCDD'D" or -HIJ complexes and RepA-EFG or -EFGHIJ complexes.
Collapse
Affiliation(s)
- A J Spiers
- Centre for Gene Technology, University of Auckland, New Zealand
| | | | | |
Collapse
|