1
|
Evolutionary and functional insights into the Ski2-like helicase family in Archaea: a comparison of Thermococcales ASH-Ski2 and Hel308 activities. NAR Genom Bioinform 2024; 6:lqae026. [PMID: 38500564 PMCID: PMC10946056 DOI: 10.1093/nargab/lqae026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Revised: 02/19/2024] [Accepted: 02/27/2024] [Indexed: 03/20/2024] Open
Abstract
RNA helicases perform essential housekeeping and regulatory functions in all domains of life by binding and unwinding RNA molecules. The Ski2-like proteins are primordial helicases that play an active role in eukaryotic RNA homeostasis pathways, with multiple homologs having specialized functions. The significance of the expansion and diversity of Ski2-like proteins in Archaea, the third domain of life, has not yet been established. Here, by studying the phylogenetic diversity of Ski2-like helicases among archaeal genomes and the enzymatic activities of those in Thermococcales, we provide further evidence of the function of this protein family in archaeal metabolism of nucleic acids. We show that, in the course of evolution, ASH-Ski2 and Hel308-Ski2, the two main groups of Ski2-like proteins, have diverged in their biological functions. Whereas Hel308 has been shown to mainly act on DNA, we show that ASH-Ski2, previously described to be associated with the 5'-3' aRNase J exonuclease, acts on RNA by supporting an efficient annealing activity, but also an RNA unwinding with a 3'-5' polarity. To gain insights into the function of Ski2, we also analyse the transcriptome of Thermococcus barophilus ΔASH-Ski2 mutant strain and provide evidence of the importance of ASH-Ski2 in cellular metabolism pathways related to translation.
Collapse
|
2
|
The NagY regulator: A member of the BglG/SacY antiterminator family conserved in Enterococcus faecalis and involved in virulence. Front Microbiol 2022; 13:1070116. [PMID: 36875533 PMCID: PMC9981650 DOI: 10.3389/fmicb.2022.1070116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Accepted: 12/21/2022] [Indexed: 02/19/2023] Open
Abstract
Enterococcus faecalis is a commensal bacterium of the gastrointestinal tract but also a major nosocomial pathogen. This bacterium uses regulators like BglG/SacY family of transcriptional antiterminators to adapt its metabolism during host colonization. In this report, we investigated the role of the BglG/SacY family antiterminator NagY in the regulation of the nagY-nagE operon in presence of N-acetylglucosamine, with nagE encoding a transporter of this carbohydrate, as well as the expression of the virulence factor HylA. We showed that this last protein is involved in biofilm formation and glycosaminoglycans degradation that are important features in bacterial infection, confirmed in the Galleria mellonella model. In order to elucidate the evolution of these actors, we performed phylogenomic analyses on E. faecalis and Enterococcaceae genomes, identified orthologous sequences of NagY, NagE, and HylA, and we report their taxonomic distribution. The study of the conservation of the upstream region of nagY and hylA genes showed that the molecular mechanism of NagY regulation involves ribonucleic antiterminator sequence overlapping a rho-independent terminator, suggesting a regulation conforming to the canonical model of BglG/SacY family antiterminators. In the perspective of opportunism understanding, we offer new insights into the mechanism of host sensing thanks to the NagY antiterminator and its targets expression.
Collapse
|
3
|
RNA processing machineries in Archaea: the 5'-3' exoribonuclease aRNase J of the β-CASP family is engaged specifically with the helicase ASH-Ski2 and the 3'-5' exoribonucleolytic RNA exosome machinery. Nucleic Acids Res 2020; 48:3832-3847. [PMID: 32030412 PMCID: PMC7144898 DOI: 10.1093/nar/gkaa052] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2019] [Revised: 01/14/2020] [Accepted: 01/23/2020] [Indexed: 01/22/2023] Open
Abstract
A network of RNA helicases, endoribonucleases and exoribonucleases regulates the quantity and quality of cellular RNAs. To date, mechanistic studies focussed on bacterial and eukaryal systems due to the challenge of identifying the main drivers of RNA decay and processing in Archaea. Here, our data support that aRNase J, a 5'-3' exoribonuclease of the β-CASP family conserved in Euryarchaeota, engages specifically with a Ski2-like helicase and the RNA exosome to potentially exert control over RNA surveillance, at the vicinity of the ribosome. Proteomic landscapes and direct protein-protein interaction analyses, strengthened by comprehensive phylogenomic studies demonstrated that aRNase J interplay with ASH-Ski2 and a cap exosome subunit. Finally, Thermococcus barophilus whole-cell extract fractionation experiments provide evidences that an aRNase J/ASH-Ski2 complex might exist in vivo and hint at an association of aRNase J with the ribosome that is emphasised in absence of ASH-Ski2. Whilst aRNase J homologues are found among bacteria, the RNA exosome and the Ski2-like RNA helicase have eukaryotic homologues, underlining the mosaic aspect of archaeal RNA machines. Altogether, these results suggest a fundamental role of β-CASP RNase/helicase complex in archaeal RNA metabolism.
Collapse
|
4
|
J-Domain Proteins in Bacteria and Their Viruses. J Mol Biol 2020; 432:3771-3789. [DOI: 10.1016/j.jmb.2020.04.014] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2020] [Revised: 04/10/2020] [Accepted: 04/14/2020] [Indexed: 02/06/2023]
|
5
|
Insights into RNA-processing pathways and associated RNA-degrading enzymes in Archaea. FEMS Microbiol Rev 2018; 42:579-613. [PMID: 29684129 DOI: 10.1093/femsre/fuy016] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2018] [Accepted: 04/17/2018] [Indexed: 12/20/2022] Open
Abstract
RNA-processing pathways are at the centre of regulation of gene expression. All RNA transcripts undergo multiple maturation steps in addition to covalent chemical modifications to become functional in the cell. This includes destroying unnecessary or defective cellular RNAs. In Archaea, information on mechanisms by which RNA species reach their mature forms and associated RNA-modifying enzymes are still fragmentary. To date, most archaeal actors and pathways have been proposed in light of information gathered from Bacteria and Eukarya. In this context, this review provides a state of the art overview of archaeal endoribonucleases and exoribonucleases that cleave and trim RNA species and also of the key small archaeal proteins that bind RNAs. Furthermore, synthetic up-to-date views of processing and biogenesis pathways of archaeal transfer and ribosomal RNAs as well as of maturation of stable small non-coding RNAs such as CRISPR RNAs, small C/D and H/ACA box guide RNAs, and other emerging classes of small RNAs are described. Finally, prospective post-transcriptional mechanisms to control archaeal messenger RNA quality and quantity are discussed.
Collapse
|
6
|
Dynamic Modeling of Streptococcus pneumoniae Competence Provides Regulatory Mechanistic Insights Into Its Tight Temporal Regulation. Front Microbiol 2018; 9:1637. [PMID: 30087661 PMCID: PMC6066662 DOI: 10.3389/fmicb.2018.01637] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2018] [Accepted: 06/30/2018] [Indexed: 12/31/2022] Open
Abstract
In the human pathogen Streptococcus pneumoniae, the gene regulatory circuit leading to the transient state of competence for natural transformation is based on production of an auto-inducer that activates a positive feedback loop. About 100 genes are activated in two successive waves linked by a central alternative sigma factor ComX. This mechanism appears to be fundamental to the biological fitness of S. pneumoniae. We have developed a knowledge-based model of the competence cycle that describes average cell behavior. It reveals that the expression rates of the two competence operons, comAB and comCDE, involved in the positive feedback loop must be coordinated to elicit spontaneous competence. Simulations revealed the requirement for an unknown late com gene product that shuts of competence by impairing ComX activity. Further simulations led to the predictions that the membrane protein ComD bound to CSP reacts directly to pH change of the medium and that blindness to CSP during the post-competence phase is controlled by late DprA protein. Both predictions were confirmed experimentally.
Collapse
|
7
|
Single-strand DNA processing: phylogenomics and sequence diversity of a superfamily of potential prokaryotic HuH endonucleases. BMC Genomics 2018; 19:475. [PMID: 29914351 PMCID: PMC6006769 DOI: 10.1186/s12864-018-4836-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2018] [Accepted: 05/29/2018] [Indexed: 12/11/2022] Open
Abstract
Background Some mobile genetic elements target the lagging strand template during DNA replication. Bacterial examples are insertion sequences IS608 and ISDra2 (IS200/IS605 family members). They use obligatory single-stranded circular DNA intermediates for excision and insertion and encode a transposase, TnpAIS200, which recognizes subterminal secondary structures at the insertion sequence ends. Similar secondary structures, Repeated Extragenic Palindromes (REP), are present in many bacterial genomes. TnpAIS200-related proteins, TnpAREP, have been identified and could be responsible for REP sequence proliferation. These proteins share a conserved HuH/Tyrosine core domain responsible for catalysis and are involved in processes of ssDNA cleavage and ligation. Our goal is to characterize the diversity of these proteins collectively referred as the TnpAY1 family. Results A genome-wide analysis of sequences similar to TnpAIS200 and TnpAREP in prokaryotes revealed a large number of family members with a wide taxonomic distribution. These can be arranged into three distinct classes and 12 subclasses based on sequence similarity. One subclass includes sequences similar to TnpAIS200. Proteins from other subclasses are not associated with typical insertion sequence features. These are characterized by specific additional domains possibly involved in protein/DNA or protein/protein interactions. Their genes are found in more than 25% of species analyzed. They exhibit a patchy taxonomic distribution consistent with dissemination by horizontal gene transfers followed by loss. The tnpAREP genes of five subclasses are flanked by typical REP sequences in a REPtron-like arrangement. Four distinct REP types were characterized with a subclass specific distribution. Other subclasses are not associated with REP sequences but have a large conserved domain located in C-terminal end of their sequence. This unexpected diversity suggests that, while most likely involved in processing single-strand DNA, proteins from different subfamilies may play a number of different roles. Conclusions We established a detailed classification of TnpAY1 proteins, consolidated by the analysis of the conserved core domains and the characterization of additional domains. The data obtained illustrate the unexpected diversity of the TnpAY1 family and provide a strong framework for future evolutionary and functional studies. By their potential function in ssDNA editing, they may confer adaptive responses to host cell physiology and metabolism. Electronic supplementary material The online version of this article (10.1186/s12864-018-4836-1) contains supplementary material, which is available to authorized users.
Collapse
|
8
|
Bacterial transformation: distribution, shared mechanisms and divergent control. Nat Rev Microbiol 2014; 12:181-96. [DOI: 10.1038/nrmicro3199] [Citation(s) in RCA: 402] [Impact Index Per Article: 40.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
|
9
|
Diversity of β-lactam resistance mechanisms in cystic fibrosis isolates of Pseudomonas aeruginosa: a French multicentre study. J Antimicrob Chemother 2013; 68:1763-71. [PMID: 23629014 DOI: 10.1093/jac/dkt115] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
OBJECTIVES To investigate the resistance mechanisms of β-lactam-resistant Pseudomonas aeruginosa isolated from cystic fibrosis (CF) patients in France. METHODS Two-hundred-and-four P. aeruginosa CF isolates were collected in 10 French university hospitals in 2007. Their susceptibility to 14 antibiotics and their resistance mechanisms to β-lactams were investigated. Their β-lactamase contents were characterized by isoelectric focusing, PCR and enzymatic assays. Expression levels of efflux pumps and the intrinsic β-lactamase AmpC were quantified by reverse transcription real-time quantitative PCR. Genotyping was performed using multiple-locus variable number of tandem repeats analysis (MLVA). The oprD genes were sequenced and compared with those of reference P. aeruginosa strains. To assess deficient OprD production, western blotting experiments were carried out on outer membrane preparations. RESULTS MLVA typing discriminated 131 genotypes and 47 clusters. One-hundred-and-twenty-four isolates (60.8%) displayed a susceptible phenotype to β-lactams according to EUCAST breakpoints. In the 80 remaining isolates, resistance to β-lactams resulted from derepression of intrinsic cephalosporinase AmpC (61.3%) and/or acquisition of secondary β-lactamases (13.8%). Efflux pumps were up-regulated in 88.8% of isolates and porin OprD was lost in 53.8% of isolates due to frameshifting or nonsense mutations in the oprD gene. CONCLUSIONS β-Lactam resistance rates are quite high in CF strains of P. aeruginosa isolated in France and not really different from those reported for nosocomial strains. Development of β-lactam resistance is correlated with patient age. It results from intrinsic mechanisms sequentially accumulated by bacteria isolated from patients who have undergone repeated courses of chemotherapy.
Collapse
|
10
|
Structuring the bacterial genome: Y1-transposases associated with REP-BIME sequences. Nucleic Acids Res 2011; 40:3596-609. [PMID: 22199259 PMCID: PMC3333891 DOI: 10.1093/nar/gkr1198] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
REPs are highly repeated intergenic palindromic sequences often clustered into structures called BIMEs including two individual REPs separated by short linker of variable length. They play a variety of key roles in the cell. REPs also resemble the sub-terminal hairpins of the atypical IS200/605 family of insertion sequences which encode Y1 transposases (TnpA(IS200/IS605)). These belong to the HUH endonuclease family, carry a single catalytic tyrosine (Y) and promote single strand transposition. Recently, a new clade of Y1 transposases (TnpA(REP)) was found associated with REP/BIME in structures called REPtrons. It has been suggested that TnpA(REP) is responsible for REP/BIME proliferation over genomes. We analysed and compared REP distribution and REPtron structure in numerous available E. coli and Shigella strains. Phylogenetic analysis clearly indicated that tnpA(REP) was acquired early in the species radiation and was lost later in some strains. To understand REP/BIME behaviour within the host genome, we also studied E. coli K12 TnpA(REP) activity in vitro and demonstrated that it catalyses cleavage and recombination of BIMEs. While TnpA(REP) shared the same general organization and similar catalytic characteristics with TnpA(IS200/IS605) transposases, it exhibited distinct properties potentially important in the creation of BIME variability and in their amplification. TnpA(REP) may therefore be one of the first examples of transposase domestication in prokaryotes.
Collapse
|
11
|
The cyst-dividing bacterium Ramlibacter tataouinensis TTB310 genome reveals a well-stocked toolbox for adaptation to a desert environment. PLoS One 2011; 6:e23784. [PMID: 21912644 PMCID: PMC3164672 DOI: 10.1371/journal.pone.0023784] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2011] [Accepted: 07/25/2011] [Indexed: 11/22/2022] Open
Abstract
Ramlibacter tataouinensis TTB310(T) (strain TTB310), a betaproteobacterium isolated from a semi-arid region of South Tunisia (Tataouine), is characterized by the presence of both spherical and rod-shaped cells in pure culture. Cell division of strain TTB310 occurs by the binary fission of spherical "cyst-like" cells ("cyst-cyst" division). The rod-shaped cells formed at the periphery of a colony (consisting mainly of cysts) are highly motile and colonize a new environment, where they form a new colony by reversion to cyst-like cells. This unique cell cycle of strain TTB310, with desiccation tolerant cyst-like cells capable of division and desiccation sensitive motile rods capable of dissemination, appears to be a novel adaptation for life in a hot and dry desert environment. In order to gain insights into strain TTB310's underlying genetic repertoire and possible mechanisms responsible for its unusual lifestyle, the genome of strain TTB310 was completely sequenced and subsequently annotated. The complete genome consists of a single circular chromosome of 4,070,194 bp with an average G+C content of 70.0%, the highest among the Betaproteobacteria sequenced to date, with total of 3,899 predicted coding sequences covering 92% of the genome. We found that strain TTB310 has developed a highly complex network of two-component systems, which may utilize responses to light and perhaps a rudimentary circadian hourglass to anticipate water availability at the dew time in the middle/end of the desert winter nights and thus direct the growth window to cyclic water availability times. Other interesting features of the strain TTB310 genome that appear to be important for desiccation tolerance, including intermediary metabolism compounds such as trehalose or polyhydroxyalkanoate, and signal transduction pathways, are presented and discussed.
Collapse
|
12
|
The PprA-PprB two-component system activates CupE, the first non-archetypal Pseudomonas aeruginosa chaperone-usher pathway system assembling fimbriae. Environ Microbiol 2010; 13:666-83. [DOI: 10.1111/j.1462-2920.2010.02372.x] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
|
13
|
SpxA1, a novel transcriptional regulator involved in X-state (competence) development in Streptococcus pneumoniae. Mol Microbiol 2009; 73:492-506. [PMID: 19627499 DOI: 10.1111/j.1365-2958.2009.06789.x] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Streptococcus pneumoniae is a naturally transformable human pathogen. Genome and phylogenetic analyses uncovered two Spx-like global transcriptional regulators, SpxA1 and SpxA2, encoded by S. pneumoniae. spxA1 and spxA2 are not essential, but their simultaneous inactivation is lethal. SpxA1 represses transcription of the early competence operon comCDE and thereby negatively regulates the initiation of the X-state (competence). The molecular basis of this repression could be similar to that of SpxA of Bacillus subtilis, involving a specific interaction with the alpha subunit of RNA polymerase. S. pneumoniae lacks an SOS-like stress response and the X-state is proposed to be a general stress response mechanism in this species. In light of this, SpxA1-dependent repression could act to sense environmental or metabolic stresses and prevent launching of the X-state in the absence of stress.
Collapse
|
14
|
A key presynaptic role in transformation for a widespread bacterial protein: DprA conveys incoming ssDNA to RecA. Cell 2007; 130:824-36. [PMID: 17803906 DOI: 10.1016/j.cell.2007.07.038] [Citation(s) in RCA: 169] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2007] [Revised: 05/25/2007] [Accepted: 07/13/2007] [Indexed: 11/15/2022]
Abstract
Natural transformation is a mechanism for genetic exchange in many bacterial genera. It proceeds through the uptake of exogenous DNA and subsequent homology-dependent integration into the genome. In Streptococcus pneumoniae, this integration requires the ubiquitous recombinase, RecA, and DprA, a protein of unknown function widely conserved in bacteria. To unravel the role of DprA, we have studied the properties of the purified S. pneumoniae protein and its Bacillus subtilis ortholog (Smf). We report that DprA and Smf bind cooperatively to single-stranded DNA (ssDNA) and that these proteins both self-interact and interact with RecA. We demonstrate that DprA-RecA-ssDNA filaments are produced and that these filaments catalyze the homology-dependent formation of joint molecules. Finally, we show that while the Escherichia coli ssDNA-binding protein SSB limits access of RecA to ssDNA, DprA lowers this barrier. We propose that DprA is a new member of the recombination-mediator protein family, dedicated to natural bacterial transformation.
Collapse
|
15
|
Abstract
UNLABELLED We present a graphical tool dedicated to the exploration of bacterial genome rearrangements. The principle of this exploration relies on the reconstruction of ancestral genomes at each internal node of a gene-order-based phylogenetic tree. This tool allows the selection of internal nodes to visualize the rearrangements between the inferred chromosome of this node and its direct descendant on the tree. AVAILABILITY PEGR is available at the Genopole Toulouse Bioinformatics platform.
Collapse
|
16
|
Independent evolution of competence regulatory cascades in streptococci? Trends Microbiol 2006; 14:339-45. [PMID: 16820295 DOI: 10.1016/j.tim.2006.06.007] [Citation(s) in RCA: 96] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2006] [Revised: 05/11/2006] [Accepted: 06/15/2006] [Indexed: 02/02/2023]
Abstract
Natural genetic transformation is a mechanism of horizontal gene transfer that is widely distributed in bacteria and requires assembly of a DNA uptake machinery. Transformable bacteria use fundamentally the same machine, which in most species is assembled only in cells that are developing competence. Competence regulation usually differs between unrelated species. Here, we examine whether related streptococci use the same competence regulatory cascade. Phylogenetic analyses of streptococcal genome sequences reveal the existence of two paralogous two-component regulatory systems, either of which might control competence. This suggests the distribution of streptococci into two groups that use competence regulatory cascades that have at least partly evolved independently. Comparison of data obtained with two transformable streptococci, Streptococcus pneumoniae and Streptococcus mutans, provides support to this suggestion.
Collapse
|
17
|
ABCdb: an online resource for ABC transporter repertories from sequenced archaeal and bacterial genomes. FEMS Microbiol Lett 2006; 256:333-9. [PMID: 16499625 DOI: 10.1111/j.1574-6968.2006.00139.x] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
The ATP-binding cassette (ABC) transporters are one of the major classes of active transporters. They are widespread in archaea, bacteria, and eukaryota, indicating that they have arisen early in evolution. They are involved in many essential physiological processes, but the majority import or export a wide variety of compounds across cellular membranes. These systems share a common architecture composed of four (exporters) or five (importers) domains. To identify and reconstruct functional ABC transporters encoded by archaeal and bacterial genomes, we have developed a bioinformatic strategy. Cross-reference to the transport classification system is used to predict the type of compound transported. A high quality of annotation is achieved by manual verification of the predictions. However, in order to face the rapid increase in the number of published genomes, we also include analyses of genomes issuing directly from the automated strategy. Querying the database (http://www-abcdb.biotoul.fr) allows to easily retrieve ABC transporter repertories and related data. Additional query tools have been developed for the analysis of the ABC family from both functional and evolutionary perspectives.
Collapse
|
18
|
ISYMOD: a knowledge warehouse for the identification, assembly and analysis of bacterial integrated systems. Bioinformatics 2004; 21:1246-56. [PMID: 15531617 DOI: 10.1093/bioinformatics/bti137] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Complex biological functions emerge from interactions between proteins in stable supra-molecular assemblies and/or through transitory contacts. Most of the time protein partners of the assemblies are composed of one or several domains which exhibit different biochemical functions. Thus the study of cellular process requires the identification of different functional units and their integration in an interaction network; such complexes are referred to as integrated systems. In order to exploit with optimum efficiency the increased release of data, automated bioinformatics strategies are needed to identify, reconstruct and model such systems. For that purpose, we have developed a knowledge warehouse dedicated to the representation and acquisition of bacterial integrated systems involved in the exchange of the bacterial cell with its environment. RESULTS ISYMOD is a knowledge warehouse that consistently integrates in the same environment the data and the methods used for their acquisition. This is achieved through the construction of (1) a domain knowledge base (DKB) devoted to the storage of the knowledge about the systems, their functional specificities, their partners and how they are related and (2) a methodological knowledge base (MKB) which depicts the task layout used to identify and reconstruct functional integrated systems. Instantiation of the DKB is obtained by solving the tasks of the MKB, whereas some tasks need instances of the DKB to be solved. AROM, an object-based knowledge representation system, has been used to design the DKB, and its task manager, AROMTasks, for developing the MKB. In this study two integrated systems, ABC transporters and two component systems, both involved in adaptation processes of a bacterial cell to its biotope, have been used to evaluate the feasibility of the approach.
Collapse
|
19
|
Abstract
The action of sialyltransferases (STs) on cell surface glycoconjugates is a key process in shaping cell phenotype in a variety of cells mostly involved in migratory and adhesive pathways. The factors determining cell-specific pattern of glycosylation are so far poorly understood. Most STs are resident proteins of the Golgi apparatus, where acceptors are sialylated while they are in transit to the cell surface. To identify putative structural features that may account for their acceptor preference, we analyzed 53 cloned animal and human STs. We could identify conserved regions and peptide motifs representative of ST subfamilies, located at the C-terminal end of the hypervariable region upstream from the L-sialyl motif. Residues 93-100 in human ST6Gal I (hST6Gal I) were shown to be crucial for enzymatic activity when deleted and expressed in CHO cells. The Delta100 hST6Gal I mutant protein was fully recognized by polyclonal anti-hST6Gal I antibodies and followed the intracellular secretory pathway. This indicated that the conserved QVWxKDS sequence is essential for the whole catalytic domain to acquire a biologically active conformation. When full-length epitope-tagged hST6Gal I and hST6GalNAc I constructs were transfected in CHO cells, the alpha-2,6 sialylated glycotope was found to be largely restricted to intracellular resident acceptors and enzymatic activity based on fluorescent lectin staining. In contrast, both enzymes deprived of their membrane anchor and part of the hypervariable region but still possessing the conserved domains exhibited a very efficient transfer of sialic acid to cell surface glycoconjugates. Colocalization of the ST6Gal I mutant proteins with early and late Golgi markers such as giantin or rab6 proteins confirmed that soluble STs migrate forward in these subcompartments where they can act upon newly synthesized acceptors and follow the secretory pathway. It is thus concluded that downstream from the transmembrane domain, native STs possess peptide sequences that allow them to sialylate glycoprotein acceptors selectively along their transit within Golgi stacks.
Collapse
|
20
|
Regulatory relationship of two-component and ABC transport systems and clustering of their genes in the Bacillus/Clostridium group, suggest a functional link between them. J Mol Microbiol Biotechnol 2002; 4:503-13. [PMID: 12432961] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/27/2023] Open
Abstract
On the Bacillus subtilis chromosome there are five examples of genes encoding two-component systems with response regulators of the OmpR family adjacent to genes encoding sub-family 9 ABC transport systems. Three of these (yts, yvc, yxd) are very similar in gene organization and in sequence. We demonstrate that the TCS and ABC transporter genes do not belong to the same transcriptional unit. The ABC transport and TCS systems are functionally linked, each response regulator controlling the expression of its cognate ABC transporter genes but not its own. Analysis of 48 bacterial genomes revealed that such family clusters only exist in the Bacillus/Clostridium group. Evolutionary analyses indicated that almost all clustered OmpR response regulators constitute two groups ("GI" and "GIIl") whereas almost all clustered sub-family 9 nucleotide-binding domains belong to two other groups ("9A" and "9B"). Interestingly, there is a mutually exclusive clustering between genes encoding "GI" or a "GII" response regulators and genes encoding "9A" or a "9B" nucleotide binding proteins. We propose that a two-component system and its cognate ABC transporter genes have evolved as a unit in Bacillus/Clostridium, both systems participating in a common physiological process.
Collapse
|
21
|
Strategies for the identification, the assembly and the classification of integrated biological systems in completely sequenced genomes. COMPUTERS & CHEMISTRY 2002; 26:447-57. [PMID: 12144175 DOI: 10.1016/s0097-8485(02)00007-4] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The proteins involved in a single biological process may form a stable supra-molecular assembly or be transiently in interaction. Although, the first annotation steps of a complete genome may allow the identification of the different partners, their assembly in a functional system, referred to as an integrated system, is a domain where methodological effort has to be done. Indeed, the knowledge required to assemble partners of such systems should be explicitly included in annotation software. The availability of a complete genome, and therefore of all the proteins encoded by that genome, motivated the development of automated approaches through the coordinated combination of different bio-informatic methods allowing the identification of the different partners, their assembly and the classification of the reconstructed systems in functional categories. In this data flux, the identification of the sequence partners represents the principal bottleneck. Here, we describe and compare the results obtained with different classes of methods (BLASTP2, PSI-BLAST, MAST and META-MEME) applied to the identification in complete genomes of a given family of integrated systems: the ABC transporters. PSI-BLAST appears to significantly outperform motif-based methods, and the results are discussed according to the nature of the proteins and the structure of the sub-families.
Collapse
|
22
|
ABCdb: an ABC transporter database. J Mol Microbiol Biotechnol 2000; 2:501-4. [PMID: 11075924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/18/2023] Open
Abstract
We present the first release of a database devoted to the ATP-binding cassette (ABC) protein domains (ABCdb). The ABC proteins are involved in a wide variety of physiological processes in Archea, Bacteria and Eucaryota where they are encoded by large families of paralogous genes. The majority of ABC domains energize the transport of compounds across the membranes. In bacteria, ABC transporters are involved in the uptake of a wide range of molecules and in mechanisms of virulence and antibiotic resistance. In eukaryotes, most of them are involved in drug resistance and in human cells, many are associated with diseases. Sequence analysis reveals that members of the ABC superfamily can be organized into sub-families and suggests that they have diverged from common ancestral forms. In this release, ABCdb includes the inventory and assembly of the ABC transporter systems of completely sequenced genomes. In addition to the protein entries, the database comprises information on functional domains, sequence motifs, predicted trans-membrane segments, and signal peptides. It also includes a classification in sub-families of the ABC systems as well as a classification of the different partners of the systems. Evolutionary trees and specific sequence patterns are provided for each sub-family. The database is endowed with a powerful query system and it was interfaced with blastP2 program for similarity searches. ABCdb has been developed in the ACeDB format, a database system developed by Jean Thierry-Mieg and Richard Durbin. ABCdb can be accessed via the World Wide Web (http://ir2lcb.cnrs-mrs.fr/ABCdb/).
Collapse
|
23
|
Bacterial twin-arginine signal peptide-dependent protein translocation pathway: evolution and mechanism. J Mol Microbiol Biotechnol 2000; 2:179-89. [PMID: 10939242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/17/2023] Open
Abstract
The recently identified bacterial Tat pathway is capable of exporting proteins with a peculiar twin-arginine signal peptide in folded conformation independently of the Sec machinery. It is structurally and mechanistically similar to the delta pH-dependent pathway used for importing chloroplast proteins into the thylakoid. The tat genes are not ubiquitously present and are absent from half of the completely sequenced bacterial genomes. The presence of the tat genes seems to correlate with genome size and with the presence of important enzymes with a twin-arginine signal peptide. A minimal Tat system requires a copy of tatA and a copy of tatC. The composition and gene order of a tat locus are generally conserved within the same taxonomy group but vary considerably to other groups, which would exclude an acquisition of the Tat system by recent horizontal gene transfer. The tat genes are also found in the genomes of chloroplasts and plant mitochondria but are absent from animal mitochondrial genomes. The topology of evolution trees suggests a bacterial origin of the Tat system. In general, the twin-arginine signal peptide is capable of targeting any passenger protein to the Tat pathway. However, a structural signal carried by the mature part of a passenger protein can override targeting information in a signal peptide under certain circumstances. Tat systems show a substrate-Tat component specificity and a species specificity. The pore size of the Tat channel is estimated as being between 5 and 9 nm. Operational models of the Tat system are proposed.
Collapse
|
24
|
Abstract
The prediction of coding sequences has received a lot of attention during the last decade. We can distinguish two kinds of methods, those that rely on training with sets of example and counter-example sequences, and those that exploit the intrinsic properties of the DNA sequences to be analyzed. The former are generally more powerful but their domains of application are limited by the availability of a training set. The latter avoid this drawback but can only be applied to sequences that are long enough to allow computation of the statistics. Here, we present a method that fills the gap between the two approaches. A learning step is applied using a set of sequences that are assumed to contain coding and non-coding regions, but with the boundaries of these regions unknown. A test step then uses the discriminant function obtained during the learning to predict coding regions in sequences from the same organism. The learning relies upon a correspondence analysis and prediction is presented on a graphical display. The method has been evaluated on a sample of yeast sequences, and the analysis of a set of expressed sequence tags from the Eucalyptus globulus-Pisolithus tinctorius ectomycorrhiza illustrates the relevance of the approach in its biological context.
Collapse
|
25
|
Abstract
We have undertaken the inventory and assembly of the ATP binding cassette (ABC) transporter systems in the complete genome of Bacillus subtilis. We combined the identification of the three protein partners that compose an ABC transporter (nucleotide-binding domain, NBD; membrane spanning domain, MSD; and solute-binding protein, SBP) with constraints on the genetic organization. This strategy allowed the identification of 86 NBDs in 78 proteins, 103 MSD proteins and 37 SBPs. The analysis of transcriptional units allows the reconstruction of 59 ABC transporters, which include at least one NBD and one MSD. A particular class of five dimeric ATPases was not associated to MSD partners and is assumed to be involved either in macrolide resistance or regulation of translation elongation. In addition, we have detected five genes encoding ATPases without any gene coding for MSD protein in their neighborhood and 11 operons that encode only the membrane and solute-binding proteins. On the bases of similarities, three ATP-binding proteins are proposed to energize ten incomplete systems, suggesting that one ATPase may be recruited by more than one transporter. Finally, we estimate that the B. subtilis genome encodes for at least 78 ABC transporters that have been split in 38 importers and 40 extruders. The ABC systems have been further classified into 11 sub-families according to the tree obtained from the NBDs and the clustering of the MSDs and the SBPs. Comparisons with Escherichia coli show that the extruders are over-represented in B. subtilis, corresponding to an expansion of the sub-families of antibiotic and drug resistance systems.
Collapse
|
26
|
Abstract
The authors of the first yeast chromosome sequence defined a minimum threshold requirement of 100 codons, above which an open reading frame (ORF) is retained as a putative coding sequence. However, at least 58 yeast genes shorter than 100 codons have an assigned protein function. Therefore, the yeast genome may contain other tiny but functionally important genes that are discarded from analyses by this simple filtering rule. We have established discriminant functions from the in-phase hexamer frequencies of functional genes and of simulated ORFs derived from a stationary Markov chain model. Fifty-two out of the 58 genes were recognized as coding ORFs by our discriminating method. The test was also applied to all the small ORFs (36 to 100 codons) found in the intergenic regions of published chromosomes. It retained 140 new potential tiny coding sequences, among which we identified seven new genes by similarity searches. Our method, used conjointly with similarity searches, can also highlight sequencing errors resulting from the disruption of the coding frame of longer ORFs. This method, by its ability to detect potential coding ORFs, can be a very useful tool for functional analysis.
Collapse
|
27
|
|
28
|
Abstract
The compositional distribution of coding sequences from five vertebrates (Xenopus, chicken, mouse, rat, and human) is shifted toward higher GC values compared to that of the DNA molecules (in the 35-85-kb size range) isolated from the corresponding genomes. This shift is due to the lower GC levels of intergenic sequences compared to coding sequences. In the cold-blooded vertebrate, the two distributions are similar in that GC-poor genes and GC-poor DNA molecules are largely predominant. In contrast, in the warm-blooded vertebrates, GC-rich genes are largely predominant over GC-poor genes, whereas GC-poor DNA molecules are largely predominant over GC-rich DNA molecules. As a consequence, the genomes of warm-blooded vertebrates show a compositional gradient of gene concentration. The compositional distributions of coding sequences (as well as of DNA molecules) showed remarkable differences between chicken and mammals, and between mouse (or rat) and human. Differences were also detected in the compositional distribution of housekeeping and tissue-specific genes, the former being more abundant among GC-rich genes.
Collapse
|
29
|
Statistical method for predicting protein coding regions in nucleic acid sequences. COMPUTER APPLICATIONS IN THE BIOSCIENCES : CABIOS 1987; 3:287-95. [PMID: 3134115 DOI: 10.1093/bioinformatics/3.4.287] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Protein coding regions of a genome fragment can be mathematically predicted by studying variations in the statistical properties or by searching the signals characteristic of the junctions between the coding and non-coding regions. We propose here a new statistical method using correspondence analysis. This method does not use any reference codon set but takes into account the codon usage homogeneity along the studied genome fragment. Comparison with previously published methods especially the 'codon usage method' of Staden has been made, and two examples are presented here. Applications to analysis of prokaryotic operon and eukaryotic split genes are also discussed. Use of the method has also shown two structures not previously described: i) in the human prt gene, a strong triplet structure exists in a non-coding region; ii) in the human tp-a codon usage is not uniform between the different exons.
Collapse
|
30
|
Evaluation of the "Cobas-Bio" centrifugal microanalyzer. Clin Chem 1981; 27:337-8. [PMID: 7460289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
|
31
|
|