1
|
Orencio-Trejo M, Utrilla J, Fernández-Sandoval MT, Huerta-Beristain G, Gosset G, Martinez A. Engineering the Escherichia coli fermentative metabolism. ADVANCES IN BIOCHEMICAL ENGINEERING/BIOTECHNOLOGY 2010; 121:71-107. [PMID: 20182928 DOI: 10.1007/10_2009_61] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
Fermentative metabolism constitutes a fundamental cellular capacity for industrial biocatalysis. Escherichia coli is an important microorganism in the field of metabolic engineering for its well-known molecular characteristics and its rapid growth. It can adapt to different growth conditions and is able to grow in the presence or absence of oxygen. Through the use of metabolic pathway engineering and bioprocessing techniques, it is possible to explore the fundamental cellular properties and to exploit its capacity to be applied as industrial biocatalysts to produce a wide array of chemicals. The objective of this chapter is to review the metabolic engineering efforts carried out with E. coli by manipulating the central carbon metabolism and fermentative pathways to obtain strains that produce metabolites with high titers, such as ethanol, alanine, lactate and succinate.
Collapse
Affiliation(s)
- M Orencio-Trejo
- Departamento de Ingeniería Celular y Biocatálisis, Instituto de Biotecnología, Universidad Nacional Autónoma de México, 510-3, Cuernavaca, Morelos, 62250, México
| | | | | | | | | | | |
Collapse
|
2
|
Dekhtyar M, Morin A, Sakanyan V. Triad pattern algorithm for predicting strong promoter candidates in bacterial genomes. BMC Bioinformatics 2008; 9:233. [PMID: 18471287 PMCID: PMC2412878 DOI: 10.1186/1471-2105-9-233] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2007] [Accepted: 05/09/2008] [Indexed: 11/17/2022] Open
Abstract
BACKGROUND Bacterial promoters, which increase the efficiency of gene expression, differ from other promoters by several characteristics. This difference, not yet widely exploited in bioinformatics, looks promising for the development of relevant computational tools to search for strong promoters in bacterial genomes. RESULTS We describe a new triad pattern algorithm that predicts strong promoter candidates in annotated bacterial genomes by matching specific patterns for the group I sigma70 factors of Escherichia coli RNA polymerase. It detects promoter-specific motifs by consecutively matching three patterns, consisting of an UP-element, required for interaction with the alpha subunit, and then optimally-separated patterns of -35 and -10 boxes, required for interaction with the sigma70 subunit of RNA polymerase. Analysis of 43 bacterial genomes revealed that the frequency of candidate sequences depends on the A+T content of the DNA under examination. The accuracy of in silico prediction was experimentally validated for the genome of a hyperthermophilic bacterium, Thermotoga maritima, by applying a cell-free expression assay using the predicted strong promoters. In this organism, the strong promoters govern genes for translation, energy metabolism, transport, cell movement, and other as-yet unidentified functions. CONCLUSION The triad pattern algorithm developed for predicting strong bacterial promoters is well suited for analyzing bacterial genomes with an A+T content of less than 62%. This computational tool opens new prospects for investigating global gene expression, and individual strong promoters in bacteria of medical and/or economic significance.
Collapse
Affiliation(s)
| | - Amelie Morin
- Laboratoire de Biotechnologie, UMR CNRS 6204, Université de Nantes, 2 rue de la Houssinière, 44322 Nantes, France
| | - Vehary Sakanyan
- Laboratoire de Biotechnologie, UMR CNRS 6204, Université de Nantes, 2 rue de la Houssinière, 44322 Nantes, France
- ProtNeteomix, 2 rue de la Houssinière, 44322 Nantes, France
| |
Collapse
|
3
|
González-Díaz H, Pérez-Bello A, Uriarte E, González-Díaz Y. QSAR study for mycobacterial promoters with low sequence homology. Bioorg Med Chem Lett 2006; 16:547-53. [PMID: 16275068 DOI: 10.1016/j.bmcl.2005.10.057] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2005] [Revised: 10/13/2005] [Accepted: 10/18/2005] [Indexed: 11/27/2022]
Abstract
The general belief is that quantitative structure-activity relationship (QSAR) techniques work only for small molecules and, protein sequences or, more recently, DNA sequences. However, with non-branched graph for proteins and DNA sequences the QSAR often have to be based on powerful non-linear techniques such as support vector machines. In our opinion, linear QSAR models based on RNA could be useful to assign biological activity when alignment techniques fail due to low sequence homology. The idea bases the high level of branching for the RNA graph. This work introduces the so-called Markov electrostatic potentials (k)xi(M) as a new class of RNA 2D-structure descriptors. Subsequently, we validate these molecular descriptors solving a QSAR classification problem for mycobacterial promoter sequences (mps), which constitute a very low sequence homology problem. The model developed (mps=-4.664.(0)xi(M)+0. 991.(1)xi(M)-2.432) was intended to predict whether a naturally occurring sequence is an mps or not on the basis of the calculated (k)xi(M) value for the corresponding RNA secondary structure. The RNA-QSAR approach recognises 115/135mps (85.2%) and 100% of control sequences. Average predictability and robustness were greater than 95%. A previous non-linear model predicts mps with a slightly higher accuracy (97%) but uses a very large parameter space for DNA sequences. Conversely, the (k)xi(M)-based RNA-QSAR encodes more structural information and needs only two variables.
Collapse
|
4
|
Elloumi M, Maddouri M. New voting strategies designed for the classification of nucleic sequences. Knowl Inf Syst 2005. [DOI: 10.1007/s10115-004-0151-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
5
|
Kalate RN, Tambe SS, Kulkarni BD. Artificial neural networks for prediction of mycobacterial promoter sequences. Comput Biol Chem 2004; 27:555-64. [PMID: 14667783 DOI: 10.1016/j.compbiolchem.2003.09.004] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
A multilayered feed-forward ANN architecture trained using the error-back-propagation (EBP) algorithm has been developed for predicting whether a given nucleotide sequence is a mycobacterial promoter sequence. Owing to the high prediction capability ( congruent with 97%) of the developed network model, it has been further used in conjunction with the caliper randomization (CR) approach for determining the structurally/functionally important regions in the promoter sequences. The results obtained thereby indicate that: (i) upstream region of -35 box, (ii) -35 region, (iii) spacer region and, (iv) -10 box, are important for mycobacterial promoters. The CR approach also suggests that the -38 to -29 region plays a significant role in determining whether a given sequence is a mycobacterial promoter. In essence, the present study establishes ANNs as a tool for predicting mycobacterial promoter sequences and determining structurally/functionally important sub-regions therein.
Collapse
Affiliation(s)
- Rupali N Kalate
- Department of Biosciences and Informatics, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, Yokohama 223-8522, Japan.
| | | | | |
Collapse
|
6
|
Abstract
This paper presents a survey of currently available mathematical models and algorithmical methods for trying to identify promoter sequences. The methods concern both searching in a genome for a previously defined consensus and extracting a consensus from a set of sequences. Such methods were often tailored for either eukaryotes or prokaryotes although this does not preclude use of the same method for both types of organisms. The survey therefore covers all methods; however, emphasis is placed on prokaryotic promoter sequence identification. Illustrative applications of the main extracting algorithms are given for three bacteria.
Collapse
Affiliation(s)
- A Vanet
- Institut de biologie physico-chimique, Paris, France
| | | | | |
Collapse
|
7
|
Wang JT, Rozen S, Shapiro BA, Shasha D, Wang Z, Yin M. New techniques for DNA sequence classification. J Comput Biol 1999; 6:209-18. [PMID: 10421523 DOI: 10.1089/cmb.1999.6.209] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
DNA sequence classification is the activity of determining whether or not an unlabeled sequence S belongs to an existing class C. This paper proposes two new techniques for DNA sequence classification. The first technique works by comparing the unlabeled sequence S with a group of active motifs discovered from the elements of C and by distinction with elements outside of C. The second technique generates and matches gapped fingerprints of S with elements of C. Experimental results obtained by running these algorithms on long and well conserved Alu sequences demonstrate the good performance of the presented methods compared with FASTA. When applied to less conserved and relatively short functional sites such as splice-junctions, a variation of the second technique combining fingerprinting with consensus sequence analysis gives better results than the current classifiers employing text compression and machine learning algorithms.
Collapse
Affiliation(s)
- J T Wang
- Department of Computer and Information Science, New Jersey Institute of Technology, University Heights, Newark 07102, USA.
| | | | | | | | | | | |
Collapse
|
8
|
Abstract
Recognition of function of newly sequenced DNA fragments is an important area of computational molecular biology. Here we present an extensive review of methods for prediction of functional sites, tRNA, and protein-coding genes and discuss possible further directions of research in this area.
Collapse
Affiliation(s)
- M S Gelfand
- Institute of Protein Research, Russian Academy of Sciences, Pushchino, Moscow region, Russia
| |
Collapse
|
9
|
Titgemeyer F, Reizer J, Reizer A, Tang J, Parr TR, Saier MH. Nucleotide sequence of the region between crr and cysM in Salmonella typhimurium: five novel ORFs including one encoding a putative transcriptional regulator of the phosphotransferase system. DNA SEQUENCE : THE JOURNAL OF DNA SEQUENCING AND MAPPING 1995; 5:145-52. [PMID: 7612925 DOI: 10.3109/10425179509029354] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
A 4471 bp region between crr and cysM on the Salmonella typhimurium chromosome (49.5 min) has been sequenced. Five ORFs were found within this region, one of which is likely to be the putative regulatory gene, ptsJ, that corresponds in map position to a gene which when mutated allows expression of a cryptic Enzyme I of the phosphotransferase system. The deduced amino acid sequence of the encoded protein is similar to those of several open reading frames (ORFs) including ORFT2 of Rhodobacter spheroides with which it is 28% identical throughout most of its length (comparison score of 21 S.D.). PtsJ exhibits a putative, N-terminal, helix-turn-helix, DNA binding domain that is similar in sequence to those in members of the GntR family of transcriptional regulators. Analyses of the sequences of the ORFs encoded within this region are presented.
Collapse
Affiliation(s)
- F Titgemeyer
- Department of Biology, University of California at San Diego, La Jolla 92093-0116, USA
| | | | | | | | | | | |
Collapse
|
10
|
Yomo T, Urabe I. A frame-specific symmetry of complementary strands of DNA suggests the existence of genes on the antisense strand. J Mol Evol 1994; 38:113-20. [PMID: 8169956 DOI: 10.1007/bf00166158] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
The bacterial DNA sequence in GenBank database were divided into coding and noncoding regions and examined for the base-trimer distribution in every triplet frame on the sense and antisense strands. The results revealed that for the noncoding region, both strands have very similar base-trimer distributions and have no frame specificity; that is, DNA is symmetric in the noncoding region. For the coding region, on the other hand, the symmetry is broken only in the triplet framework, and we found a special triplet-frame-specific symmetry which appears when the two complementary strands of the coding region are read from their 5' ends. In addition, the following frame specificity was also observed in the distribution of stop codons on the antisense strand of the coding region. When the antisense sequences of the open reading frames (ORFs) in the database are read in the three reading frames, the same reading frame as the corresponding ORF contains a significantly larger amount of long open frames without stop codons (i.e., nonstop frames [NSFs]) than expected, while the number of NSFs in the other two reading frames is similar to that of the expected one. That is, NSFs as well as ORFs are maintained in a frame-specific manner, and in this sense, DNA becomes symmetrical even in the coding region. These two kinds of frame-specific symmetries indicate that only an ORF and its complementary triplets are specifically recognized and maintained in DNA.(ABSTRACT TRUNCATED AT 250 WORDS)
Collapse
Affiliation(s)
- T Yomo
- Department of Biotechnology, Faculty of Engineering, Osaka University, Japan
| | | |
Collapse
|
11
|
Ozoline ON, Uteshev TA, Masulis IS, Kamzolova SG. Interaction of bacterial RNA-polymerase with two different promoters of phage T7 DNA. Conformational analysis. BIOCHIMICA ET BIOPHYSICA ACTA 1993; 1172:251-61. [PMID: 7916631 DOI: 10.1016/0167-4781(93)90211-u] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
Using a rifampicin-resistant RNA polymerase with altered specificity to different promoters, the D promoter of T7 phage DNA with increased affinity to the mutant enzyme was chosen. This promoter and the T7 A1 promoter with unchanged affinity as well as some nonpromoter DNA fragments were used to compare temperature-induced conformational transitions of RNA polymerase in the course of complex formation. Conformational alterations of RNA polymerase were monitored by the fluorescent label method. It was shown that RNA polymerase undergoes a set of conformational transitions during complex formation with each promoter, some of which were similar by the character of change to spectral parameters of the label (reflecting RPi and, probably, RPo formation). The local structure of complexes formed above 33 degrees C differs for A1 and D. The conformational analysis reveals at least one temperature-dependent stage upon nonspecific interaction of the enzyme with nonpromoter DNA at 13-16 degrees C. Models of functional organization of the enzyme recognizing center and some features of the structure of the promoters which may be essential for their recognition are discussed.
Collapse
Affiliation(s)
- O N Ozoline
- Institute of Cell Biophysics, Russian Academy of Sciences, Pushchino, Moscow region
| | | | | | | |
Collapse
|
12
|
Coleman J. Characterization of the Escherichia coli gene for 1-acyl-sn-glycerol-3-phosphate acyltransferase (plsC). MOLECULAR & GENERAL GENETICS : MGG 1992; 232:295-303. [PMID: 1557036 DOI: 10.1007/bf00280009] [Citation(s) in RCA: 109] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
An Escherichia coli strain deficient in 1-acyl-sn-glycerol-3-phosphate acyltransferase activity has previously been isolated, and the gene (plsC) has been shown to map near min 65 on the chromosome. I precisely mapped the location of plsC on the chromosome, and determined its DNA sequence. plsC is located between parC and sufI, and is separated from sufI by 74 bp. Upstream of plsC is parC, separated by 233 bp, which includes an active promoter. parC, plsC, and sufI are all transcribed in the counterclockwise direction on the chromosome, possibly in an operon with multiple promoters. The amino-terminal sequence of the partially purified protein, combined with the DNA sequence, reveal 1-acyl-sn-glycerol-3-phosphate acyltransferase to be a 27.5 kDa highly basic protein. The plsC gene product, 1-acyl-sn-glycerol-3-phosphate acyltransferase, is localized to the cytoplasmic membrane of the cell. The amino-terminal sequence of the purified protein reveals the first amino acid to be a blocked methionine residue, most probably a formyl-methionine. The amino acid sequence of 1-acyl-sn-glycerol-3-phosphate acyltransferase has a short region of homology to two other E. coli acyltransferases that utilize acyl-acyl carrier protein as the acyl donor, sn-glycerol-3-phosphate acyltransferase and UDP-N-acetyl-glucosamine acyltransferase (involved in lipid A biosynthesis).
Collapse
Affiliation(s)
- J Coleman
- Department of Biochemistry and Molecular Biology, Louisiana State University-Medical Center, New Orleans 70112
| |
Collapse
|
13
|
Cardon LR, Stormo GD. Expectation maximization algorithm for identifying protein-binding sites with variable lengths from unaligned DNA fragments. J Mol Biol 1992; 223:159-70. [PMID: 1731067 DOI: 10.1016/0022-2836(92)90723-w] [Citation(s) in RCA: 97] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
An Expectation Maximization algorithm for identification of DNA binding sites is presented. The approach predicts the location of binding regions while allowing variable length spacers within the sites. In addition to predicting the most likely spacer length for a set of DNA fragments, the method identifies individual sites that differ in spacer size. No alignment of DNA sequences is necessary. The method is illustrated by application to 231 Escherichia coli DNA fragments known to contain promoters with variable spacings between their consensus regions. Maximum-likelihood tests of the differences between the spacing classes indicate that the consensus regions of the spacing classes are not distinct. Further tests suggest that several positions within the spacing region may contribute to promoter specificity.
Collapse
Affiliation(s)
- L R Cardon
- Institute for Behavior Genetics, University of Colorado Boulder 80309-0447
| | | |
Collapse
|
14
|
Somerville R. The Trp repressor, a ligand-activated regulatory protein. PROGRESS IN NUCLEIC ACID RESEARCH AND MOLECULAR BIOLOGY 1992; 42:1-38. [PMID: 1574585 DOI: 10.1016/s0079-6603(08)60572-3] [Citation(s) in RCA: 27] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Affiliation(s)
- R Somerville
- Department of Biochemistry, Purdue University, West Lafayette, Indiana 47907
| |
Collapse
|
15
|
Vartak NB, Reizer J, Reizer A, Gripp JT, Groisman EA, Wu LF, Tomich JM, Saier MH. Sequence and evolution of the FruR protein of Salmonella typhimurium: a pleiotropic transcriptional regulatory protein possessing both activator and repressor functions which is homologous to the periplasmic ribose-binding protein. Res Microbiol 1991; 142:951-63. [PMID: 1805309 DOI: 10.1016/0923-2508(91)90005-u] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
The repressor of the fructose (fru) operon of Salmonella typhimurium (FruR) has been implicated in the transcriptional regulation of dozens of genes concerned with central metabolic pathways of carbon utilization. We here report the nucleotide sequence of the gene encoding FruR and analyse both its operator-promoter region and its deduced amino acyl sequence. The FruR protein was overexpressed and was shown to have a molecular weight of about 36 kDa in agreement with the molecular weight deduced from the gene sequence. Sequence analyses revealed that FruR is homologous to 9 distinct bacterial DNA-binding proteins, most of which recognize sugar inducers and all of which possess helix-turn-helix motifs within their N-terminal regions and exhibit sequence identity throughout most of their lengths. FruR is also homologous to the periplasmic ribose-binding protein which serves as a constituent of the ribose transport/chemoreception system. The ribose-binding protein is in turn homologous to binding proteins specific for arabinose and galactose. The periplasmic binding proteins, the structures of some of which have been elucidated in three dimensions, lack the N-terminal helix-turn-helix region, but instead possess N-terminal hydrophobic signal sequences which target them to the periplasm. A phylogenetic tree for the more closely related proteins of this superfamily was constructed, and a signature motif was identified which should facilitate future detection of additional transcriptional regulatory proteins belonging to this family.
Collapse
Affiliation(s)
- N B Vartak
- Department of Biology, University of California at San Diego, La Jolla 92093-0116
| | | | | | | | | | | | | | | |
Collapse
|
16
|
Sharrocks AD, Hornby DP. Transcriptional analysis of the restriction and modification genes of bacteriophage P1. Mol Microbiol 1991; 5:685-94. [PMID: 2046552 DOI: 10.1111/j.1365-2958.1991.tb00740.x] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Bacteriophage P1 res and mod genes encode the restriction and modification polypeptides of the Type III restriction enzyme EcoP1. Northern blot analysis using res- and mod-specific probes revealed the presence of two separate transcripts in strains harbouring the EcoP1 restriction and modification genes. Furthermore, by constructing a series of fusions with a promoter less lacZ gene, we show that both the res and mod genes are transcribed from separate promoters. A more detailed investigation of the mod promoter region revealed two promoters located some 70 and 140bp upstream from the translational start codon. In addition, another pair of promoters and a further separate promoter are located more than 500bp upstream from this start codon. Two short open reading frames are located between these distal and proximal promoter clusters. Transcription of the res gene is initiated from within the mod open reading frame from two adjacent promoters. In addition a functional promoter is located on the antisense strand close to the res promoter region. The relationship between the transcription units of the res and mod genes is discussed.
Collapse
Affiliation(s)
- A D Sharrocks
- Department of Molecular Biology, Krebs Institute, University of Sheffield, UK
| | | |
Collapse
|
17
|
Jiang XM, Neal B, Santiago F, Lee SJ, Romana LK, Reeves PR. Structure and sequence of the rfb (O antigen) gene cluster of Salmonella serovar typhimurium (strain LT2). Mol Microbiol 1991; 5:695-713. [PMID: 1710759 DOI: 10.1111/j.1365-2958.1991.tb00741.x] [Citation(s) in RCA: 280] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
The rfb gene cluster of Salmonella LT2 has been cloned and sequenced. The genes rfbA, rfbB, rfbD, rfbF, rfbG, rfbK, rfbM and rfbP were located individually and the gene rfbL was located outside the cluster. Approximately 16 open reading frames were found in the region which is essential for the expression of O antigen. The gene products of rfbB and rfbG were found to have homology with the group of dehydrogenase and related enzymes described previously. Analysis of the G + C ratio of the rfb cluster extended the area of low-G + C composition previously found in the sequence of rfbJ to the whole rfb gene cluster. Three to five segments with discrete G + C contents and codon adaptation indices are present in the rfb region, indicating a heterogeneous origin of these segments. Potential promoters were found near the start of the rfb region, supporting the possibility that the rfb gene cluster is an operon.
Collapse
Affiliation(s)
- X M Jiang
- Department of Microbiology, University of Sydney, New South Wales, Australia
| | | | | | | | | | | |
Collapse
|
18
|
|