1
|
Avram O, Rapoport D, Portugez S, Pupko T. M1CR0B1AL1Z3R-a user-friendly web server for the analysis of large-scale microbial genomics data. Nucleic Acids Res 2019; 47:W88-W92. [PMID: 31114912 PMCID: PMC6602433 DOI: 10.1093/nar/gkz423] [Citation(s) in RCA: 84] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2019] [Revised: 04/29/2019] [Accepted: 05/06/2019] [Indexed: 11/21/2022] Open
Abstract
Large-scale mining and analysis of bacterial datasets contribute to the comprehensive characterization of complex microbial dynamics within a microbiome and among different bacterial strains, e.g., during disease outbreaks. The study of large-scale bacterial evolutionary dynamics poses many challenges. These include data-mining steps, such as gene annotation, ortholog detection, sequence alignment and phylogeny reconstruction. These steps require the use of multiple bioinformatics tools and ad-hoc programming scripts, making the entire process cumbersome, tedious and error-prone due to manual handling. This motivated us to develop the M1CR0B1AL1Z3R web server, a 'one-stop shop' for conducting microbial genomics data analyses via a simple graphical user interface. Some of the features implemented in M1CR0B1AL1Z3R are: (i) extracting putative open reading frames and comparative genomics analysis of gene content; (ii) extracting orthologous sets and analyzing their size distribution; (iii) analyzing gene presence-absence patterns; (iv) reconstructing a phylogenetic tree based on the extracted orthologous set; (v) inferring GC-content variation among lineages. M1CR0B1AL1Z3R facilitates the mining and analysis of dozens of bacterial genomes using advanced techniques, with the click of a button. M1CR0B1AL1Z3R is freely available at https://microbializer.tau.ac.il/.
Collapse
Affiliation(s)
- Oren Avram
- The School of Molecular Cell Biology & Biotechnology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Dana Rapoport
- The School of Molecular Cell Biology & Biotechnology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Shir Portugez
- The School of Molecular Cell Biology & Biotechnology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Tal Pupko
- The School of Molecular Cell Biology & Biotechnology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| |
Collapse
|
2
|
Pannier L, Merino E, Marchal K, Collado-Vides J. Effect of genomic distance on coexpression of coregulated genes in E. coli. PLoS One 2017; 12:e0174887. [PMID: 28419102 PMCID: PMC5395161 DOI: 10.1371/journal.pone.0174887] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2016] [Accepted: 03/16/2017] [Indexed: 12/26/2022] Open
Abstract
In prokaryotes, genomic distance is a feature that in addition to coregulation affects coexpression. Several observations, such as genomic clustering of highly coexpressed small regulons, support the idea that coexpression behavior of coregulated genes is affected by the distance between the coregulated genes. However, the specific contribution of distance in addition to coregulation in determining the degree of coexpression has not yet been studied systematically. In this work, we exploit the rich information in RegulonDB to study how the genomic distance between coregulated genes affects their degree of coexpression, measured by pairwise similarity of expression profiles obtained under a large number of conditions. We observed that, in general, coregulated genes display higher degrees of coexpression as they are more closely located on the genome. This contribution of genomic distance in determining the degree of coexpression was relatively small compared to the degree of coexpression that was determined by the tightness of the coregulation (degree of overlap of regulatory programs) but was shown to be evolutionary constrained. In addition, the distance effect was sufficient to guarantee coexpression of coregulated genes that are located at very short distances, irrespective of their tightness of coregulation. This is partly but definitely not always because the close distance is also the cause of the coregulation. In cases where it is not, we hypothesize that the effect of the distance on coexpression could be caused by the fact that coregulated genes closely located to each other are also relatively more equidistantly located from their common TF and therefore subject to more similar levels of TF molecules. The absolute genomic distance of the coregulated genes to their common TF-coding gene tends to be less important in determining the degree of coexpression. Our results pinpoint the importance of taking into account the combined effect of distance and coregulation when studying prokaryotic coexpression and transcriptional regulation.
Collapse
Affiliation(s)
- Lucia Pannier
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Enrique Merino
- Departamento de Microbiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Kathleen Marchal
- Department of Microbial and Molecular Systems, KU Leuven, Centre of Microbial and Plant Genetics, Leuven, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark, Ghent, Belgium
- Department of Information Technology, Ghent University, IMinds, Ghent, Belgium
- Department of Genetics, University of Pretoria, Hatfield Campus, Pretoria, South Africa
| | - Julio Collado-Vides
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| |
Collapse
|
3
|
Oliver P, Peralta-Gil M, Tabche ML, Merino E. Molecular and structural considerations of TF-DNA binding for the generation of biologically meaningful and accurate phylogenetic footprinting analysis: the LysR-type transcriptional regulator family as a study model. BMC Genomics 2016; 17:686. [PMID: 27567672 PMCID: PMC5002191 DOI: 10.1186/s12864-016-3025-3] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2016] [Accepted: 08/18/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The goal of most programs developed to find transcription factor binding sites (TFBSs) is the identification of discrete sequence motifs that are significantly over-represented in a given set of sequences where a transcription factor (TF) is expected to bind. These programs assume that the nucleotide conservation of a specific motif is indicative of a selective pressure required for the recognition of a TF for its corresponding TFBS. Despite their extensive use, the accuracies reached with these programs remain low. In many cases, true TFBSs are excluded from the identification process, especially when they correspond to low-affinity but important binding sites of regulatory systems. RESULTS We developed a computational protocol based on molecular and structural criteria to perform biologically meaningful and accurate phylogenetic footprinting analyses. Our protocol considers fundamental aspects of the TF-DNA binding process, such as: i) the active homodimeric conformations of TFs that impose symmetric structures on the TFBSs, ii) the cooperative binding of TFs, iii) the effects of the presence or absence of co-inducers, iv) the proximity between two TFBSs or one TFBS and a promoter that leads to very long spurious motifs, v) the presence of AT-rich sequences not recognized by the TF but that are required for DNA flexibility, and vi) the dynamic order in which the different binding events take place to determine a regulatory response (i.e., activation or repression). In our protocol, the abovementioned criteria were used to analyze a profile of consensus motifs generated from canonical Phylogenetic Footprinting Analyses using a set of analysis windows of incremental sizes. To evaluate the performance of our protocol, we analyzed six members of the LysR-type TF family in Gammaproteobacteria. CONCLUSIONS The identification of TFBSs based exclusively on the significance of the over-representation of motifs in a set of sequences might lead to inaccurate results. The consideration of different molecular and structural properties of the regulatory systems benefits the identification of TFBSs and enables the development of elaborate, biologically meaningful and precise regulatory models that offer a more integrated view of the dynamics of the regulatory process of transcription.
Collapse
Affiliation(s)
- Patricia Oliver
- Departmento de Microbiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Martín Peralta-Gil
- Escuela Superior de Apan de la Universidad Autónoma del Estado de Hidalgo, Carretera Apan-Calpulalpan, Km 8, Chimalpa Tlalayote s/n, Colonia Chimalpa, Apan, Hidalgo, México
| | - María-Luisa Tabche
- Departmento de Microbiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Enrique Merino
- Departmento de Microbiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México.
| |
Collapse
|
4
|
González A, Angarica VE, Sancho J, Fillat MF. The FurA regulon in Anabaena sp. PCC 7120: in silico prediction and experimental validation of novel target genes. Nucleic Acids Res 2014; 42:4833-46. [PMID: 24503250 PMCID: PMC4005646 DOI: 10.1093/nar/gku123] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
In the filamentous cyanobacterium Anabaena sp. PCC 7120, the ferric uptake regulator FurA functions as a global transcriptional regulator. Despite several analyses have focused on elucidating the FurA-regulatory network, the number of target genes described for this essential transcription factor is limited to a handful of examples. In this article, we combine an in silico genome-wide predictive approach with experimental determinations to better define the FurA regulon. Predicted FurA-binding sites were identified upstream of 215 genes belonging to diverse functional categories including iron homeostasis, photosynthesis and respiration, heterocyst differentiation, oxidative stress defence and light-dependent signal transduction mechanisms, among others. The probabilistic model proved to be effective at discerning FurA boxes from non-cognate sequences, while subsequent electrophoretic mobility shift assay experiments confirmed the in vitro specific binding of FurA to at least 20 selected predicted targets. Gene-expression analyses further supported the dual role of FurA as transcriptional modulator that can act both as repressor and as activator. In either role, the in vitro affinity of the protein to its target sequences is strongly dependent on metal co-regulator and reducing conditions, suggesting that FurA couples in vivo iron homeostasis and the response to oxidative stress to major physiological processes in cyanobacteria.
Collapse
Affiliation(s)
- Andrés González
- Departamento de Bioquímica y Biología Molecular y Celular, Universidad de Zaragoza, 50009 Zaragoza, Spain, Instituto de Biocomputación y Física de Sistemas Complejos (BIFI), Universidad de Zaragoza, 50018 Zaragoza, Spain and Unidad Asociada BIFI-IQFR (CSIC), 28006 Madrid, Spain
| | | | | | | |
Collapse
|
5
|
Latorre M, Galloway-Peña J, Roh JH, Budinich M, Reyes-Jara A, Murray BE, Maass A, González M. Enterococcus faecalis reconfigures its transcriptional regulatory network activation at different copper levels. Metallomics 2014; 6:572-81. [PMID: 24382465 DOI: 10.1039/c3mt00288h] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
A global transcriptional regulatory network was generated in the pathogenic bacterium Enterococcus faecalis in order to understand how this organism can activate and coordinate its expression at different copper concentrations. The topological evaluation of the network showed common patterns described in other organisms. Integrating microarray experiments allowed the identification of two sub-networks activated at low (0.05 mM CuSO4) and high (0.5 mM CuSO4) concentrations of copper. The analysis indicates the presence of specific functionally activated modules induced by copper levels, highlighting the regulons LysR and ArgR as global regulators and CopY, Fur and LexA as local regulators. Taking advantage of the fact that E. faecalis presented a homeostatic module, we produced an in vivo intervention by removing this system from the cell without affecting the connectivity of the global transcriptional network. This strategy led us to find that this bacterium can reconfigure its gene expression to maintain cellular homeostasis, activating new modules principally related to glucose metabolism and transcriptional processes. Finally, these results position E. faecalis as the most complete and controllable systemic model organism for copper homeostasis available to date.
Collapse
Affiliation(s)
- Mauricio Latorre
- Laboratorio de Bioinformática y Expresión Génica, INTA, Universidad de Chile, El Líbano 5524, Santiago 11, Chile. ,
| | | | | | | | | | | | | | | |
Collapse
|
6
|
Faria JP, Overbeek R, Xia F, Rocha M, Rocha I, Henry CS. Genome-scale bacterial transcriptional regulatory networks: reconstruction and integrated analysis with metabolic models. Brief Bioinform 2013; 15:592-611. [DOI: 10.1093/bib/bbs071] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
|
7
|
Keseler IM, Mackie A, Peralta-Gil M, Santos-Zavaleta A, Gama-Castro S, Bonavides-Martínez C, Fulcher C, Huerta AM, Kothari A, Krummenacker M, Latendresse M, Muñiz-Rascado L, Ong Q, Paley S, Schröder I, Shearer AG, Subhraveti P, Travers M, Weerasinghe D, Weiss V, Collado-Vides J, Gunsalus RP, Paulsen I, Karp PD. EcoCyc: fusing model organism databases with systems biology. Nucleic Acids Res 2012; 41:D605-12. [PMID: 23143106 PMCID: PMC3531154 DOI: 10.1093/nar/gks1027] [Citation(s) in RCA: 428] [Impact Index Per Article: 32.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
EcoCyc (http://EcoCyc.org) is a model organism database built on the genome sequence of Escherichia coli K-12 MG1655. Expert manual curation of the functions of individual E. coli gene products in EcoCyc has been based on information found in the experimental literature for E. coli K-12-derived strains. Updates to EcoCyc content continue to improve the comprehensive picture of E. coli biology. The utility of EcoCyc is enhanced by new tools available on the EcoCyc web site, and the development of EcoCyc as a teaching tool is increasing the impact of the knowledge collected in EcoCyc.
Collapse
Affiliation(s)
- Ingrid M Keseler
- SRI International, 333 Ravenswood Avenue, Menlo Park, CA 94025, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Krueger B, Friedrich T, Förster F, Bernhardt J, Gross R, Dandekar T. Different evolutionary modifications as a guide to rewire two-component systems. Bioinform Biol Insights 2012; 6:97-128. [PMID: 22586357 PMCID: PMC3348925 DOI: 10.4137/bbi.s9356] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
Two-component systems (TCS) are short signalling pathways generally occurring in prokaryotes. They frequently regulate prokaryotic stimulus responses and thus are also of interest for engineering in biotechnology and synthetic biology. The aim of this study is to better understand and describe rewiring of TCS while investigating different evolutionary scenarios. Based on large-scale screens of TCS in different organisms, this study gives detailed data, concrete alignments, and structure analysis on three general modification scenarios, where TCS were rewired for new responses and functions: (i) exchanges in the sequence within single TCS domains, (ii) exchange of whole TCS domains; (iii) addition of new components modulating TCS function. As a result, the replacement of stimulus and promotor cassettes to rewire TCS is well defined exploiting the alignments given here. The diverged TCS examples are non-trivial and the design is challenging. Designed connector proteins may also be useful to modify TCS in selected cases.
Collapse
Affiliation(s)
- Beate Krueger
- Dept of Bioinformatics, Biocenter, Am Hubland, University of Würzburg, D-97074 Würzburg, Germany
| | | | | | | | | | | |
Collapse
|
9
|
Li G, Liu B, Ma Q, Xu Y. A new framework for identifying cis-regulatory motifs in prokaryotes. Nucleic Acids Res 2010; 39:e42. [PMID: 21149261 PMCID: PMC3074163 DOI: 10.1093/nar/gkq948] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
We present a new algorithm, BOBRO, for prediction of cis-regulatory motifs in a given set of promoter sequences. The algorithm substantially improves the prediction accuracy and extends the scope of applicability of the existing programs based on two key new ideas: (i) we developed a highly effective method for reliably assessing the possibility for each position in a given promoter to be the (approximate) start of a conserved sequence motif; and (ii) we developed a highly reliable way for recognition of actual motifs from the accidental ones based on the concept of ‘motif closure’. These two key ideas are embedded in a classical framework for motif finding through finding cliques in a graph but have made this framework substantially more sensitive as well as more selective in motif finding in a very noisy background. A comparative analysis shows that the performance coefficient was improved from 29% to 41% by our program compared to the best among other six state-of-the-art prediction tools on a large-scale data sets of promoters from one genome, and also consistently improved by substantial margins on another kind of large-scale data sets of orthologous promoters across multiple genomes. The power of BOBRO in dealing with noisy data was further demonstrated through identification of the motifs of the global transcriptional regulators by running it over 2390 promoter sequences of Escherichia coli K12.
Collapse
Affiliation(s)
- Guojun Li
- Computational Systems Biology Laboratory, Department of Biochemistry and Molecular Biology and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | | | | | | |
Collapse
|
10
|
Baumbach J. On the power and limits of evolutionary conservation--unraveling bacterial gene regulatory networks. Nucleic Acids Res 2010; 38:7877-84. [PMID: 20699275 PMCID: PMC3001071 DOI: 10.1093/nar/gkq699] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The National Center for Biotechnology Information (NCBI) recently announced ‘1000 prokaryotic genomes are now completed and available in the Genome database’. The increasing trend will provide us with thousands of sequenced microbial organisms over the next years. However, this is only the first step in understanding how cells survive, reproduce and adapt their behavior while being exposed to changing environmental conditions. One major control mechanism is transcriptional gene regulation. Here, striking is the direct juxtaposition of the handful of bacterial model organisms to the 1000 prokaryotic genomes. Next-generation sequencing technologies will further widen this gap drastically. However, several computational approaches have proven to be helpful. The main idea is to use the known transcriptional regulatory network of reference organisms as template in order to unravel evolutionarily conserved gene regulations in newly sequenced species. This transfer essentially depends on the reliable identification of several types of conserved DNA sequences. We decompose this problem into three computational processes, review the state of the art and illustrate future perspectives.
Collapse
Affiliation(s)
- Jan Baumbach
- Algorithms Group, International Computer Science Institute, Berkeley, USA.
| |
Collapse
|
11
|
Bai J, Wang J, Xue F, Li J, Bu L, Hu J, Xu G, Bao Q, Zhao G, Ding X, Yan J, Wu J. proTF: a comprehensive data and phylogenomics resource for prokaryotic transcription factors. Bioinformatics 2010; 26:2493-5. [DOI: 10.1093/bioinformatics/btq432] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
|
12
|
Karpinets TV, Romine MF, Schmoyer DD, Kora GH, Syed MH, Leuze MR, Serres MH, Park BH, Samatova NF, Uberbacher EC. Shewanella knowledgebase: integration of the experimental data and computational predictions suggests a biological role for transcription of intergenic regions. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2010; 2010:baq012. [PMID: 20627862 PMCID: PMC2911847 DOI: 10.1093/database/baq012] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Shewanellae are facultative γ-proteobacteria whose remarkable respiratory versatility has resulted in interest in their utility for bioremediation of heavy metals and radionuclides and for energy generation in microbial fuel cells. Extensive experimental efforts over the last several years and the availability of 21 sequenced Shewanella genomes made it possible to collect and integrate a wealth of information on the genus into one public resource providing new avenues for making biological discoveries and for developing a system level understanding of the cellular processes. The Shewanella knowledgebase was established in 2005 to provide a framework for integrated genome-based studies on Shewanella ecophysiology. The present version of the knowledgebase provides access to a diverse set of experimental and genomic data along with tools for curation of genome annotations and visualization and integration of genomic data with experimental data. As a demonstration of the utility of this resource, we examined a single microarray data set from Shewanella oneidensis MR-1 for new insights into regulatory processes. The integrated analysis of the data predicted a new type of bacterial transcriptional regulation involving co-transcription of the intergenic region with the downstream gene and suggested a biological role for co-transcription that likely prevents the binding of a regulator of the upstream gene to the regulator binding site located in the intergenic region. Database URL:http://shewanella-knowledgebase.org:8080/Shewanella/ or http://spruce.ornl.gov:8080/Shewanella/
Collapse
|
13
|
Veiga DFT, Dutta B, Balázsi G. Network inference and network response identification: moving genome-scale data to the next level of biological discovery. MOLECULAR BIOSYSTEMS 2010; 6:469-80. [PMID: 20174676 PMCID: PMC3087299 DOI: 10.1039/b916989j] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
The escalating amount of genome-scale data demands a pragmatic stance from the research community. How can we utilize this deluge of information to better understand biology, cure diseases, or engage cells in bioremediation or biomaterial production for various purposes? A research pipeline moving new sequence, expression and binding data towards practical end goals seems to be necessary. While most individual researchers are not motivated by such well-articulated pragmatic end goals, the scientific community has already self-organized itself to successfully convert genomic data into fundamentally new biological knowledge and practical applications. Here we review two important steps in this workflow: network inference and network response identification, applied to transcriptional regulatory networks. Among network inference methods, we concentrate on relevance networks due to their conceptual simplicity. We classify and discuss network response identification approaches as either data-centric or network-centric. Finally, we conclude with an outlook on what is still missing from these approaches and what may be ahead on the road to biological discovery.
Collapse
Affiliation(s)
- Diogo F T Veiga
- Department of Systems Biology-Unit 950, The University of Texas M. D. Anderson Cancer Center, Houston, TX, USA.
| | | | | |
Collapse
|
14
|
Qu Y, Brown P, Barbe JF, Puljic M, Merino E, Adkins RM, Lovley DR, Krushkal J. GSEL Version 2, an Online Genome-Wide Query System of Operon Organization and Regulatory Sequence Elements of Geobacter sulfurreducens. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2009; 13:439-49. [DOI: 10.1089/omi.2009.0081] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Affiliation(s)
- Yanhua Qu
- Department of Preventive Medicine, The University of Tennessee Health Science Center, Memphis, Tennessee
| | - Peter Brown
- Department of Microbiology, University of Massachusetts, Amherst, Massachusetts
| | - Jose F. Barbe
- Department of Preventive Medicine, The University of Tennessee Health Science Center, Memphis, Tennessee
| | - Marko Puljic
- Department of Preventive Medicine, The University of Tennessee Health Science Center, Memphis, Tennessee
- Department of Mathematical Sciences, University of Memphis, Memphis, Tennessee
| | - Enrique Merino
- Department of Pediatrics, The University of Tennessee Health Science Center, Memphis, Tennessee
| | - Ronald M. Adkins
- Department of Molecular Microbiology, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Morelos, México
| | - Derek R. Lovley
- Department of Microbiology, University of Massachusetts, Amherst, Massachusetts
| | - Julia Krushkal
- Department of Preventive Medicine, The University of Tennessee Health Science Center, Memphis, Tennessee
| |
Collapse
|
15
|
Pérez AG, Angarica VE, Collado-Vides J, Vasconcelos ATR. From sequence to dynamics: the effects of transcription factor and polymerase concentration changes on activated and repressed promoters. BMC Mol Biol 2009; 10:92. [PMID: 19772633 PMCID: PMC2761915 DOI: 10.1186/1471-2199-10-92] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2009] [Accepted: 09/22/2009] [Indexed: 11/25/2022] Open
Abstract
Background The fine tuning of two features of the bacterial regulatory machinery have been known to contribute to the diversity of gene expression within the same regulon: the sequence of Transcription Factor (TF) binding sites, and their location with respect to promoters. While variations of binding sequences modulate the strength of the interaction between the TF and its binding sites, the distance between binding sites and promoters alter the interaction between the TF and the RNA polymerase (RNAP). Results In this paper we estimated the dissociation constants (Kd) of several E. coli TFs in their interaction with variants of their binding sequences from the scores resulting from aligning them to Positional Weight Matrices. A correlation coefficient of 0.78 was obtained when pooling together sites for different TFs. The theoretically estimated Kd values were then used, together with the dissociation constants of the RNAP-promoter interaction to analyze activated and repressed promoters. The strength of repressor sites -- i.e., the strength of the interaction between TFs and their binding sites -- is slightly higher than that of activated sites. We explored how different factors such as the variation of binding sequences, the occurrence of more than one binding site, or different RNAP concentrations may influence the promoters' response to the variations of TF concentrations. We found that the occurrence of several regulatory sites bound by the same TF close to a promoter -- if they are bound by the TF in an independent manner -- changes the effect of TF concentrations on promoter occupancy, with respect to individual sites. We also found that the occupancy of a promoter will never be more than half if the RNAP concentration-to-Kp ratio is 1 and the promoter is subject to repression; or less than half if the promoter is subject to activation. If the ratio falls to 0.1, the upper limit of occupancy probability for repressed drops below 10%; a descent of the limits occurs also for activated promoters. Conclusion The number of regulatory sites may thus act as a versatility-producing device, in addition to serving as a source of robustness of the transcription machinery. Furthermore, our results show that the effects of TF concentration fluctuations on promoter occupancy are constrained by RNAP concentrations.
Collapse
Affiliation(s)
- Abel González Pérez
- Centro Nacional de Bioinformática, Industria y San José, Capitolio Nacional, CP 10200, Habana Vieja, Ciudad de la Habana, Cuba.
| | | | | | | |
Collapse
|
16
|
Janky R, Helden JV, Babu MM. Investigating transcriptional regulation: from analysis of complex networks to discovery of cis-regulatory elements. Methods 2009; 48:277-86. [PMID: 19450688 DOI: 10.1016/j.ymeth.2009.04.022] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2009] [Revised: 04/17/2009] [Accepted: 04/18/2009] [Indexed: 10/20/2022] Open
Abstract
Regulation of gene expression at the transcriptional level is a fundamental mechanism that is well conserved in all cellular systems. Due to advances in large-scale experimental analyses, we now have a wealth of information on gene regulation such as mRNA expression level across multiple conditions, genome-wide location data of transcription factors and data on transcription factor binding sites. This knowledge can be used to reconstruct transcriptional regulatory networks. Such networks are usually represented as directed graphs where regulatory interactions are depicted as directed edges from the transcription factor nodes to the target gene nodes. This abstract representation allows us to apply graph theory to study transcriptional regulation at global and local levels, to predict regulatory motifs and regulatory modules such as regulons and to compare the regulatory network of different genomes. Here we review some of the available computational methodologies for studying transcriptional regulatory networks as well as their interpretation.
Collapse
Affiliation(s)
- Rekin's Janky
- Structural Studies Division, Medical Research Council - Laboratory of Molecular Biology, Hills Road, Cambridge CB20QH, United Kingdom.
| | | | | |
Collapse
|
17
|
Reliable transfer of transcriptional gene regulatory networks between taxonomically related organisms. BMC SYSTEMS BIOLOGY 2009; 3:8. [PMID: 19146695 PMCID: PMC2653031 DOI: 10.1186/1752-0509-3-8] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/27/2008] [Accepted: 01/15/2009] [Indexed: 11/10/2022]
Abstract
BACKGROUND Transcriptional regulation of gene activity is essential for any living organism. Transcription factors therefore recognize specific binding sites within the DNA to regulate the expression of particular target genes. The genome-scale reconstruction of the emerging regulatory networks is important for biotechnology and human medicine but cost-intensive, time-consuming, and impossible to perform for any species separately. By using bioinformatics methods one can partially transfer networks from well-studied model organisms to closely related species. However, the prediction quality is limited by the low level of evolutionary conservation of the transcription factor binding sites, even within organisms of the same genus. RESULTS Here we present an integrated bioinformatics workflow that assures the reliability of transferred gene regulatory networks. Our approach combines three methods that can be applied on a large-scale: re-assessment of annotated binding sites, subsequent binding site prediction, and homology detection. A gene regulatory interaction is considered to be conserved if (1) the transcription factor, (2) the adjusted binding site, and (3) the target gene are conserved. The power of the approach is demonstrated by transferring gene regulations from the model organism Corynebacterium glutamicum to the human pathogens C. diphtheriae, C. jeikeium, and the biotechnologically relevant C. efficiens. For these three organisms we identified reliable transcriptional regulations for ~40% of the common transcription factors, compared to ~5% for which knowledge was available before. CONCLUSION Our results suggest that trustworthy genome-scale transfer of gene regulatory networks between organisms is feasible in general but still limited by the level of evolutionary conservation.
Collapse
|
18
|
|
19
|
Klein J, Leupold S, Münch R, Pommerenke C, Johl T, Kärst U, Jänsch L, Jahn D, Retter I. ProdoNet: identification and visualization of prokaryotic gene regulatory and metabolic networks. Nucleic Acids Res 2008; 36:W460-4. [PMID: 18440972 PMCID: PMC2447764 DOI: 10.1093/nar/gkn217] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
ProdoNet is a web-based application for the mapping of prokaryotic genes and the corresponding proteins to common gene regulatory and metabolic networks. For a given list of genes, the system detects shared operons, identifies co-expressed genes and deduces joint regulators. In addition, the contribution to shared metabolic pathways becomes visible on KEGG maps. Furthermore, the co-occurrence of genes of interest in gene expression profiles can be added to the visualization of the global network. In this way, ProdoNet provides the basis for functional genomics approaches and for the interpretation of transcriptomics and proteomics data. As an example, we present an investigation of an experimental membrane subproteome analysis of Pseudomonas aeruginosa with ProdoNet. The ProdoNet dataset on transcriptional regulation is based on the PRODORIC Prokaryotic Database of Gene Regulation and the Virtual Footprint tool. ProdoNet is accessible at http://www.prodonet.tu-bs.de.
Collapse
Affiliation(s)
- Johannes Klein
- Institute for Microbiology, Technische Universität Braunschweig, Spielmannstr. 7, D-38106 Braunschweig, Germany
| | | | | | | | | | | | | | | | | |
Collapse
|
20
|
González Pérez AD, González González E, Espinosa Angarica V, Vasconcelos ATR, Collado-Vides J. Impact of Transcription Units rearrangement on the evolution of the regulatory network of gamma-proteobacteria. BMC Genomics 2008; 9:128. [PMID: 18366643 PMCID: PMC2329645 DOI: 10.1186/1471-2164-9-128] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2007] [Accepted: 03/17/2008] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In the past years, several studies begun to unravel the structure, dynamical properties, and evolution of transcriptional regulatory networks. However, even those comparative studies that focus on a group of closely related organisms are limited by the rather scarce knowledge on regulatory interactions outside a few model organisms, such as E. coli among the prokaryotes. RESULTS In this paper we used the information annotated in Tractor_DB (a database of regulatory networks in gamma-proteobacteria) to calculate a normalized Site Orthology Score (SOS) that quantifies the conservation of a regulatory link across thirty genomes of this subclass. Then we used this SOS to assess how regulatory connections have evolved in this group, and how the variation of basic regulatory connection is reflected on the structure of the chromosome. We found that individual regulatory interactions shift between different organisms, a process that may be described as rewiring the network. At this evolutionary scale (the gamma-proteobacteria subclass) this rewiring process may be an important source of variation of regulatory incoming interactions for individual networks. We also noticed that the regulatory links that form feed forward motifs are conserved in a better correlated manner than triads of random regulatory interactions or pairs of co-regulated genes. Furthermore, the rewiring process that takes place at the most basic level of the regulatory network may be linked to rearrangements of genetic material within bacterial chromosomes, which change the structure of Transcription Units and therefore the regulatory connections between Transcription Factors and structural genes. CONCLUSION The rearrangements that occur in bacterial chromosomes-mostly inversion or horizontal gene transfer events - are important sources of variation of gene regulation at this evolutionary scale.
Collapse
Affiliation(s)
- Abel D González Pérez
- Centro Nacional de Bioinformática. Industria y San José, Capitolio Nacional, CP. 10200, Habana Vieja, Ciudad de la Habana, Cuba.
| | | | | | | | | |
Collapse
|
21
|
Cross-talk Between Iron and Nitrogen Regulatory Networks in Anabaena (Nostoc) sp. PCC 7120: Identification of Overlapping Genes in FurA and NtcA Regulons. J Mol Biol 2007; 374:267-81. [DOI: 10.1016/j.jmb.2007.09.010] [Citation(s) in RCA: 86] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2007] [Revised: 08/30/2007] [Accepted: 09/04/2007] [Indexed: 01/26/2023]
|
22
|
Affiliation(s)
- Dmitry A Rodionov
- Burnham Institute for Medical Research, La Jolla, California 92037, USA.
| |
Collapse
|