1
|
Chalupa D, Nielsen P. Parameter-free and cooperative local search algorithms for graph colouring. Soft comput 2021. [DOI: 10.1007/s00500-021-06347-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
2
|
Hashim FA, Mabrouk MS, Atabany WA. Comparative Analysis of DNA Motif Discovery Algorithms: A Systemic Review. CURRENT CANCER THERAPY REVIEWS 2019. [DOI: 10.2174/1573394714666180417161728] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Background:
Bioinformatics is an interdisciplinary field that combines biology and information
technology to study how to deal with the biological data. The DNA motif discovery
problem is the main challenge of genome biology and its importance is directly proportional to increasing
sequencing technologies which produce large amounts of data. DNA motif is a repeated
portion of DNA sequences of major biological interest with important structural and functional
features. Motif discovery plays a vital role in the antibody-biomarker identification which is useful
for diagnosis of disease and to identify Transcription Factor Binding Sites (TFBSs) that help in
learning the mechanisms for regulation of gene expression. Recently, scientists discovered that the
TFs have a mutation rate five times higher than the flanking sequences, so motif discovery also
has a crucial role in cancer discovery.
Methods:
Over the past decades, many attempts use different algorithms to design fast and accurate
motif discovery tools. These algorithms are generally classified into consensus or probabilistic
approach.
Results:
Many of DNA motif discovery algorithms are time-consuming and easily trapped in a local
optimum.
Conclusion:
Nature-inspired algorithms and many of combinatorial algorithms are recently proposed
to overcome the problems of consensus and probabilistic approaches. This paper presents a
general classification of motif discovery algorithms with new sub-categories. It also presents a
summary comparison between them.
Collapse
Affiliation(s)
- Fatma A. Hashim
- Department of Biomedical Engineering, Helwan University, Helwan, Egypt
| | - Mai S. Mabrouk
- Department of Biomedical Engineering, Misr University for Science and Technology (MUST), Cairo, Egypt
| | | |
Collapse
|
3
|
Teixeira MC, Monteiro PT, Sá-Correia I. Predicting Gene and Genomic Regulation in Saccharomyces cerevisiae, using the YEASTRACT Database: A Step-by-Step Guided Analysis. Methods Mol Biol 2015; 1361:391-404. [PMID: 26483034 DOI: 10.1007/978-1-4939-3079-1_22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/18/2023]
Abstract
Transcriptional regulation is one of the key steps in the control of gene expression, with huge impact on the survival, adaptation, and fitness of all organisms. However, it is becoming increasingly clear that transcriptional regulation is far more complex than initially foreseen. In model organisms such as the yeast Saccharomyces cerevisiae evidence has been piling up showing that the expression of each gene can be controlled by several transcription factors, in the close dependency of the environmental conditions. Furthermore, transcription factors work in intricate networks, being themselves regulated at the transcriptional, post-transcriptional, and post-translational levels, working in cooperation or antagonism in the promoters of their target genes.In this chapter, a step-by-step guide using the YEASTRACT database is provided, for the prediction and ranking of the transcription factors required for the regulation of the expression a single gene and of a genome-wide response. These analyses are illustrated with the regulation of the PDR18 gene and of the transcriptome-wide changes induced upon exposure to the herbicide 2,4-Dichlorophenoxyacetic acid (2,4-D), respectively. The newest potentialities of this information system are explored, and the various results obtained in the dependency of the querying criteria are discussed in terms of the knowledge gathered on the biological responses considered as case studies.
Collapse
Affiliation(s)
- Miguel C Teixeira
- Biological Sciences Research Group, Department of Bioengineering, Instituto Superior Técnico, IBB - Institute for Bioengineering and Biosciences, Universidade de Lisboa, Av. Rovisco Pais, 1049-001, Lisbon, Portugal.
| | - Pedro T Monteiro
- INESC-ID, Instituto Superior Técnico, Universidade de Lisboa, Rua Alves Redol 9, 1000-029 Lisbon, Portugal
| | - Isabel Sá-Correia
- Biological Sciences Research Group, Department of Bioengineering, Instituto Superior Técnico, IBB - Institute for Bioengineering and Biosciences, Universidade de Lisboa, Av. Rovisco Pais, 1049-001, Lisbon, Portugal
| |
Collapse
|
4
|
Becker JD, Takeda S, Borges F, Dolan L, Feijó JA. Transcriptional profiling of Arabidopsis root hairs and pollen defines an apical cell growth signature. BMC PLANT BIOLOGY 2014; 14:197. [PMID: 25080170 PMCID: PMC4236730 DOI: 10.1186/s12870-014-0197-3] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/11/2014] [Accepted: 07/14/2014] [Indexed: 05/20/2023]
Abstract
BACKGROUND Current views on the control of cell development are anchored on the notion that phenotypes are defined by networks of transcriptional activity. The large amounts of information brought about by transcriptomics should allow the definition of these networks through the analysis of cell-specific transcriptional signatures. Here we test this principle by applying an analogue to comparative anatomy at the cellular level, searching for conserved transcriptional signatures, or conserved small gene-regulatory networks (GRNs) on root hairs (RH) and pollen tubes (PT), two filamentous apical growing cells that are a striking example of conservation of structure and function in plants. RESULTS We developed a new method for isolation of growing and mature root hair cells, analysed their transcriptome by microarray analysis, and further compared it with pollen and other single cell transcriptomics data. Principal component analysis shows a statistical relation between the datasets of RHs and PTs which is suggestive of a common transcriptional profile pattern for the apical growing cells in a plant, with overlapping profiles and clear similarities at the level of small GTPases, vesicle-mediated transport and various specific metabolic responses. Furthermore, cis-regulatory element analysis of co-regulated genes between RHs and PTs revealed conserved binding sequences that are likely required for the expression of genes comprising the apical signature. This included a significant occurrence of motifs associated to a defined transcriptional response upon anaerobiosis. CONCLUSIONS Our results suggest that maintaining apical growth mechanisms synchronized with energy yielding might require a combinatorial network of transcriptional regulation. We propose that this study should constitute the foundation for further genetic and physiological dissection of the mechanisms underlying apical growth of plant cells.
Collapse
Affiliation(s)
- Jörg D Becker
- Instituto Gulbenkian de Ciência, 2780-156, Oeiras, Portugal
| | - Seiji Takeda
- Department of Cell and Developmental Biology, John Innes Centre, Norwich NR4 7UH, UK
- Present address: Cell and Genome Biology, Graduate School of Life and Environmental Sciences, Kyoto Prefectural University, Kitaina-Yazuma Oji 74, Seika-cho, Soraku-gun, Kyoto 619-0244, Japan
| | - Filipe Borges
- Instituto Gulbenkian de Ciência, 2780-156, Oeiras, Portugal
- Present address: Cold Spring Harbor Laboratory, Cold Spring Harbor 11724, NY, USA
| | - Liam Dolan
- Department of Cell and Developmental Biology, John Innes Centre, Norwich NR4 7UH, UK
- Department of Plant Sciences, University of Oxford, South Parks Road, Oxford OX1 3RB, UK
| | - José A Feijó
- Instituto Gulbenkian de Ciência, 2780-156, Oeiras, Portugal
- Department of Cell Biology and Molecular Genetics, University of Maryland, 0118 BioScience Research Bldg, College Park 20742-5815, MD, USA
| |
Collapse
|
5
|
Lim JH, Iggo RD, Barker D. Models incorporating chromatin modification data identify functionally important p53 binding sites. Nucleic Acids Res 2013; 41:5582-93. [PMID: 23599002 PMCID: PMC3675478 DOI: 10.1093/nar/gkt260] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Genome-wide prediction of transcription factor binding sites is notoriously difficult. We have developed and applied a logistic regression approach for prediction of binding sites for the p53 transcription factor that incorporates sequence information and chromatin modification data. We tested this by comparison of predicted sites with known binding sites defined by chromatin immunoprecipitation (ChIP), by the location of predictions relative to genes, by the function of nearby genes and by analysis of gene expression data after p53 activation. We compared the predictions made by our novel model with predictions based only on matches to a sequence position weight matrix (PWM). In whole genome assays, the fraction of known sites identified by the two models was similar, suggesting that there was little to be gained from including chromatin modification data. In contrast, there were highly significant and biologically relevant differences between the two models in the location of the predicted binding sites relative to genes, in the function of nearby genes and in the responsiveness of nearby genes to p53 activation. We propose that these contradictory results can be explained by PWM and ChIP data reflecting primarily biophysical properties of protein–DNA interactions, whereas chromatin modification data capture biologically important functional information.
Collapse
Affiliation(s)
- Ji-Hyun Lim
- Sir Harold Mitchell Building, School of Biology, University of St Andrews, St Andrews, Fife, KY16 9TH, UK
| | | | | |
Collapse
|
6
|
Sahu TK, Rao AR, Vasisht S, Singh N, Singh UP. Computational approaches, databases and tools for in silico motif discovery. Interdiscip Sci 2012; 4:239-255. [PMID: 23354813 DOI: 10.1007/s12539-012-0141-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2011] [Revised: 04/12/2012] [Accepted: 06/13/2012] [Indexed: 06/01/2023]
Abstract
Motifs are the biologically significant fragments of nucleotide or peptide sequences in a specific pattern. Motifs are categorized as structural motifs and sequence motifs. These are discovered by phylogenetic studies of similar genes across species. Structural motifs are formed by three dimensional arrangements of amino acids consisting of two or more α helices or β strands whereas sequence motifs are formed by the nucleotide fragments appearing in the exons of a gene. The arrangement of residues in structural motifs may not be continuous while it is continuous in sequence motifs. Sequence motifs may encode to the structural motifs. The algorithms used for motif discovery are important part of the bio-computational studies. The purpose of motif discovery is to identify patterns in biopolymer (nucleotide or protein) sequences to understand the structure and function of the molecules and their evolutionary aspects. The main aim of this paper is to provide systematic compilation of a review on different approaches, databases and tools used in motif discovery.
Collapse
Affiliation(s)
- Tanmaya Kumar Sahu
- Centre for Agricultural Bioinformatics, Indian Agricultural Statistics Research Institute, New Delhi, India
| | | | | | | | | |
Collapse
|
7
|
Federico M, Leoncini M, Montangero M, Valente P. Direct vs 2-stage approaches to structured motif finding. Algorithms Mol Biol 2012; 7:20. [PMID: 22908910 PMCID: PMC3564690 DOI: 10.1186/1748-7188-7-20] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2011] [Accepted: 07/25/2012] [Indexed: 12/03/2022] Open
Abstract
Background The notion of DNA motif is a mathematical abstraction used to model regions of the DNA (known as Transcription Factor Binding Sites, or TFBSs) that are bound by a given Transcription Factor to regulate gene expression or repression. In turn, DNA structured motifs are a mathematical counterpart that models sets of TFBSs that work in concert in the gene regulations processes of higher eukaryotic organisms. Typically, a structured motif is composed of an ordered set of isolated (or simple) motifs, separated by a variable, but somewhat constrained number of “irrelevant” base-pairs. Discovering structured motifs in a set of DNA sequences is a computationally hard problem that has been addressed by a number of authors using either a direct approach, or via the preliminary identification and successive combination of simple motifs. Results We describe a computational tool, named SISMA, for the de-novo discovery of structured motifs in a set of DNA sequences. SISMA is an exact, enumerative algorithm, meaning that it finds all the motifs conforming to the specifications. It does so in two stages: first it discovers all the possible component simple motifs, then combines them in a way that respects the given constraints. We developed SISMA mainly with the aim of understanding the potential benefits of such a 2-stage approach w.r.t. direct methods. In fact, no 2-stage software was available for the general problem of structured motif discovery, but only a few tools that solved restricted versions of the problem. We evaluated SISMA against other published tools on a comprehensive benchmark made of both synthetic and real biological datasets. In a significant number of cases, SISMA outperformed the competitors, exhibiting a good performance also in most of the cases in which it was inferior. Conclusions A reflection on the results obtained lead us to conclude that a 2-stage approach can be implemented with many advantages over direct approaches. Some of these have to do with greater modularity, ease of parallelization, and the possibility to perform adaptive searches of structured motifs. As another consideration, we noted that most hard instances for SISMA were easy to detect in advance. In these cases one may initially opt for a direct method; or, as a viable alternative in most laboratories, one could run both direct and 2-stage tools in parallel, halting the computations when the first halts.
Collapse
|
8
|
Brush GS, Najor NA, Dombkowski AA, Cukovic D, Sawarynski KE. Yeast IME2 functions early in meiosis upstream of cell cycle-regulated SBF and MBF targets. PLoS One 2012; 7:e31575. [PMID: 22393365 PMCID: PMC3290606 DOI: 10.1371/journal.pone.0031575] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2011] [Accepted: 01/13/2012] [Indexed: 01/17/2023] Open
Abstract
Background In Saccharomyces cerevisiae, the G1 cyclin/cyclin-dependent kinase (CDK) complexes Cln1,-2,-3/Cdk1 promote S phase entry during the mitotic cell cycle but do not function during meiosis. It has been proposed that the meiosis-specific protein kinase Ime2, which is required for normal timing of pre-meiotic DNA replication, is equivalent to Cln1,-2/Cdk1. These two CDK complexes directly catalyze phosphorylation of the B-type cyclin/CDK inhibitor Sic1 during the cell cycle to enable its destruction. As a result, Clb5,-6/Cdk1 become activated and facilitate initiation of DNA replication. While Ime2 is required for Sic1 destruction during meiosis, evidence now suggests that Ime2 does not directly catalyze Sic1 phosphorylation to target it for destabilization as Cln1,-2/Cdk1 do during the cell cycle. Methodology/Principal Findings We demonstrated that Sic1 is eventually degraded in meiotic cells lacking the IME2 gene (ime2Δ), supporting an indirect role of Ime2 in Sic1 destruction. We further examined global RNA expression comparing wild type and ime2Δ cells. Analysis of these expression data has provided evidence that Ime2 is required early in meiosis for normal transcription of many genes that are also periodically expressed during late G1 of the cell cycle. Conclusions/Significance Our results place Ime2 at a position in the early meiotic pathway that lies upstream of the position occupied by Cln1,-2/Cdk1 in the analogous cell cycle pathway. Thus, Ime2 may functionally resemble Cln3/Cdk1 in promoting S phase entry, or it could play a role even further upstream in the corresponding meiotic cascade.
Collapse
Affiliation(s)
- George S Brush
- Department of Oncology, Wayne State University School of Medicine, Detroit, Michigan, United States of America.
| | | | | | | | | |
Collapse
|
9
|
Sá-Correia I, Teixeira MC. 2D electrophoresis-based expression proteomics: a microbiologist's perspective. Expert Rev Proteomics 2011; 7:943-53. [PMID: 21142894 DOI: 10.1586/epr.10.76] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Quantitative proteomics based on 2D electrophoresis (2-DE) coupled with peptide mass fingerprinting is still one of the most widely used quantitative proteomics approaches in microbiology research. Our view on the exploitation of this global expression analysis technique and its contribution and potential to push forward the field of molecular microbial physiology towards a molecular systems microbiology perspective is discussed in this article. The advances registered in 2-DE-based quantitative proteomic analysis leading to increased protein resolution, sensitivity and accuracy, and the promising use of 2-DE to gain insights into post-translational modifications at a proteome-wide level (considering all the proteins/protein forms expressed by the genome) are focused on. Given the progress made in this field, it is foreseen that the 2-DE-based approach to quantitative proteomics will continue to be a fundamental tool for microbiologists working at a genome-wide scale. Guidelines are also provided for the exploitation of expression proteomics data, based on useful computational tools, and for the integration of these data with other genome-wide expression information. The advantages and limitations of a complete 2-DE-based expression proteomics analysis, envisaging the quantification of the global changes occurring in the proteome of a given cell depending on environmental or genetic manipulations, are discussed from the microbiologist's perspective. Particular focus is given to the emerging field of toxicoproteomics, a new systems toxicity approach that offers a powerful tool to directly monitor the earliest stages of the toxicological response by identifying critical proteins and pathways that are affected by, and respond to, a chemical stress. The experimental design and the bioinformatics analysis of data used in our laboratory to gain mechanistic insights through expression proteomics into the responses of the eukaryotic model Saccharomyces cerevisiae or of Pseudomonas strains to environmental toxicants are presented as case studies.
Collapse
Affiliation(s)
- Isabel Sá-Correia
- Institute for Biotechnology and Bioengineering, Biological Sciences Research Group, Centro de Engenharia Biológica e Química, Instituto Superior Técnico, Technical University of Lisbon, Av. Rovisco Pais, Lisbon, Portugal.
| | | |
Collapse
|
10
|
Tauber E, Miller-Fleming L, Mason RP, Kwan W, Clapp J, Butler NJ, Outeiro TF, Muchowski PJ, Giorgini F. Functional gene expression profiling in yeast implicates translational dysfunction in mutant huntingtin toxicity. J Biol Chem 2010; 286:410-9. [PMID: 21044956 PMCID: PMC3012999 DOI: 10.1074/jbc.m110.101527] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Huntington disease (HD) is a neurodegenerative disorder caused by the expansion of a polyglutamine tract in the huntingtin (htt) protein. To uncover candidate therapeutic targets and networks involved in pathogenesis, we integrated gene expression profiling and functional genetic screening to identify genes critical for mutant htt toxicity in yeast. Using mRNA profiling, we have identified genes differentially expressed in wild-type yeast in response to mutant htt toxicity as well as in three toxicity suppressor strains: bna4Δ, mbf1Δ, and ume1Δ. BNA4 encodes the yeast homolog of kynurenine 3-monooxygenase, a promising drug target for HD. Intriguingly, despite playing diverse cellular roles, these three suppressors share common differentially expressed genes involved in stress response, translation elongation, and mitochondrial transport. We then systematically tested the ability of the differentially expressed genes to suppress mutant htt toxicity when overexpressed and have thereby identified 12 novel suppressors, including genes that play a role in stress response, Golgi to endosome transport, and rRNA processing. Integrating the mRNA profiling data and the genetic screening data, we have generated a robust network that shows enrichment in genes involved in rRNA processing and ribosome biogenesis. Strikingly, these observations implicate dysfunction of translation in the pathology of HD. Recent work has shown that regulation of translation is critical for life span extension in Drosophila and that manipulation of this process is protective in Parkinson disease models. In total, these observations suggest that pharmacological manipulation of translation may have therapeutic value in HD.
Collapse
Affiliation(s)
- Eran Tauber
- Department of Genetics, University of Leicester, Leicester LE1 7RH, United Kingdom
| | | | | | | | | | | | | | | | | |
Collapse
|
11
|
|
12
|
Monteiro PT, Mendes ND, Teixeira MC, d'Orey S, Tenreiro S, Mira NP, Pais H, Francisco AP, Carvalho AM, Lourenço AB, Sá-Correia I, Oliveira AL, Freitas AT. YEASTRACT-DISCOVERER: new tools to improve the analysis of transcriptional regulatory associations in Saccharomyces cerevisiae. Nucleic Acids Res 2007; 36:D132-6. [PMID: 18032429 PMCID: PMC2238916 DOI: 10.1093/nar/gkm976] [Citation(s) in RCA: 122] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
The Yeast search for transcriptional regulators and consensus tracking (YEASTRACT) information system (www.yeastract.com) was developed to support the analysis of transcription regulatory associations in Saccharomyces cerevisiae. Last updated in September 2007, this database contains over 30 990 regulatory associations between Transcription Factors (TFs) and target genes and includes 284 specific DNA binding sites for 108 characterized TFs. Computational tools are also provided to facilitate the exploitation of the gathered data when solving a number of biological questions, in particular the ones that involve the analysis of global gene expression results. In this new release, YEASTRACT includes DISCOVERER, a set of computational tools that can be used to identify complex motifs over-represented in the promoter regions of co-regulated genes. The motifs identified are then clustered in families, represented by a position weight matrix and are automatically compared with the known transcription factor binding sites described in YEASTRACT. Additionally, in this new release, it is possible to generate graphic depictions of transcriptional regulatory networks for documented or potential regulatory associations between TFs and target genes. The visual display of these networks of interactions is instrumental in functional studies. Tutorials are available on the system to exemplify the use of all the available tools.
Collapse
Affiliation(s)
- Pedro T Monteiro
- INESC-ID, Knowledge Discovery and Bioinformatics Group, R. Alves Redol, 9, 1000-029 Lisbon, Portugal
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Abstract
BACKGROUND Unraveling the mechanisms that regulate gene expression is a major challenge in biology. An important task in this challenge is to identify regulatory elements, especially the binding sites in deoxyribonucleic acid (DNA) for transcription factors. These binding sites are short DNA segments that are called motifs. Recent advances in genome sequence availability and in high-throughput gene expression analysis technologies have allowed for the development of computational methods for motif finding. As a result, a large number of motif finding algorithms have been implemented and applied to various motif models over the past decade. This survey reviews the latest developments in DNA motif finding algorithms. RESULTS Earlier algorithms use promoter sequences of coregulated genes from single genome and search for statistically overrepresented motifs. Recent algorithms are designed to use phylogenetic footprinting or orthologous sequences and also an integrated approach where promoter sequences of coregulated genes and phylogenetic footprinting are used. All the algorithms studied have been reported to correctly detect the motifs that have been previously detected by laboratory experimental approaches, and some algorithms were able to find novel motifs. However, most of these motif finding algorithms have been shown to work successfully in yeast and other lower organisms, but perform significantly worse in higher organisms. CONCLUSION Despite considerable efforts to date, DNA motif finding remains a complex challenge for biologists and computer scientists. Researchers have taken many different approaches in developing motif discovery tools and the progress made in this area of research is very encouraging. Performance comparison of different motif finding tools and identification of the best tools have proven to be a difficult task because tools are designed based on algorithms and motif models that are diverse and complex and our incomplete understanding of the biology of regulatory mechanism does not always provide adequate evaluation of underlying algorithms over motif models.
Collapse
Affiliation(s)
- Modan K Das
- Computer Science Department, Oklahoma State University, Stillwater, Oklahoma 74078, USA
- USDA-ARS, Department of Plant Sciences, University of Arizona, Tucson, Arizona 85721, USA
| | - Ho-Kwok Dai
- Computer Science Department, Oklahoma State University, Stillwater, Oklahoma 74078, USA
| |
Collapse
|
14
|
Santos PM, Roma V, Benndorf D, von Bergen M, Harms H, Sá-Correia I. Mechanistic Insights Into the Global Response to Phenol in the Phenol-biodegrading StrainPseudomonassp. M1 Revealed by Quantitative Proteomics. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2007; 11:233-51. [PMID: 17883337 DOI: 10.1089/omi.2007.0009] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Quantitative proteomics was used to gain insights into the global adaptive response to phenol in the phenol-biodegrading strain Pseudomonas sp. M1 when an alternative carbon source (pyruvate or succinate) is present. A phylogenetic analysis indicated Pseudomonas citronellolis as the closest species to the environmental strain M1, while P. aeruginosa is the closest species with the genome sequence available. After two-dimensional gel electrophoresis (2-DE) separation, protein identification by MS/MS ion search allowed the assignment of 87 out of 136 selected protein spots, 56 of which matched P. aeruginosa proteins present in databases. Coordinate induction of six enzymes of the phenol catabolic pathway in cells grown in pyruvate and phenol was revealed by expression proteomics. When succinate was the alternative carbon source (C-source), these catabolic proteins were not expressed. The global response of Pseudomonas sp. M1 to phenol-induced stress involved, among others, proteins of the energy metabolism, stress response proteins, and transport proteins. Quantitative and/or qualitative differences were registered in M1 response to different phenol concentrations or to identical phenol concentrations when cells were grown in pyruvate or succinate medium. They were attributed to differences observed in the specific growth rate, in the expression of phenol catabolism, and in resistance to phenol of Pseudomonas sp. M1 grown under different conditions.
Collapse
Affiliation(s)
- Pedro M Santos
- Institute for Biotechnology and Bioengineering, Centre for Biological and Chemical Engineering, Instituto Superior Técnico, Lisboa, Portugal
| | | | | | | | | | | |
Collapse
|
15
|
Li L, Liang Y, Bass RL. GAPWM: a genetic algorithm method for optimizing a position weight matrix. Bioinformatics 2007; 23:1188-94. [PMID: 17341493 DOI: 10.1093/bioinformatics/btm080] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
MOTIVATION Position weight matrices (PMWs) are simple models commonly used in motif-finding algorithms to identify short functional elements, such as cis-regulatory motifs, on genes. When few experimentally verified motifs are available, estimation of the PWM may be poor. The resultant PWM may not reliably discriminate a true motif from a false one. While experimentally identifying such motifs remains time-consuming and expensive, low-resolution binding data from techniques such as ChIP-on-chip and ChIP-PET have become available. We propose a novel but simple method to improve a poorly estimated PWM using ChIP data. METHODOLOGY Starting from an existing PWM, a set of ChIP sequences, and a set of background sequences, our method, GAPWM, derives an improved PWM via a genetic algorithm that maximizes the area under the receiver operating characteristic (ROC) curve. GAPWM can easily incorporate prior information such as base conservation. We tested our method on two PMWs (Oct4/Sox2 and p53) using three recently published ChIP data sets (human Oct4, mouse Oct4 and human p53). RESULTS GAPWM substantially increased the sensitivity/specificity of a poorly estimated PWM and further improved the quality of a good PWM. Furthermore, it still functioned when the starting PWM contained a major error. The ROC performance of GAPWM compared favorably with that of MEME and others. With increasing availability of ChIP data, our method provides an alternative for obtaining high-quality PWMs for genome-wide identification of transcription factor binding sites. AVAILABILITY The C source code and all data used in this report are available at http://dir.niehs.nih.gov/dirbb/gapwm. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Leping Li
- Biostatistics Branch, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina 27709, USA.
| | | | | |
Collapse
|