1
|
Karp PD, Paley S, Caspi R, Kothari A, Krummenacker M, Midford PE, Moore LR, Subhraveti P, Gama-Castro S, Tierrafria VH, Lara P, Muñiz-Rascado L, Bonavides-Martinez C, Santos-Zavaleta A, Mackie A, Sun G, Ahn-Horst TA, Choi H, Juenemann R, Knudsen CNM, Covert MW, Collado-Vides J, Paulsen I. The EcoCyc database (2025). EcoSal Plus 2025:eesp00192024. [PMID: 40304522 DOI: 10.1128/ecosalplus.esp-0019-2024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2024] [Accepted: 03/18/2025] [Indexed: 05/02/2025]
Abstract
EcoCyc is a bioinformatics database (DB) available at EcoCyc.org that describes the genome and the biochemical machinery of Escherichia coli K-12 MG1655. The long-term goal of the project was to describe the complete molecular catalog of the E. coli cell, as well as the functions of each of its molecular parts, to facilitate a system-level understanding of E. coli. EcoCyc is an electronic reference source for E. coli biologists and for biologists who work with related microorganisms. The database includes information pages on each E. coli gene product, metabolite, reaction, operon, and metabolic pathway. The database also includes information on the regulation of gene expression, E. coli gene essentiality, and nutrient conditions that do or do not support the growth of E. coli. The website and downloadable software contain tools for the analysis of high-throughput data sets. In addition, a steady-state metabolic flux model is generated from each new version of EcoCyc and can be executed via EcoCyc.org. The model can predict metabolic flux rates, nutrient uptake rates, and growth rates for different gene knockouts and nutrient conditions. Data generated from a whole-cell model that is parameterized from the latest data on EcoCyc is also available. This review outlines the data content of EcoCyc and the procedures by which this content is generated.
Collapse
Affiliation(s)
- Peter D Karp
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Suzanne Paley
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Ron Caspi
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Anamika Kothari
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Markus Krummenacker
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Peter E Midford
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Lisa R Moore
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Pallavi Subhraveti
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Socorro Gama-Castro
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | - Víctor H Tierrafria
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | - Paloma Lara
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | - Luis Muñiz-Rascado
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | - César Bonavides-Martinez
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | - Alberto Santos-Zavaleta
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | - Amanda Mackie
- School of Natural Sciences, Macquarie University, Sydney, New South Wales, Australia
| | - Gwanggyu Sun
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Travis A Ahn-Horst
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Heejo Choi
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Riley Juenemann
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Cyrus N M Knudsen
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Markus W Covert
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Julio Collado-Vides
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | - Ian Paulsen
- School of Natural Sciences, Macquarie University, Sydney, New South Wales, Australia
| |
Collapse
|
2
|
Karp PD, Paley S, Caspi R, Kothari A, Krummenacker M, Midford PE, Moore LR, Subhraveti P, Gama-Castro S, Tierrafria VH, Lara P, Muñiz-Rascado L, Bonavides-Martinez C, Santos-Zavaleta A, Mackie A, Sun G, Ahn-Horst TA, Choi H, Covert MW, Collado-Vides J, Paulsen I. The EcoCyc Database (2023). EcoSal Plus 2023; 11:eesp00022023. [PMID: 37220074 PMCID: PMC10729931 DOI: 10.1128/ecosalplus.esp-0002-2023] [Citation(s) in RCA: 44] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Accepted: 04/04/2023] [Indexed: 01/28/2024]
Abstract
EcoCyc is a bioinformatics database available online at EcoCyc.org that describes the genome and the biochemical machinery of Escherichia coli K-12 MG1655. The long-term goal of the project is to describe the complete molecular catalog of the E. coli cell, as well as the functions of each of its molecular parts, to facilitate a system-level understanding of E. coli. EcoCyc is an electronic reference source for E. coli biologists and for biologists who work with related microorganisms. The database includes information pages on each E. coli gene product, metabolite, reaction, operon, and metabolic pathway. The database also includes information on the regulation of gene expression, E. coli gene essentiality, and nutrient conditions that do or do not support the growth of E. coli. The website and downloadable software contain tools for the analysis of high-throughput data sets. In addition, a steady-state metabolic flux model is generated from each new version of EcoCyc and can be executed online. The model can predict metabolic flux rates, nutrient uptake rates, and growth rates for different gene knockouts and nutrient conditions. Data generated from a whole-cell model that is parameterized from the latest data on EcoCyc are also available. This review outlines the data content of EcoCyc and of the procedures by which this content is generated.
Collapse
Affiliation(s)
- Peter D. Karp
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Suzanne Paley
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Ron Caspi
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Anamika Kothari
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Markus Krummenacker
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Peter E. Midford
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Lisa R. Moore
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Pallavi Subhraveti
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Socorro Gama-Castro
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Victor H. Tierrafria
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Paloma Lara
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Luis Muñiz-Rascado
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - César Bonavides-Martinez
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Alberto Santos-Zavaleta
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Amanda Mackie
- Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney, New South Wales, Australia
| | - Gwanggyu Sun
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Travis A. Ahn-Horst
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Heejo Choi
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Markus W. Covert
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Julio Collado-Vides
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Ian Paulsen
- School of Natural Sciences, Macquarie University, Sydney, New South Wales, Australia
| |
Collapse
|
3
|
Reus E, Nielsen MR, Frandsen RJN. Metabolic and regulatory insights from the experimental horizontal gene transfer of the aurofusarin and bikaverin gene clusters to
Aspergillus nidulans. Mol Microbiol 2019; 112:1684-1700. [DOI: 10.1111/mmi.14376] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/18/2019] [Indexed: 12/19/2022]
Affiliation(s)
- Elise Reus
- Department of Biotechnology and Bioengineering Technical University of Denmark Kongens Lyngby Denmark
| | | | | |
Collapse
|
4
|
Karp PD, Ong WK, Paley S, Billington R, Caspi R, Fulcher C, Kothari A, Krummenacker M, Latendresse M, Midford PE, Subhraveti P, Gama-Castro S, Muñiz-Rascado L, Bonavides-Martinez C, Santos-Zavaleta A, Mackie A, Collado-Vides J, Keseler IM, Paulsen I. The EcoCyc Database. EcoSal Plus 2018; 8:10.1128/ecosalplus.ESP-0006-2018. [PMID: 30406744 PMCID: PMC6504970 DOI: 10.1128/ecosalplus.esp-0006-2018] [Citation(s) in RCA: 63] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2018] [Indexed: 01/28/2023]
Abstract
EcoCyc is a bioinformatics database available at EcoCyc.org that describes the genome and the biochemical machinery of Escherichia coli K-12 MG1655. The long-term goal of the project is to describe the complete molecular catalog of the E. coli cell, as well as the functions of each of its molecular parts, to facilitate a system-level understanding of E. coli. EcoCyc is an electronic reference source for E. coli biologists and for biologists who work with related microorganisms. The database includes information pages on each E. coli gene product, metabolite, reaction, operon, and metabolic pathway. The database also includes information on E. coli gene essentiality and on nutrient conditions that do or do not support the growth of E. coli. The website and downloadable software contain tools for analysis of high-throughput data sets. In addition, a steady-state metabolic flux model is generated from each new version of EcoCyc and can be executed via EcoCyc.org. The model can predict metabolic flux rates, nutrient uptake rates, and growth rates for different gene knockouts and nutrient conditions. This review outlines the data content of EcoCyc and of the procedures by which this content is generated.
Collapse
Affiliation(s)
- Peter D Karp
- Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
| | - Wai Kit Ong
- Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
| | - Suzanne Paley
- Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
| | | | - Ron Caspi
- Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
| | - Carol Fulcher
- Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
| | - Anamika Kothari
- Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
| | | | - Mario Latendresse
- Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
| | - Peter E Midford
- Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
| | | | - Socorro Gama-Castro
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
| | - Luis Muñiz-Rascado
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
| | - César Bonavides-Martinez
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
| | - Alberto Santos-Zavaleta
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
| | - Amanda Mackie
- Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney, NSW 2109, Australia
| | - Julio Collado-Vides
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
| | - Ingrid M Keseler
- Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
| | - Ian Paulsen
- Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney, NSW 2109, Australia
| |
Collapse
|
5
|
Inter-replicon Gene Flow Contributes to Transcriptional Integration in the Sinorhizobium meliloti Multipartite Genome. G3-GENES GENOMES GENETICS 2018; 8:1711-1720. [PMID: 29563186 PMCID: PMC5940162 DOI: 10.1534/g3.117.300405] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Integration of newly acquired genes into existing regulatory networks is necessary for successful horizontal gene transfer (HGT). Ten percent of bacterial species contain at least two DNA replicons over 300 kilobases in size, with the secondary replicons derived predominately through HGT. The Sinorhizobium meliloti genome is split between a 3.7 Mb chromosome, a 1.7 Mb chromid consisting largely of genes acquired through ancient HGT, and a 1.4 Mb megaplasmid consisting primarily of recently acquired genes. Here, RNA-sequencing is used to examine the transcriptional consequences of massive, synthetic genome reduction produced through the removal of the megaplasmid and/or the chromid. Removal of the pSymA megaplasmid influenced the transcription of only six genes. In contrast, removal of the chromid influenced expression of ∼8% of chromosomal genes and ∼4% of megaplasmid genes. This was mediated in part by the loss of the ETR DNA region whose presence on pSymB is due to a translocation from the chromosome. No obvious functional bias among the up-regulated genes was detected, although genes with putative homologs on the chromid were enriched. Down-regulated genes were enriched in motility and sensory transduction pathways. Four transcripts were examined further, and in each case the transcriptional change could be traced to loss of specific pSymB regions. In particularly, a chromosomal transporter was induced due to deletion of bdhA likely mediated through 3-hydroxybutyrate accumulation. These data provide new insights into the evolution of the multipartite bacterial genome, and more generally into the integration of horizontally acquired genes into the transcriptome.
Collapse
|
6
|
Abstract
Lateral gene transfer (LGT) profoundly shapes the evolution of bacterial lineages. LGT across disparate phylogenetic groups and genome content diversity between related organisms suggest a model of bacterial evolution that views LGT as rampant and promiscuous. It has even driven the argument that species concepts and tree-based phylogenetics cannot be applied to bacteria. Here, we show that acquisition and retention of genes through LGT are surprisingly rare in the ubiquitous and biomedically important bacterial genus Streptomyces Using a molecular clock, we estimate that the Streptomyces bacteria are ~380 million years old, indicating that this bacterial genus is as ancient as land vertebrates. Calibrating LGT rate to this geologic time span, we find that on average only 10 genes per million years were acquired and subsequently maintained. Over that same time span, Streptomyces accumulated thousands of point mutations. By explicitly incorporating evolutionary timescale into our analyses, we provide a dramatically different view on the dynamics of LGT and its impact on bacterial evolution.IMPORTANCE Tree-based phylogenetics and the use of species as units of diversity lie at the foundation of modern biology. In bacteria, these pillars of evolutionary theory have been called into question due to the observation of thousands of lateral gene transfer (LGT) events within and between lineages. Here, we show that acquisition and retention of genes through LGT are exceedingly rare in the bacterial genus Streptomyces, with merely one gene acquired in Streptomyces lineages every 100,000 years. These findings stand in contrast to the current assumption of rampant genetic exchange, which has become the dominant hypothesis used to explain bacterial diversity. Our results support a more nuanced understanding of genetic exchange, with LGT impacting evolution over short timescales but playing a significant role over long timescales. Deeper understanding of LGT provides new insight into the evolutionary history of life on Earth, as the vast majority of this history is microbial.
Collapse
|
7
|
Kurata T, Nakanishi S, Hashimoto M, Taoka M, Yamazaki Y, Isobe T, Kato JI. Novel essential gene Involved in 16S rRNA processing in Escherichia coli. J Mol Biol 2014; 427:955-965. [PMID: 25545592 DOI: 10.1016/j.jmb.2014.12.013] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2014] [Revised: 12/15/2014] [Accepted: 12/16/2014] [Indexed: 12/21/2022]
Abstract
Biogenesis of ribosomes is a complex process mediated by many factors. While its transcription proceeds, ribosomal RNA (rRNA) folds itself into a characteristic three-dimensional structure through interaction with ribosomal proteins, during which its ends are processed. Here, we show that the essential protein YqgF, a RuvC family protein with an RNase-H-like motif, is involved in the processing of pre-16S rRNA during ribosome maturation. Indeed, pre-16S rRNA accumulated in cells of a temperature-sensitive yqgF mutant (yqgF(ts)) cultured at a non-permissive temperature. In addition, purified YqgF was shown to process the 5' end of pre-16S rRNA within 70S ribosomes in vitro. Mass spectrometry analysis of the total proteins in the yqgF(ts) mutant cells showed that the expression of genes containing multiple Shine-Dalgarno-like sequences was observed to be lower than in wild type. These results are interpreted to indicate that YqgF is involved in a novel enzymic activity necessary for the processing of pre-16S rRNA, thereby affecting elongation of translation.
Collapse
Affiliation(s)
- Tatsuaki Kurata
- Department of Biological Sciences, Graduate Schools of Science and Engineering, Tokyo Metropolitan University, Tokyo 192-0397, Japan
| | - Shinobu Nakanishi
- Department of Biological Sciences, Graduate Schools of Science and Engineering, Tokyo Metropolitan University, Tokyo 192-0397, Japan
| | - Masayuki Hashimoto
- Institute of Molecular Medicine, Medical College, National Cheng-Kung University, Tainan City 704, Taiwan; Infectious Disease and Signal Transduction Center, Medical College, National Cheng-Kung University, Tainan City 704, Taiwan
| | - Masato Taoka
- Department of Chemistry, Graduate Schools of Science and Engineering, Tokyo Metropolitan University, Tokyo 192-0397, Japan
| | - Yukiko Yamazaki
- Genetic Informatics Laboratory, Genetic Strains Research Center, National Institute of Genetics, Shizuoka 411-8540, Japan
| | - Toshiaki Isobe
- Department of Chemistry, Graduate Schools of Science and Engineering, Tokyo Metropolitan University, Tokyo 192-0397, Japan
| | - Jun-Ichi Kato
- Department of Biological Sciences, Graduate Schools of Science and Engineering, Tokyo Metropolitan University, Tokyo 192-0397, Japan.
| |
Collapse
|
8
|
Efficient and cost effective production of active-form human PKB using silkworm larvae. Sci Rep 2014; 4:6016. [PMID: 25125290 PMCID: PMC4133716 DOI: 10.1038/srep06016] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2014] [Accepted: 07/23/2014] [Indexed: 01/01/2023] Open
Abstract
Protein kinase B (PKB) also known as Akt is involved in many signal transduction pathways. As alterations of the PKB pathway are found in a number of human malignancies, PKB is considered an important drug target for cancer therapy. However, production of sufficient amounts of active PKB for biochemical and structural studies is very costly because of the necessity of using a higher organism expression system to obtain phosphorylated PKB. Here, we report efficient production of active PKBα using the BmNPV bacmid expression system with silkworm larvae. Following direct injection of bacmid DNA, recombinant PKBα protein was highly expressed in the fat bodies of larvae, and could be purified using a GST-tag and then cleaved. A final yield of approximately 1 mg PKBα/20 larvae was recorded. Kinase assays showed that the recombinant PKBα possessed high phosphorylation activity. We further confirmed phosphorylation on the activation loop by mass spectrometric analysis. Our results indicate that the silkworm expression system is of value for preparation of active-form PKBα with phosphorylation on the activation loop. This efficient production of the active protein will facilitate further biochemical and structural studies and stimulate subsequent drug development.
Collapse
|
9
|
Pelchovich G, Nadejda S, Dana A, Tuller T, Bravo IG, Gophna U. Ribosomal mutations affecting the translation of genes that use non-optimal codons. FEBS J 2014; 281:3701-18. [PMID: 24966114 DOI: 10.1111/febs.12892] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2014] [Revised: 06/10/2014] [Accepted: 06/23/2014] [Indexed: 01/26/2023]
Abstract
Genes that are laterally acquired by a new host species often contain codons that are non-optimal to the tRNA repertoire of the new host, which may lead to insufficient translational levels. Inefficient translation can be overcome by different mechanisms, such as incremental amelioration of the coding sequence, compensatory mutations in the regulatory sequences leading to increased transcription or increase in gene copy number. However, there is also a possibility that ribosomal mutations can improve the expression of such genes. To test this hypothesis, we examined the effects of point mutations in the endogenous ribosomal proteins S12 and S5 in Escherichia coli, which are known to be involved in the decoding of the mRNA, on the efficiency of translation of exogenous genes that use non-optimal codons, in vivo. We show that an S12 mutant in E. coli is able to express exogenous genes, with non-optimal codons, to higher levels than the wild-type, and explore the mechanisms underlying this phenomenon in this mutant. Our results suggest that the transient emergence of mutants that allow efficient expression of exogenous genes with non-optimal codons could also increase the chances of fixation of laterally transferred genes.
Collapse
Affiliation(s)
- Gidi Pelchovich
- Department of Molecular Microbiology and Biotechnology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv, Israel
| | | | | | | | | | | |
Collapse
|
10
|
Karp PD, Weaver D, Paley S, Fulcher C, Kubo A, Kothari A, Krummenacker M, Subhraveti P, Weerasinghe D, Gama-Castro S, Huerta AM, Muñiz-Rascado L, Bonavides-Martinez C, Weiss V, Peralta-Gil M, Santos-Zavaleta A, Schröder I, Mackie A, Gunsalus R, Collado-Vides J, Keseler IM, Paulsen I. The EcoCyc Database. EcoSal Plus 2014; 6:10.1128/ecosalplus.ESP-0009-2013. [PMID: 26442933 PMCID: PMC4243172 DOI: 10.1128/ecosalplus.esp-0009-2013] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2014] [Indexed: 11/20/2022]
Abstract
EcoCyc is a bioinformatics database available at EcoCyc.org that describes the genome and the biochemical machinery of Escherichia coli K-12 MG1655. The long-term goal of the project is to describe the complete molecular catalog of the E. coli cell, as well as the functions of each of its molecular parts, to facilitate a system-level understanding of E. coli. EcoCyc is an electronic reference source for E. coli biologists and for biologists who work with related microorganisms. The database includes information pages on each E. coli gene, metabolite, reaction, operon, and metabolic pathway. The database also includes information on E. coli gene essentiality and on nutrient conditions that do or do not support the growth of E. coli. The website and downloadable software contain tools for analysis of high-throughput data sets. In addition, a steady-state metabolic flux model is generated from each new version of EcoCyc. The model can predict metabolic flux rates, nutrient uptake rates, and growth rates for different gene knockouts and nutrient conditions. This review provides a detailed description of the data content of EcoCyc and of the procedures by which this content is generated.
Collapse
Affiliation(s)
- Peter D Karp
- Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
| | - Daniel Weaver
- Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
| | - Suzanne Paley
- Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
| | - Carol Fulcher
- Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
| | - Aya Kubo
- Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
| | - Anamika Kothari
- Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
| | | | | | | | - Socorro Gama-Castro
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
| | - Araceli M Huerta
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
| | - Luis Muñiz-Rascado
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
| | - César Bonavides-Martinez
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
| | - Verena Weiss
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
| | - Martin Peralta-Gil
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
| | - Alberto Santos-Zavaleta
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
| | - Imke Schröder
- Department of Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles, CA 90095
- UCLA Institute of Genomics and Proteomics, University of California, Los Angeles, CA 90095
| | - Amanda Mackie
- Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney, NSW 2109, Australia
| | - Robert Gunsalus
- Department of Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles, CA 90095
| | - Julio Collado-Vides
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, A.P. 565-A, Cuernavaca, Morelos 62100, México
| | - Ingrid M Keseler
- Bioinformatics Research Group, SRI International, Menlo Park, CA 94025
| | - Ian Paulsen
- Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney, NSW 2109, Australia
| |
Collapse
|
11
|
Wang Y, Huang KY, Huo Y. Proteomic comparison between Salmonella Typhimurium and Salmonella Typhi. J Microbiol 2014; 52:71-6. [PMID: 24390840 DOI: 10.1007/s12275-014-3204-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2013] [Revised: 07/30/2013] [Accepted: 08/05/2013] [Indexed: 11/26/2022]
Abstract
The genus Salmonella contains more than 2500 serovars. While most cause the self-limiting gastroenteritis, a few serovars can elicit typhoid fever, a severe systemic infection. S. enterica subsp. enterica serovar Typhimurium and S. Typhi are the representatives of the gastroenteritis and typhoid fever types of Salmonella. In this study, we adopted Stable Isotope Labeling with Amino acids in Cell culture (SILAC) technology to quantitatively compare the proteomes of the two serovars. We found several proteins with serovar-specific expression, which could be developed as new biomarkers for clinical serotype diagnosis. We found that flagella and chemotaxis genes were down-regulated in S. Typhi in comparison with S. Typhimurium. We attributed this observation to the fact that the smooth cellular structure of S. Typhi may better fit its systemic lifestyle. Instead of known virulence factors that were located within Salmonella Pathogenecity Islands, a number of core genes, which were involved in metabolism and transport of carbohydrates and amino acids, showed differential expression between the two serovars. Further studies on the roles of these differentially-expressed genes in the pathogenesis should be undertaken.
Collapse
Affiliation(s)
- Yue Wang
- Department of Gynecology and Obstetrics, Women's Hospital School of Medicine Zhejiang University, Hangzhou, P. R. China
| | | | | |
Collapse
|
12
|
SUN JS, HAHN TW. Comparative Proteomic Analysis of Salmonella enterica Serovars Enteritidis, Typhimurium and Gallinarum. J Vet Med Sci 2012; 74:285-91. [DOI: 10.1292/jvms.11-0366] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Affiliation(s)
- Ji-Sun SUN
- College of Veterinary Medicine and Institute of Veterinary Science, Kangwon National University
| | - Tae-Wook HAHN
- College of Veterinary Medicine and Institute of Veterinary Science, Kangwon National University
| |
Collapse
|
13
|
Characterizing the Escherichia coli O157:H7 proteome including protein associations with higher order assemblies. PLoS One 2011; 6:e26554. [PMID: 22087229 PMCID: PMC3210124 DOI: 10.1371/journal.pone.0026554] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2011] [Accepted: 09/28/2011] [Indexed: 11/19/2022] Open
Abstract
Background The recent outbreak of severe infections with Shiga toxin (Stx) producing Escherichia coli (STEC) serotype O104:H4 highlights the need to understand horizontal gene transfer among E. coli strains, identify novel virulence factors and elucidate their pathogenesis. Quantitative shotgun proteomics can contribute to such objectives, allowing insights into the part of the genome translated into proteins and the connectivity of biochemical pathways and higher order assemblies of proteins at the subcellular level. Methodology/Principal Findings We examined protein profiles in cell lysate fractions of STEC strain 86-24 (serotype O157:H7), following growth in cell culture or bacterial isolation from intestines of infected piglets, in the context of functionally and structurally characterized biochemical pathways of E. coli. Protein solubilization in the presence of Triton X-100, EDTA and high salt was followed by size exclusion chromatography into the approximate Mr ranges greater than 280 kDa, 280-80 kDa and 80-10 kDa. Peptide mixtures resulting from these and the insoluble fraction were analyzed by quantitative 2D-LC-nESI-MS/MS. Of the 2521 proteins identified at a 1% false discovery rate, representing 47% of all predicted E. coli O157:H7 gene products, the majority of integral membrane proteins were enriched in the high Mr fraction. Hundreds of proteins were enriched in a Mr range higher than that predicted for a monomer supporting their participation in protein complexes. The insoluble STEC fraction revealed enrichment of aggregation-prone proteins, including many that are part of large structure/function entities such as the ribosome, cytoskeleton and O-antigen biosynthesis cluster. Significance Nearly all E. coli O157:H7 proteins encoded by prophage regions were expressed at low abundance levels or not detected. Comparative quantitative analyses of proteins from distinct cell lysate fractions allowed us to associate uncharacterized proteins with membrane attachment, potential participation in stable protein complexes, and susceptibility to aggregation as part of larger structural assemblies.
Collapse
|
14
|
Rabhi M, Espéli O, Schwartz A, Cayrol B, Rahmouni AR, Arluison V, Boudvillain M. The Sm-like RNA chaperone Hfq mediates transcription antitermination at Rho-dependent terminators. EMBO J 2011; 30:2805-16. [PMID: 21673658 DOI: 10.1038/emboj.2011.192] [Citation(s) in RCA: 84] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2010] [Accepted: 05/26/2011] [Indexed: 11/09/2022] Open
Abstract
In Escherichia coli, the essential motor protein Rho promotes transcription termination in a tightly controlled manner that is not fully understood. Here, we show that the general post-transcriptional regulatory protein Hfq associates with Rho to regulate Rho function. The Hfq:Rho complex can be further stabilized by RNA bridging both factors in a configuration that inhibits the ATP hydrolysis and duplex unwinding activities of Rho and that mediates transcription antitermination at Rho-dependent terminators in vitro and in vivo. Antitermination at a prototypical terminator (λtR1) requires Hfq binding to an A/U-rich transcript region directly upstream from the terminator. Antitermination is modulated by trans-acting factors (NusG or nucleic acid competitors) that affect Hfq association with Rho or RNA. These data unveil a new Hfq function and a novel transcription regulatory mechanism with potentially important implications for bacterial RNA metabolism, gene silencing, and pathogenicity.
Collapse
Affiliation(s)
- Makhlouf Rabhi
- Centre de Biophysique Moléculaire, CNRS UPR4301, rue Charles Sadron, Orléans Cedex 2, France
| | | | | | | | | | | | | |
Collapse
|
15
|
Tuller T, Girshovich Y, Sella Y, Kreimer A, Freilich S, Kupiec M, Gophna U, Ruppin E. Association between translation efficiency and horizontal gene transfer within microbial communities. Nucleic Acids Res 2011; 39:4743-55. [PMID: 21343180 PMCID: PMC3113575 DOI: 10.1093/nar/gkr054] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
Horizontal gene transfer (HGT) is a major force in microbial evolution. Previous studies have suggested that a variety of factors, including restricted recombination and toxicity of foreign gene products, may act as barriers to the successful integration of horizontally transferred genes. This study identifies an additional central barrier to HGT-the lack of co-adaptation between the codon usage of the transferred gene and the tRNA pool of the recipient organism. Analyzing the genomic sequences of more than 190 microorganisms and the HGT events that have occurred between them, we show that the number of genes that were horizontally transferred between organisms is positively correlated with the similarity between their tRNA pools. Those genes that are better adapted to the tRNA pools of the target genomes tend to undergo more frequent HGT. At the community (or environment) level, organisms that share a common ecological niche tend to have similar tRNA pools. These results remain significant after controlling for diverse ecological and evolutionary parameters. Our analysis demonstrates that there are bi-directional associations between the similarity in the tRNA pools of organisms and the number of HGT events occurring between them. Similar tRNA pools between a donor and a host tend to increase the probability that a horizontally acquired gene will become fixed in its new genome. Our results also suggest that frequent HGT may be a homogenizing force that increases the similarity in the tRNA pools of organisms within the same community.
Collapse
Affiliation(s)
- Tamir Tuller
- Faculty of Mathematics and Computer Science, Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Blavatnik School of Computer Science, School of Medicine, Tel Aviv University, Ramat Aviv 69978, Israel
| | | | | | | | | | | | | | | |
Collapse
|
16
|
Zhou F, Cardoza JD, Ficarro SB, Adelmant GO, Lazaro JB, Marto JA. Online nanoflow RP-RP-MS reveals dynamics of multicomponent Ku complex in response to DNA damage. J Proteome Res 2010; 9:6242-55. [PMID: 20873769 PMCID: PMC2997187 DOI: 10.1021/pr1004696] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Tandem affinity purification (TAP) coupled with mass spectrometry has become the technique of choice for characterization of multicomponent protein complexes. While current TAP protocols routinely provide high yield and specificity for proteins expressed under physiologically relevant conditions, analytical figures of merit required for efficient and in-depth LC-MS analysis remain unresolved. Here we implement a multidimensional chromatography platform, based on two stages of reversed-phase (RP) separation operated at high and low pH, respectively. We compare performance metrics for RP-RP and SCX-RP for the analysis of complex peptide mixtures derived from cell lysate, as well as protein complexes purified via TAP. Our data reveal that RP-RP fractionation outperforms SCX-RP primarily due to increased peak capacity in the first dimension separation. We integrate this system with miniaturized LC assemblies to achieve true online fractionation at low (≤5 nL/min) effluent flow rates. Stable isotope labeling is used to monitor the dynamics of the multicomponent Ku protein complex in response to DNA damage induced by γ radiation.
Collapse
Affiliation(s)
- Feng Zhou
- Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School
| | - Job D. Cardoza
- Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School
| | - Scott B. Ficarro
- Department of Cancer Biology and Blais Proteomics Center, Dana-Farber Cancer Institute
- Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School
| | - Guillaume O. Adelmant
- Department of Cancer Biology and Blais Proteomics Center, Dana-Farber Cancer Institute
- Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School
| | | | - Jarrod A. Marto
- Department of Cancer Biology and Blais Proteomics Center, Dana-Farber Cancer Institute
- Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School
| |
Collapse
|
17
|
Soares MR, Facincani AP, Ferreira RM, Moreira LM, de Oliveira JC, Ferro JA, Ferro MI, Meneghini R, Gozzo FC. Proteome of the phytopathogen Xanthomonas citri subsp. citri: a global expression profile. Proteome Sci 2010; 8:55. [PMID: 21062441 PMCID: PMC2996358 DOI: 10.1186/1477-5956-8-55] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2010] [Accepted: 11/09/2010] [Indexed: 01/28/2023] Open
Abstract
Background Citrus canker is a disease caused by Xantomonas citri subsp.citri (Xac), and has emerged as one of the major threats to the worldwide citrus crop because it affects all commercial citrus varieties, decreases the production and quality of the fruits and can spread rapidly in citrus growing areas. In this work, the first proteome of Xac was analyzed using two methodologies, two-dimensional liquid chromatography (2D LC) and tandem mass spectrometry (MS/MS). Results In order to gain insight into the metabolism of Xac, cells were grown on two different media (NB - Nutrient Broth and TSE - Tryptone Sucrose broth enriched with glutamic acid), and proteins were proteolyzed with trypsin and examined by 2D LC-MS/MS. Approximately 39% of all predicted proteins by annotation of Xac were identified with their component peptides unambiguously assigned to tandem mass spectra. The proteins, about 1,100, were distributed in all annotated functional categories. Conclusions This is the first proteomic reference map for the most aggressive strain of Xanthomonas pathogen of all orange varieties. The compilation of metabolic pathways involved with bacterial growth showed that Xac expresses a complete central and intermediary metabolism, replication, transcription and translation machineries and regulation factors, distinct membrane transporters (ABC, MFS and pumps) and receptors (MCP, TonB dependent and metabolites acquisition), two-component systems (sensor and regulatory components) and response regulators. These data corroborate the growth curve in vitro and are the first reports indicating that many of these genome annotated genes are translated into operative in Xac. This proteomic analysis also provided information regarding the influence of culture medium on growth and protein expression of Xac.
Collapse
Affiliation(s)
- Márcia R Soares
- Laboratório Nacional de Luz Sincrotron (LNLS), Campinas, SP, Brazil.
| | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Babu MM. Early Career Research Award Lecture. Structure, evolution and dynamics of transcriptional regulatory networks. Biochem Soc Trans 2010; 38:1155-78. [PMID: 20863280 DOI: 10.1042/bst0381155] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
The availability of entire genome sequences and the wealth of literature on gene regulation have enabled researchers to model an organism's transcriptional regulation system in the form of a network. In such a network, TFs (transcription factors) and TGs (target genes) are represented as nodes and regulatory interactions between TFs and TGs are represented as directed links. In the present review, I address the following topics pertaining to transcriptional regulatory networks. (i) Structure and organization: first, I introduce the concept of networks and discuss our understanding of the structure and organization of transcriptional networks. (ii) Evolution: I then describe the different mechanisms and forces that influence network evolution and shape network structure. (iii) Dynamics: I discuss studies that have integrated information on dynamics such as mRNA abundance or half-life, with data on transcriptional network in order to elucidate general principles of regulatory network dynamics. In particular, I discuss how cell-to-cell variability in the expression level of TFs could permit differential utilization of the same underlying network by distinct members of a genetically identical cell population. Finally, I conclude by discussing open questions for future research and highlighting the implications for evolution, development, disease and applications such as genetic engineering.
Collapse
Affiliation(s)
- M Madan Babu
- MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 0QH, UK.
| |
Collapse
|
19
|
Müller SA, Kohajda T, Findeiss S, Stadler PF, Washietl S, Kellis M, von Bergen M, Kalkhof S. Optimization of parameters for coverage of low molecular weight proteins. Anal Bioanal Chem 2010; 398:2867-81. [PMID: 20803007 PMCID: PMC2990009 DOI: 10.1007/s00216-010-4093-x] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2010] [Revised: 07/26/2010] [Accepted: 08/03/2010] [Indexed: 11/16/2022]
Abstract
Proteins with molecular weights of <25 kDa are involved in major biological processes such as ribosome formation, stress adaption (e.g., temperature reduction) and cell cycle control. Despite their importance, the coverage of smaller proteins in standard proteome studies is rather sparse. Here we investigated biochemical and mass spectrometric parameters that influence coverage and validity of identification. The underrepresentation of low molecular weight (LMW) proteins may be attributed to the low numbers of proteolytic peptides formed by tryptic digestion as well as their tendency to be lost in protein separation and concentration/desalting procedures. In a systematic investigation of the LMW proteome of Escherichia coli, a total of 455 LMW proteins (27% of the 1672 listed in the SwissProt protein database) were identified, corresponding to a coverage of 62% of the known cytosolic LMW proteins. Of these proteins, 93 had not yet been functionally classified, and five had not previously been confirmed at the protein level. In this study, the influences of protein extraction (either urea or TFA), proteolytic digestion (solely, and the combined usage of trypsin and AspN as endoproteases) and protein separation (gel- or non-gel-based) were investigated. Compared to the standard procedure based solely on the use of urea lysis buffer, in-gel separation and tryptic digestion, the complementary use of TFA for extraction or endoprotease AspN for proteolysis permits the identification of an extra 72 (32%) and 51 proteins (23%), respectively. Regarding mass spectrometry analysis with an LTQ Orbitrap mass spectrometer, collision-induced fragmentation (CID and HCD) and electron transfer dissociation using the linear ion trap (IT) or the Orbitrap as the analyzer were compared. IT-CID was found to yield the best identification rate, whereas IT-ETD provided almost comparable results in terms of LMW proteome coverage. The high overlap between the proteins identified with IT-CID and IT-ETD allowed the validation of 75% of the identified proteins using this orthogonal fragmentation technique. Furthermore, a new approach to evaluating and improving the completeness of protein databases that utilizes the program RNAcode was introduced and examined.
Collapse
Affiliation(s)
- Stephan A Müller
- Department of Proteomics, UFZ, Helmholtz-Centre for Environmental Research, Permoserstraße 15, 04318 Leipzig, Germany
| | | | | | | | | | | | | | | |
Collapse
|
20
|
Taoka M, Ikumi M, Nakayama H, Masaki S, Matsuda R, Nobe Y, Yamauchi Y, Takeda J, Takahashi N, Isobe T. In-Gel Digestion for Mass Spectrometric Characterization of RNA from Fluorescently Stained Polyacrylamide Gels. Anal Chem 2010; 82:7795-803. [DOI: 10.1021/ac101623j] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Affiliation(s)
- Masato Taoka
- Department of Chemistry, Graduate School of Sciences and Engineering, Tokyo Metropolitan University, Minamiosawa 1-1, Hachioji-shi, Tokyo 192-0397, Japan, Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency, Sanbancho 5, Chiyoda-ku, Tokyo 102-0075, Japan, Biomolecular Characterization Team, RIKEN Advanced Science Institute, Hirosawa 2-1, Wako, Saitama 351-0198, Japan, and Department of Biotechnology, United Graduate School of Agriculture, Tokyo University of
| | - Maki Ikumi
- Department of Chemistry, Graduate School of Sciences and Engineering, Tokyo Metropolitan University, Minamiosawa 1-1, Hachioji-shi, Tokyo 192-0397, Japan, Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency, Sanbancho 5, Chiyoda-ku, Tokyo 102-0075, Japan, Biomolecular Characterization Team, RIKEN Advanced Science Institute, Hirosawa 2-1, Wako, Saitama 351-0198, Japan, and Department of Biotechnology, United Graduate School of Agriculture, Tokyo University of
| | - Hiroshi Nakayama
- Department of Chemistry, Graduate School of Sciences and Engineering, Tokyo Metropolitan University, Minamiosawa 1-1, Hachioji-shi, Tokyo 192-0397, Japan, Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency, Sanbancho 5, Chiyoda-ku, Tokyo 102-0075, Japan, Biomolecular Characterization Team, RIKEN Advanced Science Institute, Hirosawa 2-1, Wako, Saitama 351-0198, Japan, and Department of Biotechnology, United Graduate School of Agriculture, Tokyo University of
| | - Shunpei Masaki
- Department of Chemistry, Graduate School of Sciences and Engineering, Tokyo Metropolitan University, Minamiosawa 1-1, Hachioji-shi, Tokyo 192-0397, Japan, Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency, Sanbancho 5, Chiyoda-ku, Tokyo 102-0075, Japan, Biomolecular Characterization Team, RIKEN Advanced Science Institute, Hirosawa 2-1, Wako, Saitama 351-0198, Japan, and Department of Biotechnology, United Graduate School of Agriculture, Tokyo University of
| | - Ryozo Matsuda
- Department of Chemistry, Graduate School of Sciences and Engineering, Tokyo Metropolitan University, Minamiosawa 1-1, Hachioji-shi, Tokyo 192-0397, Japan, Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency, Sanbancho 5, Chiyoda-ku, Tokyo 102-0075, Japan, Biomolecular Characterization Team, RIKEN Advanced Science Institute, Hirosawa 2-1, Wako, Saitama 351-0198, Japan, and Department of Biotechnology, United Graduate School of Agriculture, Tokyo University of
| | - Yuko Nobe
- Department of Chemistry, Graduate School of Sciences and Engineering, Tokyo Metropolitan University, Minamiosawa 1-1, Hachioji-shi, Tokyo 192-0397, Japan, Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency, Sanbancho 5, Chiyoda-ku, Tokyo 102-0075, Japan, Biomolecular Characterization Team, RIKEN Advanced Science Institute, Hirosawa 2-1, Wako, Saitama 351-0198, Japan, and Department of Biotechnology, United Graduate School of Agriculture, Tokyo University of
| | - Yoshio Yamauchi
- Department of Chemistry, Graduate School of Sciences and Engineering, Tokyo Metropolitan University, Minamiosawa 1-1, Hachioji-shi, Tokyo 192-0397, Japan, Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency, Sanbancho 5, Chiyoda-ku, Tokyo 102-0075, Japan, Biomolecular Characterization Team, RIKEN Advanced Science Institute, Hirosawa 2-1, Wako, Saitama 351-0198, Japan, and Department of Biotechnology, United Graduate School of Agriculture, Tokyo University of
| | - Jun Takeda
- Department of Chemistry, Graduate School of Sciences and Engineering, Tokyo Metropolitan University, Minamiosawa 1-1, Hachioji-shi, Tokyo 192-0397, Japan, Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency, Sanbancho 5, Chiyoda-ku, Tokyo 102-0075, Japan, Biomolecular Characterization Team, RIKEN Advanced Science Institute, Hirosawa 2-1, Wako, Saitama 351-0198, Japan, and Department of Biotechnology, United Graduate School of Agriculture, Tokyo University of
| | - Nobuhiro Takahashi
- Department of Chemistry, Graduate School of Sciences and Engineering, Tokyo Metropolitan University, Minamiosawa 1-1, Hachioji-shi, Tokyo 192-0397, Japan, Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency, Sanbancho 5, Chiyoda-ku, Tokyo 102-0075, Japan, Biomolecular Characterization Team, RIKEN Advanced Science Institute, Hirosawa 2-1, Wako, Saitama 351-0198, Japan, and Department of Biotechnology, United Graduate School of Agriculture, Tokyo University of
| | - Toshiaki Isobe
- Department of Chemistry, Graduate School of Sciences and Engineering, Tokyo Metropolitan University, Minamiosawa 1-1, Hachioji-shi, Tokyo 192-0397, Japan, Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency, Sanbancho 5, Chiyoda-ku, Tokyo 102-0075, Japan, Biomolecular Characterization Team, RIKEN Advanced Science Institute, Hirosawa 2-1, Wako, Saitama 351-0198, Japan, and Department of Biotechnology, United Graduate School of Agriculture, Tokyo University of
| |
Collapse
|
21
|
Iwasaki M, Miwa S, Ikegami T, Tomita M, Tanaka N, Ishihama Y. One-dimensional capillary liquid chromatographic separation coupled with tandem mass spectrometry unveils the Escherichia coli proteome on a microarray scale. Anal Chem 2010; 82:2616-20. [PMID: 20222674 DOI: 10.1021/ac100343q] [Citation(s) in RCA: 118] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
We successfully identified the proteome expressed in Escherichia coli (E. coli) cells on a microarray scale using one-dimensional capillary liquid chromatography-tandem mass spectrometry (LC-MS/MS) with a 350 cm long, 100 microm i. d., monolithic silica-C(18) capillary column. E. coli tryptic digest (4 microg) was injected onto the column, and a 41 h gradient was applied with a flow rate of 500 nL/min at less than 20 MPa. In total, 22,196 nonredundant tryptic peptides from 2602 proteins, including 830 membrane proteins, were identified from the E. coli cells (triplicate analysis), in which an equivalent number of genes was detected by transcriptome analysis. Approximately a 5-fold larger peak response on average was obtained in this system, compared with that obtained by conventional capillary LC-MS/MS analysis with a 15 cm long, 3 microm diameter C(18) silica particle-packed column. The higher response suggests that the influence of ionization suppression was drastically reduced by the high-efficiency separation on the long monolithic silica column coupled with the shallow gradient. Because this high-resolution system does not require any additional separation prior to LC-MS/MS, this "one-shot" proteomics approach can simplify the workflow of shotgun proteomics and minimize the sample amount, as well as reduce the total analysis time, despite the use of prolonged shallow gradient elution.
Collapse
|
22
|
Taoka M, Yamauchi Y, Nobe Y, Masaki S, Nakayama H, Ishikawa H, Takahashi N, Isobe T. An analytical platform for mass spectrometry-based identification and chemical analysis of RNA in ribonucleoprotein complexes. Nucleic Acids Res 2010; 37:e140. [PMID: 19740761 PMCID: PMC2790879 DOI: 10.1093/nar/gkp732] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
We describe here a mass spectrometry (MS)-based analytical platform of RNA, which combines direct nano-flow reversed-phase liquid chromatography (RPLC) on a spray tip column and a high-resolution LTQ-Orbitrap mass spectrometer. Operating RPLC under a very low flow rate with volatile solvents and MS in the negative mode, we could estimate highly accurate mass values sufficient to predict the nucleotide composition of a approximately 21-nucleotide small interfering RNA, detect post-transcriptional modifications in yeast tRNA, and perform collision-induced dissociation/tandem MS-based structural analysis of nucleolytic fragments of RNA at a sub-femtomole level. Importantly, the method allowed the identification and chemical analysis of small RNAs in ribonucleoprotein (RNP) complex, such as the pre-spliceosomal RNP complex, which was pulled down from cultured cells with a tagged protein cofactor as bait. We have recently developed a unique genome-oriented database search engine, Ariadne, which allows tandem MS-based identification of RNAs in biological samples. Thus, the method presented here has broad potential for automated analysis of RNA; it complements conventional molecular biology-based techniques and is particularly suited for simultaneous analysis of the composition, structure, interaction, and dynamics of RNA and protein components in various cellular RNP complexes.
Collapse
Affiliation(s)
- Masato Taoka
- Department of Chemistry, Graduate School of Sciences and Engineering, Tokyo Metropolitan University, Minamiosawa 1-1, Hachioji-shi, Tokyo 192-0397, Japan
| | | | | | | | | | | | | | | |
Collapse
|
23
|
Williams TA, Codoñer FM, Toft C, Fares MA. Two chaperonin systems in bacterial genomes with distinct ecological roles. Trends Genet 2009; 26:47-51. [PMID: 20036437 DOI: 10.1016/j.tig.2009.11.009] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2009] [Revised: 11/27/2009] [Accepted: 11/30/2009] [Indexed: 10/20/2022]
Abstract
Bacterial chaperonins are essential to cell viability and have a role in endosymbiosis, which leads to increased biological complexity. However, the extent to which chaperonins promote ecological innovation is unknown. We screened 622 bacterial genomes for genes encoding chaperonins, and found archaeal-like chaperonins in bacteria that inhabit archaeal ecological niches. We found that chaperonins encoded in pathogenic bacteria are the most functionally divergent. We identified the molecular basis of the dramatic structural changes in mitochondrial GROEL, a highly derived chaperonin gene. Our analysis suggests that chaperonins are important capacitors of evolutionary and ecological change.
Collapse
Affiliation(s)
- Tom A Williams
- Department of Genetics, University of Dublin, Trinity College, Dublin, Ireland
| | | | | | | |
Collapse
|
24
|
Identification of functional marker proteins in the mammalian growth cone. Proc Natl Acad Sci U S A 2009; 106:17211-6. [PMID: 19805073 DOI: 10.1073/pnas.0904092106] [Citation(s) in RCA: 75] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Identification of proteins in the mammalian growth cone has the potential to advance our understanding of this critical regulator of neuronal growth and formation of neural circuit; however, to date, only one growth cone marker protein, GAP-43, has been reported. Here, we successfully used a proteomic approach to identify 945 proteins present in developing rat forebrain growth cones, including highly abundant, membrane-associated and actin-associated proteins. Almost 100 of the proteins appear to be highly enriched in the growth cone, as determined by quantitative immunostaining, and for 17 proteins, the results of RNAi suggest a role in axon growth. Most of the proteins we identified have not previously been implicated in axon growth and thus their identification presents a significant step forward, providing marker proteins and candidate neuronal growth-associated proteins.
Collapse
|
25
|
The Conflict Between Horizontal Gene Transfer and the Safeguard of Identity: Origin of Meiotic Sexuality. J Mol Evol 2009; 69:470-80. [DOI: 10.1007/s00239-009-9277-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2009] [Accepted: 09/08/2009] [Indexed: 02/05/2023]
|
26
|
Masuda T, Saito N, Tomita M, Ishihama Y. Unbiased quantitation of Escherichia coli membrane proteome using phase transfer surfactants. Mol Cell Proteomics 2009; 8:2770-7. [PMID: 19767571 DOI: 10.1074/mcp.m900240-mcp200] [Citation(s) in RCA: 133] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
We developed a sample preparation protocol for rapid and unbiased analysis of the membrane proteome using an alimentary canal-mimicking system in which proteases are activated in the presence of bile salts. In this rapid and unbiased protocol, immobilized trypsin is used in the presence of deoxycholate and lauroylsarcosine to increase digestion efficiency as well as to increase the solubility of the membrane proteins. Using 22.5 microg of Escherichia coli whole cell lysate, we quantitatively demonstrated that membrane proteins were extracted and digested at the same level as soluble proteins without any solubility-related bias. The recovery of membrane proteins was independent of the number of transmembrane domains per protein. In the analysis of the membrane-enriched fraction from 22.5 microg of E. coli cell lysate, the abundance distribution of the membrane proteins was in agreement with that of the membrane protein-coding genes when this protocol, coupled with strong cation exchange prefractionation prior to nano-LC-MS/MS analysis, was used. Because this protocol allows unbiased sample preparation, protein abundance estimation based on the number of observed peptides per protein was applied to both soluble and membrane proteins simultaneously, and the copy numbers per cell for 1,453 E. coli proteins, including 545 membrane proteins, were successfully obtained. Finally, this protocol was applied to quantitative analysis of guanosine tetra- and pentaphosphate-dependent signaling in E. coli wild-type and relA knock-out strains.
Collapse
Affiliation(s)
- Takeshi Masuda
- Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata 997-0017, Japan
| | | | | | | |
Collapse
|
27
|
Comparative proteomic analysis on Salmonella Gallinarum and Salmonella Enteritidis exploring proteins that may incorporate host adaptation in poultry. J Proteomics 2009; 72:815-21. [DOI: 10.1016/j.jprot.2009.05.002] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2009] [Revised: 04/22/2009] [Accepted: 05/18/2009] [Indexed: 11/18/2022]
|
28
|
Baños RC, Vivero A, Aznar S, García J, Pons M, Madrid C, Juárez A. Differential regulation of horizontally acquired and core genome genes by the bacterial modulator H-NS. PLoS Genet 2009; 5:e1000513. [PMID: 19521501 PMCID: PMC2686267 DOI: 10.1371/journal.pgen.1000513] [Citation(s) in RCA: 88] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2009] [Accepted: 05/12/2009] [Indexed: 11/20/2022] Open
Abstract
Horizontal acquisition of DNA by bacteria dramatically increases genetic diversity and hence successful bacterial colonization of several niches, including the human host. A relevant issue is how this newly acquired DNA interacts and integrates in the regulatory networks of the bacterial cell. The global modulator H-NS targets both core genome and HGT genes and silences gene expression in response to external stimuli such as osmolarity and temperature. Here we provide evidence that H-NS discriminates and differentially modulates core and HGT DNA. As an example of this, plasmid R27-encoded H-NS protein has evolved to selectively silence HGT genes and does not interfere with core genome regulation. In turn, differential regulation of both gene lineages by resident chromosomal H-NS requires a helper protein: the Hha protein. Tight silencing of HGT DNA is accomplished by H-NS-Hha complexes. In contrast, core genes are modulated by H-NS homoligomers. Remarkably, the presence of Hha-like proteins is restricted to the Enterobacteriaceae. In addition, conjugative plasmids encoding H-NS variants have hitherto been isolated only from members of the family. Thus, the H-NS system in enteric bacteria presents unique evolutionary features. The capacity to selectively discriminate between core and HGT DNA may help to maintain horizontally transmitted DNA in silent form and may give these bacteria a competitive advantage in adapting to new environments, including host colonization. Acquisition of DNA by horizontal gene transfer (HGT) significantly increases bacterial genetic variability. Relevant issues are the mechanisms that bacterial cells have evolved to efficiently integrate the newly acquired DNA into the host cell regulatory machinery. In Gram negative cells, the nucleoid associated protein H-NS has been shown to bind AT-rich sequences of HGT DNA and silence unwanted expression of these genes. This has led to consider H-NS as a “genome sentinel.” Nevertheless, this proposed role must be compatible with its role modulating core genome genes. Weak expression of recently transferred genes must be coordinated with proper expression levels of housekeeping genes. In this paper, we describe a strategy that enteric bacteria have developed to differentially modulate HGT and core genome genes. Two independent lines of experimental evidence suggest that the H-NS system of enteric bacteria may have evolved to discriminate between core genome and HGT DNA. The plasmid R27-encoded H-NS protein selectively modulates HGT genes. This avoids plasmid-encoded H-NS interfering with modulation of core functions. We also show that, for efficient silencing of HGT genes, resident chromosomal H-NS recruits the Hha protein and forms heteromeric complexes with DNA. In contrast, housekeeping genes are modulated by H-NS alone.
Collapse
Affiliation(s)
- Rosa C. Baños
- Institut de Bioenginyeria de Catalunya (IBEC), Parc Científic de Barcelona, Barcelona, Spain
| | - Aitziber Vivero
- Departament de Microbiologia, Facultat de Biologia, Universitat de Barcelona, Barcelona, Spain
| | - Sonia Aznar
- Departament de Microbiologia, Facultat de Biologia, Universitat de Barcelona, Barcelona, Spain
| | - Jesús García
- Institute for Research in Biomedicine (IRB Barcelona), Parc Científic de Barcelona, Barcelona, Spain
| | - Miquel Pons
- Institute for Research in Biomedicine (IRB Barcelona), Parc Científic de Barcelona, Barcelona, Spain
- Departament de Química Orgànica, Universitat de Barcelona, Barcelona, Spain
| | - Cristina Madrid
- Departament de Microbiologia, Facultat de Biologia, Universitat de Barcelona, Barcelona, Spain
- * E-mail: (CM); (AJ)
| | - Antonio Juárez
- Institut de Bioenginyeria de Catalunya (IBEC), Parc Científic de Barcelona, Barcelona, Spain
- Departament de Microbiologia, Facultat de Biologia, Universitat de Barcelona, Barcelona, Spain
- * E-mail: (CM); (AJ)
| |
Collapse
|
29
|
Abstract
Escherichia coli is not considered naturally competent, yet it has homologues of the genes that most competent bacteria use for DNA uptake and processing. In Haemophilus influenzae and Vibrio cholerae, these genes are regulated by the Sxy and cyclic AMP receptor (CRP) proteins. We used microarrays to find out whether similar regulation occurs in E. coli. Expression of sxy strongly induced 63 transcriptional units, 34 of which required CRP for transcriptional activation and had promoter sites resembling the Sxy- and CRP-dependent CRP-S motif previously characterized in H. influenzae. As previously reported, sxy expression also induced the sigma-H regulon. Flagellar operons were downregulated by sxy expression, although motility remained unaffected. The CRP-S regulon included all of E. coli's known competence gene homologues, so we investigated Sxy's effect on competence-associated phenotypes. A sxy knockout reduced both "natural" plasmid transformation and competitive fitness in long-term culture. In addition, expression of plasmid-borne sxy led to production of type IV pilin, the main subunit of the DNA uptake machinery of most bacteria. Although H. influenzae Sxy only weakly activated the E. coli Sxy regulon, induction was dramatically improved when it was coexpressed with its cognate CRP, suggesting that intimate interactions between Sxy and CRP are required for transcriptional activation at CRP-S sites.
Collapse
|
30
|
Balleza E, López-Bojorquez LN, Martínez-Antonio A, Resendis-Antonio O, Lozada-Chávez I, Balderas-Martínez YI, Encarnación S, Collado-Vides J. Regulation by transcription factors in bacteria: beyond description. FEMS Microbiol Rev 2009; 33:133-51. [PMID: 19076632 PMCID: PMC2704942 DOI: 10.1111/j.1574-6976.2008.00145.x] [Citation(s) in RCA: 138] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
Transcription is an essential step in gene expression and its understanding has been one of the major interests in molecular and cellular biology. By precisely tuning gene expression, transcriptional regulation determines the molecular machinery for developmental plasticity, homeostasis and adaptation. In this review, we transmit the main ideas or concepts behind regulation by transcription factors and give just enough examples to sustain these main ideas, thus avoiding a classical ennumeration of facts. We review recent concepts and developments: cis elements and trans regulatory factors, chromosome organization and structure, transcriptional regulatory networks (TRNs) and transcriptomics. We also summarize new important discoveries that will probably affect the direction of research in gene regulation: epigenetics and stochasticity in transcriptional regulation, synthetic circuits and plasticity and evolution of TRNs. Many of the new discoveries in gene regulation are not extensively tested with wetlab approaches. Consequently, we review this broad area in Inference of TRNs and Dynamical Models of TRNs. Finally, we have stepped backwards to trace the origins of these modern concepts, synthesizing their history in a timeline schema.
Collapse
Affiliation(s)
- Enrique Balleza
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | | | | | | | | | | | | | | |
Collapse
|
31
|
Díaz-Mejía JJ, Babu M, Emili A. Computational and experimental approaches to chart the Escherichia coli cell-envelope-associated proteome and interactome. FEMS Microbiol Rev 2008; 33:66-97. [PMID: 19054114 PMCID: PMC2704936 DOI: 10.1111/j.1574-6976.2008.00141.x] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
The bacterial cell-envelope consists of a complex arrangement of lipids, proteins and carbohydrates that serves as the interface between a microorganism and its environment or, with pathogens, a human host. Escherichia coli has long been investigated as a leading model system to elucidate the fundamental mechanisms underlying microbial cell-envelope biology. This includes extensive descriptions of the molecular identities, biochemical activities and evolutionary trajectories of integral transmembrane proteins, many of which play critical roles in infectious disease and antibiotic resistance. Strikingly, however, only half of the c. 1200 putative cell-envelope-related proteins of E. coli currently have experimentally attributed functions, indicating an opportunity for discovery. In this review, we summarize the state of the art of computational and proteomic approaches for determining the components of the E. coli cell-envelope proteome, as well as exploring the physical and functional interactions that underlie its biogenesis and functionality. We also provide a comprehensive comparative benchmarking analysis on the performance of different bioinformatic and proteomic methods commonly used to determine the subcellular localization of bacterial proteins.
Collapse
Affiliation(s)
- Juan Javier Díaz-Mejía
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada
| | | | | |
Collapse
|
32
|
Abstract
Bacteria experience a continual influx of novel genetic material from a wide range of sources and yet their genomes remain relatively small. This aspect of bacterial evolution indicates that most newly arriving sequences are rapidly eliminated; however, numerous new genes persist, as evident from the presence of unique genes in almost all bacterial genomes. This review summarizes the methods for identifying new genes in bacterial genomes and examines the features that promote the retention and elimination of these evolutionary novelties.
Collapse
Affiliation(s)
- Chih-Horng Kuo
- Department of Ecology & Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA
| | | |
Collapse
|
33
|
Jungblut PR, Holzhütter HG, Apweiler R, Schlüter H. The speciation of the proteome. Chem Cent J 2008; 2:16. [PMID: 18638390 PMCID: PMC2492845 DOI: 10.1186/1752-153x-2-16] [Citation(s) in RCA: 188] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2008] [Accepted: 07/18/2008] [Indexed: 11/13/2022] Open
Abstract
INTRODUCTION In proteomics a paradox situation developed in the last years. At one side it is basic knowledge that proteins are post-translationally modified and occur in different isoforms. At the other side the protein expression concept disclaims post-translational modifications by connecting protein names directly with function. DISCUSSION Optimal proteome coverage is today reached by bottom-up liquid chromatography/mass spectrometry. But quantification at the peptide level in shotgun or bottom-up approaches by liquid chromatography and mass spectrometry is completely ignoring that a special peptide may exist in an unmodified form and in several-fold modified forms. The acceptance of the protein species concept is a basic prerequisite for meaningful quantitative analyses in functional proteomics. In discovery approaches only top-down analyses, separating the protein species before digestion, identification and quantification by two-dimensional gel electrophoresis or protein liquid chromatography, allow the correlation between changes of a biological situation and function. CONCLUSION To obtain biological relevant information kinetics and systems biology have to be performed at the protein species level, which is the major challenge in proteomics today.
Collapse
Affiliation(s)
- Peter R Jungblut
- Max Planck Institute for Infection Biology, Core Facility Protein Analysis, Berlin, Germany
| | | | - Rolf Apweiler
- European Bioinformatics Institute, Cambridge CB10 1SD, UK
| | | |
Collapse
|
34
|
Grasselli E, François P, Gutacker M, Gettler B, Benagli C, Convert M, Boerlin P, Schrenzel J, Piffaretti JC. Evidence of horizontal gene transfer between human and animal commensal Escherichia coli strains identified by microarray. ACTA ACUST UNITED AC 2008; 53:351-8. [PMID: 18557937 DOI: 10.1111/j.1574-695x.2008.00434.x] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Bacteria exchange genetic material by horizontal gene transfer (HGT). To evaluate the impact of HGT on Escherichia coli genome plasticity, 19 commensal strains collected from the intestinal floras of humans and animals were analyzed by microarrays. Strains were hybridized against an oligoarray containing 2700 E. coli K12 chromosomal genes. A core (genes shared among compared genomes) and a flexible gene pool (genes unique for each genome) have been identified. Analysis of hybridization signals evidenced 1015 divergent genes among the 19 strains and each strain showed a specific genomic variability pattern. Four hundred and fifty-eight genes were characterized by higher rates of interstrain variation and were considered hyperdivergent. These genes are not randomly distributed onto the chromosome but are clustered in precise regions. Hyperdivergent genes belong to the flexible gene pool and show a specific GC content, differing from that of the chromosome, indicating acquisition by HGT. Among these genes, those involved in defense mechanisms and cell motility as well as intracellular trafficking and secretion were far more represented than others. The observed genome plasticity contributes to the maintenance of genetic diversity and may therefore be a source of evolutionary adaptation and survival.
Collapse
Affiliation(s)
- Elena Grasselli
- Istituto Cantonale di Microbiologia, via Mirasole, Bellinzona, Switzerland
| | | | | | | | | | | | | | | | | |
Collapse
|
35
|
Kamiie J. [Progress of drug transport study based on absolute quantitative method for membrane transporter proteins]. YAKUGAKU ZASSHI 2008; 128:507-12. [PMID: 18379169 DOI: 10.1248/yakushi.128.507] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Knowing the amount of membrane transporter expression in human tissue is one of the key issues in the rational and reliable prediction of pharmacokinetic profiles in humans. Recently, we have developed a simultaneous and highly sensitive method for the absolute quantification of multiple membrane transporter proteins in mammalian tissues. To develop quantitative analysis of high molecular-weight membrane proteins, we have solved problems using proteomics technology as follows: 1) The target proteins are detected via tryptic peptides that can be dissolved and analyzed with LC-MS/MS, while membrane protein is difficult to dissolve. 2) LC-MS/MS in multiple reaction monitoring (MRM) mode produce a highly sensitive and selective response for transporter proteins with low expression by separation from highly abundant molecules. 3) Analyte specificity for each peptide was demonstrated in amino acid sequences using multiple MRM detection. Selection of the peptide probe was very important for highly sensitive analysis with LC-MS/MS. We set criteria for peptide selection using an informatics approach. Peptides without unstable residue, double basic residues, and integral membrane domain were selected as useful probes. The developed method will provide an inclusive assay platform for all transporter proteins expressed in both animal/human tissues and will contribute to progress in drug discovery and development.
Collapse
Affiliation(s)
- Junichi Kamiie
- Graduate School of Pharmaceutical Sciences, Tohoku University, SORST.JST, Aoba, Aramaki, Aoba-ku, Sendai, Japan.
| |
Collapse
|
36
|
Ishihama Y, Schmidt T, Rappsilber J, Mann M, Hartl FU, Kerner MJ, Frishman D. Protein abundance profiling of the Escherichia coli cytosol. BMC Genomics 2008; 9:102. [PMID: 18304323 PMCID: PMC2292177 DOI: 10.1186/1471-2164-9-102] [Citation(s) in RCA: 371] [Impact Index Per Article: 21.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2008] [Accepted: 02/27/2008] [Indexed: 11/10/2022] Open
Abstract
Background Knowledge about the abundance of molecular components is an important prerequisite for building quantitative predictive models of cellular behavior. Proteins are central components of these models, since they carry out most of the fundamental processes in the cell. Thus far, protein concentrations have been difficult to measure on a large scale, but proteomic technologies have now advanced to a stage where this information becomes readily accessible. Results Here, we describe an experimental scheme to maximize the coverage of proteins identified by mass spectrometry of a complex biological sample. Using a combination of LC-MS/MS approaches with protein and peptide fractionation steps we identified 1103 proteins from the cytosolic fraction of the Escherichia coli strain MC4100. A measure of abundance is presented for each of the identified proteins, based on the recently developed emPAI approach which takes into account the number of sequenced peptides per protein. The values of abundance are within a broad range and accurately reflect independently measured copy numbers per cell. As expected, the most abundant proteins were those involved in protein synthesis, most notably ribosomal proteins. Proteins involved in energy metabolism as well as those with binding function were also found in high copy number while proteins annotated with the terms metabolism, transcription, transport, and cellular organization were rare. The barrel-sandwich fold was found to be the structural fold with the highest abundance. Highly abundant proteins are predicted to be less prone to aggregation based on their length, pI values, and occurrence patterns of hydrophobic stretches. We also find that abundant proteins tend to be predominantly essential. Additionally we observe a significant correlation between protein and mRNA abundance in E. coli cells. Conclusion Abundance measurements for more than 1000 E. coli proteins presented in this work represent the most complete study of protein abundance in a bacterial cell so far. We show significant associations between the abundance of a protein and its properties and functions in the cell. In this way, we provide both data and novel insights into the role of protein concentration in this model organism.
Collapse
Affiliation(s)
- Yasushi Ishihama
- Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata 997-0017, Japan.
| | | | | | | | | | | | | |
Collapse
|
37
|
de Groot MJL, Daran-Lapujade P, van Breukelen B, Knijnenburg TA, de Hulster EAF, Reinders MJT, Pronk JT, Heck AJR, Slijper M. Quantitative proteomics and transcriptomics of anaerobic and aerobic yeast cultures reveals post-transcriptional regulation of key cellular processes. MICROBIOLOGY-SGM 2008; 153:3864-3878. [PMID: 17975095 DOI: 10.1099/mic.0.2007/009969-0] [Citation(s) in RCA: 90] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Saccharomyces cerevisiae is unique among yeasts in its ability to grow rapidly in the complete absence of oxygen. S. cerevisiae is therefore an ideal eukaryotic model to study physiological adaptation to anaerobiosis. Recent transcriptome analyses have identified hundreds of genes that are transcriptionally regulated by oxygen availability but the relevance of this cellular response has not been systematically investigated at the key control level of the proteome. Therefore, the proteomic response of S. cerevisiae to anaerobiosis was investigated using metabolic stable-isotope labelling in aerobic and anaerobic glucose-limited chemostat cultures, followed by relative quantification of protein expression. Using independent replicate cultures and stringent statistical filtering, a robust dataset of 474 quantified proteins was generated, of which 249 showed differential expression levels. While some of these changes were consistent with previous transcriptome studies, many of the responses of S. cerevisiae to oxygen availability were, to our knowledge, previously unreported. Comparison of transcriptomes and proteomes from identical cultivations yielded strong evidence for post-transcriptional regulation of key cellular processes, including glycolysis, amino-acyl-tRNA synthesis, purine nucleotide synthesis and amino acid biosynthesis. The use of chemostat cultures provided well-controlled and reproducible culture conditions, which are essential for generating robust datasets at different cellular information levels. Integration of transcriptome and proteome data led to new insights into the physiology of anaerobically growing yeast that would not have been apparent from differential analyses at either the mRNA or protein level alone, thus illustrating the power of multi-level studies in yeast systems biology.
Collapse
Affiliation(s)
- Marco J L de Groot
- Netherlands Proteomics Centre, Utrecht, The Netherlands
- Kluyver Centre for Genomics of Industrial Fermentation, Delft, The Netherlands
- Department of Biomolecular Mass Spectrometry, Bijvoet Center for Biomolecular Research and Utrecht Institute for Pharmaceutical Sciences, Utrecht University, Sorbonnelaan 16, 3584 CA Utrecht, The Netherlands
| | - Pascale Daran-Lapujade
- Kluyver Centre for Genomics of Industrial Fermentation, Delft, The Netherlands
- Department of Biotechnology, Delft University of Technology, Julianalaan 67, 2628 BC Delft, The Netherlands
| | - Bas van Breukelen
- Netherlands Proteomics Centre, Utrecht, The Netherlands
- Department of Biomolecular Mass Spectrometry, Bijvoet Center for Biomolecular Research and Utrecht Institute for Pharmaceutical Sciences, Utrecht University, Sorbonnelaan 16, 3584 CA Utrecht, The Netherlands
| | - Theo A Knijnenburg
- Kluyver Centre for Genomics of Industrial Fermentation, Delft, The Netherlands
- Information and Communication Theory Group, Faculty of Electrical Engineering, Mathematics and Computer Science, Delft University of Technology, Mekelweg 4, 2628 CD Delft, The Netherlands
| | - Erik A F de Hulster
- Kluyver Centre for Genomics of Industrial Fermentation, Delft, The Netherlands
- Department of Biotechnology, Delft University of Technology, Julianalaan 67, 2628 BC Delft, The Netherlands
| | - Marcel J T Reinders
- Kluyver Centre for Genomics of Industrial Fermentation, Delft, The Netherlands
- Information and Communication Theory Group, Faculty of Electrical Engineering, Mathematics and Computer Science, Delft University of Technology, Mekelweg 4, 2628 CD Delft, The Netherlands
| | - Jack T Pronk
- Kluyver Centre for Genomics of Industrial Fermentation, Delft, The Netherlands
- Department of Biotechnology, Delft University of Technology, Julianalaan 67, 2628 BC Delft, The Netherlands
| | - Albert J R Heck
- Netherlands Proteomics Centre, Utrecht, The Netherlands
- Department of Biomolecular Mass Spectrometry, Bijvoet Center for Biomolecular Research and Utrecht Institute for Pharmaceutical Sciences, Utrecht University, Sorbonnelaan 16, 3584 CA Utrecht, The Netherlands
| | - Monique Slijper
- Netherlands Proteomics Centre, Utrecht, The Netherlands
- Department of Biomolecular Mass Spectrometry, Bijvoet Center for Biomolecular Research and Utrecht Institute for Pharmaceutical Sciences, Utrecht University, Sorbonnelaan 16, 3584 CA Utrecht, The Netherlands
| |
Collapse
|
38
|
Kaji H, Kamiie JI, Kawakami H, Kido K, Yamauchi Y, Shinkawa T, Taoka M, Takahashi N, Isobe T. Proteomics Reveals N-Linked Glycoprotein Diversity in Caenorhabditis elegans and Suggests an Atypical Translocation Mechanism for Integral Membrane Proteins. Mol Cell Proteomics 2007; 6:2100-9. [PMID: 17761667 DOI: 10.1074/mcp.m600392-mcp200] [Citation(s) in RCA: 86] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Protein glycosylation is one of the most common post-translational modifications in eukaryotes and affects various aspects of protein structure and function. To facilitate studies of protein glycosylation, we paired glycosylation site-specific stable isotope tagging of lectin affinity-captured N-linked glycopeptides with mass spectrometry and determined 1,465 N-glycosylated sites on 829 proteins expressed in Caenorhabditis elegans. The analysis shows the diversity of protein glycosylation in eukaryotes in terms of glycosylation sites and oligosaccharide structures attached to polypeptide chains and suggests the substrate specificity of oligosaccharyltransferase, a single multienzyme complex in C. elegans that incorporates an oligosaccharide moiety en bloc to newly synthesized polypeptides. In addition, topological analysis of 257 N-glycosylated proteins containing a putative single transmembrane segment that were identified based on the relative positions of glycosylation sites and transmembrane segments suggests that an atypical non-cotranslational mechanism translocates large N-terminal segments from the cytosol to the endoplasmic reticulum lumen in the absence of signal sequence function.
Collapse
Affiliation(s)
- Hiroyuki Kaji
- Department of Chemistry, Graduate School of Science and Engineering, Tokyo Metropolitan University, Minamiosawa 1-1, Hachioji, Tokyo 192-0397, Japan.
| | | | | | | | | | | | | | | | | |
Collapse
|
39
|
Shapira R, Mimran E. Isolation and characterization of Escherichia coli mutants exhibiting altered response to thymol. Microb Drug Resist 2007; 13:157-65. [PMID: 17949301 DOI: 10.1089/mdr.2007.731] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The antimicrobial mode of action of the plant essential oil thymol was studied with Escherichia coli. Random transposon-insertion mutants were screened for altered response to thymol. Of four mutants showing more sensitivity, three were found in rfaQ, whose product is involved in lipopolysaccharide biosynthesis, and the fourth in the quorum-sensing gene qseC. Mutants showing more resistance had mutations in genes whose products are involved in the degradation of short-lived regulatory and abnormal proteins (the lon gene), menaquinone biosynthesis (menA), an unknown function of a putative membrane protein (yagF), synthesis of a small hypothetical protein (an intergene region between the two small genes yiiE and yiiF), and the efflux pump of cadaverine and lysine (cadB). The antibacterial activities of carvacrol, menthol, and cymene, essential oils structurally similar to thymol, were also determined. Although the level of resistance toward thymol was conserved in the respective mutants qseC, menA, and cadB, knockout mutants displayed different levels of tolerance to carvacrol; inconsistencies in resistance levels were also noted in mutants challenged with menthol. Wild-type and mutant E. coli responded to thymol exposure with a massive potassium efflux that generally corresponded to the resistant rate. The verity of the loci accounting for E. coli response suggests a multitarget mode of the antimicrobial activity of thymol and multitolerance mechanisms.
Collapse
Affiliation(s)
- Roni Shapira
- Institute of Biochemistry, Food Science and Nutrition, Faculty of Agricultural, Food and Environmental Quality Sciences, The Hebrew University, Rehovot 76100, Israel.
| | | |
Collapse
|
40
|
Maillet I, Berndt P, Malo C, Rodriguez S, Brunisholz RA, Pragai Z, Arnold S, Langen H, Wyss M. From the genome sequence to the proteome and back: evaluation of E. coli genome annotation with a 2-D gel-based proteomics approach. Proteomics 2007; 7:1097-106. [PMID: 17366475 DOI: 10.1002/pmic.200600599] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
The ambition of systems biology to understand complex biological systems at the molecular level implies that we need to have a concrete and correct understanding of each molecular entity and its function. However, even for the best-studied organism, Escherichia coli, a large number of proteins have never been identified and characterised from wild-type cells, and/or await unravelling of their biological role. Instead, the ORF models for these proteins have been predicted by suitable algorithms and/or through comparison with known, homologous proteins from other organisms, approaches which may be prone to error. In the present study, we used a combination of 2-DE, MALDI-TOF-MS and PMF to identify 1151 different proteins in E. coli K12 JM109. Comparison of the experimental with the theoretical Mr and pI values (4000 experimental values each) allowed the identification of numerous proteins with incorrect or incomplete ORF annotations in the current E. coli genome databases. Several inconsistencies in genome annotation were verified experimentally, and up to 55 candidates await further investigation. Our findings demonstrate how an up-to-date 2-D gel-based proteomics approach can be used for improving the annotation of prokaryotic genomes. They also highlight the need for harmonization among the different E. coli genome databases.
Collapse
|
41
|
Graham RLJ, Graham C, McMullan G. Microbial proteomics: a mass spectrometry primer for biologists. Microb Cell Fact 2007; 6:26. [PMID: 17697372 PMCID: PMC1971468 DOI: 10.1186/1475-2859-6-26] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2007] [Accepted: 08/15/2007] [Indexed: 11/29/2022] Open
Abstract
It is now more than 10 years since the publication of the first microbial genome sequence and science is now moving towards a post genomic era with transcriptomics and proteomics offering insights into cellular processes and function. The ability to assess the entire protein network of a cell at a given spatial or temporal point will have a profound effect upon microbial science as the function of proteins is inextricably linked to phenotype. Whilst such a situation is still beyond current technologies rapid advances in mass spectrometry, bioinformatics and protein separation technologies have produced a step change in our current proteomic capabilities. Subsequently a small, but steadily growing, number of groups are taking advantage of this cutting edge technology to discover more about the physiology and metabolism of microorganisms. From this research it will be possible to move towards a systems biology understanding of a microorganism. Where upon researchers can build a comprehensive cellular map for each microorganism that links an accurately annotated genome sequence to gene expression data, at a transcriptomic and proteomic level.In order for microbiologists to embrace the potential that proteomics offers, an understanding of a variety of analytical tools is required. The aim of this review is to provide a basic overview of mass spectrometry (MS) and its application to protein identification. In addition we will describe how the protein complexity of microbial samples can be reduced by gel-based and gel-free methodologies prior to analysis by MS. Finally in order to illustrate the power of microbial proteomics a case study of its current application within the Bacilliaceae is given together with a description of the emerging discipline of metaproteomics.
Collapse
Affiliation(s)
- Robert LJ Graham
- School of Biomedical Sciences, University of Ulster, Coleraine, County Londonderry, BT52 1SA, UK
| | - Ciaren Graham
- School of Biomedical Sciences, University of Ulster, Coleraine, County Londonderry, BT52 1SA, UK
| | - Geoff McMullan
- School of Biomedical Sciences, University of Ulster, Coleraine, County Londonderry, BT52 1SA, UK
| |
Collapse
|
42
|
Fournier ML, Gilmore JM, Martin-Brown SA, Washburn MP. Multidimensional Separations-Based Shotgun Proteomics. Chem Rev 2007; 107:3654-86. [PMID: 17649983 DOI: 10.1021/cr068279a] [Citation(s) in RCA: 143] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
43
|
Wolff S, Antelmann H, Albrecht D, Becher D, Bernhardt J, Bron S, Büttner K, van Dijl JM, Eymann C, Otto A, Tam LT, Hecker M. Towards the entire proteome of the model bacterium Bacillus subtilis by gel-based and gel-free approaches. J Chromatogr B Analyt Technol Biomed Life Sci 2007; 849:129-40. [PMID: 17055787 DOI: 10.1016/j.jchromb.2006.09.029] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2006] [Revised: 08/25/2006] [Accepted: 09/08/2006] [Indexed: 11/29/2022]
Abstract
With the emergence of mass spectrometry in protein science and the availability of complete genome sequences, proteomics has gone through a rapid development. The soil bacterium Bacillus subtilis, as one of the first DNA sequenced species, represents a model for Gram-positive bacteria and its proteome was extensively studied throughout the years. Having the final goal to elucidate how life really functions, one basic requirement is to know the entirety of cellular proteins. This review presents how far we have got in unraveling the proteome of B. subtilis. The application of gel-based and gel-free technologies, the analyses of different subcellular proteome fractions, and the pursuance of various physiological strategies resulted in a coverage of more than one-third of B. subtilis theoretical proteome.
Collapse
Affiliation(s)
- Susanne Wolff
- Institut für Mikrobiologie, Ernst-Moritz-Arndt-Universität Greifswald, Friedrich-Ludwig-Jahn-Str. 15, D-17487 Greifswald, Germany
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Wilson GA, Feil EJ, Lilley AK, Field D. Large-scale comparative genomic ranking of taxonomically restricted genes (TRGs) in bacterial and archaeal genomes. PLoS One 2007; 2:e324. [PMID: 17389915 PMCID: PMC1824705 DOI: 10.1371/journal.pone.0000324] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2007] [Accepted: 02/18/2007] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND Lineage-specific, or taxonomically restricted genes (TRGs), especially those that are species and strain-specific, are of special interest because they are expected to play a role in defining exclusive ecological adaptations to particular niches. Despite this, they are relatively poorly studied and little understood, in large part because many are still orphans or only have homologues in very closely related isolates. This lack of homology confounds attempts to establish the likelihood that a hypothetical gene is expressed and, if so, to determine the putative function of the protein. METHODOLOGY/PRINCIPAL FINDINGS We have developed "QIPP" ("Quality Index for Predicted Proteins"), an index that scores the "quality" of a protein based on non-homology-based criteria. QIPP can be used to assign a value between zero and one to any protein based on comparing its features to other proteins in a given genome. We have used QIPP to rank the predicted proteins in the proteomes of Bacteria and Archaea. This ranking reveals that there is a large amount of variation in QIPP scores, and identifies many high-scoring orphans as potentially "authentic" (expressed) orphans. There are significant differences in the distributions of QIPP scores between orphan and non-orphan genes for many genomes and a trend for less well-conserved genes to have lower QIPP scores. CONCLUSIONS The implication of this work is that QIPP scores can be used to further annotate predicted proteins with information that is independent of homology. Such information can be used to prioritize candidates for further analysis. Data generated for this study can be found in the OrphanMine at http://www.genomics.ceh.ac.uk/orphan_mine.
Collapse
Affiliation(s)
- Gareth A Wilson
- Centre for Ecology and Hydrology (CEH) Oxford, Oxford, United Kindgom.
| | | | | | | |
Collapse
|
45
|
Dworzanski JP, Snyder AP. Classification and identification of bacteria using mass spectrometry-based proteomics. Expert Rev Proteomics 2007; 2:863-78. [PMID: 16307516 DOI: 10.1586/14789450.2.6.863] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Timely classification and identification of bacteria is of vital importance in many areas of public health. Mass spectrometry-based methods provide an attractive alternative to well-established microbiologic procedures. Mass spectrometry methods can be characterized by the relatively high speed of acquiring taxonomically relevant information. Gel-free mass spectrometry proteomics techniques allow for rapid fingerprinting of bacterial proteins using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry or, for high-throughput sequencing of peptides from protease-digested cellular proteins, using mass analysis of fragments from collision-induced dissociation of peptide ions. The latter technique uses database searching of product ion mass spectra. A database contains a comprehensive list of protein sequences translated from protein-encoding open reading frames found in bacterial genomes. The results of such searches allow the assignment of experimental peptide sequences to matching theoretical bacterial proteomes. Phylogenetic profiles of sequenced peptides are then used to create a matrix of sequence-to-bacterium assignments, which are analyzed using numerical taxonomy tools. The results thereof reveal the relatedness between bacteria, and allow the taxonomic position of an investigated strain to be inferred.
Collapse
Affiliation(s)
- Jacek P Dworzanski
- Science Applications International Corporation (SAIC), PO Box 68, Aberdeen Proving Ground, MD 21010-0068, USA.
| | | |
Collapse
|
46
|
Stegmeier JF, Glück A, Sukumaran S, Mäntele W, Andersen C. Characterisation of YtfM, a second member of the Omp85 family in Escherichia coli. Biol Chem 2007; 388:37-46. [PMID: 17214547 DOI: 10.1515/bc.2007.004] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Omp85 proteins form a ubiquitous protein family, members of which are found in all Gram-negative bacteria. Omp85 of Neisseria meningitidis and YaeT of Escherichia coli are shown to be essential for outer membrane biogenesis. Interestingly, there exists a homologue to YaeT in E. coli and many proteobacteria, denoted YtfM, the function of which has not been described yet. Like YaeT, YtfM is predicted to consist of an amino-terminal periplasmic domain and a membrane-located carboxy-terminal domain. In this study, we present a first characterisation of YtfM by comparison to YaeT concerning structural, biochemical and electrophysiological properties. Furthermore, a knockout strain revealed that ytfM is a non-essential gene and lack of the protein had no effect on outer membrane composition and integrity. The only observable phenotype was strongly reduced growth, indicating an important role of YtfM in vivo.
Collapse
Affiliation(s)
- Johannes F Stegmeier
- Lehrstuhl für Biotechnologie, Biozentrum der Universität Würzburg, Am Hubland, D-97074 Würzburg, Germany
| | | | | | | | | |
Collapse
|
47
|
Oyama M, Kozuka-Hata H, Suzuki Y, Semba K, Yamamoto T, Sugano S. Diversity of translation start sites may define increased complexity of the human short ORFeome. Mol Cell Proteomics 2007; 6:1000-6. [PMID: 17317662 DOI: 10.1074/mcp.m600297-mcp200] [Citation(s) in RCA: 81] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Our previous proteomics analysis of small proteins expressed in human K562 cells provided the first direct evidence of translation of upstream ORFs in human full-length cDNAs (Oyama, M., Itagaki, C., Hata, H., Suzuki, Y., Izumi, T., Natsume, T., Isobe, T., and Sugano, S. (2004) Analysis of small human proteins reveals the translation of upstream open reading frames of mRNAs. Genome Res. 14, 2048-2052). In the present study, we performed an in-depth proteomics analysis of human K562 and HEK293 cells using a two-dimensional nano-liquid chromatography-tandem mass spectrometry system. The results led to the identification of eight protein-coding regions besides 197 small proteins with a theoretical mass less than 20 kDa that were already annotated coding sequences in the curated mRNA database. In addition to the upstream ORFs in the presumed 5'-untranslated regions of mRNAs, bioinformatics analysis based on accumulated 5'-end cDNA sequence data provided evidence of novel short coding regions that were likely to be translated from the upstream non-AUG start site or from the new short transcript variants generated by utilization of downstream alternative promoters. Protein expression analysis of the GRINL1A gene revealed that translation from the most upstream start site occurred on the minor alternative splicing transcript, whereas this initiation site was not utilized on the major mRNA, resulting in translation of the downstream ORF from the second initiation codon. These findings reveal a novel post-transcriptional system that can augment the human proteome via the alternative use of diverse translation start sites coupled with transcriptional regulation through alternative promoters or splicing, leading to increased complexity of short protein-coding regions defined by the human transcriptome.
Collapse
Affiliation(s)
- Masaaki Oyama
- Medical Proteomics Laboratory, University of Tokyo, Japan.
| | | | | | | | | | | |
Collapse
|
48
|
Abstract
Background Bacterial genomes develop new mechanisms to tide them over the imposing conditions they encounter during the course of their evolution. Acquisition of new genes by lateral gene transfer may be one of the dominant ways of adaptation in bacterial genome evolution. Lateral gene transfer provides the bacterial genome with a new set of genes that help it to explore and adapt to new ecological niches. Methods A maximum likelihood analysis was done on the five sequenced corynebacterial genomes to model the rates of gene insertions/deletions at various depths of the phylogeny. Results The study shows that most of the laterally acquired genes are transient and the inferred rates of gene movement are higher on the external branches of the phylogeny and decrease as the phylogenetic depth increases. The newly acquired genes are under relaxed selection and evolve faster than their older counterparts. Analysis of some of the functionally characterised LGTs in each species has indicated that they may have a possible adaptive role. Conclusion The five Corynebacterial genomes sequenced to date have evolved by acquiring between 8 – 14% of their genomes by LGT and some of these genes may have a role in adaptation.
Collapse
Affiliation(s)
- Pradeep Reddy Marri
- Department of Biology, McMaster University, Hamilton, Ontario L8S 4K1, Canada
| | - Weilong Hao
- Department of Biology, McMaster University, Hamilton, Ontario L8S 4K1, Canada
| | - G Brian Golding
- Department of Biology, McMaster University, Hamilton, Ontario L8S 4K1, Canada
| |
Collapse
|
49
|
Kaji H, Yamauchi Y, Takahashi N, Isobe T. Mass spectrometric identification of N-linked glycopeptides using lectin-mediated affinity capture and glycosylation site–specific stable isotope tagging. Nat Protoc 2007; 1:3019-27. [PMID: 17406563 DOI: 10.1038/nprot.2006.444] [Citation(s) in RCA: 125] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Protein post-translational modifications (PTMs), such as glycosylation and phosphorylation, are crucial for various signaling and regulatory events, and are therefore an important objective of proteomics research. We describe here a protocol for isotope-coded glycosylation site-specific tagging (IGOT), a method for the large-scale identification of N-linked glycoproteins from complex biological samples. The steps of this approach are: (1) lectin column-mediated affinity capture of glycopeptides generated by protease digestion of protein mixtures; (2) purification of the enriched glycopeptides by hydrophilic interaction chromatography (HIC); (3) peptide-N-glycanase-mediated incorporation of a stable isotope tag, 18O18O, specifically at the N-glycosylation site; and (4) identification of 18O-tagged peptides by liquid chromatography-coupled mass spectrometry (LC/MS)-based proteomics technology. The application of this protocol to the characterization of N-linked glycoproteins from crude extracts of the nematode Caenorhabditis elegans or mouse liver provides a list of hundreds to a thousand glycoproteins and their sites of glycosylation within a week.
Collapse
Affiliation(s)
- Hiroyuki Kaji
- Department of Chemistry, Graduate School of Science and Engineering, Tokyo Metropolitan University, Minamiosawa 1-1, Hachioji, Tokyo 192-0397, Japan
| | | | | | | |
Collapse
|
50
|
Suen G, Arshinoff BI, Taylor RG, Welch RD. Practical Applications of Bacterial Functional Genomics. Biotechnol Genet Eng Rev 2007; 24:213-42. [DOI: 10.1080/02648725.2007.10648101] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|