1
|
Armengaud J, Cardon T, Cristobal S, Matallana-Surget S, Bertile F. Novel model organisms and proteomics for a better biological understanding. J Proteomics 2025; 316:105441. [PMID: 40216077 DOI: 10.1016/j.jprot.2025.105441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2024] [Revised: 01/26/2025] [Accepted: 04/08/2025] [Indexed: 04/17/2025]
Abstract
The concept of « model organisms » is being revisited in the light of the latest advances in multi-omics technologies that can now capture the full range of molecular events that occur over time, regardless of the organism studied. Classic, well-studied models, such as Escherichia coli, Saccharomyces cerevisiae, to name a few, have long been valuable for hypothesis testing, reproducibility, and sharing common platforms among researchers. However, they are not suitable for all types of research. The complexity of unanswered questions in biology demands more elaborated systems, particularly to study plant and animal biodiversity, microbial ecosystems and their interactions with their hosts if any. More integrated systems, known as « holobionts », are emerging to describe and unify host organisms and associated microorganisms, providing an overview of all their possible interactions and trajectories. Comparative evolutionary proteomics offers interesting prospects for extrapolating knowledge from a few selected model organisms to others. This approach enables a deeper characterization of the diversity of proteins and proteoforms across the three branches of the tree of life, i.e. Bacteria, Archaea, and Eukarya. It also provides a powerful means to address remaining biological questions, such as identifying the key molecular players in organisms when they are confronted to environmental challenges, like anthropogenic toxicants, pathogens, dietary shifts or climate stressors, and proposing long-term sustainable solutions. SIGNIFICANCE: In this commentary, we reevaluated the concept of "model organisms" in light of advancements in multi-omics technologies. Traditional models have proven invaluable for hypothesis testing, reproducibility, and fostering shared research frameworks. However, we discussed that they are not universally applicable. To address complexities such as biodiversity and understand microbial ecosystems and their host interactions, integrated systems like "holobionts," which encompass host organisms and their associated microbes, are gaining prominence. Comparative evolutionary proteomics further enhances our understanding by enabling detailed exploration of protein diversity across organisms. This approach also facilitates the identification of critical molecular players in organisms facing environmental challenges, such as pollutants, pathogens, dietary changes, or climate stress, and contributes to developing sustainable long-term solutions.
Collapse
Affiliation(s)
- Jean Armengaud
- Université Paris-Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (DMTS), SPI, 30200 Bagnols-sur-Cèze, France.
| | - Tristan Cardon
- Univ. Lille, Inserm, CHU Lille, U1192 - Protéomique Réponse Inflammatoire Spectrométrie de Masse - PRISM, F-59000 Lille, France
| | - Susana Cristobal
- Department of Biomedical and Clinical Sciences, Cell Biology, Faculty of Medicine, Linköping University, Linköping 581 85, Sweden; Ikerbasque, Basque Foundation for Sciences, Department of Physiology, Faculty of Medicine, and Nursing, University of the Basque Country UPV/EHU, Leioa 489 40, Spain
| | - Sabine Matallana-Surget
- Division of Biological and Environmental Sciences, Faculty of Natural Sciences, University of Stirling, Stirling, Scotland, FK9 4LA, United Kingdom
| | - Fabrice Bertile
- University of Strasbourg, CNRS, Institut Pluridisciplinaire Hubert Curien, UMR 7178, Laboratoire de Spectrométrie de Masse BioOrganique, Strasbourg 67000, France
| |
Collapse
|
2
|
Yamauchi T, Kikuchi M, Iizuka Y, Tsunoda M. X-ray crystal structure of proliferating cell nuclear antigen 1 from Aeropyrum pernix. Acta Crystallogr F Struct Biol Commun 2024; 80:294-301. [PMID: 39382846 PMCID: PMC11533367 DOI: 10.1107/s2053230x24009518] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2024] [Accepted: 09/26/2024] [Indexed: 10/10/2024] Open
Abstract
Proliferating cell nuclear antigen (PCNA) plays a critical role in DNA replication by enhancing the activity of various proteins involved in replication. In this study, the crystal structure of ApePCNA1, one of three PCNAs from the thermophilic archaeon Aeropyrum pernix, was elucidated. ApePCNA1 was cloned and expressed in Escherichia coli and the protein was purified and crystallized. The resulting crystal structure determined at 2.00 Å resolution revealed that ApePCNA1 does not form a trimeric ring, unlike PCNAs from other domains of life. It has unique structural features, including a long interdomain-connecting loop and a PIP-box-like sequence at the N-terminus, indicating potential interactions with other proteins. These findings provide insights into the functional mechanisms of PCNAs in archaea and their evolutionary conservation across different domains of life. A modified medium and protocol were used to express recombinant protein containing the lac operon. The expression of the target protein increased and the total incubation time decreased when using this system compared with those of previous expression protocols.
Collapse
Affiliation(s)
- Takahiro Yamauchi
- Graduate School of Life Science and Technology, Iryo Sosei University, Iwaki, Fukushima, Japan
- Department of PharmacyFukushima Rosai HospitalIwakiFukushimaJapan
| | - Makiko Kikuchi
- Graduate School of Science and Engineering, Iryo Sosei University, Iwaki, Fukushima, Japan
- Faculty of Pharmacy, Iryo Sosei University, Iwaki, Fukushima, Japan
| | - Yasuhito Iizuka
- Graduate School of Life Science and Technology, Iryo Sosei University, Iwaki, Fukushima, Japan
- Faculty of Pharmacy, Iryo Sosei University, Iwaki, Fukushima, Japan
| | - Masaru Tsunoda
- Graduate School of Life Science and Technology, Iryo Sosei University, Iwaki, Fukushima, Japan
- Graduate School of Science and Engineering, Iryo Sosei University, Iwaki, Fukushima, Japan
- Faculty of Pharmacy, Iryo Sosei University, Iwaki, Fukushima, Japan
| |
Collapse
|
3
|
Gemayel K, Lomsadze A, Borodovsky M. StartLink and StartLink+: Prediction of Gene Starts in Prokaryotic Genomes. FRONTIERS IN BIOINFORMATICS 2021; 1:704157. [PMID: 36303749 PMCID: PMC9581028 DOI: 10.3389/fbinf.2021.704157] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2021] [Accepted: 11/04/2021] [Indexed: 12/31/2022] Open
Abstract
State-of-the-art algorithms of ab initio gene prediction for prokaryotic genomes were shown to be sufficiently accurate. A pair of algorithms would agree on predictions of gene 3'ends. Nonetheless, predictions of gene starts would not match for 15-25% of genes in a genome. This discrepancy is a serious issue that is difficult to be resolved due to the absence of sufficiently large sets of genes with experimentally verified starts. We have introduced StartLink that infers gene starts from conservation patterns revealed by multiple alignments of homologous nucleotide sequences. We also have introduced StartLink+ combining both ab initio and alignment-based methods. The ability of StartLink to predict the start of a given gene is restricted by the availability of homologs in a database. We observed that StartLink made predictions for 85% of genes per genome on average. The StartLink+ accuracy was shown to be 98-99% on the sets of genes with experimentally verified starts. In comparison with database annotations, we observed that the annotated gene starts deviated from the StartLink+ predictions for ∼5% of genes in AT-rich genomes and for 10-15% of genes in GC-rich genomes on average. The use of StartLink+ has a potential to significantly improve gene start annotation in genomic databases.
Collapse
Affiliation(s)
- Karl Gemayel
- School of Computational Science and Engineering, Georgia Tech, Atlanta, GA, United States
| | - Alexandre Lomsadze
- Wallace H Coulter Department of Biomedical Engineering, Georgia Tech and Emory University, Atlanta, GA, United States
| | - Mark Borodovsky
- School of Computational Science and Engineering, Georgia Tech, Atlanta, GA, United States
- Wallace H Coulter Department of Biomedical Engineering, Georgia Tech and Emory University, Atlanta, GA, United States
- Moscow Institute of Physics and Technology, Dolgoprudny, Moscow, Russia
| |
Collapse
|
4
|
Lomsadze A, Gemayel K, Tang S, Borodovsky M. Modeling leaderless transcription and atypical genes results in more accurate gene prediction in prokaryotes. Genome Res 2018; 28:1079-1089. [PMID: 29773659 PMCID: PMC6028130 DOI: 10.1101/gr.230615.117] [Citation(s) in RCA: 120] [Impact Index Per Article: 17.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2017] [Accepted: 05/16/2018] [Indexed: 11/24/2022]
Abstract
In a conventional view of the prokaryotic genome organization, promoters precede operons and ribosome binding sites (RBSs) with Shine-Dalgarno consensus precede genes. However, recent experimental research suggesting a more diverse view motivated us to develop an algorithm with improved gene-finding accuracy. We describe GeneMarkS-2, an ab initio algorithm that uses a model derived by self-training for finding species-specific (native) genes, along with an array of precomputed "heuristic" models designed to identify harder-to-detect genes (likely horizontally transferred). Importantly, we designed GeneMarkS-2 to identify several types of distinct sequence patterns (signals) involved in gene expression control, among them the patterns characteristic for leaderless transcription as well as noncanonical RBS patterns. To assess the accuracy of GeneMarkS-2, we used genes validated by COG (Clusters of Orthologous Groups) annotation, proteomics experiments, and N-terminal protein sequencing. We observed that GeneMarkS-2 performed better on average in all accuracy measures when compared with the current state-of-the-art gene prediction tools. Furthermore, the screening of ∼5000 representative prokaryotic genomes made by GeneMarkS-2 predicted frequent leaderless transcription in both archaea and bacteria. We also observed that the RBS sites in some species with leadered transcription did not necessarily exhibit the Shine-Dalgarno consensus. The modeling of different types of sequence motifs regulating gene expression prompted a division of prokaryotic genomes into five categories with distinct sequence patterns around the gene starts.
Collapse
Affiliation(s)
- Alexandre Lomsadze
- Wallace H. Coulter Department of Biomedical Engineering, Georgia Tech, Atlanta, Georgia 30332, USA
- Gene Probe, Incorporated, Atlanta, Georgia 30324, USA
| | - Karl Gemayel
- School of Computational Science and Engineering, Georgia Tech, Atlanta, Georgia 30332, USA
| | - Shiyuyun Tang
- School of Biological Sciences, Georgia Tech, Atlanta, Georgia 30332, USA
| | - Mark Borodovsky
- Wallace H. Coulter Department of Biomedical Engineering, Georgia Tech, Atlanta, Georgia 30332, USA
- Gene Probe, Incorporated, Atlanta, Georgia 30324, USA
- School of Computational Science and Engineering, Georgia Tech, Atlanta, Georgia 30332, USA
- School of Biological Sciences, Georgia Tech, Atlanta, Georgia 30332, USA
- Department of Biological and Medical Physics, Moscow Institute of Physics and Technology, Moscow, 141700, Russia
| |
Collapse
|
5
|
Daimon K, Ishino S, Imai N, Nagumo S, Yamagami T, Matsukawa H, Ishino Y. Two Family B DNA Polymerases From Aeropyrum pernix, Based on Revised Translational Frames. Front Mol Biosci 2018; 5:37. [PMID: 29713633 PMCID: PMC5911459 DOI: 10.3389/fmolb.2018.00037] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2018] [Accepted: 03/28/2018] [Indexed: 11/23/2022] Open
Abstract
Living organisms are divided into three domains, Bacteria, Eukarya, and Archaea. Comparative studies in the three domains have provided useful information to understand the evolution of the DNA replication machinery. DNA polymerase is the central enzyme of DNA replication. The presence of multiple family B DNA polymerases is unique in Crenarchaeota, as compared with other archaeal phyla, which have a single enzyme each for family B (PolB) and family D (PolD). We analyzed PolB1 and PolB3 in the hyperthermophilic crenarchaeon, Aeropyrum pernix, and found that they are larger proteins than those predicted from the coding regions in our previous study and from public database annotations. The recombinant larger PolBs exhibited the same DNA polymerase activities as previously reported. However, the larger PolB3 showed remarkably higher thermostability, which made this enzyme applicable to PCR. In addition, the high tolerance to salt and heparin suggests that PolB3 will be useful for amplification from the samples with contaminants, and therefore it has a great potential for diagnostic use in the medical and environmental field.
Collapse
Affiliation(s)
- Katsuya Daimon
- Graduate School of Bioresource and Bioenvironmental Sciences, Kyushu University, Fukuoka, Japan
| | - Sonoko Ishino
- Graduate School of Bioresource and Bioenvironmental Sciences, Kyushu University, Fukuoka, Japan
| | - Namiko Imai
- Graduate School of Bioresource and Bioenvironmental Sciences, Kyushu University, Fukuoka, Japan
| | - Sachiyo Nagumo
- Graduate School of Bioresource and Bioenvironmental Sciences, Kyushu University, Fukuoka, Japan
| | - Takeshi Yamagami
- Graduate School of Bioresource and Bioenvironmental Sciences, Kyushu University, Fukuoka, Japan
| | - Hiroaki Matsukawa
- Graduate School of Bioresource and Bioenvironmental Sciences, Kyushu University, Fukuoka, Japan
| | - Yoshizumi Ishino
- Graduate School of Bioresource and Bioenvironmental Sciences, Kyushu University, Fukuoka, Japan
| |
Collapse
|
6
|
Demey LM, Miller CR, Manzella MP, Spurbeck RR, Sandhu SK, Reguera G, Kashefi K. The draft genome of the hyperthermophilic archaeon Pyrodictium delaneyi strain hulk, an iron and nitrate reducer, reveals the capacity for sulfate reduction. Stand Genomic Sci 2017; 12:47. [PMID: 28814988 PMCID: PMC5556600 DOI: 10.1186/s40793-017-0260-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2017] [Accepted: 08/08/2017] [Indexed: 01/01/2023] Open
Abstract
Pyrodictium delaneyi strain Hulk is a newly sequenced strain isolated from chimney samples collected from the Hulk sulfide mound on the main Endeavour Segment of the Juan de Fuca Ridge (47.9501 latitude, -129.0970 longitude, depth 2200 m) in the Northeast Pacific Ocean. The draft genome of strain Hulk shared 99.77% similarity with the complete genome of the type strain Su06T, which shares with strain Hulk the ability to reduce iron and nitrate for respiration. The annotation of the genome of strain Hulk identified genes for the reduction of several sulfur-containing electron acceptors, an unsuspected respiratory capability in this species that was experimentally confirmed for strain Hulk. This makes P. delaneyi strain Hulk the first hyperthermophilic archaeon known to gain energy for growth by reduction of iron, nitrate, and sulfur-containing electron acceptors. Here we present the most notable features of the genome of P. delaneyi strain Hulk and identify genes encoding proteins critical to its respiratory versatility at high temperatures. The description presented here corresponds to a draft genome sequence containing 2,042,801 bp in 9 contigs, 2019 protein-coding genes, 53 RNA genes, and 1365 hypothetical genes.
Collapse
Affiliation(s)
- Lucas M. Demey
- Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI USA
| | - Caitlin R. Miller
- Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI USA
| | - Michael P Manzella
- Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI USA
- Present address: Natural Resource Ecology Laboratory, Colorado State University, Fort Collins, Colorado, USA
| | - Rachel R. Spurbeck
- Applied Genomics and Biology Group, Department of CBRNE Defense, Battelle Memorial Institute, Columbus, OH USA
| | | | - Gemma Reguera
- Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI USA
| | - Kazem Kashefi
- Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI USA
| |
Collapse
|
7
|
Willems P, Ndah E, Jonckheere V, Stael S, Sticker A, Martens L, Van Breusegem F, Gevaert K, Van Damme P. N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana. Mol Cell Proteomics 2017; 16:1064-1080. [PMID: 28432195 PMCID: PMC5461538 DOI: 10.1074/mcp.m116.066662] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2016] [Revised: 04/11/2017] [Indexed: 01/05/2023] Open
Abstract
Proteogenomics is an emerging research field yet lacking a uniform method of analysis. Proteogenomic studies in which N-terminal proteomics and ribosome profiling are combined, suggest that a high number of protein start sites are currently missing in genome annotations. We constructed a proteogenomic pipeline specific for the analysis of N-terminal proteomics data, with the aim of discovering novel translational start sites outside annotated protein coding regions. In summary, unidentified MS/MS spectra were matched to a specific N-terminal peptide library encompassing protein N termini encoded in the Arabidopsis thaliana genome. After a stringent false discovery rate filtering, 117 protein N termini compliant with N-terminal methionine excision specificity and indicative of translation initiation were found. These include N-terminal protein extensions and translation from transposable elements and pseudogenes. Gene prediction provided supporting protein-coding models for approximately half of the protein N termini. Besides the prediction of functional domains (partially) contained within the newly predicted ORFs, further supporting evidence of translation was found in the recently released Araport11 genome re-annotation of Arabidopsis and computational translations of sequences stored in public repositories. Most interestingly, complementary evidence by ribosome profiling was found for 23 protein N termini. Finally, by analyzing protein N-terminal peptides, an in silico analysis demonstrates the applicability of our N-terminal proteogenomics strategy in revealing protein-coding potential in species with well- and poorly-annotated genomes.
Collapse
Affiliation(s)
- Patrick Willems
- From the ‡VIB/UGent Center for Plant Systems Biology, 9052 Ghent, Belgium.,§Ghent University, Department of Plant Biotechnology and Bioinformatics, 9052 Ghent.,¶VIB/UGent Center for Medical Biotechnology, 9000 Ghent, Belgium.,‖Ghent University, Department of Biochemistry, 9000 Ghent, Belgium
| | - Elvis Ndah
- ¶VIB/UGent Center for Medical Biotechnology, 9000 Ghent, Belgium.,‖Ghent University, Department of Biochemistry, 9000 Ghent, Belgium.,**Ghent University, Department of Mathematical Modeling, Statistics and Bioinformatics, 9000 Ghent, Belgium
| | - Veronique Jonckheere
- ¶VIB/UGent Center for Medical Biotechnology, 9000 Ghent, Belgium.,‖Ghent University, Department of Biochemistry, 9000 Ghent, Belgium
| | - Simon Stael
- From the ‡VIB/UGent Center for Plant Systems Biology, 9052 Ghent, Belgium.,§Ghent University, Department of Plant Biotechnology and Bioinformatics, 9052 Ghent.,¶VIB/UGent Center for Medical Biotechnology, 9000 Ghent, Belgium.,‖Ghent University, Department of Biochemistry, 9000 Ghent, Belgium
| | - Adriaan Sticker
- ¶VIB/UGent Center for Medical Biotechnology, 9000 Ghent, Belgium.,‖Ghent University, Department of Biochemistry, 9000 Ghent, Belgium.,**Ghent University, Department of Mathematical Modeling, Statistics and Bioinformatics, 9000 Ghent, Belgium
| | - Lennart Martens
- ¶VIB/UGent Center for Medical Biotechnology, 9000 Ghent, Belgium.,‖Ghent University, Department of Biochemistry, 9000 Ghent, Belgium.,**Ghent University, Department of Mathematical Modeling, Statistics and Bioinformatics, 9000 Ghent, Belgium
| | - Frank Van Breusegem
- From the ‡VIB/UGent Center for Plant Systems Biology, 9052 Ghent, Belgium.,§Ghent University, Department of Plant Biotechnology and Bioinformatics, 9052 Ghent
| | - Kris Gevaert
- ¶VIB/UGent Center for Medical Biotechnology, 9000 Ghent, Belgium.,‖Ghent University, Department of Biochemistry, 9000 Ghent, Belgium
| | - Petra Van Damme
- ¶VIB/UGent Center for Medical Biotechnology, 9000 Ghent, Belgium; .,‖Ghent University, Department of Biochemistry, 9000 Ghent, Belgium
| |
Collapse
|
8
|
Marshall NC, Finlay BB, Overall CM. Sharpening Host Defenses during Infection: Proteases Cut to the Chase. Mol Cell Proteomics 2017; 16:S161-S171. [PMID: 28179412 PMCID: PMC5393396 DOI: 10.1074/mcp.o116.066456] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2016] [Revised: 02/03/2017] [Indexed: 01/14/2023] Open
Abstract
The human immune system consists of an intricate network of tightly controlled pathways, where proteases are essential instigators and executioners at multiple levels. Invading microbial pathogens also encode proteases that have evolved to manipulate and dysregulate host proteins, including host proteases during the course of disease. The identification of pathogen proteases as well as their substrates and mechanisms of action have empowered significant developments in therapeutics for infectious diseases. Yet for many pathogens, there remains a great deal to be discovered. Recently, proteomic techniques have been developed that can identify proteolytically processed proteins across the proteome. These “degradomics” approaches can identify human substrates of microbial proteases during infection in vivo and expose the molecular-level changes that occur in the human proteome during infection as an operational network to develop hypotheses for further research as well as new therapeutics. This Perspective Article reviews how proteases are utilized during infection by both the human host and invading bacterial pathogens, including archetypal virulence-associated microbial proteases, such as the Clostridia spp. botulinum and tetanus neurotoxins. We highlight the potential knowledge that degradomics studies of host–pathogen interactions would uncover, as well as how degradomics has been successfully applied in similar contexts, including use with a viral protease. We review how microbial proteases have been targeted in current therapeutic approaches and how microbial proteases have shaped and even contributed to human therapeutics beyond infectious disease. Finally, we discuss how, moving forward, degradomics research can greatly contribute to our understanding of how microbial pathogens cause disease in vivo and lead to the identification of novel substrates in vivo, and the development of improved therapeutics to counter these pathogens.
Collapse
Affiliation(s)
- Natalie C Marshall
- From the ‡Department of Microbiology & Immunology.,§Michael Smith Laboratories
| | - B Brett Finlay
- From the ‡Department of Microbiology & Immunology.,§Michael Smith Laboratories.,¶Department of Biochemistry & Molecular Biology
| | - Christopher M Overall
- ¶Department of Biochemistry & Molecular Biology, .,**Department of Oral Biological & Medical Sciences, Centre for Blood Research, Life Sciences Institute, University of British Columbia, Vancouver, British Columbia, Canada
| |
Collapse
|
9
|
Abstract
Omics approaches have become popular in biology as powerful discovery tools, and currently gain in interest for diagnostic applications. Establishing the accurate genome sequence of any organism is easy, but the outcome of its annotation by means of automatic pipelines remains imprecise. Some protein-encoding genes may be missed as soon as they are specific and poorly conserved in a given taxon, while important to explain the specific traits of the organism. Translational starts are also poorly predicted in a relatively important number of cases, thus impacting the protein sequence database used in proteomics, comparative genomics, and systems biology. The use of high-throughput proteomics data to improve genome annotation is an attractive option to obtain a more comprehensive molecular picture of a given organism. Here, protocols for reannotating prokaryote genomes are described based on shotgun proteomics and derivatization of protein N-termini with a positively charged reagent coupled to high-resolution tandem mass spectrometry.
Collapse
|
10
|
Liebensteiner MG, Pinkse MWH, Nijsse B, Verhaert PDEM, Tsesmetzis N, Stams AJM, Lomans BP. Perchlorate and chlorate reduction by the Crenarchaeon Aeropyrum pernix and two thermophilic Firmicutes. ENVIRONMENTAL MICROBIOLOGY REPORTS 2015; 7:936-945. [PMID: 26332065 DOI: 10.1111/1758-2229.12335] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/27/2015] [Accepted: 08/27/2015] [Indexed: 06/05/2023]
Abstract
This study reports the ability of one hyperthermophilic and two thermophilic microorganisms to grow anaerobically by the reduction of chlorate and perchlorate. Physiological, genomic and proteome analyses suggest that the Crenarchaeon Aeropyrum pernix reduces perchlorate with a periplasmic enzyme related to nitrate reductases, but that it lacks a functional chlorite-disproportionating enzyme (Cld) to complete the pathway. Aeropyrum pernix, previously described as a strictly aerobic microorganism, seems to rely on the chemical reactivity of reduced sulfur compounds with chlorite, a mechanism previously reported for perchlorate-reducing Archaeoglobus fulgidus. The chemical oxidation of thiosulfate (in excessive amounts present in the medium) and the reduction of chlorite result in the release of sulfate and chloride, which are the products of a biotic-abiotic perchlorate reduction pathway in Ae. pernix. The apparent absence of Cld in two other perchlorate-reducing microorganisms, Carboxydothermus hydrogenoformans and Moorella glycerini strain NMP, and their dependence on sulfide for perchlorate reduction is consistent with the observations made on Ar. fulgidus. Our findings suggest that microbial perchlorate reduction at high temperature differs notably from the physiology of perchlorate- and chlorate-reducing mesophiles and that it is characterized by the lack of a chlorite dismutase and is enabled by a combination of biotic and abiotic reactions.
Collapse
Affiliation(s)
- Martin G Liebensteiner
- Laboratory of Microbiology, Wageningen University, Dreijenplein 10, 6703 HB, Wageningen, The Netherlands
| | - Martijn W H Pinkse
- Analytical Biotechnology Section, Department of Biotechnology, Delft University of Technology, Julianalaan 67, 2628 BC, Delft, The Netherlands
- Netherlands Proteomics Centre, Julianalaan 67, 2628 BC, Delft, The Netherlands
| | - Bart Nijsse
- Laboratory of Systems and Synthetic Biology, Wageningen University, Dreijenplein 10, 6703 HB, Wageningen, The Netherlands
| | - Peter D E M Verhaert
- Analytical Biotechnology Section, Department of Biotechnology, Delft University of Technology, Julianalaan 67, 2628 BC, Delft, The Netherlands
- Netherlands Proteomics Centre, Julianalaan 67, 2628 BC, Delft, The Netherlands
| | - Nicolas Tsesmetzis
- Shell International Exploration and Production Inc., 3333 Highway 6 South, Houston, TX, 77082, USA
| | - Alfons J M Stams
- Laboratory of Microbiology, Wageningen University, Dreijenplein 10, 6703 HB, Wageningen, The Netherlands
- Centre of Biological Engineering, University of Minho, Campus de Gualtar, 4710-057, Braga, Portugal
| | - Bart P Lomans
- Shell Global Solutions International B.V., Kessler Park 1, 2288 GS, Rijswijk, The Netherlands
| |
Collapse
|
11
|
Berry IJ, Steele JR, Padula MP, Djordjevic SP. The application of terminomics for the identification of protein start sites and proteoforms in bacteria. Proteomics 2015; 16:257-72. [DOI: 10.1002/pmic.201500319] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2015] [Revised: 09/21/2015] [Accepted: 09/30/2015] [Indexed: 01/11/2023]
Affiliation(s)
- Iain J. Berry
- The ithree Institute; University of Technology Sydney; Broadway NSW Australia
- Proteomics Core Facility; University of Technology Sydney; Broadway NSW Australia
| | - Joel R. Steele
- Proteomics Core Facility; University of Technology Sydney; Broadway NSW Australia
| | - Matthew P. Padula
- The ithree Institute; University of Technology Sydney; Broadway NSW Australia
- Proteomics Core Facility; University of Technology Sydney; Broadway NSW Australia
| | - Steven P. Djordjevic
- The ithree Institute; University of Technology Sydney; Broadway NSW Australia
- Proteomics Core Facility; University of Technology Sydney; Broadway NSW Australia
| |
Collapse
|
12
|
Kumar D, Mondal AK, Kutum R, Dash D. Proteogenomics of rare taxonomic phyla: A prospective treasure trove of protein coding genes. Proteomics 2015; 16:226-40. [PMID: 26773550 DOI: 10.1002/pmic.201500263] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2015] [Revised: 09/18/2015] [Accepted: 09/28/2015] [Indexed: 01/04/2023]
Abstract
Sustainable innovations in sequencing technologies have resulted in a torrent of microbial genome sequencing projects. However, the prokaryotic genomes sequenced so far are unequally distributed along their phylogenetic tree; few phyla contain the majority, the rest only a few representatives. Accurate genome annotation lags far behind genome sequencing. While automated computational prediction, aided by comparative genomics, remains a popular choice for genome annotation, substantial fraction of these annotations are erroneous. Proteogenomics utilizes protein level experimental observations to annotate protein coding genes on a genome wide scale. Benefits of proteogenomics include discovery and correction of gene annotations regardless of their phylogenetic conservation. This not only allows detection of common, conserved proteins but also the discovery of protein products of rare genes that may be horizontally transferred or taxonomy specific. Chances of encountering such genes are more in rare phyla that comprise a small number of complete genome sequences. We collated all bacterial and archaeal proteogenomic studies carried out to date and reviewed them in the context of genome sequencing projects. Here, we present a comprehensive list of microbial proteogenomic studies, their taxonomic distribution, and also urge for targeted proteogenomics of underexplored taxa to build an extensive reference of protein coding genes.
Collapse
Affiliation(s)
- Dhirendra Kumar
- G. N. Ramachandran Knowledge Center of Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, South Campus, Sukhdev Vihar, Delhi, India
| | - Anupam Kumar Mondal
- G. N. Ramachandran Knowledge Center of Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, South Campus, Sukhdev Vihar, Delhi, India
| | - Rintu Kutum
- G. N. Ramachandran Knowledge Center of Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, South Campus, Sukhdev Vihar, Delhi, India
| | - Debasis Dash
- G. N. Ramachandran Knowledge Center of Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, South Campus, Sukhdev Vihar, Delhi, India
| |
Collapse
|
13
|
Leon DR, Ytterberg AJ, Boontheung P, Kim U, Loo JA, Gunsalus RP, Ogorzalek Loo RR. Mining proteomic data to expose protein modifications in Methanosarcina mazei strain Gö1. Front Microbiol 2015; 6:149. [PMID: 25798134 PMCID: PMC4350412 DOI: 10.3389/fmicb.2015.00149] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2014] [Accepted: 02/09/2015] [Indexed: 12/11/2022] Open
Abstract
Proteomic tools identify constituents of complex mixtures, often delivering long lists of identified proteins. The high-throughput methods excel at matching tandem mass spectrometry data to spectra predicted from sequence databases. Unassigned mass spectra are ignored, but could, in principle, provide valuable information on unanticipated modifications and improve protein annotations while consuming limited quantities of material. Strategies to "mine" information from these discards are presented, along with discussion of features that, when present, provide strong support for modifications. In this study we mined LC-MS/MS datasets of proteolytically-digested concanavalin A pull down fractions from Methanosarcina mazei Gö1 cell lysates. Analyses identified 154 proteins. Many of the observed proteins displayed post-translationally modified forms, including O-formylated and methyl-esterified segments that appear biologically relevant (i.e., not artifacts of sample handling). Interesting cleavages and modifications (e.g., S-cyanylation and trimethylation) were observed near catalytic sites of methanogenesis enzymes. Of 31 Methanosarcina protein N-termini recovered by concanavalin A binding or from a previous study, only M. mazei S-layer protein MM1976 and its M. acetivorans C2A orthologue, MA0829, underwent signal peptide excision. Experimental results contrast with predictions from algorithms SignalP 3.0 and Exprot, which were found to over-predict the presence of signal peptides. Proteins MM0002, MM0716, MM1364, and MM1976 were found to be glycosylated, and employing chromatography tailored specifically for glycopeptides will likely reveal more. This study supplements limited, existing experimental datasets of mature archaeal N-termini, including presence or absence of signal peptides, translation initiation sites, and other processing. Methanosarcina surface and membrane proteins are richly modified.
Collapse
Affiliation(s)
- Deborah R Leon
- Department of Chemistry and Biochemistry, University of California, Los Angeles Los Angeles, CA, USA
| | - A Jimmy Ytterberg
- Department of Chemistry and Biochemistry, University of California, Los Angeles Los Angeles, CA, USA
| | - Pinmanee Boontheung
- Department of Chemistry and Biochemistry, University of California, Los Angeles Los Angeles, CA, USA
| | - Unmi Kim
- Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles Los Angeles, CA, USA
| | - Joseph A Loo
- Department of Chemistry and Biochemistry, University of California, Los Angeles Los Angeles, CA, USA ; Department of Biological Chemistry, David Geffen School of Medicine, University of California, Los Angeles Los Angeles, CA, USA ; UCLA-DOE Institute for Genomics and Proteomics, University of California, Los Angeles Los Angeles, CA, USA
| | - Robert P Gunsalus
- Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles Los Angeles, CA, USA ; UCLA-DOE Institute for Genomics and Proteomics, University of California, Los Angeles Los Angeles, CA, USA
| | - Rachel R Ogorzalek Loo
- Department of Biological Chemistry, David Geffen School of Medicine, University of California, Los Angeles Los Angeles, CA, USA ; UCLA-DOE Institute for Genomics and Proteomics, University of California, Los Angeles Los Angeles, CA, USA
| |
Collapse
|
14
|
Kucharova V, Wiker HG. Proteogenomics in microbiology: taking the right turn at the junction of genomics and proteomics. Proteomics 2014; 14:2360-675. [PMID: 25263021 DOI: 10.1002/pmic.201400168] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2014] [Revised: 08/18/2014] [Accepted: 09/23/2014] [Indexed: 12/14/2022]
Abstract
High-accuracy and high-throughput proteomic methods have completely changed the way we can identify and characterize proteins. MS-based proteomics can now provide a unique supplement to genomic data and add a new level of information to the interpretation of genomic sequences. Proteomics-driven genome annotation has become especially relevant in microbiology where genomes are sequenced on a daily basis and limitations of an in silico driven annotation process are well recognized. In this review paper, we outline different strategies on how one can design a proteogenomic experiment, for example on genome-sequenced (synonymous proteogenomics) versus unsequenced organisms (ortho-proteogenomics) or with the aid of other "omic" data such as RNA-seq. We touch upon many challenges that are encountered during a typical proteogenomic study, mostly concerning bioinformatics methods and downstream data analysis, but also related to creation and use of sequence databases. A large list of proteogenomic case studies of different microorganisms is provided to illustrate the mapping of MS/MS-derived peptide spectra to genomic DNA sequences. These investigations have led to accurate determination of translational initiation sites, pointed out eventual read-throughs or programmed frameshifts, detected signal peptide processing or other protein maturation events, removed questionable annotation assignments, and provided evidence for predicted hypothetical proteins.
Collapse
Affiliation(s)
- Veronika Kucharova
- Department of Clinical Science, The Gade Research Group for Infection and Immunity, University of Bergen, Norway
| | | |
Collapse
|
15
|
Armengaud J, Hartmann EM, Bland C. Proteogenomics for environmental microbiology. Proteomics 2013; 13:2731-42. [PMID: 23636904 DOI: 10.1002/pmic.201200576] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2012] [Revised: 03/06/2013] [Accepted: 04/09/2013] [Indexed: 11/09/2022]
Abstract
Proteogenomics sensu stricto refers to the use of proteomic data to refine the annotation of genomes from model organisms. Because of the limitations of automatic annotation pipelines, a relatively high number of errors occur during the structural annotation of genes coding for proteins. Whether putative orphan sequences or short genes encoding low-molecular-weight proteins really exist is still frequently a mystery. Whether start codons are well defined is also an open debate. These problems are exacerbated for genomes of microorganisms belonging to poorly documented genera, as related sequences are not always available for homology-guided annotation. The functional annotation of a significant proportion of genes is also another well-known issue when annotating environmental microorganisms. High-throughput shotgun proteomics has recently greatly evolved, allowing the exploration of the proteome from any microorganism at an unprecedented depth. The structural and functional annotation process may be usefully complemented with experimental data. Indeed, proteogenomic mapping has been successfully performed for a wide variety of organisms. Specific approaches devoted to systematically establishing the N-termini of a large set of proteins are being developed. N-terminomics is giving rise to datasets of experimentally proven translational start codons as well as validated peptide signals for secreted proteins. By extension, combining genomic and proteomic data is becoming routine in many research projects. The proteomic analysis of organisms with unfinished genome sequences, the so-called composite proteomics, and the search for microbial biomarkers by bottom-up and top-down combined approaches are some examples of proteogenomic-flavored studies. They illustrate the advent of a new era of environmental microbiology where proteomics and genomics are intimately integrated to answer key biological questions.
Collapse
Affiliation(s)
- Jean Armengaud
- CEA, DSV, IBEB, Lab Biochim System Perturb, Bagnols-sur-Cèze, France
| | | | | |
Collapse
|
16
|
Guo FB, Xiong L, Teng JLL, Yuen KY, Lau SKP, Woo PCY. Re-annotation of protein-coding genes in 10 complete genomes of Neisseriaceae family by combining similarity-based and composition-based methods. DNA Res 2013; 20:273-86. [PMID: 23571676 PMCID: PMC3686433 DOI: 10.1093/dnares/dst009] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
In this paper, we performed a comprehensive re-annotation of protein-coding genes by a systematic method combining composition- and similarity-based approaches in 10 complete bacterial genomes of the family Neisseriaceae. First, 418 hypothetical genes were predicted as non-coding using the composition-based method and 413 were eliminated from the gene list. Both the scatter plot and cluster of orthologous groups (COG) fraction analyses supported the result. Second, from 20 to 400 hypothetical proteins were assigned with functions in each of the 10 strains based on the homology search. Among newly assigned functions, 397 are so detailed to have definite gene names. Third, 106 genes missed by the original annotations were picked up by an ab initio gene finder combined with similarity alignment. Transcriptional experiments validated the effectiveness of this method in Laribacter hongkongensis and Chromobacterium violaceum. Among the 106 newly found genes, some deserve particular interests. For example, 27 transposases were newly found in Neiserria meningitidis alpha14. In Neiserria gonorrhoeae NCCP11945, four new genes with putative functions and definite names (nusG, rpsN, rpmD and infA) were found and homologues of them usually are essential for survival in bacteria. The updated annotations for the 10 Neisseriaceae genomes provide a more accurate prediction of protein-coding genes and a more detailed functional information of hypothetical proteins. It will benefit research into the lifestyle, metabolism, environmental adaption and pathogenicity of the Neisseriaceae species. The re-annotation procedure could be used directly, or after the adaption of detailed methods, for checking annotations of any other bacterial or archaeal genomes.
Collapse
Affiliation(s)
- Feng-Biao Guo
- Department of Microbiology, The University of Hong Kong, Special Administrative Region, Hong Kong, People's Republic of China
| | | | | | | | | | | |
Collapse
|
17
|
Abstract
High-throughput identification of proteins with the latest generation of hybrid high-resolution mass spectrometers is opening new perspectives in microbiology. I present, here, an overview of tandem mass spectrometry technology and bioinformatics for shotgun proteomics that make 2D-PAGE approaches obsolete. Non-labelling quantitative approaches have become more popular than labelling techniques on most proteomic platforms because they are easier to carry out while their quantitative outcome is rather robust. Parameters for recording mass spectrometry data, however, need to be chosen carefully and statistics to assess the confidence of the results should not be neglected. Interestingly, next-generation sequencing methodologies make any microbial model quickly amenable to proteomics, leading to the documentation of a wide range of organisms from diverse environments. Some recent discoveries made using microbial proteomics have challenged some biological dogma, such as: (i) initiation of the translation does not occur predominantly from ATG codons in some microorganisms, (ii) non-canonical initiation codons are used to regulate the production of specific but important proteins and (iii) a gene may code for multiple polypeptide species, heterogeneous in terms of sequences. Microbial diversity and microbial physiology can now be revisited by means of exhaustive comparative proteomic surveys where thousands of proteins are detected and quantified. Proteogenomics, consisting of better annotating of genomes with the help of proteomic evidence, is paving the way for integrated multi-omic approaches in microbiology. Finally, meta-proteomic tools and approaches are emerging for tackling the high complexity of the microbial world as a whole, opening new perspectives for assessing how microbial communities function.
Collapse
Affiliation(s)
- Jean Armengaud
- CEA, DSV, IBEB, Lab Biochim System Perturb, F-30207 Bagnols-sur-Cèze, France.
| |
Collapse
|
18
|
Okamoto A, Yamada K. Proteome driven re-evaluation and functional annotation of the Streptococcus pyogenes SF370 genome. BMC Microbiol 2011; 11:249. [PMID: 22070424 PMCID: PMC3224786 DOI: 10.1186/1471-2180-11-249] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2011] [Accepted: 11/10/2011] [Indexed: 12/02/2022] Open
Abstract
Background The genome data of Streptococcus pyogenes SF370 has been widely used by many researchers and provides a vast array of interesting findings. Nevertheless, approximately 40% of genes remain classified as hypothetical proteins, and several coding sequences (CDSs) have been unrecognized. In this study, we attempted a shotgun proteomic analysis with a six-frame database that was independent of genome annotation. Results Nine proteins encoded by novel ORFs were found by shotgun proteomic analysis, and their specific mRNAs were verified by reverse transcriptional PCR (RT-PCR). We also provided functional annotations for hypothetical genes using proteomic analysis from three different culture conditions that were separated into three fractions: supernatant, soluble, and insoluble. Consequently, we identified 567 proteins on re-evaluation of the proteomic data using an in-house database comprising 1,697 annotated and nine non-annotated CDSs. We provided functional annotations for 126 hypothetical proteins (18.9% out of the 668 hypothetical proteins) based on their cellular fractions and expression profiles under different culture conditions. Conclusions The list of amino acid sequences that were annotated by genome analysis contains outdated information and unrecognized protein-coding sequences. We suggest that the six-frame database derived from actual DNA sequences be used for reliable proteomic analysis. In addition, the experimental evidence from functional proteomic analysis is useful for the re-evaluation of previously sequenced genomes.
Collapse
Affiliation(s)
- Akira Okamoto
- Department of Molecular Bacteriology, Nagoya University Graduate School of Medicine, 65 Tsurumai-cho, Showa-ku, Nagoya, Aichi 466-8550, Japan.
| | | |
Collapse
|
19
|
Mochizuki T, Sako Y, Prangishvili D. Provirus induction in hyperthermophilic archaea: characterization of Aeropyrum pernix spindle-shaped virus 1 and Aeropyrum pernix ovoid virus 1. J Bacteriol 2011; 193:5412-9. [PMID: 21784945 PMCID: PMC3187419 DOI: 10.1128/jb.05101-11] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2011] [Accepted: 06/28/2011] [Indexed: 01/29/2023] Open
Abstract
By in silico analysis, we have identified two putative proviruses in the genome of the hyperthermophilic archaeon Aeropyrum pernix, and under special conditions of A. pernix growth, we were able to induce their replication. Both viruses were isolated and characterized. Negatively stained virions of one virus appeared as pleomorphic spindle-shaped particles, 180 to 210 nm by 40 to 55 nm, with tails of heterogeneous lengths in the range of 0 to 300 nm. This virus was named Aeropyrum pernix spindle-shaped virus 1 (APSV1). Negatively stained virions of the other virus appeared as slightly irregular oval particles with one pointed end, while in cryo-electron micrographs, the virions had a regular oval shape and uniform size (70 by 55 nm). The virus was named Aeropyrum pernix ovoid virus 1 (APOV1). Both viruses have circular, double-stranded DNA genomes of 38,049 bp for APSV1 and 13,769 bp for APOV1. Similarities to proteins of other archaeal viruses were limited to the integrase and Dna1-like protein. We propose to classify APOV1 into the family Guttaviridae.
Collapse
Affiliation(s)
- Tomohiro Mochizuki
- Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Department of Microbiology, Institut Pasteur, 75015 Paris, France
| | - Yoshihiko Sako
- Laboratory of Marine Microbiology, Graduate School of Agriculture, Kyoto University, Kyoto 606-8502, Japan
| | - David Prangishvili
- Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Department of Microbiology, Institut Pasteur, 75015 Paris, France
| |
Collapse
|
20
|
Enrichment and proteome analysis of a hyperthermostable protein set of archaeon Thermococcus onnurineus NA1. Extremophiles 2011; 15:451-61. [DOI: 10.1007/s00792-011-0376-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2010] [Accepted: 04/06/2011] [Indexed: 10/18/2022]
|
21
|
Lee YG, Kang SG, Lee JH, Kim SI, Chung YH. Characterization of hyperthermostable fructose-1,6-bisphosphatase from Thermococcus onnurineus NA1. J Microbiol 2011; 48:803-7. [PMID: 21221938 DOI: 10.1007/s12275-010-0377-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2010] [Accepted: 11/11/2010] [Indexed: 11/26/2022]
Abstract
To understand the physiological functions of thermostable fructose-1,6-bisphosphatase (TNA1-Fbp) from Thermococcus onnurineus NA1, its recombinant enzyme was overexpressed in Escherichia coli, purified, and the enzymatic properties were characterized. The enzyme showed maximal activity for fructose-1,6-bisphosphate at 95°C and pH 8.0 with a half-life (t (1/2)) of about 8 h. TNA1-Fbp had broad substrate specificities for fructose-1,6-bisphosphate and its analogues including fructose-1-phosphate, glucose-1-phosphate, and phosphoenolpyruvate. In addition, its enzyme activity was increased five-fold by addition of 1 mM Mg(2+), while Li(+) did not enhance enzymatic activity. TNA1-Fbp activity was inhibited by ATP, ADP, and phosphoenolpyruvate, but AMP up to 100 mM did not have any effect. TNA1-Fbp is currently defined as a class V fructose-1,6-bisphosphatase (FBPase) because it is very similar to FBPase of Thermococcus kodakaraensis KOD1 based on sequence homology. However, this enzyme shows a different range of substrate specificities. These results suggest that TNA1-Fbp can establish new criterion for class V FBPases.
Collapse
Affiliation(s)
- Yeol Gyun Lee
- Division of Life Science, Korea Basic Science Institute, Daejeon, 305-806, Republic of Korea
| | | | | | | | | |
Collapse
|
22
|
Baker BJ, Comolli LR, Dick GJ, Hauser LJ, Hyatt D, Dill BD, Land ML, VerBerkmoes NC, Hettich RL, Banfield JF. Enigmatic, ultrasmall, uncultivated Archaea. Proc Natl Acad Sci U S A 2010; 107:8806-11. [PMID: 20421484 PMCID: PMC2889320 DOI: 10.1073/pnas.0914470107] [Citation(s) in RCA: 214] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Metagenomics has provided access to genomes of as yet uncultivated microorganisms in natural environments, yet there are gaps in our knowledge-particularly for Archaea-that occur at relatively low abundance and in extreme environments. Ultrasmall cells (<500 nm in diameter) from lineages without cultivated representatives that branch near the crenarchaeal/euryarchaeal divide have been detected in a variety of acidic ecosystems. We reconstructed composite, near-complete approximately 1-Mb genomes for three lineages, referred to as ARMAN (archaeal Richmond Mine acidophilic nanoorganisms), from environmental samples and a biofilm filtrate. Genes of two lineages are among the smallest yet described, enabling a 10% higher coding density than found genomes of the same size, and there are noncontiguous genes. No biological function could be inferred for up to 45% of genes and no more than 63% of the predicted proteins could be assigned to a revised set of archaeal clusters of orthologous groups. Some core metabolic genes are more common in Crenarchaeota than Euryarchaeota, up to 21% of genes have the highest sequence identity to bacterial genes, and 12 belong to clusters of orthologous groups that were previously exclusive to bacteria. A small subset of 3D cryo-electron tomographic reconstructions clearly show penetration of the ARMAN cell wall and cytoplasmic membranes by protuberances extended from cells of the archaeal order Thermoplasmatales. Interspecies interactions, the presence of a unique internal tubular organelle [Comolli, et al. (2009) ISME J 3:159-167], and many genes previously only affiliated with Crenarchaea or Bacteria indicate extensive unique physiology in organisms that branched close to the time that Cren- and Euryarchaeotal lineages diverged.
Collapse
Affiliation(s)
| | - Luis R. Comolli
- Lawrence Berkeley National Laboratories, Berkeley, CA 94720; and
| | | | | | | | - Brian D. Dill
- Chemical Sciences Divisions, Oak Ridge National Laboratory, Oak Ridge, TN 37831
| | | | | | - Robert L. Hettich
- Chemical Sciences Divisions, Oak Ridge National Laboratory, Oak Ridge, TN 37831
| | - Jillian F. Banfield
- Department of Earth and Planetary Science and
- Environmental Science, Policy, and Management, University of California, Berkeley, CA 94720
| |
Collapse
|
23
|
Mochizuki T, Yoshida T, Tanaka R, Forterre P, Sako Y, Prangishvili D. Diversity of viruses of the hyperthermophilic archaeal genus Aeropyrum, and isolation of the Aeropyrum pernix bacilliform virus 1, APBV1, the first representative of the family Clavaviridae. Virology 2010; 402:347-54. [PMID: 20430412 DOI: 10.1016/j.virol.2010.03.046] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2009] [Revised: 01/19/2010] [Accepted: 03/29/2010] [Indexed: 10/19/2022]
Abstract
We have surveyed the morphological diversity of viruses infecting the archaeon Aeropyrum pernix, the most thermophilic species among aerobic organisms, growing optimally at 90 degrees C, and isolated and characterized a novel virus, Aeropyrum pernix bacilliform virus 1, APBV1. This is the first virus to be described of the genus Aeropyrum and the archaeal order Desulfurococcales. The virion of APBV1 has rigid bacilliform morphology, about 140x20nm, with one end pointed and the other rounded. It contains highly glycosylated single major protein and three minor proteins. The circular, double-stranded DNA genome comprising 5278bp is the smallest for known archaeal viruses. None of the 14 putative genes, all on the same DNA strand, shows significant similarity to sequences in the public databases. The APBV1 infection caused neither retardation of host growth nor lysis of host cells, and integration of the viral genome into the host chromosome was not detected. On the basis of unusual morphological and genomic properties, we propose to consider APBV1 as the first representative of a new viral family, the Clavaviridae.
Collapse
Affiliation(s)
- Tomohiro Mochizuki
- Institute Pasteur, Molecular Biology of the Gene in Extremophiles Unit, Department of Microbiology, F-75015 Paris, France
| | | | | | | | | | | |
Collapse
|
24
|
Armengaud J. Proteogenomics and systems biology: quest for the ultimate missing parts. Expert Rev Proteomics 2010; 7:65-77. [DOI: 10.1586/epr.09.104] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/30/2023]
|
25
|
Lee DK, Hwang JY, Cho HY, Kong KH. A Thermostable Aspartate Aminotransferase from Aeropyrum pernix K1. B KOREAN CHEM SOC 2009. [DOI: 10.5012/bkcs.2009.30.12.3143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
26
|
Ellen AF, Albers SV, Driessen AJM. Comparative study of the extracellular proteome of Sulfolobus species reveals limited secretion. Extremophiles 2009; 14:87-98. [PMID: 19957093 PMCID: PMC2797410 DOI: 10.1007/s00792-009-0290-y] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2009] [Accepted: 11/10/2009] [Indexed: 01/01/2023]
Abstract
Although a large number of potentially secreted proteins can be predicted on the basis of genomic distribution of signal sequence-bearing proteins, protein secretion in Archaea has barely been studied. A proteomic inventory and comparison of the growth medium proteins in three hyperthermoacidophiles, i.e., Sulfolobus solfataricus, S. acidocaldarius and S. tokodaii, indicates that only few proteins are freely secreted into the growth medium and that the majority originates from cell envelope bound forms. In S. acidocaldarius both cell-associated and secreted alpha-amylase activities are detected. Inactivation of the amyA gene resulted in a complete loss of activity, suggesting that the same protein is responsible for the a-amylase activity at both locations. It is concluded that protein secretion in Sulfolobus is a limited process, and it is suggested that the S-layer may act as a barrier for the free diffusion of folded proteins into the medium.
Collapse
Affiliation(s)
- Albert F Ellen
- Department of Molecular Microbiology, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Kerklaan 30, 9751 NN Haren, The Netherlands
| | | | | |
Collapse
|
27
|
Baudet M, Ortet P, Gaillard JC, Fernandez B, Guérin P, Enjalbal C, Subra G, de Groot A, Barakat M, Dedieu A, Armengaud J. Proteomics-based refinement of Deinococcus deserti genome annotation reveals an unwonted use of non-canonical translation initiation codons. Mol Cell Proteomics 2009; 9:415-26. [PMID: 19875382 PMCID: PMC2830850 DOI: 10.1074/mcp.m900359-mcp200] [Citation(s) in RCA: 79] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
Deinococcaceae are a family of extremely radiation-tolerant bacteria that are currently subjected to numerous studies aimed at understanding the molecular mechanisms for such radiotolerance. To achieve a comprehensive and accurate annotation of the Deinococcus deserti genome, we performed an N terminus-oriented characterization of its proteome. For this, we used a labeling reagent, N-tris(2,4,6-trimethoxyphenyl)phosphonium acetyl succinimide, to selectively derivatize protein N termini. The large scale identification of N-tris(2,4,6-trimethoxyphenyl)phosphonium acetyl succinimide-modified N-terminal-most peptides by shotgun liquid chromatography-tandem mass spectrometry analysis led to the validation of 278 and the correction of 73 translation initiation codons in the D. deserti genome. In addition, four new genes were detected, three located on the main chromosome and one on plasmid P3. We also analyzed signal peptide cleavages on a genome-wide scale. Based on comparative proteogenomics analysis, we propose a set of 137 corrections to improve Deinococcus radiodurans and Deinococcus geothermalis gene annotations. Some of these corrections affect important genes involved in DNA repair mechanisms such as polA, ligA, and ddrB. Surprisingly, experimental evidences were obtained indicating that DnaA (the protein involved in the DNA replication initiation process) and RpsL (the S12 ribosomal conserved protein) translation is initiated in Deinococcaceae from non-canonical codons (ATC and CTG, respectively). Such use may be the basis of specific regulation mechanisms affecting replication and translation. We also report the use of non-conventional translation initiation codons for two other genes: Deide_03051 and infC. Whether such use of non-canonical translation initiation codons is much more frequent than for other previously reported bacterial phyla or restricted to Deinococcaceae remains to be investigated. Our results demonstrate that predicting translation initiation codons is still difficult for some bacteria and that proteomics-based refinement of genome annotations may be helpful in such cases.
Collapse
Affiliation(s)
- Mathieu Baudet
- Laboratoire de Biochimie des Systèmes Perturbés, Service de Biochimie et Toxicologie Nucléaire, Institut de Biologie Environnementale et Biotechnologie (iBEB), Direction des Sciences du Vivant (DSV), Commissariat à l'Energie Atomique et aux Energies Alternatives (CEA), F-30207 Bagnols-sur-Cèze, France
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Palmieri G, Cannio R, Fiume I, Rossi M, Pocsfalvi G. Outside the unusual cell wall of the hyperthermophilic archaeon Aeropyrum pernix K1. Mol Cell Proteomics 2009; 8:2570-81. [PMID: 19640852 DOI: 10.1074/mcp.m900012-mcp200] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
In contrast to the extensively studied eukaryal and bacterial protein secretion systems, comparatively less is known about how and which proteins cross the archaeal cell membrane. To identify secreted proteins of the hyperthermophilic archaeon Aeropyrum pernix K1 we used a proteomics approach to analyze the extracellular and cell surface protein fractions. The experimentally obtained data comprising 107 proteins were compared with the in silico predicted secretome. Because of the lack of signal peptide and cellular localization prediction tools specific for archaeal species, programs trained on eukaryotic and/or Gram-positive and Gram-negative bacterial signal peptide data sets were used. PSortB Gram-negative and Gram-positive analysis predicted 21 (1.2% of total ORFs) and 24 (1.4% of total ORFs) secreted proteins, respectively, from the entire A. pernix K1 proteome, 12 of which were experimentally identified in this work. Six additional proteins were predicted to follow non-classical secretion mechanisms using SecP algorithms. According to at least one of the two PSortB predictions, 48 proteins identified in the two fractions possess an unknown localization site. In addition, more than half of the proteins do not contain signal peptides recognized by current prediction programs. This suggests that known mechanisms only partly describe archaeal protein secretion. The most striking characteristic of the secretome was the high number of transport-related proteins identified from the ATP-binding cassette (ABC), tripartite ATP-independent periplasmic, ATPase, small conductance mechanosensitive ion channel (MscS), and dicarboxylate amino acid-cation symporter transporter families. In particular, identification of 21 solute-binding receptors of the ABC superfamily of the 24 predicted in silico confirms that ABC-mediated transport represents the most frequent strategy adopted by A. pernix for solute translocation across the cell membrane.
Collapse
Affiliation(s)
- Gianna Palmieri
- Institute of Protein Biochemistry-National Research Council, 80131 Naples, Italy
| | | | | | | | | |
Collapse
|
29
|
Zivanovic Y, Armengaud J, Lagorce A, Leplat C, Guérin P, Dutertre M, Anthouard V, Forterre P, Wincker P, Confalonieri F. Genome analysis and genome-wide proteomics of Thermococcus gammatolerans, the most radioresistant organism known amongst the Archaea. Genome Biol 2009; 10:R70. [PMID: 19558674 PMCID: PMC2718504 DOI: 10.1186/gb-2009-10-6-r70] [Citation(s) in RCA: 110] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2009] [Revised: 05/29/2009] [Accepted: 06/26/2009] [Indexed: 11/15/2022] Open
Abstract
The genome sequence of Thermococcus gammatolerans, a radioresistant archaeon, is described; a proteomic analysis reveals that radioresistance may be due to unknown DNA repair enzymes. Background Thermococcus gammatolerans was isolated from samples collected from hydrothermal chimneys. It is one of the most radioresistant organisms known amongst the Archaea. We report the determination and annotation of its complete genome sequence, its comparison with other Thermococcales genomes, and a proteomic analysis. Results T. gammatolerans has a circular chromosome of 2.045 Mbp without any extra-chromosomal elements, coding for 2,157 proteins. A thorough comparative genomics analysis revealed important but unsuspected genome plasticity differences between sequenced Thermococcus and Pyrococcus species that could not be attributed to the presence of specific mobile elements. Two virus-related regions, tgv1 and tgv2, are the only mobile elements identified in this genome. A proteogenome analysis was performed by a shotgun liquid chromatography-tandem mass spectrometry approach, allowing the identification of 10,931 unique peptides corresponding to 951 proteins. This information concurrently validates the accuracy of the genome annotation. Semi-quantification of proteins by spectral count was done on exponential- and stationary-phase cells. Insights into general catabolism, hydrogenase complexes, detoxification systems, and the DNA repair toolbox of this archaeon are revealed through this genome and proteome analysis. Conclusions This work is the first archaeal proteome investigation done at the stage of primary genome annotation. This archaeon is shown to use a large variety of metabolic pathways even under a rich medium growth condition. This proteogenomic study also indicates that the high radiotolerance of T. gammatolerans is probably due to proteins that remain to be characterized rather than a larger arsenal of known DNA repair enzymes.
Collapse
Affiliation(s)
- Yvan Zivanovic
- Laboratoire de Génomique des Archae, Université Paris-Sud 11, CNRS, UMR8621, Bât400 F-91405 Orsay, France.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Lee AM, Sevinsky JR, Bundy JL, Grunden AM, Stephenson JL. Proteomics of Pyrococcus furiosus, a Hyperthermophilic Archaeon Refractory to Traditional Methods. J Proteome Res 2009; 8:3844-51. [DOI: 10.1021/pr801119h] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Alice M. Lee
- Department of Microbiology, P.O. Box 7615, North Carolina State University, Raleigh, North Carolina 27695, and Biomarkers and Systems Biology Center, Research Triangle Institute, 3040 Cornwallis Road, Research Triangle Park, North Carolina 27709
| | - Joel R. Sevinsky
- Department of Microbiology, P.O. Box 7615, North Carolina State University, Raleigh, North Carolina 27695, and Biomarkers and Systems Biology Center, Research Triangle Institute, 3040 Cornwallis Road, Research Triangle Park, North Carolina 27709
| | - Jonathan L. Bundy
- Department of Microbiology, P.O. Box 7615, North Carolina State University, Raleigh, North Carolina 27695, and Biomarkers and Systems Biology Center, Research Triangle Institute, 3040 Cornwallis Road, Research Triangle Park, North Carolina 27709
| | - Amy M. Grunden
- Department of Microbiology, P.O. Box 7615, North Carolina State University, Raleigh, North Carolina 27695, and Biomarkers and Systems Biology Center, Research Triangle Institute, 3040 Cornwallis Road, Research Triangle Park, North Carolina 27709
| | - James L. Stephenson
- Department of Microbiology, P.O. Box 7615, North Carolina State University, Raleigh, North Carolina 27695, and Biomarkers and Systems Biology Center, Research Triangle Institute, 3040 Cornwallis Road, Research Triangle Park, North Carolina 27709
| |
Collapse
|
31
|
Refolding, characterization and crystal structure of (S)-malate dehydrogenase from the hyperthermophilic archaeon Aeropyrum pernix. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2009; 1794:1496-504. [PMID: 19555779 DOI: 10.1016/j.bbapap.2009.06.014] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/27/2009] [Revised: 06/15/2009] [Accepted: 06/16/2009] [Indexed: 11/21/2022]
Abstract
Tartrate oxidation activity was found in the crude extract of an aerobic hyperthermophilic archaeon Aeropyrum pernix, and the enzyme was identified as (S)-malate dehydrogenase (MDH), which, when produced in Escherichia coli, was mainly obtained as an inactive inclusion body. The inclusion body was dissolved in 6 M guanidine-HCl and gradually refolded to the active enzyme through dilution of the denaturant. The purified recombinant enzyme consisted of four identical subunits with a molecular mass of about 110 kDa. NADP was preferred as a coenzyme over NAD for (S)-malate oxidation and, unlike MDHs from other sources, this enzyme readily catalyzed the oxidation of (2S,3S)-tartrate and (2S,3R)-tartrate. The tartrate oxidation activity was also observed in MDHs from the hyperthermophilic archaea Methanocaldococcus jannaschii and Archaeoglobus fulgidus, suggesting these hyperthermophilic MDHs loosely bind their substrates. The refolded A. pernix MDH was also crystallized, and the structure was determined at a resolution of 2.9 A. Its overall structure was similar to those of the M. jannaschii, Chloroflexus aurantiacus, Chlorobium vibrioforme and Cryptosporidium parvum [lactate dehydrogenase-like] MDHs with root-mean-square-deviation values between 1.4 and 2.1 A. Consistent with earlier reports, Ala at position 53 was responsible for coenzyme specificity, and the next residue, Arg, was important for NADP binding. Structural comparison revealed that the hyperthermostability of the A. pernix MDH is likely attributable to its smaller cavity volume and larger numbers of ion pairs and ion-pair networks, but the molecular strategy for thermostability may be specific for each enzyme.
Collapse
|
32
|
Barbe V, Cruveiller S, Kunst F, Lenoble P, Meurice G, Sekowska A, Vallenet D, Wang T, Moszer I, Médigue C, Danchin A. From a consortium sequence to a unified sequence: the Bacillus subtilis 168 reference genome a decade later. MICROBIOLOGY (READING, ENGLAND) 2009; 155:1758-1775. [PMID: 19383706 PMCID: PMC2885750 DOI: 10.1099/mic.0.027839-0] [Citation(s) in RCA: 264] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/26/2009] [Revised: 02/25/2009] [Accepted: 02/25/2009] [Indexed: 11/18/2022]
Abstract
Comparative genomics is the cornerstone of identification of gene functions. The immense number of living organisms precludes experimental identification of functions except in a handful of model organisms. The bacterial domain is split into large branches, among which the Firmicutes occupy a considerable space. Bacillus subtilis has been the model of Firmicutes for decades and its genome has been a reference for more than 10 years. Sequencing the genome involved more than 30 laboratories, with different expertises, in a attempt to make the most of the experimental information that could be associated with the sequence. This had the expected drawback that the sequencing expertise was quite varied among the groups involved, especially at a time when sequencing genomes was extremely hard work. The recent development of very efficient, fast and accurate sequencing techniques, in parallel with the development of high-level annotation platforms, motivated the present resequencing work. The updated sequence has been reannotated in agreement with the UniProt protein knowledge base, keeping in perspective the split between the paleome (genes necessary for sustaining and perpetuating life) and the cenome (genes required for occupation of a niche, suggesting here that B. subtilis is an epiphyte). This should permit investigators to make reliable inferences to prepare validation experiments in a variety of domains of bacterial growth and development as well as build up accurate phylogenies.
Collapse
Affiliation(s)
- Valérie Barbe
- CEA, Institut de Génomique, Génoscope, 2 rue Gaston Crémieux, 91057 Évry, France
| | - Stéphane Cruveiller
- CEA, Institut de Génomique, Laboratoire de Génomique Comparative/CNRS UMR8030, Génoscope, 2 rue Gaston Crémieux, 91057 Évry, France
| | - Frank Kunst
- CEA, Institut de Génomique, Génoscope, 2 rue Gaston Crémieux, 91057 Évry, France
| | - Patricia Lenoble
- CEA, Institut de Génomique, Génoscope, 2 rue Gaston Crémieux, 91057 Évry, France
| | - Guillaume Meurice
- Institut Pasteur, Intégration et Analyse Génomiques, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France
| | - Agnieszka Sekowska
- Institut Pasteur, Génétique des Génomes Bactériens/CNRS URA2171, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France
| | - David Vallenet
- CEA, Institut de Génomique, Laboratoire de Génomique Comparative/CNRS UMR8030, Génoscope, 2 rue Gaston Crémieux, 91057 Évry, France
| | - Tingzhang Wang
- Institut Pasteur, Génétique des Génomes Bactériens/CNRS URA2171, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France
| | - Ivan Moszer
- Institut Pasteur, Intégration et Analyse Génomiques, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France
| | - Claudine Médigue
- CEA, Institut de Génomique, Laboratoire de Génomique Comparative/CNRS UMR8030, Génoscope, 2 rue Gaston Crémieux, 91057 Évry, France
| | - Antoine Danchin
- Institut Pasteur, Génétique des Génomes Bactériens/CNRS URA2171, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France
| |
Collapse
|
33
|
Armengaud J. A perfect genome annotation is within reach with the proteomics and genomics alliance. Curr Opin Microbiol 2009; 12:292-300. [PMID: 19410500 DOI: 10.1016/j.mib.2009.03.005] [Citation(s) in RCA: 79] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2009] [Revised: 03/26/2009] [Accepted: 03/26/2009] [Indexed: 11/17/2022]
Abstract
High-throughput identification of proteins and their accurate partial sequencing by shotgun nanoLC-MS/MS are now feasible for any cellular model at a full genomic scale. Proteogenomics is the integration of these data with the genome. Mining microbial proteomes allows validation of predicted orphan genes and correction of genome annotation errors such as discovery of unannotated genes, reversal of reading frames and identification of translational start sites, stop codon read-throughs or programmed frameshifts. Recent advances have been achieved in database searches, N-terminal oriented proteomics and homology-driven proteogenomics. From now on, proteogenomics on newly sequenced model genomes can be carried out at the earliest stage of the genome project as already exemplified by Mycoplasma mobile and Deinococcus deserti genomes. The proteomics and genomics alliance produces almost complete and accurate gene catalogues for small microbial genomes, a comprehensiveness which is essential for efficient systems biology.
Collapse
Affiliation(s)
- Jean Armengaud
- CEA, DSV, IBEB, Lab Biochim System Perturb, Bagnols-sur-Cèze, France.
| |
Collapse
|
34
|
Hu GQ, Guo JT, Liu YC, Zhu H. MetaTISA: Metagenomic Translation Initiation Site Annotator for improving gene start prediction. ACTA ACUST UNITED AC 2009; 25:1843-5. [PMID: 19389734 DOI: 10.1093/bioinformatics/btp272] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
SUMMARY We proposed a tool named MetaTISA with an aim to improve TIS prediction of current gene-finders for metagenomes. The method employs a two-step strategy to predict translation initiation sites (TISs) by first clustering metagenomic fragments into phylogenetic groups and then predicting TISs independently for each group in an unsupervised manner. As evaluated on experimentally verified TISs, MetaTISA greatly improves the accuracies of TIS prediction of current gene-finders. AVAILABILITY The C++ source code is freely available under the GNU GPL license via http://mech.ctb.pku.edu.cn/MetaTISA/.
Collapse
Affiliation(s)
- Gang-Qing Hu
- State Key Laboratory for Turbulence and Complex Systems, Department of Biomedical Engineering and Center for Theoretical Biology, Peking University, Beijing 100871, China
| | | | | | | |
Collapse
|
35
|
Guo FB, Lin Y. Identify Protein-coding Genes in the Genomes ofAeropyrum pernixK1 andChlorobium tepidumTLS. J Biomol Struct Dyn 2009; 26:413-20. [DOI: 10.1080/07391102.2009.10507256] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
36
|
Kwon SO, Kang SG, Park SH, Kim YH, Choi JS, Lee JH, Kim SI. Proteomic characterization of the sulfur-reducing hyperthermophilic archaeon Thermococcus onnurineus NA1 by 2-DE/MS-MS. Extremophiles 2009; 13:379-87. [PMID: 19132287 DOI: 10.1007/s00792-008-0220-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2008] [Accepted: 12/09/2008] [Indexed: 11/28/2022]
Abstract
Thermococcus onnurineus NA1, a sulfur-reducing hyperthermophilic archaeon, was isolated from a deep-sea hydrothermal vent area in Papua New Guinea. The strain requires elemental sulfur as a terminal electron acceptor for heterotrophic growth on peptides, amino acids and sugars. Recently, genome sequencing of Thermococcus onnurineus NA1 was completed. In this study, 2-DE/MS-MS analysis of the cytosolic proteome was performed to elucidate the metabolic characterization of Thermococcus onnurineus NA1 at the protein level. Among the 1,136 visualized protein spots, 110 proteins were identified. Enzymes related to metabolic pathways of amino acids utilization, glycolysis, pyruvate conversion, ATP synthesis, and protein synthesis were identified as abundant proteins, highlighting the fact that these are major metabolic pathways in Thermococcus onnurineus NA1. Interestingly, multiple spots of phosphoenolpyruvate synthetase and elongation factor Tu were found on 2D gels generated by truncation at the N-terminus, implicating the cellular regulatory mechanism of this key enzyme by protease degradation. In addition to the proteins involved in metabolic systems, we also identified various proteases and stress-related proteins. The proteomic characterization of abundantly induced proteins using 2-DE/MS-MS enables a better understanding of Thermococcus onnurineus NA1 metabolism.
Collapse
Affiliation(s)
- Sang Oh Kwon
- Korea Basic Science Institute, Daejeon 305-333, South Korea
| | | | | | | | | | | | | |
Collapse
|
37
|
Hu GQ, Zheng X, Zhu HQ, She ZS. Prediction of translation initiation site for microbial genomes with TriTISA. ACTA ACUST UNITED AC 2008; 25:123-5. [PMID: 19015130 DOI: 10.1093/bioinformatics/btn576] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
UNLABELLED We report a new and simple method, TriTISA, for accurate prediction of translation initiation site (TIS) of microbial genomes. TriTISA classifies all candidate TISs into three categories based on evolutionary properties, and characterizes them in terms of Markov models. Then, it employs a Bayesian methodology for the selection of true TIS with a non-supervised, iterative procedure. Assessment on experimentally verified TIS data shows that TriTISA is overall better than all other methods of the state-of-the-art for microbial genome TIS prediction. In particular, TriTISA is shown to have a robust accuracy independent of the quality of initial annotation. AVAILABILITY The C++ source code is freely available under the GNU GPL license via http://mech.ctb.pku.edu.cn/protisa/TriTISA.
Collapse
Affiliation(s)
- Gang-Qing Hu
- State Key Lab for Turbulence and Complex Systems, Department of Biomedical Engineering, College of Engineering and Center for Theoretical Biology, Peking University, Beijing 100871, China
| | | | | | | |
Collapse
|
38
|
The two PAN ATPases from Halobacterium display N-terminal heterogeneity and form labile complexes with the 20S proteasome. Biochem J 2008; 411:387-97. [PMID: 18215129 DOI: 10.1042/bj20071502] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The PAN (proteasome-activating nucleotidase) proteins from archaea represent homologues of the eukaryotic 26S proteasome regulatory ATPases. In vitro the PAN complex has been previously shown to have a stimulatory effect on the peptidase activities of the 20S core. By using gradient ultracentrifugation we found that, in cellular extracts, the two PAN proteins from Halobacterium do not form stable high-molecular-mass complexes. Only PAN B was found to associate transiently with the 20S proteasome, thus suggesting that the two PAN proteins are not functionally redundant. The PAN B-20S proteasome complexes associate in an ATP-dependent manner and are stabilized upon nucleotide binding. The two PAN proteins were immunodetected in cellular extracts as N-terminal-truncated polypeptides. RNA-mapping experiments and sequence analysis indicated that this process involved transcript heterogeneities and dual translational initiation mechanisms. Taken together, our results suggest that PAN N-terminal modifications and their intracellular dynamics of assembly/association may constitute important determinants of proteolysis regulation.
Collapse
|
39
|
Indicators from archaeal secretomes. Microbiol Res 2008; 165:1-10. [PMID: 18407482 DOI: 10.1016/j.micres.2008.03.002] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2007] [Revised: 02/14/2008] [Accepted: 03/01/2008] [Indexed: 11/21/2022]
Abstract
Just as in the Eukarya and the Bacteria, members of the Archaea need to export proteins beyond the cell membrane. This would be required to fulfill a variety of essential functions such as nutrient acquisition and biotransformations, maintenance of extracellular structures and more. Apart from the Eukarya and the Bacteria however, members of the Archaea share a number of unique characteristics. Does this uniqueness extend to the protein secretion system? It was the objective of this study to answer this question. To overcome the limited experimental information on secreted proteins in Archaea, this study was carried out by subjecting the available archaeal genomes, which represent halophiles, thermophiles, and extreme thermophiles, to bioinformatics analysis. Specifically, to examine the properties of the secretomes of the Archaea using the ExProt program. A total of 24 genomes were analyzed. Secretomes were found to fall in the range of 6% of total ORFs (Methanopyrus kandleri) to 19% (Halobacterium sp. NRC-1). Methanosarcina acetivorans has the highest fraction of lipoproteins (at 89) and the lowest (at 1) were members of the Thermoplasma, Pyrobaculum aerophilum, and Nanoarchaeum equitans. Based on the Tat consensus sequence, contribution of these secreted proteins to the secretomes were negligible, making up 8 proteins out of a total of 7105 predicted exported proteins. Amino acid composition, an attribute of signal peptides not used as a selection criteria by ExProt, of predicted archaeal signal peptides show that in the haloarchaea secretomes, the frequency of the amino acid Lys is much lower than that seen in bacterial signal peptides, but is compensated for by a higher frequency of Arg. It also showed that higher frequencies for Thr, Val, and Gly contribute to the hydrophobic character in haloarchaeal signal peptides, unlike bacterial signal peptides in which the hydrophobic character is dominated by Leu and Ile.
Collapse
|
40
|
Shuttle vector expression in Thermococcus kodakaraensis: contributions of cis elements to protein synthesis in a hyperthermophilic archaeon. Appl Environ Microbiol 2008; 74:3099-104. [PMID: 18378640 DOI: 10.1128/aem.00305-08] [Citation(s) in RCA: 96] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Shuttle vectors that replicate stably and express selectable phenotypes in both Thermococcus kodakaraensis and Escherichia coli have been constructed. Plasmid pTN1 from Thermococcus nautilis was ligated to the commercial vector pCR2.1-TOPO, and selectable markers were added so that T. kodakaraensis transformants could be selected by DeltatrpE complementation and/or mevinolin resistance. Based on Western blot measurements, shuttle vector expression of RpoL-HA, a hemagglutinin (HA) epitope-tagged subunit of T. kodakaraensis RNA polymerase (RNAP), was approximately 8-fold higher than chromosome expression. An idealized ribosome binding sequence (5'-AGGTGG) was incorporated for RpoL-HA expression, and changes to this sequence reduced expression. Changing the translation initiation codon from AUG to GUG did not reduce RpoL-HA expression, but replacing AUG with UUG dramatically reduced RpoL-HA synthesis. When functioning as translation initiation codons, AUG, GUG, and UUG all directed the incorporation of methionine as the N-terminal residue of RpoL-HA synthesized in T. kodakaraensis. Affinity purification confirmed that an HA- plus six-histidine-tagged RpoL subunit (RpoL-HA-his(6)) synthesized ectopically from a shuttle vector was assembled in vivo into RNAP holoenzymes that were active and could be purified directly from T. kodakaraensis cell lysates by Ni(2+) binding and imidazole elution.
Collapse
|
41
|
Kiyonari S, Kamigochi T, Ishino Y. A single amino acid substitution in the DNA-binding domain of Aeropyrum pernix DNA ligase impairs its interaction with proliferating cell nuclear antigen. Extremophiles 2007; 11:675-84. [PMID: 17487442 DOI: 10.1007/s00792-007-0083-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2007] [Accepted: 04/09/2007] [Indexed: 11/25/2022]
Abstract
Proliferating cell nuclear antigen (PCNA) is known as a DNA sliding clamp that acts as a platform for the assembly of enzymes involved in DNA replication and repair. Previously, it was reported that a crenarchaeal PCNA formed a heterotrimeric structure, and that each PCNA subunit has distinct binding specificity to PCNA-binding proteins. Here we describe the PCNA-binding properties of a DNA ligase from the hyperthermophilic crenarchaeon Aeropyrum pernix K1. Based on our findings on the Pyrococcus furiosus DNA ligase-PCNA interaction, we predicted that the aromatic residue, Phe132, in the DNA-binding domain of A. pernix DNA ligase (ApeLig) would play a critical role in binding to A. pernix PCNA (ApePCNA). Surface plasmon resonance analyses revealed that the ApeLig F132A mutant does not interact with an immobilized subunit of ApePCNA. Furthermore, we could not detect any stimulation of the ligation activity of the ApeLig F132A protein by ApePCNA in vitro. These results indicated that the phenylalanine, which is located in our predicted PCNA-binding region in ApeLig, has a critical role for the physical and functional interaction with ApePCNA.
Collapse
Affiliation(s)
- Shinichi Kiyonari
- Department of Genetic Resources Technology, Faculty of Agriculture, Kyushu University, 6-10-1 Hakozaki, Higashi-ku, Fukuoka-shi, Fukuoka, 812-8581, Japan
| | | | | |
Collapse
|
42
|
OTSUKA R, SASAKI K, MISE M, ATAKU H, NISHIJIMA K, YAMAZAKI J, YAMAZAKI S. Development of Anion Exchange Chromatography for Proteome Analysis and Its Application to Detection of Membrane Proteins of the Hyper-Thermophilic Crenarchaeon, Aeropyrum pernix K1. BUNSEKI KAGAKU 2006. [DOI: 10.2116/bunsekikagaku.55.963] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Affiliation(s)
- Rie OTSUKA
- Genome Analysis Center, Department of Biotechnology, National Institute of Technology and Evaluation (NITE)
| | - Kazumi SASAKI
- Genome Analysis Center, Department of Biotechnology, National Institute of Technology and Evaluation (NITE)
| | - Miyako MISE
- Genome Analysis Center, Department of Biotechnology, National Institute of Technology and Evaluation (NITE)
| | - Hanako ATAKU
- Genome Analysis Center, Department of Biotechnology, National Institute of Technology and Evaluation (NITE)
| | - Keiko NISHIJIMA
- Genome Analysis Center, Department of Biotechnology, National Institute of Technology and Evaluation (NITE)
| | - Jun YAMAZAKI
- Genome Analysis Center, Department of Biotechnology, National Institute of Technology and Evaluation (NITE)
| | - Syuji YAMAZAKI
- Genome Analysis Center, Department of Biotechnology, National Institute of Technology and Evaluation (NITE)
| |
Collapse
|